"au:"Jonathan Hayase"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Jonathan Hayase"" — arXiv2 Search

Showing 1–20 of 22 results

/ Date/ Name

Sep 25, 2024Monge-Kantorovich Fitting With Sobolev Budgets Feb 19, 2024Query-Based Adversarial Prompt Generation Oct 29, 2023Label Poisoning is All You Need Oct 14, 2022Zonotope Domains for Lagrangian Neural Network Verification Jul 23, 2024Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?Jun 17, 2025Sampling from Your Language Model One Byte at a Time Oct 12, 2022Few-shot Backdoor Attacks via Neural Tangent Kernels Jul 2, 2024PLeaS -- Merging Models with Permutations and Least Squares Mar 17, 2025SuperBPE: Space Travel for Language Models Apr 27, 2023DataComp: In search of the next generation of multimodal datasets Feb 6, 2026Anchored Decoding: Provably Reducing Copyright Risk for Any Language Model Jan 30, 2026Are you going to finish that? A Practical Study of the Partial Token Problem Nov 28, 2023Scalable Extraction of Training Data from (Production) Language Models Apr 22, 2021SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics Nov 1, 2024OML: A Primitive for Reconciling Open Access with Owner Control in AI Model Distribution Jun 23, 2025Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations May 24, 2022Towards a Defense Against Federated Backdoor Attacks Under Continuous Training Apr 23, 2024Insufficient Statistics Perturbation: Stable Estimators for Private Least Squares Mar 11, 2024Stealing Part of a Production Language Model Feb 11, 2025Scalable Fingerprinting of Large Language Models