Showing 1–15 of 15 results
/ Date/ Name
Oct 8, 2019Observer Dependent Lossy Image CompressionSep 21, 2020Optimal Provable Robustness of Quantum Classification via Quantum Hypothesis TestingDec 15, 2023WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl DataOct 19, 2021Toward Reliability in the NISQ Era: Robust Interval Guarantee for Quantum Measurements on Approximate StatesMar 19, 2020RAB: Provable Robustness Against Backdoor AttacksFeb 3, 2022Certifying Out-of-Domain Generalization for Blackbox FunctionsFeb 27, 2020TSS: Transformation-Specific Smoothing for Robustness CertificationNov 19, 2024RedPajama: an Open Dataset for Training Large Language ModelsNov 30, 2022Predicting Properties of Quantum Systems with Conditional Generative ModelsJan 14, 2025Towards Best Practices for Open Datasets for LLM TrainingMay 31, 2022Certifying Some Distributional Fairness with Subpopulation DecompositionNov 12, 2018PennyLane: Automatic differentiation of hybrid quantum-classical computationsOct 31, 2024Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source LanguageFeb 18, 2025Multilingual Language Model Pretraining using Machine-translated DataApr 17, 2025Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation