Showing 1–20 of 25 results
/ Date/ Name
May 23, 2019Thwarting finite difference adversarial attacks with output randomizationNov 15, 2019Optimal Mini-Batch Size Selection for Fast Gradient DescentApr 17, 2025ZeroSumEval: Scaling LLM Evaluation with Inter-Model CompetitionOct 8, 2020Don't Parse, Insert: Multilingual Semantic Parsing with Insertion Based DecodingFeb 1, 2024When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model LeaderboardsMay 29, 2018Focal onset seizure prediction using convolutional networksMay 23, 2019Deep density ratio estimation for change point detectionMar 10, 2025ZeroSumEval: An Extensible Framework For Scaling LLM Evaluation with Inter-Model CompetitionJan 24, 2023Low-Resource Compositional Semantic Parsing with Concept PretrainingAug 2, 2022AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq ModelJun 15, 2022Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding SystemsJun 8, 2025ConfRAG: Confidence-Guided Retrieval-Augmenting GenerationApr 9, 2019Generation & Evaluation of Adversarial Examples for Malware ObfuscationDec 1, 2022PIZZA: A new benchmark for complex end-to-end task-oriented parsingFeb 2, 2022RescoreBERT: Discriminative Speech Recognition Rescoring with BERTDec 7, 2020Using multiple ASR hypotheses to boost i18n NLU performanceMay 19, 2023Controlling the Extraction of Memorized Data from Large Language Models via Prompt-TuningAug 3, 2025Refine-n-Judge: Curating High-Quality Preference Chains for LLM-Fine-TuningOct 30, 2025CRAG-MM: Multi-modal Multi-turn Comprehensive RAG BenchmarkMar 5, 2022Unfreeze with Care: Space-Efficient Fine-Tuning of Semantic Parsing Models