Showing 1–20 of 23 results
/ Date/ Name
Mar 23, 2026A Brief Comparison of Training-Free Multi-Vector Sequence Compression MethodsMar 20, 2026CoverageBench: Evaluating Information Coverage across Tasks and DomainsOct 11, 2025Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory RewritingOct 8, 2025All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM GenerationsAug 10, 2025Can LLMs Identify Tax Abuse?May 28, 2025Jailbreak Distillation: Renewable Safety BenchmarkingApr 1, 2025WikiVideo: Article Generation from Multiple VideosSep 17, 2024Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language ModelsJun 20, 2024RE-AdaptIR: Improving Information Retrieval through Reverse Engineered AdaptationFeb 28, 2024RORA: Robust Free-Text Rationale EvaluationFeb 22, 2024Enhancing Systematic Decompositional Natural Language Inference Using Informal LogicDec 28, 2023Do Androids Know They're Only Dreaming of Electric Sheep?Nov 16, 2023BLT: Can Large Language Models Handle Basic Legal Text?Sep 16, 2022NELLIE: A Neuro-Symbolic Inference Engine for Grounded, Compositional, and Explainable ReasoningMar 8, 2021InFillmore: Frame-Guided Language Generation with Bidirectional ContextOct 2, 2020Which *BERT? A Survey Organizing Contextualized EncodersApr 2, 2020Causal Inference of Script KnowledgeOct 6, 2019Exact and/or Fast Nearest NeighborsApr 25, 2019Probing What Different NLP Tasks Teach Machines about Function Word ComprehensionSep 20, 2018Predicting the Argumenthood of English Prepositional Phrases