Showing 421–440 of 1,726 results
/ Date/ Name
Sep 26, 2025LUMINA: Detecting Hallucinations in RAG System with Context-Knowledge SignalsSep 25, 2025Hallucination reduction with CASAL: Contrastive Activation Steering For Amortized LearningSep 22, 2025Everyday Physics in Korean Contexts: A Culturally Grounded Physical Reasoning BenchmarkSep 22, 2025Qwen3-Omni Technical ReportSep 22, 2025Generalizable End-to-End Tool-Use RL with Synthetic CodeGymSep 21, 2025ARE: Scaling Up Agent Environments and EvaluationsSep 16, 2025Case-Based Decision-Theoretic Decoding with Quality MemoriesSep 16, 2025The Better You Learn, The Smarter You Prune: Towards Efficient Vision-language-action Models via Differentiable Token PruningSep 15, 2025Fun-ASR Technical ReportSep 14, 2025Towards Better Health Conversations: The Benefits of Context-seekingSep 13, 2025Evaluating Large Language Models for Evidence-Based Clinical Question AnsweringSep 11, 2025LLM Architecture, Scaling Laws, and Economics: A Quick SummarySep 10, 2025Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge ReasoningSep 9, 2025VStyle: A Benchmark for Voice Style Adaptation with Spoken InstructionsSep 8, 2025LAMDAS: LLM as an Implicit Classifier for Domain-specific Data SelectionSep 8, 2025WebExplorer: Explore and Evolve for Training Long-Horizon Web AgentsSep 7, 2025Understanding the Influence of Synthetic Data for Text EmbeddersSep 6, 2025New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASRSep 4, 2025Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?Aug 28, 2025NPG-Muse: Scaling Long Chain-of-Thought Reasoning with NP-Hard Graph Problems