Showing 601–620 of 1,726 results
/ Date/ Name
Feb 22, 2025Efficient LLM Moderation with Multi-Layer Latent PrototypesFeb 21, 2025Self-Taught Agentic Long Context UnderstandingFeb 21, 2025Retrieval-Augmented Speech Recognition Approach for Domain ChallengesFeb 20, 2025SuperGPQA: Scaling LLM Evaluation across 285 Graduate DisciplinesFeb 20, 2025NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localizationFeb 20, 2025Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent BiasesFeb 20, 2025MLGym: A New Framework and Benchmark for Advancing AI Research AgentsFeb 19, 2025The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic TextFeb 19, 2025MMTEB: Massive Multilingual Text Embedding BenchmarkFeb 18, 2025Rethinking Diverse Human Preference Learning through Principal Component AnalysisFeb 18, 2025On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response GenerationFeb 17, 2025InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden ContextFeb 17, 2025HalluEntity: Benchmarking and Understanding Entity-Level Hallucination DetectionFeb 17, 2025AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse VerificationFeb 16, 2025DEEPER Insight into Your User: Directed Persona Refinement for Dynamic Persona ModelingFeb 15, 2025A Tutorial on LLM Reasoning: Relevant Methods behind ChatGPT o1Feb 13, 2025MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and EfficiencyFeb 12, 2025Inference-time sparse attention with asymmetric indexingFeb 12, 2025One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMsFeb 11, 2025CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction