Showing 221–240 of 1,726 results
/ Date/ Name
Apr 15, 2026EuropeMedQA Study Protocol: A Multilingual, Multimodal Medical Examination Dataset for Language Model EvaluationApr 14, 2026Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic ReasoningApr 13, 2026CArtBench: Evaluating Vision-Language Models on Chinese Art Understanding, Interpretation, and AuthenticityApr 13, 2026Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and MusicApr 12, 2026SpectralLoRA: Is Low-Frequency Structure Sufficient for LoRA Adaptation? A Spectral Analysis of Weight UpdatesApr 12, 2026CodaRAG: Connecting the Dots with Associativity Inspired by Complementary LearningApr 11, 2026Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language ModelsApr 10, 2026Scalable High-Recall Constraint-Satisfaction-Based Information Retrieval for Clinical Trials MatchingApr 9, 2026Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-ExpertsApr 9, 2026Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of InterestApr 9, 2026A Decomposition Perspective to Long-context Reasoning for LLMsApr 8, 2026Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLMApr 8, 2026TeamLLM: A Human-Like Team-Oriented Collaboration Framework for Multi-Step Contextualized TasksApr 7, 2026DataSTORM: Deep Research on Large-Scale Databases using Exploratory Data Analysis and Data StorytellingApr 7, 2026ETR: Entropy Trend Reward for Efficient Chain-of-Thought ReasoningApr 6, 2026TriAttention: Efficient Long Reasoning with Trigonometric KV CompressionApr 2, 2026Do We Need Frontier Models to Verify Mathematical Proofs?Mar 31, 2026CounselReflect: A Toolkit for Auditing Mental-Health DialoguesMar 30, 2026Moving Beyond Review: Applying Language Models to Planning and Translation in ReflectionMar 30, 2026Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights