/ Date/ Name

Computation and Language

cs.CL

/ Date/ Name

/ Date/ Name

Showing 341–360 of 1,726 results

/ Date/ Name

Dec 2, 2025DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Dec 2, 2025Process-Centric Analysis of Agentic Software Systems Dec 1, 2025Beware of Reasoning Overconfidence: Pitfalls in the Reasoning Process for Multi-solution Tasks Dec 1, 2025Learning the Boundary of Solvability: Aligning LLMs to Detect Unsolvable Problems Nov 27, 2025DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Nov 25, 2025Soft Adaptive Policy Optimization Nov 24, 2025DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Nov 24, 2025How Learning Rate Decay Wastes Your Best Data in Curriculum-Based LLM Pretraining Nov 21, 2025Selective Rotary Position Embedding Nov 21, 2025The PLLuM Instruction Corpus Nov 21, 2025Closing the Performance Gap Between AI and Radiologists in Chest X-Ray Reporting Nov 20, 2025Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation Nov 19, 2025MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping Nov 18, 2025ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning Nov 17, 2025Dropouts in Confidence: Moral Uncertainty in Human-LLM Alignment Nov 17, 2025Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Nov 16, 2025On the Brittleness of LLMs: A Journey around Set Membership Nov 14, 2025DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains Nov 13, 2025Instella: Fully Open Language Models with Stellar Performance Nov 13, 2025AdvancedIF: Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

← Previous Next →