Showing 321–340 of 1,726 results
/ Date/ Name
Jan 9, 2026The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought ReasoningJan 8, 2026Large Language Models Are Bad Dice Players: LLMs Struggle to Generate Random Numbers from Statistical DistributionsJan 8, 2026GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL OptimizationJan 8, 2026Memory Matters More: Event-Centric Memory as a Logic Map for Agent Searching and ReasoningJan 8, 2026MAGA-Bench: Machine-Augment-Generated Text via Alignment Detection BenchmarkJan 7, 2026Tracing the complexity profiles of different linguistic phenomena through the intrinsic dimension of LLM representationsJan 5, 2026ModeX: Evaluator-Free Best-of-N Selection for Open-Ended GenerationDec 31, 2025Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language ModelsDec 29, 2025MiMo-Audio: Audio Language Models are Few-Shot LearnersDec 28, 2025Multimodal Fact-Checking: An Agent-based ApproachDec 24, 2025Parallel Token Prediction for Language ModelsDec 21, 2025Toward Human-Centered AI-Assisted Terminology WorkDec 19, 2025OpenAI GPT-5 System CardDec 15, 2025Olmo 3Dec 15, 2025Memory in the Age of AI AgentsDec 9, 2025SoMe: A Realistic Benchmark for LLM-based Social Media AgentsDec 8, 2025Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from NovelsDec 8, 2025NeSTR: A Neuro-Symbolic Abductive Framework for Temporal Reasoning in Large Language ModelsDec 6, 2025Policy-based Sentence Simplification: Replacing Parallel Corpora with LLM-as-a-JudgeDec 3, 2025CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding