Showing 501–520 of 1,726 results
/ Date/ Name
Jun 2, 2025AI Debate Aids Assessment of Controversial ClaimsJun 2, 2025Generate, Not Recommend: Personalized Multimodal Content GenerationJun 2, 2025Incentivizing Reasoning for Advanced Instruction-Following of Large Language ModelsJun 1, 2025Towards Predicting Any Human Trajectory In ContextJun 1, 2025GuessBench: Sensemaking Multimodal Creativity in the WildMay 31, 2025Data Swarms: Optimizable Generation of Synthetic Evaluation DataMay 29, 2025ScaleLong: A Multi-Timescale Benchmark for Long Video UnderstandingMay 29, 2025GSO: Challenging Software Optimization Tasks for Evaluating SWE-AgentsMay 29, 2025Revisiting Uncertainty Estimation and Calibration of Large Language ModelsMay 28, 2025VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language ModelsMay 28, 2025Chain-of-Talkers (CoTalk): Fast Human Annotation of Dense Image CaptionsMay 28, 2025Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel DecodingMay 28, 2025Jailbreak Distillation: Renewable Safety BenchmarkingMay 28, 2025Principled Content Selection to Generate Diverse and Personalized Multi-Document SummariesMay 27, 2025Rethinking Data Mixture for Large Language Models: A Comprehensive Survey and New PerspectivesMay 27, 2025Pangu Pro MoE: Mixture of Grouped Experts for Efficient SparsityMay 26, 2025WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and RollbackMay 25, 2025MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent SystemsMay 24, 2025TULUN: Transparent and Adaptable Low-resource Machine TranslationMay 23, 2025ProgRM: Build Better GUI Agents with Progress Rewards