Showing 1–20 of 46 results
/ Date/ Name
Jun 21, 2024Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language ModelsJun 5, 2025Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement LearningMay 24, 2025OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ TasksJan 21, 2026MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled BenchmarksFeb 23, 2026SkillOrchestra: Learning to Route Agents via Skill TransferJun 3, 2025Helpful Agent Meets Deceptive Judge: Understanding Vulnerabilities in Agentic WorkflowsApr 30, 2025COSMOS: Predictable and Cost-Effective Adaptation of LLMsOct 16, 2025LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the WildMay 31, 2024Grammar-Aligned DecodingMar 5, 2026Functionality-Oriented LLM Merging on the Fisher--Rao ManifoldDec 7, 2023Hierarchical Spatio-temporal Decoupling for Text-to-Video GenerationDec 15, 2023DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic ModelsNov 13, 2024EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video GenerationFeb 13, 2025The Widespread Adoption of Large Language Model-Assisted Writing Across SocietyDec 26, 2023Acousto-drag photovoltaic effect by piezoelectric integration of two-dimensional semiconductorsOct 15, 2025Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement MechanismsFeb 8, 2026SPD-Faith Bench: Diagnosing and Improving Faithfulness in Chain-of-Thought for Multimodal Large Language ModelsDec 16, 2021APTSHIELD: A Stable, Efficient and Real-time APT Detection System for Linux HostsDec 6, 2023Quantum Fusion of Independent Networks Based on Multi-user Entanglement SwappingJun 3, 2024UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation