Showing 1–20 of 20 results
/ Date/ Name
Feb 28, 2026SkillCraft: Can LLM Agents Learn to Use Tools Skillfully?Dec 2, 2025DeepSeek-V3.2: Pushing the Frontier of Open Large Language ModelsNov 17, 2025Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and ChallengesOct 29, 2025The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task ExecutionFeb 11, 2025CodeI/O: Condensing Reasoning Patterns via Code Input-Output PredictionJan 22, 2025DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningDec 27, 2024DeepSeek-V3 Technical ReportDec 23, 2024Diving into Self-Evolving Training for Multimodal ReasoningSep 25, 2024Programming Every Example: Lifting Pre-training Data Quality Like Experts at ScaleFeb 19, 2024Reformatted AlignmentFeb 17, 2024Dissecting Human and LLM PreferencesOct 20, 2023Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop ReasoningOct 9, 2023Generative Judge for Evaluating AlignmentDec 16, 2022Self-Prompting Large Language Models for Zero-Shot Open-Domain QAMar 25, 2022Stochastic Trajectory Prediction via Motion Indeterminacy DiffusionOct 16, 2021MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document UnderstandingJul 29, 2021Personalized Trajectory Prediction via Distribution DiscriminationJul 29, 2021Human Trajectory Prediction via Counterfactual AnalysisSep 10, 2020Dialogue-adaptive Language Model Pre-training From Quality EstimationApr 29, 2020Knowledgeable Dialogue Reading Comprehension on Key Turns