Showing 1–20 of 91 results
/ Date/ Name
Nov 24, 2025Fara-7B: An Efficient Agentic Model for Computer UseApr 30, 2025Phi-4-reasoning Technical ReportFeb 14, 2024Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered ApplicationsNov 7, 2024Magentic-One: A Generalist Multi-Agent System for Solving Complex TasksMay 5, 2022Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021May 31, 2024Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHFMar 5, 2026Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large ToolspacesNov 1, 2022Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language InstructionsJun 5, 2023Orca: Progressive Learning from Complex Explanation Traces of GPT-4Mar 12, 2022The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of RedundancyOct 13, 2021NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative EnvironmentJul 3, 2024AgentInstruct: Toward Generative Teaching with Agentic FlowsNov 18, 2023Orca 2: Teaching Small Language Models How to ReasonMar 3, 2026Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool UseApr 22, 2018Adversarial Training for Community Question Answer Selection Based on Multi-scale MatchingAug 24, 2022ADMoE: Anomaly Detection with Mixture-of-Experts from Noisy LabelsMay 27, 2022IGLU 2022: Interactive Grounded Language Understanding in a Collaborative Environment at NeurIPS 2022Apr 4, 2024Direct Nash Optimization: Teaching Language Models to Self-Improve with General PreferencesMay 18, 2023Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language InstructionsDec 2, 2023Axiomatic Preference Modeling for Longform Question Answering