Showing 21–40 of 47 results
/ Date/ Name
Jul 3, 2023Facilitating Cooperation in Human-Agent Hybrid Populations through Autonomous AgentsApr 30, 2024Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement LearningJun 19, 2025Large Language Models are Near-Optimal Decision-Makers with a Non-Human Learning BehaviorFeb 12, 2025Stop Overvaluing Multi-Agent Debate -- We Must Rethink Evaluation and Embrace Model HeterogeneityMar 3, 2025Nature-Inspired Population-Based Evolution of Large Language ModelsMar 12, 2025ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement LearningJul 9, 2025Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language ModelAug 18, 2025Beyond GPT-5: Making LLMs Cheaper and Better via Performance-Efficiency Optimized RoutingDec 9, 2025Single-Agent Scaling Fails Multi-Agent Intelligence: Towards Foundation Models with Native Multi-Agent IntelligenceOct 10, 2025ICL-Router: In-Context Learned Model Representations for LLM RoutingFeb 8, 2026MARTI-MARS$^2$: Scaling Multi-Agent Self-Search via Reinforcement Learning for Code GenerationMay 3, 2026Disentangling Intent from Role: Adversarial Self-Play for Persona-Invariant Safety AlignmentFeb 13, 2022Individual-Level Inverse Reinforcement Learning for Mean Field GamesMay 20, 2024Configurable Mirror Descent: Towards a Unification of Decision MakingJun 6, 2024Beyond a binary theorizing of prosocialityFeb 11, 2025EvoFlow: Evolving Diverse Agentic Workflows On The FlyJul 14, 2025Open-Source LLMs Collaboration Beats Closed-Source LLMs: A Scalable Multi-Agent SystemOct 3, 2025The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic FeedbackJun 3, 2025Truly Assessing Fluid Intelligence of Large Language Models through Dynamic Reasoning EvaluationMay 21, 2025Decouple and Orthogonalize: A Data-Free Framework for LoRA Merging