Showing 1–17 of 17 results
/ Date/ Name
Oct 17, 2025MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMsOct 7, 2023Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum LearningJun 3, 2025VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent EnvironmentsFeb 7, 2025Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy OptimizationOct 5, 2023Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive GamesAug 2, 2024A Survey on Self-play Methods in Reinforcement LearningMar 24, 2025AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language ModelsOct 29, 2023Language Agents with Reinforcement Learning for Strategic Play in the Werewolf GameFeb 4, 2026WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement LearningMay 7, 2025Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement LearningFeb 4, 2025VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic PlayOct 7, 2025EARL: Efficient Agentic Reinforcement Learning Systems for Large Language ModelsMar 4, 2026MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and ExploitationFeb 2, 2026Kimi K2.5: Visual Agentic IntelligenceSep 29, 2025RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM AlignmentJun 15, 2022Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement LearningNov 18, 2025Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn