"au:"Zelai Xu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zelai Xu"" — arXiv2 Search

Showing 1–17 of 17 results

/ Date/ Name

Oct 17, 2025MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs Oct 7, 2023Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning Jun 3, 2025VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent Environments Feb 7, 2025Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization Oct 5, 2023Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games Aug 2, 2024A Survey on Self-play Methods in Reinforcement Learning Mar 24, 2025AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models Oct 29, 2023Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game Feb 4, 2026WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning May 7, 2025Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning Feb 4, 2025VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play Oct 7, 2025EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models Mar 4, 2026MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation Feb 2, 2026Kimi K2.5: Visual Agentic Intelligence Sep 29, 2025RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment Jun 15, 2022Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning Nov 18, 2025Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn