"au:"Shuyue Hu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Shuyue Hu"" — arXiv2 Search

Showing 21–40 of 47 results

/ Date/ Name

Jul 3, 2023Facilitating Cooperation in Human-Agent Hybrid Populations through Autonomous Agents Apr 30, 2024Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning Jun 19, 2025Large Language Models are Near-Optimal Decision-Makers with a Non-Human Learning Behavior Feb 12, 2025Stop Overvaluing Multi-Agent Debate -- We Must Rethink Evaluation and Embrace Model Heterogeneity Mar 3, 2025Nature-Inspired Population-Based Evolution of Large Language Models Mar 12, 2025ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning Jul 9, 2025Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model Aug 18, 2025Beyond GPT-5: Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing Dec 9, 2025Single-Agent Scaling Fails Multi-Agent Intelligence: Towards Foundation Models with Native Multi-Agent Intelligence Oct 10, 2025ICL-Router: In-Context Learned Model Representations for LLM Routing Feb 8, 2026MARTI-MARS$^2$: Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation May 3, 2026Disentangling Intent from Role: Adversarial Self-Play for Persona-Invariant Safety Alignment Feb 13, 2022Individual-Level Inverse Reinforcement Learning for Mean Field Games May 20, 2024Configurable Mirror Descent: Towards a Unification of Decision Making Jun 6, 2024Beyond a binary theorizing of prosociality Feb 11, 2025EvoFlow: Evolving Diverse Agentic Workflows On The Fly Jul 14, 2025Open-Source LLMs Collaboration Beats Closed-Source LLMs: A Scalable Multi-Agent System Oct 3, 2025The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic Feedback Jun 3, 2025Truly Assessing Fluid Intelligence of Large Language Models through Dynamic Reasoning Evaluation May 21, 2025Decouple and Orthogonalize: A Data-Free Framework for LoRA Merging

← Previous Next →