"au:"Pengyu Zhao"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Pengyu Zhao"" — arXiv2 Search

Showing 1–5 of 5 results

/ Date/ Name

Mar 10, 2026DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use Jan 29, 2026HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing Sep 8, 2025WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Jun 16, 2025MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Jan 14, 2025MiniMax-01: Scaling Foundation Models with Lightning Attention