arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Pengyu Zhao"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Mar 10, 2026
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
Jan 29, 2026
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
Sep 8, 2025
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Jun 16, 2025
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Jan 14, 2025
MiniMax-01: Scaling Foundation Models with Lightning Attention