arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Tianqi Zhang"" — arXiv2 Search
Showing 1–2 of 2 results
/ Date
/ Name
Apr 10, 2025
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning
Jan 25, 2025
Predictive Lagrangian Optimization for Constrained Reinforcement Learning