arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jiacheng Yang"" — arXiv2 Search
Showing 1–2 of 2 results
/ Date
/ Name
Feb 27, 2026
Annotation-Free Visual Reasoning for High-Resolution Large Multimodal Models via Reinforcement Learning
Aug 25, 2025
GEPO: Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning