arXiv2
Search
Toggle theme
/ Date
/ Name
Search
/ Date
/ Name
"au:"Rihui Xin"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
May 26, 2025
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers
Feb 18, 2025
Baichuan-M1: Pushing the Medical Capability of Large Language Models
Sep 2, 2025
DCPO: Dynamic Clipping Policy Optimization
Sep 2, 2025
Baichuan-M2: Scaling Medical Capability with Large Verifier System