arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jianhe Lin"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
May 30, 2025
Writing-Zero: Bridge the Gap Between Non-verifiable Tasks and Verifiable Rewards
Feb 15, 2026
Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
Mar 10, 2026
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR