arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Ruipeng Jia"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Apr 28, 2022
Neural Label Search for Zero-Shot Multi-Lingual Extractive Summarization
May 30, 2025
Writing-Zero: Bridge the Gap Between Non-verifiable Tasks and Verifiable Rewards
Feb 15, 2026
Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
Sep 23, 2024
Orthogonal Finetuning for Direct Preference Optimization
Aug 25, 2025
Weights-Rotated Preference Optimization for Large Language Models