arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Ruoyu Sun"" — arXiv2 Search
Showing 1–9 of 9 results
/ Date
/ Name
Mar 2, 2026
Adam Converges Without Any Modification On Update Rules
Oct 31, 2025
ORGEval: Graph-Theoretic Evaluation of LLMs in Optimization Modeling
Sep 30, 2025
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
May 27, 2025
Rethinking Data Mixture for Large Language Models: A Comprehensive Survey and New Perspectives
Aug 29, 2024
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
Jun 24, 2024
Adam-mini: Use Fewer Learning Rates To Gain More
Feb 26, 2024
Why Transformers Need Adam: A Hessian Perspective
Oct 12, 2023
LEMON: Lossless model expansion
Aug 20, 2022
Adam Can Converge Without Any Modification On Update Rules