arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Zhi-Quan Luo"" — arXiv2 Search
Showing 1–9 of 9 results
/ Date
/ Name
Mar 2, 2026
Adam Converges Without Any Modification On Update Rules
Sep 30, 2025
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Nov 25, 2024
Exploring the Generalization Capabilities of AID-based Bi-level Optimization
Aug 29, 2024
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
Jun 24, 2024
Adam-mini: Use Fewer Learning Rates To Gain More
Feb 26, 2024
Why Transformers Need Adam: A Hessian Perspective
Oct 23, 2023
TeleQnA: A Benchmark Dataset to Assess Large Language Models Telecommunications Knowledge
Aug 20, 2022
Adam Can Converge Without Any Modification On Update Rules
May 28, 2022
Efficient-Adam: Communication-Efficient Distributed Adam