arXiv2
Search
Toggle theme
/ Date
/ Name
Search
/ Date
/ Name
"au:"Puxin Xu"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Apr 19, 2023
A Theory on Adam Instability in Large-Scale Machine Learning
Sep 5, 2023
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Jul 17, 2025
PrefPalette: Personalized Preference Modeling with Latent Attributes
Jul 31, 2024
The Llama 3 Herd of Models
Jul 18, 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
May 18, 2023
LIMA: Less Is More for Alignment