arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Bor-Yiing Su"" — arXiv2 Search
Showing 1–7 of 7 results
/ Date
/ Name
Dec 28, 2025
MoR: Mixture Of Representations For Mixed-Precision Training
Mar 20, 2020
Deep Learning Training in Facebook Data Centers: Design of Scale-up and Scale-out Systems
Mar 7, 2020
ShadowSync: Performing Synchronization in the Background for Highly Scalable Distributed Training
Jan 30, 2026
Unveiling the Potential of Quantization with MXFP4: Strategies for Quantization Error Reduction
Nov 5, 2020
CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery
May 2, 2025
Llama-Nemotron: Efficient Reasoning Models
Sep 29, 2025
Pretraining Large Language Models with NVFP4