arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Abhinav Venigalla"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Mar 25, 2020
Pipelined Backpropagation at Scale: Training Large Models without Batches
Jul 2, 2020
Adaptive Braking for Mitigating Gradient Delay
Mar 29, 2021
Representation range needs for 16-bit neural network training
Mar 27, 2024
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
Dec 29, 2023
MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining