arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jared Casper"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
Apr 14, 2026
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
Nov 9, 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Sep 17, 2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism