Showing 1–11 of 11 results
/ Date/ Name
Jan 30, 2026Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient EstimationMay 20, 2025Quartet: Native FP4 Training Can Be Optimal for Large Language ModelsNov 26, 2024Pushing the Limits of Large Language Model Quantization via the Linearity TheoremJan 11, 2024Extreme Compression of Large Language Models via Additive QuantizationSep 27, 2025Bridging the Gap Between Promise and Performance for Microscaling FP4 QuantizationFeb 7, 2025QuEST: Stable Training of LLMs with 1-Bit Weights and ActivationsJun 2, 2025Unified Scaling Laws for Compressed RepresentationsJan 10, 2024Correlated Quantization for Faster Nonconvex Distributed OptimizationSep 17, 2025Apertus: Democratizing Open and Compliant LLMs for Global Language EnvironmentsOct 21, 2025CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware TrainingJun 24, 2024Panza: Design and Analysis of a Fully-Local Personalized Text Writing Assistant