"au:"Andrei Panferov"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Andrei Panferov"" — arXiv2 Search

Showing 1–11 of 11 results

/ Date/ Name

Jan 30, 2026Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation May 20, 2025Quartet: Native FP4 Training Can Be Optimal for Large Language Models Nov 26, 2024Pushing the Limits of Large Language Model Quantization via the Linearity Theorem Jan 11, 2024Extreme Compression of Large Language Models via Additive Quantization Sep 27, 2025Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization Feb 7, 2025QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Jun 2, 2025Unified Scaling Laws for Compressed Representations Jan 10, 2024Correlated Quantization for Faster Nonconvex Distributed Optimization Sep 17, 2025Apertus: Democratizing Open and Compliant LLMs for Global Language Environments Oct 21, 2025CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training Jun 24, 2024Panza: Design and Analysis of a Fully-Local Personalized Text Writing Assistant