Showing 1–12 of 12 results
/ Date/ Name
May 4, 2025An Empirical Study of Qwen3 QuantizationApr 8, 2024BinaryDM: Accurate Weight Binarization for Efficient Diffusion ModelsApr 18, 2025Knitting Robots: A Deep Learning Approach for Reverse-Engineering Fabric PatternsJul 15, 2025First-Order Error Matters: Accurate Compensation for Quantized Large Language ModelsDec 8, 2024BiDM: Pushing the Limit of Quantization for Diffusion ModelsFeb 2, 2026Token Pruning for In-Context Generation in Diffusion TransformersAug 2, 2023Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing AttacksFeb 8, 2024Accurate LoRA-Finetuning Quantization of LLMs via Information RetentionApr 22, 2024An empirical study of LLaMA3 quantization: from LLMs to MLLMsDec 13, 2024AniSora: Exploring the Frontiers of Animation Video Generation in the Sora EraApr 7, 2025PEAKS: Selecting Key Training Examples Incrementally via Prediction Error Anchored by Kernel SimilaritySep 25, 2024A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms