Showing 1–14 of 14 results
/ Date/ Name
Jan 20, 2026Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision FlowNov 10, 2025StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video GenerationJun 19, 2025SparseLoRA: Accelerating LLM Fine-Tuning with Contextual SparsityOct 25, 2024COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingJun 18, 2024Immiscible Diffusion: Accelerating Diffusion Training with Noise AssignmentMay 24, 2024Looking Backward: Streaming Video-to-Video Translation with Feature BanksDec 19, 2023StreamDiffusion: A Pipeline-level Solution for Real-time Interactive GenerationSep 25, 2023Aligning Large Multimodal Models with Factually Augmented RLHFFeb 8, 2023Q-Diffusion: Quantizing Diffusion ModelsApr 21, 2022PreTraM: Self-Supervised Pre-training via Connecting Trajectory and MapJun 12, 2020CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAsApr 1, 2019Large Batch Optimization for Deep Learning: Training BERT in 76 minutesJan 24, 2019Large-Batch Training for LSTM and BeyondNov 5, 2018Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge