Showing 1–20 of 29 results
/ Date/ Name
Apr 8, 2026Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLMApr 6, 2026TriAttention: Efficient Long Reasoning with Trigonometric KV CompressionFeb 19, 2026Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMsJan 20, 2026Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision FlowNov 10, 2025StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video GenerationNov 6, 2025NVIDIA Nemotron Nano V2 VLJun 19, 2025SparseLoRA: Accelerating LLM Fine-Tuning with Contextual SparsityMay 28, 2025Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel DecodingDec 5, 2024NVILA: Efficient Frontier Visual Language ModelsOct 25, 2024COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingOct 14, 2024SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion TransformersSep 6, 2024VILA-U: a Unified Foundation Model Integrating Visual Understanding and GenerationJul 26, 2024Wolf: Dense Video Captioning with a World Summarization FrameworkJul 24, 2024VILA$^2$: VILA Augmented VILAMar 28, 2024Tiny Machine Learning: Progress and FuturesOct 26, 2023PockEngine: Sparse and Efficient Fine-tuning in a PocketDec 16, 2022Biomedical image analysis competitions: The state of current participation practiceOct 30, 2022QuEst: Graph Transformer for Quantum Circuit Reliability EstimationJun 30, 2022On-Device Training Under 256KB MemoryJun 19, 2022MME-CRS: Multi-Metric Evaluation Based on Correlation Re-Scaling for Evaluating Open-Domain Dialogue