"au:"Ligeng Zhu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Ligeng Zhu"" — arXiv2 Search

Showing 1–20 of 20 results

/ Date/ Name

Apr 8, 2026Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM Jan 20, 2026Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Jun 19, 2025SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity May 28, 2025Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Dec 5, 2024NVILA: Efficient Frontier Visual Language Models Oct 25, 2024COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training Oct 14, 2024SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers Sep 6, 2024VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation Jul 26, 2024Wolf: Dense Video Captioning with a World Summarization Framework Jul 24, 2024VILA$^2$: VILA Augmented VILA Mar 28, 2024Tiny Machine Learning: Progress and Futures Oct 26, 2023PockEngine: Sparse and Efficient Fine-tuning in a Pocket Jun 30, 2022On-Device Training Under 256KB Memory Apr 25, 2022Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications Nov 2, 2020IOS: Inter-Operator Scheduler for CNN Acceleration May 28, 2020HAT: Hardware-Aware Transformers for Efficient Natural Language Processing Jun 21, 2019Deep Leakage from Gradients Dec 2, 2018ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware Jan 18, 2018Sparsely Aggregated Convolutional Networks Dec 5, 2017Learning to Forecast Videos of Human Activity with Multi-granularity Models and Adaptive Rendering