"au:"Shengen Yan"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Shengen Yan"" — arXiv2 Search

Showing 1–20 of 30 results

/ Date/ Name

Jan 13, 2015Deep Image: Scaling up Image Recognition Feb 19, 2019Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes Jun 4, 2023Proteus: Simulating the Performance of Distributed DNN Training Jun 12, 2024DiTFastAttn: Attention Compression for Diffusion Transformer Models Apr 2, 2024Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better Apr 16, 2025VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate Jun 4, 2024ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Mar 28, 2025DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers Feb 17, 2025DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation May 28, 2024MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization Feb 26, 2025AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms May 24, 2025PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs Sep 3, 2021Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters Feb 6, 2024LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K Jan 10, 2022A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs Apr 22, 2017Towards Distributed Machine Learning in Shared Clusters: A Dynamically-Partitioned Approach Jul 1, 2024Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs Sep 16, 2024CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios Dec 18, 2024E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling May 25, 2024HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models