"au:"Peng Gao"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Peng Gao"" — arXiv2 Search

Showing 1–15 of 15 results

/ Date/ Name

Oct 14, 2025Migration and spreading of a droplet driven by a chemical step Jun 16, 2025MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Feb 13, 2025MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Jan 14, 2025Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Jul 10, 2024VEnhancer: Generative Space-Time Enhancement for Video Generation May 23, 2024TerDiT: Ternary Diffusion Models with Transformers Feb 8, 2024SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models Nov 13, 2023SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models Apr 28, 2023LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model Mar 9, 2023Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking Aug 6, 2022Frozen CLIP Models are Efficient Video Learners May 8, 2022ConvMAE: Masked Convolution Meets Masked Autoencoders Nov 18, 2020End-to-End Object Detection with Adaptive Clustering Transformer Apr 26, 2020Transmission electron microscopy of organic-inorganic hybrid perovskites: myths and truths Nov 13, 2019Learning Where to Focus for Efficient Video Object Detection