"au:"Yinan He"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Yinan He"" — arXiv2 Search

Showing 1–8 of 8 results

/ Date/ Name

Mar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Aug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Jan 14, 2025Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Jun 12, 2024OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Mar 22, 2024InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Mar 11, 2024VideoMamba: State Space Model for Efficient Video Understanding Dec 6, 2022InternVideo: General Video Foundation Models via Generative and Discriminative Learning Nov 16, 2021INTERN: A New Learning Paradigm Towards General Vision