"au:"Yinan He"" — arXiv2 SearchShowing 1–8 of 8 results
/ Date/ Name
Mar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion ScaleAug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and EfficiencyJan 14, 2025Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion ModelsJun 12, 2024OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with TextMar 22, 2024InternVideo2: Scaling Foundation Models for Multimodal Video UnderstandingMar 11, 2024VideoMamba: State Space Model for Efficient Video UnderstandingDec 6, 2022InternVideo: General Video Foundation Models via Generative and Discriminative LearningNov 16, 2021INTERN: A New Learning Paradigm Towards General Vision