"au:"Erfei Cui"" — arXiv2 SearchShowing 1–7 of 7 results
/ Date/ Name
Mar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion ScaleMar 21, 2026ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent FrameworkMar 10, 2026InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and EditingOct 14, 2025MetaCaptioner: Towards Generalist Visual Captioning with Open-source SuitesAug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and EfficiencyMay 30, 2025Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in SpacesJun 12, 2024OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text