"au:"Hongjie Zhang"" — arXiv2 SearchShowing 1–6 of 6 results
/ Date/ Name
Mar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion ScaleMar 21, 2026ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent FrameworkMar 10, 2026InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and EditingAug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and EfficiencyMar 22, 2024InternVideo2: Scaling Foundation Models for Multimodal Video UnderstandingDec 6, 2022InternVideo: General Video Foundation Models via Generative and Discriminative Learning