"au:"Ganlin Yang"" — arXiv2 SearchShowing 1–5 of 5 results
/ Date/ Name
Mar 21, 2026ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent FrameworkMar 10, 2026InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and EditingOct 13, 2025Vlaser: Vision-Language-Action Model with Synergistic Embodied ReasoningAug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and EfficiencyMay 30, 2025Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces