arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Weiyun Wang"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Oct 14, 2025
MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites
Oct 13, 2025
Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning
Aug 25, 2025
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Jul 19, 2025
Docopilot: Improving Multimodal Models for Document-Level Understanding
Jun 12, 2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text