arXiv2
Search
Dark
/ Date
/ Name
Aa
W
Search
/ Date
/ Name
"au:"Wenwen Tong"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Oct 15, 2025
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue
Dec 30, 2025
SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning
Mar 18, 2023
3D Data Augmentation for Driving Scenes on Camera
Jun 5, 2023
Scene as Occupancy
Apr 25, 2024
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites