arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Yanghai Wang"" — arXiv2 Search
Showing 1–8 of 8 results
/ Date
/ Name
Sep 23, 2024
OmniBench: Towards The Future of Universal Omni-Language Models
Apr 16, 2026
DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation
Oct 21, 2025
IF-VidCap: Can Video Caption Models Follow Instructions?
Nov 10, 2025
MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Oct 20, 2025
MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues
Dec 24, 2025
T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
Aug 22, 2025
M3TQA: Massively Multilingual Multitask Table Question Answering
Oct 12, 2025
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs