"au:"Chaoyou Fu"" — arXiv2 SearchShowing 1–8 of 8 results
/ Date/ Name
Apr 22, 2026SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech GenerationOct 21, 2025CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using AgentFeb 13, 2025MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and EfficiencyFeb 7, 2025Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccuracyNov 1, 2024Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLMAug 16, 2024A Survey on Benchmarks of Multimodal Large Language ModelsOct 24, 2023Woodpecker: Hallucination Correction for Multimodal Large Language ModelsJun 23, 2023A Survey on Multimodal Large Language Models