"au:"Zhaoye Fei"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zhaoye Fei"" — arXiv2 Search

Showing 21–28 of 28 results

/ Date/ Name

Feb 11, 2026MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Dec 24, 2024VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Jun 19, 2025InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems Jun 29, 2025XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs Oct 27, 2025RoboOmni: Proactive Robot Manipulation in Omni-modal Context Aug 28, 2025CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation Mar 30, 2026MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions Feb 9, 2026MOVA: Towards Scalable and Synchronized Video-Audio Generation