arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Zhaoye Fei"" — arXiv2 Search
Showing 21–28 of 28 results
/ Date
/ Name
Feb 11, 2026
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models
Dec 24, 2024
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Jun 19, 2025
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems
Jun 29, 2025
XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs
Oct 27, 2025
RoboOmni: Proactive Robot Manipulation in Omni-modal Context
Aug 28, 2025
CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation
Mar 30, 2026
MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions
Feb 9, 2026
MOVA: Towards Scalable and Synchronized Video-Audio Generation
← Previous