"au:"Jianwei Yu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Jianwei Yu"" — arXiv2 Search

Showing 1–18 of 18 results

/ Date/ Name

Jan 26, 2026VIBEVOICE-ASR Technical Report Jun 1, 2025CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching May 19, 2025MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix Jan 29, 2024Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings Dec 16, 2023SECap: Speech Emotion Captioning with Large Language Model Sep 25, 2023AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data Aug 14, 2023The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track Dec 1, 2022High Fidelity Speech Enhancement with Band-split RNN Jul 20, 2022Diffsound: Discrete Diffusion Model for Text-to-sound Generation Mar 29, 2022Integrating Lattice-Free MMI into End-to-End Speech Recognition Mar 28, 2022On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition Jan 6, 2022Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Nov 29, 2021Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition Aug 30, 2021ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding Nov 16, 2020Audio-visual Multi-channel Integration and Recognition of Overlapped Speech May 18, 2020Audio-visual Multi-channel Recognition of Overlapped Speech Jan 6, 2020Audio-visual Recognition of Overlapped speech for the LRS2 dataset Nov 8, 2019Adversarial Attacks on GMM i-vector based Speaker Verification Systems