arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Dongyang Dai"" — arXiv2 Search
Showing 1–9 of 9 results
/ Date
/ Name
May 26, 2020
Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Jan 2, 2025
learning discriminative features from spectrograms using center loss for speech emotion recognition
Jan 2, 2025
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT
Mar 8, 2024
RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction
Oct 7, 2021
Cloning one's voice using very limited data in the wild
Jun 20, 2020
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams
Dec 21, 2020
Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network
Oct 26, 2020
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition
Aug 28, 2024
Multi-modal Adversarial Training for Zero-Shot Voice Cloning