"au:"Xixin Wu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Xixin Wu"" — arXiv2 Search

Showing 1–20 of 93 results

/ Date/ Name

Jan 13, 2021Should Ensemble Members Be Calibrated?Oct 27, 2022Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations Mar 31, 2022Neural Architecture Search for Speech Emotion Recognition Mar 2, 2022A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS Sep 22, 2022A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS Mar 14, 2023Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection Mar 14, 2023A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition Nov 3, 2020Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion Feb 4, 2022The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge Jun 18, 2022Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion Aug 31, 2023QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning May 24, 2023SAIL: Search-Augmented Instruction Learning May 25, 2023Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator Feb 2, 2023Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition Jun 4, 2024SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models Jun 5, 2024Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder Aug 29, 2023Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?Jan 2, 2025learning discriminative features from spectrograms using center loss for speech emotion recognition Dec 9, 2024Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection Sep 13, 2024Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions