"au:"Helen Meng"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Helen Meng"" — arXiv2 Search

Showing 1–20 of 221 results

/ Date/ Name

Apr 8, 2020Multi-Target Emotional Voice Conversion With Neural Vocoders Jun 20, 2020Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams Nov 29, 2021Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition Oct 28, 2020Replay and Synthetic Speech Detection with Res2net Architecture Sep 7, 2021Countering Online Hate Speech: An NLP Perspective Jul 19, 2021Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks Apr 8, 2021Towards Multi-Scale Style Control for Expressive Speech Synthesis Jan 19, 2021Creation and Evaluation of a Pre-tertiary Artificial Intelligence (AI) Curriculum Apr 14, 2021Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis Feb 18, 2022VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion Aug 28, 2022Bayesian Neural Network Language Modeling for Speech Recognition Aug 18, 2022Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion Mar 31, 2022NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism Mar 31, 2022Neural Architecture Search for Speech Emotion Recognition Aug 10, 2022Towards Cross-speaker Reading Style Transfer on Audiobook Dataset Oct 19, 2019Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification Nov 8, 2019Adversarial Attacks on GMM i-vector based Speaker Verification Systems Mar 2, 2022A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS Oct 25, 2022Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAE Sep 22, 2022A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS