Showing 1–20 of 221 results
/ Date/ Name
Apr 8, 2020Multi-Target Emotional Voice Conversion With Neural VocodersJun 20, 2020Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic PosteriorgramsNov 29, 2021Mixed Precision DNN Qunatization for Overlapped Speech Separation and RecognitionOct 28, 2020Replay and Synthetic Speech Detection with Res2net ArchitectureSep 7, 2021Countering Online Hate Speech: An NLP PerspectiveJul 19, 2021Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech AttacksApr 8, 2021Towards Multi-Scale Style Control for Expressive Speech SynthesisJan 19, 2021Creation and Evaluation of a Pre-tertiary Artificial Intelligence (AI) CurriculumApr 14, 2021Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech SynthesisFeb 18, 2022VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversionAug 28, 2022Bayesian Neural Network Language Modeling for Speech RecognitionAug 18, 2022Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice ConversionMar 31, 2022NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention MechanismMar 31, 2022Neural Architecture Search for Speech Emotion RecognitionAug 10, 2022Towards Cross-speaker Reading Style Transfer on Audiobook DatasetOct 19, 2019Adversarial Attacks on Spoofing Countermeasures of automatic speaker verificationNov 8, 2019Adversarial Attacks on GMM i-vector based Speaker Verification SystemsMar 2, 2022A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTSOct 25, 2022Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAESep 22, 2022A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS