Showing 1–15 of 15 results
/ Date/ Name
Oct 16, 2018Deep neural network based i-vector mapping for speaker verification using short utterancesFeb 19, 2019A spelling correction model for end-to-end speech recognitionMar 11, 2019Singing voice conversion with non-parallel dataApr 2, 2024Effective internal language model training and fusion for factorized transducer modelJul 27, 2020Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognitionDec 21, 2024Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech RecognitionAug 8, 2020Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verificationJul 21, 2023Prompting Large Language Models with Speech Recognition AbilitiesDec 14, 2020REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with RelabelingNov 4, 2022Biased Self-supervised learning for ASRJul 23, 2024Towards scalable efficient on-device ASR with transfer learningFeb 22, 2022VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech RecognitionJan 21, 2025A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic dataSep 22, 2023Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR ModelDec 15, 2022Improving Fast-slow Encoder based Transducer with Streaming Deliberation