Showing 101–120 of 223 results
/ Date/ Name
Nov 22, 2022COVID-Net Assistant: A Deep Learning-Driven Virtual Assistant for COVID-19 Symptom Prediction and RecommendationNov 18, 2022Speaker Overlap-aware Neural Diarization for Multi-party Meeting AnalysisNov 15, 2022Hybrid Transformers for Music Source SeparationNov 14, 2022SNIPER Training: Single-Shot Sparse Training for Text-to-SpeechNov 11, 2022Speech-to-Speech Translation For A Real-world Unwritten LanguageNov 8, 2022SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech TranslationsNov 8, 2022Pushing the limits of self-supervised speaker verification using regularized distillation frameworkNov 2, 2022data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupNov 1, 2022SDMuse: Stochastic Differential Music Editing and Generation via Hybrid RepresentationOct 27, 2022Multimodal Transformer Distillation for Audio-Visual SynchronizationOct 18, 2022Simple and Effective Unsupervised Speech TranslationOct 3, 2022Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker DetectionAug 28, 2022Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer TasksAug 16, 2022Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech RecognitionJul 29, 2022Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognitionJul 20, 2022Diffsound: Discrete Diffusion Model for Text-to-sound GenerationJun 7, 2022LegoNN: Building Modular Encoder-Decoder ModelsJun 3, 2022Constraining Gaussian processes for physics-informed acoustic emission mappingMay 30, 2022StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech SynthesisMay 16, 2022PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification