Showing 21–39 of 39 results
/ Date/ Name
Jul 8, 2021Improved Language Identification Through Cross-Lingual Self-Supervised LearningApr 11, 2022Unified Speech-Text Pre-training for Speech Translation and RecognitionNov 17, 2021XLS-R: Self-supervised Cross-lingual Speech Representation Learning at ScaleOct 24, 2020Multilingual Speech Translation with Efficient Finetuning of Pretrained ModelsApr 14, 2021Large-Scale Self- and Semi-Supervised Learning for Speech TranslationApr 5, 2022Towards End-to-end Unsupervised Speech RecognitionMar 1, 2022Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-TrainingMay 22, 2023Scaling Speech Technology to 1,000+ LanguagesNov 15, 2022Introducing Semantics into Speech EncodersApr 27, 2022Offline Visual Representation Learning for Embodied NavigationJun 27, 2022Wav2Vec-Aug: Improved self-supervised training with limited dataOct 24, 2020A Comparison of Discrete Latent Variable Models for Speech Representation LearningApr 2, 2021Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-TrainingMar 14, 2023OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNavOct 12, 2023Toward Joint Language Modeling for Speech Units and TextApr 6, 2022Simple and Effective Unsupervised Speech SynthesisSep 23, 2021Simple and Effective Zero-shot Cross-lingual Phoneme RecognitionFeb 1, 2021Generative Spoken Language Modeling from Raw AudioJul 31, 2024The Llama 3 Herd of Models