"au:"Alexei Baevski"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Alexei Baevski"" — arXiv2 Search

Showing 21–39 of 39 results

/ Date/ Name

Jul 8, 2021Improved Language Identification Through Cross-Lingual Self-Supervised Learning Apr 11, 2022Unified Speech-Text Pre-training for Speech Translation and Recognition Nov 17, 2021XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale Oct 24, 2020Multilingual Speech Translation with Efficient Finetuning of Pretrained Models Apr 14, 2021Large-Scale Self- and Semi-Supervised Learning for Speech Translation Apr 5, 2022Towards End-to-end Unsupervised Speech Recognition Mar 1, 2022Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training May 22, 2023Scaling Speech Technology to 1,000+ Languages Nov 15, 2022Introducing Semantics into Speech Encoders Apr 27, 2022Offline Visual Representation Learning for Embodied Navigation Jun 27, 2022Wav2Vec-Aug: Improved self-supervised training with limited data Oct 24, 2020A Comparison of Discrete Latent Variable Models for Speech Representation Learning Apr 2, 2021Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training Mar 14, 2023OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav Oct 12, 2023Toward Joint Language Modeling for Speech Units and Text Apr 6, 2022Simple and Effective Unsupervised Speech Synthesis Sep 23, 2021Simple and Effective Zero-shot Cross-lingual Phoneme Recognition Feb 1, 2021Generative Spoken Language Modeling from Raw Audio Jul 31, 2024The Llama 3 Herd of Models