Showing 1–20 of 47 results
/ Date/ Name
Apr 5, 2022Towards End-to-end Unsupervised Speech RecognitionSep 5, 2018A Unified Feature Disentangler for Multi-Domain Image Translation and ManipulationOct 28, 2019Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation LearningOct 29, 2024A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform GenerationOct 28, 2019Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused DecodingJan 16, 2024Revisiting Self-supervised Learning of Speech Representation from a Mutual Information PerspectiveApr 6, 2022Simple and Effective Unsupervised Speech SynthesisMar 2, 2025UniWav: Towards Unified Pre-training for Speech Representation Learning and GenerationJan 13, 2026Ministral 3Nov 1, 2020Non-Autoregressive Predictive Coding for Learning Speech Representations from Local DependenciesOct 25, 2023Generative Pre-training for Speech with Flow MatchingMay 17, 2023DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation LearningNov 2, 2018Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language ModelJun 10, 2021Cross-Modal Discrete Representation LearningApr 1, 2022Monarch: Expressive Structured Matrices for Efficient and Accurate TrainingJan 15, 2004Evidence for large superhumps in TX Col and V4742 SgrOct 7, 2025ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix FactorizationApr 16, 2020REVISE: A Tool for Measuring and Mitigating Bias in Visual DatasetsMay 10, 2021Spoken Moments: Learning Joint Audio-Visual Representations from Video DescriptionsJul 27, 2023How to Train Your YouTube Recommender to Avoid Unwanted Videos