Showing 161–180 of 223 results
/ Date/ Name
Apr 5, 2021AST: Audio Spectrogram TransformerApr 2, 2021Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature RepresentationMar 28, 2021Quantifying Bias in Automatic Speech RecognitionMar 6, 2021Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-SpeechMar 3, 2021Multi-view Audio and Music ClassificationJan 19, 2021UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled DataJan 9, 2021Coupling a generative model with a discriminative learning framework for speaker verificationDec 24, 2020Unsupervised neural adaptation model based on optimal transport for spoken language identificationDec 17, 2020The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networksNov 3, 2020Unsupervised Pattern Discovery from Thematic Speech Archives Based on Multilingual Bottleneck FeaturesOct 22, 2020Similarity Analysis of Self-Supervised Speech RepresentationsOct 18, 2020Self-Attention Generative Adversarial Network for Speech EnhancementAug 22, 2020A Efficient Multimodal Framework for Large Scale Emotion Recognition by Fusing Music and Electrodermal Activity SignalsJul 29, 2020Exploiting Cross-Lingual Knowledge in Unsupervised Acoustic Modeling for Low-Resource LanguagesJul 25, 2020Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware ModelingJul 25, 2020Adaptive music: Automated music composition and distributionMay 18, 2020Audio-visual Multi-channel Recognition of Overlapped SpeechMay 15, 2020WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPUMay 14, 2020Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party ScenarioApr 30, 2020Jukebox: A Generative Model for Music