Showing 161–180 of 235 results
/ Date/ Name
Oct 13, 2021Singer separation for karaoke content generationOct 7, 2021Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0Oct 1, 2021Large-scale ASR Domain Adaptation using Self- and Semi-supervised LearningSep 20, 2021TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage MethodSep 18, 2021SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker VerificationSep 16, 2021PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics TranscriptionSep 9, 2021BeamTransformer: Microphone Array-based Overlapping Speech DetectionAug 30, 2021ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language UnderstandingJul 27, 2021The CORSMAL benchmark for the prediction of the properties of containersJul 21, 2021StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice ConversionJul 20, 2021A Real-time Speaker Diarization System Based on Spatial SpectrumJul 14, 2021FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared TaskJul 14, 2021Multi-Task Audio Source SeparationApr 26, 2021Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion PredictionApr 6, 2021LT-LM: a novel non-autoregressive language model for single-shot lattice rescoringApr 2, 2021Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature RepresentationMar 28, 2021Quantifying Bias in Automatic Speech RecognitionMar 6, 2021Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-SpeechMar 3, 2021Multi-view Audio and Music ClassificationJan 19, 2021UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data