Showing 141–160 of 223 results
/ Date/ Name
Oct 27, 2021LSTM-RPA: A Simple but Effective Long Sequence Prediction Algorithm for Music Popularity PredictionOct 18, 2021Personalized Speech Enhancement: New Models and Comprehensive EvaluationOct 17, 2021VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice SynthesisOct 14, 2021DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken UtterancesOct 14, 2021Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-SpeechOct 14, 2021Revisiting IPA-based Cross-lingual Text-to-speechOct 13, 2021Singer separation for karaoke content generationOct 7, 2021Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0Oct 1, 2021Large-scale ASR Domain Adaptation using Self- and Semi-supervised LearningSep 20, 2021TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage MethodSep 18, 2021SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker VerificationSep 16, 2021PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics TranscriptionSep 9, 2021BeamTransformer: Microphone Array-based Overlapping Speech DetectionAug 30, 2021ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language UnderstandingJul 27, 2021The CORSMAL benchmark for the prediction of the properties of containersJul 21, 2021StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice ConversionJul 20, 2021A Real-time Speaker Diarization System Based on Spatial SpectrumJul 14, 2021FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared TaskJul 14, 2021Multi-Task Audio Source SeparationApr 26, 2021Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction