Showing 141–160 of 235 results
/ Date/ Name
Mar 31, 2022PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech RepresentationsMar 30, 2022Span Classification with Structured Information for Disfluency Detection in Spoken UtterancesMar 30, 2022Multiple Narrow-band signals Direction Finding with TMLA by Nonuniform Period ModulationMar 29, 2022Integrating Lattice-Free MMI into End-to-End Speech RecognitionMar 28, 2022On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech RecognitionMar 25, 2022DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation LearningMar 25, 2022Automatic Song Translation for Tonal LanguagesMar 13, 2022CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio ClassificationMar 7, 2022Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and LanguageFeb 15, 2022General-purpose, long-context autoregressive modeling with Perceiver ARFeb 8, 2022Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand ChallengeJan 6, 2022Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelNov 29, 2021Mixed Precision DNN Qunatization for Overlapped Speech Separation and RecognitionNov 28, 2021Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual InformationOct 27, 2021LSTM-RPA: A Simple but Effective Long Sequence Prediction Algorithm for Music Popularity PredictionOct 18, 2021Personalized Speech Enhancement: New Models and Comprehensive EvaluationOct 17, 2021VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice SynthesisOct 14, 2021DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken UtterancesOct 14, 2021Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-SpeechOct 14, 2021Revisiting IPA-based Cross-lingual Text-to-speech