Showing 1–14 of 14 results
/ Date/ Name
Jun 12, 2024Multimodal Representation Loss Between Timed Text and Audio for Regularized Speech SeparationOct 19, 2025Adaptive Deterministic Flow Matching for Target Speaker ExtractionOct 19, 2025Towards Real-Time Generative Speech Restoration with Flow-MatchingOct 28, 2020Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech EnhancementApr 6, 2020WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech EnhancementJul 18, 2025TGIF: Talker Group-Informed Familiarization of Target Speaker ExtractionApr 8, 2021MetricGAN+: An Improved Version of MetricGAN for Speech EnhancementJan 15, 2024On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech EnhancementJan 25, 2026AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and ThinkingNov 10, 2021OSSEM: one-shot speaker adaptive speech enhancement using meta learningOct 21, 2025That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code GenerationJun 18, 2020Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processingJun 9, 2021Speech Recovery for Real-World Self-powered Intermittent DevicesNov 2, 2022Inference and Denoise: Causal Inference-based Neural Speech Enhancement