Showing 1–17 of 17 results
/ Date/ Name
Sep 29, 2025Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text GenerationOct 7, 2025Latent Speech-Text TransformerAug 13, 2020Incorporating Broad Phonetic Information for Speech EnhancementJul 19, 2022ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and UnderstandingDec 5, 2024CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech ProcessingDec 16, 2025Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue SummarizationJul 25, 2021A Study on Speech Enhancement Based on Diffusion Probabilistic ModelNov 15, 2020Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class InformationFeb 24, 2022Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 ChallengeFeb 24, 2025Mutual Reinforcement of LLM Dialogue Synthesis and Summarization Capabilities for Few-Shot Dialogue SummarizationOct 9, 2021An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech RecognitionSep 12, 2024SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion TransformerAug 6, 2025Enhancing Dialogue Annotation with Speaker Characteristics Leveraging a Frozen LLMFeb 10, 2022Conditional Diffusion Probabilistic Model for Speech EnhancementJun 7, 2015A Multi-layered Acoustic Tokenizing Deep Neural Network (MAT-DNN) for Unsupervised Discovery of Linguistic Units and Generation of High Quality FeaturesDec 17, 2021Discretization and Re-synthesis: an alternative method to solve the Cocktail Party ProblemJun 18, 2020Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing