Showing 1–20 of 24 results
/ Date/ Name
Nov 22, 2022Disentangled Feature Learning for Real-Time Neural Speech CodingMay 27, 2025Text-Queried Audio Source Separation via Hierarchical ModelingJul 3, 2022Towards Error-Resilient Neural Speech CodingJul 7, 2022Cross-Scale Vector Quantization for Scalable Neural Speech CodingMar 15, 2025Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained RepresentationsApr 8, 2021Phoneme-based Distribution Regularization for Speech EnhancementJul 4, 2022Multi-Modal Multi-Correlation Learning for Audio-Visual Speech SeparationJan 24, 2022End-to-End Neural Speech Coding for Real-Time CommunicationsDec 17, 2020Interactive Speech and Noise Modeling for Speech EnhancementSep 12, 2017End-to-End United Video Dehazing and DetectionJul 20, 2017An All-in-One Network for Dehazing and BeyondJul 18, 2022Latent-Domain Predictive Neural Speech CodingJan 20, 2026Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic SearchOct 13, 2023Low-latency Speech Enhancement via Speech Token GenerationMay 26, 2023ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise SuppressionFeb 26, 2023Contrast-PLC: Contrastive Learning for Packet Loss ConcealmentFeb 21, 2023DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech SeparationFeb 20, 2023Improving Speech Enhancement via Event-based QueryFeb 25, 2023Time-Variance Aware Real-Time Speech EnhancementJan 29, 2024Masked Audio Modeling with CLAP and Multi-Objective Learning