"au:"Meng Ge"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Meng Ge"" — arXiv2 Search

Showing 1–20 of 41 results

/ Date/ Name

May 22, 2025PCMamba: Physics-Informed Cross-Modal State Space Model for Dual-Camera Compressive Hyperspectral Imaging May 21, 2025FRN: Fractal-Based Recursive Spectral Reconstruction Network Sep 17, 2025Process-Supervised Reinforcement Learning for Interactive Multimodal Tool-Use Agents Nov 19, 2020Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals Feb 21, 2022L-SpEx: Localized Target Speaker Extraction Jul 15, 2022MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources Oct 9, 2022VCSE: Time-Domain Visual-Contextual Speaker Extraction Network Mar 9, 2024sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks Jan 5, 2024Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio Sep 15, 2023Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech Aug 31, 2024Progressive Residual Extraction based Pre-training for Speech Representation Learning Dec 22, 2024Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement Dec 26, 2023The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge Sep 19, 2023USED: Universal Speaker Extraction and Diarization Sep 29, 2025Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis Jan 24, 2025Efficient Emotion and Speaker Adaptation in LLM-Based TTS via Characteristic-Specific Partial Fine-Tuning Sep 13, 2023PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network Mar 31, 2022A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Sep 24, 2024WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction Jan 5, 2025Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module