"au:"Ruchao Fan"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Ruchao Fan"" — arXiv2 Search

Showing 1–20 of 27 results

/ Date/ Name

Aug 8, 2020Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification Apr 15, 2023A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition Oct 16, 2022Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding Jun 18, 2021An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition Dec 2, 2024AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM Jun 16, 2022DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR Apr 28, 2023Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR Jun 15, 2024Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models Oct 28, 2020CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition Jun 15, 2024SOA: Reducing Domain Mismatch in SSL Pipeline by Speech Only Adaptation for Low Resource ASR Feb 14, 2024UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models Oct 16, 2022CTCBERT: Advancing Hidden-unit BERT with CTC Objectives Feb 12, 2021Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASR Sep 4, 2025OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics Nov 20, 2025Train Short, Infer Long: Speech-LLM Enables Zero-Shot Streamable Joint ASR and Diarization on Long Audio Jun 4, 2025Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model Oct 7, 2024CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation Jun 4, 2025SLM-S2ST: A multimodal language model for direct speech-to-speech translation Nov 13, 2018An Online Attention-based Model for Speech Recognition Jun 18, 2021Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System