Showing 1–20 of 22 results
/ Date/ Name
Jan 10, 2023Video Surveillance System Incorporating Expert Decision-making Process: A Case Study on Detecting Calving Signs in CattleJan 10, 2023Deep Multi-stream Network for Video-based Calving Sign DetectionSep 9, 2023Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech RecognitionNov 2, 2022BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced EncoderNov 2, 2022InterMPL: Momentum Pseudo-Labeling with Intermediate CTC LossJun 9, 2025Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech RecognitionSep 1, 2023Remixing-based Unsupervised Source Separation from ScratchJan 21, 2020Block-wise Scrambled Image Recognition Using Adaptation NetworkSep 19, 2023Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech RecognitionApr 28, 2025A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation ModelsOct 12, 2023A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and ExtractionOct 29, 2022BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language ModelDec 20, 2020Exploring Effectiveness of Inter-Microtask Qualification Tests in CrowdsourcingOct 26, 2020Improved Mask-CTC for Non-Autoregressive End-to-End ASRMar 13, 2023Neural Diarization with Non-autoregressive Intermediate AttractorsNov 18, 2022Self-Remixing: Unsupervised Speech Separation via Separation and RemixingMay 18, 2020Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask PredictOct 20, 2021An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASROct 8, 2021Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword UnitsMar 26, 2022Remix-cycle-consistent Learning on Adversarially Learned Separator for Accurate and Stable Unsupervised Speech Separation