"au:"Tetsuji Ogawa"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Tetsuji Ogawa"" — arXiv2 Search

Showing 1–20 of 22 results

/ Date/ Name

Jan 10, 2023Video Surveillance System Incorporating Expert Decision-making Process: A Case Study on Detecting Calving Signs in Cattle Jan 10, 2023Deep Multi-stream Network for Video-based Calving Sign Detection Sep 9, 2023Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition Nov 2, 2022BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced Encoder Nov 2, 2022InterMPL: Momentum Pseudo-Labeling with Intermediate CTC Loss Jun 9, 2025Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition Sep 1, 2023Remixing-based Unsupervised Source Separation from Scratch Jan 21, 2020Block-wise Scrambled Image Recognition Using Adaptation Network Sep 19, 2023Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition Apr 28, 2025A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models Oct 12, 2023A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction Oct 29, 2022BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model Dec 20, 2020Exploring Effectiveness of Inter-Microtask Qualification Tests in Crowdsourcing Oct 26, 2020Improved Mask-CTC for Non-Autoregressive End-to-End ASR Mar 13, 2023Neural Diarization with Non-autoregressive Intermediate Attractors Nov 18, 2022Self-Remixing: Unsupervised Speech Separation via Separation and Remixing May 18, 2020Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict Oct 20, 2021An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR Oct 8, 2021Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units Mar 26, 2022Remix-cycle-consistent Learning on Adversarially Learned Separator for Accurate and Stable Unsupervised Speech Separation