Showing 1–20 of 42 results
/ Date/ Name
Aug 8, 2018End-to-end Speech Recognition with Word-based RNN Language ModelsNov 2, 2018Cycle-consistency training for end-to-end speech recognitionApr 19, 2021Advanced Long-context End-to-end Speech Recognition Using Context-expanded TransformersJan 16, 2025Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech RecognitionJun 8, 2017Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LMMay 15, 2018A Purely End-to-end System for Multi-speaker Speech RecognitionApr 30, 2019Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and TextJul 28, 2018Back-Translation-Style Data Augmentation for End-to-End ASRNov 12, 2018Multi-encoder multi-resolution framework for end-to-end speech recognitionNov 7, 2018Analysis of Multilingual Sequence-to-Sequence speech recognition systemsNov 7, 2018CNN-based MultiChannel End-to-End Speech Recognition for everyday home environmentsNov 26, 2020Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-TrainingOct 11, 2021Advancing Momentum Pseudo-Labeling with Conformer and Initialization StrategyMar 30, 2018ESPnet: End-to-End Speech Processing ToolkitJan 11, 2017Attention-Based Multimodal Fusion for Video DescriptionNov 12, 2018Stream attention-based multi-array end-to-end speech recognitionNov 12, 2018Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognitionApr 7, 2021Capturing Multi-Resolution Context by Dilated Self-AttentionSep 21, 2016Joint CTC-Attention based End-to-End Speech Recognition using Multi-task LearningNov 1, 2024Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval