"au:"Jagadeesh Balam"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Jagadeesh Balam"" — arXiv2 Search

Showing 1–20 of 45 results

/ Date/ Name

Oct 23, 2020Improving Noise Robustness of an End-to-End Neural Model for Automatic Speech Recognition Apr 5, 2021Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition Jul 22, 2021CarneliNet: Neural Mixture Model for Automatic Speech Recognition Oct 27, 2022A Compact End-to-End Model with Local and Global Context for Spoken Language Identification Jul 22, 2024Schrödinger Bridge for Generative Speech Enhancement Oct 23, 2024VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning Jul 13, 2023Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling Jun 28, 2024Less is More: Accurate Speech Recognition & Translation without Web-Scale Data Aug 23, 2024NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks Jul 29, 2024Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models Sep 18, 2023Investigating End-to-End ASR Architectures for Long Form Audio Transcription Apr 5, 2021SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition May 19, 2025Granary: Speech Recognition and Translation Dataset in 25 European Languages Oct 18, 2023The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System Oct 18, 2023Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation Mar 14, 2024Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer Dec 27, 2023Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition May 21, 2025SALM-Duplex: Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model Sep 17, 2025Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST Feb 27, 2026Chunk-wise Attention Transducers for Fast and Accurate Streaming Speech-to-Text