"au:"Atsushi Ando"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Atsushi Ando"" — arXiv2 Search

Showing 1–19 of 19 results

/ Date/ Name

Oct 28, 2022On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis Jun 27, 2024Factor-Conditioned Speaking-Style Captioning Aug 31, 2023Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff Feb 11, 2024Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis Jul 12, 2025Can We Really Repurpose Multi-Speaker ASR Corpus for Speaker Diarization?Sep 22, 2023NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization Jul 11, 2022Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data Mar 29, 2019Does the Lombard Effect Improve Emotional Communication in Noise? - Analysis of Emotional Speech Acted in Noise -Jun 4, 2023End-to-End Joint Target and Non-Target Speakers ASR Aug 30, 2024Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings Jul 1, 2024SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling May 30, 2025Pretraining Multi-Speaker Identification for Neural Speaker Diarization Jun 13, 2025Dissecting the Segmentation Model of End-to-End Diarization with Vector Clustering Jun 14, 2025Mitigating Non-Target Speaker Bias in Guided Speaker Embedding Jul 28, 2018Ultrafast Dynamics of Electron-phonon Coupling in Transition-metal Dichalcogenides Feb 14, 2025Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge Sep 9, 2024NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge Oct 9, 2024Mamba-based Segmentation Model for Speaker Diarization Oct 16, 2024Guided Speaker Embedding