"au:"Joon Son Chung"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Joon Son Chung"" — arXiv2 Search

Showing 1–20 of 128 results

/ Date/ Name

Sep 3, 2018LRS3-TED: a large-scale dataset for visual speech recognition Sep 6, 2018Deep Audio-Visual Speech Recognition Dec 5, 2019VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge Jun 24, 2019Who said that?: Audio-visual speaker diarisation of real-world meetings May 18, 2020Metric Learning for Keyword Spotting Aug 6, 2016Signs in time: Encoding human motion as a temporal image Aug 10, 2020Self-Supervised Learning of Audio-Visual Objects from Video Oct 29, 2020The ins and outs of speaker recognition: lessons from VoxSRC 2020 Nov 10, 2020Supervised attention for speaker recognition Apr 29, 2020Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision Jul 2, 2020Spot the conversation: speaker diarisation in the wild Sep 21, 2018Perfect match: Improved cross-modal embeddings for audio-visual synchronisation Jun 15, 2018Deep Lip Reading: a comparison of models and an online application Apr 11, 2018The Conversation: Deep Audio-Visual Speech Enhancement Nov 1, 2022Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition Oct 30, 2023Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model May 14, 2020FaceFilter: Audio-visual speech separation using still images Jun 25, 2019Naver at ActivityNet Challenge 2019 -- Task B Active Speaker Detection (AVA)Feb 20, 2020Disentangled Speech Embeddings using Cross-modal Self-supervision Mar 26, 2020In defence of metric learning for speaker recognition