Showing 1–20 of 128 results
/ Date/ Name
Sep 3, 2018LRS3-TED: a large-scale dataset for visual speech recognitionSep 6, 2018Deep Audio-Visual Speech RecognitionDec 5, 2019VoxSRC 2019: The first VoxCeleb Speaker Recognition ChallengeJun 24, 2019Who said that?: Audio-visual speaker diarisation of real-world meetingsMay 18, 2020Metric Learning for Keyword SpottingAug 6, 2016Signs in time: Encoding human motion as a temporal imageAug 10, 2020Self-Supervised Learning of Audio-Visual Objects from VideoOct 29, 2020The ins and outs of speaker recognition: lessons from VoxSRC 2020Nov 10, 2020Supervised attention for speaker recognitionApr 29, 2020Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervisionJul 2, 2020Spot the conversation: speaker diarisation in the wildSep 21, 2018Perfect match: Improved cross-modal embeddings for audio-visual synchronisationJun 15, 2018Deep Lip Reading: a comparison of models and an online applicationApr 11, 2018The Conversation: Deep Audio-Visual Speech EnhancementNov 1, 2022Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language RecognitionOct 30, 2023Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion ModelMay 14, 2020FaceFilter: Audio-visual speech separation using still imagesJun 25, 2019Naver at ActivityNet Challenge 2019 -- Task B Active Speaker Detection (AVA)Feb 20, 2020Disentangled Speech Embeddings using Cross-modal Self-supervisionMar 26, 2020In defence of metric learning for speaker recognition