Showing 1–10 of 10 results
/ Date/ Name
Feb 27, 2023Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face VideoFeb 17, 2024CoLLaVO: Crayon Large Language and Vision mOdelMar 7, 2024Persona Extraction Through Semantic Similarity for Emotional Support Conversation GenerationMay 24, 2024Meteor: Mamba-based Traversal of Rationale for Large Language and Vision ModelsJun 18, 2024TroL: Traversal of Layers for Large Language and Vision ModelsJun 12, 2024Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face ConversationMar 8, 2025Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech RepresentationsMar 12, 2024MoAI: Mixture of All Intelligence for Large Language and Vision ModelsSep 2, 2024Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and LanguageSep 23, 2024Phantom of Latent for Large Language and Vision Models