Showing 1–20 of 20 results
/ Date/ Name
Apr 21, 2026SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language ModelsApr 22, 2025Vision-Language Models Are Not Pragmatically Competent in Referring Expression GenerationDec 16, 2024Transparent and Coherent Procedural Mistake DetectionOct 31, 2024Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language UseDec 17, 2023Bridging Language and Action: A Survey of Language-Conditioned Robot ManipulationNov 2, 2023MetaReVision: Meta-Learning with Retrieval for Visually Grounded Compositional Concept AcquisitionNov 1, 2023Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?Oct 19, 2023CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image ManipulationOct 12, 2023Think, Act, and Ask: Open-World Interactive Personalized Robot NavigationSep 21, 2023LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an AgentJul 5, 2023Human Inspired Progressive Alignment and Comparative Learning for Grounded Word AcquisitionJun 14, 2023World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language ModelsNov 9, 2022Prompting Large Pre-trained Vision-Language Models For Compositional Concept LearningOct 22, 2022DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving AgentsOct 22, 2022DANLI: Deliberative Agent for Following Natural Language InstructionsMar 25, 2022Learning to Mediate Disparities Towards Pragmatic CommunicationJan 23, 2022Partition-Based Active Learning for Graph Neural NetworksSep 13, 2021MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative TasksSep 10, 2021Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language UnderstandingApr 21, 2020Experience Grounds Language