Showing 1–20 of 51 results
/ Date/ Name
May 15, 2015WhittleSearch: Interactive Image Search with Relative Attribute FeedbackMay 15, 2015Discovering Attribute Shades of Meaning with the CrowdDec 28, 2018Artistic Object Recognition by Unsupervised Style AdaptationMay 8, 2018Image Retrieval with Mixed Initiative and Multimodal FeedbackOct 31, 2019Predicting the Politics of an Image Using Webly Supervised DataJul 10, 2017Automatic Understanding of Image and Video AdvertisementsOct 2, 2024Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual RetrievalDec 9, 2022Contrastive View Design Strategies to Enhance Robustness to Domain Shifts in Downstream Object DetectionMar 19, 2026Learning Consistent Temporal Grounding between Related Tasks in Sports CoachingMay 12, 2022Weakly-Supervised Action Detection Guided by Audio NarrationDec 5, 2021Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal CharacterizationMar 29, 2021Domain-robust VQA with diverse datasets and methods but no target labelsAug 20, 2015Seeing Behind the Camera: Identifying the Authorship of a PhotographSep 15, 2024Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action RecognitionApr 19, 2025A Multimodal Recaptioning Framework to Account for Perceptual Diversity Across Languages in Vision-Language ModelingMar 16, 2023VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object DetectionMar 20, 2023Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated DepthSep 23, 2022Comparison of Lexical Alignment with a Teachable Robot in Human-Robot and Human-Human-Robot InteractionsSep 24, 2023Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature AlignmentApr 25, 2023Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining