Showing 1–20 of 35 results
/ Date/ Name
Dec 3, 2018Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial LearningMay 24, 2022Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution GeneralizationAug 31, 2016Measuring Machine Intelligence Through Visual Question AnsweringDec 1, 2017Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question AnsweringJul 15, 2024Benchmarking Vision Language Models for Cultural UnderstandingMay 3, 2015VQA: Visual Question AnsweringMay 24, 2023An Examination of the Robustness of Reference-Free Image Captioning Evaluation MetricsJun 23, 2016Analyzing the Behavior of Visual Question Answering ModelsJul 23, 2024VisMin: Visual Minimal-Change UnderstandingMar 28, 2026Communicating about Space: Language-Mediated Spatial Integration Across Partial ViewsApr 26, 2017C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 DatasetJun 15, 2023Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional UnderstandingOct 4, 2023Improving Automatic VQA Evaluation Using Large Language ModelsMar 26, 2024Improving Text-to-Image Consistency via Automatic Prompt OptimizationJun 10, 2025CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation MetricsAug 22, 2025WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code GenerationMay 12, 2023Measuring Progress in Fine-grained Vision-and-Language UnderstandingOct 8, 2018Overcoming Language Priors in Visual Question Answering with Adversarial RegularizationAug 1, 2025The Promise of RL for Autoregressive Image EditingJul 10, 2024Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison