Showing 1–17 of 17 results
/ Date/ Name
Dec 24, 2019Barycenters of Natural Images -- Constrained Wasserstein Barycenters for Image MorphingMay 29, 2018Adversarial Noise Attacks of Deep Learning Architectures -- Stability Analysis via Sparse Modeled SignalsJan 23, 2020Ada-LISTA: Learned Solvers Adaptive to Varying ModelsMay 8, 2022Multimodal Semi-Supervised Learning for Text RecognitionJan 18, 2023CLIPTER: Looking at the Bigger Picture in Scene Text RecognitionApr 25, 2018Multi-Layer Sparse Coding: The Holistic WayJun 2, 2018On Multi-Layer Basis Pursuit, Efficient Algorithms and Convolutional Neural NetworksDec 20, 2020Sequence-to-Sequence Contrastive Learning for Text RecognitionJun 28, 2020When and How Can Deep Generative Models be Inverted?Dec 23, 2020On Calibration of Scene-Text Recognition ModelsJan 18, 2023Towards Models that Can See and ReadDec 11, 2024DocVLM: Make Your VLM an Efficient ReaderNov 7, 2024TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language ModelsFeb 21, 2026DREAM: Deep Research Evaluation with Agentic MetricsJan 7, 2024GRAM: Global Reasoning for Multi-Page VQASep 14, 2022Out-of-Vocabulary Challenge ReportFeb 8, 2024Question Aware Vision Transformer for Multimodal Reasoning