arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jianqiang Wan"" — arXiv2 Search
Showing 1–8 of 8 results
/ Date
/ Name
Mar 28, 2024
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
May 30, 2020
Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation
Mar 11, 2026
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
Feb 22, 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Nov 26, 2025
Qwen3-VL Technical Report
Dec 3, 2024
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy
Apr 29, 2022
Vision-Language Pre-Training for Boosting Scene Text Detectors
Feb 19, 2025
Qwen2.5-VL Technical Report