"au:"Qunyi Xie"" — arXiv2 SearchShowing 1–8 of 8 results
/ Date/ Name
Apr 2, 2025On Data Synthesis and Post-training for Visual Abstract ReasoningJun 2, 20253DRS: MLLMs Need 3D-Aware Representation Supervision for Scene UnderstandingJul 24, 2023MataDoc: Margin and Text Aware Document Dewarping for Arbitrary BoundaryMay 19, 2023Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document UnderstandingOct 23, 2024Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric ReasoningFeb 4, 2026ERNIE 5.0 Technical ReportMay 31, 2024StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and BeyondNov 26, 2025Agentic Learner with Grow-and-Refine Multimodal Semantic Memory