arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Xiangtai Li"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Dec 11, 2025
WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
Oct 30, 2025
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark
Oct 21, 2025
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
Jul 10, 2025
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
Jul 2, 2025
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
Jan 4, 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model