arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jiani Zheng"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Oct 21, 2025
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
Jul 10, 2025
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
Nov 18, 2023
Make Pixels Dance: High-Dynamic Video Generation
Jul 5, 2023
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?