arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Ji Zhang"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Apr 25, 2025
SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models
Mar 20, 2025
IRef-VLA: A Benchmark for Interactive Referential Grounding with Imperfect Language in 3D Scenes
Nov 5, 2024
VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
Nov 17, 2021
Achieving Human Parity on Visual Question Answering