"au:"Juexiao Zhang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Juexiao Zhang"" — arXiv2 Search

Showing 1–13 of 13 results

/ Date/ Name

Oct 15, 2024Multiview Scene Graph Mar 8, 2024ActFormer: Scalable Collaborative Perception via Active Queries Jun 20, 2025Co-VisiON: Co-Visibility ReasONing on Sparse Image Sets of Indoor Scenes Jun 11, 2025From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models Nov 26, 2024CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Jun 25, 2024Tell Me Where You Are: Multimodal LLMs Meet Place Recognition Oct 6, 2023URLOST: Unsupervised Representation Learning without Stationarity or Topology Oct 11, 2024VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model Feb 26, 2026CRAG: Can 3D Generative Models Help 3D Assembly?Mar 19, 2024LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images Feb 19, 2026Understanding Nature Engagement Experiences of Blind People Oct 9, 2019Word Embedding Visualization Via Dictionary Learning Jan 17, 2025When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis