"au:"Xiyang Wu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Xiyang Wu"" — arXiv2 Search

Showing 1–18 of 18 results

/ Date/ Name

Feb 15, 2024On the Vulnerability of LLM/VLM-Controlled Robotics May 2, 2025VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding Mar 10, 2026MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Jun 9, 2023iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning Nov 23, 2025MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models Sep 30, 2023LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments Jun 18, 2025Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation Mar 26, 2026SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models Oct 31, 2020FireCommander: An Interactive, Probabilistic Multi-agent Environment for Heterogeneous Robot Teams Apr 22, 2026Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks Apr 7, 2026Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills Jun 16, 2024AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models Jan 4, 2025A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges Nov 19, 2025First Frame Is the Place to Go for Video Content Customization Oct 23, 2023HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models Apr 4, 2024AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales Jun 23, 2025CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning Sep 26, 2024FALCON: Future-Aware Learning with Contextual Object-Centric Pretraining for UAV Action Recognition