Showing 1–18 of 18 results
/ Date/ Name
Feb 15, 2024On the Vulnerability of LLM/VLM-Controlled RoboticsMay 2, 2025VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video UnderstandingMar 10, 2026MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero DataJun 9, 2023iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement LearningNov 23, 2025MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language ModelsSep 30, 2023LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured EnvironmentsJun 18, 2025Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form GenerationMar 26, 2026SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action ModelsOct 31, 2020FireCommander: An Interactive, Probabilistic Multi-agent Environment for Heterogeneous Robot TeamsApr 22, 2026Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon TasksApr 7, 2026Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent SkillsJun 16, 2024AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language ModelsJan 4, 2025A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesNov 19, 2025First Frame Is the Place to Go for Video Content CustomizationOct 23, 2023HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsApr 4, 2024AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying ScalesJun 23, 2025CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and ReasoningSep 26, 2024FALCON: Future-Aware Learning with Contextual Object-Centric Pretraining for UAV Action Recognition