"au:"Jindong Gu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Jindong Gu"" — arXiv2 Search

Showing 1–7 of 7 results

/ Date/ Name

Jul 7, 2025Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Jun 23, 2025AViLA: Asynchronous Vision-Language Agent for Streaming Multimodal Data Interaction Apr 2, 2025On the Role of Feedback in Test-Time Scaling of Agentic AI Workflows Sep 28, 2024Visual Question Decomposition on Multimodal Large Language Models Jul 24, 2023A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Apr 17, 2023Towards Robust Prompts on Vision-Language Models Jul 25, 2022SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness