"au:"Ziyu Ma"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Ziyu Ma"" — arXiv2 Search

Showing 1–10 of 10 results

/ Date/ Name

Feb 4, 2024GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering Jun 18, 2024DrVideo: Document Retrieval Based Long Video Understanding Oct 16, 2021Hybrid Mutimodal Fusion for Dimensional Emotion Recognition Aug 21, 2025An Empirical Study on How Video-LLMs Answer Video Questions Apr 17, 2026CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution Apr 9, 2026SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Jan 8, 2026Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Nov 11, 2025Where and What Matters: Sensitivity-Aware Task Vectors for Many-Shot Multimodal In-Context Learning Jul 5, 2022Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation Sep 25, 2025Tree Search for LLM Agent Reinforcement Learning