arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Dasen Dai"" — arXiv2 Search
Showing 1–7 of 7 results
/ Date
/ Name
Mar 31, 2026
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis
Apr 10, 2026
UIPress: Bringing Optical Token Compression to UI-to-Code Generation
Jan 5, 2026
FMVP: Masked Flow Matching for Adversarial Video Purification
Mar 2, 2026
VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models
Mar 24, 2026
PaperVoyager : Building Interactive Web with Visual Language Models
Feb 23, 2025
Human Cognitive Benchmarks Reveal Foundational Visual Gaps in MLLMs
May 6, 2026
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents