"au:"Kaichen Zhang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Kaichen Zhang"" — arXiv2 Search

Showing 1–17 of 17 results

/ Date/ Name

Dec 3, 2025DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction Nov 13, 2023The Impact of Generative Artificial Intelligence on Market Equilibrium: Evidence from a Natural Experiment May 23, 2024Optimized Cost Per Click in Online Advertising: A Theoretical Analysis Nov 22, 2024Large Multi-modal Models Can Interpret Features in Large Multi-modal Models Jul 17, 2024LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Apr 28, 2025GVPO: Group Variance Policy Optimization for Large Language Model Post-Training Aug 2, 2025RSPO: Risk-Seeking Policy Optimization for Pass@k and Max@k Metrics in Large Language Models Nov 20, 2025OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Aug 6, 2024LLaVA-OneVision: Easy Visual Task Transfer Oct 17, 2024MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Jun 24, 2024Long Context Transfer from Language to Vision Apr 30, 2026Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling May 14, 2025Streaming Multi-agent Pathfinding Oct 15, 2025UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning Nov 25, 2025LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling May 8, 2024Robust Reward Placement under Uncertainty May 6, 2024WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning