Showing 1–17 of 17 results
/ Date/ Name
Dec 3, 2025DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D ReconstructionNov 13, 2023The Impact of Generative Artificial Intelligence on Market Equilibrium: Evidence from a Natural ExperimentMay 23, 2024Optimized Cost Per Click in Online Advertising: A Theoretical AnalysisNov 22, 2024Large Multi-modal Models Can Interpret Features in Large Multi-modal ModelsJul 17, 2024LMMs-Eval: Reality Check on the Evaluation of Large Multimodal ModelsApr 28, 2025GVPO: Group Variance Policy Optimization for Large Language Model Post-TrainingAug 2, 2025RSPO: Risk-Seeking Policy Optimization for Pass@k and Max@k Metrics in Large Language ModelsNov 20, 2025OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General RecipeAug 6, 2024LLaVA-OneVision: Easy Visual Task TransferOct 17, 2024MixEval-X: Any-to-Any Evaluations from Real-World Data MixturesJun 24, 2024Long Context Transfer from Language to VisionApr 30, 2026Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World ModelingMay 14, 2025Streaming Multi-agent PathfindingOct 15, 2025UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding LearningNov 25, 2025LongVT: Incentivizing "Thinking with Long Videos" via Native Tool CallingMay 8, 2024Robust Reward Placement under UncertaintyMay 6, 2024WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning