Showing 1–10 of 10 results
/ Date/ Name
Dec 15, 2023Lever LM: Configuring In-Context Sequence to Lever Large Vision Language ModelsJun 19, 2024LIVE: Learnable In-Context Vector for Visual Question AnsweringMar 10, 2025LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RLSep 20, 2024First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test ChallengeApr 11, 2025Mimic In-Context Learning for Multimodal TasksAug 7, 2025On the Generalization of SFT: A Reinforcement Learning Perspective with Reward RectificationOct 31, 2024Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory TasksJul 13, 2024ICCV23 Visual-Dialog Emotion Explanation Challenge: SEU_309 Team Technical ReportJul 17, 2025DeQA-Doc: Adapting DeQA-Score to Document Image Quality AssessmentJul 11, 2025L-CLIPScore: a Lightweight Embedding-based Captioning Metric for Evaluating and Training