Showing 1–20 of 25 results
/ Date/ Name
Jan 27, 2026Youtu-VL: Unleashing Visual Potential via Unified Vision-Language SupervisionDec 31, 2025Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language ModelsNov 4, 2025LTD-Bench: Evaluating Large Language Models by Letting Them DrawOct 21, 2025CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using AgentOct 18, 2025Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic RewardsOct 17, 2025FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-IdentificationAug 12, 2025ASPD: Unlocking Adaptive Serial-Parallel Decoding by Exploring Intrinsic Parallelism in LLMsJun 2, 2025Incentivizing Reasoning for Advanced Instruction-Following of Large Language ModelsFeb 7, 2025Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccuracyNov 1, 2024Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLMJun 16, 2024Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL DivergenceMar 10, 2024RESTORE: Towards Feature Shift for Vision-Language Prompt LearningFeb 19, 2024FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning SchemaOct 24, 2023Woodpecker: Hallucination Correction for Multimodal Large Language ModelsJun 23, 2023A Survey on Multimodal Large Language ModelsMar 14, 2023Co-Salient Object Detection with Co-Representation PurificationOct 18, 2021Mitigating Memorization of Noisy Labels via Regularization between RepresentationsMar 2, 2021Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial QueryJan 19, 2021An Empirical Study and Analysis on Open-Set Semi-Supervised LearningSep 30, 2020Pruning Filter in Filter