"au:"Xing Sun"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Xing Sun"" — arXiv2 Search

Showing 1–20 of 25 results

/ Date/ Name

Jan 27, 2026Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision Dec 31, 2025Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Nov 4, 2025LTD-Bench: Evaluating Large Language Models by Letting Them Draw Oct 21, 2025CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent Oct 18, 2025Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards Oct 17, 2025FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification Aug 12, 2025ASPD: Unlocking Adaptive Serial-Parallel Decoding by Exploring Intrinsic Parallelism in LLMs Jun 2, 2025Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Feb 7, 2025Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy Nov 1, 2024Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM Jun 16, 2024Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence Mar 10, 2024RESTORE: Towards Feature Shift for Vision-Language Prompt Learning Feb 19, 2024FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema Oct 24, 2023Woodpecker: Hallucination Correction for Multimodal Large Language Models Jun 23, 2023A Survey on Multimodal Large Language Models Mar 14, 2023Co-Salient Object Detection with Co-Representation Purification Oct 18, 2021Mitigating Memorization of Noisy Labels via Regularization between Representations Mar 2, 2021Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query Jan 19, 2021An Empirical Study and Analysis on Open-Set Semi-Supervised Learning Sep 30, 2020Pruning Filter in Filter