Showing 1–19 of 19 results
/ Date/ Name
Jan 27, 2026Youtu-VL: Unleashing Visual Potential via Unified Vision-Language SupervisionJan 8, 2026SRU-Pix2Pix: A Fusion-Driven Generator Network for Medical Image Translation with Few-Shot LearningDec 31, 2025Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language ModelsNov 24, 2025EEG-VLM: A Hierarchical Vision-Language Model with Multi-Level Feature Alignment and Visually Enhanced Language-Guided Reasoning for EEG Image-Based Sleep Stage PredictionOct 21, 2025CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using AgentOct 18, 2025Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic RewardsJun 25, 2025Visual-Semantic Knowledge Conflicts in Operating Rooms: Synthetic Data Curation for Surgical Risk Perception in Multimodal Large Language ModelsMar 11, 2025Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language AgentsFeb 17, 2025AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse VerificationFeb 12, 2025One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMsJan 26, 2025SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science DomainDec 25, 2024An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS DiagnosisSep 23, 2024Adaptive Learning on User Segmentation: Universal to Specific Representation via Bipartite Neural InteractionSep 23, 2024FedSlate:A Federated Deep Reinforcement Learning Recommender SystemJul 17, 2024Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language ModelsJul 7, 2024MINDECHO: Role-Playing Language Agents for Key Opinion LeadersMar 22, 2024Subequivariant Reinforcement Learning Framework for Coordinated Motion ControlJan 29, 2022Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point ProcessesJun 16, 2020Model Embedding Model-Based Reinforcement Learning