Showing 1–14 of 14 results
/ Date/ Name
Sep 26, 2025MTRec: Learning to Align with User Preferences via Mental Reward ModelsOct 16, 2025Model-agnostic Selective Labeling with Provable Statistical GuaranteesMay 6, 2026Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent GamesSep 21, 2020Dynamic Horizon Value Estimation for Model-based Reinforcement LearningAug 25, 2022The ReprGesture entry to the GENEA Challenge 2022Oct 18, 2021Empirical Policy Optimization for $n$-Player Markov GamesApr 1, 2017Vehicle Traffic Driven Camera Placement for Better Metropolis Security SurveillanceAug 6, 2025GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples ReplayOct 10, 2020Event-Triggered Multi-agent Reinforcement Learning with Communication under Limited-bandwidth ConstraintAug 24, 2021CMML: Contextual Modulation Meta Learning for Cold-Start RecommendationDec 1, 2025ViRectify: A Challenging Benchmark for Video Reasoning Correction with Multimodal Large Language ModelsAug 14, 2021Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware RecommendationDec 17, 2023A Unified Framework for Multi-Domain CTR Prediction via Large Language ModelsNov 10, 2025PADiff: Predictive and Adaptive Diffusion Policies for Ad Hoc Teamwork