Showing 1–10 of 10 results
/ Date/ Name
Jan 3, 2023Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some AlternativesAug 6, 2025FaST: Feature-aware Sampling and Tuning for Personalized Preference Alignment with Limited DataNov 28, 2023SARDINE: A Simulator for Automated Recommendation in Dynamic and Interactive EnvironmentsMay 3, 2021SmoothI: Smooth Rank Indicators for Differentiable IR MetricsJun 26, 2018Deep $k$-Means: Jointly clustering with $k$-Means and learning representationsJan 20, 2023Generative Slate Recommendation with Reinforcement LearningMar 29, 2024ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language ModelsOct 9, 2024Guaranteed Generation from Large Language ModelsSep 17, 2025Findings of the Third Automatic Minuting (AutoMin) ChallengeFeb 20, 2025Drift: Decoding-time Personalized Alignments with Implicit User Preferences