Showing 1–20 of 20 results
/ Date/ Name
Jan 20, 2023Generative Slate Recommendation with Reinforcement LearningApr 19, 2023An Offline Metric for the Debiasedness of Click ModelsJan 3, 2023Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some AlternativesFeb 7, 2022Introducing the Expohedron for Efficient Pareto-optimal Fairness-Utility Amortizations in Repeated RankingsMay 3, 2021SmoothI: Smooth Rank Indicators for Differentiable IR MetricsDec 27, 2017Active Search for High Recall: a Non-Stationary Extension of Thompson SamplingAug 31, 2020Interactive and Explainable Point-of-Interest Recommendation using Look-alike GroupsMay 5, 2016LSTM-based Mixture-of-Experts for Knowledge-Aware DialoguesJul 18, 2016Joint Event Detection and Entity Resolution: a Virtuous CycleApr 3, 2024Unbiased Learning to Rank Meets Reality: Lessons from Baidu's Large-Scale Search DatasetFeb 3, 2020Modeling ASR Ambiguity for Dialogue State Tracking Using Word Confusion NetworksMay 26, 2023Distributional Reinforcement Learning with Dual Expectile-Quantile RegressionFeb 27, 2026Robust Skills, Brittle Grounding: Diagnosing Restricted Generalization in Vision-Language Action Policies via Multi-Object PickingMay 12, 2026Behavioral Mode Discovery for Fine-tuning Multimodal Generative PoliciesNov 28, 2023SARDINE: A Simulator for Automated Recommendation in Dynamic and Interactive EnvironmentsFeb 1, 2024SLIM: Skill Learning with Multiple CriticsMar 14, 2025Disentangled Object-Centric Image Representation for Robotic ManipulationJun 12, 2020Real-Time Optimization Of Web Publisher RTB RevenuesMay 16, 2022Pareto-Optimal Fairness-Utility Amortizations in Rankings with a DBN Exposure ModelMar 16, 2017Efficient Online Learning for Optimizing Value of Information: Theory and Application to Interactive Troubleshooting