Showing 1–12 of 12 results
/ Date/ Name
Jun 21, 2021Emphatic Algorithms for Deep Reinforcement LearningMar 5, 2018Beyond Greedy Ranking: Slate Optimization via List-CVAEFeb 27, 2019Degenerate Feedback Loops in Recommender SystemsJul 12, 2021Learning Expected Emphatic Traces for Deep RLFeb 9, 2023Scaling Goal-based Exploration via Pruning Proto-goalsJul 28, 2019Wasserstein Fair ClassificationNov 8, 2019Reducing Sentiment Bias in Language Models via Counterfactual EvaluationSep 15, 2022Human-level Atari 200x fasterSep 17, 2025Discovery of Unstable SingularitiesAug 7, 2023AlphaStar Unplugged: Large-Scale Offline Reinforcement LearningFeb 7, 2020Causally Correct Partial Models for Reinforcement LearningJul 24, 2018Learning from Delayed Outcomes via Proxies with Applications to Recommender Systems