Showing 1–20 of 20 results
/ Date/ Name
Oct 6, 2023Demystifying Embedding Spaces using Large Language ModelsMay 25, 2023DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion ModelsFeb 4, 2023Reinforcement Learning with History-Dependent Dynamic ContextsFeb 6, 2022Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation VectorsFeb 11, 2021Meta-Thompson SamplingFeb 8, 2020BRPO: Batch Residual Policy OptimizationNov 20, 2019Gradient-based Optimization for Bayesian Preference ElicitationSep 26, 2019CAQL: Continuous Action Q-LearningSep 11, 2019RecSim: A Configurable Simulation Platform for Recommender SystemsJun 21, 2019Randomized Exploration in Generalized Linear BanditsMay 29, 2019Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical MethodologyMay 29, 2019Advantage Amplification in Slowly Evolving Latent-State EnvironmentsMar 21, 2019Perturbed-History Exploration in Stochastic Linear BanditsFeb 26, 2019Perturbed-History Exploration in Stochastic Multi-Armed BanditsOct 4, 2018Seq2Slate: Re-ranking and Slate Optimization with RNNsMay 7, 2018Planning and Learning with Stochastic Action SetsMar 6, 2013The Probability of a Possibility: Adding Uncertainty to Default RulesJan 23, 2013Continuous Value Function Approximation for Sequential Bidding PoliciesJul 11, 2012Regret Minimizing Equilibria and Mechanisms for Games with Strict Type UncertaintyJun 13, 2011Eliciting Forecasts from Self-interested Experts: Scoring Rules for Decision Makers