Showing 1–20 of 22 results
/ Date/ Name
Oct 12, 2021Query-Reward Tradeoffs in Multi-Armed BanditsFeb 5, 2021Confidence-Budget Matching for Sequential Budgeted LearningAug 13, 2020Reinforcement Learning with Trajectory FeedbackMay 8, 2019Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit ProblemJun 17, 2024Improved Algorithms for Contextual Dynamic PricingJan 15, 2026Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive BatchingFeb 13, 2020Tight Lower Bounds for Combinatorial Multi-Armed BanditsNov 11, 2025Online Linear Regression with Paid Stochastic FeaturesMar 18, 2024The Value of Reward Lookahead in Reinforcement LearningAug 10, 2020Lenient Regret for Multi-Armed BanditsJun 4, 2024Reinforcement Learning with Lookahead InformationOct 2, 2019Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement LearningSep 6, 2018Learn What Not to Learn: Action Elimination with Deep Reinforcement LearningMay 27, 2019Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy PoliciesMay 26, 2024On Bits and Bandits: Quantifying the Regret-Information Trade-offNov 5, 2024Stable Matching with Ties: Approximation Ratios and LearningFeb 4, 2023Reinforcement Learning with History-Dependent Dynamic ContextsMay 31, 2022On Preemption and Learning in Stochastic SchedulingMay 30, 2022Reinforcement Learning with a TerminatorMay 24, 2023Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics