Showing 1–20 of 21 results
/ Date/ Name
Jan 8, 2025Rising Rested MAB with Linear DriftSep 13, 2024Batch Ensemble for Variance Dependent Regret in Stochastic BanditsMar 2, 2023Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function ApproximationFeb 1, 2023Uniswap Liquidity Provision: An Online Learning ApproachSep 27, 2022Dueling Convex Optimization with General PreferencesMar 2, 2022Learning Efficiently Function Approximation for Contextual MDPFeb 23, 2022Finding Safe Zones of policies Markov Decision ProcessesJan 31, 2022Cooperative Online Learning in Stochastic and Adversarial MDPsDec 29, 2020Learning Adversarial Markov Decision Processes with Delayed FeedbackMay 28, 2019ROI Maximization in Stochastic Online Decision-MakingApr 7, 2019Competitive ratio versus regret minimization: achieving the best of both worldsFeb 17, 2019Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ RegretOct 24, 2018Optimal Algorithm for Bayesian Incentive-Compatible ExplorationOct 22, 2018Adversarial Online Learning with noiseJun 19, 2018Online Linear Quadratic ControlMay 12, 2018Fair Leader Election for Rational Agents in Asynchronous Rings and NetworksMay 7, 2018Planning and Learning with Stochastic Action SetsMay 24, 2016When should an expert make a prediction?Mar 21, 2016Online Learning with Low Rank ExpertsJan 14, 2015Classification with Low Rank and Missing Data