"au:"Julian Zimmert"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Julian Zimmert"" — arXiv2 Search

Showing 1–20 of 35 results

/ Date/ Name

Jan 3, 2024Optimal cross-learning for contextual bandits with unknown context distributions Nov 11, 2024Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback Oct 6, 2021Efficient Methods for Online Multiclass Logistic Regression Jul 4, 2018Factored Bandits Oct 14, 2019An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays Aug 23, 2022A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Feb 6, 2022Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum States May 28, 2019Connections Between Mirror Descent, Thompson Sampling and the Information Ratio Oct 7, 2021A Model Selection Approach for Corruption Robust Reinforcement Learning Jun 3, 2025Non-stationary Bandit Convex Optimization: A Comprehensive Study Oct 17, 2022A Unified Algorithm for Stochastic Path Problems Feb 20, 2023A Blackbox Approach to Best of Both Worlds in Bandits and Beyond Feb 18, 2023Best of Both Worlds Policy Optimization Jul 19, 2018Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits Jul 12, 2021Adapting to Misspecification in Contextual Bandits Feb 4, 2025A Scalable Crawling Algorithm Utilizing Noisy Change-Indicating Signals Dec 10, 2025Contextual Dynamic Pricing with Heterogeneous Buyers Jan 25, 2019Beating Stochastic and Adversarial Semi-bandits Optimally and Simultaneously Oct 25, 2021The Pareto Frontier of model selection for general Contextual Bandits May 10, 2024Incentive-compatible Bandits: Importance Weighting No More