"au:"Georg Ostrovski"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Georg Ostrovski"" — arXiv2 Search

Showing 1–20 of 21 results

/ Date/ Name

Dec 15, 2011Fixed Point Theorem for Non-Self Maps of Regions in the Plane Jan 11, 2023An Analysis of Quantile Temporal-Difference Learning Oct 26, 2021The Difficulty of Passive Learning in Deep Reinforcement Learning Dec 14, 2019Adapting Behaviour for Learning Progress Nov 9, 2010Piecewise Linear Hamiltonian Flows Associated to Zero-Sum Games: Transition Combinatorics and Questions on Ergodicity Feb 25, 2021On The Effect of Auxiliary Tasks on Representation Dynamics Jun 2, 2020Temporally-Extended ε-Greedy Exploration Jun 1, 2022The Phenomenon of Policy Churn Aug 26, 2021When should agents explore?May 11, 2021Return-based Scaling: Yet Another Normalisation Trick for Deep RL Jun 14, 2018Autoregressive Quantile Networks for Generative Modeling May 18, 2013Dynamics of a Continuous Piecewise Affine Map of the Square Jun 14, 2018Implicit Quantile Networks for Distributional Reinforcement Learning Mar 3, 2017Count-Based Exploration with Neural Density Models Aug 19, 2013Payoff Performance of Fictitious Play Jul 5, 2022An Empirical Study of Implicit Regularization in Deep Offline RL Dec 15, 2015Increasing the Action Gap: New Operators for Reinforcement Learning May 24, 2023Deep Reinforcement Learning with Plasticity Injection Nov 14, 2017Symmetric Decomposition of Asymmetric Games Jun 6, 2016Unifying Count-Based Exploration and Intrinsic Motivation