Showing 1–20 of 21 results
/ Date/ Name
Dec 15, 2011Fixed Point Theorem for Non-Self Maps of Regions in the PlaneJan 11, 2023An Analysis of Quantile Temporal-Difference LearningOct 26, 2021The Difficulty of Passive Learning in Deep Reinforcement LearningDec 14, 2019Adapting Behaviour for Learning ProgressNov 9, 2010Piecewise Linear Hamiltonian Flows Associated to Zero-Sum Games: Transition Combinatorics and Questions on ErgodicityFeb 25, 2021On The Effect of Auxiliary Tasks on Representation DynamicsJun 2, 2020Temporally-Extended ε-Greedy ExplorationJun 1, 2022The Phenomenon of Policy ChurnAug 26, 2021When should agents explore?May 11, 2021Return-based Scaling: Yet Another Normalisation Trick for Deep RLJun 14, 2018Autoregressive Quantile Networks for Generative ModelingMay 18, 2013Dynamics of a Continuous Piecewise Affine Map of the SquareJun 14, 2018Implicit Quantile Networks for Distributional Reinforcement LearningMar 3, 2017Count-Based Exploration with Neural Density ModelsAug 19, 2013Payoff Performance of Fictitious PlayJul 5, 2022An Empirical Study of Implicit Regularization in Deep Offline RLDec 15, 2015Increasing the Action Gap: New Operators for Reinforcement LearningMay 24, 2023Deep Reinforcement Learning with Plasticity InjectionNov 14, 2017Symmetric Decomposition of Asymmetric GamesJun 6, 2016Unifying Count-Based Exploration and Intrinsic Motivation