Showing 1–20 of 23 results
/ Date/ Name
Mar 7, 2016Learning Shared Representations in Multi-task Reinforcement LearningJun 9, 2015The Wreath Process: A totally generative model of geometric shape based on nested symmetriesDec 18, 2018Universal Successor Features ApproximatorsApr 25, 2019Ray Interference: a Source of Plateaus in Deep Reinforcement LearningJun 17, 2022Generalised Policy Improvement with Geometric Policy CompositionAug 26, 2021When should agents explore?May 11, 2021Return-based Scaling: Yet Another Normalisation Trick for Deep RLJul 3, 2020Expected Eligibility TracesJun 24, 2021The Option Keyboard: Combining Skills in Reinforcement LearningMay 1, 2025Wasserstein Policy OptimizationJan 22, 2025Optimizing Return Distributions with Distributional Dynamic ProgrammingDec 14, 2019Adapting Behaviour for Learning ProgressFeb 20, 2022Selective Credit AssignmentFeb 26, 2019The Termination CriticOct 16, 2019Conditional Importance Sampling for Off-Policy LearningJul 8, 2019General non-linear Bellman equationsSep 7, 2023A State Representation for Diminishing RewardsJan 30, 2019Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy ImprovementJun 20, 2017Observational Learning by Reinforcement LearningDec 8, 2021Model-Value Inconsistency as a Signal for Epistemic Uncertainty