Showing 1–20 of 42 results
/ Date/ Name
Dec 18, 2023Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real RobotsJun 14, 2018Maximum a Posteriori Policy OptimisationJun 15, 2021On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and FinetuningDec 5, 2018Relative Entropy Regularized Policy IterationOct 5, 2024Learning from negative feedback, or positive feedback or bothFeb 19, 2020Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement LearningJun 18, 2019Robust Reinforcement Learning for Continuous Control with Model MisspecificationOct 7, 2021Evaluating model-based planning and planner amortization for continuous controlJun 1, 2020Acme: A Research Framework for Distributed Reinforcement LearningMay 6, 2022How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic ManipulationSep 5, 2024Game On: Towards Language Models as RL ExperimentersJul 7, 2025Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic CapabilitiesApr 12, 2022Forgetting and Imbalance in Robot Lifelong Learning with Off-policy DataOct 29, 2020"What, not how": Solving an under-actuated insertion task from scratchJun 26, 2019Compositional Transfer in Hierarchical Reinforcement LearningOct 1, 2019Augmenting learning using symmetry in a biologically-inspired domainMay 25, 2021From Motor Control to Team Play in Simulated Humanoid FootballJan 2, 2018DeepMind Control SuiteJul 9, 2025Value from Observations: Towards Large-Scale Imitation Learning via Self-ImprovementNov 24, 2022SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration