Showing 21–35 of 35 results
/ Date/ Name
Oct 24, 2025Enhancing Tactile-based Reinforcement Learning for Robotic ControlFeb 17, 2026Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social DilemmasMay 23, 2024Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human InputFeb 24, 2026Probing Dec-POMDP Reasoning in Cooperative MARLMar 2, 2019Discovering Options for Exploration by Minimizing Cover TimeFeb 8, 2020Learning State Abstractions for Transfer in Continuous ControlMar 8, 2025Studying the Interplay Between the Actor and Critic Representations in Reinforcement LearningJan 22, 2025Optimizing Return Distributions with Distributional Dynamic ProgrammingJun 9, 2025Memory Allocation in Resource-Constrained Reinforcement LearningJul 24, 2025Remembering the Markov Property in Cooperative MARLNov 6, 2025Forgetting is EverywhereDec 31, 2025Inter-Agent Relative Representations for Multi-Agent Option DiscoverySep 13, 2022Meta-Gradients in Non-Stationary EnvironmentsFeb 13, 2020The Efficiency of Human Cognition Reflects Planned Information ProcessingFeb 27, 2021Revisiting Peng's Q($λ$) for Modern Reinforcement Learning