"au:"Michael Littman"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Michael Littman"" — arXiv2 Search

Showing 1–20 of 80 results

/ Date/ Name

Sep 1, 2017Mean Actor Critic Jan 16, 2019ReNeg and Backseat Driver: Learning from Demonstration with Continuous Human Feedback May 9, 2012A Bayesian Sampling Approach to Exploration in Reinforcement Learning Jan 10, 2013Graphical Models for Game Theory Jun 27, 2012An Efficient Optimal-Equilibrium Algorithm for Two-player Game Trees Jun 13, 2012CORL: A Continuous-state Offset-dynamics Reinforcement Learner Jan 15, 2017Near Optimal Behavior via Approximate State Abstraction Dec 16, 2016An Alternative Softmax Operator for Reinforcement Learning Jul 10, 2024Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy Jul 4, 2018Transfer with Model Features in Reinforcement Learning Jul 15, 2011On the Computational Complexity of Stochastic Controller Optimization in POMDPs Feb 14, 2012Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search Jan 10, 2005Combining Independent Modules in Lexical Multiple-Choice Problems Jul 19, 2019Interactive Learning of Environment Dynamics for Sequential Tasks Jan 15, 2020Lipschitz Lifelong Reinforcement Learning Apr 14, 2017Environment-Independent Task Specifications via GLTL Aug 23, 2005Corpus-based Learning of Analogies and Semantic Relations Oct 27, 2022Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report Jun 27, 2012Incremental Model-based Learners With Formal Learning-Time Guarantees Sep 19, 2017Summable Reparameterizations of Wasserstein Critics in the One-Dimensional Setting