"au:"Michael L. Littman"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Michael L. Littman"" — arXiv2 Search

Showing 1–20 of 56 results

/ Date/ Name

Jul 4, 2018Transfer with Model Features in Reinforcement Learning Jul 15, 2011On the Computational Complexity of Stochastic Controller Optimization in POMDPs Nov 1, 2021On the Expressivity of Markov Reward Dec 8, 2019Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging Feb 13, 2020The Efficiency of Human Cognition Reflects Planned Information Processing Sep 15, 2021Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback Feb 14, 2012Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search Jan 10, 2005Combining Independent Modules in Lexical Multiple-Choice Problems Jun 10, 2021Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI Program Dec 7, 2022Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior May 14, 2021People construct simplified mental representations to plan Aug 23, 2019Stackelberg Punishment and Bully-Proofing Autonomous Vehicles Nov 7, 2022Reward-Predictive Clustering May 9, 2012A Bayesian Sampling Approach to Exploration in Reinforcement Learning Jan 10, 2013Graphical Models for Game Theory Jun 27, 2012An Efficient Optimal-Equilibrium Algorithm for Two-player Game Trees Jun 13, 2012CORL: A Continuous-state Offset-dynamics Reinforcement Learner Jan 15, 2017Near Optimal Behavior via Approximate State Abstraction Dec 16, 2016An Alternative Softmax Operator for Reinforcement Learning Jul 3, 2024Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages