Showing 1–20 of 80 results
/ Date/ Name
Sep 1, 2017Mean Actor CriticJan 16, 2019ReNeg and Backseat Driver: Learning from Demonstration with Continuous Human FeedbackMay 9, 2012A Bayesian Sampling Approach to Exploration in Reinforcement LearningJan 10, 2013Graphical Models for Game TheoryJun 27, 2012An Efficient Optimal-Equilibrium Algorithm for Two-player Game TreesJun 13, 2012CORL: A Continuous-state Offset-dynamics Reinforcement LearnerJan 15, 2017Near Optimal Behavior via Approximate State AbstractionDec 16, 2016An Alternative Softmax Operator for Reinforcement LearningJul 10, 2024Mitigating Partial Observability in Sequential Decision Processes via the Lambda DiscrepancyJul 4, 2018Transfer with Model Features in Reinforcement LearningJul 15, 2011On the Computational Complexity of Stochastic Controller Optimization in POMDPsFeb 14, 2012Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree searchJan 10, 2005Combining Independent Modules in Lexical Multiple-Choice ProblemsJul 19, 2019Interactive Learning of Environment Dynamics for Sequential TasksJan 15, 2020Lipschitz Lifelong Reinforcement LearningApr 14, 2017Environment-Independent Task Specifications via GLTLAug 23, 2005Corpus-based Learning of Analogies and Semantic RelationsOct 27, 2022Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel ReportJun 27, 2012Incremental Model-based Learners With Formal Learning-Time GuaranteesSep 19, 2017Summable Reparameterizations of Wasserstein Critics in the One-Dimensional Setting