Showing 1–20 of 56 results
/ Date/ Name
Jul 4, 2018Transfer with Model Features in Reinforcement LearningJul 15, 2011On the Computational Complexity of Stochastic Controller Optimization in POMDPsNov 1, 2021On the Expressivity of Markov RewardDec 8, 2019Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imagingFeb 13, 2020The Efficiency of Human Cognition Reflects Planned Information ProcessingSep 15, 2021Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage FeedbackFeb 14, 2012Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree searchJan 10, 2005Combining Independent Modules in Lexical Multiple-Choice ProblemsJun 10, 2021Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI ProgramDec 7, 2022Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired BehaviorMay 14, 2021People construct simplified mental representations to planAug 23, 2019Stackelberg Punishment and Bully-Proofing Autonomous VehiclesNov 7, 2022Reward-Predictive ClusteringMay 9, 2012A Bayesian Sampling Approach to Exploration in Reinforcement LearningJan 10, 2013Graphical Models for Game TheoryJun 27, 2012An Efficient Optimal-Equilibrium Algorithm for Two-player Game TreesJun 13, 2012CORL: A Continuous-state Offset-dynamics Reinforcement LearnerJan 15, 2017Near Optimal Behavior via Approximate State AbstractionDec 16, 2016An Alternative Softmax Operator for Reinforcement LearningJul 3, 2024Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages