Showing 1–20 of 29 results
/ Date/ Name
Aug 19, 2022Game-Theoretic Algorithms for Conditional Moment MatchingMay 30, 2022Minimax Optimal Online Imitation Learning via Replay EstimationMar 26, 2023Inverse Reinforcement Learning without Reinforcement LearningJan 26, 2026Gained in Translation: Privileged Pairwise Judges Enhance Multilingual ReasoningFeb 2, 2022Causal Imitation Learning under Temporally Correlated NoiseMar 4, 2021Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation GapAug 3, 2022Sequence Model Imitation Learning with Unobserved ContextsSep 1, 2023Learning Shared Safety Constraints from Multi-task DemonstrationsFeb 13, 2024Hybrid Inverse Reinforcement LearningJan 26, 2025Your Learned Constraint is Secretly a Backward Reachable TubeJun 24, 2018Generative Models for Pose TransferJan 4, 2019On the Utility of Model Learning in HRIOct 5, 2021A Critique of Strictly Batch Imitation LearningJan 8, 2024A Minimaximalist Approach to Reinforcement Learning from Human FeedbackMar 3, 2025All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-TuningSep 22, 2019Scaled Autonomy: Enabling Human Operators to Control Robot FleetsJun 15, 2024EvIL: Evolution Strategies for Generalisable Imitation LearningApr 25, 2024REBEL: Reinforcement Learning via Regressing Relative RewardsMay 28, 2025Scaling Offline RL via Efficient and Expressive Shortcut ModelsOct 6, 2024Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF