"au:"Gokul Swamy"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Gokul Swamy"" — arXiv2 Search

Showing 1–20 of 29 results

/ Date/ Name

Aug 19, 2022Game-Theoretic Algorithms for Conditional Moment Matching May 30, 2022Minimax Optimal Online Imitation Learning via Replay Estimation Mar 26, 2023Inverse Reinforcement Learning without Reinforcement Learning Jan 26, 2026Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning Feb 2, 2022Causal Imitation Learning under Temporally Correlated Noise Mar 4, 2021Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap Aug 3, 2022Sequence Model Imitation Learning with Unobserved Contexts Sep 1, 2023Learning Shared Safety Constraints from Multi-task Demonstrations Feb 13, 2024Hybrid Inverse Reinforcement Learning Jan 26, 2025Your Learned Constraint is Secretly a Backward Reachable Tube Jun 24, 2018Generative Models for Pose Transfer Jan 4, 2019On the Utility of Model Learning in HRI Oct 5, 2021A Critique of Strictly Batch Imitation Learning Jan 8, 2024A Minimaximalist Approach to Reinforcement Learning from Human Feedback Mar 3, 2025All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning Sep 22, 2019Scaled Autonomy: Enabling Human Operators to Control Robot Fleets Jun 15, 2024EvIL: Evolution Strategies for Generalisable Imitation Learning Apr 25, 2024REBEL: Reinforcement Learning via Regressing Relative Rewards May 28, 2025Scaling Offline RL via Efficient and Expressive Shortcut Models Oct 6, 2024Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF