"au:"Cassidy Laidlaw"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Cassidy Laidlaw"" — arXiv2 Search

Showing 1–13 of 13 results

/ Date/ Name

Nov 25, 2019Playing it Safe: Adversarial Robustness with an Abstain Option Apr 22, 2022The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models Dec 13, 2023The Effective Horizon Explains Deep RL Performance in Stochastic Environments Mar 5, 2024Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking May 29, 2019Functional Adversarial Attacks Jun 19, 2021Uncertain Decisions Facilitate Better Preference Learning Dec 13, 2023Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF Apr 19, 2023Bridging RL Theory and Practice with the Effective Horizon Jun 22, 2020Perceptual Adversarial Robustness: Defense Against Unseen Threat Models Apr 9, 2025AssistanceZero: Scalably Solving Assistance Games Dec 15, 2023Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping May 8, 2019Capture, Learning, and Synthesis of 3D Speaking Styles Jan 14, 2025Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision