Showing 21–40 of 56 results
/ Date/ Name
Dec 27, 2022Learning Individual Policies in Large Multi-agent Systems through Local Variance MinimizationJan 27, 2023Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward PenaltiesNov 20, 2019Solving Online Threat Screening Games using Constrained Action Space Reinforcement LearningMar 27, 2018Entropy based Independent Learning in Anonymous Multi-Agent SettingsJul 13, 2024Preserving the Privacy of Reward Functions in MDPs through DeceptionDec 16, 2023Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement LearningFeb 8, 2025Improving Environment Novelty Quantification for Effective Unsupervised Environment DesignJun 14, 2024Bootstrapping Language Models with DPO Implicit RewardsFeb 10, 2026Efficient Unsupervised Environment Design through Hierarchical Policy Representation LearningOct 1, 2025On Discovering Algorithms for Adversarial Imitation LearningSep 30, 2023Enhancing the Hierarchical Environment Design via Generative Trajectory ModelingFeb 4, 2023Diversity Induced Environment Design via Self-PlayFeb 21, 2023Future Aware Pricing and Matching for Sustainable On-demand Ride PoolingFeb 21, 2023Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement LearningSep 13, 2020Zone pAth Construction (ZAC) based Approaches for Effective Real-Time RidesharingSep 16, 2021Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child HealthDec 1, 2021Conditional Expectation based Value Decomposition for Scalable On-Demand Ride PoolingDec 7, 2024Semantic Loss Guided Data Efficient Supervised Fine Tuning for Safe Responses in LLMsOct 10, 2024UNIQ: Offline Inverse Q-learning for Avoiding Undesirable DemonstrationsFeb 20, 2024SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning