Showing 1–20 of 24 results
/ Date/ Name
Sep 18, 2025Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes VariationJun 20, 2024Tractable Equilibrium Computation in Markov Games through Risk AversionDec 2, 2021Sample Complexity of Robust Reinforcement Learning with a Generative ModelAug 10, 2022Robust Reinforcement Learning using Offline DataMay 23, 2025KL-regularization Itself is Differentially Private in Bandits and RLHFJun 12, 2025TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for AutonomyFeb 4, 2025Robust LLM Alignment via Distributionally Robust Direct Preference OptimizationJan 27, 2026Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM ReasoningMay 25, 2025Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity GuaranteesMar 3, 2020Bounded Regret for Finitely Parameterized Multi-Armed BanditsOct 27, 2023Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data CoverageDec 18, 2021Off-Policy Evaluation Using Information Borrowing and Context-Based SwitchingFeb 11, 2026Distributionally Robust Cooperative Multi-Agent Reinforcement Learning via Robust Value FactorizationMar 5, 2023Improved Sample Complexity Bounds for Distributionally Robust Reinforcement LearningNov 28, 2022Personalized Reward Learning with Interaction-Grounded Learning (IGL)Jun 20, 2020Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance GuaranteesMay 8, 2024Model-Free Robust $φ$-Divergence Reinforcement Learning Using Both Offline and Online DataJun 22, 2024Distributionally Robust Constrained Reinforcement Learning under Strong DualityJun 26, 2025Risk-Averse Total-Reward Reinforcement LearningNov 6, 2024Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data