Showing 1–20 of 47 results
/ Date/ Name
Jan 27, 2019Deconstructing Generative Adversarial NetworksMay 28, 2020Robust estimation via generalized quasi-gradientsSep 19, 2019Generalized Resilience and Robust StatisticsMay 24, 2022Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy GuaranteesAug 9, 2018Joint Transceiver Optimization for Wireless Communication PHY with Convolutional Neural NetworkJan 26, 2023Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise ComparisonsNov 10, 2022The Sample Complexity of Online Contract DesignMay 19, 2023Online Learning in a Creator EconomyJun 21, 2023On the Optimal Bounds for Noisy ComputingJan 29, 2024Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHFJun 3, 2023On Optimal Caching and Model Multiplexing for Large Model InferenceJun 4, 2023Fine-Tuning Language Models with Advantage-Induced Policy AlignmentJun 1, 2023Doubly Robust Self-TrainingFeb 2, 2022Robust Estimation for Nonparametric Families via Generative Adversarial NetworksJan 21, 2020When does the Tukey median work?Feb 20, 2024Generative AI Security: Challenges and CountermeasuresJan 19, 2021Minimax Off-Policy Evaluation for Multi-Armed BanditsSep 18, 2023Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline DemonstrationDec 13, 2023The Effective Horizon Explains Deep RL Performance in Stochastic EnvironmentsJun 17, 2024From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline