"au:"Banghua Zhu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Banghua Zhu"" — arXiv2 Search

Showing 1–20 of 47 results

/ Date/ Name

Jan 27, 2019Deconstructing Generative Adversarial Networks May 28, 2020Robust estimation via generalized quasi-gradients Sep 19, 2019Generalized Resilience and Robust Statistics May 24, 2022Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees Aug 9, 2018Joint Transceiver Optimization for Wireless Communication PHY with Convolutional Neural Network Jan 26, 2023Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons Nov 10, 2022The Sample Complexity of Online Contract Design May 19, 2023Online Learning in a Creator Economy Jun 21, 2023On the Optimal Bounds for Noisy Computing Jan 29, 2024Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF Jun 3, 2023On Optimal Caching and Model Multiplexing for Large Model Inference Jun 4, 2023Fine-Tuning Language Models with Advantage-Induced Policy Alignment Jun 1, 2023Doubly Robust Self-Training Feb 2, 2022Robust Estimation for Nonparametric Families via Generative Adversarial Networks Jan 21, 2020When does the Tukey median work?Feb 20, 2024Generative AI Security: Challenges and Countermeasures Jan 19, 2021Minimax Off-Policy Evaluation for Multi-Armed Bandits Sep 18, 2023Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration Dec 13, 2023The Effective Horizon Explains Deep RL Performance in Stochastic Environments Jun 17, 2024From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline