"au:"Ray Jiang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Ray Jiang"" — arXiv2 Search

Showing 1–12 of 12 results

/ Date/ Name

Jun 21, 2021Emphatic Algorithms for Deep Reinforcement Learning Mar 5, 2018Beyond Greedy Ranking: Slate Optimization via List-CVAE Feb 27, 2019Degenerate Feedback Loops in Recommender Systems Jul 12, 2021Learning Expected Emphatic Traces for Deep RL Feb 9, 2023Scaling Goal-based Exploration via Pruning Proto-goals Jul 28, 2019Wasserstein Fair Classification Nov 8, 2019Reducing Sentiment Bias in Language Models via Counterfactual Evaluation Sep 15, 2022Human-level Atari 200x faster Sep 17, 2025Discovery of Unstable Singularities Aug 7, 2023AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning Feb 7, 2020Causally Correct Partial Models for Reinforcement Learning Jul 24, 2018Learning from Delayed Outcomes via Proxies with Applications to Recommender Systems