Showing 1–20 of 33 results
/ Date/ Name
Apr 12, 2019Distributed Bandit Learning: Near-Optimal Regret with Efficient CommunicationJun 25, 2023Is RLHF More Difficult than Standard RL?Jan 11, 201816-qubit IBM universal quantum computer can be fully entangledOct 20, 2022Learning Rationalizable Equilibria in Multiplayer GamesMar 23, 2021An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality GapMar 11, 2025GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and EditingJun 11, 2020Improved Algorithms for Convex-Concave Minimax OptimizationAug 21, 2020Refined Analysis of FPL for Adversarial Markov Decision ProcessesJul 14, 2025Tie-breaking Agnostic Lower Bound for Fictitious PlayMay 16, 2025Diff-Unfolding: A Model-Based Score Learning Framework for Inverse ProblemsOct 16, 2019On Solving Minimax Optimization Locally: A Follow-the-Ridge ApproachAug 17, 2020On the Suboptimality of Negative Momentum for Minimax OptimizationFeb 28, 2022Neural Adaptive SCEne TracingOct 2, 2024FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing ImagesFeb 13, 2023Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function ApproximationFeb 4, 2022NeAT: Neural Adaptive TomographyMar 14, 2022Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental LimitsOct 28, 2020Online Learning in Unknown Markov GamesFeb 26, 2018A Low-latency Pipeline for GRB Light Curve and Spectrum using Fermi/GBM Near Real-time DataOct 24, 2017In-flight energy calibration of the space-borne Compton polarimeter POLAR