Showing 1–13 of 13 results
/ Date/ Name
Aug 26, 2023Transaction fee mechanism for Proof-of-Stake protocolJun 21, 2022Polynomial Voting RulesJan 5, 2006Heavy-Traffic Optimality of a Stochastic Network under Utility-Maximizing Resource ControlFeb 3, 2025Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement LearningMay 30, 2023Policy Optimization for Continuous Reinforcement LearningOct 12, 2025Understanding Sampler Stochasticity in Training Diffusion Models for RLHFSep 4, 2025Diffusion Generative Models Meet Compressed Sensing, with Applications to Imaging and FinanceOct 5, 2024RainbowPO: A Unified Framework for Combining Improvements in Preference OptimizationMar 13, 2025RPO: Fine-Tuning Visual Generative Models via Rich Vision-Language PreferencesSep 12, 2024Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learningSep 17, 2024Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A SurveyJul 26, 2022Trading under the Proof-of-Stake Protocol -- a Continuous-Time Control ApproachJan 28, 2015Matching Supply and Demand in Production-Inventory Systems: Asymptotics and Optimization