Showing 1–20 of 20 results
/ Date/ Name
Apr 1, 2026CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm DiscoveryMay 28, 2025Revisiting Group Relative Policy Optimization: Insights into On-Policy and Off-Policy TrainingMar 9, 2025Reinforcement Learning with Verifiable Rewards: GRPO's Effective Loss, Dynamics, and Success AmplificationJun 9, 2024Distributional Preference Alignment of LLMs via Optimal TransportOct 11, 2023Risk Aware Benchmarking of Large Language ModelsJun 17, 2021Large-Scale Chemical Language Representations Capture Molecular Structure and PropertiesDec 21, 2020Image Captioning as an Assistive Technology: Lessons Learned from VizWiz 2020 ChallengeDec 21, 2020Alleviating Noisy Data in Image Captioning with Cooperative DistillationNov 4, 2020On the Convergence of Gradient Descent in GANs: MMD GAN As a Gradient FlowNov 3, 2020Tabular Transformers for Modeling Multivariate Time SeriesSep 29, 2020Unbalanced Sobolev DescentAug 24, 2020Active learning of deep surrogates for PDEs: Application to metasurface designOct 31, 2019Sobolev Independence CriterionMay 30, 2019Wasserstein Style TransferMay 16, 2018Regularized Finite Dimensional Kernel Sobolev DiscrepancyNov 14, 2017Sobolev GANMay 26, 2017Fisher GANFeb 27, 2017McGan: Mean and Covariance Feature Matching GANFeb 21, 2013q-ary Compressive SensingSep 6, 2012Multiclass Learning with Simplex Coding