Showing 1–14 of 14 results
/ Date/ Name
Feb 1, 2021Doubly robust Thompson sampling for linear payoffsJun 17, 2025Adaptive Data Augmentation for Thompson SamplingJan 24, 2026PatchIsland: Orchestration of LLM Agents for Continuous Vulnerability RepairSep 15, 2022Double Doubly Robust Thompson Sampling for Generalized Linear Contextual BanditsOct 23, 2023A Doubly Robust Approach to Sparse Reinforcement LearningJan 31, 2023Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit FeedbackJun 11, 2022Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual BanditsMay 31, 2023Learning the Pareto Front Using Bootstrapped Observation SamplesSep 10, 2025DiTTO-LLM: Framework for Discovering Topic-based Technology Opportunities via Large Language ModelFeb 10, 2025Linear Bandits with Partially Observable FeaturesJun 5, 2020Principled learning method for Wasserstein distributionally robust optimization with local perturbationsJan 28, 2019Principled analytic classifier for positive-unlabeled learning via weighted integral probability metricSep 29, 2025Takedown: How It's Done in Modern Coding Agent ExploitsSep 18, 2025ATLANTIS: AI-driven Threat Localization, Analysis, and Triage Intelligence System