Showing 1–20 of 23 results
/ Date/ Name
May 26, 2019Robust Classification using Robust Feature AugmentationJun 2, 2025Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective RolloutsOct 28, 2022Coverage-centric Coreset Selection for High Pruning RatesSep 20, 2017Smoke Screener or Straight Shooter: Detecting Elite Sybil Attacks in User-Review Social NetworksFeb 3, 2025Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at ScaleJul 17, 2020Understanding and Diagnosing Vulnerability under Adversarial AttacksJun 6, 2024ELFS: Label-Free Coreset Selection with Proxy Training DynamicsOct 11, 2023Leveraging Hierarchical Feature Sharing for Efficient Dataset CondensationMay 27, 2019Analyzing the Interpretability Robustness of Self-Explaining ModelsDec 27, 2019Efficient Adversarial Training with Transferable Adversarial ExamplesOct 1, 2025Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?Feb 9, 2024Learn To be Efficient: Build Structured Sparsity in Large Language ModelsFeb 5, 2026Jackpot: Optimal Budgeted Rejection Sampling for Extreme Actor-Policy Mismatch Reinforcement LearningSep 30, 2025OPPO: Accelerating PPO-based RLHF via Pipeline OverlapOct 22, 2025RLBoost: Harvesting Preemptible Resources for Cost-Efficient Reinforcement Learning on LLMsJul 15, 2025Class-Proportional Coreset Selection for Difficulty-Separable DataOct 15, 2025When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?Jun 1, 2023CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV PerceptionJun 5, 2025Kinetics: Rethinking Test-Time Scaling LawsFeb 19, 2024Plato: Plan to Efficiently Decode for Large Language Model Inference