Showing 1–17 of 17 results
/ Date/ Name
Apr 20, 2026Beyond Indistinguishability: Measuring Extraction Risk in LLM APIsJan 23, 2026White-Box Sensitivity Auditing with Steering VectorsApr 23, 2025Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" ControlOct 24, 2023SoK: Memorization in General-Purpose Large Language ModelsMar 21, 2023Manipulating Transfer Learning for Property InferenceDec 21, 2022SoK: Let the Privacy Games Begin! A Unified Treatment of Data Inference Privacy in Machine LearningJan 19, 2022Cold Atoms in Space: Community Workshop Summary and Proposed Road-MapSep 13, 2021Formalizing and Estimating Distribution Inference RisksJun 7, 2021Formalizing Distribution Inference RisksMar 24, 2021Improved Estimation of Concentration Under $\ell_p$-Norm Distance Metrics Using Half SpacesJun 30, 2020Model-Targeted Poisoning Attacks with Provable ConvergenceApr 21, 2020Certifying Joint Adversarial Robustness for Model EnsemblesMar 1, 2020Understanding the Intrinsic Robustness of Image Distributions using Conditional Generative ModelsMay 29, 2019Empirically Measuring Concentration: Fundamental Limits on Intrinsic RobustnessJan 28, 2019Context-aware Monitoring in Robotic SurgeryJun 15, 2017Horcrux: A Password Manager for ParanoidsJun 11, 2017Decentralized Certificate Authorities