Showing 1–20 of 27 results
/ Date/ Name
Jan 7, 2023REaaS: Enabling Adversarially Robust Downstream Classifiers via Robust Encoder as a ServiceOct 3, 2022MultiGuard: Provably Robust Multi-label Classification against Adversarial ExamplesDec 6, 2022Pre-trained Encoders in Self-Supervised Learning Improve Secure and Privacy-preserving Supervised LearningMar 12, 2025Prompt Inversion Attack against Collaborative Inference of Large Language ModelsMay 25, 2022jTrans: Jump-Aware Transformer for Binary Code SimilarityAug 25, 2021EncoderMI: Membership Inference against Pre-trained Encoders in Contrastive LearningJan 30, 2024Provably Robust Multi-bit Watermarking for AI-generated TextMar 4, 2026Self-Sovereign AgentAug 29, 2025RepoMark: A Data-Usage Auditing Framework for Code Large Language ModelsMar 12, 2025Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion ModelsApr 22, 2025A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and DeploymentOct 3, 2025DMark: Order-Agnostic Watermarking for Diffusion Large Language ModelsFeb 27, 2025Towards Collaborative Anti-Money Laundering Among Financial InstitutionsMay 21, 2025Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems through Benign QueriesMar 9, 2025Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation VariationFeb 26, 2026IMMACULATE: A Practical LLM Auditing Framework via Verifiable ComputationMar 3, 2026V3DB: Audit-on-Demand Zero-Knowledge Proofs for Verifiable Vector Search over Committed SnapshotsApr 5, 2023A Certified Radius-Guided Attack Framework to Image Segmentation ModelsMay 26, 2025Efficient Reasoning via Chain of Unconscious ThoughtAug 9, 2025Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models