arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Zhenlan Ji"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
Mar 23, 2025
STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models
Jun 8, 2024
SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner
Oct 10, 2023
Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach