"au:"Jiawen Shi"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Jiawen Shi"" — arXiv2 Search

Showing 1–12 of 12 results

/ Date/ Name

Apr 28, 2025Prompt Injection Attack to Tool Selection in LLM Agents Feb 21, 2023BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT Mar 26, 2024Optimization-based Prompt Injection Attack to LLM-as-a-Judge Jul 1, 2024Self-Cognition in Large Language Models: An Exploratory Study Mar 20, 2025BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models Oct 4, 2023MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use Jun 6, 2024AutoJailbreak: Exploring Jailbreak Attacks and Defenses through a Dependency Lens Apr 10, 2026BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning Feb 20, 2025On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Mar 8, 2025Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation May 29, 2025Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models May 23, 2025SafeAgent: Safeguarding LLM Agents via an Automated Risk Simulator