"au:"Zongjie Li"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zongjie Li"" — arXiv2 Search

Showing 1–20 of 20 results

/ Date/ Name

Apr 4, 2026Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode Mar 31, 2026SkillReducer: Optimizing LLM Agent Skills for Token Efficiency Mar 22, 2026WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making Sep 6, 2025Red-Teaming Coding Agents from a Tool-Invocation Perspective: An Empirical Security Assessment Jun 20, 2025Differentiation-Based Extraction of Proprietary Data from Fine-Tuned LLMs Jun 11, 2025Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models Mar 23, 2025STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models Aug 15, 2024API-guided Dataset Synthesis to Finetune Large Code Models Jun 8, 2024SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner May 8, 2024SPVR: syntax-to-prompt vulnerability repair based on large language models Jan 27, 2024An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios Dec 7, 2023VRPTEST: Evaluating Visual Referring Prompting in Large Multimodal Models Oct 10, 2023Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach Oct 10, 2023Refining Decompiled C Code with Large Language Models Sep 29, 2023Split and Merge: Aligning Position Biases in LLM-based Evaluators May 4, 2023"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process Mar 6, 2023On Extracting Specialized Code Abilities from Large Language Models: A Feasibility Study Aug 17, 2022CCTEST: Testing and Repairing Code Completion Systems Apr 20, 2022Unleashing the Power of Compiler Intermediate Representation to Enhance Neural Program Embeddings Dec 2, 2020CRaDLe: Deep Code Retrieval Based on Semantic Dependency Learning