"au:"Xuansheng Wu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Xuansheng Wu"" — arXiv2 Search

Showing 1–20 of 28 results

/ Date/ Name

Sep 30, 2023From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning Jun 29, 2023Could Small Language Models Serve as Recommenders? Towards Data-centric Cold-start Recommendations Feb 24, 2023NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence Representation Oct 17, 2025Soundness-Aware Level: A Microscopic Signature that Predicts LLM Reasoning Potential Feb 21, 2025Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders Jul 4, 2024Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring Feb 19, 2025Self-Regularization with Sparse Autoencoders for Controllable LLM-based Classification Mar 21, 2023Black-box Backdoor Defense via Zero-shot Image Purification Jan 20, 2023Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education Mar 13, 2023A Survey of Graph Prompting Methods: Techniques, Applications, and Challenges Mar 28, 2024Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering Mar 13, 2024Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era Nov 11, 2025Investigating CoT Monitorability in Large Reasoning Models Feb 11, 2026Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs May 31, 2025Concept-Centric Token Interpretation for Vector-Quantized Generative Models Jan 7, 2024InFoBench: Evaluating Instruction Following Ability in Large Language Models May 15, 2025Artificial Intelligence Bias on English Language Learners in Automatic Scoring May 12, 2025Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders Nov 30, 2023Applying Large Language Models and Chain-of-Thought for Automatic Scoring Jun 24, 2025Is Long-to-Short a Free Lunch? Investigating Inconsistency and Reasoning Efficiency in LRMs