Showing 1–20 of 28 results
/ Date/ Name
Sep 30, 2023From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction TuningJun 29, 2023Could Small Language Models Serve as Recommenders? Towards Data-centric Cold-start RecommendationsFeb 24, 2023NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence RepresentationOct 17, 2025Soundness-Aware Level: A Microscopic Signature that Predicts LLM Reasoning PotentialFeb 21, 2025Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse AutoencodersJul 4, 2024Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic ScoringFeb 19, 2025Self-Regularization with Sparse Autoencoders for Controllable LLM-based ClassificationMar 21, 2023Black-box Backdoor Defense via Zero-shot Image PurificationJan 20, 2023Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science EducationMar 13, 2023A Survey of Graph Prompting Methods: Techniques, Applications, and ChallengesMar 28, 2024Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question AnsweringMar 13, 2024Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM EraNov 11, 2025Investigating CoT Monitorability in Large Reasoning ModelsFeb 11, 2026Less is Enough: Synthesizing Diverse Data in Feature Space of LLMsMay 31, 2025Concept-Centric Token Interpretation for Vector-Quantized Generative ModelsJan 7, 2024InFoBench: Evaluating Instruction Following Ability in Large Language ModelsMay 15, 2025Artificial Intelligence Bias on English Language Learners in Automatic ScoringMay 12, 2025Beyond Input Activations: Identifying Influential Latents by Gradient Sparse AutoencodersNov 30, 2023Applying Large Language Models and Chain-of-Thought for Automatic ScoringJun 24, 2025Is Long-to-Short a Free Lunch? Investigating Inconsistency and Reasoning Efficiency in LRMs