Showing 1–12 of 12 results
/ Date/ Name
Sep 5, 2023CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language ModelsOct 13, 2025DebugTA: An LLM-Based Agent for Simplifying Debugging and Teaching in Programming EducationJul 4, 2025CoreCodeBench: Decoupling Code Intelligence via Fine-Grained Repository-Level TasksOct 28, 2025Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and EvaluationApr 7, 2025AdvKT: An Adversarial Multi-Step Training Framework for Knowledge TracingOct 30, 2025CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative TournamentsJul 1, 2024SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language ModelNov 29, 2024Train Once for All: A Transitional Approach for Efficient Aspect Sentiment Triplet ExtractionJun 17, 2022An F-shape Click Model for Information Retrieval on Multi-block Mobile PagesMay 3, 2024CodeGRAG: Bridging the Gap between Natural Language and Programming Language via Graphical Retrieval Augmented GenerationDec 27, 2023Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and ChallengesMay 14, 2025LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis