Showing 81–100 of 169 results
/ Date/ Name
May 29, 2025GSO: Challenging Software Optimization Tasks for Evaluating SWE-AgentsMay 28, 2025Jailbreak Distillation: Renewable Safety BenchmarkingApr 27, 2025Critical Considerations on Effort-aware Software Defect Prediction MetricsApr 24, 2025Detection, Classification and Prevalence of Self-Admitted Aging DebtApr 12, 2025SmartShift: A Secure and Efficient Approach to Smart Contract MigrationApr 9, 2025R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE AgentsMar 28, 2025Challenges and Paths Towards AI for Software EngineeringMar 24, 2025What is Business Process Automation Anyway?Feb 19, 2025Where's the Bug? Attention Probing for Scalable Fault LocalizationFeb 12, 2025Flow-of-Action: SOP Enhanced LLM-Based Multi-Agent System for Root Cause AnalysisJan 12, 2025AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous CloudsDec 18, 2024Syzygy: Dual Code-Test C to (safe) Rust Translation using LLMs and Dynamic AnalysisNov 5, 2024GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation ModelsSep 24, 2024Preference-Guided Refactored Tuning for Retrieval Augmented Code GenerationAug 15, 2024API-guided Dataset Synthesis to Finetune Large Code ModelsJul 26, 2024Optimizing Checkpoint-Restart Mechanisms for HPC with DMTCP in Containers at NERSCJul 16, 2024Building AI Agents for Autonomous Clouds: Challenges and Design PrinciplesJul 2, 2024Mining Constraints from Reference Process Models for Detecting Best-Practice Violations in Event LogsJun 27, 2024Failure Diagnosis in Microservice Systems: A Comprehensive Survey and AnalysisJun 20, 2024CREF: An LLM-based Conversational Software Repair Framework for Programming Tutors