Showing 1–13 of 13 results
/ Date/ Name
Sep 29, 2021DeepAnalyze: Learning to Localize Crashes at ScaleMay 26, 2022AutoTSG: Learning and Synthesis for Incident TroubleshootingJul 16, 2024Building AI Agents for Autonomous Clouds: Challenges and Design PrinciplesJan 15, 2021SoftNER: Mining Knowledge Graphs From Cloud IncidentsDec 18, 2024Syzygy: Dual Code-Test C to (safe) Rust Translation using LLMs and Dynamic AnalysisDec 23, 2023CodeScholar: Growing Idiomatic Code ExamplesJul 10, 2020Neural Knowledge Extraction From Cloud Service IncidentsMay 29, 2025GSO: Challenging Software Optimization Tasks for Evaluating SWE-AgentsJan 12, 2025AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous CloudsApr 9, 2025R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE AgentsJan 17, 2026Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line InterfacesMar 28, 2025Challenges and Paths Towards AI for Software EngineeringDec 20, 2023DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines