Showing 1–20 of 24 results
/ Date/ Name
Jan 19, 2026A Benchmark for Language Models in Real-World System BuildingNov 2, 2025Can Language Models Go Beyond Coding? Assessing the Capability of Language Models to Build Real-World SystemsJan 12, 2025AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous CloudsSep 20, 2024CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation InformationJul 16, 2024Building AI Agents for Autonomous Clouds: Challenges and Design PrinciplesJul 9, 2024A Scenario-Oriented Benchmark for Assessing AIOps Algorithms in Microservice ManagementJun 27, 2024Failure Diagnosis in Microservice Systems: A Comprehensive Survey and AnalysisMay 24, 2024Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly DetectionFeb 8, 2024UFO: A UI-Focused Agent for Windows OS InteractionFeb 5, 2024Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency PerspectiveJan 24, 2024Automated Root Causing of Cloud Incidents using In-Context Learning with GPT-4Dec 19, 2023Xpert: Empowering Incident Management with Query Recommendations via Large Language ModelsNov 29, 2023TaskWeaver: A Code-First Agent FrameworkNov 7, 2023Everything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationOct 28, 2023TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice SystemsOct 11, 2023OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language ModelsAug 1, 2023A Survey of Time Series Anomaly Detection Methods in the AIOps DomainJul 3, 2023ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly DetectionMay 29, 2023Assess and Summarize: Improve Outage Understanding with Large Language ModelsMay 25, 2023Automatic Root Cause Analysis via Large Language Models for Cloud Incidents