Showing 1–13 of 13 results
/ Date/ Name
Feb 19, 2026Computer-Using World ModelJan 19, 2026A Benchmark for Language Models in Real-World System BuildingNov 2, 2025Can Language Models Go Beyond Coding? Assessing the Capability of Language Models to Build Real-World SystemsMay 24, 2024Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly DetectionMar 18, 2024QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback-based Self-CorrectionFeb 8, 2024UFO: A UI-Focused Agent for Windows OS InteractionDec 19, 2023Xpert: Empowering Incident Management with Query Recommendations via Large Language ModelsNov 29, 2023TaskWeaver: A Code-First Agent FrameworkNov 7, 2023Everything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationOct 28, 2023TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice SystemsJul 3, 2023ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly DetectionMay 29, 2023Assess and Summarize: Improve Outage Understanding with Large Language ModelsMay 25, 2023Automatic Root Cause Analysis via Large Language Models for Cloud Incidents