Showing 1–20 of 29 results
/ Date/ Name
Mar 6, 2020Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud ProviderJul 12, 2016Compiling Stateful Network Properties for Runtime VerificationJun 17, 2019Sample-Efficient Neural Architecture Search by Learning Action SpaceJul 23, 2018Scanning the Internet for ROS: A View of Security in Robotics ResearchNov 21, 2018SuperNeurons: FFT-based Gradient Sparsification in the Distributed Training of Deep Neural NetworksJan 30, 2018FITing-Tree: A Data-aware Index StructureApr 13, 2026Nanvix: A Multikernel OS Design for High-Density Serverless DeploymentsSep 11, 2023PACE-LM: Prompting and Augmentation for Calibrated Confidence Estimation with GPT-4 in Cloud Incident Root Cause AnalysisMar 7, 2024Exploring LLM-based Agents for Root Cause AnalysisFeb 9, 2025Intent-based System Design and OperationMay 18, 2018Neural Architecture Search using Deep Neural Networks and Monte Carlo Tree SearchJan 28, 2025Towards Resource-Efficient Compound AI SystemsFeb 2, 2025ModServe: Modality- and Stage-Aware Resource Disaggregation for Scalable Multimodal Model ServingDec 8, 2025A Performance Analyzer for a Public Cloud's ML-Augmented VM AllocatorAug 22, 2025Murakkab: Resource-Efficient Agentic Workflow Orchestration in Cloud PlatformsMar 6, 2026StreamWise: Serving Multi-Modal Generation in Real-Time at ScaleJul 1, 2020Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree SearchJan 15, 2025Octopus: Enhancing CXL Memory Pods via Sparse TopologyApr 28, 2021Faa$T: A Transparent Auto-Scaling Cache for Serverless ApplicationsMay 31, 2021With Great Freedom Comes Great Opportunity: Rethinking Resource Allocation for Serverless Functions