Showing 1–12 of 12 results
/ Date/ Name
Aug 7, 2021Asymmetry-aware Scalable LockingMar 19, 2022No Provisioned Concurrency: Fast RDMA-codesigned Remote Fork for Serverless ComputingApr 27, 2026Characterizing Vision-Language-Action Models across XPUs: Constraints and Acceleration for On-Robot DeploymentMay 2, 2026VUDA: Breaking CUDA-Vulkan Isolation for Spatial Sharing of Compute and Graphics on the Same GPUJul 21, 2023Transactional Indexes on (RDMA or CXL-based) Disaggregated Memory with Repairable TransactionMay 8, 2025PIDiff: Image Customization for Personalized Identities with Diffusion ModelsAug 14, 2025Leveraging OS-Level Primitives for Robotic Action ManagementMay 20, 2024PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated SpeculationJan 24, 2024Characterizing Network Requirements for GPU API Remoting in AI ApplicationsMar 25, 2026SOMA: Strategic Orchestration and Memory-Augmented System for Vision-Language-Action Model Robustness via In-Context AdaptationMar 10, 2026FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource IsolationNov 17, 2025TZ-LLM: Protecting On-Device Large Language Models with Arm TrustZone