"au:"Shivaram Venkataraman"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Shivaram Venkataraman"" — arXiv2 Search

Showing 1–20 of 59 results

/ Date/ Name

Feb 4, 2022MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks Oct 29, 2020Accordion: Adaptive Gradient Communication via Critical Learning Regime Identification Jun 4, 2018Learning a Code: Machine Learning for Approximate Non-Linear Coded Computation Apr 4, 2026Minos: Systematically Classifying Performance and Power Characteristics of GPU Workloads on HPC Clusters Mar 10, 2022LlamaTune: Sample-Efficient DBMS Configuration Tuning Oct 11, 2019Blink: Fast and Generic Collectives for Distributed ML Feb 4, 2025LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models Feb 2, 2021AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning Oct 6, 2020Move Fast and Meet Deadlines: Fine-grained Real-time Stream Processing with Cameo Sep 30, 2022Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning Feb 20, 2017Hemingway: Modeling Distributed Optimization Algorithms Oct 29, 2016KeystoneML: Optimizing Pipelines for Large-Scale Advanced Analytics Jan 30, 2025Scaling Inference-Efficient Language Models May 2, 2019Parity Models: A General Framework for Coding-Based Resilience in ML Inference Jan 6, 2023Does compressing activations help model parallel training?Feb 13, 2017Occupy the Cloud: Distributed Computing for the 99%Aug 23, 2022Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems Jan 20, 2021Marius: Learning Massive Graph Embeddings on a Single Machine Feb 24, 2022BagPipe: Accelerating Deep Recommendation Model Training Aug 21, 2024PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters