Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics
/ Authors
/ Abstract
This work explores the interaction of three communication-centric design dimensions for graph workloads on emerging integrated CPU-GPU systems: update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx). We show that these dimensions are inter-dependent and the best design depends on the graph algorithm and input. We develop a model to predict this best design, motivating flexible and hardware-software co-designed GPU memory systems.
Journal: 2020 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)