Revisiting Huffman Coding: Toward Extreme Performance on Modern GPU Architectures — arXiv2