An Effective Information Theoretic Framework for Channel Pruning — arXiv2