High-entropy Advantage in Neural Networks' Generalizability — arXiv2