NICO++: Towards Better Benchmarking for Domain Generalization — arXiv2