Implicit Regularization and Convergence for Weight Normalization — arXiv2