The Impact of the Mini-batch Size on the Variance of Gradients in Stochastic Gradient Descent — arXiv2