Non-deterministic learning dynamics in large neural networks due to structural data bias

/ Authors

/ Abstract

We study the dynamics of on-line learning in large (N→∞) perceptrons, for the case of training sets with a structural (N0) bias of the input vectors, by deriving exact and closed macroscopic dynamical laws using non-equilibrium statistical mechanical tools. In sharp contrast to the more conventional theories developed for homogeneously distributed or only weakly biased data, these laws are found to describe a non-trivial and persistently non-deterministic macroscopic evolution, and a generalization error which retains both stochastic and sample-to-sample fluctuations, even for infinitely large networks. Furthermore, for the standard error-correcting microscopic algorithms (such as the perceptron learning rule) one obtains learning curves with distinct bias-induced phases. Our theoretical predictions find excellent confirmation in numerical simulations.

Journal: Journal of Physics A

DOI: 10.1088/0305-4470/33/48/309