Correspondence between neuroevolution and gradient descentDownload PDF

Anonymous

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Desk Rejected SubmissionReaders: Everyone
Keywords: neuroevolution, gradient descent, theoretical description of learning
Abstract: We show analytically that training a neural network by stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise. Averaged over independent realizations of the learning process, neuroevolution is equivalent to gradient descent on the loss function. We use numerical simulation to show that this correspondence can be observed for finite mutations. Our results provide a connection between two distinct types of neural-network training, and provide justification for the empirical success of neuroevolution.
One-sentence Summary: We show analytically that training a neural network by stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise.
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics
1 Reply

Loading