Training Autoencoders by Alternating Minimization

Sneha Kudugunta; Adepu Shankar; Surya Chavali; Vineeth Balasubramanian; Purushottam Kar

Training Autoencoders by Alternating Minimization

Sneha Kudugunta, Adepu Shankar, Surya Chavali, Vineeth Balasubramanian, Purushottam Kar

15 Feb 2018 (modified: 10 Feb 2022)ICLR 2018 Conference Blind SubmissionReaders: Everyone

Abstract: We present DANTE, a novel method for training neural networks, in particular autoencoders, using the alternating minimization principle. DANTE provides a distinct perspective in lieu of traditional gradient-based backpropagation techniques commonly used to train deep networks. It utilizes an adaptation of quasi-convex optimization techniques to cast autoencoder training as a bi-quasi-convex optimization problem. We show that for autoencoder configurations with both differentiable (e.g. sigmoid) and non-differentiable (e.g. ReLU) activation functions, we can perform the alternations very effectively. DANTE effortlessly extends to networks with multiple hidden layers and varying network configurations. In experiments on standard datasets, autoencoders trained using the proposed method were found to be very promising when compared to those trained using traditional backpropagation techniques, both in terms of training speed, as well as feature extraction and reconstruction performance.

TL;DR: We utilize the alternating minimization principle to provide an effective novel technique to train deep autoencoders.

Keywords: Deep Learning, Autoencoders, Alternating Optimization

Data: [MNIST](https://paperswithcode.com/dataset/mnist)

7 Replies

Loading