EXPLORING DEEP LEARNING USING INFORMATION THEORY TOOLS AND PATCH ORDERING

Henok Ghebrechristos; Gita Alaghband

EXPLORING DEEP LEARNING USING INFORMATION THEORY TOOLS AND PATCH ORDERING

Henok Ghebrechristos, Gita Alaghband

27 Sept 2018 (modified: 05 May 2023)ICLR 2019 Conference Withdrawn SubmissionReaders: Everyone

Abstract: We present a framework for automatically ordering image patches that enables in-depth analysis of dataset relationship to learnability of a classification task using convolutional neural network. An image patch is a group of pixels residing in a continuous area contained in the sample. Our preliminary experimental results show that an informed smart shuffling of patches at a sample level can expedite training by exposing important features at early stages of training. In addition, we conduct systematic experiments and provide evidence that CNN’s generalization capabilities do not correlate with human recognizable features present in training samples. We utilized the framework not only to show that spatial locality of features within samples do not correlate with generalization, but also to expedite convergence while achieving similar generalization performance. Using multiple network architectures and datasets, we show that ordering image regions using mutual information measure between adjacent patches, enables CNNs to converge in a third of the total steps required to train the same network without patch ordering.

Keywords: CNN, Deep Learning, Feature Extraction, Patch Ordering, Convergence, Image Classification

TL;DR: Develop new techniques that rely on patch reordering to enable detailed analysis of data-set relationship to training and generalization performances.

2 Replies

Loading