ParseNet: Looking Wider to See BetterDownload PDF

29 Mar 2024 (modified: 16 Feb 2016)ICLR 2016 workshop submissionReaders: Everyone
CMT Id: 240
Abstract: We present a technique for adding global context to fully convolutional networks for semantic segmentation. The approach is simple, using the average feature for a layer to augment the features at each location. In addition, we study several idiosyncrasies of training, significantly increasing the performance of baseline networks (e.g. from FCN~\cite{long2014fully}). When we add our proposed global feature, and a technique for learning normalization parameters, accuracy increases consistently even over our improved versions of the baselines. Our proposed approach, ParseNet, achieves state-of-the-art performance on SiftFlow and PASCAL-Context with small additional computational cost over baselines, and near state-of-the-art performance on PASCAL VOC 2012 semantic segmentation with a simple approach. Code is available at \url{https://github.com/weiliu89/caffe/tree/fcn} .
Conflicts: cs.unc.edu, magicleap.com
0 Replies

Loading