Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features

Chieh Hubert Lin; Hsin-Ying Lee; Hung-Yu Tseng; Maneesh Kumar Singh; Ming-Hsuan Yang

Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features

Chieh Hubert Lin, Hsin-Ying Lee, Hung-Yu Tseng, Maneesh Kumar Singh, Ming-Hsuan Yang

16 May 2022 (modified: 06 Apr 2025)NeurIPS 2022 SubmittedReaders: Everyone

Keywords: positional information, position encoding, padding, CNN

TL;DR: We develop a reliable metric for measuring the positional information from CNN paddings and use it to conduct a large scale study.

Abstract: Recent studies show that paddings in convolutional neural networks encode absolute position information which can negatively affect the model performance for certain tasks. However, existing metrics for quantifying the strength of positional information remain unreliable and frequently lead to erroneous results. To address this issue, we propose novel metrics for measuring (and visualizing) the encoded positional information. We formally define the encoded information as PPP (Position-information Pattern from Padding) and conduct a series of experiments to study its properties as well as its formation. The proposed metrics measure the presence of positional information more reliably than the existing metrics based on PosENet and a test in F-Conv. We also demonstrate that for any extant (and proposed) padding schemes, PPP is primarily a learning artifact and is less dependent on the characteristics of the underlying padding schemes.

Supplementary Material: pdf

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 3 code implementations](https://www.catalyzex.com/paper/unveiling-the-mask-of-position-information/code)

13 Replies

Loading