Abstract: Highlights•An encoder decoder architecture with mirror skip connections for text localization•Linear, parametric, and convolutional mirror skip connections have been implemented.•Three models with different kernel sizes ensembled to capture multi scale features.•Proposed model beats state of the art pixel level classifiers.•Datasets used: ICDAR 2003, 2015, SVT and Total Text.
Loading