Abstract: Highlights•We construct a scale-aware hierarchical attention network for scene text recognition.•We exploit a pyramidal structure to learn multi-scale features for characters.•A hierarchical attention decoder is proposed for the most fine-grained prediction.
External IDs:doi:10.1016/j.patrec.2020.06.009
Loading