Abstract: In this paper, we propose a new network architecture called Convolutional Multi-directional Recurrent Network (CDRN) for offline handwritten text recognition. The conventional recurrent neural network model obtains the local context from limited directions, whereas we build up the multi-directional long short-term memory (MDirLSTM) module to abstract contextual information in various directions. Moreover, we develop a shortcut connection strategy in our proposed architecture for faster yet better convergence. In cooperation with the aforementioned methods, the proposed architecture also benefits from the following properties: (1) it obtains informative features of the input directly without involving hand-crafted features and segmentation, and (2) it is an end-to-end trainable model whose components are trained conjointly. We evaluate the performance of the proposed method on two databases: IAM words and IRONOFF. Our experimental results demonstrate a significant increase in recognition performance using MDirLSTM and shortcut connections, which suggests the effectiveness of these two proposed methods.
0 Replies
Loading