Information Extraction from Text Regions with Complex Tabular Structure

Kaixuan Zhang; Zejiang Shen; Jie Zhou; Melissa Dell

Information Extraction from Text Regions with Complex Tabular Structure

Kaixuan Zhang, Zejiang Shen, Jie Zhou, Melissa Dell

Published: 01 Nov 2019, Last Modified: 05 May 2023DI 2019Readers: Everyone

Abstract: Recent innovations have improved layout analysis of document images, significantly improving our ability to identify text and non-text regions. However, extracting information from within text regions remains quite challenging because the text region may have a complex structure. In this paper, we present a new dataset with complex text structure, and propose new methods to robustly retrieve information from the complex text region.

1 Reply

Loading