Page Classification through Logical Labelling

Published: 2002, Last Modified: 30 Sept 2024ICPR (3) 2002EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: We propose an integrated approach to page classification and logical labelling. Layout is represented by a fully connected attributed relational graph that is matched to the graph of an unknown document, achieving classification and labelling simultaneously. By incorporating global constraints in an integrated fashion, ambiguity at the zone level can be reduced, providing robustness to noise and variation. Models are automatically trained from sample documents. Experimental results show promise for the classification and labelling of technical article title pages, and supports the idea of a hierarchical model base.
Loading