This are the Supplementary Materials associated with the paper titled `Measuring the State of Document Understanding` send to NeurIPS'21 Dataset and Benchmarks track. 

#Content

## code_to_reproduce
This is the directory that provide tools to download data, reproduce the baseline results and evaluation.

## schema 
Contains a specification for JSON based format for defining the structure of benchmark's datasets.
Also contains examples - definitions of a dataset-level schema that describe version, license, name, web url and other metadata.

## appendices
Contains pdf file with appendices that are part of paper submission.
