Abstract: Highlights•A parser to retrieve the resume structure from a given pdf document.•A simple approach to ensure the correct resume reading order.•Two segmentation models to extract the sections and subsections from a resume.•An anonymized dataset for the resume template identification problem.