Affiliation strings for 1008 math publications (preview version)
Abstract: This dataset contains affiliation strings extracted for 1,008 mathematics publications. The data was selected from zbMATH Open data, by selecting publications from journal issues in the category FAST TRACK/COVER for the years 2016–2025. For each publication, internal zbMATH affiliation strings were reviewed and manually checked by comparing with the version of record. The resulting dataset consists of three columns: line_no: line number (for debugging) zbl: Unique identifier from zbMATH Open. Prefix with https://zbmath.org/ to visit additional information on the article. For example, 5635019 is associated with https://zbmath.org/5635019 aff_str: the affiliation string as it appears in the source metadata This resource can be used for tasks such as analyzing institutional contributions to mathematical research, studying author–affiliation patterns, or testing affiliation string parsing methods. With the zbl field one can derive additional metadata such as the DOI for each paper. In future versions of the dataset, we will include an extra table with the most important meatdata. In addition we plan to release signatures (combination of unique author identifiers and unique institution identifiers) in addition to the affiliation strings. Moreover, we will provide negative samples for publication that had incorrect affiliation strings, accoring to our records and discuss possible reasons for incorrect, incomplete or missing affiliation strings. For now, the dataset only contains positive samples. Another addition we are planning to work on is statistics on the distribtion of the data (by year, publisher, country, etc.). As this is a preview version of the dataset, we are very happy about any feedback. Please contact the first author of the dataset for feedback and suggestions.
External IDs:doi:10.5281/zenodo.17105141
Loading