MEDFORD in a Box: Improvements and Future Directions for a Metadata Description Language

Polina Shpilker, Benjamin Stubbs, Michael Sayers, Yumin Lee, Lenore Cowen, Donna K. Slonim, Shaun Wallace, Alva L. Couch, Noah M. Daniels

Published: 2026, Last Modified: 07 Mar 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Scientific research metadata is vital to ensure the validity, reusability, and cost-effectiveness of research efforts. The MEDFORD metadata language was previously introduced to simplify the process of writing and maintaining metadata for non-programmers. However, barriers to entry and usability remain, including limited automatic validation, difficulty of data transport, and user unfamiliarity with text file editing. To address these issues, we introduce MEDFORD-in-a-Box (MIAB), a documentation ecosystem to facilitate researcher adoption and earlier metadata capture. MIAB contains many improvements, including an updated MEDFORD parser with expanded validation routines and BagIt export capability. MIAB also includes an improved VS Code extension that supports these changes through a visual IDE. By simplifying metadata generation, this new tool supports the creation of correct, consistent, and reusable metadata, ultimately improving research reproducibility.
Loading