Abstract: Across many domains, the growing amount of data presents a challenge in extracting meaningful insights. A significant hurdle is the accurate interpretation and integration of data from diverse sources, often dictated by their specific applications. The RDF Mapping Language (RML), based on the W3C recommendation R2RML, can be used to transform heterogeneous data formats to RDF using defined mappings. However, existing RML implementations only support a limited set of (semi-)structured data sources such as CSV, SQL, XML, and JSON, neglecting numerous use-cases relying on other formats. This work overcomes this limitation by proposing a methodology to flexibly extend RML to support additional source formats. We systematically analyze RML and its implementations to derive a generic concept for the extension of RML. Our contributions include a general workflow for extending RML with new formats and demonstrative implementations of the RML Mapper for two examples from Building Information
Loading