Towards a new Curation Workflow for the CMC Corpora Resource Family

Published: 30 Nov 2025, Last Modified: 13 May 202612th International Conference on Computer Mediated Communication and Social Media Corpora for the HumanitiesEveryoneCC BY 4.0
Abstract: This paper aims at raising awareness regarding a recently started and ongoing effort of CLARIN ERIC and the CLARIN Knowledge Centre for CMC and Social Media Corpora (CKCMC) to enhance the visibility and accessibility of the CMC community’s datasets through the CLARIN CMC Corpora Resource Family (CMC-RF), which, as of May 2025, the CKCMC officially adopted, that is, it took over responsibility. We offer some possible scenarios regarding how curation (addition, change and deletion of entries) of the CMC-RF could be approached, with the objective to prepare a productive context for a roundtable at the forthcoming 2025 edition of the CMC-Corpora conference. We intend to use the outcomes of this roundtable to devise a grounded and informed first version of Curation Guidelines for the CMC-RF, together with a community-oriented procedure to update it.
Loading