GDPR compliant collection of Therapist-Patient-DialoguesDownload PDF

Published: 21 Oct 2022, Last Modified: 05 May 2023PAI4MH 2022 PosterReaders: Everyone
Keywords: data collection, GDPR, Natural Language Processing
Abstract: According to the Global Burden of Disease list provided by the WHO, mental disorders are among the most debilitating disorders. To improve diagnosis and therapy effectiveness, in recent years, researchers tried to identify individual biomarkers. Gathering neurobiological data however, is costly and time-consuming. Another potential source of information, which is already part of the clinical routine, are therapist-patient dialogues. While there are some pioneering works investigating the role of language as predictors for various therapeutic parameters, for example patient-therapist alliance, there are no large-scale studies. A major obstacle to conduct these studies is the availability of sizeable datasets, which are needed to train machine learning models. While these conversations are part of the daily routine of clinicians, gathering them is usually hindered by various ethical (purpose of data usage), legal (data privacy) and technical (data formatting) limitations. Some of which are particular to the domain of therapy dialogues, like the increased difficulty in anonymisation, or the transcription of the recordings. In this paper, we elaborate on the challenges we faced in starting our collection of therapist-patient dialogues in a psychiatry clinic under the General Data Privacy Regulation of the European Union with the goal to use the data for NLP research. We give an overview over each step in our procedure and point out potential pitfalls to motivate further research in this field.
1 Reply