Integrating linguistic knowledge into DNNs: Application to online grooming detectionDownload PDF

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone
Keywords: Machine Learning, Corpus Linguistics
Abstract: Online grooming (OG) of children is a pervasive issue in an increasingly interconnected world. We explore various complementary methods to incorporate Corpus Linguistics (CL) knowledge into accurate and interpretable Deep Learning (DL) models. They provide an implicit text normalisation that adapts embedding spaces to the groomers' usage of language, and they focus the DNN's attention onto the expressions of OG strategies. We apply these integration to two architecture types and improve on the state-of-the-art on a new OG corpus.
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics
One-sentence Summary: Incorporating Corpus Linguistic knowledge in Deep Learning models to create accurate and interpretable models.
Supplementary Material: zip
Reviewed Version (pdf): https://openreview.net/references/pdf?id=LampF4VckO
16 Replies

Loading