Semi Supervised Approach for Relation Extraction in Agriculture Documents

Published: 01 Jan 2022, Last Modified: 20 May 2025OCIT 2022EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In this work, we propose a semi-supervised boot-strapping approach for relation extraction in domain specific texts, specifically focusing on agricultural domain. Our approach utilizes the BERT model with dependency parsing for relation extraction. The proposed model, focuses on identifying five inter subdomain relations viz., Soil_Location, Soil_Crop, Disease_Pathogen, Pathogen_Crop, and Chemical_Crop. We created a corpus of 30,000 sentences extracted from recognised agriculture sites to evaluate the model. The labeled relations were then manually checked to evaluate the prediction accuracy. We used a test corpus with 700 sentences that included 3500 triplets for the evaluation. The proposed approach presents an average macro F -Score of 86.4 %, which is quite promising for semi-supervised domain specific relation extraction systems. Experimental results show the efficacy of the proposed approach in classifying relational phrases in a semi-supervised set-up for the agricultural domain.
Loading