Automatic Generation of Annotated Collection for Recognition of Sentiment Frames

Published: 08 Apr 2021, Last Modified: 08 Aug 2024OpenReview Archive Direct UploadEveryoneCC BY 4.0
Abstract: While addressing the challenge of sentiment analysis, it is crucial to consider not only the polarity of certain words but also the polarity between them, particularly between the arguments of a predicate. For this purpose, the RuSentiFrames lexicon was created. But the training of the ML model requires an annotated collection of data, and since the manual annotation is laborious and expensive, the automation of the process is preferable. In this paper, we describe a rule-based approach to automatic annotation of semantic roles for the predicates of the RuSentiFrames lexicon. The implementation of the algorithm includes the search of the entities with certain morpho-syntactic features in the order that depends on the case of the entity and is based on calculation of the posterior probabilities of the co-occurrence of a certain case and a certain type of predicate arguments. The results of the algorithm evaluation, based on several different characteristics, were relatively high. The solutions of problematic cases have been suggested and are expected to be implemented in further research.
Loading