Abstract: A hybrid technique for extraction of bigram and trigram multiword expressions is presented. This technique works in two
phases as first statistical technique is applied to filter the extracted bigrams and trigrams from English text, and after it multiword expressions are extracted from this list using some linguistic
rules. Two methods for threshold decision in statistical technique
are also presented, first is by minimizing the error in classification,
and the second is based on maximizing the recall value.
Loading