期刊名称:Journal of Theoretical and Applied Information Technology
印刷版ISSN:1992-8645
电子版ISSN:1817-3195
出版年度:2014
卷号:70
期号:3
出版社:Journal of Theoretical and Applied
摘要:The automatic extraction of semantic relations between words from textual corpora is an extremely challenging task. The increasing need for language resources supporting Natural language processing (NLP) applications has encouraged the development of automated methods for the extraction of semantic relations between words. The use of corpus statistical and similarity distribution methods can help in the task of semantic relation extraction between pairs of words. In this paper, we present a pattern-based bootstrapping approach using Arabic language corpora and a corpus analysis tool (Sketch Engine) to extract the semantic relations (antonyms) between word pairs. The algorithm uses LogDice and pattern co-occurrence to classify the extracted pairs into antonyms. Results of evaluation show that our approach is able to extract the antonym relations with a precision of 76%.