首页    期刊浏览 2025年08月17日 星期日
登录注册

文章基本信息

  • 标题:Speculation and Negation Annotation for Arabic Biomedical Texts: BioArabic Corpus
  • 本地全文:下载
  • 作者:Fatima T. AL-Khawaldeh
  • 期刊名称:The World of Computer Science and Information Technology Journal
  • 印刷版ISSN:2221-0741
  • 出版年度:2016
  • 卷号:6
  • 期号:1
  • 页码:4
  • 语种:English
  • 出版社:WCSIT Publishing
  • 摘要:Negation and speculation are two common linguistic concepts in natural language processing field, need more semantic understanding of texts. They are used to definite factuality of text. Negation is used to express the opposite of the text and the Speculation is used to determine the degree of certainty. Biomedical text mining is the main natural language processing application concerns with negation and speculation to distinguish between facts and uncertain or negated information in biomedical text. To our knowledge, there is no previous research on annotating Arabic biomedical text to identify the negative or speculative expression and no publicly available standard corpora of suitable size that are practical for evaluating the automatic detection of negation and speculation tools and scope determination. This paper presents produced corpus handling negation and speculative in Arabic biomedical texts with the main annotation (we call this corpus the BioArabic corpus). The goal of building BioArabic corpus is to help biologists and computational linguistics, who develop tools for identifying the negation and speculation, to train and evaluate these tools since in biomedical texts language, assumptions, experimental results and negative results are used extensively. We will report our statistics on corpus size and the consistency of annotations.
国家哲学社会科学文献中心版权所有