首页    期刊浏览 2025年03月09日 星期日
登录注册

文章基本信息

  • 标题:Efficient Reduction of Overgeneration Errors for Automatic Controlled Indexing with an Application to the Biomedical Domain
  • 本地全文:下载
  • 作者:Samassi Adama ; Brou Konan Marcellin ; Gooré Bi Tra
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2018
  • 卷号:9
  • 期号:12
  • DOI:10.14569/IJACSA.2018.091225
  • 出版社:Science and Information Society (SAI)
  • 摘要:Studies on MetaMap and MaxMatcher has shown that both concept extraction systems suffer from overgeneration problems. Over-generation occurs when the extraction systems mistakenly select an irrelevant concept. One of the reasons for these errors is that these systems use the words to weight the terms of the concepts. In this paper, an Integer Linear Programming model is used to select the optimal subset of extracted concept mentions covering the largest number of important words in the document to be indexed. Then each concept mentions that this set is mapped to a unique concept in UMLS using an information retrieval model.
  • 关键词:Concept extraction; concept recognition; automatic controlled indexing; controlled vocabulary; information retrieval
国家哲学社会科学文献中心版权所有