首页    期刊浏览 2024年07月06日 星期六
登录注册

文章基本信息

  • 标题:An Attention-Based Model Using Character Composition of Entities in Chinese Relation Extraction
  • 本地全文:下载
  • 作者:Xiaoyu Han ; Yue Zhang ; Wenkai Zhang
  • 期刊名称:Information
  • 电子版ISSN:2078-2489
  • 出版年度:2020
  • 卷号:11
  • 期号:2
  • 页码:79-95
  • DOI:10.3390/info11020079
  • 出版社:MDPI Publishing
  • 摘要:Relation extraction is a vital task in natural language processing. It aims to identify the relationship between two specified entities in a sentence. Besides information contained in the sentence, additional information about the entities is verified to be helpful in relation extraction. Additional information such as entity type getting by NER (Named Entity Recognition) and description provided by knowledge base both have their limitations. Nevertheless, there exists another way to provide additional information which can overcome these limitations in Chinese relation extraction. As Chinese characters usually have explicit meanings and can carry more information than English letters. We suggest that characters that constitute the entities can provide additional information which is helpful for the relation extraction task, especially in large scale datasets. This assumption has never been verified before. The main obstacle is the lack of large-scale Chinese relation datasets. In this paper, first, we generate a large scale Chinese relation extraction dataset based on a Chinese encyclopedia. Second, we propose an attention-based model using the characters that compose the entities. The result on the generated dataset shows that these characters can provide useful information for the Chinese relation extraction task. By using this information, the attention mechanism we used can recognize the crucial part of the sentence that can express the relation. The proposed model outperforms other baseline models on our Chinese relation extraction dataset.
  • 关键词:relation extraction; Chinese; character; attention; distant supervision relation extraction ; Chinese ; character ; attention ; distant supervision
国家哲学社会科学文献中心版权所有