首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Evaluating the Reliability and Interaction of Recursively Used Feature Classes for Terminology Extraction
  • 本地全文:下载
  • 作者:Anna Hätty ; Michael Dorna ; Sabine Schulte im Walde
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2017
  • 卷号:2017
  • 页码:113-121
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Feature design and selection is a crucial aspect when treating terminology extraction as a machine learning classification problem. We designed feature classes which characterize different properties of terms based on distributions, and propose a new feature class for components of term candidates. By using random forests, we infer optimal features which are later used to build decision tree classifiers. We evaluate our method using the ACL RD-TEC dataset. We demonstrate the importance of the novel feature class for downgrading termhood which exploits properties of term components. Furthermore, our classification suggests that the identification of reliable term candidates should be performed successively, rather than just once.
国家哲学社会科学文献中心版权所有