首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:A Multiclass-based Classification Strategy for Rethorical Sentence Categorization from Scientific Papers
  • 本地全文:下载
  • 作者:Dwi H. Widyantoro ; Masayu L. Khodra ; Bambang Riyanto Trilaksono
  • 期刊名称:Journal of ICT Research and Applications
  • 印刷版ISSN:2337-5787
  • 电子版ISSN:2338-5499
  • 出版年度:2013
  • 卷号:7
  • 期号:3
  • 页码:235-249
  • 语种:English
  • 出版社:Institut Teknologi Bandung
  • 其他摘要:Rapid identification of content structures in a scientific paper is of great importance particularly for those who actively engage in frontier research. This paper presents a multi-classifier approach to identify such structures in terms of classification of rhetorical sentences in scientific papers. The idea behind this approach is based on an observation that no single classifier is the best performer for classifying all rhetorical categories of sentences. Therefore, our approach learns which classifiers are good at what categories, assign the classifiers for those categories and apply only the right classifier for classifying a given category. This paper employsk-fold cross validation over training data to obtain the category-classifier mapping and then re-learn the classification model of the corresponding classifier using full training data on that particular category. This approach has been evaluated for identifying sixteen different rhetorical categories on sentences collected from ACL-ARC paper collection. The experimental results show that the multi-classifier approach can significantly improve the classification performance over multi-label classifiers.
国家哲学社会科学文献中心版权所有