首页    期刊浏览 2025年09月15日 星期一
登录注册

文章基本信息

  • 标题:A Novel Multi-View Ensemble Learning Architecture to Improve the Structured Text Classification
  • 本地全文:下载
  • 作者:Carlos Adriano Gonçalves ; Adrián Seara Vieira ; Célia Talma Gonçalves
  • 期刊名称:Information
  • 电子版ISSN:2078-2489
  • 出版年度:2022
  • 卷号:13
  • 期号:6
  • 页码:283
  • DOI:10.3390/info13060283
  • 语种:English
  • 出版社:MDPI Publishing
  • 摘要:Multi-view ensemble learning exploits the information of data views. To test its efficiency for full text classification, a technique has been implemented where the views correspond to the document sections. For classification and prediction, we use a stacking generalization based on the idea that different learning algorithms provide complementary explanations of the data. The present study implements the stacking approach using support vector machine algorithms as the baseline and a C4.5 implementation as the meta-learner. Views are created with OHSUMED biomedical full text documents. Experimental results lead to the sustained conclusion that the application of multi-view techniques to full texts significantly improves the task of text classification, providing a significant contribution for the biomedical text mining research. We also have evidence to conclude that enriched datasets with text from certain sections are better than using only titles and abstracts.
国家哲学社会科学文献中心版权所有