首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:Using AdaBoost Meta-Learning Algorithm for Medical News Multi-Document Summarization
  • 本地全文:下载
  • 作者:Mahdi Gholami Mehr
  • 期刊名称:Intelligent Information Management
  • 印刷版ISSN:2150-8194
  • 电子版ISSN:2150-8208
  • 出版年度:2013
  • 卷号:05
  • 期号:06
  • 页码:182-190
  • DOI:10.4236/iim.2013.56020
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:Automatic text summarization involves reducing a text document or a larger corpus of multiple documents to a short set of sentences or paragraphs that convey the main meaning of the text. In this paper, we discuss about multi-document summarization that differs from the single one in which the issues of compression, speed, redundancy and passage selection are critical in the formation of useful summaries. Since the number and variety of online medical news make them difficult for experts in the medical field to read all of the medical news, an automatic multi-document summarization can be useful for easy study of information on the web. Hence we propose a new approach based on machine learning meta-learner algorithm called AdaBoost that is used for summarization. We treat a document as a set of sentences, and the learning algorithm must learn to classify as positive or negative examples of sentences based on the score of the sentences. For this learning task, we apply AdaBoost meta-learning algorithm where a C4.5 decision tree has been chosen as the base learner. In our experiment, we use 450 pieces of news that are downloaded from different medical websites. Then we compare our results with some existing approaches.
  • 关键词:Multi-Document Summarization; Machine Learning; Decision Trees; AdaBoost; C4.5; Medical Document Summarization
国家哲学社会科学文献中心版权所有