首页    期刊浏览 2024年11月26日 星期二
登录注册

文章基本信息

  • 标题:Predictive and perspective analysis of cancer image data set using machine learning algorithms
  • 本地全文:下载
  • 作者:Divya Chauhan ; Kishori Lal Bansal
  • 期刊名称:International Journal of Advanced Computer Research
  • 印刷版ISSN:2249-7277
  • 电子版ISSN:2277-7970
  • 出版年度:2020
  • 卷号:10
  • 期号:49
  • 页码:1-7
  • 出版社:Association of Computer Communication Education for National Triumph (ACCENT)
  • 摘要:Classification and prediction of the images are fairly easy task for humans, but it takes more effort for a machine to do the same. Machine learning helps to attain this goal. It automates the task of classifying a large collection of images into different classes by labelling the incoming data and recognizes patterns in it, which is subsequently translated into valuable insights. The aim of this paper is to classify the image data set of five cancer types, namely Osteosarcoma, Prostate Cancer, Brain Cancer, Breast Cancer and Acute Myeloid Leukaemia. Furthermore, the prediction of Osteosarcoma case for one of the four classes of tumor namely Non tumor, Non-Viable tumor, viable tumor, Viable: Non-Viable tumor has to be done. The quantitative analysis is done using various machine learning libraries of python. The three classification algorithms used for image analysis are random forest, SVM, and logistic regression. The metrics used for performing perspective analysis are precision, recall and F1 Score. The results show that the random forest algorithm has performed best amongst the three classification algorithms when given with less complicated scenario, with prediction accuracy, precision, recall and f1 score of 100%. But the performance of every classification algorithm degrades when provided with the cases of Osteosarcoma which has got more complicated scatter graph. However, the logistic regression retains its performance by predicting tumor cases with 99% accuracy.
  • 关键词:Data mining; Big data; Hadoop; Mahout; Clustering; Health care.
国家哲学社会科学文献中心版权所有