首页    期刊浏览 2025年02月19日 星期三
登录注册

文章基本信息

  • 标题:Applying Statistical Machine Learning Methods to Analysis Differences in the Severity Level of COVID-19 among Countries
  • 本地全文:下载
  • 作者:Wen Yin ; Chenchen Pan ; Nanyi Deng
  • 期刊名称:Journal of Software
  • 印刷版ISSN:1796-217X
  • 出版年度:2021
  • 卷号:16
  • 期号:5
  • 页码:219-234
  • DOI:10.17706/jsw.16.5.219-234
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:The COVID-19 pandemic has caused a significant negative impact on countries around the world, and there appears to be an observable difference in severity among nations. This study aims to provide an insight into the roles many social and economic factors played in contributing to this variation. By investigating potential patterns through exploratory data analysis, followed by constructing models using several popular machine learning techniques, we examine the validity of the underlying assumptions and identifying any potential limitations. Total deaths per million population is used as dependent variable with log transformation to remove outliers. A set of factors such as life expectancy, unemployment rate and population are available in the dataset. After removing and transforming outliers, various machine learning methods with cross validation are implemented and the optimal model is determined by predefined metrics such as root-mean-squared-error (RMSE) and mean-squared-error (MAE). The results show that the Gradient Boost Machine (GBM) technique achieves the most optimal results in terms of minimum RMSE and MAE. The RMSE and MAE values indicate no over fitting issues and the GBM algorithm captures the most influential factors such as life expectancy, healthcare expense per Gross Domestic Product (GDP) and GDP per capita, which are clearly critical explanatory variables for predicting total deaths per million population.
  • 关键词:COVID-19; machine learning; social and economic factors.
国家哲学社会科学文献中心版权所有