首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:Improving Air Pollution Prediction Modelling Using Wrapper Feature Selection
  • 本地全文:下载
  • 作者:Ul-Saufie, Ahmad Zia ; Hamzan, Nurul Haziqah ; Zahari, Zulaika
  • 期刊名称:Sustainability
  • 印刷版ISSN:2071-1050
  • 出版年度:2022
  • 卷号:14
  • 期号:18
  • 页码:1-16
  • DOI:10.3390/su141811403
  • 语种:English
  • 出版社:MDPI, Open Access Journal
  • 摘要:Feature selection is considered as one of the essential steps in data pre-processing. However, all of the previous studies on predicting PM10 concentration in Malaysia have been limited to statistical method feature selection, and none of these studies used machine-learning approaches. Therefore, the objective of this research is to investigate the influence variables of the PM10 prediction model by using wrapper feature selection to compare the prediction model performance of different wrapper feature selection and to predict the concentration of PM10 for the next day. This research uses 10 years of daily data on pollutant concentrations from two stations (Klang and Shah Alam) obtained from the Department of Environment Malaysia (DOE) from 2009 until 2018. Six wrapper methods (forward selection, backward elimination, stepwise, brute-force, weight-guided and genetic algorithm evolution and the predictive analytics multiple linear regression (MLR) and artificial neural network (ANN)) were implemented in this study. This study found that brute-force is the dominant wrapper method in most of the best models in selecting important features for MLR. Moreover, compared to MLR, ANN provides more advantages regarding model accuracy and permits feature selection in predicting PM10. The overall results revealed that the RMSE value for next day prediction in Klang is 20.728, while the AE value is 15.69. Furthermore, the RMSE value for next day prediction in Shah Alam is 10.004, while the AE value is 7.982. Finally, all of the predicted models in Klang and Shah Alam can be used to predict the PM10 concentrations. This proposed model can be used as a tool for an early warning system in giving air quality information to local authorities in order to formulate air-quality-improvement strategies.
  • 关键词:hybrid models; air pollution modelling; feature selection; wrapper method; artificial neural network
国家哲学社会科学文献中心版权所有