首页    期刊浏览 2025年04月27日 星期日
登录注册

文章基本信息

  • 标题:High Performance Machine Learning Models of Large Scale Air Pollution Data in Urban Area
  • 本地全文:下载
  • 作者:Snezhana G. Gocheva-Ilieva ; Atanas V. Ivanov ; Ioannis E. Livieris
  • 期刊名称:Cybernetics and Information Technologies
  • 印刷版ISSN:1311-9702
  • 电子版ISSN:1314-4081
  • 出版年度:2020
  • 卷号:20
  • 期号:6
  • 页码:49-60
  • DOI:10.2478/cait-2020-0060
  • 语种:English
  • 出版社:Bulgarian Academy of Science
  • 摘要:Preserving the air quality in urban areas is crucial for the health of thepopulation as well as for the environment. The availability of large volumes ofmeasurement data on the concentrations of air pollutants enables their analysis andmodelling to establish trends and dependencies in order to forecast and preventfuture pollution. This study proposes a new approach for modelling air pollutantsdata using the powerful machine learning method Random Forest (RF) and Auto-Regressive Integrated Moving Average (ARIMA) methodology. Initially, a RF modelof the pollutant is built and analysed in relation to the meteorological variables. Thismodel is then corrected through subsequent modelling of its residuals using theunivariate ARIMA. The approach is demonstrated for hourly data on seven airpollutants (O 3 , NOx, NO, NO 2 , CO, SO 2 , PM 10 ) in the town of Dimitrovgrad,Bulgaria over 9 years and 3 months. Six meteorological and three time variables areused as predictors. High-performance models are obtained explaining the data withR 2 = 90%-98%.
  • 关键词:Machine learning; Random Forest; Autoregressive integrated moving average; error correction; time series; forecasting.
国家哲学社会科学文献中心版权所有