Modeling PM2.5 Urban Pollution Using Machine Learning and Selected Meteorological Parameters

Jan Kleine Deters, Rasa Zalakeviciute, Mario Gonzalez, Yves Rybarczyk*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

79 Citations (Scopus)
17 Downloads (Pure)


Outdoor air pollution costs millions of premature deaths annually, mostly due to anthropogenic fine particulate matter (or PM2.5). Quito, the capital city of Ecuador, is no exception in exceeding the healthy levels of pollution. In addition to the impact of urbanization, motorization, and rapid population growth, particulate pollution is modulated by meteorological factors and geophysical characteristics, which complicate the implementation of the most advanced models of weather forecast. Thus, this paper proposes a machine learning approach based on six years of meteorological and pollution data analyses to predict the concentrations of PM2.5 from wind (speed and direction) and precipitation levels. The results of the classification model show a high reliability in the classification of low (<10 μg/m3) versus high (>25 μg/m3) and low (<10 μg/m3) versus moderate (10-25 μg/m3) concentrations of PM2.5. A regression analysis suggests a better prediction of PM2.5 when the climatic conditions are getting more extreme (strong winds or high levels of precipitation). The high correlation between estimated and real data for a time series analysis during the wet season confirms this finding. The study demonstrates that the use of statistical models based on machine learning is relevant to predict PM2.5 concentrations from meteorological data.

Original languageEnglish
Article number5106045
JournalJournal of Electrical and Computer Engineering
Publication statusPublished - 18 Jun 2017
Externally publishedYes


Dive into the research topics of 'Modeling PM2.5 Urban Pollution Using Machine Learning and Selected Meteorological Parameters'. Together they form a unique fingerprint.

Cite this