We developed an integrated approach coupling a chemical transport model (CTM) with machine learning (ML) techniques to produce high spatial resolution NO2 and O3 daily concentration fields over Italy. Three years (2013-2015) simulations, at a spatial resolution of 5 km, performed by the Flexible Air quality RegionalModel (FARM) were used as predictors, together with other spatial-temporal data, such as population, land-use, surface greenness and road networks, by a ML Random Forest (MLRF) algorithm to produce daily concentrations at higher resolution (1 km) over the national territory. The evaluation of the adopted integrated approach was based on NO2 and O3 observations available from530 and 293 monitoring stations across Italy, respectively. A good performance for NO2 and excellent results for O3 were obtained from the application of the CTM; as for NO2, the levels at urban traffic stations were not captured by the simulations due to the adopted horizontal resolution and related emissions uncertainties. Performance improvements were achieved with ML-RF predictions, reducing NO2 underestimation (near zero fractional bias results) and better capturing spatial contrasts. The results obtained in this work were used to support the national exposure assessment and environmental epidemiology studies planned in the BEEP (Big data in Environmental and occupational Epidemiology) project and confirm the potential of machine learning methods to adequately predict air pollutant levels at high spatial and temporal resolutions.

Spatial-temporal prediction of ambient nitrogen dioxide and ozone levels over Italy using a Random Forest model for population exposure assessment

Viegi G
2021

Abstract

We developed an integrated approach coupling a chemical transport model (CTM) with machine learning (ML) techniques to produce high spatial resolution NO2 and O3 daily concentration fields over Italy. Three years (2013-2015) simulations, at a spatial resolution of 5 km, performed by the Flexible Air quality RegionalModel (FARM) were used as predictors, together with other spatial-temporal data, such as population, land-use, surface greenness and road networks, by a ML Random Forest (MLRF) algorithm to produce daily concentrations at higher resolution (1 km) over the national territory. The evaluation of the adopted integrated approach was based on NO2 and O3 observations available from530 and 293 monitoring stations across Italy, respectively. A good performance for NO2 and excellent results for O3 were obtained from the application of the CTM; as for NO2, the levels at urban traffic stations were not captured by the simulations due to the adopted horizontal resolution and related emissions uncertainties. Performance improvements were achieved with ML-RF predictions, reducing NO2 underestimation (near zero fractional bias results) and better capturing spatial contrasts. The results obtained in this work were used to support the national exposure assessment and environmental epidemiology studies planned in the BEEP (Big data in Environmental and occupational Epidemiology) project and confirm the potential of machine learning methods to adequately predict air pollutant levels at high spatial and temporal resolutions.
2021
Istituto di Fisiologia Clinica - IFC
Istituto per la Ricerca e l'Innovazione Biomedica -IRIB
Chemical transport model
FARM
WRF
Machine learning
Random Forest
Nitrogen dioxide
Ozone
File in questo prodotto:
File Dimensione Formato  
prod_446719-doc_160664.pdf

solo utenti autorizzati

Descrizione: Spatial-temporal prediction of ambient nitrogen dioxide and ozone levels over Italy using a Random Forest model for population exposure assessment
Tipologia: Documento in Post-print
Dimensione 2.94 MB
Formato Adobe PDF
2.94 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/426742
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 32
  • ???jsp.display-item.citation.isi??? ND
social impact