Enterprise AI Analysis

High-Resolution NO2, O3, and PM, Estimation in Puglia: Leveraging AI and Explainability Techniques

This study developed an explainable machine learning model to predict daily surface concentrations of NO2, O3, PM10, and PM2.5 at a high spatial resolution (300m) in Apulia, Italy. Using ARPA station data (2019-2022) combined with meteorological, geographic, land-use, and temporal variables, an XGBoost model was trained. The model achieved an average R² of 0.71 (0.77 for NO2, 0.78 for O3, 0.67 for PM2.5, 0.64 for PM10) through repeated cross-validation. Explainable AI (XAI) methods, specifically SHAP, confirmed the model's physical consistency and provided insights into pollutant distribution drivers. This framework supports high-resolution exposure assessment for public health and environmental justice, aligning with new EU Air Quality Directives.

Schedule Your Strategy Session

Executive Impact: Key Performance Indicators

Leveraging advanced AI for environmental monitoring provides unprecedented accuracy and granular insights, driving more effective policy and health interventions.

0.71 Average R² across all pollutants

0.77 R² for NO2 prediction

0.78 R² for O3 prediction

300m Spatial Resolution

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Enterprise Process Flow

Satellite & WRF Data Collection

→

Spatial Preprocessing & Resampling

→

ARPA Ground Measurements Integration

→

Final Dataset Creation

→

XGBoost Model Training & Validation

→

Explainable AI (SHAP) Interpretation

The research outlines a robust data-fusion pipeline: starting from diverse data sources, rigorous preprocessing integrates them with ground-truth measurements. An XGBoost model then predicts pollutant concentrations, with SHAP providing critical interpretability for enterprise decision-making.

Discuss Your Implementation

XGBoost vs. Linear Model Performance (R²)

Pollutant	Linear Model (R²)	XGBoost Model (R²)
NO₂	0.39 ± 0.01	0.77 ± 0.01
O₃	0.50 ± 0.01	0.78 ± 0.01
PM₂.₅	0.18 ± 0.01	0.67 ± 0.01
PM₁₀	0.18 ± 0.01	0.64 ± 0.01

The XGBoost model consistently outperforms the linear model across all pollutants, demonstrating its superior ability to capture non-linear interactions and achieve higher predictive accuracy, especially for particulate matter. This validates the use of advanced ML for complex environmental data.

Explore Performance Metrics

Traffic & Industrial Emissions Dominant NO₂ Sources identified by SHAP

SHAP analysis confirmed that NO₂ concentrations are strongly influenced by land-use and anthropogenic predictors such as road network density, built-up/industrial fabric, and population density. Wind speed plays a crucial role in dilution and dispersion, reducing concentrations at higher intensities. This aligns with known atmospheric processes, validating the model's mechanistic understanding.

Understand XAI Insights

Ozone: Challenges in Temporal Transferability

Problem: Ozone (O₃) predictions showed a lower R² (0.53 daily) under Leave-One-Year-Out (LOYO) validation compared to random cross-validation (0.78). This highlights the difficulty in extrapolating O₃ behavior across different years due to interannual variability in meteorology and photochemistry.

Approach: The model still utilized Sentinel-5P O₃ column data in interaction with meteorological and land-use variables, demonstrating that even with low raw linear correlation, non-linear ML can extract useful signals. Temperature and emissivity (surface energy balance proxy) were strong positive drivers, consistent with photochemical formation.

Impact: While daily LOYO performance for O₃ is challenging, aggregation to monthly/annual means significantly improves R² (0.72), making the model reliable for long-term exposure assessment in epidemiological and policy studies, where such averages are often used. This suggests the model captures seasonal cycles robustly.

Despite challenges in daily O₃ temporal transferability, the model effectively captures seasonal trends and is reliable for long-term exposure assessments crucial for public health and policy.

Learn More

Advanced ROI Calculator

Estimate the potential annual cost savings and hours reclaimed by implementing enterprise AI solutions for environmental monitoring and data analysis, based on your organization's specifics and the insights from this research.

Your Industry

Number of Employees (Impacted by data tasks)

Average Weekly Hours on Manual Data Tasks

Average Hourly Cost of Labor ($)

Estimated Annual Savings

Annual Hours Reclaimed

Get Your Custom ROI Analysis

Your AI Implementation Roadmap

A structured approach to integrating high-resolution environmental AI for actionable insights.

Data Ingestion & Harmonization

Consolidate satellite, meteorological, land-use, and ground-truth data into a unified, clean dataset.

Model Training & Validation

Develop and train advanced ML models (e.g., XGBoost) using robust cross-validation and temporal transferability protocols.

Explainable AI Integration

Apply XAI techniques (SHAP) to interpret model predictions, ensuring transparency and scientific consistency for stakeholder buy-in.

Deployment & Monitoring

Implement the validated model in an operational environment for continuous, high-resolution air quality mapping and real-time monitoring.

Policy & Health Impact Assessment

Utilize high-resolution outputs for environmental justice analyses, public health studies, and compliance with regulatory directives.

Plan Your AI Journey

Ready to Transform Your Environmental Monitoring?

Our explainable AI solutions deliver unparalleled clarity and accuracy for critical environmental insights. Schedule a personalized consultation to discuss how these advanced techniques can benefit your enterprise.

Book Your Free Consultation

Enterprise AI Analysis

High-Resolution NO2, O3, and PM, Estimation in Puglia: Leveraging AI and Explainability Techniques

Executive Impact: Key Performance Indicators

Deep Analysis & Enterprise Applications

Enterprise Process Flow

XGBoost vs. Linear Model Performance (R²)

Ozone: Challenges in Temporal Transferability

Advanced ROI Calculator

Your AI Implementation Roadmap

Data Ingestion & Harmonization

Model Training & Validation

Explainable AI Integration

Deployment & Monitoring

Policy & Health Impact Assessment

Ready to Transform Your Environmental Monitoring?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai