Who gets clean air, and who bears the burden of pollution?
Using EPA AQI data, we apply machine learning and data visualization to uncover hidden air quality risk patterns across U.S. counties.
How does air quality vary by region?
Visualize trends over time in AQI: identify improving or worsening regions
Compare exposure to specific pollutants (PM2.5, Ozone, NO₂) across counties
Identify areas at the intersection of high pollution and vulnerable populations
To uncover hidden patterns in air quality across U.S. counties and identify future air quality risks, we used K-means and XGBoost algorithms to perform cluster analysis on long-term air quality data and predict the most likely air quality conditions in different regions in 2026. We aim to reveal the underlying geographical, climatic, industrial, and natural factors behind these patterns and help policymakers and community leaders identify high-risk areas and prioritize air purification efforts.
Select a county to see its air quality cluster classification and 2026 risk prediction based on long-term pollution patterns.
⚠️ Some counties may not have a 2026 risk prediction due to insufficient data.