Air pollution silently claims over 7 million lives every year.
It's one of the most urgent yet overlooked global health crises, and its effects aren't equally shared. At the heart of this issue is PM₂.₅, a microscopic pollutant that quietly inflames lungs, damages hearts, and shortens lives, especially in less-developed countries.
Air pollution is often framed as an environmental issue, but it's also a public health emergency. Among all pollutants, PM₂.₅ (particulate matter smaller than 2.5 micrometers) is particularly dangerous, it's small enough to bypass your body's defenses and settle deep in your lungs and bloodstream.
While everyone is affected by air pollution, our analysis shows that not all populations are equally vulnerable. Countries with lower development levels face disproportionately higher health burdens from the same levels of pollution.
PM₂.₅ is made up of ultrafine particles emitted from various sources. Because of its size, PM₂.₅ can penetrate deep into the lungs, cross into the bloodstream, and contribute to serious health conditions.
Understanding the global impact of air pollution on human health through comprehensive data analysis
Important Note: Our research focuses specifically on ambient (outdoor) PM₂.₅ air pollution, examining its long-term health impacts across different countries and development levels.
We applied several statistical and machine learning techniques to uncover patterns in the data
Research methodology and statistical analysis process
Annual averages at country level from WHO Global Health Observatory
Disability-Adjusted Life Years from the IHME Global Burden of Disease study
A measure of development that combines income, education, and fertility
We grouped our results by disease type: Cardiovascular disease, Stroke, Chronic respiratory disease, All-cause DALYs, and COVID-19 mortality
Our analysis reveals significant disparities in how air pollution affects different populations worldwide
In short: the same air pollution is more dangerous if you live in a less-developed country.
Socio-demographic development (SDI) consistently explains the largest share of variation in DALYs across countries.
Models that used 3–5 year rolling averages performed better than single-year exposure.
Long-term PM₂.₅ exposure showed a modest but consistent association with COVID-19 mortality in 2020.
Explore our complete collection of data visualizations and analysis figures
We're working on an interactive predictor tool to help visualize the health impacts of PM₂.₅
Our team is currently developing an interactive tool that will allow you to explore predicted health impacts based on PM₂.₅ levels and country development factors.
Everyone has a role to play in addressing air pollution and protecting public health
We all have a role to play when it comes to protecting our health. If you live in an area with high pollution, try to check local air quality updates regularly and limit outdoor activity when levels are high, especially vulnerable groups such as children, the elderly, and those with chronic conditions.
Small actions like using air purifiers indoors or choosing cleaner modes of transport can also make a difference.
We urge governments and environmental agencies to go beyond general air quality indices and develop a comprehensive Health-Adjusted Pollution Index, one that clearly links pollutant levels to measurable health risks.
This is about prioritizing health in every environmental policy.
Term | Meaning |
---|---|
PM₂.₅ | Fine particulate matter smaller than 2.5 micrometers in diameter. These airborne particles can penetrate deep into the lungs and enter the bloodstream, causing serious health effects. |
DALYs (Disability-Adjusted Life Years) | A measure of overall disease burden, calculated as the sum of years of life lost due to premature death and years lived with disability. |
SDI (Socio-Demographic Index) | A composite index combining income, education level, and fertility rate to represent a country's development level. |
Rolling Average | A multi-year average of data used to smooth out year-to-year fluctuations and capture longer-term trends. |
Lagged Average | An average of exposure values from previous years used to estimate delayed health effects of pollution. |
Linear Regression | A basic statistical method that models the relationship between a continuous outcome and one or more predictors by fitting a straight line. Used as a baseline to estimate average effects of PM₂.₅. |
Random Forest Regression | A machine learning method that uses an ensemble of decision trees to model complex and nonlinear relationships between variables. Helps detect patterns missed by linear models. |
Log-Transformed Regression | A version of linear regression where the dependent variable (e.g., COVID-19 deaths) is log-transformed to handle skewed data and stabilize variance. |
Interaction Models | Regression models that test whether the effect of one variable (e.g., PM₂.₅) changes depending on another (e.g., SDI), revealing effect modification. |
Quantile Regression | A statistical technique that estimates relationships at different points (quantiles) of the outcome distribution, such as the 25th or 75th percentile. Useful for understanding how effects differ across countries with low vs. high disease burden. |
Atmosfear is a student-led data science research project from the MIT Emerging Talent Program. We investigate the global health impacts of long-term exposure to air pollution, focusing on how socioeconomic disparities shape vulnerability.
Our goal is to bridge data science and public health to uncover hidden patterns, reveal inequities, and inform smarter global action.
"We want to humanize the numbers and make the invisible visible."
Interested in collaboration or media inquiries? Reach out to us.
Legal & Acknowledgements: This site is for research communication purposes only. Data used is sourced from IHME GBD and WHO. All findings are based on statistical modeling and should not be used for clinical or policy decisions without further validation.