logo
ResearchBunny Logo
Dissecting heritability, environmental risk, and air pollution causal effects using > 50 million individuals in MarketScan

Medicine and Health

Dissecting heritability, environmental risk, and air pollution causal effects using > 50 million individuals in MarketScan

D. Mcguire, H. Markus, et al.

Discover groundbreaking research by Daniel McGuire and colleagues at Penn State College of Medicine, revealing how genetics and environmental factors converge to influence disease risk. This study leverages a vast dataset to provide insights into the causal effects of air pollution on a range of diseases, showcasing innovative modeling strategies that can redefine our understanding of health.

00:00
00:00
~3 min • Beginner • English
Abstract
Large national-level electronic health record (EHR) datasets offer new opportunities for disentangling the role of genes and environment through deep phenotype information and approximate pedigree structures. Here we use the approximate geographical locations of patients as a proxy for spatially correlated community-level environmental risk factors. We develop a spatial mixed linear effect (SMILE) model that incorporates both genetics and environmental contribution. We extract EHR and geographical locations from 257,620 nuclear families and compile 1083 disease outcome measurements from the MarketScan dataset. We augment the EHR with publicly available environmental data, including levels of particulate matter 2.5 (PM₂.₅), nitrogen dioxide (NO₂), climate, and sociodemographic data. We refine the estimates of genetic heritability and quantify community-level environmental contributions. We also use wind speed and direction as instrumental variables to assess the causal effects of air pollution. In total, we find PM₂.₅ or NO₂ have statistically significant causal effects on 135 diseases, including respiratory, musculoskeletal, digestive, metabolic, and sleep disorders, where PM₂.₅ and NO₂ tend to affect biologically distinct disease categories. These analyses showcase several robust strategies for jointly modeling genetic and environmental effects on disease risk using large EHR datasets and will benefit upcoming biobank studies in the era of precision medicine.
Publisher
Nature Communications
Published On
Jun 25, 2024
Authors
Daniel McGuire, Havell Markus, Lina Yang, Jingyu Xu, Austin Montgomery, Arthur Berg, Qunhua Li, Laura Carrel, Dajiang J. Liu, Bibo Jiang
Tags
genetics
environment
disease risk
air pollution
EHR dataset
causal effects
heredity
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny