Medicine and Healthnpj Women's Health

Quantifying disparities in intimate partner violence: a machine learning method to correct for underreporting

D. Shanmugam, K. Hou, et al.

Discover PURPLE, an innovative machine learning approach developed by Divya Shanmugam, Kaihua Hou, and Emma Pierson, that accurately estimates the prevalence of underreported health conditions like intimate partner violence. By addressing underreporting's challenges, PURPLE reveals critical insights into demographic disparities in health data, ultimately providing more plausible estimates than traditional methods.... show more

General Summary Metrics

Abstract

The first step towards reducing the pervasive disparities in women's health is to quantify them. Accurate estimates of the relative prevalence across groups—capturing, for example, that a condition affects Black women more frequently than white women—facilitate effective and equitable health policy that prioritizes groups who are disproportionately affected by a condition. However, it is difficult to estimate relative prevalence when a health condition is underreported, as many women's health conditions are. In this work, we present PURPLE, a method for accurately estimating the relative prevalence of underreported health conditions which builds upon the literature in positive unlabeled learning. We show that under a commonly made assumption—that the probability of having a health condition given a set of symptoms remains constant across groups—we can recover the relative prevalence, even without restrictive assumptions commonly made in positive unlabeled learning and even if it is impossible to recover the absolute prevalence. We conduct experiments on synthetic and real health data which demonstrate PURPLE's ability to recover the relative prevalence more accurately than do previous methods. We then use PURPLE to quantify the relative prevalence of intimate partner violence (IPV) in two large emergency department datasets. We find higher prevalences of IPV among patients who are on Medicaid, not legally married, and non-white, and among patients who live in lower-income zip codes or in metropolitan counties. We show that correcting for underreporting is important to accurately quantify these disparities and that failing to do so yields less plausible estimates. Our method is broadly applicable to underreported conditions in women's health, as well as to gender biases beyond healthcare.

Publisher

npj Women's Health

Published On

May 15, 2024

Authors

Divya Shanmugam, Kaihua Hou, Emma Pierson

DOI

https://doi.org/10.1038/s44294-024-00011-5

Explore these studies to deepen your understanding

Adjacent work that informs or extends this paper's methodology and findings.

Medicine and Health

Pre-deployment risk factors for PTSD in active-duty personnel deployed to Afghanistan: a machine-learning approach for analyzing multivariate predictors

K. Schultebraucks, M. Qian, et al.

Medicine and Health

HIDDEN: a machine learning method for detection of disease-relevant populations in case-control single-cell transcriptomics data

A. Goeva, M. Dolan, et al.

Computer Science

Using the interest theory of rights and Hohfeldian taxonomy to address a gap in machine learning methods for legal document analysis

A. Izzidien

Earth Sciences

A machine learning paradigm for necessary observations to reduce uncertainties in aerosol climate forcing

J. Redemann and L. Gao

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 22+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny