logo
ResearchBunny Logo
Addressing gaps in data on drinking water quality through data integration and machine learning: evidence from Ethiopia

Environmental Studies and Forestry

Addressing gaps in data on drinking water quality through data integration and machine learning: evidence from Ethiopia

A. A. Ambel, R. Bain, et al.

This study, conducted by Alemayehu A. Ambel, Robert Bain, Tefera Bekele Degefu, Ayca Donmez, Richard Johnston, and Tom Slamyaker, tackles the pressing issue of poor drinking water quality data in Ethiopia. By employing advanced machine learning techniques on 2016 survey data, the research accurately predicts household water contamination, revealing potential pathways for improved public health initiatives.

00:00
00:00
~3 min • Beginner • English
Abstract
Monitoring access to safely managed drinking water services requires information on water quality. An increasing number of countries have integrated water quality testing in household surveys; however, it is not anticipated that such tests will be included in all future surveys. Using water testing data from the 2016 Ethiopia Socio-Economic Survey (ESS) we developed predictive models to identify households using contaminated (≥1 E. coli per 100 mL) drinking water sources based on common machine learning classification algorithms. These models were then applied to the 2013–2014 and 2018–2019 waves of the ESS that did not include water testing. The highest performing model achieved good accuracy (88.5%; 95% CI 86.3%, 90.6%) and discrimination (AUC 0.91; 95% CI 0.89, 0.94). The use of demographic, socioeconomic, and geospatial variables provided comparable results to that of the full featured model whereas a model based exclusively on water source type performed poorly. Drinking water quality at the point of collection can be predicted from demographic, socioeconomic, and geospatial variables that are often available in household surveys.
Publisher
npj Clean Water
Published On
Sep 08, 2023
Authors
Alemayehu A. Ambel, Robert Bain, Tefera Bekele Degefu, Ayca Donmez, Richard Johnston, Tom Slamyaker
Tags
drinking water quality
machine learning
E. coli
Ethiopia
predictive modeling
socioeconomic factors
water safety monitoring
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny