Medicine and Health

Sounds of COVID-19: exploring realistic performance of audio-based digital testing

J. Han, T. Xia, et al.

This groundbreaking research conducted by Jing Han, Tong Xia, and their colleagues investigates the effectiveness of audio-based methods for detecting COVID-19. With a unique dataset comprised of cough, breath, and voice samples, they reveal the realistic performance of these testing approaches, emphasizing the need for robust evaluation strategies to avoid misleading results.... show more

Abstract

To identify Coronavirus disease (COVID-19) cases efficiently, affordably, and at scale, recent work has shown how audio (including cough, breathing and voice) based approaches can be used for testing. However, there is a lack of exploration of how biases and methodological decisions impact these tools’ performance in practice. In this paper, we explore the realistic performance of audio-based digital testing of COVID-19. To investigate this, we collected a large crowdsourced respiratory audio dataset through a mobile app, alongside symptoms and COVID-19 test results. Within the collected dataset, we selected 5240 samples from 2478 English-speaking participants and split them into participant-independent sets for model development and validation. In addition to controlling the language, we also balanced demographics for model training to avoid potential acoustic bias. We used these audio samples to construct an audio-based COVID-19 prediction model. The unbiased model took features extracted from breathing, coughs and voice signals as predictors and yielded an AUC-ROC of 0.71 (95% CI: 0.65-0.77). We further explored several scenarios with different types of unbalanced data distributions to demonstrate how biases and participant splits affect the performance. With these different, but less appropriate, evaluation strategies, the performance could be overestimated, reaching an AUC up to 0.90 (95% CI: 0.85-0.95) in some circumstances. We found that an unrealistic experimental setting can result in misleading, sometimes over-optimistic, performance. Instead, we reported complete and reliable results on crowd-sourced data, which would allow medical professionals and policy makers to accurately assess the value of this technology and facilitate its deployment.

Publisher

npj Digital Medicine

Published On

Jan 28, 2022

Authors

Jing Han, Tong Xia, Dimitris Spathis, Erika Bondareva, Chloë Brown, Jagmohan Chauhan, Ting Dang, Andreas Grammenos, Apinan Hasthanasombat, Andres Floto, Pietro Cicuta, Cecilia Mascolo

DOI

https://doi.org/10.1038/s41746-021-00553-x

Related Publications

Explore these studies to deepen your understanding of the subject.

Medicine and Health

Exploring the effectiveness of a COVID-19 contact tracing app using an agent-based model

J. Almagor and S. Picascia

Medicine and Health

The role of the mass vaccination programme in combating the COVID-19 pandemic: An LSTM-based analysis of COVID-19 confirmed cases

S. Hansun, V. Charles, et al.

Health and Fitness

Supporting Weight Management during COVID-19 (SWIM-C): twelve-month follow-up of a randomised controlled trial of a web-based, ACT-based, guided self-help intervention

J. Mueller, R. Richards, et al.

Business

Digital green value co-creation behavior, digital green network embedding and digital green innovation performance: moderating effects of digital green network fragmentation

S. Yin and Y. Zhao

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 12+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny