Psychologynpj Digital Medicine

Assessing the accuracy of automatic speech recognition for psychotherapy

A. S. Miner, A. Haque, et al.

This study, conducted by a team of experts including Adam S. Miner and Albert Haque, explores the potential of a HIPAA-compliant automatic speech recognition system to enhance psychotherapy audio transcription. While demonstrating promising results in interpreting depression-related utterances, the system's accuracy reveals further development is needed before it can guarantee individual safety monitoring.... show more

General Summary Metrics

Abstract

Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear which metrics and thresholds are appropriate for different clinical use cases, which may range from population descriptions to individual safety monitoring. Here we show that automatic speech recognition is feasible in psychotherapy, but further improvements in accuracy are needed before widespread use. Our HIPAA-compliant automatic speech recognition system demonstrated a transcription word error rate of 25%. For depression-related utterances, sensitivity was 80% and positive predictive value was 83%. For clinician-identified harm-related sentences, the word error rate was 34%. These results suggest that automatic speech recognition may support understanding of language patterns and subgroup variation in existing treatments but may not be ready for individual-level safety surveillance.

Publisher

npj Digital Medicine

Published On

Jun 03, 2020

Authors

Adam S. Miner, Albert Haque, Jason A. Fries, Scott L. Fleming, Denise E. Wilfley, G. Terence Wilson, Arnold Milstein, Dan Jurafsky, Bruce A. Arnow, W. Stewart Agras, Li Fei-Fei, Nigam H. Shah

DOI

https://doi.org/10.1038/s41746-020-0285-8

Explore these studies to deepen your understanding

Adjacent work that informs or extends this paper's methodology and findings.

Medicine and Health

Novel statistical approach for assessing the persistence of the circadian rhythms of social activity from telephone call detail records in older adults

T. Aubourg, J. Demongeot, et al.

Environmental Studies and Forestry

The role of the IPCC in assessing actionable evidence for climate policymaking

H. Pollitt, J. Mercure, et al.

Psychology

Accuracy prompts are a replicable and generalizable approach for reducing the spread of misinformation

G. Pennycook and D. G. Rand

Biology

Assessing the effectiveness of a national protected area network for carnivore conservation

J. Terraube, J. V. Doninck, et al.

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 22+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny