Computer Science

A natural language processing approach to detect inconsistencies in death investigation notes attributing suicide circumstances

S. Wang, Y. Zhou, et al.

Discover the critical need for data accuracy in scientific research and policy development. This innovative study by Song Wang, Yiliang Zhou, Ziqiang Han, Cui Tao, Yunyu Xiao, Ying Ding, Joydeep Ghosh, and Yifan Peng employs a cutting-edge Natural Language Processing approach to identify annotation inconsistencies in the National Violent Death Reporting System, revealing significant insights into suicide-circumstance data. Don't miss it!... show more

Abstract

Background: Data accuracy is essential for scientific research and policy development. The National Violent Death Reporting System (NVDRS) data is widely used for discovering the patterns and causing factors of death. Recent studies suggested the annotation inconsistencies within the NVDRS and the potential impact on erroneous suicide-circumstance attributions. Methods: We present an empirical Natural Language Processing (NLP) approach to detect annotation inconsistencies and adopt a cross-validation-like paradigm to identify possible label errors. We analyzed 267,804 suicide death incidents between 2003 and 2020 from the NVDRS. We measured annotation inconsistency by the degree of changes in the F-1 score. Results: Our results show that incorporating the target state's data into training the suicide-circumstance classifier brings an increase of 5.4% to the F-1 score on the target state's test set and a decrease of 1.1% on other states' test set. Conclusions: To conclude, we present an NLP framework to detect the annotation inconsistencies, show the effectiveness of identifying and rectifying possible label errors, and eventually propose an improvement solution to improve the coding consistency of human annotators.

Publisher

Communications Medicine

Published On

Oct 14, 2024

Authors

Song Wang, Yiliang Zhou, Ziqiang Han, Cui Tao, Yunyu Xiao, Ying Ding, Joydeep Ghosh, Yifan Peng

DOI

https://doi.org/10.1038/s43856-024-00631-7

Related Publications

Explore these studies to deepen your understanding of the subject.

Medicine and Health

Cohort design and natural language processing to reduce bias in electronic health records research

S. Khurshid, C. Reeder, et al.

Psychology

Predicting recurrent chat contact in a psychological intervention for the youth using natural language processing

S. Hornstein, J. Scharfenberger, et al.

Psychology

Natural language processing methods are sensitive to sub-clinical linguistic differences in schizophrenia spectrum disorders

S. X. Tang, R. Kriz, et al.

Humanities

Queering the web archive: A xenofeminist approach to gender, function, language and culture in the London French Special Collection

S. Huc-hepher

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 12+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny