logo
ResearchBunny Logo
A machine learning approach predicts future risk to suicidal ideation from social media data

Medicine and Health

A machine learning approach predicts future risk to suicidal ideation from social media data

A. Roy, K. Nikolitch, et al.

This groundbreaking study introduces SAIPH, a state-of-the-art algorithm designed to predict suicidal ideation risk using Twitter data. Conducted by a team of experts including Arunima Roy and Zachary A. Kaminsky, the research promises insights into suicide risk behaviors, offering a potential clinical decision tool for screening and monitoring.

00:00
00:00
~3 min • Beginner • English
Abstract
Machine learning analysis of social media data represents a promising way to capture longitudinal environmental influences contributing to individual risk for suicidal thoughts and behaviors. Our objective was to generate an algorithm termed "Suicide Artificial Intelligence Prediction Heuristic (SAIPH)" capable of predicting future risk to suicidal thought by analyzing publicly available Twitter data. We trained a series of neural networks on Twitter data queried against suicide associated psychological constructs including burden, stress, loneliness, hopelessness, insomnia, depression, and anxiety. Using 512,526 tweets from N = 283 suicidal ideation (SI) cases and 3,518,494 tweets from 2655 controls, we then trained a random forest model using neural network outputs to predict binary SI status. The model predicted N = 830 SI events derived from an independent set of 277 suicidal ideators relative to N = 3159 control events in all non-SI individuals with an AUC of 0.88 (95% CI 0.86–0.90). Using an alternative approach, our model generates temporal prediction of risk such that peak occurrences above an individual specific threshold denote a ~7 fold increased risk for SI within the following 10 days (OR = 6.7 ± 1.1, P = 9 × 10⁻⁷¹). We validated our model using regionally obtained Twitter data and observed significant associations of algorithm SI scores with county-wide suicide death rates across 16 days in August and in October, 2019, most significantly in younger individuals. Algorithmic approaches like SAIPH have the potential to identify individual future SI risk and could be easily adapted as clinical decision tools aiding suicide screening and risk monitoring using available technologies.
Publisher
npj Digital Medicine
Published On
May 26, 2020
Authors
Arunima Roy, Katerina Nikolitch, Rachel McGinn, Safiya Jinah, William Klement, Zachary A. Kaminsky
Tags
suicidal ideation
Twitter data
algorithm
neural networks
mental health
risk prediction
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny