logo
ResearchBunny Logo
Digital Disease Surveillance for Emerging Infectious Diseases: An Early Warning System Using the Internet and Social Media Data for COVID-19 Forecasting in Canada

Medicine and Health

Digital Disease Surveillance for Emerging Infectious Diseases: An Early Warning System Using the Internet and Social Media Data for COVID-19 Forecasting in Canada

Y. Yang, S. Tsao, et al.

Dive into this compelling study by Yang Yang, Shu-Feng Tsao, Mohammad A Basri, Helen H Chen, and Zahid A Butt, exploring how internet search trends and social media data can act as early warning signals for COVID-19 surveillance in Canada. Discover how symptom searches and social media hashtags correlated with case incidences and the innovative forecasting techniques employed!

00:00
00:00
~3 min • Beginner • English
Abstract
Background: Emerging Infectious Diseases (EID) are a significant threat to population health globally. We aimed to examine the relationship between internet search engine queries and social media data on COVID-19 and determine if they can predict COVID-19 cases in Canada. Methods: We analyzed Google Trends (GT) and Twitter data from 1/1/2020 to 3/31/2020 in Canada and used various signal-processing techniques to remove noise from the data. Data on COVID-19 cases was obtained from the COVID-19 Canada Open Data Working Group. We conducted time-lagged cross-correlation analyses and developed the long short-term memory model for forecasting daily COVID-19 cases. Results: Among symptom keywords, "cough," "runny nose," and "anosmia" were strong signals with high cross-correlation coefficients >0.8, showing that searching for these terms on GT correlated with the incidence of COVID-19 and peaked 9, 11, and 3 days earlier than the incidence peak, respectively. For symptoms- and COVID-related Tweet counts, the cross-correlations of Tweet signals and daily cases were r = 0.868 at 11 days earlier and r = 0.840 at 10 days earlier, respectively. The LSTM forecasting model achieved the best performance (MSE = 124.78, R2 = 0.88, adjusted R2 = 0.87) using GT signals with cross-correlation coefficients >0.75. Combining GT and Tweet signals did not improve the model performance. Conclusion: Internet search engine queries and social media data can be used as early warning signals for creating a real-time surveillance system for COVID-19 forecasting, but challenges remain in modelling.
Publisher
This information was not provided in the source document.
Published On
Jan 01, 2023
Authors
Yang Yang, Shu-Feng Tsao, Mohammad A Basri, Helen H Chen, Zahid A Butt
Tags
COVID-19
Google Trends
social media
forecasting
real-time surveillance
data analysis
LSTM model
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny