BMC Medical Research Methodology | |
Need of care in interpreting Google Trends-based COVID-19 infodemiological study results: potential risk of false-positivity | |
Kenichiro Sato1  Tatsushi Toda1  Tatsuo Mano1  Atsushi Iwata2  | |
[1] Department of Neurology, Graduate School of Medicine, University of Tokyo, Tokyo, Japan;Department of Neurology, Graduate School of Medicine, University of Tokyo, Tokyo, Japan;Department of Neurology, Tokyo Metropolitan Geriatric Medical Center Hospital, Tokyo, Japan; | |
关键词: COVID-19; Google Trends; Infodemiology; Vector autoregression model; Granger causality; | |
DOI : 10.1186/s12874-021-01338-2 | |
来源: Springer | |
【 摘 要 】
BackgroundGoogle Trends (GT) is being used as an epidemiological tool to study coronavirus disease (COVID-19) by identifying keywords in search trends that are predictive for the COVID-19 epidemiological burden. However, many of the earlier GT-based studies include potential statistical fallacies by measuring the correlation between non-stationary time sequences without adjusting for multiple comparisons or the confounding of media coverage, leading to concerns about the increased risk of obtaining false-positive results. In this study, we aimed to apply statistically more favorable methods to validate the earlier GT-based COVID-19 study results.MethodsWe extracted the relative GT search volume for keywords associated with COVID-19 symptoms, and evaluated their Granger-causality to weekly COVID-19 positivity in eight English-speaking countries and Japan. In addition, the impact of media coverage on keywords with significant Granger-causality was further evaluated using Japanese regional data.ResultsOur Granger causality-based approach largely decreased (by up to approximately one-third) the number of keywords identified as having a significant temporal relationship with the COVID-19 trend when compared to those identified by Pearson or Spearman’s rank correlation-based approach. “Sense of smell” and “loss of smell” were the most reliable GT keywords across all the evaluated countries; however, when adjusted with their media coverage, these keyword trends did not Granger-cause the COVID-19 positivity trends (in Japan).ConclusionsOur results suggest that some of the search keywords reported as candidate predictive measures in earlier GT-based COVID-19 studies may potentially be unreliable; therefore, caution is necessary when interpreting published GT-based study results.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202108125334337ZK.pdf | 1647KB | download |