Acessibilidade / Reportar erro

Correlations between Web Searches and COVID-19 Epidemiological Indicators in Brazil

Abstract:

COVID-19 rapidly spread across the world in an unprecedented outbreak with a massive number of infected and fatalities. The pandemic was heavily discussed and searched on the internet, which generated big amounts of data related to it. This led to the possibility of attempting to forecast coronavirus indicators using the internet data. For this study, Google Trends statistics for 124 selected search terms related to pandemic were used in an attempt to find which keywords had the best Spearman correlations with a lag, as well as a forecasting model. It was found that keywords related to coronavirus testing among some others, such as “I have contracted covid”, had high correlations (≥0.7) with few weeks of lag (≤4 weeks). Besides that, the ARIMAX model using those keywords had promising results in predicting the increase or decrease of epidemiological indicators, although it was not able to predict their exact values. Thus, we found that Google Trends data may be useful for predicting outbreaks of coronavirus a few weeks before they happen, and may be used as an auxiliary tool in monitoring and forecasting the disease in Brazil.

Keywords:
Google Trends; infodemiology; epidemiological predictions; digital health

HIGHLIGHTS

  • Google Trends data could be useful for predicting COVID-19.

  • High correlations (>=0.7) were found between keywords and indications when using a lag.

  • ARIMAX model could help predict COVID-19 cases and deaths per week.

Instituto de Tecnologia do Paraná - Tecpar Rua Prof. Algacyr Munhoz Mader, 3775 - CIC, 81350-010 Curitiba PR Brazil, Tel.: +55 41 3316-3052/3054, Fax: +55 41 3346-2872 - Curitiba - PR - Brazil
E-mail: babt@tecpar.br