Acessibilidade / Reportar erro

Mathematical Modeling and Georeferenced Forecasting for the COVID-19 at the State of RS, Brazil

ABSTRACT

In this contribution, we present a predictive tool developed to help in the management of the evolution of the COVID-19 pandemic situation in Rio Grande do Sul (RS)-Brazil. This tool is the result of georeferenced data analysis, mathematical modeling, and parameter calibration for the dynamics of a SIR-type model defined on a spatial structure that allows distinct subpopulations to interact, similar to the controlled distancing (A l for l=1, ···, 21)) groupings proposed by the RS government and public health authorities. The predictive analysis, updated biweekly, provides three distinct scenarios per month (milder, average, and severe) and is made available as WebSIGs (Geographic Information System-GIS). The forecast of the average scenario for each A l group is the result of a simulation of the proposed SIR-type dynamics with calibrated parameters derived from an augmented Lagrangian maximum a posteriori estimation and data on the number of infected cases made available by the RS Health Secretariat. The milder and severe scenarios are obtained from the average scenario, with changes in the contagion rates of each A l group. When compared to the number of infections reported in each A l group, the modeling predictions for a biweekly time window (the first two weeks) were quite satisfactory, with errors ranging from 0% to 5.13%, gradually increasing over time. Therefore, we suggest a biweekly re-calibration of the parameters and corresponding forecasts as a wise strategy.

Keywords:
COVID-19 forecast; SIR-type model; parameter calibration; georeferenced data

1 INTRODUCTION

Since the first reported case of COVID-19, there has been a great mobilization of the scientific community to understand the dynamics of SARS-CoV-2 (the virus that causes COVID-191919 World Health Organization. WHO Coronavirus Disease (COVID-19) Dashboard (2020). URL https://covid19.who.int.2020/7/29.
https://covid19.who.int.2020/7/29...
) and how the virus spreads. In this scenario, the increased demand for drugs, and the rapid development of vaccines, become the focus of most researchers. While known and tested drugs have shown little or no efficacy, and mass vaccination of the entire world population is unthinkable in the short term due to the time required for manufacturing, distribution, and application, non-pharmacological procedures have remained an important alternative to tightening pandemic effects. In this context, mathematical modeling has been an indispensable tool to formulate forecasts and to give assistance to public leaders in the making of public policies to slow the spread of the disease, e.g.,11 V.V.L. Albani, R.M. Velho & J.P. Zubelli. Estimating, monitoring, and forecasting COVID-19 epidemics: a spatiotemporal approach applied to NYC data. Sci Rep, 11 (2021), 9089.), (1111 J. Govaert. Coronavirus: UK changes course amid death toll fears. (2020). URL https://www.bbc.com/news/health-51915302.
https://www.bbc.com/news/health-51915302...
), (1414 M.J. Lazo & A. De Cezaro. Why can we observe a plateau even in an out of control epidemic outbreak? A SEIR model with the interaction of n distinct populations for Covid-19 in Brazil (2020). and references therein.

On the other hand, it was also clear that science is not enough to contest the effects of the pandemic without an efficient way of spreading the relevant information. In this sense, the use of geotechnologies has also proved to be of great value, especially Digital Cartography and Geographic Information Systems (GIS) as tools for georeferencing and spatial representation of the occurrences of COVID-19. Several initiatives have used digital maps and cybercartography for the cartographic organization of data, didactically illustrating the density of the epidemic and, mainly, supporting decision-making in the effective application of control devices to reduce the transmission of the virus22 M.d.G. Albuquerque, J.R.d. Santos, A.F. Silveira, D.L. Bornia Junior, R.B. Nunes, T.B.R. Gandra, A.L.C.d. Silva, G.B.d. Miranda & A.R. Trevisol. Influence of socio-economic indicators and territorial networks at the spatiotemporal spread dynamics of Covid-19 in Brazil. Sociedade & Natureza, 33 (2021). doi: 10.14393/SN-v33-2021-59688. URL https://seer.ufu.br/index.php/sociedadenatureza/article/view/59688.
https://seer.ufu.br/index.php/sociedaden...
), (99 D.P. de Paula, D.H. Miranda de Medeiros, E. Lacerda Barros, R.G. Pinheiro Guerra, J.d.O. Santos, J.S. Queiroz Lima & R.M. Leite Monteiro. Diffusion of Covid-19 in the Northern Metropolis in Northeast Brazil: territorial dynamics and risks associated with Social Vulnerability. Sociedade & Natureza, 32 (2020), 639-656. doi: 10.14393/SN-v32-2020-56098. URL https://seer.ufu.br/index.php/sociedadenatureza/article/view/56098.
https://seer.ufu.br/index.php/sociedaden...
. Such initiatives, both in the academic sphere and in territorial management, follow an extensive tradition of Health Cartography as a technical-scientific instrument to support epidemiological analysis.

The present study aims to add to the understanding, prediction, dissemination to society, and support for decision-making regarding the dynamics and spread of the COVID-19 pandemic by proposing a spatially structured SIR model integrated into georeferencing tools. We present a predictive tool developed to help in the management of the evolution of the COVID-19 pandemic in the Rio Grande do Sul (RS)-Brazil. The work consists of three steps as follows: 1) Organization and treatment of the reported data of infected people, in a georeferenced manner, which serves as an entry in mathematical modeling (initial conditions and data for parameters calibration for the proposed SIR type model), as detailed in Section 2; 2) Mathematical modeling and parameter calibration of COVID-19 dynamics by a SIR model with interacting multiple populations, as proposed in Sections 3 and 4; 3) Organization of COVID-19 spread scenarios in WebSIGs set-up as described in Section 4.

2 ORGANIZATION AND DATA OBSERVATION

Online digital mapping, also known as cybercartography, is an informational solution for publishing georeferenced data, making it more accessible to other users1010 R. Ferreira. Atlas, Cibercartografia e Neogeografia. Revista Portuguesa de Estudos Regionais, (2016), 31-44.. The methodology consists of three stages: the organization and processing of data provided by the State Health Department; mathematical modeling; and the creation of the WebSIG (Figure 1).

Figure 1
Flowchart representative of the stages involved in the work.

The creation of the geographic information plan requires data only from official sources. The 2019 vector base in the shapefile format provided by the Brazilian Institute of Geography and Statistics (IBGE) was admitted as a municipal area1 1 Available in https://www.ibge.gov.br/ . The regionalization of the modeling, in the form of regional blocks or clusters, had as reference the information provided by the portal of planning, state budget, and health management of RS, which were distributed into twenty-one clusters provided by the controlled distancing model of the state, corresponding to the 30 health regions (see [66 A. De Cezaro & et al. Georeferenced data from the COVID-19 pandemic for the state of Rio Grande do Sul: Reported positive cases and simulated forecasts. Latin American Data in Science, 1(2) (2021), 49-62., Fig. 2]).

To this end, the thirty Health Regions of RS, which subsidize actions in the sphere of planning and infrastructure of the State’s health, served as the basis for the construction of the twenty-one groups provided for by the Controlled Distancing Model of RS (Data control COVID-19 CORONAVÍRUS, 2020) and in this work the same criteria were adopted. According to the epidemiological action plan, which includes the classification flags of the degree of risk and the respective social distancing protocols, such groups were delimited using criteria of health and economic activity.

Each cluster is composed of one or more health regions that aggregate, respectively, a fraction of the 497 municipalities that make up the state of RS. In this study, the same special criteria proposed by the Controlled Distance Model of RS were adopted, facilitating access to the registry of reported infected people, data made available daily by the Painel Coronavirus-RS2 2 Available in https://ti.saude.rs.gov.br/covid19/ , as well as comparisons between the epidemiological actions and the results obtained by the modeling over the past few months. Because the data is cumulative for each day, a subtraction was performed with the count of the cases from the previous day to obtain the new cases for each day.

2.1 Organization and processing of geographic data: COVID-19 positive cases reported

The methodologies of organization, acquisition, and processing of data on the number of infected cases by Sars-Cov-2 officially reported by the Health Department of RS, available in electronic means, are presented in this work between the months of March 2020 and June 2021. In addition, we demonstrated how such a set of data is used to calibrate the parameters for a mathematical modeling of the spread of the disease in the 21 health groups of the state, which results in a set of predictive data for three distinct scenarios for the spread of COVID-19, in RS over a period of 28 days from the initial date of the data of positive cases reported. Such a set of predictive data can be used by health authorities as an auxiliary means in the decision-making for measures to control and combat the spread of COVID-19 and these are available for access through interactive maps WebSIGs with the spatial distribution of the projected cases. In this manuscript, we present a small sample of the collected and stored data as well as the projected data, the complete set of which is in the repository https://github.com/exactum-furg or as complementary material in66 A. De Cezaro & et al. Georeferenced data from the COVID-19 pandemic for the state of Rio Grande do Sul: Reported positive cases and simulated forecasts. Latin American Data in Science, 1(2) (2021), 49-62.. In a simplified procedure, the processes described above can be represented by the following flowchart:

First, we performed the creation process of the Geographic Information Plan of RS, which corresponds to the first step of the flowchart presented in Figure 1. IBGE 2021 data in the form of a shapefile from an automatic recovery system was used for this purpose. This information was used as the basis of the territorial and demographic division of the municipalities of RS and reorganization in the form of regional blocks or clusters, covering the 21 health groupings provided by the social-distancing plan developed by the State Government, based on the information provided by the Portal of Planning, Budget, and Health Management of RS. In [66 A. De Cezaro & et al. Georeferenced data from the COVID-19 pandemic for the state of Rio Grande do Sul: Reported positive cases and simulated forecasts. Latin American Data in Science, 1(2) (2021), 49-62., Fig. 2] it is possible to notice this geographical division.

Subsequently, the reported data of positive cases for COVID-19 from each of the 21 groupings was accessed daily. The positive case data accumulated is downloaded and systematized in an Excel spreadsheet in a georeferenced way. The georeferencing of the data allows us to link the characteristics of each of the 21 health groupings that are used as input data for mathematical modeling. Such characteristics of the health groups include the population, the area, the name of the group, the code of the regions that make-up each group and the number reported of infected cases (number of positive cases of COVID-19 reported). See the details in Table 1. According to these data, it is possible to calculate important variables for the study, such as: new cumulative cases of the week, population density, as well as the total number of infected cases reported for each group.

Table 1
Data organization according to the planning regions of the state of Rio Grande do Sul, t01=24/03/21 and t02=07/04/21.

The last step is the preparation of the acquired data of reported COVID-19 positivity cases for use in mathematical modeling. To perform it, a conversion factor/pixel to standardize the output spatial size of the cell (spatial resolution) of data rasters (vector format) generated by converting formats is used. For the standardization of this factor, we adopted the combined relationship between the centroids of the polygonal shape of each cluster and its maximum area, thus resulting in a conversion factor of 0.1, which results in a spatial resolution of 10×10 km cell/pixel. This resolution best matched the geometry of the health groups, preserving their original format1515 P. Longley. Geographical Information Systems: a renaissance of geodemographics for public service delivery. Progress in Human Geography, 29(1) (2005), 57-63. doi: 10.1191/0309132505ph528pr. URL https://doi.org/10.1191/0309132505ph528pr.
https://doi.org/10.1191/0309132505ph528p...
. This process allows the organization of the database of people infected by COVID-19 collected from the Coronavirus panel, for each of the 21 health clusters, available in table format and, by processing, feed files of the vector type (shapefiles) that are converted into arrays with coordinate registration (raster). Finally, the database is transformed into ASCII format for later use as initial conditions and data for mathematical modeling and parameter calibration.

It is worth noting that every day at 6 p.m., data on the number of reported positive COVID-19 infections is accessed and downloaded from the Coronavirus panel. Afterward, these are grouped by the weekly cumulative, to correct any issues related to the data reported at the weekends. As a result, for each new date defined as the basis, the number of cases will be presented week by week, beginning with the first week.

2.1.1 Spatial resolution

The choice of spatial resolution went through two important criteria for the generation of results: a) The size of the pixels in relation to the geometry (area) of the Health Regions, that is, whether the pixel quantity satisfactorily respects the contour of the Regions of Health, b) result processing time.

For the first criterion (pixels×geometry), it was evident that the 30 km resolution overgeneralized (vector simplification criteria) the polygons that form the health clusters, mischaracterizing them. Thus, the pixels of resolutions of 10 km and 5 km better represent the geometries of the clusters, that is, a generalization that does not deform the geometry of the clusters.

The last criterion takes into account the processing time of each scenario, with the difference between the 5 km resolutions and the 10 and 30 km resolutions being evident. While the 10 km resolution takes 4 min to process all the data from the projected cases, the 5 km resolution takes approximately 40 min, taking more computational time for processing. Whereas the modeling was performed using a notebook with Inter® Core™ i7-7500U CPU @ 2.70GHz x 4, with 7.6GiB of memory and Mesa Inter® HD Graphic 620 (KBL GT2) video graphics.

In view of the criteria, the 10 km resolution presented better results in general without significant loss of area or boundary deformation. This is responsible for good coefficients in the cross-analysis; it represents the geometric features of the health region clusters and a relatively low processing time for the amount of data analyzed.

3 MATHEMATICAL MODELING

The georeferenced forecasting for COVID-19 proposed here assumes that the constant population N is subdivided into a spatially structured network with a size corresponding to the size of the state of RS sub-divisions as described in Section 2. Indeed, this network is composed of 68×83 sites (regions). This network is organized so that each cluster is composed of a certain number of sites established in the data organization step described in Section 2. It should be noted that this number depends on the precision adopted in the rasterization process and on the grouping area. According to the Coupled Map Network (CMN) theory, e.g.1818 L. Rodrigues, M. Varriale, W. Godoy & D. Mistro. “Coupled Map Lattice Model for Insects and Spreadable Substances” (2014), 141-169 p. doi: 10.1007/978-3-319-06877-07.
https://doi.org/10.1007/978-3-319-06877-...
and references therein, each site x=(x i , x j ) of this network is represented by integer coordinates x i =is, x j =js, where s is the size of the site, for i∈{1, ···, 63} , j∈{1, ···, 83}.

The total population N i, j of each (i, j) site is normalized with respect to the population density and assumed constant, such that the total population in the lattice satisfies i=168j=183Ni,j=1. Such assumptions have implications for the magnitude of the model parameters (see Table 2). Furthermore, the model is considered as a SIR-type compartmental model, e.g.,1212 H.W. Hethcote. The mathematics of infectious diseases. SIAM review, 42(4) (2000), 599-653.), (1313 W. Kermack & A. Mckendrick. A Contribution to the Mathematical Theory of Epidemics. Proceedings of the Royal Society of London. Series A, Containing papers of a Mathematical and Physical Character, 115(772) (1927), 700-721.), (1414 M.J. Lazo & A. De Cezaro. Why can we observe a plateau even in an out of control epidemic outbreak? A SEIR model with the interaction of n distinct populations for Covid-19 in Brazil (2020). and references therein, where the total population of each site N i, j is divided into the susceptible fractions S i, j (t), infected I i, j (t) and removed R i, j (t), at each time instant t≥0, following a compartmental SIR-type model1212 H.W. Hethcote. The mathematics of infectious diseases. SIAM review, 42(4) (2000), 599-653.), (1313 W. Kermack & A. Mckendrick. A Contribution to the Mathematical Theory of Epidemics. Proceedings of the Royal Society of London. Series A, Containing papers of a Mathematical and Physical Character, 115(772) (1927), 700-721..

Table 2
Calibrated parameters from the COVID-19 infected people reported in t 0=24/03/21 and t 0=07/04/21, obtained from algorithm ((4.6)) for the corresponding health clusters. The relative errors between predicted and reported cases for the two weeks following the day t 0 for the average scenario are ER 1% and ER 2%. For the severe scenario, ERP 2% is the relative error of the second subsequent week from the day t 0.

We assume that the dynamics of evolution occurs through the coupled system (compartmental model of the SIR type with the interaction of multiple populations and without vital dynamics in the i∈{1, ···, 68} and j∈{1, ···, 83} sites) given by

S ˙ i , j t = - S i , j t β i , j t I i , j t + i ^ , j ^ V i , j β i ^ , j ^ t I i ^ , j ^ t I ˙ i , j t = S i , j t β i , j t I i , j t + i ^ , j ^ V i , j β i ^ , j ^ t I i ^ , j ^ t - γ i , j t I i , j t R ˙ i , j t = γ i , j t I i , j t ,

The initial conditions for the model are obtained at each site (i, j) from the fraction of infected reported cases I i , j(t 0) (at the instant t 0), given by the relationship

S i , j t 0 = 1 - I i , j t 0 - R i , j t 0 , I i , j t 0 0 , R i , j t 0 0 , (3.1)

Furthermore, because the model does not account for reinfection, R i, j (t 0) is the accumulated proportion of individuals removed during the entire course of the pandemic up to time t 0. Thus, R i, j (t 0) represents the population density in each (i, j) site that is not yet infected.

Next, we make some observations regarding the dynamics (3.1)-(3.1), where we take the opportunity to describe the parameters and the notation used. The dynamic (3.1) should be interpreted as follows: the probability of infection is due to contact between susceptible and infected individuals from the same site and is proportional to the effective contact rate β i, j (t), or by contact with infected individuals from neighboring sites, V î,ĵ to î∈{1, ..., 68, :îi} and ĵ∈{1, ..., 83, : ĵj}, which is proportional to the rate β î,ĵ (t). The rate of removals (recovered or dead) is given by γ i, j (t). If β î,ĵ (t)=0 for every î∈{1, ..., 68, :îi} and ĵ∈{1, ..., 83, : ĵj}, then (3.1) reduces to the SIR model for the isolated population at each site, e.g.,1414 M.J. Lazo & A. De Cezaro. Why can we observe a plateau even in an out of control epidemic outbreak? A SEIR model with the interaction of n distinct populations for Covid-19 in Brazil (2020). and references therein. The central question is how the disease spreads through the network given the interaction with the neighborhoods V î,ĵ and, in this case, the respective β î,ĵ (t)≠0. As a result, we will assume that the interaction occurs in an infinitely small time-step relative to the time of the dynamics t, with no migration. In other words, individuals from different populations interact and return to their reference sites faster than the time t→t+t.

We ended this section by showing the well-posedness of the initial value problem (3.1)-(3.1).

Theorem 3.1.Let T>t0≥0 be given. Assume the model parameters be continuous in [t0, T]. Then, the model(3.1)-(3.1)has a unique positive solution in [t0, T]. This means that its state variables remain non-negative for any trajectory initialized at non-negative initial conditions. Moreover, the solution is continuous with respect to the initial data and the model parameters in the interval [t0, T].

Proof. The continuity of the parameters implies the continuity of the right hand side with respect to t. Furthermore, a direct calculation shows that all the coordinates of the Jacobian matrix on the right hand side of (3.1) are continuous. As a result, uniformly bounded in [t 0 , T]. Then, the mean value theorem implies that the right hand site of (3.1) is Lipschitz continuous with respect to the second variable. Hence, the assertions adhere to the general theory of the well-posedness of initial value problems, for example, e.g.1616 L. Perko. “Differential Equations and Dynamical Systems”. Springer (2001).. □

The results in Theorem 3.1 may possibly have an extension to [t 0 , ∞[ (although we do not have a proof for it). Since we are interested in short-running forecasted scenarios of the dynamics (3.1)-(3.1), we restrict ourselves to finite time intervals in Theorem 3.1.

4 ON AN AUGMENTED LAGRANGIAN MAXIMUM A POSTERIORI (AL-MAP) STRATEGY FOR CALIBRATING THE MODEL PARAMETERS

There are three main unknown parameters to be calibrated in the model (3.1). Indeed, the contagious rate within the population β i, j , the contagious rates for interacting populations β î,ĵ and the removing rate γ i, j .

It is worth mentioning that parameter calibration for the model (3.1) is an inverse problem, e.g.,77 A. De Cezaro, A. Marinho de Oliveira & F. Travessini De Cezaro. “Identificação de parâmetros em Equações Diferenciais: Teoria e Aplicações”. UFPI (2012). and references therein.

Since daily reported of COVID-19 infected cases are sparce and subject to many uncertainties, mainly the sub-notifications during the weekends, we use the weekly cumulative of new COVID-19 reported cases Yi,j=Ii,jt0,,Ii,jt0+kτ as observed data, where τ represents one week in the time scale3 3 In the numerical simulations presented in this contribution, we use the data of five consecutive weeks. Hence, k=4. In particular, in the simulations, where we use the Ii,jt0 as the initial conditions and Ii,jt0+τ for the first step in the parameter calibration and Ii,jt0+2τ as the initial conditions and Ii,jt0+3τ for the parameter re-calibration. , for each of the (i, j)-sites.

In this contribution, we propose calibrating the vector of constant parameter Xi,jkt:=βi,jt0+kτt,βi^,j^t0+kτt,γi,jt0+kτt=βi,jk,βi^,j^k,γi,jk, for t∈[t 0 , t 0+τ], using a constrained maximum a posteriori probability (MAP) estimate approach11 V.V.L. Albani, R.M. Velho & J.P. Zubelli. Estimating, monitoring, and forecasting COVID-19 epidemics: a spatiotemporal approach applied to NYC data. Sci Rep, 11 (2021), 9089.), (44 G. Chowell. Fitting dynamic models to epidemic outbreaks with quantified uncertainty: a primer for parameter uncertainty, identifiability, and forecasts. Infectious Disease Modelling, 2(3) (2017), 379-398.. It is given by the following constrained optimization problem

X ^ i , j k a r g m a x X Π p o s t k X i , j k | I i , j t 0 + k τ , s u b j e c t t o C i , j t 0 + k τ X i , j k = 0 (4.1)

where, Ci,jt0+kτXi,jk=I^i,jt0+τ-Ii,jt0+kτ, for I^i,jt0+τ:=Ii,jX^i,jkt0+τ representing cumulative infected populations at the site (i, j) given by the unique solution of (3.1), with initial conditions (3.1) the set of values at t=t 0, as a function of the parameters Xi,jk at t=t 0+τ. ΠpostkXi,jk|Ii,jt0+kτ is the log-posterior density functional given by

Π p o s t t 0 + k τ X i , j k | I i , j t 0 + k τ Π k I i , j t 0 + k τ | X i , j k + Π p r i o r k X i , j k | X i , j k - 1 , (4.2)

for the log-likelihood density

Π k I i , j t 0 + k τ | X i , j k I i , j t 0 + k τ log σ I ^ i , j t 0 + k τ - σ I ^ i , j t 0 + k τ - log I i , j t 0 + k τ ! . (4.3)

Πt0+kτIi,jt0+kτ|Xi,jk is the logarithm of the Poisson distribution with mean given by σI^i,jt0+kτ:=σIi,jX^i,jt0+kτ the unique solution of (3.1)-(3.1), for t=t 0+kτ. The logIi,jt0+kτ! is approximated by the Stirling’s formula:

log I i , j t 0 + k τ ! ~ 1 / 2 log 2 π I i , j t 0 + k τ + I i , j t 0 + k τ log I i , j t 0 + k τ - I i , j t 0 + k τ ,

in the data time-series Yi, j .

Moreover, we use the Gaussian priors in the posterior densities

Π p r i o r t 0 + k τ X i , j k | X i , j k - 1 - α X i , j k - X i , j k - 1 2 (4.4)

as the regularization term that accounts for the uncertainties in the data Ii,jt0+kτ, with α>0 the regularization parameter77 A. De Cezaro, A. Marinho de Oliveira & F. Travessini De Cezaro. “Identificação de parâmetros em Equações Diferenciais: Teoria e Aplicações”. UFPI (2012).. It is worth mentioning that the (MAP) is a log-concave estimate of an unknown quantity that equals the mode of the posterior distribution, resulting in a point estimate of an unobserved quantity based on empirical data. (MAP) estimations are closely related to the method of maximum likelihood (ML) estimation. However, (MAP) employs an augmented optimization objective that incorporates a prior distribution (that quantifies the additional information available through prior knowledge of a related event) over the quantity one wants to estimate. Hence, MAP estimation can be seen as a regularization of maximum likelihood estimation. In particular, the (MAP) estimation proposed in (4.1) can be interpreted as an iterated-Tikhonov regularized solution, e.g., [55 A. De Cezaro. “On a Parabolic Inverse Problem Arising in Quantitative Finance: Convex and Iterative Regularization”. PhD Theses-IMPA (2010)., Section 2.2] or88 A. De Cezaro, O. Scherzer & J. Zubelli. Convex regularization of local volatility models from option prices: Convergence analysis and rates. Nonlinear Analysis: Theory, Methods & Applications, 75(4) (2012), 2398-2415..

Due to the constraints in the optimization problem (4.1), the (MAP) estimation cannot guarantee that any maximizers of Πpostt0+kτXi,jk|Ii,jt0+kτ satisfies the constraints. For that fate, we propose the augmented Lagrangian (AL-MAP). It consists in the introduction of the augmented Lagrange functional L, which is formally defined by4 4 Remember that maxG=min−G, whenever a solution for the problem exist.

L i , j k X i , j k ; λ i , j k , μ i , j k : = - Π p o s t t 0 + k τ X i , j k | I i , j t 0 + k τ + λ i , j k C i , j t 0 + k τ + μ i , j k C i , j t 0 + k τ 2 . (4.5)

The Augmented Lagrangian replaces a constrained optimization problem by a series of unconstrained problems through the so-called duality theory33 D. Bertsekas. “Constrained optimization and Lagrange multiplier methods”. Computer Science and Applied Mathematics. Academic Press, New York (1982).. In (4.5), Xi,jk is the primal variable λi,jk,μi,jk are the dual variables. In particular, λi,jk can be interpreted as a “generalized” Lagrange multiplier and the scalar μi,jk>0 is a penalty factor that allows one to establish a duality relation for problems of non-convex (concave) type as (4.1).

Next, we present a framework from which an algorithm can be derived for the constrained optimization problem (4.1), in terms of the proposed (AL-MAP) approach. The main ideas and necessary definitions can be found in [1717 R. Rockafellar & R.B. Wets. “Variational analysis”, volume 317 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin (1998), xiv+733 p., Cap. 11, Sec. K*] or in [33 D. Bertsekas. “Constrained optimization and Lagrange multiplier methods”. Computer Science and Applied Mathematics. Academic Press, New York (1982)., Section 4.4].

Theorem 4.1. Let the general assumption on this contributions holds true. Then:

  • i) There exists a pairλ¯i,jk,μ¯i,jksupporting an exact penalty representation as in Definition 11.601717 R. Rockafellar & R.B. Wets. “Variational analysis”, volume 317 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin (1998), xiv+733 p..

  • ii) The optimal solutions of the primal and augmented dual problems are characterized as saddle points of the augmented Lagrangian. In other words

X ^ i , j k = inf X L i , j k X i , j k ; λ ¯ i , j k , μ ¯ i , j k , λ ¯ i , j k = sup λ L i , j k X ^ i , j k ; λ i , j k , μ i , j k , f o r a n y , μ i , j k μ ¯ i , j k . (4.6)

  • iii) LetXi,jk,0,λi,jk,0,μi,jk,0and appropriated initial guess given, withμi,jk,0large enough. Then, the sequenceXi,jk,l,λi,jk,l,μi,jk,lgenerated as

X i , j k , l = inf X L i , j k , l X i , j k ; λ i , j k , l - 1 , μ i , j k , l - 1 , μ i , j k , l = 1 + ξ l μ i , j k , l - 1 λ i , j k , l = λ i , j k + μ i , j k , l C i , j k X i , j k , l , (4.7)

  • for positive of ξl , converges towards the saddle point of the augmented LagrangianLi,jkXi,jk;λi,jk,μi,jk.

Proof. Notice that augmented Lagrangian (4.5) is a proximal Lagrangian in the sense of [1717 R. Rockafellar & R.B. Wets. “Variational analysis”, volume 317 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin (1998), xiv+733 p., Exemple 11.57], for a convex function -Πpostt0+kτXi,jk|Ii,jt0+kτ. Hence, the assertions on items i) and ii) follows from [1717 R. Rockafellar & R.B. Wets. “Variational analysis”, volume 317 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin (1998), xiv+733 p., Theorem 11.61] and [1717 R. Rockafellar & R.B. Wets. “Variational analysis”, volume 317 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin (1998), xiv+733 p., Theorem 11.59], respectively. Item iii) follows from the theory developed in [33 D. Bertsekas. “Constrained optimization and Lagrange multiplier methods”. Computer Science and Applied Mathematics. Academic Press, New York (1982)., Section 4.4]. □

5 COVID-19 FORECASTED SCENARIOS FOR RS

In this section, we presented the COVID-19 infected population forecasted scenarios for RS, in one of the hostile phases of the pandemic, namely, between March and May of 2021, in which: a)-called average scenario-in which the forecasts are driven by the dynamics (3.1)-(3.1) with the parameters X^i,jk calibrated from the (AL-MAP) algorithm (4.6) and b) based on changes in the parameters of the situation a), we determine milder scenarios (or optimistic) and more severe (or pessimistic) scenarios by decreasing or increasing the contagion rate β^i,jk calibrated in a). These last two scenarios emulate low or high population adherence to control measures of COVID-19 dissemination.

The presented scenarios are obtained by following the steps below. The numerical routines are implemented by the authors in Octave using the lsqnonlin routine for the nonlinear minimization of the augmented Lagrangian.

  1. Select date t 0 for the database (as in Section 2). Let the initial conditions (3.1) be given at t 0. It corresponds to k=0 in the corresponding data set Yi, j . Let T>t 0, where T is final time in which the predictive dynamics is considered. In our simulated scenarios T corresponds to 28 days.

  2. Calibrate the model parameters with the algorithm (4.6), and then treat them as constant for t 0t≤T.

  3. While t 0t≤T , the forecasts follow the discrete-time (3.1) dynamics corresponding to Euler’s method, with step-size h=1/2.

  4. The forecast is assumed to follow the dynamic (3.1) as long as t≤T or until a new base (and a new t 0, for example t 0t 0+2τ) becomes available. In any case, go back to item 1).

Then, the forecasted scenarios are georeferenced to the corresponding A l health clusters in a process that is, basically, inverse to the data acquisition presented in Section 2. In the end, five distinct layers (in color scales) are generated in the health clusters map, representing, respectively, the number of infected people reported in the analyzed week and the number of infected predicted for the next 7, 14, 21 and 28 days, for each of the previously announced scenarios. These results are made available in a WebSIG (interactive and georeferenced format) on the project’s website5 5 Available in https://exactum.furg.br/ .

5.1 Forcasted example

In this subsection, we present the results of predictions for the infected population by COVID-19 with our proposed strategy. It differs from the ones available on the project’s website4, since it appears below as the graphics of the infected population dynamics of the clusters for the health clusters determined by the controlled distancing model proposed by the government of the state of RS. The parameters are calibrated from algorithm (4.6) based on reported COVID-19 infected people at two databases t 0, in 24/03/21 and 07/04/21, respectively.

We only present the results from the clusters A 1-Santa Maria, A 2-Uruguaiana, A 8-Porto Alegre, A 15-Passo Fundo, A 16-Pelotas and A 18-Caxias do Sul, as shown in Figures 2-3, respectively. Such selections represent clustering with high population density, such as the RS capital Porto Alegre and the industrialized region of Caxias do Sul; medium-populated clustering regions (Passo Fundo, Pelotas, and Santa Maria); and sparsely populated areas, such as Uruguaiana. Hence, it covers the most important population dynamics in the state of RS. It is worth mentioning that the simulations are performed for all the 21 clusters simultaneously.

Figure 2
Infected forecast scenarios at each databased for the corresponding health clusters. The average scenario corresponds to the calibrated parameters presented in Table 2.

Figure 3
Infected forecast scenarios at each databased for the corresponding health clusters. The average scenario corresponds to the calibrated parameters presented in Table 2.

In Table 2, we present the calibrated parameters for the average scenario (the one obtained with the parameters calibrated using the observed data and algorithm (4.6)). The COVID-19 reported cases used as initial conditions and database in simulations are given in the data paper66 A. De Cezaro & et al. Georeferenced data from the COVID-19 pandemic for the state of Rio Grande do Sul: Reported positive cases and simulated forecasts. Latin American Data in Science, 1(2) (2021), 49-62..

Figures 2-3 show the predictions of infected population from the simulations obtained from the calibrated parameters shown in Table 2. The continuous lines in Figures 2-3 correspond to the COVID-19 infected people predictions (for the 3 scenarios-(green, blue, red) correspond to (milder, average, severe) scenarios, respectively. ) for the dates of 07, 14 and 21/04/21 with the parameters calibrated based on the reported infected cases on t 0=24/03/21 as initial conditions. The parameters are then re-calibrated based on the number of infected people reported in the week of the day t 0=07/04/21. The predictions corresponding to the 3 scenarios (with recalibrated parameters as in Table 2 corresponding to t 0=07/04/21) are presented as the dashed curves in Figures 2-3. It corresponds to predicted infected cases of COVID-19, for the dates 21/04, 28/04 and 5/05. The relative error ER j %, for the first two weeks after calibration, is presented in the Table 2.

Note that when the first prediction completes 14 days (continuous lines in 07/04), the errors of the results of the first base start to increase and the curves of the three scenarios deviate from the reported data. However, with the strategy of re-calibrating the parameters and setting new predictions every two weeks, when the “old” predictions start to be less reliable, we already have them replaced with the predictions based on the new database. Despite the many external factors that are not taken into account in the proposed COVID-19 modeling, the presented scenarios show a significant agreement with the reported cases for short time periods (around 15 days)-see the errors in Table 2). In particular, COVID-19 reported cases are enclosed by the forecasted scenarios envelope, in the time horizon of 15 days.

One final observation regarding the results presented in Figures 2-3 is that, for some clusters, the milder scenario describes the data for the long time horizon better than the one predicted by the average scenario (the one corresponding to the calibrated parameters-see last column in Table 2). It is because data reported from the end of March/21 to the beginning of May/21 corresponds to the period in which RS went through the most severe restriction (black flag) of economic activity since the beginning of the COVID-19 pandemic.

6 CONCLUSIONS AND FUTURE DIRECTIONS

In this contribution we present the results of the development of a predictive tool with the capability to help manage the evolution of the COVID-19 pandemic in the state of RS, whose interactive form can be found on the EXACTUM website. This tool is the result of the analysis of georeferenced data supported by the mathematical modeling dynamics of the SIR-type model with a spatial structure and interaction of multiple populations (model 3.1), corresponding to the controlled distance clusters, thus encompassing a forecast for the state as a whole.

We propose an augmented Lagrangian maximum a posteriori estimation method for calibrating the model parameters, based on data from people infected with COVID-19 reported by health authorities. The obtained forecasts provide a good description of the evolution of the reported cases of newly infected (see Table 2 and Figures 2-3) for a short time horizon (15 days). Therefore, we suggest calibrating the parameters in a time window that corresponds to a maximum of two weeks. With this strategy, we verify that the number of infected for future dates remains enveloped by the predictions obtained by the three proposed scenarios.

Future developments of this approach include the possibility of extending the results to all Brazilian states. The difficulties of this extension lie in the need for a higher computing capacity than that used by the team in this project, where the computation was done on a personal computer.

Moreover, there is an urgent need for standardization in the way the reported data is handled by the health authorities of each state or by the federal ministry of health.

Finally, it is important to point out that the inclusion of other relevant information in the modeling, such as ICU bed occupancy forecast and, more recently, vaccination, is easily incorporated into the dynamics 3.1. In this way, extensions in this sense are also possible.

ACKNOWLEDGMENTS

We thank Fundação de Amparo à Pesquisa do Estado do Rio Grande do Sul (FAPERGS), Brazil, Grant No. 20/2551-0000272-1. D.A. Sales acknowledges the support of CNPq, Conselho Nacional de Desenvolvimento Científico e Tecnológico-Brazil.

REFERENCES

  • 1
    V.V.L. Albani, R.M. Velho & J.P. Zubelli. Estimating, monitoring, and forecasting COVID-19 epidemics: a spatiotemporal approach applied to NYC data. Sci Rep, 11 (2021), 9089.
  • 2
    M.d.G. Albuquerque, J.R.d. Santos, A.F. Silveira, D.L. Bornia Junior, R.B. Nunes, T.B.R. Gandra, A.L.C.d. Silva, G.B.d. Miranda & A.R. Trevisol. Influence of socio-economic indicators and territorial networks at the spatiotemporal spread dynamics of Covid-19 in Brazil. Sociedade & Natureza, 33 (2021). doi: 10.14393/SN-v33-2021-59688. URL https://seer.ufu.br/index.php/sociedadenatureza/article/view/59688
    » https://doi.org/10.14393/SN-v33-2021-59688» https://seer.ufu.br/index.php/sociedadenatureza/article/view/59688
  • 3
    D. Bertsekas. “Constrained optimization and Lagrange multiplier methods”. Computer Science and Applied Mathematics. Academic Press, New York (1982).
  • 4
    G. Chowell. Fitting dynamic models to epidemic outbreaks with quantified uncertainty: a primer for parameter uncertainty, identifiability, and forecasts. Infectious Disease Modelling, 2(3) (2017), 379-398.
  • 5
    A. De Cezaro. “On a Parabolic Inverse Problem Arising in Quantitative Finance: Convex and Iterative Regularization”. PhD Theses-IMPA (2010).
  • 6
    A. De Cezaro & et al. Georeferenced data from the COVID-19 pandemic for the state of Rio Grande do Sul: Reported positive cases and simulated forecasts. Latin American Data in Science, 1(2) (2021), 49-62.
  • 7
    A. De Cezaro, A. Marinho de Oliveira & F. Travessini De Cezaro. “Identificação de parâmetros em Equações Diferenciais: Teoria e Aplicações”. UFPI (2012).
  • 8
    A. De Cezaro, O. Scherzer & J. Zubelli. Convex regularization of local volatility models from option prices: Convergence analysis and rates. Nonlinear Analysis: Theory, Methods & Applications, 75(4) (2012), 2398-2415.
  • 9
    D.P. de Paula, D.H. Miranda de Medeiros, E. Lacerda Barros, R.G. Pinheiro Guerra, J.d.O. Santos, J.S. Queiroz Lima & R.M. Leite Monteiro. Diffusion of Covid-19 in the Northern Metropolis in Northeast Brazil: territorial dynamics and risks associated with Social Vulnerability. Sociedade & Natureza, 32 (2020), 639-656. doi: 10.14393/SN-v32-2020-56098. URL https://seer.ufu.br/index.php/sociedadenatureza/article/view/56098
    » https://doi.org/10.14393/SN-v32-2020-56098» https://seer.ufu.br/index.php/sociedadenatureza/article/view/56098
  • 10
    R. Ferreira. Atlas, Cibercartografia e Neogeografia. Revista Portuguesa de Estudos Regionais, (2016), 31-44.
  • 11
    J. Govaert. Coronavirus: UK changes course amid death toll fears. (2020). URL https://www.bbc.com/news/health-51915302
    » https://www.bbc.com/news/health-51915302
  • 12
    H.W. Hethcote. The mathematics of infectious diseases. SIAM review, 42(4) (2000), 599-653.
  • 13
    W. Kermack & A. Mckendrick. A Contribution to the Mathematical Theory of Epidemics. Proceedings of the Royal Society of London. Series A, Containing papers of a Mathematical and Physical Character, 115(772) (1927), 700-721.
  • 14
    M.J. Lazo & A. De Cezaro. Why can we observe a plateau even in an out of control epidemic outbreak? A SEIR model with the interaction of n distinct populations for Covid-19 in Brazil (2020).
  • 15
    P. Longley. Geographical Information Systems: a renaissance of geodemographics for public service delivery. Progress in Human Geography, 29(1) (2005), 57-63. doi: 10.1191/0309132505ph528pr. URL https://doi.org/10.1191/0309132505ph528pr
    » https://doi.org/10.1191/0309132505ph528pr» https://doi.org/10.1191/0309132505ph528pr
  • 16
    L. Perko. “Differential Equations and Dynamical Systems”. Springer (2001).
  • 17
    R. Rockafellar & R.B. Wets. “Variational analysis”, volume 317 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin (1998), xiv+733 p.
  • 18
    L. Rodrigues, M. Varriale, W. Godoy & D. Mistro. “Coupled Map Lattice Model for Insects and Spreadable Substances” (2014), 141-169 p. doi: 10.1007/978-3-319-06877-07.
    » https://doi.org/10.1007/978-3-319-06877-07
  • 19
    World Health Organization. WHO Coronavirus Disease (COVID-19) Dashboard (2020). URL https://covid19.who.int.2020/7/29
    » https://covid19.who.int.2020/7/29
  • 1
  • 2
  • 3
    In the numerical simulations presented in this contribution, we use the data of five consecutive weeks. Hence, k=4. In particular, in the simulations, where we use the Ii,jt0 as the initial conditions and Ii,jt0+τ for the first step in the parameter calibration and Ii,jt0+2τ as the initial conditions and Ii,jt0+3τ for the parameter re-calibration.
  • 4
    Remember that maxG=minG, whenever a solution for the problem exist.
  • 5

Publication Dates

  • Publication in this collection
    28 July 2023
  • Date of issue
    Jul-Sep 2023

History

  • Received
    22 Dec 2021
  • Accepted
    20 Dec 2022
Sociedade Brasileira de Matemática Aplicada e Computacional - SBMAC Rua Maestro João Seppe, nº. 900, 16º. andar - Sala 163, Cep: 13561-120 - SP / São Carlos - Brasil, +55 (16) 3412-9752 - São Carlos - SP - Brazil
E-mail: sbmac@sbmac.org.br