1*Doctorado (Cum Laude) en Ingeniería, jgomez@unimagdalena.edu.co. ORCID: 0000-0002-0840-8743. Universidad del Magdalena,Santa Marta, Colombia.
2*Estudiante de Ingeniería Electrónica, jeraldynredondoep@unimagdalena.edu.co. ORCID: 0000-0003-2453-5360. Universidad del Magdalena,Santa Marta, Colombia.
3*Estudiante de Ingeniería Electrónica, mariatovarsh@unimagdalena.edu.co. ORCID: 0000-0003-1431-2059. Universidad del Magdalena,Santa Marta, Colombia.
How to cite: J. Gómez-Rojas, J.Redondo-Pérez, M. Tovar-Hernández, “Análisis y estimación gráfica del comportamiento del COVID-19 en Colombia, Santa Marta y Cartagena enfocado a la letalidad”, Respuestas, vol. 26, no. 1, pp. 118-131, 2021.
© Peer review is the responsibility of the Universidad Francisco de Paula Santander.This is an article under the license CC BY-NC 4.0.
Received: November 11, 2020
Approved: December 5, 2020.
Keywords
COVID -19, Curve, Deaths, Estimation, Normal Distribution.
Abstract
This document presents the analysis of the behavior of daily cases of contagion and deaths from COVID-19. This is carried out to demonstrate the advance of the pandemic from its beginnings and to estimate the impact of this soon so that health entities are prepared to execute mitigation and containment plans in the population. To achieve its realization, the method of Polynomial Regression and Normal Distribution was used. The results obtained are the curve that models the behavior of cases of contagion and death from the beginning and the prediction of how this will proceed until the end of 2020. The estimation charts model the behavior of COVID-19 based on its ideal procedure.
Palabras claves
COVID -19, Curva, Distribución Normal, Estimación, Muertes.
Resumen
En el presente documento se exterioriza el análisis del comportamiento de los casos diarios de contagio y de muertes por COVID - 19. Esto se lleva a cabo para evidenciar el avance de la pandemia desde sus principios y estimar el impacto de esta en un futuro cercano de forma que los entes de salud estén preparados para ejecutar planes de mitigación y contención en la población. Para lograr su realización se utilizó el método de Regresión Polinomial y Distribución Normal. Los resultados obtenidos son la curva que modela el comportamiento de los casos de contagio y de muerte desde un principio y la predicción del proceder de esta hasta finales del 2020. Las gráficas de estimación modelan el comportamiento del COVID-19 en función de su proceder ideal.
Introduction
COVID-19 is an infectious disease caused by type 2 coronavirus, discovered in late 2019, responsible for severe acute respiratory syndrome, better known as SARS-CoV-2, which affects the respiratory system with frequent symptoms such as nasal congestion, choking sensation, and intense cough [1]. In addition, it can spread rapidly in the population because it can enter through the nostrils, eyes, or mouth to infect a person, in response to this protective measures have been taken as social distancing, hand washing, and use of masks that cover the nose and mouth. This highly contagious disease due to its zoonotic capacity originated in the Wuhan market, the People's Republic of China, which reported on December 12, 2019, that there were 27 cases of patients with a febrile respiratory condition [3]. However, it was not notified to the WHO until December 31, 2019, so that the next day the Wuhan market is closed and finally on January 9, 2020, is published the existence of Coronavirus by the People's Republic of China. Subsequently, the virus spread in such a way that on January 13 the first case was reported in Thailand, i.e. it had already migrated to other places in the region. Finally, on January 30, WHO declared the Coronavirus as a global health threat [2].
Approximately one month later, exactly on February 26th, the first case was published in South America, specifically in Brazil, so that all countries in the region including Colombia began to prepare to face the arrival of possible cases of this virus to the nation [3]. Even though Latin American countries in comparison with most European and Asian countries have higher levels of poverty and health centers were not well prepared to face a pandemic, they managed to take advantage of the time it took to arrive, taking preventive measures, such as mandatory quarantine of travelers coming from other countries, border closures and airport closures [4]. The virus arrived in Colombia on March 6, 2020, when Europe was experiencing the most critical moments of the Pandemic. By that date, the average number of people infecting others in the country was 2.28, relatively lower than in other countries in the region, and on March 23 a national quarantine was decreed [5]. Months earlier the Pandemic had reached Europe, leaving among the most affected countries Italy, which is the European state with the highest number of deaths so far, December 2020, an estimated 55,000 [6]. European countries were hit hard by the Pandemic due to several factors, including the fact that the majority of the population is elderly, it has been estimated that by 2050 36% of the population will be older than 54 years [7], and this virus has a high capacity to lethally affect the elderly and/or people with comorbidities [8]. Among the most affected countries was Spain, which implemented the Daily Mortality Monitoring System (MoMo), which is responsible for estimating mortality for the following days based on death data from January 1, 2008, to one year before the date of analysis, this was very useful because it was able to show the "excess deaths" [9]. Most of the cases from Latin America were imported and originated from the northern cities of Italy [1].
The appearance of COVID-19 in the world has not only brought a new lifestyle but also many technological elements have been created for the control and mitigation of this. Among these, there have been mobile applications focused on the pandemic, which have different objectives, from the daily display of the numbers of infections and death to the tracking of contacts or exposures of those infected. Among these apps we have "TraceTogether", which is responsible for tracking the exposures of an infected person with people around him, was created by the government of Singapore [10], another is one developed in Spain by a research group of the University of Santiago de Compostela (USC) which is called "Modestya", this app can predict the behavior of Coronavirus in 3 days, according to data taken 15 days before [11]. Likewise, a mobile application was created in Colombia that aims to track the health status of people in search of possible asymptomatic cases in the population or mild symptoms, this in the company of the Polymerase Chain Reaction (PCR) tests and thus stop the spread of the virus. This mobile application provides users with updated information on the progress of diagnosed cases and deaths in the country, department, and municipalities, as well as offers educational aids to ensure timely care for the well-being of the individual and the collective benefit [12]. In addition, Colombia is also carrying out an analysis called "COVID-19 coronavirus transmission model" which aims to make the adjustment to a SIR model with a probabilistic component and to help eradicate the virus, estimates possible transmission scenarios in the country [13]. On the other hand, three students from different recognized universities in the country created a "Polynomial regression application for the characterization of the curve of COVID-19, using Machine Learning techniques", which allows the treatment of data by polynomial regression leading to obtaining a prediction of the future behavior of the data [14]. Likewise, a "COVID 19 APP" was developed with MATLAB software, which allows an analysis of cases in the country of interest, as well as curve fitting for the coming days [15]. Likewise, collaborators from different countries created an interactive APP that allows us to manage very easily because it offers us the world map with the different figures of total cases, recovered and deceased instantly and compares through graphs the figures of cases, deaths, and other specific countries [16]. The INS of Colombia also has a web page, very dynamic, which covers a lot of information through different types of graphics, from which we can obtain figures of cases and deaths, classified by regions, gender of people,e and others [17]. These Apps have been developed because the information systems and digital media allow us to address the data on time, as the situation merits it, thanks to which prevention and control measures can be taken in time [18].
The excessive number of deaths from Coronavirus since its inception is evidence of its high degree of lethality. However, comparing this with previous pandemics in history, it does not manage to be the most deadly, because to date the COVID-19 has claimed 1.5 million lives [6], and although it is a considerably large figure, the Black Death left approximately 200 dead [19]. In Colombia, when the virus had just arrived, an analysis was conducted to discern how it would proceed, having observed its behavior since its beginnings in Europe and the factors that had made its population more susceptible to have higher numbers of cases and deaths, among them were differentiated: the geographical and climatic conditions, the percentage of the population at risk, among others [20].
In this article, we will show the operation of an App that allows observing the prediction of the behavior of the curve having as a base tool the use of MATLAB and the commands polyfit and polyval for the adjustment and extrapolation of the curve using the polynomial regression, according to what has been investigated in Google Scholar some investigations use this method, but with a different technique. As well as the implementation of the use of mathematical-computational models to examine how the behavior by COVID-19 develops, in what time it will develop and what levels of infection it can reach, is vital for the control of infection, the good use of resources and health services and the reopening of normal daily life [21]. For the development of these mobile applications, prediction techniques must be taken into account, which aims to know future values using the information observed up to the present time, among them, we find the simple exponential smoothing, the winters method, and the hold method [22]. One of the mathematical bases for the realization of this project was the Gaussian function or normal distribution, which models the figure of a bell and requires data such as standard deviation and mean [23], this was applied in conjunction with the polynomial regression that was performed in a way that modeled the curve, because this, being the number of deaths and daily cases, at some point should fall by the behavior of the pandemic.
Materials and methods
Materials
A. Data collection, first stage.
To obtain the necessary data to model the covid-19 curve in Colombia, Santa Marta and Cartagena were exported the reports on the official page of the Ministry of Health of Colombia [24]. The most relevant data are the cases and deaths accumulated each day, which are treated interactively to give an organized and compact form.
For the analysis of these, they were plotted cumulatively and individually (bar graph).
Mathematical modeling, second stage.
To estimate the behavior of deaths from Covid-19 in Colombia, two mathematical models were used, the normal distribution and polynomial regression, which yield statistical values such as the mean and standard deviation that facilitate the treatment of the data. The normal distribution or Gaussian bell is a statistical model capable of approximating the value of a random variable to an ideal situation, in which the successive standard deviations concerning the mean establish reference values to estimate the percentage of observations of the data [24].
To work with this model we use (1) and (2).
Polynomial regression is the prediction of a quantitative response variable from a quantitative predictor variable, where the relationship is modeled as a polynomial function of order n, polynomial models can be fitted by linear least-squares regression because, although they generate nonlinear models, their formulation is still a linear equation with predictors. [26] The objective of this is to determine values for the parameters of the equation which make the curve fit the data points better [27]. Therefore, starting from the linear model (3).
A polynomial model of degree n is obtained from (4).
B. Strategy for visualization and extrapolation, third stage.
In MATLAB, there are several tools for curve fitting and one of them is the "fit" command which allows you to fit a curve in various ways such as linear, cubic, and many more. To work with the normal distribution we used the "Gauss1" mode which is the Gaussian bell of order 1 that fits the curve and interpolates the missing data to shape it; at most it yields three variables that correspond to the components of the equation that describes this behavior.
As you can see (1) and (5) are equivalent.
The "polyfit" and "polyval" commands are also used, which are related to polynomial regression, which consists of adding curvature to the model by introducing new predictors that are obtained by raising the original ones to different powers.
The polyfit function returns the coefficients for a polynomial p(x) of degree n that is the best solution (in the least-squares sense) for the y data. This function also returns mu which is a two-element vector with centering and scaling values, mu (1) corresponds to the mean, and mu (2) is the standard deviation [28]. For this process, the software uses (6).
Polyval allows you to give values to the previous result.
After fitting the curve to a polynomial of degree 7 which was the best fit to the data since the degree varies according to the dispersion of the data. This process was done with the cases and deaths. As for the approximation of the curve, as of December 31, the two commands were handled separately, both were adjusted to the normal distribution formula with certain variants, and only the estimate of daily deaths by COVID-19 was made.
An important tool in the extrapolation process is Matlab's anonymous function, which allows mathematical expressions to take on future values or be treated as a variable that depends on the values entered by the user.
By having the data reported up to the end of November, the number of days remaining until the end of the year was calculated and this figure was added to the number of data reported. With this, the data found were plotted. For example, in Colombia from the first case which was reported on March 6, 2020, until November 24 of the same year there are 249 reported days, from this last date until December 31 there are 37 days, therefore, for the estimation 286 evaluated and reported days will be taken into account.
The only variation of the process between the fit command and the polyfit is that for the second process (2) is used since it works more generally with polynomially analyzed data.
Results and Analysis
From the data tables with cases and deaths, we obtained the growth curve and the bar graph, which describe the behavior of this in Colombia, Santa Marta, and Cartagena.
In this process we observe that in the two cities analyzed there is a similar behavior until November, being Cartagena the one that presents a range of high data. In general, in Colombia, the data is more uniform.
Concerning curve fitting, the Gaussian is a little more accurate in what you want to model, but the polynomial fits a little more to the real data because it can present more than one peak, while the Gaussian is faithful to its principle, which is based on an exponential rise followed by a peak and an exponential fall. Particularly, when trying to fit the curve polynomially, if very scattered data is presented the curve tries to get as close as possible to the real thing but it always moves away from little from the approximations, this is mainly because the equation that describes the polynomial is being adapted to the form of the gaussian bell and although the standard deviation and the mean are similar to the results of the first method, the expression that multiplies the complex exponential varies and modifies the amplitude of the graph, but even so, the polynomial curve adjustment provides a slightly slower decent speed, which is a little closer to the behavior of coronavirus mortality.
When the curve is fitted, it is extrapolated and shaped up to 31 December 2020.
In the extrapolation, the graphs made by the Gaussian method present a sharper shape this is presented due to the equation that prioritizes the peak to predict how the graph goes down, but the polynomial extrapolation corresponds a little more to the real model of the coronavirus. Although, the polynomial analysis handles a very small scale that does not fit the real data.
Conclusions
With the results presented in this article we note that, the data could be successfully materialized and show the behavior of COVID-19 in Santa Marta and Cartagena along with the mortality associated with this virus, thanks to this we can see that being the two cities tourist and cultural districts, in addition to having different population density presented its first date of infection relatively close to that of the country, specifically in Bogotá D.C., but Cartagena also reported before the contagion had a higher peak and closer to the original date of diagnosis [24]. These differences are due to the flow of tourists in each city and the rapid spread of the virus from Bogota to these urban centers.
Emphasizing the extrapolation, it can be inferred that it is an ideal case because the graphs represent the approximate behavior of the curve if the mandatory quarantine were still in force and many economic sectors of society had not been reactivated. The graph shows that in December, deaths decrease notably. For this reason, the extrapolation is rather an estimate of deaths in Colombia until the end of 2020, because the curve is adapting to the shape of the mathematical model of normal distribution and represents the behavior of deaths with the guidelines and measures necessary to weaken the impact of COVID-19 on the health of Colombians; currently, the real curve has new peaks.
Similarly between Colombia and Spain, which are countries with a similar population (46.94 million for Spain and 49.65 million for Colombia); their behavior differs in many ways, Spain for its part has a rather irregular curve of deaths as it suffers unexpected fluctuations and even a constant period of low cases, however, then presents the highest peak. These differences are due to the first interaction of each country with the virus, the preparation they had to face it and the speed of response to the collapse of the health system, taking into account that in Spain the first case of COVID-19 was recognized in late January [29] and in Colombia, it was reported in early March. It is worth noting that even at their highest peaks there is a difference of almost 500 deaths (approximately 1000 for Spain and 400 for Colombia).
At an international level, the behavior of COVID-19 mortality in Argentina presents a curve very similar to the Gaussian bell, since they began with increasing mortality and upon reaching the peak began to decay exponentially, which shows that with appropriate response and prevention measures the curve can be shaped to the ideal situation, knowing that the first dates of infection between Argentina and Colombia are similar [30].
To conclude, this research shows that deaths from COVID-19 cannot be avoided because it is a utopian situation, but with strict measures by the control agencies, quality care, good management of medical resources, and essentially the compliance and responsibility of the prevention rules by the community, it is possible to reach a fairly low lethality.
References
[1] M. Á. S. Valdés, “Infección respiratoria aguda por COVID-19: una amenaza evidente”, Revista Habanera de Ciencias Médicas, vol. 19, nº 1, febrero 2020.
[2] B. NEWS, “Coronavirus: ¿Qué hizo China con respecto al brote temprano?”. 9 junio 2020. [Online]. Disponible en: https://www.bbc.com/news/world-52573137.
[3] E. ESPECTADOR, Redacción Internacional, “América Latina: los retos en seguridad en tiempos de crisis”, 22 noviembre 2020. [Online]. Disponible en: https://www.elespectador.com/noticias/el-mundo/america-latina-los-retos-en-seguridad-en-tiempos-de-crisis/.
[4] L. D. Acosta, “Capacidad de respuesta frente a la pandemia de COVID-19 en América Latina y el Caribe”, Revista Panamericana de Salud Pública, vol. 44, nº e109, septiembre 2020.
[5] D. Rosselli, “Covid-19 en Colombia: los primeros 90 días”, Acta Neurológica Colombiana, vol. 36, nº 1, mayo 2020.
[6] Statista, Abigail Orús, “Número de personas fallecidas a causa del coronavirus en el mundo a fecha de 23 de noviembre de 2020, por país”,23 noviembre 2020. [Online]. Disponible en: https://es.statista.com/estadisticas/1095779/numero-de-muertes-causadas-por-el-coronavirus-de-wuhan-por-pais/.
[7] Centro Internacional sobre el envejecimiento, Juan Tomás Martín,” Europa se tiñe de gris”, 9 noviembre 2018. [Online]. Disponible en: https://cenie.eu/es/blog/europa-se-tine-de-gris. [Último acceso: 23 noviembre 2020].
[8] et al., “Preparación y control de la enfermedad por coronavirus 2019 (COVID-19) en América Latina”, Acta Médica Peruana, vol. 37, nº 1, marzo 2020.
[9] et al., «Encajando el puzle: Una estimación rápida del número de infectados por COVID-19 en España a partir de fuentes indirectas,» Documentos de trabajo (FEDEA), nº 5, 2020.
[10] arXiv, H. C., D. I., Y. W.Y., “Contact Tracing Mobile Apps for COVID-19: Privacy Considerations and Related Trade-offs”, 30 Marzo 2020. [Online]. Disponible en: https://arxiv.org/abs/2003.11511.
[11] E. NOTICIA, “Investigadores desarrollan una app que predice la evolución del coronavirus los próximos tres días”, 04 abril 2020. [Online]. Disponible en: https://www.20minutos.es/noticia/4216598/0/investigadores-desarrollan-app-predice-. evolucion-coronavirus-proximos-tres-dias/?autoref=true.
[12] D. M. López, “Uso de tecnologías en el lugar de atención para el manejo de la pandemia por COVID-19 en Colombia”, Revista Panamericana de Salud Pública, vol. 44, nº e97, agosto 2020.
[13] Instituto Nacional de Salud, et al., “Modelo de Transmisión del Coronavirus COVID-19”, 22 abril 2020. [Online]. Disponible en: https://www.ins.gov.co/Direcciones/ONS/SiteAssets/Modelo%20COVID-19%20Colombia%20INS_v5.pdf.
[14]W. C. M. S. M. Gabriel Chanchí Golondrino, «Aplicación de la regresión polinomial para la caracterización de la curva del COVID-19, mediante técnicas de machine learning,» Investigación e Innovación en Ingenierías, vol. 8, nº 2, 2020.
[15] MATLAB Central File Exchange, C. Tapia, “APP de COVID 19 con MATLAB”, febrero 2020. [Online]. Disponible en: https://la.mathworks.com/matlabcentral/fileexchange/75152-app-de-covid-19-con-matlab.
[16] OpenStreetMap Contributors, et al., “The Coronavirus App”, Open Database License, 2020. [Online]. Disponible en: https://coronavirus.app/map.
[17] Instituto Nacional de Salud, «INS», 2020. [Online]. Disponible en: https://www.ins.gov.co/Noticias/Paginas/coronavirus-casos.aspx.
[18] Organización Panamericana de la salud, “COVID-19 Y LA IMPORTANCIA DE FORTALECER LOS SISTEMAS DE INFORMACIÓN”, 2020. [Online]. Disponible en: https://iris.paho.org/bitstream/handle/10665.2/52128/COVID-19FactsheetIS4H_spa.pdf?sequence=14&isAllowed=y.
[19] ABC Sociedad, L.A., “El coronavirus, comparado con las pandemias más letales de la historia”, 14 abril 2020. [Online]. Disponible en: https://www.abc.es/sociedad/abci-coronavirus-no-lejos-peor-epidemias-mas-letales-historia-202003201350_noticia.html.
[20] et al., “COVID-19 in Colombia endpoints. Are we different, like Europe?”, Research in Social and Administrative Pharmacy, vol. 17, nº 1, enero 2021.
[21] M. A. V. K. H. Antonio J Bravo, «Modelos matemáticos estimadores de la infección por COVID-19: Consideraciones esenciales y proyecciones en Colombia,» Revistas de Salud Pública, vol. 22, nº 3, mayo 2020.
[22] Universitat de Barcelona, “TÉCNICAS DE PREDICCIÓN”, 2020. [Online]. Disponible en: http://www.ub.edu/aplica_infor/spss/cap8-5.htm.
[23] G. M. Reyes, “La función de probabilidad normal: Características y aplicaciones”, eXtoikos, nº 6, 2012.
[24] Ministerio de Salud y Protección Social Colombiano, “Datos Abiertos”,2020. [Online]. Disponible en: https://www.datos.gov.co/Salud-y-Protecci-n-Social/Casos-positivos-de-COVID-19-en-Colombia/gt2j-8ykr/data.
[25] J. D. S, “LA DISTRIBUCIÓN NORMAL”, Revista Chilena de Anestesia, vol. 43, nº 2, 2014.
[26] J. A. Rodrigo, “Métodos de regresión no lineal: regresión polinómica, regression splines, smooth splines y GAMs”, febrero 2017 [Online]. Disponible en: https://www.cienciadedatos.net/documentos/32_metodos_de_regresion_no_lineal_polinomica_splines_gams.
[27] H. J. M. L. A. Ransnas, “Fitting curves to data using nonlinear regression: a practical and nonmathematical review”, The FASEB Journal, vol. 1, nº 5, 1987.
[28] I. © The MathWorks, “polyfit”, MathWorks, 2005. [Online]. Disponible en: https://la.mathworks.com/help/matlab/ref/polyfit.html.
[29] R. Ferrer, “Pandemia por COVID-19: el mayor reto de la historia del intensivismo”, ScienceDirect, vol. 44, nº 6, pp. 323-324, 2020.
[30] L. S.-C. N. T. María Cecilia Johnson, “Emociones, preocupaciones y reflexiones frente a la pandemia del COVID-19 en Argentina”, SciELO, vol. 25, p. 2447-2456, 2020.