Access the full text.

Sign up today, get DeepDyve free for 14 days.

Open Computer Science
, Volume 12 (1): 10 – Jan 1, 2022

/lp/de-gruyter/predicting-and-monitoring-covid-19-epidemic-trends-in-india-using-PzBc7PJaGV

- Publisher
- de Gruyter
- Copyright
- © 2022 Koyel Datta Gupta et al., published by De Gruyter
- eISSN
- 2299-1093
- DOI
- 10.1515/comp-2020-0221
- Publisher site
- See Article on Publisher Site

1IntroductionAs on May 30, 2020, India has the highest number of SARS-CoV-2 cases in Asia, with total cases counting up to 173,763 cases, 4,971 deaths, and 82,370 recoveries [1]. The fatality rate is near 3.09%. The first confirmed case of India began in Kerala at the end of January 2020 and has since reported a total of 33,050 cases till April 30, 2020. In the month of March, the basic reproduction rate (R0) of the SARS-CoV-2 in India was around 1.7 [2], which was on the lower side with respect to the major affected countries. Now, the reproductive rate ranges between 1.22 and 1.23. However, according to some beliefs, the lower transmission rate of the virus can be due to the lack of mass testing facility in India. Till February 3, 2020, three confirmed COVID-19 cases were reported in India. On February 20, 2020, the patients in Kerala recovered and were discharged. However, by March 9, 2020 the number of confirmed SARS-CoV-2-infected patients in India became 44 with 4 from New Delhi, the capital of India. Six new cases were reported in Kerala, 14 in Haryana, 9 in Uttar Pradesh, 2 cases each in Ladakh and Rajasthan, and 1 each in Tamil Nadu, Punjab, Telangana, and Jammu including 16 cases of foreign patients. The World Health Organization declared the outbreak of COVID-19 as a pandemic. By March 14, 2020 India reported 84 corona positive cases with two fatalities. India being the second highest populous country needed to act fast to control the COVID-19 outbreak. Initially government started by shutting down educational institutes and placed a stop on tourism. A voluntary curfew was observed on March 22, 2020, on the request of Shri Narendra Modi, the Prime Minister of India. With the continuous rise in the number of COVID-19 patients, the Prime Minister of India ordered a complete India-wide lockdown, under section 6(2)(i) of the disaster management Act, 2005, effective from March 25, 2020. By April 12, 2020, the number of confirmed cases rose to 8,447 with 273 deaths. On April 14, the lockdown was extended till May 3. The four states with a maximum number of COVID-19 cases are Maharashtra, Tamil Nadu, Delhi, and Gujarat. The lockdown has further been extended till May 31, 2020. However, since April 20, 2020, certain relaxation has been provided to the places that have restricted the spread of the virus. The lockdown may have reduced the rate of spread of SARS-CoV-2, but has hit the economy of India badly. Even with the tremendous effort from government and other sources to ease the impact of lockdown on daily wagers, small-scale entrepreneurs, and others who have lost their source of income, it has become an extremely difficult situation. India is facing huge financial challenges due to the loss of business in international and domestic markets, loss in revenues, crisis in tourism and other industries, and the economic liability for controlling COVID-19 in terms of management and treatment. The trajectory of the pandemic already has a critical impact on the Indian economy. Now with the restrictions of the lockdown being removed gradually in India, the outbreak is observing a steep rise in the number of COVID-19 cases. The forecast of the number of cumulative COVID-19 patients in India is the primary objective of this article. India being a densely populated country, it is important to understand the pattern of the outbreak under extreme measures taken by Government to curb the virus menace. To tackle the financial difficulties, states of the nation have eased the restrictions on certain business activities. It is important to foresee the results of these actions on the pandemic spread in the country. This article focuses on the prediction of the basic transmission rate of COVID-19 in India, projecting the possible number of cases by mid-June and the challenges involved. For predicting any temporal data, machine learning algorithms is usuallya primary choice. In this article, we have modeled an efficient Seq2Seq model designed optimally by deploying root mean square propagation (RMSProp) technique. This model relies on the training from the previous records of COVID-19 cases. However, to understand the reproduction rate of the virus, we use an adaptive SEIR model to forecast the reproductive number of SARS-CoV-2in India. With the help of the results obtained, an analysis of the performance of both the methods is estimated in terms of root-mean-squared error (RMSE). The next task is analyzing the performance of a deep learning model and a disease model and how the change in Government policies is affecting the pandemic curve. The rest of this article is organized as follows. The related works are presented in Section 2. Section 3 proposes the methodology. Section 4 shows the simulation. The research work is concluded in Section 5.2Related worksThe last few decades have witnessed the outbreak of many zoonotic diseases, which were highly infectious and fatal in many cases. Several studies were presented for predicting the outbreak of these diseases. The researchers have used machine learning models, traditional disease models, and even statistical models to estimate the outbreak of the epidemics. An adaptive neuro-fuzzy inference system combining the properties of ANN and fuzzy logic systems is proposed in ref. [3] for temporal data prediction. A mathematical model is presented in ref. [4] to monitor the infection trends of the SARS epidemic during 2003. A comparative study of four temporal data forecasting models is proposed in ref. [5]. The study uses 13 years records of Hepatitis A infection in Turkey and estimates the performance of prediction of the disease using different ANN models (MLP, time-delay neural networks, and RBF) and statistical model (ARIMA). The article concludes that MLP performs best in the given condition. The article [6] presents a Bayesian inference-based dynamic model to anticipate the deadly Ebola virus spread in African nations. In ref. [7], the authors proposed a model to forecast West Nile virus (WNV) transmission risk using the records of the WNV spread in long Island during 2001–2014. For monitoring the outbreak of influenza A (H1N1-2009), a forecasting model is proposed in ref. [8]. In ref. [9], a probability-based model is deployed to predict the worldwide spread of the MERS. The authors in ref. [10] emphasized the use of IoT, blockchaining, big-data analytics, AI, and deep learning to build a prediction model for COVID-19 and estimate its effect on the healthcare sector. In ref. [11], an ARIMA model is deployed in to forecast the spread of COVID- 2019. To estimate the disease reproductive rate, many researchers have deployed several epidemiological models. The SIR and SEIR is a disease model [12] with independent variable time and dependent variables which is the population expressed as a function of time. The population can be categorized as susceptible, infected and removed and in some cases as exposed. A logistic and exponential model is presented in ref. [13] for disease prediction. In ref. [14], the serial interval of MERS and SARS to assess the serial interval of COVID-19. The authors in ref. [15] assess the rate of virus reproduction using a Reservoir-People transmission network model for analyzing the transmission pattern of the disease. The work [16] proposes a delay adaptive SIR model that and evaluates the performance with various refrainment strategies. In ref. [17], the nature of exposure to the virus is utilized in computing its impact on patients. A modified SEIR model is used in ref. [18] with a moving population for susceptible and exposed category. Both the articles [19,20] propose modified mathematical and statistical models to forecast COVID-19 trends. The authors in ref. [21] modified the SIR model by including variable denoting the death of patients. The aims of ref. [22] are to predict COVID-19 and to identify individuals who may exhibit severe symptoms. The authors in ref. [23] build two models to improve the performance of CNN and ConvLSTM for COVID-19 detection, using data augmentation. A flower pollination-based optimized GRNN model is proposed in ref. [24] for the prediction of COVID-19 in Europe. The authors in ref. [25] presented deep learning techniques based on CNN to determine SARS-CoV-2 using X-ray and CT images and claims to achieve an accuracy of 100.3MethodologyThe highly contagious characteristic of the novel corona virus makes it even more deadly. To study the future of this pandemic in India, the dataset has been collected from several sources. After the first confirmed case in January 2020, there were only three confirmed cases of COVID-19 in India during the entire month of February 2020. Even with the government-imposed social distancing, the number of cases started rising abruptly from the end of March and is still continuing. To estimate the pattern of this time series, we have used two approaches. The first approach tries to estimate the reproduction rate of the virus. The varied scenario in India and diverse measure, taken to restrict the epidemic outbreak have inspired us to choose an epidemiological model (adaptive SEIR) to predict the transition of different stages of SARS-CoV-2 transmission. The second approach uses an efficient Seq2Seq model optimally configured using the RMSProp for the faster convergence rate.3.1Adaptive SEIRThe SEIR model used in his work is modified to accommodate the changing conditions in India. Unlike many countries, India has started taking measures like isolation of infected and exposed population and lockdown for maintaining social distancing among entire population to reduce the rate of COVID-19 transmission. Another important factor is that the testing for the disease is quite low in India, which may mean that many asymptotic patients are undetected for the virus. We have incorporated the concept of herd immunity. The infected person in this case does not exhibit any symptoms of the virus and eventually develops antibodies for the virus. But these undetected patients can still spread the infection to other susceptible individuals. The overall population (75% of the total population of the country) is divided (shown in Figure 1) into susceptible (S ∈\in population who have neither been exposed nor contracted the disease), exposed (E ∈\in population who have been in contact with an infected person, may have been identified for isolation, not yet infected, considering incubation period of the virus), infected (I ∈\in population who are infected, may be asymptomatic or otherwise, may have been detected or otherwise), and removed (R ∈\in population who have recovered or succumbed to the disease). The birth and death rates have been assumed to be a constant.Figure 1State transition diagram depicting transmission stages of COVID-19.To explain the working of the adaptive SEIR model, the essential symbolic notations used in the equations are described in Table 1.Table 1Symbolic notation for adaptive SEIRS. NoSymbolDetails1β\beta Rate of transmission from S to E stage2σ1{\sigma }_{1}Rate of incubation and the developing asymptomatic infection3σ2{\sigma }_{2}Rate of incubation and the developing of infection with symptoms4γ1{\gamma }_{1}Rate of recovery for asymptomatic cases5α1{\alpha }_{1}Rate of recovery for detected symptomatic cases6α2{\alpha }_{2}Rate of death for detected symptomatic cases7R0{R}_{0}Basic rate of reproduction8NNTotal populationDefinition 1The rate of change in susceptible population is given by the product of the rate of transmission from stage S to E stage and subsequently to I stage due to contact between a susceptible subject (S) and an infected (I).(1)dsdt=−βSIN.\frac{{\rm{d}}s}{{\rm{d}}t}=\frac{-\beta {\rm{SI}}}{N}.Definition 2The rate of change of exposed population is given by the fraction of susceptible population transforming to exposed (equation (1)) minus the fraction of exposed person who becomes infective (asymptomatic and symptomatic).(2)dEdt=βSIN−σ1E−σ2E.\frac{{\rm{d}}E}{{\rm{d}}t}=\frac{\beta {\rm{SI}}}{N}-{\sigma }_{1}E-{\sigma }_{2}E.Definition 3The rate of change of infected population (asymptomatic) is given by the fraction of the exposed population who becomes infected without exhibiting any symptoms (equation (2)) minus the fraction of asymptomatic infected persons who have become resistant/healed (considering the concept of herd immunity).(3)dIadt=σ1E−γ1Ia.\frac{{\rm{d}}{I}_{a}}{{\rm{d}}t}={\sigma }_{1}E-{\gamma }_{1}{I}_{a}.Definition 4The rate of change of infected population (symptomatic) is given by the fraction of the exposed population who becomes infected, exhibiting symptoms of viral infection (equation (2)) minus the fraction of these infected persons who either become resistant/healed or succumbs to the disease.(4)dIddt=σ2E−γ2Id(I=Ia+Id).\frac{{\rm{d}}{I}_{d}}{{\rm{d}}t}={\sigma }_{2}E-{\gamma }_{2}{I}_{d}\hspace{0.33em}\left(I={I}_{a}+{I}_{d}).Definition 5The rate of change of resistant/recovered population is given by the fraction of asymptomatic infected population and detected infected population of viral infection who gets cured and become resistant to the virus (equations (3) and (4)).(5)dRcdt=γ1Ia+α1Id.\frac{{\rm{d}}{R}_{c}}{{\rm{d}}t}={\gamma }_{1}{I}_{a}+{\alpha }_{1}{I}_{d}.Definition 6The rate of change of the deceased population is given by the fraction of the detected infected population of viral infection who succumbs to the disease (equation (4)).(6)dRddt=α2Id(γ2=α1+α2).\frac{{\rm{d}}{R}_{d}}{{\rm{d}}t}={\alpha }_{2}{I}_{d}\hspace{0.33em}\hspace{0.33em}\left({\gamma }_{2}={\alpha }_{1}+{\alpha }_{2}).Definition 7The rate of reproduction/transmission of the virus is given by the ratio of the rate of transmission from S to E and subsequently to I and the rate of recovery and fatality of the infected population.(7)R0=βγ1+γ2.{R}_{0}=\frac{\beta }{{\gamma }_{1}+{\gamma }_{2}}.3.1.1Algorithm 1Algorithm 1: Adaptive SEIR1:Initialize the variable NNto the total population of India. The initial values of susceptible population is computed as S=(N−I)/NS=\left(N-I)\hspace{0.1em}\text{/}\hspace{0.1em}N.2:Set maxdays = 1323:Initialize the values of β\beta , γ1{\gamma }_{1}, γ2{\gamma }_{2}, σ1{\sigma }_{1}, and σ2{\sigma }_{2}4:The change in the number of susceptible, exposed, infected, and recovered/dead population is calculated using equations (1)–(6)5:Step 3 is repeated for maxdays and fitted to existing data and the values of β\beta , γ1{\gamma }_{1}, γ2{\gamma }_{2}, σ1{\sigma }_{1}, and σ2{\sigma }_{2}are updated.6:The reproduction rate R0{R}_{0}is computed using equation (7).7:The values of are used to predict the values of infected people in the month of June.8:The method is validated using “holdout.”3.2Efficient sequence-to-sequence algorithmDeep learning is a machine learning technique that builds models to imitate human reasoning and learning. Models are trained to perform the task of classification or prediction by using a large dataset and multi-layered neural network architectures. The recurrent neural network (RNN) is a category of deep neural networks that has feedback loops within the network enabling them to store past record to train previous series [26]. The basic RNN model experiences the problems of vanishing and exploding gradient. In the case of long sequences, the back-propagated gradient values get reduced, which causes failure in updates of the weights and large gradients may explode for long temporal sequences generating unstable weights. The long-short-term-memory (LSTM) cell was introduced in ref. [27], which are capable of saving long-term dependencies in temporal sequences. These RNNs with LSTM core are capable of automatic feature extraction and are efficient in time series prediction since they integrate the knowledge of related information from the previous inputs. These networks work well for non linear patterns. This type of RNNs exhibit better prediction accuracy due to their extended range dependence [28]. A single LSTM unit consists of a memory unit/cell and three gates, the input, the output, and the forget. Activation functions are applied at the gates. A high activation value at the input signifies that the input information needs to be saved in the memory, a high value at the output triggers the release of the stored data to the next neuron, and a high value at the forget gate erases the data from the memory unit. Now, sequence-to-sequence (Seq2Seq) [29] architecture (Figure 2) was initially used for neural machine translation. The model is suitable for ML tasks for time series prediction. A Seq2Seq model has three modules including an encoder, an intermediate vector, and a decoder. LSTM cells constitute the encoder. The input sequence is presented to the encoder which extracts features of the input elements, and the last hidden state is called the context vector. This vector encodes the entire information from the input data. Multiple LSTM units constitute the decoder, and each unit calculate its individual hidden state and generates an output data. This article proposes applying the Seq2Seq model to predict the COVID-19 curve in India. To explain the encoder–decoder model used in this study, the essential symbolic notations used in the equation are presented in Table 2.Figure 2Basic sequence to sequence model.Table 2Symbolic notation for sequence-to-sequenceS. NoSymbolDetails1xt{x}_{t}Input vector to the LSTM encoder2ht{h}_{t}Hidden state vector of the encoder3st{s}_{t}Hidden state vector of the decoder4yt{y}_{t}Output vector of the decoder5whx{w}^{hx}Weights integrating input vector information6whh{w}^{hh}Weights integrating information from the previous time-step7ws{w}^{s}Weights at the last hidden state8v,wv,wWeight vectors9ct{c}_{t}Semantic vector at time tt10eti{e}_{ti}Degree of influence of the hidden state hi{h}_{i}at time tt11ζti{\zeta }_{ti}Normalized value of the Softmax function3.2.1EncoderIt consists of one or more LSTM units each accepting input sequence, assimilating information, and forwarding it. The equation for the encoder [30] is given as follows: (8)ht=f(whh∗ht−1+whx∗xt).{h}_{t}=f\left({w}^{hh}\ast {h}_{t-1}+{w}^{hx}\ast {x}_{t}).3.2.2Intermediate vectorThe last hidden state of the encoder generates the intermediate vector using equation (8). This vector aims to assimilate the source sequence information and acts as the decoder component’s first hidden state3.2.3DecoderIt consists of one or more LSTM units. Each LSTM cell is provided the trailing hidden state and generates output and current hidden state. The hidden state at current step, tti.e., hthtis computed by the following equation [30]: (9)st=f(whh∗st−1).{s}_{t}=f\left({w}^{hh}\ast {s}_{t-1}).The output at time step is given by the equation [30]: (10)yt=f(ws∗st).{y}_{t}=f\left({w}^{s}\ast {s}_{t}).Softmax is used to generate the output sequence. An attention mechanism can be used to save the output sequence of the encoder’s source. Bahdanau attention [31] arranges the input and output sequences, with an alignment value. It ensures that the most important information in the input receives attention.(11)ct=∑i=1T(ζti∗hi).{c}_{t}=\mathop{\sum }\limits_{i=1}^{T}\left({\zeta }_{ti}\ast {h}_{i}).(12)ζti=exp(eti)∑k=1Texp(etk).{\zeta }_{ti}=\frac{\exp \left({e}_{ti})}{{\sum }_{k=1}^{T}\exp \left({e}_{tk})}.(13)eti=vT∗tanh(w[st−1,hi]).{e}_{ti}={v}^{T}\ast \tanh \left(w\left[{s}_{t-1},{h}_{i}]).(14)st=tanh(w[st−1,yt−1,ct]).{s}_{t}=\tanh \left(w\left[{s}_{t-1},{y}_{t-1},{c}_{t}]).RMSprop is used to optimize the learning rate of the LSTM. Moving average of gradient-square to normalize. This enables to increase the step size vanishing gradient and to decrease the step for large gradient.3.2.4Algorithm 2Algorithm 2: Sequence-to-sequence1:Load input data.2:Built the Seq2Seq model using equations (8)–(10).3:Bahdanau attention (equations (11)–(14)) is applied on the encoder outputs.4:Split the input data into training and testing set as 3:1.5:During training RMSProp is used to optimize the LSTM units.6:After the completion of the training phase, testing data is loaded and the corresponding output is analyzed. The validation is done using “holdout” method.7:The final model is used for prediction of the pandemic curve in India in the month of June.4SimulationWith the first COVID-19 patient detected on January 30, India as on 9 June has 2, 70,876 confirmed cases. Figure 3 shows the number of total, currently active, recovered, and death cases in India from January 30, 2020, to June 9,2020.Figure 3COVID-19 cases in India.Figure 4a and b depicts the recovery rate and death rate till June 9, 2020.Figure 4(a) Recovery rate and (b) death rate of COVID-19 in India.The dataset of COVID-19 patients (records till June 9,2020) of India has been used to analyze the percentage of death among infected person on the basis of the age group. However, many patient records have no information related to age or gender. We have tried to find the impact of the age of a COVID-19 patient on their likelihood of death. Figure 5 shows the percentage of death of the infected population (based on only those patient records where information of their age was present). As evident, the population over 60 years is most susceptible to succumb to COVID-19. In India, the patients whose age is below 40 showed less probability of dying die due to SARS-CoV-2 disease. However, due to the lack of complete age information about every corona patient in India, it is difficult to draw a concrete supposition about the relation of the infected individual’s age and the probability of death due to the disease. For building the adaptive SEIR model, the effects of lockdown, isolation of exposed, or infected population and unlocking are considered while forecasting the pandemic pattern in India.Figure 5Age-group wise percentage of death of COVID-19 patients in India.4.1Adaptive SEIR setupThe proposed model is implemented in Python. The model predicts the daily reported COVID-19 cases in India. The parameters are calculated to fit the COVID-19 data of India. It is assumed that initially the count of the susceptible population S(N−I)/N≈NS\left(N-I)\hspace{0.1em}\text{/}\hspace{0.1em}N\approx N, where the total population is NNand count of the infected population I=1I=1(as on January 30, 2020). The initial value of R0{R}_{0}is set to 3.2 at time t=0t=0. The value of β\beta was computed equal to 0.63 and value of 0.31. Equation (7) gives the rate of spread of R0=2.03{R}_{0}=2.03.4.2Sequence-to-sequence model set upThe sequence-to-sequence model built in this work has two LSTM encoder units each in the encoder and two LSTM decoder units constitute the decoder. Bahdanau attention is used on the encoder’s output. The dropout was set at 0.3. The initial learning rate of the experiment was set to 0.01, and RMSProp was used for optimizing the hyper parameters.4.3Experimental resultsTo estimate the value of the parameters of the modified SEIR model and sequence-to-sequence model, datasets of COVID-19 cases in India are collected from various sources [32,33, 34,35]. The dataset starts from January 30, 2020 and recorded cases till June 9, 2020. The dataset is split for training and validation for the Seq2Seq model. For training, we used cases from January 30, 2020 till May 9, 2020. The validation of the model is done using data from 10th January to June 8. The adaptive SEIR curve is fitted to the available dataset. The data is also used to train and test LSTM and support vector regression (SVR) model. The RMSE and root mean squared log error values are presented in Table 3.Table 3Comparison of adaptive SEIR and Seq2Seq modelsModelRMSERMSLESeq2Seq0.694360.023Adaptive SEIR1316.25751.81LSTM954.280.029SVR6843.241.69The prediction is done from June, 9 to June 28, 2020. The number of cases predicted by the SVR, LSTM, and adaptive SEIR and Seq2Seq models is shown in Figure 6. The projected values of the cumulative number of COVID-19 cases by SEIR and sequence-to-sequence model are 528557 and 470783, respectively.Figure 6Prediction of total number of COVID-19 cases in India.The number of new cases after June 8, 2020, is depicted in Figure 7. The Seq2Seq model successfully identifies the ups and downs in the number of daily counts of patients. The models predict a value of more than 13,000 COVID-19 by the last week of June.Figure 7Prediction of day-wise new cases of COVID-19 cases in India.The cumulative number of predicted death cases due to COVID-19 is predicted in Figure 8. The first death in India was reported on March 12, 2020.Figure 8Prediction of cumulative number of death cases caused by COVID-19 in India.4.4DiscussionFrom the results shown, it is seen that the adaptive SEIR model cannot properly depict the actual pandemic curve unlike deep learning algorithms. The simplicity of the disease model causes it to discard information over the span of time. This causes severe lack of ability of epidemiological model to reproduce the pattern of a real pandemic outbreak. The adaptive SEIR model used in this work is able to show the overall pattern of the graph but fails to identify the exact points of daily increase (or decrease) in the number of COVID-19 patients. Though the modified SEIR model showed the change in the curve during periods of restrictions (lockdown) and the effect of the unlocking phase, the RMSE value of the Seq2Seq model proves a more precise prediction. Seq2Seq converts one data sequence into one more form of sequence. This conversion is achieved by the use of GRU or LSTM encoder–decoder. The context for every input is the output from the earlier stage. The encoder stage converts an input and its context into a hidden vector. The decoder converts the hidden vector into corresponding output data, using the prior output. The Seq2Seq model thus has the capacity to predict a time series data, by studying the influence of past record on present input and output and predict the output sequence. It outperforms other machine learning techniques like LSTM and SVR. The COVID-19 dataset is used to train the LSTM encoder–decoder model to predict the future trend. However, size of the dataset was small, and the accuracy of the deep learning model may be enhanced with more training data. Due to the enhanced learning capacity of the model, the Seq2Seq model has a higher accuracy than the SEIR model and has an RMSE value of 0.69436 as opposed to 1316.2575 of SEIR.5ConclusionThe restrictions imposed by Indian government have curbed the exponential growth of SARS-CoV-2 transmission. However, these measures could not flatten the curve as expected because of practical challenges. With the present economic situation, extending the lockdown beyond May 17, 2020, can lead to severe financial implication for the nation and its people. Moreover, people may violate the norms of lockdown and isolation, heightening the chance of community transmission. The study presented here predicted the pattern of the COVID-19 outbreak course and provided the comparison of a disease model and an LSTM-based encoder–decoder model. Modification has been made to the existing SEIR model in terms of the Indian situation. One very important fact in India is the lack of testing kits, and this has made it difficult to detect asymptomatic cases. These asymptomatic population is, however, capable of infecting other susceptible population. The lockdown and social isolation definitely contributed to the decay in the growth of the disease spread pattern. The data of COVID-19 cases obtained are small in dimension, which might be insufficient to fully estimate the underlying disease spread rate. The proposed models were deployed to predict the data for 20 days using the epidemiological model and deep neural networks, respectively. The models were analyzed based on the results, and it is concluded that epidemiological models cannot precisely follow the trends of a disease spread curve and can predict a curve depending on the disease information. The prime reason behind this behavior of the disease model can be attributed to the fact that it cannot save information over a time-length. On the contrary, the LSTM encoder–decoder model has the capability to emphasize past relevant information. The Bahdanau mechanism enables one to focus on the most vital source information. This further enhances the capacity of the network to predict the pandemic curve precisely. The sequence-to-sequence model with attention can be used to forecast the COVID-19 spread curve and take precautionary measures. The change in the future transmission of COVID-19 extensively depends on the government policies and the extent of social restrictions followed by people. The dynamic of COVID-19 is varying frequently due to changing social situations. The outcome of the research shown here relies on the current data and rules imposed by the government. The actual count may differ if there is a change in the current the socio-economic condition.

Open Computer Science – de Gruyter

**Published: ** Jan 1, 2022

**Keywords: **COVID-19; sequence to sequence; SEIR; transmission rate

Loading...

You can share this free article with as many people as you like with the url below! We hope you enjoy this feature!

Read and print from thousands of top scholarly journals.

System error. Please try again!

Already have an account? Log in

Bookmark this article. You can see your Bookmarks on your DeepDyve Library.

To save an article, **log in** first, or **sign up** for a DeepDyve account if you don’t already have one.

Copy and paste the desired citation format or use the link below to download a file formatted for EndNote

Access the full text.

Sign up today, get DeepDyve free for 14 days.

All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.