Open Access   Article Go Back

Data Based Model for Predicting COVID-19 Incidence Using Data Mining

Rachana Yadav1 , Amit Kumar Manjhwar2

Section:Research Paper, Product Type: Journal Paper
Volume-10 , Issue-5 , Page no. 65-73, May-2022

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v10i5.6573

Online published on May 31, 2022

Copyright © Rachana Yadav, Amit Kumar Manjhwar . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Rachana Yadav, Amit Kumar Manjhwar, “Data Based Model for Predicting COVID-19 Incidence Using Data Mining,” International Journal of Computer Sciences and Engineering, Vol.10, Issue.5, pp.65-73, 2022.

MLA Style Citation: Rachana Yadav, Amit Kumar Manjhwar "Data Based Model for Predicting COVID-19 Incidence Using Data Mining." International Journal of Computer Sciences and Engineering 10.5 (2022): 65-73.

APA Style Citation: Rachana Yadav, Amit Kumar Manjhwar, (2022). Data Based Model for Predicting COVID-19 Incidence Using Data Mining. International Journal of Computer Sciences and Engineering, 10(5), 65-73.

BibTex Style Citation:
@article{Yadav_2022,
author = {Rachana Yadav, Amit Kumar Manjhwar},
title = {Data Based Model for Predicting COVID-19 Incidence Using Data Mining},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2022},
volume = {10},
Issue = {5},
month = {5},
year = {2022},
issn = {2347-2693},
pages = {65-73},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=5470},
doi = {https://doi.org/10.26438/ijcse/v10i5.6573}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v10i5.6573}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=5470
TI - Data Based Model for Predicting COVID-19 Incidence Using Data Mining
T2 - International Journal of Computer Sciences and Engineering
AU - Rachana Yadav, Amit Kumar Manjhwar
PY - 2022
DA - 2022/05/31
PB - IJCSE, Indore, INDIA
SP - 65-73
IS - 5
VL - 10
SN - 2347-2693
ER -

VIEWS PDF XML
162 285 downloads 135 downloads
  
  
           

Abstract

Since, covid19 is affecting many countries in the world therefore, to take the necessary steps in order to control the outbreak or incidence of covid19, which is possible if we know the outbreak or incidence of covid19 which is possible with machine learning. Therefore, in this study we are analyzing the covid19 data of India and performing EDA (exploratory data analysis) and proposing various machine learning algorithm in order to predict the outbreak of covid19. We are using various machine learning algorithms like Linear regression, Gaussian naïve bayes, Decision tree and ensemble learning like random forest, gradient boosting and then finding the best algorithm by comparing their accuracy score. With the help of best algorithm, the outbreak of covid19 to manage the health crisis in each country can be controlled by taking the essential steps.

Key-Words / Index Term

Covid19,Machine Learning, EDA (Exploratory data analysis), Linear Regression, Gaussian Naïve Bayes, Decision Tree, Ensemble learning, Random forest, Gradient boosting

References

[1] Cucinotta D, Vanelli M. WHO Declares COVID-19 a Pandemic. Acta Biomed. 91:157–160, 2020.
[2] Zhao Y, Shou M, Wang Z. Prediction of the Number of Patients Infected with COVID-19 Based on Rolling Grey Verhulst Models. Int J Environ Res Public Health. 17(12):4582, 2020.
[3] Leeuwenberg AM, Schuit E. Prediction models for COVID-19 clinical decision making. The Lancet Digital Health. 2:e496–e497, 2020.
[4] M. I. Jordan1 and T. M. Mitchell, Machine learning: Trends, perspectives, and prospects (sciencemag.org) 2019.
[5] Godio A, Pace F, Vergnano A. SEIR Modeling of the Italian Epidemic of SARS-CoV-2 Using Computational Swarm Intelligence. Int J Environ Res Public Health. 17(10):3535, 2020.
[6] Sharma, V.K., Nigam, U.: Modelling and forecasting of COVID-19 growth curve in India. Trans. Indian Nat. Acad. Eng. 5, 697–710, 2020.
[7] Saeed, S., Humayun, M.: Quantitative analysis of COVID-19 patients: a preliminary statistical result of deep learning artificial intelligence framework. In: Book Series: ICT Solutions for Improving Smart Communities in Asia, IGI Gobal, pp. 218–242, 2021.
[8] Ghosal, S., Sengupta, S., Majumder, M., Sinha, B.: Linear regression analysis to predict the number of deaths in India due to SARS-CoV-2 at 6 weeks from day 0 (100 cases - March 14th 2020). Diabetes Metab. Syndr. Clin. Res. Rev. 14(January), 311–315, 2020.
[9] Rushing J, Ramachandran R, Nair U, Graves S, Welch R, Lin H. ADaM: a data mining toolkit for scientists and engineers. Computers & Geosciences 31(5):607-618, Jun. 2005.
[10] Dangare CS, Apte SS. Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques. IJCA 30;47(10):44-48, Jun 2012.
[11] R.S. Yadav, Data analysis of COVID-2019 epidemic using machine learning methods: a case study of India. Int. J. Inf. Technol., pp. 1-10, 2020.
[12] S.F. Ardabili, A. Mosavi, P. Ghamisi, F. Ferdinand, A.R. Varkonyi-Koczy, U. Reuter, et al., COVID-19 outbreak prediction with machine learning, medRxiv (2020), 2020.04.17.20070094
[13] P. Bedi, S. Dhiman, P. Gole, N. Gupta, V. Jindal, Prediction of COVID-19 trend in India and its four worst-affected states using modified SEIRD and LSTM models, SN Comput. Sci., 2(3), p. 224, 2021.
[14] T. Chakraborty, A. Bhattacharyya, M. Pattnaik, Theta autoregressive neural network model for COVID-19 outbreak predictions, medRxiv (2020), 2020.10.01.20205021
[15] A. Chatterjee, M.W. Gerdes, S.G. Martinez, Statistical explorations and univariate timeseries analysis on COVID-19 datasets to understand the trend of disease spreading and death, Sensors, 20(11), 2020.
[16] N.P. Dharani, P. Bojja, P. Raja Kumari, Evaluation of performance of an LR and SVR models to predict COVID-19 pandemic, Mater. Today Proc. 2021.
[17] S.W. Doe, T.R. Seekins, D. Fitzpatrick, D. Blanchard, S.Y. Sekeh, Adaptive County Level COVID-19 Forecast Models: Analysis and Improvement (2020), arXiv preprint arXiv:200612617
[18] S. Ganiny, O. Nisar, Mathematical modeling and a month ahead forecast of the coronavirus disease 2019 (COVID-19) pandemic: an Indian scenario, medRxiv (2020), 2020.09.10.20192195
[19] K.K.A. Ghany, H.M. Zawbaa, H.M. Sabri, COVID-19 prediction using LSTM algorithm: GCC case study, Inform. Med. Unlocked, 23, p. 100566, 2021.
[20] N.M. Ghazaly, M.A. Abdel-Fattah, A.A. Abd El-Aziz, Novel coronavirus forecasting model using nonlinear autoregressive artificial neural network, Int. J. Adv. Sci. Technol., 29, 5 Special Issue, pp. 1831-1849, 2020.
[21] Q. Guo, Z. He, Prediction of the confirmed cases and deaths of global COVID-19 using artificial intelligence, Environ. Sci. Pollut. Res. Int., 28 (9), pp. 11672-11682, 2021.
[22] K.T. Hasan, M.M. Rahman, M.M. Ahmmed, A.A. Chowdhury, M.K. Islam, 4P model for dynamic prediction of COVID-19: a statistical and machine learning approach, Cognit. Comput., pp. 1-14, 2021.
[23] B.B. Hazarika, D. Gupta Modelling and forecasting of COVID-19 spread using wavelet-coupled random vector functional link networks, Appl. Soft Comput., 96, p. 106626, 2020.
[24] M. Hawas, Generated time-series prediction data of COVID-19`s daily infections in Brazil by using recurrent neural networks, Data Brief, p.106175, 2020.
[25] B. Heni, COVID-19, Bacille Calmette-Guerin (BCG) and tuberculosis: cases and recovery previsions with deep learning sequence prediction, Ingénierie Des. Systèmes Inf., 25 (2), pp. 165-172, 2020.
[26] A.-E.E. Hridoy, M. Naim, N.U. Emon, I.H. Tipo, S. Alam, A. Al Mamun, et al., Forecasting COVID-19 dynamics and endpoint in Bangladesh: a data-driven approach, medRxiv (2020), 2020.06.26.20140905