Prediction and Distribution of Disease Using Hybrid Clustering Algorithm in Big Data

Main Article Content

Vinston Raja R, Deepak Kumar A, Prabu Sankar N, Senthamilarasi N, Chenni Kumaran J


COVID disease plague of 2019 (COVID19) has made an overall health related crisis with a very high gamble of spreading and influencing the whole planet. In essentially every nation, new cases have been accounted. To identify all countries expanding number of tests, the manual clustering of COVID-19 and clinical infection information tests becomes tedious and requires profoundly talented work. As of late, a few calculations have been utilized for clustering clinical datasets deterministically; nonetheless, these definitions have not been powerful in gathering and investigating clinical infections. To rank and score more than 200 nations as indicated by COVID-19 cases and casualty in 2020 and contrast the outcomes with existing pandemic weakness forecast models and results produced by standard Data clustering scoring methods. Information clustering is a  course of orchestrating comparative information into gatherings. A clustering algorithm bundles an informational collection into a few several clusters such an extent that the similitude inside a gathering is better compared to among clusters. This paper propose new Hybrid clustering algorithm KMHC in view of K-Means and Hierarchical Clustering calculation. This calculation KMHC, First isolated into every nation gatherings or fragments in light of the COVID patients count, Secondly grouping models has been made across the nations on the planet and across the states in India, and the presentation investigation is analyzed. This paper likewise center figures the future COVID count for India. By utilizing this outcome, set of nations which are having higher COVID count can be effortlessly pictured and the proper moves will be made to diminish the count.

Article Details

How to Cite
Vinston Raja R, et al. (2023). Prediction and Distribution of Disease Using Hybrid Clustering Algorithm in Big Data. International Journal on Recent and Innovation Trends in Computing and Communication, 11(10), 91–98.
Author Biography

Vinston Raja R, Deepak Kumar A, Prabu Sankar N, Senthamilarasi N, Chenni Kumaran J

Vinston Raja R1, Deepak Kumar A2, Prabu Sankar N3, Senthamilarasi N4, Dr. Chenni Kumaran J5

1Assistant Professor, Information Technology, Panimalar Engineering College, Chennai, India.

2Assistant Professor, Computer Science and Engineering, St. Joseph's Institute of Technology, Chennai, India.

3Assistant Professor, Department of Information Technology, Panimalar Engineering College, Chennai, India

4Assistant Professor, Computer Science and Engineering, Sathyabama Institute of Science and Technology

5Professor, Professor, Department of  Computer Science and Engineering,Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences (SIMATS)


. Anastasiya Doroshenko., "Analysis of the Distribution of COVID-19 in Italy Using Clustering Algorithms", IEEE International Conference on Data Stream Mining & Processing, pp:325-328, 2020.

. Ginalber L.O.Serra, Daiana Gomes, "Machine Learning Model For Computational Tracking and Forecasting the COVID-19 Dynamic Propagation”,IEEE Journal of Biomedical and Health Informatics, 2021.

. Vishan Kumar Gupta, et.a., " Prediction of COVID-19 confirmed, death, and cured cases in India using random forest model", IEEE Big Data Mining and Analytics - Volume: 4, Issue: 2, June 2021.

. Rahmad Kurniawan, " Clustering and Correlation Methods for Predicting Coronavirus COVID-19 Risk Analysis in Pandemic Countries", IEEE International Conference on Cyber and IT Service Management, 2020.

. Valerio Bellandi; Paolo Ceravolo; Samira Maghool; Stefano Siccardi, “A Comparative Study of Clustering Techniques Applied on Covid-19 Scientific Literature” IEEE International Conference on Internet of Things: Systems, Management and Security, 2020.

. C. Huang et al., "Clinical features of patients infected with 2019 novel coronavirus in Wuhan China", Lancet, vol. 395, no. 10223, pp. 497-506, 2020.64

. Remigio Ismael Hurtado Ortiz; Juan Carlos Barrera; Katherine Michelle Barrera., "Analysis model of the most important factors in Covid-19 through data mining, descriptive statistics and random forest", IEEE International Autumn Meeting on Power, Electronics and Computing, pp.4-6, 2020.

. N. Zhu et al., "A novel Coronavirus from patients with Pneumonia in China 2019", N. Engl. J. Med., vol. 382, no. 8, pp. 727-733, 2020.

. P?nar Cihan, "Fuzzy Rule-Based System for Predicting Daily Case in COVID-19 Outbreak", IEEE International Conference on Multidisciplinary Studies and Innovative Technologies, 2020.

. S. S. Chenar and Z. Deng, "Development of genetic programming-based model for predicting oyster norovirus outbreak risks", Water research, vol. 128, pp. 20-37, 2018.

. Shreyas Selur Arun, Ganesh Neelakanta Iyer, “On the Analysis of COVID19 - Novel Corona Viral Disease Pandemic Spread Data Using Machine Learning Techniques”, 4th International Conference on Intelligent Computing and Control Systems, 2020.

. Othman Istaiteh, Tala Owais, Nailah Al-Madi, Saleh Abu-Soud, “Machine Learning Approaches for COVID-19 Forecasting”, International Conference on Intelligent Data Science Technologies and Applications, 2020.

. Amit Bhati, Anurag Jagetiya, “Prediction of COVID-19 Outbreak in India Aopting Bhilwara Model of Containment”, 5th International Conference on Communication and Electronics Systems, 2020.

. S. Khanmohammadi, N. Adibeig, S. Shanehbandy, An improved overlapping k-means clustering method for medical applications, Expert Syst. Appl. 67 (2017).

. B.A. Hassan, T.A. Rashid, Datasets on statistical analysis and performance evaluation of backtracking search optimisation algorithm compared with its counterpart algorithms, Data Br. 28 (2020) 105046.

. M.B. Malik, M.A. Ghazi and R. Ali, “Privacy Preserving Data Mining Techniques: Current Scenario and Future Prospects”, Proceedings of 3rd International Conference on Computer and Communication Technology, pp. 26-32, 2012.

. Y.A. Alsahib, S. Aldeen, M. Salleh and M. Razzaque, “A Comprehensive Review on Privacy Preserving Data Mining”, SpringerPlus, Vol. 4, pp. 694-705, 2015.

. Benny Pinkas, “Cryptographic Techniques for Privacy- preserving Data Mining”, ACM SIGKDD Explorations Newsletter, Vol. 4, No. 2, pp. 12-19, 2002.

. Vassilios S. Verykios, Elisa Bertino, Igor Nai Fovino, Loredana Parasiliti Provenza, Yucel Saygin and Yannis Theodoridis, “State-of-the-Art in Privacy Preserving Data Mining”, ACM SIGMOD Record, Vol. 33, No. 1, pp. 50-57, 2004.

. WHO. Coronavirus disease (COVID-2019) situation reports n.d. 21. Worldometers. COVID-19 Coronavirus Pandemic n.d.

. Panda S.K., Ravichandran C., Hazarika B. Results on system of Atangana–Baleanu fractional order Willis aneurysm and nonlinear singularly perturbed boundary value problems. Chaos Solitons Fractals.2021;142:110390. doi: 10.1016/j.chaos.2020.110390.

. Ravichandran C., Logeswari K., Jarad F. New results on existence in the framework of Atangana–Baleanu derivative for fractional integro-differential equations. Chaos Solitons Fractals. 2019;125:194–200.

. Logeswari K, Ravichandran C NK. Mathematical model for spreading of COVID?19 virus with the Mittag–Leffler kernel. Numer Methods Partial Differ Equ2020.

. Hethcote H.W. The mathematics of infectious diseases. SIAM Rev. 2000;42(4):5 99–653. doi: 10.1137/ S00361445 00371907.

. Bonyah E, Khan MA, Okosun KO, Islam S. A theoretical model for Zika virus transmission. PLoS One 2017;12:e0185540.

. Kermack W.O., McKendrick A.G. A contribution to the mathematical theory of epidemics. Proc R Soc London Ser A. 1927;115:700–721

. Aleman D.M., Wibisono T.G., Schwartz B. Proc. 2009 winter Simul. Conf., IEEE. 2009. Accounting for individual behaviors in a pandemic disease spread model; pp. 1977–1985.

. Kucharski A.J., Russell T.W., Diamond C., Liu Y., Edmunds J., Funk S. Early dynamics of transmission and control of COVID-19: a mathematical modelling study. Lancet Infect Dis. 2020.

. Ndaïrou F., Area I., Nieto J.J., Torres D.F.M. Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan. Chaos Solitons Fractals. 2020; 135:1z09846. doi: 10.1016/ j. chaos.2020. 109846.

. Prem K., Liu Y., Russell T.W., Kucharski A.J., Eggo R.M., Davies N. The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: a modelling study. Lancet Public Heal.

. Hellewell J., Abbott S., Gimma A., Bosse N.I., Jarvis C.I., Russell T.W. Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts. doi: 10.1016/S2214-109X(20)30074-7.

. Mizumoto K, Chowell G. Transmission potential of the novel coronavirus (COVID-19) onboard the diamond Princess Cruises Ship, 2020. Infect Dis Model 2020;5 :264 –70.Liu Y., Gong M., Liu S., Pan Y., Huo Y. Effects of blood glucose on vaspin secretion in patients with gestational diabetes mellitus. Gynecol Endocrinol. 2021;37(3):221–224. 35. Fanelli D., Piazza F. Analysis and forecast of COVID-19 spreading in China, Italy and France. Chaos Solitons Fractals. 2020;134:1–12. doi: 10.1016/ j.chaos.2020. 109761

. Chakraborty T., Ghosh I. Real-time forecasts and risk assessment of novel coronavirus (COVID-19) cases: a data-driven analysis. Chaos Solitons Fractals. 2020;135:109850. doi: 10.1016/j.chaos.2020.109850.

. Ribeiro M.H.D.M., da Silva R.G., Mariani V.C., Coelho L.dos.S. Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil. Chaos Solitons Fractals. 2020;135:109853. doi: 10.1016/j.chaos.2020.109853.

. Yang Z., Zeng Z., Wang K., Wong S.-S., Liang W., Zanin M. Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions. J Thorac Dis. 2020;12(3):165–174.

. Peng L, Yang W, Zhang D, Zhuge C, Hong L. Epidemic analysis of COVID-19 in China by dynamical modelling; 2020.

. Li X., Zhao X., Sun Y. The lockdown of Hubei Province causing different transmission dynamics of the novel coronavirus (2019-nCoV) in Wuhan and Beijing. MedRxiv. 2020.

. Wu J.T., Leung K., Leung G.M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet 2020;395(10225):689–697.

. Tang B, Wang X, Li Q, Bragazzi NL, Tang S, Xiao Y, et al. Estimation of the transmission risk of the 2019- nCoV and its implication for public health interventions. J Clin Med 2020;9:462.23.