Improved K-means clustering on Hadoop

Main Article Content

Kaustubh Chaturbhuj, Gauri Chaudhary

Abstract

Clustering is the portioning method in which we grouped similar attribute items. Recently data grows rapidly so data analysis using clustering getting difficult. K-means is traditional clustering method. K-means is easy to implement and scalable but it suffers from local minima and sensitive to initial cluster centroids. Particle swarm optimization is mimic behavior based clustering algorithm based on particle’s velocity but it suffers from number of iterations. So we use PSO for finding initial cluster center and then use this centroids for K-means clustering which is running parallel on Hadoop. Hadoop is used for large database. We try to find global clusters in limited iterations.

Article Details

How to Cite
, K. C. G. C. (2016). Improved K-means clustering on Hadoop. International Journal on Recent and Innovation Trends in Computing and Communication, 4(4), 601–604. https://doi.org/10.17762/ijritcc.v4i4.2062
Section
Articles