Improved K-means clustering on Hadoop

Kaustubh Chaturbhuj, Gauri Chaudhary

doi:10.17762/ijritcc.v4i4.2062

PDF

Published: Apr 30, 2016

DOI: https://doi.org/10.17762/ijritcc.v4i4.2062

Kaustubh Chaturbhuj, Gauri Chaudhary

Abstract

Clustering is the portioning method in which we grouped similar attribute items. Recently data grows rapidly so data analysis using clustering getting difficult. K-means is traditional clustering method. K-means is easy to implement and scalable but it suffers from local minima and sensitive to initial cluster centroids. Particle swarm optimization is mimic behavior based clustering algorithm based on particle’s velocity but it suffers from number of iterations. So we use PSO for finding initial cluster center and then use this centroids for K-means clustering which is running parallel on Hadoop. Hadoop is used for large database. We try to find global clusters in limited iterations.

How to Cite

, K. C. G. C. (2016). Improved K-means clustering on Hadoop. International Journal on Recent and Innovation Trends in Computing and Communication, 4(4), 601–604. https://doi.org/10.17762/ijritcc.v4i4.2062

Issue

Vol. 4 No. 4 (2016): April (2016) Issue

Section

Articles

Make a Submission

Announcements

Call for Papers

January 5, 2026

Call for Papers for the New Issue.
Last Date of Submission: June 30^th, 2026

Imp. Announcement

April 15, 2022

Dear Authors,
We are feeling proud congratulations to all the contributors of IJRITCC. Because The "International Journal on Recent and Innovation Trends in Computing and Communication" has been accepted for Scopus.

Like, Subscribe and Share This Video