Document Clustering with Map Reduce using Hadoop Framework

M. Satish, M. Ramakrishna Murty

doi:10.17762/ijritcc.v3i1.3829

PDF

Published: Jan 31, 2015

DOI: https://doi.org/10.17762/ijritcc.v3i1.3829

M. Satish, M. Ramakrishna Murty

Abstract

Big data is a collection of data sets. It is so enormous and complex that it becomes difficult to processes and analyse using normal database management tools or traditional data processing applications. Big data is having many challenges. The main problem of the big data is store and retrieve of the data from the search engines. Document data is also growing rapidly in the eon of internet. Analysing document data is very important for many applications. Document clustering is the one of the important technique to analyse the document data. It has many applications like organizing large document collection, finding similar documents, recommendation system, duplicate content detection, search optimization. This work is motivated by the reorganization of the need for a well efficient retrieve of the data from massive resources of data repository through the search engines. In this work mainly focused on document clustering for collection of documents in efficient manner using with MapReduce.
DOI: 10.17762/ijritcc2321-8169.150181

How to Cite

, M. S. M. R. M. (2015). Document Clustering with Map Reduce using Hadoop Framework. International Journal on Recent and Innovation Trends in Computing and Communication, 3(1), 409–413. https://doi.org/10.17762/ijritcc.v3i1.3829

Issue

Vol. 3 No. 1 (2015): January (2015) Issue

Section

Articles

Make a Submission

Announcements

Call for Papers

January 5, 2026

Call for Papers for the New Issue.
Last Date of Submission: August 31^th, 2026

Imp. Announcement

April 15, 2022

Dear Authors,
We are feeling proud congratulations to all the contributors of IJRITCC. Because The "International Journal on Recent and Innovation Trends in Computing and Communication" has been accepted for Scopus.

Like, Subscribe and Share This Video