A Survey on Data Deduplication

Shubhanshi Singhal, Naresh Kumar

doi:10.17762/ijritcc.v5i5.653

PDF

Published: May 31, 2017

DOI: https://doi.org/10.17762/ijritcc.v5i5.653

Shubhanshi Singhal, Naresh Kumar

Abstract

Now-a-days, the demand of data storage capacity is increasing drastically. Due to more demands of storage, the computer society is attracting toward cloud storage. Security of data and cost factors are important challenges in cloud storage. A duplicate file not only waste the storage, it also increases the access time. So the detection and removal of duplicate data is an essential task. Data deduplication, an efficient approach to data reduction, has gained increasing attention and popularity in large-scale storage systems. It eliminates redundant data at the file or subfile level and identifies duplicate content by its cryptographically secure hash signature. It is very tricky because neither duplicate files don?t have a common key nor they contain error. There are several approaches to identify and remove redundant data at file and chunk levels. In this paper, the background and key features of data deduplication is covered, then summarize and classify the data deduplication process according to the key workflow.

How to Cite

, S. S. N. K. (2017). A Survey on Data Deduplication. International Journal on Recent and Innovation Trends in Computing and Communication, 5(5), 1045–1052. https://doi.org/10.17762/ijritcc.v5i5.653

Issue

Vol. 5 No. 5 (2017): May (2017) Issue

Section

Articles

Make a Submission

Announcements

Call for Papers

January 5, 2026

Call for Papers for the New Issue.
Last Date of Submission: July 20^th, 2026

Imp. Announcement

April 15, 2022

Dear Authors,
We are feeling proud congratulations to all the contributors of IJRITCC. Because The "International Journal on Recent and Innovation Trends in Computing and Communication" has been accepted for Scopus.

Like, Subscribe and Share This Video