A Survey on Data Deduplication

Main Article Content

Shubhanshi Singhal, Naresh Kumar

Abstract

Now-a-days, the demand of data storage capacity is increasing drastically. Due to more demands of storage, the computer society is attracting toward cloud storage. Security of data and cost factors are important challenges in cloud storage. A duplicate file not only waste the storage, it also increases the access time. So the detection and removal of duplicate data is an essential task. Data deduplication, an efficient approach to data reduction, has gained increasing attention and popularity in large-scale storage systems. It eliminates redundant data at the file or subfile level and identifies duplicate content by its cryptographically secure hash signature. It is very tricky because neither duplicate files don?t have a common key nor they contain error. There are several approaches to identify and remove redundant data at file and chunk levels. In this paper, the background and key features of data deduplication is covered, then summarize and classify the data deduplication process according to the key workflow.

Article Details

How to Cite
, S. S. N. K. (2017). A Survey on Data Deduplication. International Journal on Recent and Innovation Trends in Computing and Communication, 5(5), 1045–1052. https://doi.org/10.17762/ijritcc.v5i5.653
Section
Articles