Open Access   Article Go Back

Survey on De-Duplication Techniques at Public Cloud

Rajani Sajjan1 , Gayatri Chavan2 , Vijay R. Ghorpade3

Section:Survey Paper, Product Type: Conference Paper
Volume-4 , Issue-5 , Page no. 150-152, May-2016

Online published on May 31, 2016

Copyright © Rajani Sajjan, Gayatri Chavan, Vijay R. Ghorpade . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Rajani Sajjan, Gayatri Chavan, Vijay R. Ghorpade, “Survey on De-Duplication Techniques at Public Cloud,” International Journal of Computer Sciences and Engineering, Vol.4, Issue.5, pp.150-152, 2016.

MLA Style Citation: Rajani Sajjan, Gayatri Chavan, Vijay R. Ghorpade "Survey on De-Duplication Techniques at Public Cloud." International Journal of Computer Sciences and Engineering 4.5 (2016): 150-152.

APA Style Citation: Rajani Sajjan, Gayatri Chavan, Vijay R. Ghorpade, (2016). Survey on De-Duplication Techniques at Public Cloud. International Journal of Computer Sciences and Engineering, 4(5), 150-152.

BibTex Style Citation:
author = { Rajani Sajjan, Gayatri Chavan, Vijay R. Ghorpade},
title = {Survey on De-Duplication Techniques at Public Cloud},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2016},
volume = {4},
Issue = {5},
month = {5},
year = {2016},
issn = {2347-2693},
pages = {150-152},
url = {},
publisher = {IJCSE, Indore, INDIA},

RIS Style Citation:
UR -
TI - Survey on De-Duplication Techniques at Public Cloud
T2 - International Journal of Computer Sciences and Engineering
AU - Rajani Sajjan, Gayatri Chavan, Vijay R. Ghorpade
PY - 2016
DA - 2016/05/31
SP - 150-152
IS - 5
VL - 4
SN - 2347-2693
ER -

1693 1462 downloads 1470 downloads


Since the demand for data storage is increasing day by day and by the industry analysis we can say that digital data is increasing gradually, but the storage of redundant data is excess which results in most of the storage used unnecessary to keep identical copies. So this survey paper introduces various de-duplication techniques to efficiently utilize the cloud storage system.

Key-Words / Index Term

Cloud Computing, De-Duplication, Cloud Storage, Data Availability, Data Integrity, Confidentiality, Authorization, Cloud Service Provider


[1]NIST Cloud Computing Standards Roadmap Working Group NIST Cloud Computing Program Information Technology Laboratory.
[2] A.K. Elmagarmid, P.G. Ipeirotis, and V.S. Verykios, “Duplicate Record Detection: A Survey”, IEEE Trans. Knowledge and Data Eng., vol. 19, no. 1, pp. 1-16, Jan. 2007.
[3] V. Subramaniyaswamy, S. Chenthur Pandian, “A Complete Survey of Duplicate Record Detection Using Data Mining Techniques”, Information Technology Journal 11(8)., ISSN 1812-5638, pp.941- 945, 2012.
[4]K. Deepa, R. Rangarajan, “Record De-duplication using Particle Swarm Optimization”, European Journal of Scientific Research ISSN 1450-216X.,vol.80,no. 3, pp. 366-378, 2012.
[5]Qinghai Bai, “Analysis of Particle Swarm Optimization Algorithm”, Computer and Information Science, vol.3, no.1, pp. 180-184, Feb. 2010.
[6]S. Sarawagi and A. Bhamidipaty, “Interactive De-duplication Using Active Learning”, Proc. Eighth ACM SIGKDD Int’l Conf. Knowledge Discovery and Data Mining(KDD’02), pp.269-278, 2002.
[7] Bilal Khan, Azhar Rauf, Sajid H. Shah and Shah Khusro, “Identification and Removal of Duplicated Records”, World Applied Sciences Journal 13(5): ISSN 1818-4952, pp.1178-1184, 2011.
[8] Peter Christen, “A Survey of Indexing Techniques for Scalable Record Linkage and De-duplication”, IEEE Trans. Knowledge and Data Eng., vol. 24, no. 9, pp. 1537-1555, Sept.2012.
[9] Weifengsu, Jiying Wang, Frederick H. Lochovsky, “ Record Matching over Query Results from Multiple Web Databases”, IEEE Trans. Knowledge and Data Eng., vol. 22, no. 4, pp.578-588, April. 2010.
[10] A.FarithaBanu, C.Chandrasekar,”A Survey on De-duplication Methods”, International Journal of Computer Trends and Technology, ISSN: 2231-2803,vol.3,Issue.3,pp.364368,2012,
[11] Hamid HaidarianShahri, Saied HaidarianShahri, “Eliminating Duplicates in information Integration: An Adaptive, Extensible Framework”, IEEE Computer Society 1541-1672, pp. 63-71, September/October 2006.
[12] Peter Christen, Development and User Experiences of an Open Source Data Cleaning, De-duplication and Record Linkage System”, SIGKDD Explorations., vol. 11, Issue 1, pp. 39-48.
[13]V.P.Arunachalam,S.Karthik, “A Novel approach for mining inter- transaction itemsets”, European Scientific Journal, 8(14).
[14] Nick Larusso.” A Survey of Uncertain Data Algorithms and Applications”. IEEE Transaction On Knowledge And Data Engineering, 2009 .
[15] Elliott, Chip. “Quantum Cryptography”, IEEE Security & Privacy, 2004.
[16]T. Rubya, N. Prema Latha ,B. Sangeetha “A Survey on Recent Security Trends using Quantum Cryptography ”.
[17] P.Shanthi Bala “Intensification of educational cloud computing And crisis of data security in public clouds”
[18] S.SATHAPPAN, Dr.D.C.TOMAR “A study on Cluster Uncertain Data based on Probability Distribution”