Open Access   Article Go Back

A Comparative Study on K-Means and Genetic Algorithm

P. Dheivanai1 , P. Sundari2

Section:Survey Paper, Product Type: Journal Paper
Volume-06 , Issue-11 , Page no. 189-195, Dec-2018

Online published on Dec 31, 2018

Copyright © P. Dheivanai, P. Sundari . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: P. Dheivanai, P. Sundari, “A Comparative Study on K-Means and Genetic Algorithm,” International Journal of Computer Sciences and Engineering, Vol.06, Issue.11, pp.189-195, 2018.

MLA Style Citation: P. Dheivanai, P. Sundari "A Comparative Study on K-Means and Genetic Algorithm." International Journal of Computer Sciences and Engineering 06.11 (2018): 189-195.

APA Style Citation: P. Dheivanai, P. Sundari, (2018). A Comparative Study on K-Means and Genetic Algorithm. International Journal of Computer Sciences and Engineering, 06(11), 189-195.

BibTex Style Citation:
@article{Dheivanai_2018,
author = {P. Dheivanai, P. Sundari},
title = {A Comparative Study on K-Means and Genetic Algorithm},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {12 2018},
volume = {06},
Issue = {11},
month = {12},
year = {2018},
issn = {2347-2693},
pages = {189-195},
url = {https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=569},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=569
TI - A Comparative Study on K-Means and Genetic Algorithm
T2 - International Journal of Computer Sciences and Engineering
AU - P. Dheivanai, P. Sundari
PY - 2018
DA - 2018/12/31
PB - IJCSE, Indore, INDIA
SP - 189-195
IS - 11
VL - 06
SN - 2347-2693
ER -

           

Abstract

Data mining is the process of analyze data from different perspectives and summarizing it into useful information. Clustering is a practical unsupervised data mining task that subdivides an input data set into a desired number of subgroups so that members will have high similarity and the member of different groups have large differences. K-means is a usually used partitioning based clustering technique that tries to find a user specified number of clusters (k), which are represent by their centroids, by minimizing the square error function. Although K-means is easy and can be used for a wide variety of data types, it is rather sensitive to initial positions of cluster centers. There are 2 simple approach to cluster center initialization i.e. either to select the initial values at random, or to select the first k samples of the data points. Both approach cause the algorithm to converge to sub optimal solutions. Genetic algorithm one of the usually used evolutionary algorithms performs global search to find the solution to a clustering problem. The techniques typically starts with a set of randomly generated individuals called the population and creates successive, latest generations of the population by genetic operations such as natural selection, crossover, and mutation. Each one chromosome of the population represent K no. of centroids. Steps of genetic algorithm are repeatedly applied for a no. of generations to search for suitable cluster centers in the feature space such that a similarity metric of the resultant clusters is optimized. K-means and genetic algorithm based data clustering have been compared in this paper on the basis of their functioning principle, advantage and disadvantage with proper example.

Key-Words / Index Term

Data mining, K-means, Genetic algorithm

References

[1] Nikita Jain,Vishal Srivastava, “Data Mining techniques : A survey paper” , International Journal of Research in Engineerning and Technology, pp. 116-119, 2013.
[2] M.S.B PhridviRaj, C.V. GuruRao, “Data Mining – Past present and future data streams,” Elsevier, pp. 256-264, 2013.
[3] K.Kameshwaran, K. Malarvizhi, “Survey on Clustering Techniques in Data Mining,” International Journal of Computer Science and Information Technologies, pp.2272-2276, 2014.
[4] Gunjan Verma, Vineeta Verma, “Role and Application of Genetic Algorithm in Data Mining,” International Journal of Computer Application, pp. 5-8, 2012.
[5] Sharaf Ansari,Sailendra Chetlur, Srikanth Prabhu, N. Gopalakrishna Kini, Govardhan Hegde, Yusuf Hyder, “An Overview of Clustering Analysis Techniques used in Data Mining ,” International Journal of Emerging Technology
[6] Aastha Joshi, Rajneet Kaur, “ A Review: Comparative Study of Various Clustering Techniques in Data Mining,” International Journal of Advanced Research in Computer Science and Software Engineering, pp.55-57,2013.
[7] Manoj Kumar, Mohammad Husian, Naveen Upreti, Deepti Gupta, Genetic Algorithm “: Review and Application,” International Journal of Information Technology and Knowledge Management, pp.451-454, 2010.
[8] L.E. Agustın-Blas, S. Salcedo-Sanz, S. Jimenez-Fernandez, L. Carro- Calvo, J. Del Ser, J.A. Portilla-Figueras K. Elissa, “A new grouping genetic algorithm for clustering problems,” Elsevier, pp.9695-9703, 2012.
[9] Honga Tzung-Pei, Chun-Hao Chenc, Feng-Shih Lin, “Using group genetic algorithm to improve performance of attribute clustering,” Elsevier, pp.1-8, 2015.
[10] Danial Gomes Ferrari, Leandro Numes de Castro, “ Clustering algorithm selection by meta-learning systems: A new distance based problems characterization and ranking combination methods,” Elsevier, pp.181-194, 2015.
[11] Rajashree Dash and Rasmita Dash, “Comparative analysis of K-means and Genetic algorithm based data clustering,” International Journal of Advanced Computer and Mathematical Sciences, pp.257-265, 2012.
[12] Edvin Aldana-Bobadhilla, Angel Kuri-Morales, “A Clustering based method on the maximum entropy principle,” Entropy Article, pp. 151-180, 2015.
[13] Kannuri Lahari, M. Ramakrishna Murty, and Suresh C. Satapathy, “Prediction based clustering using genetic algorithm and Learning Based Optimization Performance Analysis,” Advances in Intelligent Systems and Computing,” pp. 338, 2015.
[14] Rahila H. Sheikh, M. M.Raghuwanshi, Anil N. Jaiswal, “Genetic algorithm based clustering: A Survey,” IEEE, pp.314-319, 2008.
[15] K.Arun Prabha, R.Saranya, “Refinement of K-means clustering using Genetic algorithm,” Journal of Computer Application, pp. 256-261, 2011.
[16] M.Anusha and J.G.R.Sathiaseelan, “An Enhanced K-means Genetic Algorithms for Optimal Clustering”, IEEE, pp.580-584, 2014.
[17] M.Anusha and J.G.R.Sathiaseelan, “An Improved K-Means Genetic Algorithm for Multi-objective Optimization”, International Journal of Applied Engineering Research, pp. 228-231, 2015.