Open Access   Article Go Back

Evaluation of Clustering Algorithm in Data Mining

Arpit Agrawal1

Section:Research Paper, Product Type: Journal Paper
Volume-5 , Issue-9 , Page no. 268-273, Sep-2017

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v5i9.268273

Online published on Sep 30, 2017

Copyright © Arpit Agrawal . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Arpit Agrawal, “Evaluation of Clustering Algorithm in Data Mining,” International Journal of Computer Sciences and Engineering, Vol.5, Issue.9, pp.268-273, 2017.

MLA Style Citation: Arpit Agrawal "Evaluation of Clustering Algorithm in Data Mining." International Journal of Computer Sciences and Engineering 5.9 (2017): 268-273.

APA Style Citation: Arpit Agrawal, (2017). Evaluation of Clustering Algorithm in Data Mining. International Journal of Computer Sciences and Engineering, 5(9), 268-273.

BibTex Style Citation:
@article{Agrawal_2017,
author = {Arpit Agrawal},
title = {Evaluation of Clustering Algorithm in Data Mining},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {9 2017},
volume = {5},
Issue = {9},
month = {9},
year = {2017},
issn = {2347-2693},
pages = {268-273},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=5577},
doi = {https://doi.org/10.26438/ijcse/v5i9.268273}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v5i9.268273}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=5577
TI - Evaluation of Clustering Algorithm in Data Mining
T2 - International Journal of Computer Sciences and Engineering
AU - Arpit Agrawal
PY - 2017
DA - 2017/09/30
PB - IJCSE, Indore, INDIA
SP - 268-273
IS - 9
VL - 5
SN - 2347-2693
ER -

VIEWS PDF XML
1384 457 downloads 74 downloads
  
  
           

Abstract

Text mining is the use of data mining techniques to unstructured text in order to extract important and nontrivial knowledge. One of the key methods of text mining, or the unsupervised classification of related content into various categories, is text clustering. The performance of text clustering is being improved in this study. We looked on four areas of the text clustering algorithms: document representation, document similarity analysis, high dimension reduction, and parallelization. We suggest a collection of very effective text clustering techniques that focus on the special features of unstructured text databases. All of the suggested algorithms have undergone thorough performance studies. We contrasted these techniques with current text clustering algorithms in order to assess their performance.

Key-Words / Index Term

Cluster Algorithm, Data Mining, bisecting k-means, FIHC, CFWS find CFWMS

References

[1] Mohammed J. Zaki. Scalable algorithms for association mining. IEEE Trans. on Knowl. and Data Eng., 12(3):pp 372–390, 2000.
[2] P. Agrawal, O. Benjelloun, A. Das Sarma, C. Hayworth, S. Nabar, T. Sugihara, and J. Widom. "Trio: A system for data, uncertainty, and lineage". In Proc. Int. Conf. on Very Large Databases, 2006.
[3] R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. ACM SIGMOD Int. Conf. on Management of Data, Minneapolis, MN, 1994.
[4] T. Bernecker, H.-P. Kriegel, M. Renz, F. Verhein, and A. Züfle. Probabilistic frequent itemset mining in uncertain databases. In In Proc. 15th ACM SIGKDD Conf. on Knowledge Discovery and Data Mining, Paris, France, 2009.
[5] C. C. Aggrawal and P. S. Yu, Finding Generalized Projected Clusters in High Dimensional Spaces," Proc. of ACM SIGMOD Int`l Conf. on Management of Data, 2000, pp. 70{81.
[6] J. Allan, HARD Track Overview in TREC 2003 High Accuracy Retrieval from Documents,"Proc. of the 12th Text Retrieval Conference, 2003, pp. 24{37.
[7] M. R. Anderberg, Cluster Analysis for Applications, Academic Press, 1973.
[8] DPVG06] Nele Dexters, Paul W. Purdom, and Dirk Van Gucht. A probability analysis for candidate-based frequent itemset algorithms. In SAC ’06: Proceedings of the 2006 ACM symposium on Applied computing, New York, NY, USA, 2006. ACM, pp541–545.
[9] Gregory Buehrer, Srinivasan Parthasarathy, and Amol Ghoting. Out-ofcore frequent pattern mining on a commodity pc. In KDD ’06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, New York, NY, USA, 2006, pp 86–95.
[10] Toon Calders. Deducing bounds on the frequency of itemsets. In EDBT Workshop DTDM Database Techniques in Data Mining, 2002.