Open Access   Article Go Back

A Memory Efficient Implementation of Frequent Itemset Mining with Vertical Data Format Approach

P. Sumathi1 , S. Murugan2

Section:Research Paper, Product Type: Journal Paper
Volume-06 , Issue-11 , Page no. 152-157, Dec-2018

Online published on Dec 31, 2018

Copyright © P. Sumathi, S. Murugan . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: P. Sumathi, S. Murugan, “A Memory Efficient Implementation of Frequent Itemset Mining with Vertical Data Format Approach,” International Journal of Computer Sciences and Engineering, Vol.06, Issue.11, pp.152-157, 2018.

MLA Style Citation: P. Sumathi, S. Murugan "A Memory Efficient Implementation of Frequent Itemset Mining with Vertical Data Format Approach." International Journal of Computer Sciences and Engineering 06.11 (2018): 152-157.

APA Style Citation: P. Sumathi, S. Murugan, (2018). A Memory Efficient Implementation of Frequent Itemset Mining with Vertical Data Format Approach. International Journal of Computer Sciences and Engineering, 06(11), 152-157.

BibTex Style Citation:
@article{Sumathi_2018,
author = {P. Sumathi, S. Murugan},
title = {A Memory Efficient Implementation of Frequent Itemset Mining with Vertical Data Format Approach},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {12 2018},
volume = {06},
Issue = {11},
month = {12},
year = {2018},
issn = {2347-2693},
pages = {152-157},
url = {https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=561},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=561
TI - A Memory Efficient Implementation of Frequent Itemset Mining with Vertical Data Format Approach
T2 - International Journal of Computer Sciences and Engineering
AU - P. Sumathi, S. Murugan
PY - 2018
DA - 2018/12/31
PB - IJCSE, Indore, INDIA
SP - 152-157
IS - 11
VL - 06
SN - 2347-2693
ER -

           

Abstract

Data mining is the process of extracting the concealed information and rules from large databases. But the real world datasets are sparse, dirt and also contain hundreds of items. Frequent Pattern Mining (FPM) is one of the most intensive problems in discovering frequent itemsets from such datasets. Apriori is one of the premier and classical data mining algorithms for finding frequent patterns but it is not an optimized one. So over last two decades a remarkable variations and improvements were made to overcome the inefficiencies of Apriori algorithm such as FPGrowth, TreeProjection, Charm, LCM, Eclat and Direct Hashing and Pruning (DHP), RARM, ASPMS etc., In any case, a little enhancement in the algorithm improves the mining process considerably. Frequent item set mining with vertical data format has been proposed as an improvement over the basic Apriori which reduces the number of database scans and uses array storage structure. This research paper has proposed a space efficient implementation of finding frequent itemsets with vertical data format using jagged array and reduces the usage of memory by allocating exact memory. An experiment is done between the different implementation of vertical data format approaches viz., array, and jagged array representation. From the experiment it is proved that the proposed jagged array implementation method utilizes the memory effectively when compared with the traditional multidimensional array.

Key-Words / Index Term

Frequent Pattern Mining, Apriori, FPGrowth, Eclat, Vertical Data Format, Array, and Jagged Array

References

[1]. Liu, Y., Liao, W. K., Choudhary, A. N., & Li, J. (2008). Parallel Data Mining Algorithms for Association Rules and Clustering, In Intl. Conf. on Management of Data, pp.1-25.
[2]. Kumar, G. V., Sreedevi, M., & Kumar, N. P. (2012). Mining Regular Patterns in Data Streams Using Vertical Format. International Journal of Computer Science and Security (IJCSS), 6(2), pp.142-149.
[3]. Ravikiran, D., & Srinivasu, S. V. N. (2016). Regular Pattern Mining on Crime Data Set using Vertical Data Format. International Journal of Computer Applications, 143(13).
[4]. Singla, V. (2016). A Review: Frequent Pattern Mining Techniques in Static and Stream Data Environment. Indian Journal of Science and Technology, 9(45), pp.1-7.
[5]. Ishita, R., & Rathod, A. (2016). Frequent Itemset Mining in Data Mining: A Survey. International Journal of Computer Applications, 139(9).
[6]. Guo, Y. M., & Wang, Z. J. (2010, March). A vertical format algorithm for mining frequent item sets. In Advanced Computer Control (ICACC), 2010 2nd International Conference on (Vol. 4, pp. 11-13). IEEE.
[7]. Han, J., Kamber, M. Data Mining Concepts and Techniques, Morgan Kaufmann Publishers, 2006.
[8]. S.Sharmila, Dr. S.Vijayarani. (2017). Frequent Itemset Mining and Association Rule Generation using Enhanced Apriori and Enhanced Eclat Algorithms, International Journal of Innovative Research in Computer and Communication Engineering, 5(4), pp. 679- 6804.
[9]. Zhiyong Ma, Juncheng Yang, Taixia Zhang and Fan Liu. (2016). An Improved Eclat Algorithm for Mining Association Rules Based on Increased Search Strategy, International Journal of Database Theory and Application, 9(5), pp.251-266.
[10]. C.Ganesh, B.Sathiyabhama and T.Geetha. (2016). Fast Frequent Pattern Mining Using Vertical Data Format for Knowledge Discovery, International Journal of Emerging Research in Management &Technology, 5(5), pp.141-149.
[11]. Hosny M. Ibrahim, M.H. Marghny and Noha M.A. Abdelaziz. (2015). Fast Vertical Mining Using Boolean Algebra, International Journal of Advanced Computer Science and Applications, 6(1), pp.89-96.
[12]. Mohammed J. Zaki amd Karam Gouda. (2003), Fast Vertical Mining Using Diffsets SIGKDD ’03, ACM.