Open Access   Article Go Back

Infrequent Weighted Itemset Mining for Large Dataset

R.B.M. Sayyad1 , P.S. Yalagi2

  1. Department of CSE, Walchand Institute of Technology (Solapur University), Solapur, India.
  2. Department of CSE, Walchand Institute of Technology (Solapur University), Solapur, India.

Correspondence should be addressed to: sayyadriz@gmail.com.

Section:Research Paper, Product Type: Journal Paper
Volume-5 , Issue-6 , Page no. 149-153, Jun-2017

Online published on Jun 30, 2017

Copyright © R.B.M. Sayyad, P.S. Yalagi . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: R.B.M. Sayyad, P.S. Yalagi, “Infrequent Weighted Itemset Mining for Large Dataset,” International Journal of Computer Sciences and Engineering, Vol.5, Issue.6, pp.149-153, 2017.

MLA Style Citation: R.B.M. Sayyad, P.S. Yalagi "Infrequent Weighted Itemset Mining for Large Dataset." International Journal of Computer Sciences and Engineering 5.6 (2017): 149-153.

APA Style Citation: R.B.M. Sayyad, P.S. Yalagi, (2017). Infrequent Weighted Itemset Mining for Large Dataset. International Journal of Computer Sciences and Engineering, 5(6), 149-153.

BibTex Style Citation:
@article{Sayyad_2017,
author = {R.B.M. Sayyad, P.S. Yalagi},
title = {Infrequent Weighted Itemset Mining for Large Dataset},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {6 2017},
volume = {5},
Issue = {6},
month = {6},
year = {2017},
issn = {2347-2693},
pages = {149-153},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1317},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1317
TI - Infrequent Weighted Itemset Mining for Large Dataset
T2 - International Journal of Computer Sciences and Engineering
AU - R.B.M. Sayyad, P.S. Yalagi
PY - 2017
DA - 2017/06/30
PB - IJCSE, Indore, INDIA
SP - 149-153
IS - 6
VL - 5
SN - 2347-2693
ER -

VIEWS PDF XML
535 358 downloads 589 downloads
  
  
           

Abstract

Data mining is the process of analysing data from many different perspectives or dimensions, categorize it and finally summarize it into useful information. This information can be used to increase profits, cut costs, or both. Data mining software is used for analysing data. It allows users to analyse data from many different perspectives, categorize it, and summarize the relationships discovered. Specially, data mining is the way of extracting valuable correlations or patterns among many number of fields in large relational databases. Pattern mining has become an important task in data mining. Mining frequent and infrequent itemsets from a dataset is the most important field of data mining. Mining frequent itemset is very expensive when minimum support threshold is low, and when a minimum support threshold is high mining in frequent itemsets is highly expensive. The proposed system uses multiple level minimum supports to constrain infrequent itemsets by giving different minimum supports to itemsets with different length in order to mine a number of infrequent itemsets in an appropriate degree. In this paper, we are implementing the concept of infrequent weighted itemset mining based on Hadoop-MapReduce model, which can handle massive datasets in mining in frequent itemsets, in that we proposed two novel algorithms based on IWI Miner, IWI Miner to drive the IWI mining process. This paper emphasis on the issue of discovering those itemsets which occurs rarely in large dataset called infrequent weighted itemset (IWI) mining problem.

Key-Words / Index Term

Data Mining, frequent Itemset, Infrequent Itemset, Weighted Itemset, Hadoop , MapReduce

References

[1] Aruna J. Chamatkar and P.K. Butey , "Comparison on Different Data Mining Algorithms", International Journal of Computer Sciences and Engineering, Vol.2, Issue.10, pp.54-58, 2014.
[2] Akilandeswari. S and A.V.Senthil Kumar, "A Novel Low Utility Based Infrequent Weighted Itemset Mining Approach Using Frequent Pattern", International Journal of Computer Sciences and Engineering, Vol.3, Issue.7, pp.181-185, 2015.
[3] Jeffery Dean and Sanjay Ghemawat, “MapReduce: simplified data processing on large clusters”, Communications of the ACM, Vol. 51, No.1, 2008, pp. 107-113.
[4] Dong, Z Zheng, Z Niu and Q Jiam,” Mining infrequent itemset based on multiple level minimum supports”, 2nd Int. Conf. on Innovative Computing, Information Control, 2007.
[5] He Jiang, Xiumei Luan, Xiangjun Dong,” Mining Weighted Negative Association Rules from Infrequent Itemsets Based on Multiple Supports”, 978-0-7695-4792-3/12 $26.00 © 2012 IEEE 2012 International Conference on Industrial Control and Electronics Engineering.
[6] A. Gupta, A. Mittal, and A. Bhattacharya, “Minimally Infrequent Itemset Mining Using Pattern-Growth Paradigm and Residual Trees”, Proc. Int’l Conf. Management of Data (COMAD), pp. 57-68, 2011.
[7] T Ramakrishnudu, R B V Subramanyam,” Mining Interesting Infrequent Itemsets from Very Large Data based on MapReduce Framework”, I.J. Intelligent Systems and Applications, 2015, 07, 44-49.
[8] David J. Haglin and Anna M. Manning, “On Minimal Infrequent Itemset Mining”.
[9] K. Sun and F. Bai, “Mining Weighted Association Rules Without Preassigned Weights,” IEEE Trans. Knowledge and Data Eng., vol. 20, no. 4, pp. 489-495, Apr. 2008.
[10] Ling Zhou, Stephen Yau ∗,” Efficient association rule mining among both frequent and infrequent items”, Computers and Mathematics with Applications 54 (2007) 737–749.
[11] J.Jaya1, S.V.Hemalatha2,” A Survey of Frequent and Infrequent Weighted Itemset Mining Approaches”.
[12] He Jiang, Xiumei Luan, Xiangjun Dong,” Mining Weighted Negative Association Rules from Infrequent Itemsets Based on Multiple Supports”, 978-0-7695-4792-3/12 $26.00 © 2012 IEEE 2012 International Conference on Industrial Control and Electronics Engineering.
[13] Junfeng Ding, Stephen S.T. Yau, “TCOM, an innovative data structure for mining association rules among infrequent items”, Computers and Mathematics with Applications, Vol. 57, No. 2, January 2009, pp. 290-301.
[14] Guru Prasad M.S., Nagesh H.R., Swathi Prabhu, "An Efficient Approach to Optimize the Performance of Massive Small Files in Hadoop MapReduce Framework", International Journal of Computer Sciences and Engineering, Vol.5, Issue.6, pp.112-120, 2017.
[15] Nidhi Sethi and Pradeep Sharma, "Mining Frequent Pattern from Large Dynamic Database Using Compacting Data Sets", International Journal of Scientific Research in Computer Science and Engineering, Vol.1, Issue.3, pp.31-34, 2013.