Open Access   Article Go Back

Frequent Mining Techniques In Bigdata : Study

Muthamiz Selvi1 , P. Srivaramangai2

Section:Review Paper, Product Type: Journal Paper
Volume-07 , Issue-02 , Page no. 111-116, Jan-2019

Online published on Jan 31, 2019

Copyright © Muthamiz Selvi, P. Srivaramangai . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Muthamiz Selvi, P. Srivaramangai, “Frequent Mining Techniques In Bigdata : Study,” International Journal of Computer Sciences and Engineering, Vol.07, Issue.02, pp.111-116, 2019.

MLA Style Citation: Muthamiz Selvi, P. Srivaramangai "Frequent Mining Techniques In Bigdata : Study." International Journal of Computer Sciences and Engineering 07.02 (2019): 111-116.

APA Style Citation: Muthamiz Selvi, P. Srivaramangai, (2019). Frequent Mining Techniques In Bigdata : Study. International Journal of Computer Sciences and Engineering, 07(02), 111-116.

BibTex Style Citation:
@article{Selvi_2019,
author = {Muthamiz Selvi, P. Srivaramangai},
title = {Frequent Mining Techniques In Bigdata : Study},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {1 2019},
volume = {07},
Issue = {02},
month = {1},
year = {2019},
issn = {2347-2693},
pages = {111-116},
url = {https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=658},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=658
TI - Frequent Mining Techniques In Bigdata : Study
T2 - International Journal of Computer Sciences and Engineering
AU - Muthamiz Selvi, P. Srivaramangai
PY - 2019
DA - 2019/01/31
PB - IJCSE, Indore, INDIA
SP - 111-116
IS - 02
VL - 07
SN - 2347-2693
ER -

           

Abstract

Big data is a collection of large amount of data with various types of data and usable to be processed at much higher frequency. Frequent Itemset Mining is one of the classical data mining problems in most of the data mining applications in big data era. In data mining, association rule mining is key technique for discovering useful patterns from large collection of data. Frequent itemset mining is a famous step of association rule mining. Many efficient pattern mining algorithms have been discovered in the last two decades, yet most do not hold good for Big Dataset. In association rule mining (ARM) a Frequent Itemset Mining (FIM) is a well-known step. In last two decades, many efficient pattern mining algorithms have been discovered, up till now most do not hold good for Big Dataset. The Apriori, FP-growth and Eclat algorithms are the most famous algorithms which can be used for Frequent Pattern mining. However, these parallel mining algorithms lack features like automated parallelization, fine load balancing, and distribution of data on large clusters. To overcome these problems various parallelized approaches using Hadoop MapReduce model are developed to perform frequent itemsets mining from big data. This paper gives overall study about frequent pattern mining in big data.

Key-Words / Index Term

Big data, Pattern Mining, Frequent Itemset Mining, Data Mining, ItemSets

References

[1]. K. Mehmed. Data Mining: Concepts, Models, Methods, and Algorithms. John Wiley & Sons., 2003.
[2]. J. Han and M. Kamber. Data Mining: Concpets and Techniques. Academic Press, 2nd edition edition, 2006.
[3]. J. Vreeken. Making Pattern Mining Useful. PhD thesis, the Dutch Research School for Information and Knowledge Systems, Netherlands, 2009.
[4]. F. Hadzic, H. Tan, and T.S. Dillon. Mining of Data with Complex Structures, volume 333. Springer, 2010.
[5]. J. Han, H. Cheng, D. Xin, and X. Yan. Frequent pattern mining: Current status and future direction. Data Mining and Knowledge Discovery, 15:55–86, 2007.
[6]. I.H. Witten, E. Frank, and A.M. Hall. Data Mining: Practical Machine Learning Tools and Techniques. Elsevier, 3rd edition edition, 2011. [7] R. Kosala and H. Blockeel. Web mining research: A survey. SIGKDD Explorations, 2(1):1–15, 2000.
[7]. F. Medici, M.I. Hawa, A. Giorgini, A. Panelo, C.M. Solfelix, R.D.G. Leslie, and P. Pozzilli. Antibodies to GAD65 and a tyrosine phosphatase-like molecule IA-2ic in Filipino Type I diabetic patients. Diabetes Care, 22(9):1458–1461, 1999.
[8]. W. Shi, F.K. Ngok, and D.R. Zusman. Cell density regulates cellular reversal frequency in Myxococcus xanthus. In Proceedings of the National Academy of Sciences of the United States of America, volume 93(9), pages 4142–4146, 1996.
[9]. R. Agrawal, T. Imieinski, and A. Swami. Mining association rules between sets of items in large databases. In P. Buneman and S. Jajodia, editors, Proceedings of the ACM SIGMOD International Conference on the Management of Data, pages 207–216, Washington DC, 1993. ACM Press.
[10]. A. Jim´enez, F. Berzal, and J. Cubero. Frequent tree pattern mining: A survey. Intelligent Data Analysis, 14:603–622, 2002.
[11]. M.J. Zaki. Efficiently mining frequent trees in a forest: Algorithms and applications. IEEE Transactions on Knowledge and Data Engineering, 17(8):1021– 1035, 2005.
[12]. X. Yan and J. Han. gSpan: Graph-based substructure pattern mining. In Proceedings of 2002 International Conference on Data Mining (ICDM’02), pages 721–724, Maebashi, Japan, December 2002.
[13]. Y. Chi, R.R. Muntz, S. Nijssen, and J.N. Kok. Frequent subtree mining – an overview. Special Issue on Graph and Tree Mining, 66(1-2):161–198, 2005.
[14]. P. Cserk´uti, T. Levendovszky, and H. Charaf. Survey on subtree matching. In Proceedings of the International Conference on Intelligent Engineering Systems (INES 200), pages 216–221, London, September 2006.
[15]. R. Shamir and D. Tsur. Faster subtree isomorphism. Journal of Algorithms, 33:267–280, 1999.
[16]. A. Gupta and N. Nishimura. The complexity of subgraph isomorphism: Duality results for graphs of bournded path- and tree-width. Technical report, University of Waterloo, April 1995.
[17]. M.B. Miles and A.M. Huberman. Qualitative Data Analysis: An Expanded Sourcebook. SAGE Publications, Inc., 2nd edition edition, 1994.
[18]. A.N. Oppenheim. Questionnaire Design and Attitude Measurement. Pinter Publications, 1992.
[19]. B. Boukhatem, S. Kenai, A. Tagnit-Hamou, and M. Ghrici. Application of new information technology on concrete: An overivew. Journal of Civil Engineering and Management, 17(2):248–258, 2011.
[20]. D.L. Olson. Data mining in business services. Service Business, 1(3):181–193, 2007.
[21]. N.R.S. Raghavan. Data mining in e-commerce: A survey. Sadhana – Academy Proceedings in Engineering Sciences, 30(2-3):275–289, 2005.
[22]. C. Romero and S. Ventura. Educational data mining: A review of the state of the art. IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews, 40(6):601–618, 2010.
[23]. R. Law, R. Leung, and D. Buhalis. Information technology applicaions in hospitality and tourism: A review of publications from 2005 to 2007. Journal of Travel and Tourism Marketing, 26:599–23, 2009.
[24]. C.J. Date. An Introduction to Database Systems. Addison-Wesley, 6th edition edition, 1994.
[25]. R.B. Sher and R.J. Daverman. Handbook of Geometric Topology. Elsevier, North-Holland, 2002.
[26]. A. Ceglar and J.F. Roddick. Association mining. ACM Computing Survey, 38(2):5, 2006.
[27]. V.P. Magnini, E.D.Jr. Honeycutt, and S.K. Hodge. Data mining for hotel firms: Use and limitations. Cornell Hotel and Restaurant Administration Quarterly, 44(2):94–105, 2003.
[28]. G.G. Emel, C, . Takin, and ¨ O. Akat. Profiling a domestic tourism market by means of association rule mining. Anatolia, 18(2):334–342, 2007.
[29]. X. Wu, C. Zhang, and S. Zhang. Efficient mining of both positive and negative association rules. ACM Transactions on Information Systems, 22(3):381–405, 2004.
[30]. Y.-J. Tsay, T.-J. Hsu, and J.-R. Yu, “FIUT: A new method for mining frequent itemsets,” Inf. Sci., vol. 179, no. 11, pp. [20] 1724–1737, 2009.