Open Access   Article Go Back

Uncertain Big Data Strategical Miner

H.V. Sapte1 , S.S. Pallati2 , P.P. Pandit3 , A.S. Joshi4 , V. Jumb5

  1. Xavier Institute of Engineering, Mumbai University, Mumbai, India.
  2. Xavier Institute of Engineering, Mumbai University, Mumbai, India.
  3. Xavier Institute of Engineering, Mumbai University, Mumbai, India.
  4. Xavier Institute of Engineering, Mumbai University, Mumbai, India.
  5. Xavier Institute of Engineering, Mumbai University, Mumbai, India.

Correspondence should be addressed to: harish.sapte10@gmail.com.

Section:Research Paper, Product Type: Journal Paper
Volume-5 , Issue-6 , Page no. 237-243, Jun-2017

Online published on Jun 30, 2017

Copyright © H.V. Sapte, S.S. Pallati, P.P. Pandit, A.S. Joshi, V. Jumb . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: H.V. Sapte, S.S. Pallati, P.P. Pandit, A.S. Joshi, V. Jumb, “Uncertain Big Data Strategical Miner,” International Journal of Computer Sciences and Engineering, Vol.5, Issue.6, pp.237-243, 2017.

MLA Style Citation: H.V. Sapte, S.S. Pallati, P.P. Pandit, A.S. Joshi, V. Jumb "Uncertain Big Data Strategical Miner." International Journal of Computer Sciences and Engineering 5.6 (2017): 237-243.

APA Style Citation: H.V. Sapte, S.S. Pallati, P.P. Pandit, A.S. Joshi, V. Jumb, (2017). Uncertain Big Data Strategical Miner. International Journal of Computer Sciences and Engineering, 5(6), 237-243.

BibTex Style Citation:
@article{Sapte_2017,
author = {H.V. Sapte, S.S. Pallati, P.P. Pandit, A.S. Joshi, V. Jumb},
title = {Uncertain Big Data Strategical Miner},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {6 2017},
volume = {5},
Issue = {6},
month = {6},
year = {2017},
issn = {2347-2693},
pages = {237-243},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1333},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1333
TI - Uncertain Big Data Strategical Miner
T2 - International Journal of Computer Sciences and Engineering
AU - H.V. Sapte, S.S. Pallati, P.P. Pandit, A.S. Joshi, V. Jumb
PY - 2017
DA - 2017/06/30
PB - IJCSE, Indore, INDIA
SP - 237-243
IS - 6
VL - 5
SN - 2347-2693
ER -

VIEWS PDF XML
530 584 downloads 682 downloads
  
  
           

Abstract

There are many data mining algorithms which exist today for searching patterns from transactional databases. Most of them work only on precise data. But there are also situations in which these conventional algorithms fail, situations in which Data is uncertain in nature. Uncertain data can be explained as the one where items have probabilistic values associated with them. These probabilities express the likelihood of these items to be present in the transactions. In mining, the search tree produced is also one of the major factor of concern. The search space produced when dealing with uncertain data is much larger due to the presence of existential probabilities. This problem worsens when dealing with Big data. Considering all the above factors and concerns, an algorithm is specified and explained ahead. It allows users to express the interest in terms of constraints and uses the Map Reduce programming model to mine uncertain Big data for frequent patterns that satisfy the user-specified constraints. By using these user-specified constraints as inputs, the algorithm greatly reduces the search space for Big data mining of uncertain data, and returns only those patterns the users are interested in.

Key-Words / Index Term

Big data models and algorithms, Big data analytics, Uncertain data mining, Frequent pattern mining

References

[1] C.K.-S. Leung , R. K. MacKinnon, F. Jiang, “Reducing the Search Space for Big Data Mining for Interesting Pattern from Uncertain Data” , 2014 IEEE International Congress on Big Data, pp.315-322, 2014.
[2] J.V. Patel, K. J. Panchal, “A Modified Approach to Mine Frequent Patterns from Uncertain Data”, 2015 1st International Conference (NGCT), pp.612-615, 2015.
[3] C.K.-S. Leung, “Mining uncertain data”, WIREs Data Mining and Knowledge Discovery, Vol.1 ,Issue.4, pp.316–329, July-Aug. 2011.
[4] S. Madden, “From databases to big data,” IEEE Internet Computing, Vol.16, Issue.3, pp. 4–6, May–June 2012.
[5] Azzini, P. Ceravolo, “Consistent process mining over Big data triple stores”, IEEE Big Data Congress 2013, pp. 54–61, 2013.
[6] Ӧlmezoğullari, I. Ari, “Online association rule mining over fast data”, IEEE International Congress on Big Data 2013, pp.110–117, 2013.
[7] P. Agarwal, G. Shroff, P. Malhotra, “Approximate incremental big-data harmonization”, IEEE Big Data Congress 2013, pp.118–125, 2013.
[8] Yang , S. Fong, “Countering the concept-drift problem in big data using iOVFDT”, IEEE Big Data Congress 2013, pp.126–132, 2013.
[9] C.K.Chui, B.Kao, E.Hung, “Mining Frequent Itemsets from Uncertain Data”, LNCS 2007, pp.47-58, 2007.
[10] C.K.-S. Leung , F. Jiang, “Frequent itemset mining of uncertain data streams using the damped window model”, ACM SAC 2011, pp.950–955, 2011.
[11] C.K.-S. Leung , F. Jiang, “Frequent pattern mining from time-fading streams of uncertain data”, DaWaK 2011 (LNCS 6862), pp. 252-264, 2011.
[12] Y. Tong, L. Chen, Y. Cheng, P.S. Yu, “Mining frequent itemsets over uncertain databases”, PVLDB, Vol.5, Issue.11, pp.1650–1661, July 2012.
[13] C.K.-S. Leung, M.A.F. Mateo, D.A. Brajczuk, “A tree-based approach for frequent pattern mining from uncertain data”, PAKDD 2008 (LNAI 5012), pp. 653–661, 2008.
[14] C.K.-S. Leung , S.K. Tanbeer, “Fast tree-based mining of frequent itemsets from uncertain data”, DASFAA 2012 (LNCS 7238), pp. 272–287, 2012.
[15] C.K.-S. Leung, S.K. Tanbeer, “PUF-tree: A compact tree structure for frequent pattern mining of uncertain data”, PAKDD 2013 (LNCS 7818), pp.13–25, 2013.
[16] D.N. Goswami, Anshu Chaturvedi., C.S. Raghuvanshi,”An Algorithm for Frequent Pattern Mining Based On Apriori”, International Journal on Computer Science and Engineering(IJCSE), Vol.2, Issue.4, pp.942-947, 2010.
[17] J.Dean, S.Ghemawat, “MapReduce: simplified data processing on large clusters”, CACM, Vol.51, Issue.1, pp.107-113, Jan. 2008.
[18] M.Y.Lin, P.Y.Lee, S. C. Hsueh, “Apriori based frequent itemset mining algorithms on MapReduce”, ICUIMC 2012, art.76, 2012.