Efficient Algorithms for Mining Top-K High Utility Itemsets

Ameena Aiman, Raafiya Gulmeher

Open Access Article Go Back

Efficient Algorithms for Mining Top-K High Utility Itemsets

Ameena Aiman¹ , Raafiya Gulmeher²

Section:Research Paper, Product Type: Journal Paper
Volume-6 , Issue-7 , Page no. 1274-1280, Jul-2018

CrossRef-DOI: https://doi.org/10.26438/ijcse/v6i7.12741280

Online published on Jul 31, 2018

Copyright © Ameena Aiman, Raafiya Gulmeher . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at Google Scholar | DPI Digital Library

XML View

PDF Download

How to Cite this Paper

IEEE Citation
MLA Citation
APA Citation
BibTex Citation
RIS Citation

IEEE Citation

IEEE Style Citation: Ameena Aiman, Raafiya Gulmeher, “Efficient Algorithms for Mining Top-K High Utility Itemsets,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.7, pp.1274-1280, 2018.

MLA Citation

MLA Style Citation: Ameena Aiman, Raafiya Gulmeher "Efficient Algorithms for Mining Top-K High Utility Itemsets." International Journal of Computer Sciences and Engineering 6.7 (2018): 1274-1280.

APA Citation

APA Style Citation: Ameena Aiman, Raafiya Gulmeher, (2018). Efficient Algorithms for Mining Top-K High Utility Itemsets. International Journal of Computer Sciences and Engineering, 6(7), 1274-1280.

BibTex Citation

BibTex Style Citation:
@article{Aiman_2018,
author = {Ameena Aiman, Raafiya Gulmeher},
title = {Efficient Algorithms for Mining Top-K High Utility Itemsets},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {7 2018},
volume = {6},
Issue = {7},
month = {7},
year = {2018},
issn = {2347-2693},
pages = {1274-1280},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=2599},
doi = {https://doi.org/10.26438/ijcse/v6i7.12741280}
publisher = {IJCSE, Indore, INDIA},
}

RIS Citation

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i7.12741280}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=2599
TI - Efficient Algorithms for Mining Top-K High Utility Itemsets
T2 - International Journal of Computer Sciences and Engineering
AU - Ameena Aiman, Raafiya Gulmeher
PY - 2018
DA - 2018/07/31
PB - IJCSE, Indore, INDIA
SP - 1274-1280
IS - 7
VL - 6
SN - 2347-2693
ER -

VIEWS	PDF	XML
689	537 downloads	320 downloads

Bar Line

Abstract

High utility itemsets (HUIs) mining is a developing topic in information mining, which alludes to finding all itemsets having a utility meeting a user-specified minimum utility threshold min_util. However, setting min_util appropriately is a difficult problem for users. Finding an appropriate minimum utility threshold by trial and error is a tedious process for users. If min_util is set too low, too many HUIs will be generated, which may cause the mining process to be very inefficient. On the other hand, if min_util is set too high, it is likely that no HUIs will be found. In this paper, we address the above issues by proposing a new framework for top-k high utility itemset mining, where k is the desired number of HUIs to be mined. Two types of efficient algorithms named TKU (mining Top-K Utility itemset) and TKO (mining Top-K utility itemset in One phase) are proposed for mining such itemset without the need to set min_util. We provide a structural comparison of the two algorithms with discussions on their advantages and limitations. Empirical evaluations on both real and synthetic datasets show that the performance of the proposed algorithms is close to that of the optimal case of state-of-the-art utility mining algorithms.

Key-Words / Index Term

ItemSets, Mining, High Utility, TKO, HUIs

References

[1] R. Agrawal and R. Srikant, “Fast algorithms for mining association rules,” in Proc. Int. Conf. Very Large Data Bases, 1994, pp. 487–499.
[2] C. Ahmed, S. Tanbeer, B. Jeong, and Y. Lee, “Efficient tree structures for high-utility pattern mining in incremental databases,” IEEE Trans. Knowl. Data Eng., vol. 21, no. 12, pp. 1708–1721, Dec. 2009.
[3] K. Chuang, J. Huang, and M. Chen, “Mining top-k frequent patterns in the presence of the memory constraint,” VLDB J., vol. 17, pp. 1321–1344, 2008.
[4] R. Chan, Q. Yang, and Y. Shen, “Mining high-utility itemsets,” in Proc. IEEE Int. Conf. Data Mining, 2003, pp. 19–26.
[5] P. Fournier-Viger and V. S. Tseng, “Mining top-k sequential rules,” in Proc. Int. Conf. Adv. Data Mining Appl., 2011, pp. 180–194.
[6] P. Fournier-Viger, C.Wu, and V. S. Tseng, “Mining top-k association rules,” in Proc. Int. Conf. Can. Conf. Adv. Artif. Intell., 2012, pp. 61–73.
[7] P. Fournier-Viger, C. Wu, and V. S. Tseng, “Novel concise representations of high utility itemsets using generator patterns,” in Proc. Int. Conf. Adv. Data Mining Appl. Lecture Notes Comput. Sci., 2014, vol. 8933, pp. 30–43.
[8] J. Han, J. Pei, and Y. Yin, “Mining frequent patterns without candidate generation,” in Proc. ACM SIGMOD Int. Conf. Manag. Data, 2000, pp. 1–12.
[9] J. Han, J. Wang, Y. Lu, and P. Tzvetkov, “Mining top-k frequent closed patterns without minimum support,” in Proc. IEEE Int. Conf. Data Mining, 2002, pp. 211–218.
[10] S. Krishnamoorthy, “Pruning strategies for mining high utility itemsets,” Expert Syst. Appl., vol. 42, no. 5, pp. 2371–2381, 2015.
[11] C. Lin, T. Hong, G. Lan, J. Wong, and W. Lin, “Efficient updating of discovered high-utility itemsets for transaction deletion in dynamic databases,” Adv. Eng. Informat., vol. 29, no. 1, pp. 16–27, 2015.
[12] G. Lan, T. Hong, V. S. Tseng, and S. Wang, “Applying the maximum utility measure in high utility sequential pattern mining,” Expert Syst. Appl., vol. 41, no. 11, pp. 5071–5081, 2014.
[13] Y. Liu, W. Liao, and A. Choudhary, “A fast high utility itemsets mining algorithm,” in Proc. Utility-Based Data Mining Workshop, 2005, pp. 90–99.
[14] M. Liu and J. Qu, “Mining high utility itemsets without candidate generation,” in Proc. ACM Int. Conf. Inf. Knowl. Manag., 2012, pp. 55–64.
[15] J. Liu, K. Wang, and B. Fung, “Direct discovery of high utility itemsets without candidate generation,” in Proc. IEEE Int. Conf. Data Mining, 2012, pp. 984–989.

Citations	8797
h-index	34
i10-index	152

Impact Factor :	3.802
ISSN :	2347-2693 (Online)