Efficient Algorithms for Mining Top-K High Utility Itemsets
Ameena Aiman1 , Raafiya Gulmeher2
Section:Research Paper, Product Type: Journal Paper
Volume-6 ,
Issue-7 , Page no. 1274-1280, Jul-2018
CrossRef-DOI: https://doi.org/10.26438/ijcse/v6i7.12741280
Online published on Jul 31, 2018
Copyright © Ameena Aiman, Raafiya Gulmeher . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: Ameena Aiman, Raafiya Gulmeher, “Efficient Algorithms for Mining Top-K High Utility Itemsets,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.7, pp.1274-1280, 2018.
MLA Style Citation: Ameena Aiman, Raafiya Gulmeher "Efficient Algorithms for Mining Top-K High Utility Itemsets." International Journal of Computer Sciences and Engineering 6.7 (2018): 1274-1280.
APA Style Citation: Ameena Aiman, Raafiya Gulmeher, (2018). Efficient Algorithms for Mining Top-K High Utility Itemsets. International Journal of Computer Sciences and Engineering, 6(7), 1274-1280.
BibTex Style Citation:
@article{Aiman_2018,
author = {Ameena Aiman, Raafiya Gulmeher},
title = {Efficient Algorithms for Mining Top-K High Utility Itemsets},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {7 2018},
volume = {6},
Issue = {7},
month = {7},
year = {2018},
issn = {2347-2693},
pages = {1274-1280},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=2599},
doi = {https://doi.org/10.26438/ijcse/v6i7.12741280}
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i7.12741280}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=2599
TI - Efficient Algorithms for Mining Top-K High Utility Itemsets
T2 - International Journal of Computer Sciences and Engineering
AU - Ameena Aiman, Raafiya Gulmeher
PY - 2018
DA - 2018/07/31
PB - IJCSE, Indore, INDIA
SP - 1274-1280
IS - 7
VL - 6
SN - 2347-2693
ER -
VIEWS | XML | |
630 | 459 downloads | 293 downloads |
Abstract
High utility itemsets (HUIs) mining is a developing topic in information mining, which alludes to finding all itemsets having a utility meeting a user-specified minimum utility threshold min_util. However, setting min_util appropriately is a difficult problem for users. Finding an appropriate minimum utility threshold by trial and error is a tedious process for users. If min_util is set too low, too many HUIs will be generated, which may cause the mining process to be very inefficient. On the other hand, if min_util is set too high, it is likely that no HUIs will be found. In this paper, we address the above issues by proposing a new framework for top-k high utility itemset mining, where k is the desired number of HUIs to be mined. Two types of efficient algorithms named TKU (mining Top-K Utility itemset) and TKO (mining Top-K utility itemset in One phase) are proposed for mining such itemset without the need to set min_util. We provide a structural comparison of the two algorithms with discussions on their advantages and limitations. Empirical evaluations on both real and synthetic datasets show that the performance of the proposed algorithms is close to that of the optimal case of state-of-the-art utility mining algorithms.
Key-Words / Index Term
ItemSets, Mining, High Utility, TKO, HUIs
References
[1] R. Agrawal and R. Srikant, “Fast algorithms for mining association rules,” in Proc. Int. Conf. Very Large Data Bases, 1994, pp. 487–499.
[2] C. Ahmed, S. Tanbeer, B. Jeong, and Y. Lee, “Efficient tree structures for high-utility pattern mining in incremental databases,” IEEE Trans. Knowl. Data Eng., vol. 21, no. 12, pp. 1708–1721, Dec. 2009.
[3] K. Chuang, J. Huang, and M. Chen, “Mining top-k frequent patterns in the presence of the memory constraint,” VLDB J., vol. 17, pp. 1321–1344, 2008.
[4] R. Chan, Q. Yang, and Y. Shen, “Mining high-utility itemsets,” in Proc. IEEE Int. Conf. Data Mining, 2003, pp. 19–26.
[5] P. Fournier-Viger and V. S. Tseng, “Mining top-k sequential rules,” in Proc. Int. Conf. Adv. Data Mining Appl., 2011, pp. 180–194.
[6] P. Fournier-Viger, C.Wu, and V. S. Tseng, “Mining top-k association rules,” in Proc. Int. Conf. Can. Conf. Adv. Artif. Intell., 2012, pp. 61–73.
[7] P. Fournier-Viger, C. Wu, and V. S. Tseng, “Novel concise representations of high utility itemsets using generator patterns,” in Proc. Int. Conf. Adv. Data Mining Appl. Lecture Notes Comput. Sci., 2014, vol. 8933, pp. 30–43.
[8] J. Han, J. Pei, and Y. Yin, “Mining frequent patterns without candidate generation,” in Proc. ACM SIGMOD Int. Conf. Manag. Data, 2000, pp. 1–12.
[9] J. Han, J. Wang, Y. Lu, and P. Tzvetkov, “Mining top-k frequent closed patterns without minimum support,” in Proc. IEEE Int. Conf. Data Mining, 2002, pp. 211–218.
[10] S. Krishnamoorthy, “Pruning strategies for mining high utility itemsets,” Expert Syst. Appl., vol. 42, no. 5, pp. 2371–2381, 2015.
[11] C. Lin, T. Hong, G. Lan, J. Wong, and W. Lin, “Efficient updating of discovered high-utility itemsets for transaction deletion in dynamic databases,” Adv. Eng. Informat., vol. 29, no. 1, pp. 16–27, 2015.
[12] G. Lan, T. Hong, V. S. Tseng, and S. Wang, “Applying the maximum utility measure in high utility sequential pattern mining,” Expert Syst. Appl., vol. 41, no. 11, pp. 5071–5081, 2014.
[13] Y. Liu, W. Liao, and A. Choudhary, “A fast high utility itemsets mining algorithm,” in Proc. Utility-Based Data Mining Workshop, 2005, pp. 90–99.
[14] M. Liu and J. Qu, “Mining high utility itemsets without candidate generation,” in Proc. ACM Int. Conf. Inf. Knowl. Manag., 2012, pp. 55–64.
[15] J. Liu, K. Wang, and B. Fung, “Direct discovery of high utility itemsets without candidate generation,” in Proc. IEEE Int. Conf. Data Mining, 2012, pp. 984–989.