Journal of Northeastern University Natural Science ›› 2016, Vol. 37 ›› Issue (1): 24-28.DOI: 10.12068/j.issn.1005-3026.2016.01.006

• Information & Control • Previous Articles     Next Articles

A Top-k High Utility Itemset Mining Method Based on the Index Utility

LIN Shu-kuan, WANG Xiao-cong, QIAO Jian-zhong, WANG Rui   

  1. School of Information Science & Engineering, Northeastern University, Shenyang 110819, China.
  • Received:2014-10-31 Revised:2014-10-31 Online:2016-01-15 Published:2016-01-08
  • Contact: LIN Shu-kuan
  • About author:-
  • Supported by:
    -

Abstract: The existing methods of Top-k high utility itemset mining substitute the transaction utilities of itemsets for their real utilities in order to keep the downward closure property. This makes the utilities of itemsets be estimated too large, resulting in bad pruning effect and low mining efficiency. To solve this problem, the concept of the index utility was proposed. On this basis, the two-level index was built and pruned, by which the pruning effect was strengthened and the efficiency of Top-k high utility itemset mining was enhanced. Moreover, the fast calculation of itemset utilities was supported by building the utility matrix. Therefore, the mining efficiency was further enhanced. The experiments on different types of datasets validate the effectiveness and the efficiency of the proposed method.

Key words: itemset utility, the index utility, Top-k high utility itemset, ending super itemset, utility matrix

CLC Number: