Journal of Northeastern University Natural Science ›› 2016, Vol. 37 ›› Issue (7): 931-936.DOI: 10.12068/j.issn.1005-3026.2016.07.005

• Information & Control • Previous Articles     Next Articles

Mining Full Weighted Maximal Frequent Itemsets Based on Sliding Window over Data Stream

WANG Shao-peng1, WEN Ying-you1,2, ZHAO Hong1,2   

  1. 1.School of Information Science & Engineering, Northeastern University, Shenyang 110819, China; 2.Key Laboratory of Medical Image Computing, Ministry of Education, Northeastern University, Shenyang 110819, China.
  • Received:2015-05-11 Revised:2015-05-11 Online:2016-07-15 Published:2016-07-13
  • Contact: WANG Shao-peng
  • About author:-
  • Supported by:
    -

Abstract: Aiming at the problem that none of current researches on the WMFI (weighted maximal frequent itemsets) over data stream emphasizes the WMFI mining on the condition that the frequent threshold is not equal with the weighted frequent threshold, the concept of FWMFI (full weighted maximal frequent itemsets) was firstly promoted in this work. In order to reduce redundant operations existing in the naive algorithm which is used to handle the FWMFI mining based on sliding window over data stream, the FWMFI-SW (FWMFI mining based on sliding window over data stream) algorithm was proposed. The mining optimization strategy was adopted based on the frequent character to reduce the unnecessary call about the MaxW optimization strategy in the naive algorithm. In addition, the edit distance ratio was taken as reconstruction judge function to decide whether the updated WMFP-SW-tree should be reconstructed as the window slides. The extensive experiments showed that the FWMFI-SW algorithm is effective , and outperforms the naive algorithm in running time.

Key words: data stream, sliding window, edit distance ratio, weighted maximal frequent itemsets, reconstruction judge function

CLC Number: