Journal of Northeastern University ›› 2008, Vol. 29 ›› Issue (5): 657-660+676.DOI: -

• OriginalPaper • Previous Articles     Next Articles

PDT-based document fragmentation of XML streaming data

Huo, Huan (1); Han, Dong-Hong (1); Hui, Xiao-Yun (1); Wang, Guo-Ren (1)   

  1. (1) School of Information Science and Engineering, Northeastern University, Shenyang 110004, China
  • Received:2013-06-22 Revised:2013-06-22 Online:2008-05-15 Published:2013-06-22
  • Contact: Huo, H.
  • About author:-
  • Supported by:
    -

Abstract: Unlike in conventional databases, queries on XML stream data are bounded by not only the memory capacity but also the real time processing. Based on the Hole-Filler model, a path frequency tree (PFT) is defined according to the statistic information on queries about XML to set out a sibling-based document fragmentation policy including corresponding algorithm. Then, an alternative membership-based document fragmentation policy and corresponding algorithm are proposed. Both algorithms can effectively enhance the utilization and cohesion of XML fragments. Testing results showed that the PFT-based document fragmentation algorithms perform well on query cost and other properties.

CLC Number: