东北大学学报:自然科学版 ›› 2017, Vol. 38 ›› Issue (2): 200-204.DOI: 10.12068/j.issn.1005-3026.2017.02.010

• 信息与控制 • 上一篇    下一篇

模糊XML文档与模糊DTD相似性研究

赵震1,2, 马宗民1   

  1. (1. 东北大学 计算机科学与工程学院, 辽宁 沈阳110819; 2. 渤海大学 信息科学与技术学院, 辽宁 锦州121013)
  • 收稿日期:2015-09-24 修回日期:2015-09-24 出版日期:2017-02-15 发布日期:2017-03-03
  • 通讯作者: 赵震
  • 作者简介:赵震(1977-),男,辽宁锦州人,东北大学博士研究生; 马宗民(1965-),男,山东金乡人,东北大学教授,博士生导师.
  • 基金资助:
    国家自然科学基金资助项目(61370075); 教育部新世纪优秀人才支持计划项目(NCET-05-0288).

Research on the Similarity of Fuzzy XML Documents and Fuzzy DTD

ZHAO Zhen1,2, MA Zong-min1   

  1. 1. School of Computer Science & Engineering, Northeastern University, Shenyang 110819, China; 2. College of Information Science and Technology, Bohai University, Jinzhou 121013, China.
  • Received:2015-09-24 Revised:2015-09-24 Online:2017-02-15 Published:2017-03-03
  • Contact: ZHAO Zhen
  • About author:-
  • Supported by:
    -

摘要: 在模糊XML数据管理中,模糊XML文档和模糊DTD的相似性是模糊XML数据整合、模糊XML文档聚类的关键步骤.为了研究模糊XML文档和模糊DTD的相似性,对模糊DTD树进行了规则变换,主要解决元素和属性的析取约束和基数约束问题,即由析取范式转化为合取范式,将元素或属性的重复次数确定化,然后利用树编辑距离算法对模糊XML文档树和转化后的模糊DTD树集合进行相似性对比.通过实验验证了所提方法的性能优势.

关键词: 模糊XML文档, 文档类型定义(DTD), 相似性, 结构匹配, 数据整合

Abstract: In fuzzy extensible markup language (XML) data management, the similarity between fuzzy XML document and fuzzy document type definition (DTD) is a key step of fuzzy XML data integration and fuzzy XML documents clustering. In order to study the similarity, the fuzzy DTD tree are transformed by rules, which mainly solves the disjunctive constraint and cardinality constraint problems of the elements and attributes, namely the transformation from disjunctive normal form into conjunctive normal form, thus the number of repetitions of elements or attributes being determined. And then, the tree edit distance algorithm is used to compare the similarity between the fuzzy XML document tree and the transformed fuzzy DTD tree. The advantages of the proposed method are verified by experiments.

Key words: fuzzy XML documents, DTD(document type definition), similarity, structure matching, data integration

中图分类号: