东北大学学报(自然科学版) ›› 2011, Vol. 32 ›› Issue (12): 1700-1703.DOI: -

• 论著 • 上一篇    下一篇

多数据源上Top-k中间模式的产生算法

丁国辉;王国仁;赵相国;   

  1. 东北大学信息科学与工程学院;
  • 收稿日期:2013-06-19 修回日期:2013-06-19 发布日期:2013-04-04
  • 通讯作者: -
  • 作者简介:-
  • 基金资助:
    国家自然科学基金资助项目(60803026,61073063);;

Generating top-k mediated schemas for multiple data sources

Ding, Guo-Hui (1); Wang, Guo-Ren (1); Zhao, Xiang-Guo (1)   

  1. (1) School of Information Science and Engineering, Northeastern University, Shenyang 110819, China
  • Received:2013-06-19 Revised:2013-06-19 Published:2013-04-04
  • Contact: Ding, G.-H.
  • About author:-
  • Supported by:
    -

摘要: 模式集成在很多数据库相关领域起着关键作用,例如数据空间、数据仓库和电子商务等.提出一种自动的多个中间模式的产生方法.首先,引入概念图在抽象层次上表示待集成的多个源模式.其次,给出一种概念之间相似性的划分方法,每种划分方式表示一种源模式的集成策略.最后,利用模拟退火算法在候选中间模式空间中进行搜索,该算法能够自动地找到k个最好的候选中间模式.实验表明,提出的算法是有效的,并且具有较小的运行开销.

关键词: 模式集成, 源模式, 概念图, 相似性, 中间模式

Abstract: Schema integration is a critical step in many database applications, such as data space, data warehousing and electronic commerce, etc. This paper proposed an automatic approach to generate the mediated schemas over a set of source schemas. Firstly, the concept graph is presented to represent the source schemas for the unified representation. Secondly, the similarity between concepts is divided into intervals for the generation of the three merging strategies. Finally, the simulated annealing algorithm is employed to automatically generate the best k mediated schemas. Through extensive experiments, the results show that the algorithm proposed is effective and the running time is little.

中图分类号: