东北大学学报(自然科学版) ›› 2012, Vol. 33 ›› Issue (3): 332-335.DOI: -

• 论著 • 上一篇    下一篇

Web表格中本体实例自动获取方法

车成逸;马宗民;焦晓龙;   

  1. 东北大学信息科学与工程学院;
  • 收稿日期:2013-06-19 修回日期:2013-06-19 发布日期:2013-04-04
  • 通讯作者: -
  • 作者简介:-
  • 基金资助:
    国家自然科学基金资助项目(60873010)

Automatic acquisition method of ontology instances from web tables

Cha, Song-Il (1); Ma, Zong-Min (1); Jiao, Xiao-Long (1)   

  1. (1) School of Information Science and Engineering, Northeastern University, Shenyang 110819, China
  • Received:2013-06-19 Revised:2013-06-19 Published:2013-04-04
  • Contact: Cha, S.-I.
  • About author:-
  • Supported by:
    -

摘要: 当前许多领域信息都采用表格形式展现,因此,如何从表格中抽取本体逐渐引起了人们的关注.为了提高从Web表格中抽取本体实例的准确性,提出了基于语义相似度的词汇语义类的获取方法.该方法采用了基于SVM的语义相似度计算方法,提高了判断语义相似度的准确性,克服了以前依靠句法相似度分析表格结构存在的局限性.最后,根据实验结果,对该方法的性能进行评估.实验结果显示,该方法可以有效地从Web表格中抽取本体实例.

关键词: Web表格, 本体实例, 语义相似度, 交互信息量, 支持向量机

Abstract: Information is mainly represented in tabular form in the modern society and scientific filed, so more and more attentions have been paid on how to extract ontology instances from the Web tables. In order to improve the accuracy of extracting ontology instances from the Web tables, an acquisition method of semantic class of words was proposed on the basis of semantic similarity. The method is based on SVM semantic similarity calculation method, which improves the accuracy of checking the semantic similarity, and overcomes the limitations of the previous analysis method of table structure by using syntactic similarity. Finally, according to the experimental results, the performance of this method is evaluated. Experimental results show that this method can effectively extract the ontology instances from Web tables.

中图分类号: