东北大学学报(自然科学版) ›› 2009, Vol. 30 ›› Issue (11): 1558-1561.DOI: -

• 论著 • 上一篇    下一篇

一种基于广义相似性的共调控基因聚类算法

赵宇海;乔百友;林天亮;王国仁;   

  1. 东北大学医学影像计算教育部重点实验室;东北大学信息科学与工程学院;东北大学计算中心;
  • 收稿日期:2013-06-22 修回日期:2013-06-22 出版日期:2009-11-15 发布日期:2013-06-22
  • 通讯作者: Zhao, Y.-H.
  • 作者简介:-
  • 基金资助:
    国家自然科学基金资助项目(60803026,60873011,60773219);;

A clustering algorithm based on generalized similarity for co-regulated genes

Zhao, Yu-Hai (1); Qiao, Bai-You (1); Lin, Tian-Liang (3); Wang, Guo-Ren (1)   

  1. (1) Key Laboratory of Medical Image Computing, Ministry of Education, Northeastern University, Shenyang 110004, China; (2) School of Information Science and Engineering, Northeastern University, Shenyang 110004, China; (3) Computer Center, Northeastern University, Shenyang 110004, China
  • Received:2013-06-22 Revised:2013-06-22 Online:2009-11-15 Published:2013-06-22
  • Contact: Zhao, Y.-H.
  • About author:-
  • Supported by:
    -

摘要: 针对共调控基因的特殊性质和现有共调控基因聚类算法存在的不足,提出了基于广义相似性的聚类模型g-Cluster.正负共调控基因因具有相同的编码而被聚集到同一个共调控基因簇中.进一步提出了一种基于树结构的聚类算法FBTD,采用先宽度优先后深度优先的搜索策略,挖掘所有符合条件的最大g-Cluster,同时应用了高效的削减规则和优化策略.将该算法用于真实数据集.理论分析和实验结果都表明,该算法是实用和有效的.

关键词: 共调控基因, 聚类, 模式相似性, 基因本体

Abstract: A novel clustering model, i.e., the g-Cluster, is developed on the basis of generalized similarity for the special properties and disadvantages of existing clustering algorithms of co-regulated genes. The positive and negative co-regulated genes in this model are integrated into the same cluster if and only if they are provided with the same code. Further, a tree-based clustering algorithm FBTD (first breadth then depth) is proposed, where the priorities in search strategy is that the breadth is taken first then the depth, to find out all the maximal g-Clusters with high-efficiency pruning rules and optimizing strategy performed simultaneously. Applying the FBTD algorithm to real datasets involving genes, both the theoretic and testing results showed that the algorithm is practically efficient.

中图分类号: