东北大学学报(自然科学版) ›› 2009, Vol. 30 ›› Issue (1): 50-53.DOI: -

• 论著 • 上一篇    下一篇

一种基于信息论的蛋白质数据库搜索鉴定算法

于长永;王国仁;毛克明;翟文丹;   

  1. 东北大学信息科学与工程学院;
  • 收稿日期:2013-06-22 修回日期:2013-06-22 出版日期:2009-01-15 发布日期:2013-06-22
  • 通讯作者: Yu, C.-Y.
  • 作者简介:-
  • 基金资助:
    国家自然科学基金资助项目(60573089)

A database search algorithm based on information theory for protein identification

Yu, Chang-Yong (1); Wang, Guo-Ren (1); Mao, Ke-Ming (1); Zhai, Wen-Dan (1)   

  1. (1) School of Information Science and Engineering, Northeastern University, Shenyang 110004, China
  • Received:2013-06-22 Revised:2013-06-22 Online:2009-01-15 Published:2013-06-22
  • Contact: Yu, C.-Y.
  • About author:-
  • Supported by:
    -

摘要: 为了有效地利用蛋白质串联质谱数据,提高蛋白质鉴定的准确性,提出了一种基于信息论的蛋白质数据库搜索鉴定算法——ITPIA(information theory based protein identification algorithm)算法.针对多肽串联质谱质量低、噪音多等问题,ITPIA算法利用了信息论中的熵理论提出了一种有效的实验串联质谱和多肽的理论质谱的匹配打分算法.该算法更大程度上从多肽串联质谱中获得蛋白质的结构信息.实验结果表明,ITPIA算法有效地提高了蛋白质鉴定的准确性.

关键词: 蛋白质鉴定, 串联质谱, 数据库搜索, 信息论, 匹配算法

Abstract: The ITPIA (information theory based protein identification algorithm) based on information theory is proposed to search protein database by use of polypeptide tandem mass spectra for more exact protein identification. To solve efficiently the problem of low-quality and noisy spectra, the entropy theory is introduced in the ITPIA with a scoring system built to measure the similarity between the experimental spectrum and the theoretical spectrum of polypeptide in the database. The algorithm thus acquires more information on protein structure via the polypeptide tandem mass spectra. Experimental results showed that ITPIA algorithm can effectively improve the exactness of protein identification.

中图分类号: