东北大学学报:自然科学版 ›› 2017, Vol. 38 ›› Issue (6): 798-803.DOI: 10.12068/j.issn.1005-3026.2017.06.008

• 信息与控制 • 上一篇    下一篇

基于加权极限学习机的肿瘤基因表达谱数据分类

姜琳颖, 余东海, 石鑫   

  1. (东北大学 软件学院, 辽宁 沈阳110169)
  • 收稿日期:2015-12-30 修回日期:2015-12-30 出版日期:2017-06-15 发布日期:2017-06-11
  • 通讯作者: 姜琳颖
  • 作者简介:姜琳颖(1972-),女,山东龙口人,东北大学副教授.
  • 基金资助:

    国家自然科学基金资助项目(61272176).

Tumor Microarray Gene Expression Data Classification Based on Weighted Extreme Learning Machine

JIANG Lin-ying, YU Dong-hai, SHI Xin   

  1. School of Software, Northeastern University, Shenyang 110169, China.
  • Received:2015-12-30 Revised:2015-12-30 Online:2017-06-15 Published:2017-06-11
  • Contact: JIANG Lin-ying
  • About author:-
  • Supported by:

    -

摘要:

基因表达谱数据一般来源于临床试验,而在临床试验中,试验样本的类分布情况是不确定的,这就使得表达谱数据往往具有比较明显的不平衡性.采用加权极限学习机来对不平衡基因表达谱数据进行分类,为了减少因为不平衡数据引起的分类误差,一个临时的权重被分配给每一个样本以增强少样本类的影响,同时减少多样本类的影响,进而提高肿瘤分类的准确率.实验结果表明,所提方法能够提高少样本类的识别率,从而提高分类器的总体性能.

关键词: 基因, 表达谱数据, 加权极限学习机, 不平衡性, 肿瘤分类

Abstract:

With the development of gene microarray technology, gene expression profiling becomes a significant method for identifying different types of canners. Microarray gene expression data is from clinical trials in general, where the class distribution of samples is changeable, which makes the expression data have a chance to become more imbalanced. In this paper, the weighted extreme learning machine (WELM) was used to classify the imbalance microarray gene expressing data. In order to reduce classification error caused by the imbalance data, a weight was assigned to each sample in order to enhance the impact of minority class while reducing majority class’s impact, and improve the accuracy of tumor classification. The experimental results show that the minority class recognition rate can be well improved by the proposed method, so as to improve the overall performance of classifiers.

Key words: gene, microarray expressing data, WELM, imbalance, tumor classification

中图分类号: