东北大学学报:自然科学版 ›› 2019, Vol. 40 ›› Issue (5): 609-613.DOI: 10.12068/j.issn.1005-3026.2019.05.001

• 信息与控制 •    下一篇

基于频率的Read Mapping种子选择算法

马海涛, 祁实, 于长永, 赵宇海   

  1. (东北大学 计算机科学与工程学院, 辽宁 沈阳110169)
  • 收稿日期:2018-03-26 修回日期:2018-03-26 出版日期:2019-05-15 发布日期:2019-05-17
  • 通讯作者: 马海涛
  • 作者简介:马海涛(1977-),男,河北昌黎人,东北大学讲师,博士.
  • 基金资助:
    国家自然科学基金资助项目(61772124, 61332014, 61401080, 61402087); 河北省自然科学基金资助项目(F2015501049); 河北省教育厅项目(QN2014339); 中央高校基本科研业务费专项资金资助项目(N150402002).

Frequency-Based Seed Selection Algorithm for Read Mapping

MA Hai-tao, QI Shi,YU Chang-yong, ZHAO Yu-hai   

  1. School of Computer Science & Engineering, Northeastern University, Shenyang 110169, China.
  • Received:2018-03-26 Revised:2018-03-26 Online:2019-05-15 Published:2019-05-17
  • Contact: MA Hai-tao
  • About author:-
  • Supported by:
    -

摘要: 选择具有最低频率的最优种子是一个复杂的计算问题,往往需要很长时间.提出了一种read的基于频率的合并种子选择算法(FMSS),该算法能够高效地选择接近最优的种子集合,可用于改善现有映射工具的性能.实验对比了平均种子选择方法和当前最优的种子选择策略(OSS,optimal seed solver),结果显示FMSS算法能够用很少的时间代价给出接近OSS的最优种子集合,这表明FMSS算法可集成到现有映射工具中用于处理更大规模的read mapping问题.

关键词: 种子频率, 读取映射, 平均种子, 频率合并, 最优种子

Abstract: The selection of the optimal seed(that is, the seed with the lowest frequency)is a complex calculation problem, which often takes a long time. A frequency-based merge seed selection(FMSS) algorithm is proposed, which can efficiently select the suboptimal set of seeds and improve the performance of existing mapping tools. In the experiment, FMSS was compared with the average seed selection method and the optimal seed solver(OSS). Experimental results show that FMSS can select the optimal set of seeds close to OSS, and the time cost of FMSS is far lower than that of the OSS algorithm. The FMSS algorithm is more suitable for seed selection in terms of time cost and seed selection quality.

Key words: seed frequency, read-mapping, average seed;frequency merge, optimal seed

中图分类号: