并行后缀树的构造及查询算法

doi:-

东北大学学报(自然科学版) ›› 2004, Vol. 25 ›› Issue (3): 231-234.DOI: -

并行后缀树的构造及查询算法

乔百友;葛健;王国仁;韩东红

东北大学信息科学与工程学院;东北大学信息科学与工程学院;东北大学信息科学与工程学院;东北大学信息科学与工程学院辽宁沈阳　110004

收稿日期:2013-06-24 修回日期:2013-06-24 出版日期:2004-03-15 发布日期:2013-06-24
通讯作者: Qiao, B.-Y.
作者简介:-
基金资助:
国家自然科学基金资助项目(60273079)

Parallel construction and inquiry algorithm of suffix trees

Qiao, Bai-You (1); Ge, Jian (1); Wang, Guo-Ren (1); Han, Dong-Hong (1)

(1) Sch. of Info. Sci. and Eng., Northeastern Univ., Shenyang 110004, China

Received:2013-06-24 Revised:2013-06-24 Online:2004-03-15 Published:2013-06-24
Contact: Qiao, B.-Y.
About author:-
Supported by:
-

摘要/Abstract

摘要： 针对生物信息领域中传统后缀树构造算法在时间和空间上的限制,从结构并行的角度提出了一种新颖的、适用于生物信息学应用的并行后缀树结构和相应的构造算法·该算法首先将给定字符串分成若干连续的片段,并在各个处理机上分别构造这些片段的后缀树,形成了一种分布于多个处理机上的并行后缀树结构·该并行算法不仅大大缩短了后缀树的构造时间,而且避免了主存大小的限制·经分析,其性能优于现有的任何一种并行算法·在此基础上,提出了一种高效的基于这种并行后缀树的字符串匹配算法,解决了传统后缀树的基本查询问题·

关键词: 后缀树, 并行构造, 字符串匹配, 生物序列, 生物信息学

Abstract: A parallel suffix tree constructing algorithm is proposed to get rid of traditional space/time restriction while using suffix trees in bioinformatics. In this algorithm, the given string is divided into several continuous substrings. Then, the suffix trees for every substring are constructed in parallel, thus forming a suffix tree structure distributed separately on several processors. This algorithm can not only reduce the time needed to construct suffix trees but also avoid the restriction of main memory. The performance analysis shows that this algorithm outperforms any of existing parallel algorithms. Based on such a suffix tree structure, an efficient pattern matching algorithm is also proposed for the inquiries about traditional suffix trees.

中图分类号:

乔百友;葛健;王国仁;韩东红. 并行后缀树的构造及查询算法[J]. 东北大学学报(自然科学版), 2004, 25(3): 231-234.

Qiao, Bai-You (1); Ge, Jian (1); Wang, Guo-Ren (1); Han, Dong-Hong (1) . Parallel construction and inquiry algorithm of suffix trees[J]. Journal of Northeastern University, 2004, 25(3): 231-234.

[1]	秦诗悦，周福才，柳璐. 基于后缀树的基因数据可搜索加密方法[J]. 东北大学学报:自然科学版, 2019, 40(4): 461-466.
[2]	宁一，王大志，江雪晨，张翠玲. 基于原子能量熵和CSM的配电线路故障分类方法[J]. 东北大学学报:自然科学版, 2017, 38(1): 1-5.
[3]	杨英华;李召;陈永禄;陈晓波;. 基于CVA-ICA与CSM的故障诊断方法[J]. 东北大学学报(自然科学版), 2012, 33(12): 1685-1689.
[4]	韩东红;王国仁;乔百友. XML路径表达式中公共子查询的优化技术[J]. 东北大学学报(自然科学版), 2005, 26(6): 535-537.

并行后缀树的构造及查询算法

Parallel construction and inquiry algorithm of suffix trees

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 4

编辑推荐

Metrics

本文评价