采用BWT的多核并行的子串匹配算法

针对P-BWT精确匹配算法存在只支持短串查询并且只能工作在单处理器上的问题,提出了一个多核并行的支持任意查询长度的精确查询算法.改进了P-BWT索引上的查询过程,当一个查询串跨越了多个数据分片时,首先在其匹配的最后一个分片上查询,然后依次在前面分片上进行验证.进一步提出了一个多核并行查询算法来减少搜索和验证过程的迭代次数.实验结果表明,所述算法可以高效并行地完成子串匹配任务....

Full description

Saved in:
Bibliographic Details
Published in东北大学学报(自然科学版) Vol. 37; no. 5; pp. 624 - 628
Main Author 王佳英 王斌 李晓华 杨晓春
Format Journal Article
LanguageChinese
Published 东北大学计算机科学与工程学院,辽宁 沈阳,110819 2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract 针对P-BWT精确匹配算法存在只支持短串查询并且只能工作在单处理器上的问题,提出了一个多核并行的支持任意查询长度的精确查询算法.改进了P-BWT索引上的查询过程,当一个查询串跨越了多个数据分片时,首先在其匹配的最后一个分片上查询,然后依次在前面分片上进行验证.进一步提出了一个多核并行查询算法来减少搜索和验证过程的迭代次数.实验结果表明,所述算法可以高效并行地完成子串匹配任务.
AbstractList TP311.13; 针对P-BWT精确匹配算法存在只支持短串查询并且只能工作在单处理器上的问题,提出了一个多核并行的支持任意查询长度的精确查询算法.改进了P-BWT索引上的查询过程,当一个查询串跨越了多个数据分片时,首先在其匹配的最后一个分片上查询,然后依次在前面分片上进行验证.进一步提出了一个多核并行查询算法来减少搜索和验证过程的迭代次数.实验结果表明,所述算法可以高效并行地完成子串匹配任务.
针对P-BWT精确匹配算法存在只支持短串查询并且只能工作在单处理器上的问题,提出了一个多核并行的支持任意查询长度的精确查询算法.改进了P-BWT索引上的查询过程,当一个查询串跨越了多个数据分片时,首先在其匹配的最后一个分片上查询,然后依次在前面分片上进行验证.进一步提出了一个多核并行查询算法来减少搜索和验证过程的迭代次数.实验结果表明,所述算法可以高效并行地完成子串匹配任务.
Author 王佳英 王斌 李晓华 杨晓春
AuthorAffiliation 东北大学计算机科学与工程学院,辽宁沈阳110819
AuthorAffiliation_xml – name: 东北大学计算机科学与工程学院,辽宁 沈阳,110819
Author_FL YANG Xiao-chun
WANG Jia-ying
LI Xiao-hua
WANG Bin
Author_FL_xml – sequence: 1
  fullname: WANG Jia-ying
– sequence: 2
  fullname: WANG Bin
– sequence: 3
  fullname: LI Xiao-hua
– sequence: 4
  fullname: YANG Xiao-chun
Author_xml – sequence: 1
  fullname: 王佳英 王斌 李晓华 杨晓春
BookMark eNo9j71KA0EcxLeIYIx5CRGs7vzv52VLDX5BwCZgGXZvd-MF3egdYuyVNBEsFCGVgoWNFgqSvI_JmbfwQsRqmOHHDLOCSr7rLULrGEIqhdzshEmW-RAD8IACESEBLELgIQArofJ_voyqWZZoAJAs4kSWUTDr9_P71-2jZj68nrwMp0-jyfjr53kwt29336OPyWA8u7nN3x-nnw-raMmpk8xW_7SCmrs7zfp-0DjcO6hvNYKYSxZYIQCIjI3WEY-BMu5AArWaKWEUdoRGQjOLqabYWuMolhKcijUzxnFHaQVtLGovlXfKt1ud7kXqi8GW0abX0_N3wItvBbm2IOPjrm-fJwV7lianKr1qCVGrRTKihP4C3QxjAA
ClassificationCodes TP311.13
ContentType Journal Article
Copyright Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
Copyright_xml – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
DBID 2RA
92L
CQIGP
~WA
2B.
4A8
92I
93N
PSX
TCJ
DOI 10.3969/j.issn.1005-3026.2016.05.004
DatabaseName 维普期刊资源整合服务平台
中文科技期刊数据库-CALIS站点
中文科技期刊数据库-7.0平台
中文科技期刊数据库- 镜像站点
Wanfang Data Journals - Hong Kong
WANFANG Data Centre
Wanfang Data Journals
万方数据期刊 - 香港版
China Online Journals (COJ)
China Online Journals (COJ)
DatabaseTitleList

DeliveryMethod fulltext_linktorsrc
Discipline Engineering
DocumentTitleAlternate Multi-core Parallel Substring Matching Algorithm Using BWT
DocumentTitle_FL Multi-core Parallel Substring Matching Algorithm Using BWT
EndPage 628
ExternalDocumentID dbdxxb201605004
668879732
GrantInformation_xml – fundername: 国家自然科学基金资助项目; 教育部高等学校博士学科点专项科研基金资助项目
  funderid: (61322208,61272178,61129002,61572122,61532021); (20110042110028)
GroupedDBID -03
2B.
2C.
2RA
5XA
5XD
92E
92I
92L
ABDBF
ACGFS
ALMA_UNASSIGNED_HOLDINGS
CCEZO
CEKLB
CQIGP
CW9
EAD
EAP
EAS
EOJEC
ESX
OBODZ
TCJ
TGP
U1G
U5M
~WA
4A8
93N
ABJNI
ACUHS
PSX
ID FETCH-LOGICAL-c594-e660029cdbb75c0345f0903eb4a6da1f2376b4e13b31eedf31990facb4ddf5f33
ISSN 1005-3026
IngestDate Thu May 29 03:59:14 EDT 2025
Wed Feb 14 10:19:32 EST 2024
IsPeerReviewed false
IsScholarly true
Issue 5
Keywords multi-core
多核
full text index
parallel
精确匹配
exact matching
BWT (Burrows-Wheeler transform)
并行
BWT
全文索引
Language Chinese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c594-e660029cdbb75c0345f0903eb4a6da1f2376b4e13b31eedf31990facb4ddf5f33
Notes In order to solve the problem that P-BWT (Burrows-Wheeler transform) could only support short queries, and work on a uniprocessor, a multi-core parallel exact matching algorithm was proposed which any query length could be supposed. Firstly, the search process on P-BWT index was modified. When a query spans multiple data fragments, it first searches on the last segment, then verifies on the other segments. Further, a parallel algorithm was proposed to reduce the iterations in the search and verify process. Finally, the experimental study show that using the proposed algorithm, the substring matching task could be accomplished efficiently in parallel manner.
BWT ( Burrows-Wheeler transform) ; full text index; exact matching; parallel;multi-core
21-1344/T
WANG Jia-ying, WANG Bin, LI Xiao-hua, YANG Xiao-chun ( School of Computer Science & Engineering, Northeastern University, Shenyang 110819, China)
PageCount 5
ParticipantIDs wanfang_journals_dbdxxb201605004
chongqing_primary_668879732
PublicationCentury 2000
PublicationDate 2016
PublicationDateYYYYMMDD 2016-01-01
PublicationDate_xml – year: 2016
  text: 2016
PublicationDecade 2010
PublicationTitle 东北大学学报(自然科学版)
PublicationTitleAlternate Journal of Northeastern University(Natural Science)
PublicationTitle_FL Journal of Northeastern University(Natural Science)
PublicationYear 2016
Publisher 东北大学计算机科学与工程学院,辽宁 沈阳,110819
Publisher_xml – name: 东北大学计算机科学与工程学院,辽宁 沈阳,110819
SSID ssib000947529
ssib051368049
ssib023167010
ssj0040330
ssib002039846
ssib004675270
ssib006703041
ssib002263414
ssib008679651
ssib001128993
Score 2.064024
Snippet ...
TP311.13;...
SourceID wanfang
chongqing
SourceType Aggregation Database
Publisher
StartPage 624
SubjectTerms BWT
全文索引
多核
并行
精确匹配
Title 采用BWT的多核并行的子串匹配算法
URI http://lib.cqvip.com/qk/90188A/201605/668879732.html
https://d.wanfangdata.com.cn/periodical/dbdxxb201605004
Volume 37
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LaxRBEG5iAqIH8YkxKntIn2TizPRjuo_duxOCoKcVc1vmmZw2PhIIOSu5RPCgCDkpePCiBwVJwJ9jEvMvrOqZ7LYhaBSWoba7uru6ambq656pGkJmZVy4xHVBpaowAH_NA62yKoh4WWQsrLLIbebcfyAXHvJ7i2Jx4sx3762ltdV8rtg4Ma7kf6wKZWBXjJL9B8uOOoUCoMG-cAQLw_FUNqapprCYh1-aUM2pUfZR39GGKk5TQQ1HOpXUhNQqLLGaWklTRU1EVfd35h7VIU05ctoYS4AB-HEUoHvIbFKqE-zQMtp8t_II2rYNdbdtiGxOAJO0nRvpEZIqQ42g6Ty1XaoUigQTMQZHAXlQyATb6shrBVXaMTetRrsYrialyjopeigd9mepjU5gkVTD8N1xDRT0sBIJTTVzU8CSYyxGeSxAgBaFv2_SBHS6c_x06lCoULCEr1loYs1Jk3cdopRgxcTpLkGJcEYjHo3SmRiuHOzcpqgMrIWJR3ec3WKnP43Cg5biLkZotP6kcU6YNZaFTYaBI-_VpMxpr1LhuSLZxKa3qEY2IfjHHSbTUjuHiQPMjQbAVx6ly2gb8jFQGL2-Webl-nqOPKFwyXinYlikgVucMrZn5z34zhPhwW_A9rC89-BhyLTy09vFEkCVHzcNzT1_ItE9efDdJYscP9WOMatDOIazImJSuafIDTLjIWNNdpJ2mmfJbKuDu3_SAKZdWV4ZLj0BMOli-4Z1NlzyYGj_IrnQrh87prkZXCITG8uXyXkvq-gVEhxubh68_gi3goPt53sftvff7eztfvv5fgv_fnr1Y-fL3tbu4YuXB5_f7n99c5X059N-dyFov4oSFELzoJLuSXpR5nkiipBxUeNWa5XzTJZZVONbbjmvIpazCPBvDS5Wh3VW5Lwsa1Ezdo1MDleG1XXSSXhdJVlSJ7BG5ELpTMsskjXP4iridVFOk5nRvAePm-Q3AykBlmCKr2nSaTUxaG-JzwbHTowbf2eZIeeQbjY1b5LJ1adr1S2A-av57fZs-gWGZbQF
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E9%87%87%E7%94%A8BWT%E7%9A%84%E5%A4%9A%E6%A0%B8%E5%B9%B6%E8%A1%8C%E7%9A%84%E5%AD%90%E4%B8%B2%E5%8C%B9%E9%85%8D%E7%AE%97%E6%B3%95&rft.jtitle=%E4%B8%9C%E5%8C%97%E5%A4%A7%E5%AD%A6%E5%AD%A6%E6%8A%A5%EF%BC%88%E8%87%AA%E7%84%B6%E7%A7%91%E5%AD%A6%E7%89%88%EF%BC%89&rft.au=%E7%8E%8B%E4%BD%B3%E8%8B%B1&rft.au=%E7%8E%8B%E6%96%8C&rft.au=%E6%9D%8E%E6%99%93%E5%8D%8E&rft.au=%E6%9D%A8%E6%99%93%E6%98%A5&rft.date=2016&rft.pub=%E4%B8%9C%E5%8C%97%E5%A4%A7%E5%AD%A6%E8%AE%A1%E7%AE%97%E6%9C%BA%E7%A7%91%E5%AD%A6%E4%B8%8E%E5%B7%A5%E7%A8%8B%E5%AD%A6%E9%99%A2%2C%E8%BE%BD%E5%AE%81+%E6%B2%88%E9%98%B3%2C110819&rft.issn=1005-3026&rft.volume=37&rft.issue=5&rft.spage=624&rft.epage=628&rft_id=info:doi/10.3969%2Fj.issn.1005-3026.2016.05.004&rft.externalDocID=dbdxxb201605004
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fimage.cqvip.com%2Fvip1000%2Fqk%2F90188A%2F90188A.jpg
http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fdbdxxb%2Fdbdxxb.jpg