Parallel Approximate Multi-Pattern Matching on Heterogeneous Cluster Systems

Cheng Zhong, Zeng Fan, Defu Su
{"title":"Parallel Approximate Multi-Pattern Matching on Heterogeneous Cluster Systems","authors":"Cheng Zhong, Zeng Fan, Defu Su","doi":"10.1109/PDCAT.2008.23","DOIUrl":null,"url":null,"abstract":"For the given multiple patterns and a text string, firstly, a perfect hash function is constructed, the patterns are transformed into the unique pairs of integer values in parallel by the perfect hash function, the corresponding integer values are stored in a global hash table, and a recursion expression for computing hash function value of the signatures of each sub-string of text is also proposed. Secondly, based on divisible load principle, a linear programming model for the optimal text distribution strategy is created and a parallel approximate multi-pattern matching algorithm allowing one error is presented on the heterogeneous cluster system which processors have different computing speeds and distinct communication capabilities and different memory sizes by taking into account computation and communication startup time and using the assigned processor distribution order. The experimental results on the cluster system of heterogeneous personal computers show that the presented parallel algorithm is averagely 25% faster than that one using the even text distribution strategy, and it obtains a nearly linear speedup and good scalability.","PeriodicalId":282779,"journal":{"name":"2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDCAT.2008.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

For the given multiple patterns and a text string, firstly, a perfect hash function is constructed, the patterns are transformed into the unique pairs of integer values in parallel by the perfect hash function, the corresponding integer values are stored in a global hash table, and a recursion expression for computing hash function value of the signatures of each sub-string of text is also proposed. Secondly, based on divisible load principle, a linear programming model for the optimal text distribution strategy is created and a parallel approximate multi-pattern matching algorithm allowing one error is presented on the heterogeneous cluster system which processors have different computing speeds and distinct communication capabilities and different memory sizes by taking into account computation and communication startup time and using the assigned processor distribution order. The experimental results on the cluster system of heterogeneous personal computers show that the presented parallel algorithm is averagely 25% faster than that one using the even text distribution strategy, and it obtains a nearly linear speedup and good scalability.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
异构集群系统的并行近似多模式匹配
对于给定的多个模式和一个文本字符串,首先构造一个完美哈希函数,通过完美哈希函数将模式并行转换为唯一的整数值对,将对应的整数值存储在全局哈希表中,并提出了计算文本各子字符串签名哈希函数值的递归表达式。其次,基于可分负载原理,建立了最优文本分发策略的线性规划模型,并在处理器计算速度和通信能力不同、内存大小不同的异构集群系统上,考虑计算和通信启动时间,采用指定的处理器分布顺序,提出了一种允许误差的并行近似多模式匹配算法。在异构个人计算机集群系统上的实验结果表明,本文提出的并行算法比采用均匀文本分布策略的并行算法平均快25%,并且获得了近似线性的加速和良好的可扩展性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Case Studies in Computer Network Measurement Advances in the ProGenGrid Workflow Management System Finding Interaction Partners Using Attitude-Based Decision Strategies Agent Migration and Communication in WSNs Portable Object Thermal Awareness: Modeling Intelligent Sensor Networks for Cool Store Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1