DNA sequence alignment: An assignment for OpenMP, MPI, and CUDA/OpenCL

Arturo Gonzalez-EscribanoUniversidad de Valladolid, Spain, Diego García-ÁlvarezUniversidad de Valladolid, Spain, Jesús CámaraUniversidad de Valladolid, Spain
{"title":"DNA sequence alignment: An assignment for OpenMP, MPI, and CUDA/OpenCL","authors":"Arturo Gonzalez-EscribanoUniversidad de Valladolid, Spain, Diego García-ÁlvarezUniversidad de Valladolid, Spain, Jesús CámaraUniversidad de Valladolid, Spain","doi":"arxiv-2409.06075","DOIUrl":null,"url":null,"abstract":"We present an assignment for a full Parallel Computing course. Since\n2017/2018, we have proposed a different problem each academic year to\nillustrate various methodologies for approaching the same computational problem\nusing different parallel programming models. They are designed to be\nparallelized using shared-memory programming with OpenMP, distributed-memory\nprogramming with MPI, and GPU programming with CUDA or OpenCL. The problem\nchosen for this year implements a brute-force solution for exact DNA sequence\nalignment of multiple patterns. The program searches for exact coincidences of\nmultiple nucleotide strings in a long DNA sequence. The sequential\nimplementation is designed to be clear and understandable to students while\noffering many opportunities for parallelization and optimization. This\nassignment addresses key concepts many students find difficult to apply in\npractical scenarios: race conditions, reductions, collective operations, and\npoint-to-point communications. It also covers the problem of parallel\ngeneration of pseudo-random sequences and strategies to notify and stop\nspeculative computations when matches are found. This assignment serves as an\nexercise that reinforces basic knowledge and prepares students for more complex\nparallel computing concepts and structures. It has been successfully\nimplemented as a practical assignment in a Parallel Computing course in the\nthird year of a Computer Engineering degree program. Supporting materials for\nthis and previous assignments in this series are publicly available.","PeriodicalId":501422,"journal":{"name":"arXiv - CS - Distributed, Parallel, and Cluster Computing","volume":"80 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Distributed, Parallel, and Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We present an assignment for a full Parallel Computing course. Since 2017/2018, we have proposed a different problem each academic year to illustrate various methodologies for approaching the same computational problem using different parallel programming models. They are designed to be parallelized using shared-memory programming with OpenMP, distributed-memory programming with MPI, and GPU programming with CUDA or OpenCL. The problem chosen for this year implements a brute-force solution for exact DNA sequence alignment of multiple patterns. The program searches for exact coincidences of multiple nucleotide strings in a long DNA sequence. The sequential implementation is designed to be clear and understandable to students while offering many opportunities for parallelization and optimization. This assignment addresses key concepts many students find difficult to apply in practical scenarios: race conditions, reductions, collective operations, and point-to-point communications. It also covers the problem of parallel generation of pseudo-random sequences and strategies to notify and stop speculative computations when matches are found. This assignment serves as an exercise that reinforces basic knowledge and prepares students for more complex parallel computing concepts and structures. It has been successfully implemented as a practical assignment in a Parallel Computing course in the third year of a Computer Engineering degree program. Supporting materials for this and previous assignments in this series are publicly available.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DNA 序列比对:为 OpenMP、MPI 和 CUDA/OpenCL 分配任务
我们介绍了一门完整的并行计算课程的作业。自 2017/2018 学年以来,我们每学年都会提出一个不同的问题,以展示使用不同并行编程模型处理同一计算问题的各种方法。这些问题可以使用 OpenMP 进行共享内存编程,使用 MPI 进行分布式内存编程,使用 CUDA 或 OpenCL 进行 GPU 编程。今年选择的问题是实现多模式 DNA 序列精确配对的暴力解法。该程序搜索长 DNA 序列中多个核苷酸字符串的精确重合点。程序的顺序实现设计得清晰易懂,同时提供了许多并行化和优化的机会。本作业涉及许多学生认为难以在实际场景中应用的关键概念:竞赛条件、还原、集体操作和点对点通信。它还涉及伪随机序列的并行生成问题,以及在发现匹配时通知和停止累加计算的策略。本作业可作为强化基础知识的练习,为学生学习更全面的并行计算概念和结构做好准备。该作业作为计算机工程学位课程三年级并行计算课程的实践作业已成功实施。本系列作业及以前作业的辅助材料均可公开获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Massively parallel CMA-ES with increasing population Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix Computations Energy Efficiency Support for Software Defined Networks: a Serverless Computing Approach CountChain: A Decentralized Oracle Network for Counting Systems Delay Analysis of EIP-4844
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1