PACHECO-BAUTISTA Daniel, CARREÑO-AGUILERA Ricardo, ALGREDO-BADILLO Ignacio, PATIÑO-ORTIZ Miguel
{"title":"HARDWARE ACCELERATION OF DNA READ ALIGNMENT PROGRAMS: CHALLENGES AND OPPORTUNITIES","authors":"PACHECO-BAUTISTA Daniel, CARREÑO-AGUILERA Ricardo, ALGREDO-BADILLO Ignacio, PATIÑO-ORTIZ Miguel","doi":"10.1142/s0218348x23500974","DOIUrl":null,"url":null,"abstract":"The alignment or mapping of Deoxyribonucleic Acid (DNA) reads produced by the new massively parallel sequencing machines is a fundamental initial step in the DNA analysis process. DNA alignment consists of ordering millions of short nucleotide sequences called reads, using a previously sequenced genome as a reference, to reconstruct the genetic code of a species. Even with the efforts made in the development of new multi-stage alignment programs, based on sophisticated algorithms and new filtering heuristics, the execution times remain limiting for the development of various applications such as epigenetics and genomic medicine. This paper presents an overview of recent developments in the acceleration of DNA alignment programs, with special emphasis on those based on hardware, in particular Graphics Processing Units (GPUs), Field Programmable Gate Arrays (FPGAs), and Processing-in-Memory (PIM) devices. Unlike most of the works found in the literature, which review only the proposals that gradually emerged in some specific acceleration technology, this work analyzes the contemporary state of the subject in a more comprehensive way, covering from the conception of the problem, the modern sequencing technologies and the analysis of the structure of the new alignment programs, to the most innovative software and hardware acceleration techniques. The foregoing allows to clearly define, at the end of the paper, the trends, challenges and opportunities that still prevail in the field. We hope that this work will serve as a guide for the development of new and more sophisticated DNA alignment systems.","PeriodicalId":55144,"journal":{"name":"Fractals-Complex Geometry Patterns and Scaling in Nature and Society","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fractals-Complex Geometry Patterns and Scaling in Nature and Society","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1142/s0218348x23500974","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The alignment or mapping of Deoxyribonucleic Acid (DNA) reads produced by the new massively parallel sequencing machines is a fundamental initial step in the DNA analysis process. DNA alignment consists of ordering millions of short nucleotide sequences called reads, using a previously sequenced genome as a reference, to reconstruct the genetic code of a species. Even with the efforts made in the development of new multi-stage alignment programs, based on sophisticated algorithms and new filtering heuristics, the execution times remain limiting for the development of various applications such as epigenetics and genomic medicine. This paper presents an overview of recent developments in the acceleration of DNA alignment programs, with special emphasis on those based on hardware, in particular Graphics Processing Units (GPUs), Field Programmable Gate Arrays (FPGAs), and Processing-in-Memory (PIM) devices. Unlike most of the works found in the literature, which review only the proposals that gradually emerged in some specific acceleration technology, this work analyzes the contemporary state of the subject in a more comprehensive way, covering from the conception of the problem, the modern sequencing technologies and the analysis of the structure of the new alignment programs, to the most innovative software and hardware acceleration techniques. The foregoing allows to clearly define, at the end of the paper, the trends, challenges and opportunities that still prevail in the field. We hope that this work will serve as a guide for the development of new and more sophisticated DNA alignment systems.
期刊介绍:
The investigation of phenomena involving complex geometry, patterns and scaling has gone through a spectacular development and applications in the past decades. For this relatively short time, geometrical and/or temporal scaling have been shown to represent the common aspects of many processes occurring in an unusually diverse range of fields including physics, mathematics, biology, chemistry, economics, engineering and technology, and human behavior. As a rule, the complex nature of a phenomenon is manifested in the underlying intricate geometry which in most of the cases can be described in terms of objects with non-integer (fractal) dimension. In other cases, the distribution of events in time or various other quantities show specific scaling behavior, thus providing a better understanding of the relevant factors determining the given processes.
Using fractal geometry and scaling as a language in the related theoretical, numerical and experimental investigations, it has been possible to get a deeper insight into previously intractable problems. Among many others, a better understanding of growth phenomena, turbulence, iterative functions, colloidal aggregation, biological pattern formation, stock markets and inhomogeneous materials has emerged through the application of such concepts as scale invariance, self-affinity and multifractality.
The main challenge of the journal devoted exclusively to the above kinds of phenomena lies in its interdisciplinary nature; it is our commitment to bring together the most recent developments in these fields so that a fruitful interaction of various approaches and scientific views on complex spatial and temporal behaviors in both nature and society could take place.