Emad S. Hassan, Ahmed M. Dessouky, Hesham Fathi, Gerges M. Salama, Ahmed S. Oshaba, Atef El-Emary, Fathi E. Abd El‑Samie
{"title":"Improved Hybrid Approach for Enhancing Protein Coding Regions Identification in DNA Sequences","authors":"Emad S. Hassan, Ahmed M. Dessouky, Hesham Fathi, Gerges M. Salama, Ahmed S. Oshaba, Atef El-Emary, Fathi E. Abd El‑Samie","doi":"10.2174/0115748936287244240117065325","DOIUrl":null,"url":null,"abstract":"Introduction: Identifying and predicting protein-coding regions within DNA sequences play a pivotal role in genomic research. This paper introduces an approach for identifying proteincoding regions in DNA sequences, employing a hybrid methodology that combines a digital bandpass filter with wavelet transforms and various spectral estimation techniques to enhance exon prediction. Specifically, the Haar and Daubechies wavelet transforms are applied to improve the accuracy of protein-coding region (exon) prediction, enabling the extraction of intricate details that may be obscured in the original DNA sequences. background: The identification and prediction of protein-coding regions within DNA sequences play a pivotal role in genomic research. Methods: This research showcases the utility of Haar and Daubechies wavelet transforms, both nonparametric and parametric spectral estimation methods, and the deployment of a digital band pass filter for detecting peaks in exon regions. Additionally, the application of the Electron-Ion Interaction Potential (EIIP) method for converting symbolic DNA sequences into numerical values and the utilization of sum-of-sinusoids (SoS) mathematical models with optimized parameters further enrich the toolbox for DNA sequence analysis, ensuring the success of this proposed method in modeling DNA sequences optimally and accurately identifying genes. objective: Enhanced Protein-Coding Region Identification in DNA Sequences Using Wavelet Transforms Results: The outcomes of this approach showcase a substantial enhancement in identification accuracy for protein-coding regions. In terms of peak location detection, the application of Haar and Daubechies wavelet transforms enhances the accuracy of peak localization by approximately (0.01, 3-5 dB). When employing non-parametric and parametric spectral estimation techniques, there is an improvement in peak location by approximately (0.01, 4 dB) compared to the original signal. The proposed approach also achieves higher accuracy when compared with existing methods. method: hybrid methodology that combines a digital band-pass filter with wavelet transforms and various spectral estimation techniques to enhance exon prediction. Conclusion: These findings not only bridge gaps in DNA sequence analysis but also offer a promising pathway for advancing exonic region prediction and gene identification in genomics research. The hybrid methodology presented stands as a robust contribution to the evolving landscape of genomic analysis techniques. result: The results obtained through this proposed method demonstrate significantly improved identification accuracy. These findings offer a promising avenue for DNA sequence analysis, exonic region prediction, and gene identification.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/0115748936287244240117065325","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Identifying and predicting protein-coding regions within DNA sequences play a pivotal role in genomic research. This paper introduces an approach for identifying proteincoding regions in DNA sequences, employing a hybrid methodology that combines a digital bandpass filter with wavelet transforms and various spectral estimation techniques to enhance exon prediction. Specifically, the Haar and Daubechies wavelet transforms are applied to improve the accuracy of protein-coding region (exon) prediction, enabling the extraction of intricate details that may be obscured in the original DNA sequences. background: The identification and prediction of protein-coding regions within DNA sequences play a pivotal role in genomic research. Methods: This research showcases the utility of Haar and Daubechies wavelet transforms, both nonparametric and parametric spectral estimation methods, and the deployment of a digital band pass filter for detecting peaks in exon regions. Additionally, the application of the Electron-Ion Interaction Potential (EIIP) method for converting symbolic DNA sequences into numerical values and the utilization of sum-of-sinusoids (SoS) mathematical models with optimized parameters further enrich the toolbox for DNA sequence analysis, ensuring the success of this proposed method in modeling DNA sequences optimally and accurately identifying genes. objective: Enhanced Protein-Coding Region Identification in DNA Sequences Using Wavelet Transforms Results: The outcomes of this approach showcase a substantial enhancement in identification accuracy for protein-coding regions. In terms of peak location detection, the application of Haar and Daubechies wavelet transforms enhances the accuracy of peak localization by approximately (0.01, 3-5 dB). When employing non-parametric and parametric spectral estimation techniques, there is an improvement in peak location by approximately (0.01, 4 dB) compared to the original signal. The proposed approach also achieves higher accuracy when compared with existing methods. method: hybrid methodology that combines a digital band-pass filter with wavelet transforms and various spectral estimation techniques to enhance exon prediction. Conclusion: These findings not only bridge gaps in DNA sequence analysis but also offer a promising pathway for advancing exonic region prediction and gene identification in genomics research. The hybrid methodology presented stands as a robust contribution to the evolving landscape of genomic analysis techniques. result: The results obtained through this proposed method demonstrate significantly improved identification accuracy. These findings offer a promising avenue for DNA sequence analysis, exonic region prediction, and gene identification.
简介识别和预测 DNA 序列中的蛋白质编码区在基因组研究中起着举足轻重的作用。本文介绍了一种识别 DNA 序列中蛋白质编码区的方法,该方法采用了一种混合方法,将数字带通滤波器与小波变换和各种光谱估算技术相结合,以提高外显子预测能力。具体来说,该方法采用了哈尔和道贝奇斯小波变换来提高蛋白质编码区(外显子)预测的准确性,从而能够提取原始 DNA 序列中可能被掩盖的复杂细节:DNA 序列中蛋白质编码区的识别和预测在基因组研究中起着举足轻重的作用。方法:这项研究展示了哈尔和道贝奇斯小波变换、非参数和参数谱估计方法的实用性,以及数字带通滤波器在检测外显子区域峰值方面的应用。此外,应用电子-离子相互作用势(EIIP)方法将符号 DNA 序列转换为数值,以及利用具有优化参数的总和-正弦曲线(SoS)数学模型,进一步丰富了 DNA 序列分析工具箱,确保所提出的方法能够成功地对 DNA 序列进行优化建模并准确识别基因:利用小波变换加强 DNA 序列中蛋白质编码区的识别 结果:该方法的结果表明,蛋白质编码区的识别准确率大幅提高。在峰值位置检测方面,应用 Haar 和 Daubechies 小波变换可将峰值定位精度提高约 (0.01, 3-5 dB)。在采用非参数和参数频谱估计技术时,与原始信号相比,峰值定位精度提高了约 (0.01, 4 dB)。方法:将数字带通滤波器、小波变换和各种频谱估计技术相结合的混合方法,以提高外显子预测能力。结论:这些发现不仅弥补了 DNA 序列分析中的不足,还为基因组学研究中的外显子区域预测和基因鉴定提供了一条前景广阔的途径。所提出的混合方法是对不断发展的基因组分析技术的有力贡献:通过该方法获得的结果表明,识别的准确性显著提高。这些发现为 DNA 序列分析、外显子区域预测和基因鉴定提供了一条前景广阔的途径。
期刊介绍:
Current Bioinformatics aims to publish all the latest and outstanding developments in bioinformatics. Each issue contains a series of timely, in-depth/mini-reviews, research papers and guest edited thematic issues written by leaders in the field, covering a wide range of the integration of biology with computer and information science.
The journal focuses on advances in computational molecular/structural biology, encompassing areas such as computing in biomedicine and genomics, computational proteomics and systems biology, and metabolic pathway engineering. Developments in these fields have direct implications on key issues related to health care, medicine, genetic disorders, development of agricultural products, renewable energy, environmental protection, etc.