首页 > 最新文献

Bioinformatics and Biology Insights最新文献

英文 中文
A Literature Review: ECG-Based Models for Arrhythmia Diagnosis Using Artificial Intelligence Techniques. 文献综述:基于心电图的心律失常人工智能诊断模型。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322221149600
Abir Boulif, Bouchra Ananou, Mustapha Ouladsine, Stéphane Delliaux

In the health care and medical domain, it has been proven challenging to diagnose correctly many diseases with complicated and interferential symptoms, including arrhythmia. However, with the evolution of artificial intelligence (AI) techniques, the diagnosis and prognosis of arrhythmia became easier for the physicians and practitioners using only an electrocardiogram (ECG) examination. This review presents a synthesis of the studies conducted in the last 12 years to predict arrhythmia's occurrence by classifying automatically different heartbeat rhythms. From a variety of research academic databases, 40 studies were selected to analyze, among which 29 of them applied deep learning methods (72.5%), 9 of them addressed the problem with machine learning methods (22.5%), and 2 of them combined both deep learning and machine learning to predict arrhythmia (5%). Indeed, the use of AI for arrhythmia diagnosis is emerging in literature, although there are some challenging issues, such as the explicability of the Deep Learning methods and the computational resources needed to achieve high performance. However, with the continuous development of cloud platforms and quantum calculation for AI, we can achieve a breakthrough in arrhythmia diagnosis.

在卫生保健和医学领域,正确诊断许多具有复杂和干扰症状的疾病,包括心律失常,已被证明是具有挑战性的。然而,随着人工智能(AI)技术的发展,心律失常的诊断和预后对医生和从业人员来说变得更加容易,仅使用心电图(ECG)检查。本文综述了过去12年来通过自动分类不同的心跳节律来预测心律失常发生的综合研究。从各种研究学术数据库中选取40篇研究进行分析,其中应用深度学习方法的研究29篇(72.5%),利用机器学习方法解决问题的研究9篇(22.5%),结合深度学习和机器学习预测心律失常的研究2篇(5%)。事实上,人工智能在心律失常诊断中的应用正在文献中出现,尽管存在一些具有挑战性的问题,例如深度学习方法的可解释性和实现高性能所需的计算资源。但随着人工智能的云平台和量子计算的不断发展,我们可以在心律失常诊断方面取得突破。
{"title":"A Literature Review: ECG-Based Models for Arrhythmia Diagnosis Using Artificial Intelligence Techniques.","authors":"Abir Boulif,&nbsp;Bouchra Ananou,&nbsp;Mustapha Ouladsine,&nbsp;Stéphane Delliaux","doi":"10.1177/11779322221149600","DOIUrl":"https://doi.org/10.1177/11779322221149600","url":null,"abstract":"<p><p>In the health care and medical domain, it has been proven challenging to diagnose correctly many diseases with complicated and interferential symptoms, including arrhythmia. However, with the evolution of artificial intelligence (AI) techniques, the diagnosis and prognosis of arrhythmia became easier for the physicians and practitioners using only an electrocardiogram (ECG) examination. This review presents a synthesis of the studies conducted in the last 12 years to predict arrhythmia's occurrence by classifying automatically different heartbeat rhythms. From a variety of research academic databases, 40 studies were selected to analyze, among which 29 of them applied deep learning methods (72.5%), 9 of them addressed the problem with machine learning methods (22.5%), and 2 of them combined both deep learning and machine learning to predict arrhythmia (5%). Indeed, the use of AI for arrhythmia diagnosis is emerging in literature, although there are some challenging issues, such as the explicability of the Deep Learning methods and the computational resources needed to achieve high performance. However, with the continuous development of cloud platforms and quantum calculation for AI, we can achieve a breakthrough in arrhythmia diagnosis.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/2c/a1/10.1177_11779322221149600.PMC9926384.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9291423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Unraveling the Mechanism of Immunity and Inflammation Related to Molecular Signatures Crosstalk Among Obesity, T2D, and AD: Insights From Bioinformatics Approaches. 揭示肥胖、T2D和AD分子信号串扰的免疫和炎症机制:来自生物信息学方法的见解。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231167977
Kumar Vishal, Piplu Bhuiyan, Junxia Qi, Yang Chen, Jubiao Zhang, Fen Yang, Juxue Li

Individuals with type 2 diabetes (T2D) and obesity have a higher risk of developing Alzheimer disease (AD), and increasing evidence indicates a link between impaired immune signaling pathways and the development of AD. However, the shared cellular mechanisms and molecular signatures among these 3 diseases remain unknown. The purpose of this study was to uncover similar molecular markers and pathways involved in obesity, T2D, and AD using bioinformatics and a network biology approach. First, we investigated the 3 RNA sequencing (RNA-seq) gene expression data sets and determined 224 commonly shared differentially expressed genes (DEGs) from obesity, T2D, and AD diseases. Gene ontology and pathway enrichment analyses revealed that mutual DEGs were mainly enriched with immune and inflammatory signaling pathways. In addition, we constructed a protein-protein interactions network for finding hub genes, which have not previously been identified as playing a critical role in these 3 diseases. Furthermore, the transcriptional factors and protein kinases regulating commonly shared DEGs among obesity, T2D, and AD were also identified. Finally, we suggested potential drug candidates as possible therapeutic interventions for 3 diseases. The results of this bioinformatics analysis provided a new understanding of the potential links between obesity, T2D, and AD pathologies.

患有2型糖尿病(T2D)和肥胖的个体患阿尔茨海默病(AD)的风险更高,越来越多的证据表明免疫信号通路受损与AD的发展之间存在联系。然而,这三种疾病共有的细胞机制和分子特征尚不清楚。本研究的目的是利用生物信息学和网络生物学方法揭示与肥胖、T2D和AD相关的类似分子标记和途径。首先,我们研究了3rna测序(RNA-seq)基因表达数据集,确定了肥胖、T2D和AD疾病共有的224个差异表达基因(DEGs)。基因本体和途径富集分析显示,相互deg主要富集免疫和炎症信号通路。此外,我们构建了蛋白质-蛋白质相互作用网络,以寻找先前未被确定在这3种疾病中发挥关键作用的枢纽基因。此外,还鉴定了肥胖、T2D和AD中共同调节deg的转录因子和蛋白激酶。最后,我们提出了潜在的候选药物作为3种疾病可能的治疗干预措施。这项生物信息学分析的结果为肥胖、T2D和AD病理之间的潜在联系提供了新的认识。
{"title":"Unraveling the Mechanism of Immunity and Inflammation Related to Molecular Signatures Crosstalk Among Obesity, T2D, and AD: Insights From Bioinformatics Approaches.","authors":"Kumar Vishal,&nbsp;Piplu Bhuiyan,&nbsp;Junxia Qi,&nbsp;Yang Chen,&nbsp;Jubiao Zhang,&nbsp;Fen Yang,&nbsp;Juxue Li","doi":"10.1177/11779322231167977","DOIUrl":"https://doi.org/10.1177/11779322231167977","url":null,"abstract":"<p><p>Individuals with type 2 diabetes (T2D) and obesity have a higher risk of developing Alzheimer disease (AD), and increasing evidence indicates a link between impaired immune signaling pathways and the development of AD. However, the shared cellular mechanisms and molecular signatures among these 3 diseases remain unknown. The purpose of this study was to uncover similar molecular markers and pathways involved in obesity, T2D, and AD using bioinformatics and a network biology approach. First, we investigated the 3 RNA sequencing (RNA-seq) gene expression data sets and determined 224 commonly shared differentially expressed genes (DEGs) from obesity, T2D, and AD diseases. Gene ontology and pathway enrichment analyses revealed that mutual DEGs were mainly enriched with immune and inflammatory signaling pathways. In addition, we constructed a protein-protein interactions network for finding hub genes, which have not previously been identified as playing a critical role in these 3 diseases. Furthermore, the transcriptional factors and protein kinases regulating commonly shared DEGs among obesity, T2D, and AD were also identified. Finally, we suggested potential drug candidates as possible therapeutic interventions for 3 diseases. The results of this bioinformatics analysis provided a new understanding of the potential links between obesity, T2D, and AD pathologies.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/c2/a4/10.1177_11779322231167977.PMC10134115.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9386555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Normalization of Large-Scale Transcriptome Data Using Heuristic Methods. 使用启发式方法的大规模转录组数据规范化。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231160397
Arthur Yosef, Eli Shnaider, Moti Schneider, Michael Gurevich

In this study, we introduce an artificial intelligent method for addressing the batch effect of a transcriptome data. The method has several clear advantages in comparison with the alternative methods presently in use. Batch effect refers to the discrepancy in gene expression data series, measured under different conditions. While the data from the same batch (measurements performed under the same conditions) are compatible, combining various batches into 1 data set is problematic because of incompatible measurements. Therefore, it is necessary to perform correction of the combined data (normalization), before performing biological analysis. There are numerous methods attempting to correct data set for batch effect. These methods rely on various assumptions regarding the distribution of the measurements. Forcing the data elements into pre-supposed distribution can severely distort biological signals, thus leading to incorrect results and conclusions. As the discrepancy between the assumptions regarding the data distribution and the actual distribution is wider, the biases introduced by such "correction methods" are greater. We introduce a heuristic method to reduce batch effect. The method does not rely on any assumptions regarding the distribution and the behavior of data elements. Hence, it does not introduce any new biases in the process of correcting the batch effect. It strictly maintains the integrity of measurements within the original batches.

在这项研究中,我们介绍了一种人工智能方法来解决转录组数据的批量效应。与目前使用的替代方法相比,该方法有几个明显的优点。批效应是指在不同条件下测量的基因表达数据序列的差异。虽然来自同一批次(在相同条件下进行的测量)的数据是兼容的,但由于测量结果不兼容,将不同批次组合成一个数据集是有问题的。因此,在进行生物分析之前,有必要对组合数据进行校正(归一化)。有许多方法试图纠正数据集的批处理效果。这些方法依赖于关于测量分布的各种假设。强迫数据元素进入预先假定的分布会严重扭曲生物信号,从而导致不正确的结果和结论。由于对数据分布的假设与实际分布的差异越大,这种“校正方法”引入的偏差也越大。我们引入了一种启发式方法来减少批处理效应。该方法不依赖于关于数据元素的分布和行为的任何假设。因此,它不会在修正批效应的过程中引入任何新的偏差。严格保持原始批次测量的完整性。
{"title":"Normalization of Large-Scale Transcriptome Data Using Heuristic Methods.","authors":"Arthur Yosef,&nbsp;Eli Shnaider,&nbsp;Moti Schneider,&nbsp;Michael Gurevich","doi":"10.1177/11779322231160397","DOIUrl":"https://doi.org/10.1177/11779322231160397","url":null,"abstract":"<p><p>In this study, we introduce an artificial intelligent method for addressing the batch effect of a transcriptome data. The method has several clear advantages in comparison with the alternative methods presently in use. Batch effect refers to the discrepancy in gene expression data series, measured under different conditions. While the data from the same batch (measurements performed under the same conditions) are compatible, combining various batches into 1 data set is problematic because of incompatible measurements. Therefore, it is necessary to perform correction of the combined data (normalization), before performing biological analysis. There are numerous methods attempting to correct data set for batch effect. These methods rely on various assumptions regarding the distribution of the measurements. Forcing the data elements into pre-supposed distribution can severely distort biological signals, thus leading to incorrect results and conclusions. As the discrepancy between the assumptions regarding the data distribution and the actual distribution is wider, the biases introduced by such \"correction methods\" are greater. We introduce a heuristic method to reduce batch effect. The method does not rely on any assumptions regarding the distribution and the behavior of data elements. Hence, it does not introduce any new biases in the process of correcting the batch effect. It strictly maintains the integrity of measurements within the original batches.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/1a/e0/10.1177_11779322231160397.PMC10068970.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9612102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative In Silico Analysis and Functional Characterization of TANK-Binding Kinase 1-Binding Protein 1. TANK-Binding Kinase 1- binding Protein 1的比较硅分析与功能表征。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231164828
Humaira Aziz Sawal, Shagufta Nighat, Tanzeela Safdar, Laiba Anees

Protein modelling plays a vital role in the drug discovery process. TANK-binding kinase 1-binding protein 1 is also called an adapter protein, which is encoded by gene TBK1 present in Homo sapiens. It is found in lungs, small intestine, leukocytes, heart, placenta, muscle, kidney, lower level of thymus, and brain. It has a number of protein-binding sites, to which TBK1 and IKBKE bind and perform different functions as immunomodulatory, antiproliferative, and antiviral innate immunity which release different types of interferons. Our study predicts the comparative model of 3-dimensional (3D) structure through different bioinformatics tools that will be helpful for further studies in future. The reactivity and stability of these proteins were evaluated physicochemically and through domain determination and prediction of secondary structure using bioinformatics methods such as ProtParam, Pfam, and SOPMA, respectively. Robetta, an ab initio approach, I-TASSER, and AlphaFold was used for 3D structure prediction, and the models were validated using the SAVESv6.0 (PROCHECK) server. Conclusively, the best 3D structure of TBK1-binding protein 1 was predicted using Robetta software. After unveiling the 3D structure of the novel protein, we concluded that this structure will help us to find out its role other than in antiviral innate immunity and by producing torsion in its 3D structure researchers will be able to detect either this protein is involved in any disease or not because according to previous studies it was not associated with any disease.

蛋白质建模在药物发现过程中起着至关重要的作用。TANK-binding kinase 1-binding protein 1也被称为适配蛋白(adapter protein),由智人体内存在的TBK1基因编码。见于肺、小肠、白细胞、心脏、胎盘、肌肉、肾脏、胸腺下层和大脑。它有许多蛋白质结合位点,TBK1和IKBKE结合并发挥不同的功能,如免疫调节、抗增殖和抗病毒先天免疫,释放不同类型的干扰素。我们的研究通过不同的生物信息学工具预测了三维(3D)结构的比较模型,这将有助于未来的进一步研究。这些蛋白的反应性和稳定性分别通过ProtParam、Pfam和SOPMA等生物信息学方法进行了物理化学评价,并通过结构域测定和二级结构预测进行了评价。采用Robetta、从头算法、I-TASSER和AlphaFold进行三维结构预测,并使用SAVESv6.0 (PROCHECK)服务器对模型进行验证。最后,利用Robetta软件预测tbk1结合蛋白1的最佳三维结构。在揭示了这种新蛋白的3D结构后,我们得出结论,这种结构将帮助我们找出它在抗病毒先天免疫之外的作用,通过在其3D结构中产生扭转,研究人员将能够检测出这种蛋白质是否与任何疾病有关,因为根据之前的研究,它与任何疾病无关。
{"title":"Comparative In Silico Analysis and Functional Characterization of TANK-Binding Kinase 1-Binding Protein 1.","authors":"Humaira Aziz Sawal,&nbsp;Shagufta Nighat,&nbsp;Tanzeela Safdar,&nbsp;Laiba Anees","doi":"10.1177/11779322231164828","DOIUrl":"https://doi.org/10.1177/11779322231164828","url":null,"abstract":"<p><p>Protein modelling plays a vital role in the drug discovery process. TANK-binding kinase 1-binding protein 1 is also called an adapter protein, which is encoded by gene <i>TBK1</i> present in <i>Homo sapiens.</i> It is found in lungs, small intestine, leukocytes, heart, placenta, muscle, kidney, lower level of thymus, and brain. It has a number of protein-binding sites, to which TBK1 and IKBKE bind and perform different functions as immunomodulatory, antiproliferative, and antiviral innate immunity which release different types of interferons. Our study predicts the comparative model of 3-dimensional (3D) structure through different bioinformatics tools that will be helpful for further studies in future. The reactivity and stability of these proteins were evaluated physicochemically and through domain determination and prediction of secondary structure using bioinformatics methods such as ProtParam, Pfam, and SOPMA, respectively. Robetta, an ab initio approach, I-TASSER, and AlphaFold was used for 3D structure prediction, and the models were validated using the SAVESv6.0 (PROCHECK) server. Conclusively, the best 3D structure of TBK1-binding protein 1 was predicted using Robetta software. After unveiling the 3D structure of the novel protein, we concluded that this structure will help us to find out its role other than in antiviral innate immunity and by producing torsion in its 3D structure researchers will be able to detect either this protein is involved in any disease or not because according to previous studies it was not associated with any disease.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/a2/f9/10.1177_11779322231164828.PMC10074619.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9641105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Targeted Gene Panel Sequencing Unveiled New Pathogenic Mutations in Patients With Breast Cancer. 靶向基因面板测序揭示了乳腺癌患者新的致病突变。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231182054
Souad Kartti, El Mehdi Bouricha, Oumaima Zarrik, Youssef Aghlallou, Chaimaa Mounjid, Rachid ELJaoudi, Lahcen Belyamani, Azeddine Ibrahimi, Basma El Khannoussi

The increasing commercialization of new gene panels based on next-generation sequencing for clinical research has significantly improved our understanding of breast cancer genetics and has led to the discovery of new mutation variants. The study included 16 unselected Moroccan breast cancer patients tested with multi-gene panel (HEVA screen panel) using Illumina Miseq, followed by Sanger sequencing to validate the most relevant mutation. Mutational analysis revealed the presence of 13 mutations (11 single-nucleotide polymorphisms [SNPs] and 2 indels), and 6 of 11 identified SNPs were predicted as pathogenic. One of the 6 pathogenic mutations was c.7874G>C, a heterozygous SNP in HD-OB domain of BRCA2 gene, which led to the arginine to threonine change at codon 2625 of the protein. This work describes the first case of a patient with breast cancer harboring this pathogenic variant and analyzes its functional impact using molecular docking and molecular dynamics simulation. Further experimental investigations are needed to validate its pathogenicity and to verify its association with breast cancer.

基于下一代测序的临床研究新基因面板的日益商业化,大大提高了我们对乳腺癌遗传学的理解,并导致了新的突变变体的发现。该研究包括16名未选择的摩洛哥乳腺癌患者,使用Illumina Miseq进行多基因面板(HEVA筛选面板)测试,然后进行Sanger测序以验证最相关的突变。突变分析显示存在13个突变(11个单核苷酸多态性[SNPs]和2个indel),鉴定出的11个snp中有6个被预测为致病性。6个致病性突变之一是BRCA2基因HD-OB结构域的杂合SNP C . 7874g >C,导致该蛋白密码子2625处精氨酸向苏氨酸转变。本研究描述了第一例携带该致病变异的乳腺癌患者,并利用分子对接和分子动力学模拟分析了其功能影响。需要进一步的实验研究来证实其致病性并证实其与乳腺癌的关联。
{"title":"Targeted Gene Panel Sequencing Unveiled New Pathogenic Mutations in Patients With Breast Cancer.","authors":"Souad Kartti,&nbsp;El Mehdi Bouricha,&nbsp;Oumaima Zarrik,&nbsp;Youssef Aghlallou,&nbsp;Chaimaa Mounjid,&nbsp;Rachid ELJaoudi,&nbsp;Lahcen Belyamani,&nbsp;Azeddine Ibrahimi,&nbsp;Basma El Khannoussi","doi":"10.1177/11779322231182054","DOIUrl":"https://doi.org/10.1177/11779322231182054","url":null,"abstract":"<p><p>The increasing commercialization of new gene panels based on next-generation sequencing for clinical research has significantly improved our understanding of breast cancer genetics and has led to the discovery of new mutation variants. The study included 16 unselected Moroccan breast cancer patients tested with multi-gene panel (HEVA screen panel) using Illumina Miseq, followed by Sanger sequencing to validate the most relevant mutation. Mutational analysis revealed the presence of 13 mutations (11 single-nucleotide polymorphisms [SNPs] and 2 indels), and 6 of 11 identified SNPs were predicted as pathogenic. One of the 6 pathogenic mutations was c.7874G>C, a heterozygous SNP in HD-OB domain of BRCA2 gene, which led to the arginine to threonine change at codon 2625 of the protein. This work describes the first case of a patient with breast cancer harboring this pathogenic variant and analyzes its functional impact using molecular docking and molecular dynamics simulation. Further experimental investigations are needed to validate its pathogenicity and to verify its association with breast cancer.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/75/b4/10.1177_11779322231182054.PMC10291397.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10664466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular Cloning and AlphaFold Modeling of Thyrotropin (ag-TSH) From the Amazonian Fish Pirarucu (Arapaima gigas). 亚马逊食人鱼促甲状腺素(ag-TSH)的分子克隆和AlphaFold建模。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231154148
Renan Passos Freire, Jorge Enrique Hernandez-Gonzalez, Eliana Rosa Lima, Miriam Fussae Suzuki, João Ezequiel de Oliveira, Lucas Simon Torai, Paolo Bartolini, Carlos Roberto Jorge Soares

Arapaima gigas, known as Pirarucu in Brazil, is one of the largest freshwater fish in the world. Some individuals could reach 3 m in length and weight up to 200 kg. Due to extinction risks and its economic value, the species has been a focus for preservation and reproduction studies. Thyrotropin (TSH) is a glycoprotein hormone formed by 2 subunits α and β whose main activity is related to the synthesis of thyroid hormones (THs)-T3 and T4. In this work, we present a combination of bioinformatics tools to identify Arapaima gigas βTSH (ag-βTSH), modeling its molecular structure and express the recombinant heterodimer form in mammalian cells. Using the combination of computational biology, based on genome-related information, in silico molecular cloning and modeling led to confirm results of the ag-βTSH sequence by reverse transcriptase-polymerase chain reaction (RT-PCR) and transient expression in human embryonic kidney (HEK293F) cells. Molecular cloning of ag-βTSH retrieved 146 amino acids with a signal peptide of 21 amino acid residues and 6 disulfide bonds. The sequence has a similarity to 39 fish species, ranging between 43.1% and 81.6%, whose domains are extremely conserved, such as cystine knot motif and N-glycosylation site. The Arapaima gigas thyrotropin (ag-TSH) model, solved by AlphaFold, was used in molecular dynamics simulations with Scleropages formosus receptor, providing similar values of free energy ΔGbind and ΔGPMF in comparison with Homo sapiens model. The recombinant expression in HEK293F cells reached a yield of 25 mg/L, characterized via chromatographic and physical-chemical techniques. This work shows that other Arapaima gigas proteins could be studied in a similar way, using the combination of these techniques, recovering more information from its genome and improving the reproduction and preservation of this prehistoric fish.

巨骨舌鱼,在巴西被称为Pirarucu,是世界上最大的淡水鱼之一。有些个体可以达到3米长,体重可达200公斤。由于其灭绝风险和经济价值,该物种已成为保护和繁殖研究的重点。促甲状腺素(TSH)是一种由α和β 2个亚基组成的糖蛋白激素,其主要活性与甲状腺激素(THs)-T3和T4的合成有关。在这项工作中,我们提出了生物信息学工具的组合来鉴定巨骨舌鱼βTSH (ag-βTSH),建模其分子结构,并在哺乳动物细胞中表达重组异源二聚体形式。利用计算生物学相结合,基于基因组相关信息,通过逆转录聚合酶链反应(RT-PCR)和瞬时表达在人胚胎肾(HEK293F)细胞中的分子克隆和建模,证实了ag-βTSH序列。ag-βTSH的分子克隆获得了146个氨基酸,包含21个氨基酸残基和6个二硫键的信号肽。该序列与39种鱼类的相似度在43.1% ~ 81.6%之间,结构域高度保守,如胱氨酸结基和n -糖基化位点。利用AlphaFold求解的巨骨舌鱼(Arapaima gigas thyrotropin, ag-TSH)模型,对formosus硬化受体进行分子动力学模拟,得到与智人模型相似的自由能ΔGbind和ΔGPMF值。重组蛋白在HEK293F细胞中的表达量达到25 mg/L,通过色谱和理化技术进行了表征。这项工作表明,可以用类似的方式研究其他巨骨舌鱼的蛋白质,使用这些技术的组合,从其基因组中恢复更多信息,并改善这种史前鱼类的繁殖和保存。
{"title":"Molecular Cloning and AlphaFold Modeling of Thyrotropin (ag-TSH) From the Amazonian Fish Pirarucu (<i>Arapaima gigas</i>).","authors":"Renan Passos Freire,&nbsp;Jorge Enrique Hernandez-Gonzalez,&nbsp;Eliana Rosa Lima,&nbsp;Miriam Fussae Suzuki,&nbsp;João Ezequiel de Oliveira,&nbsp;Lucas Simon Torai,&nbsp;Paolo Bartolini,&nbsp;Carlos Roberto Jorge Soares","doi":"10.1177/11779322231154148","DOIUrl":"https://doi.org/10.1177/11779322231154148","url":null,"abstract":"<p><p><i>Arapaima gigas</i>, known as Pirarucu in Brazil, is one of the largest freshwater fish in the world. Some individuals could reach 3 m in length and weight up to 200 kg. Due to extinction risks and its economic value, the species has been a focus for preservation and reproduction studies. Thyrotropin (TSH) is a glycoprotein hormone formed by 2 subunits α and β whose main activity is related to the synthesis of thyroid hormones (THs)-T3 and T4. In this work, we present a combination of bioinformatics tools to identify <i>Arapaima gigas</i> βTSH (ag-βTSH), modeling its molecular structure and express the recombinant heterodimer form in mammalian cells. Using the combination of computational biology, based on genome-related information, in silico molecular cloning and modeling led to confirm results of the ag-βTSH sequence by reverse transcriptase-polymerase chain reaction (RT-PCR) and transient expression in human embryonic kidney (HEK293F) cells. Molecular cloning of ag-βTSH retrieved 146 amino acids with a signal peptide of 21 amino acid residues and 6 disulfide bonds. The sequence has a similarity to 39 fish species, ranging between 43.1% and 81.6%, whose domains are extremely conserved, such as cystine knot motif and N-glycosylation site. The <i>Arapaima gigas</i> thyrotropin (ag-TSH) model, solved by AlphaFold, was used in molecular dynamics simulations with <i>Scleropages formosus</i> receptor, providing similar values of free energy ΔG<sub>bind</sub> and ΔG<sub>PMF</sub> in comparison with <i>Homo sapiens</i> model. The recombinant expression in HEK293F cells reached a yield of 25 mg/L, characterized via chromatographic and physical-chemical techniques. This work shows that other <i>Arapaima gigas</i> proteins could be studied in a similar way, using the combination of these techniques, recovering more information from its genome and improving the reproduction and preservation of this prehistoric fish.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/a0/da/10.1177_11779322231154148.PMC9926385.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10798468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plastid DNA Barcoding and RtActin cDNA Fragment Isolation of Reutealis Trisperma: A Promising Bioresource for Biodiesel Production. 三棱草质体DNA条形码及rtacn cDNA片段的分离——生物柴油生产的一种有前景的生物资源
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231182768
Nurul Jadid, Nur Laili Alfina Rosidah, Muhammad Rifqi Nur Ramadani, Indah Prasetyowati, Noor Nailis Sa'adah, Aulia Febrianti Widodo, Dwi Oktafitria

Reutealis trisperma belonging to the family Euphorbiaceae is currently used for biodiesel production, and rapid development in plant-based biofuel production has led to its increasing demand. However, massive utilization of bio-industrial plants has led to conservation issues. Moreover, genetic information on R trisperma is still limited, which is crucial for developmental, physiological, and molecular studies. Studying gene expression is essential to explain plant physiological processes. Nonetheless, this technique requires sensitive and precise measurement of messenger RNA (mRNA). In addition, the presence of internal control genes is important to avoid bias. Therefore, collecting and preserving genetic data for R trisperma is indispensable. In this study, we aimed to evaluate the application of plastid loci, rbcL, and matK, to the DNA barcode of R trisperma for use in conservation programs. In addition, we isolated and cloned the RtActin (RtACT) gene fragment for use in gene expression studies. Sequence information was analyzed in silico by comparison with other Euphorbiaceae plants. For actin fragment isolation, reverse-transcription polymerase chain reaction was used. Molecular cloning of RtActin was performed using the pTA2 plasmid before sequencing. We successfully isolated and cloned 592 and 840 bp of RtrbcL and RtmatK fragment genes, respectively. The RtrbcL barcoding marker, rather than the RtmatK plastidial marker, provided discriminative molecular phylogenetic data for R Trisperma. We also isolated 986 bp of RtACT gene fragments. Our phylogenetic analysis demonstrated that R trisperma is closely related to the Vernicia fordii Actin gene (97% identity). Our results suggest that RtrbcL could be further developed and used as a barcoding marker for R trisperma. Moreover, the RtACT gene could be further investigated for use in gene expression studies of plant.

作为大戟科植物的三种植物(Reutealis trisperma)目前被用于生物柴油的生产,植物基生物燃料生产的快速发展导致其需求不断增加。然而,生物工业工厂的大规模利用导致了保护问题。此外,三种植物的遗传信息仍然有限,这对发育、生理和分子研究至关重要。研究基因表达对解释植物生理过程至关重要。尽管如此,这项技术需要对信使RNA (mRNA)进行敏感和精确的测量。此外,内部控制基因的存在对于避免偏见也很重要。因此,收集和保存R三种植物的遗传数据是必不可少的。在这项研究中,我们的目的是评估质体位点rbcL和matK在R三种精子DNA条形码中的应用,以用于保护计划。此外,我们分离并克隆了rtacn (RtACT)基因片段,用于基因表达研究。通过与其他大戟科植物的比较,对序列信息进行了计算机分析。采用逆转录聚合酶链反应分离肌动蛋白片段。测序前利用pTA2质粒对rtacn进行分子克隆。我们成功分离并克隆了592 bp的RtrbcL和840 bp的RtmatK片段基因。RtrbcL条形码标记,而不是RtmatK质体标记,提供了RtrbcL分子系统发育的鉴别数据。我们还分离到了986 bp的RtACT基因片段。系统发育分析表明,R三种植物与fordii Vernicia Actin基因亲缘关系密切(同源性97%)。研究结果表明,RtrbcL可以进一步开发并作为RtrbcL的条形码标记物。此外,RtACT基因可进一步用于植物基因表达研究。
{"title":"Plastid DNA Barcoding and <i>RtActin</i> cDNA Fragment Isolation of <i>Reutealis Trisperma</i>: A Promising Bioresource for Biodiesel Production.","authors":"Nurul Jadid,&nbsp;Nur Laili Alfina Rosidah,&nbsp;Muhammad Rifqi Nur Ramadani,&nbsp;Indah Prasetyowati,&nbsp;Noor Nailis Sa'adah,&nbsp;Aulia Febrianti Widodo,&nbsp;Dwi Oktafitria","doi":"10.1177/11779322231182768","DOIUrl":"https://doi.org/10.1177/11779322231182768","url":null,"abstract":"<p><p><i>Reutealis trisperma</i> belonging to the family <i>Euphorbiaceae</i> is currently used for biodiesel production, and rapid development in plant-based biofuel production has led to its increasing demand. However, massive utilization of bio-industrial plants has led to conservation issues. Moreover, genetic information on <i>R trisperma</i> is still limited, which is crucial for developmental, physiological, and molecular studies. Studying gene expression is essential to explain plant physiological processes. Nonetheless, this technique requires sensitive and precise measurement of messenger RNA (mRNA). In addition, the presence of internal control genes is important to avoid bias. Therefore, collecting and preserving genetic data for <i>R trisperma</i> is indispensable. In this study, we aimed to evaluate the application of plastid loci, <i>rbcL</i>, and <i>matK</i>, to the DNA barcode of <i>R trisperma</i> for use in conservation programs. In addition, we isolated and cloned the <i>RtActin</i> (<i>RtACT</i>) gene fragment for use in gene expression studies. Sequence information was analyzed <i>in silico</i> by comparison with other <i>Euphorbiaceae</i> plants. For actin fragment isolation, reverse-transcription polymerase chain reaction was used. Molecular cloning of <i>RtActin</i> was performed using the pTA2 plasmid before sequencing. We successfully isolated and cloned 592 and 840 bp of <i>RtrbcL</i> and <i>RtmatK</i> fragment genes, respectively. The <i>RtrbcL</i> barcoding marker, rather than the <i>RtmatK</i> plastidial marker, provided discriminative molecular phylogenetic data for <i>R Trisperma</i>. We also isolated 986 bp of <i>RtACT</i> gene fragments. Our phylogenetic analysis demonstrated that <i>R trisperma</i> is closely related to the <i>Vernicia fordii Actin</i> gene (97% identity). Our results suggest that <i>RtrbcL</i> could be further developed and used as a barcoding marker for <i>R trisperma</i>. Moreover, the <i>RtACT</i> gene could be further investigated for use in gene expression studies of plant.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/8a/9c/10.1177_11779322231182768.PMC10286179.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10298531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Integrated Approach of Learning Genetic Networks From Genome-Wide Gene Expression Data Using Gaussian Graphical Model and Monte Carlo Method. 利用高斯图模型和蒙特卡罗方法从全基因组基因表达数据中学习遗传网络的集成方法。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231152972
Haitao Zhao, Sujay Datta, Zhong-Hui Duan

Global genetic networks provide additional information for the analysis of human diseases, beyond the traditional analysis that focuses on single genes or local networks. The Gaussian graphical model (GGM) is widely applied to learn genetic networks because it defines an undirected graph decoding the conditional dependence between genes. Many algorithms based on the GGM have been proposed for learning genetic network structures. Because the number of gene variables is typically far more than the number of samples collected, and a real genetic network is typically sparse, the graphical lasso implementation of GGM becomes a popular tool for inferring the conditional interdependence among genes. However, graphical lasso, although showing good performance in low dimensional data sets, is computationally expensive and inefficient or even unable to work directly on genome-wide gene expression data sets. In this study, the method of Monte Carlo Gaussian graphical model (MCGGM) was proposed to learn global genetic networks of genes. This method uses a Monte Carlo approach to sample subnetworks from genome-wide gene expression data and graphical lasso to learn the structures of the subnetworks. The learned subnetworks are then integrated to approximate a global genetic network. The proposed method was evaluated with a relatively small real data set of RNA-seq expression levels. The results indicate the proposed method shows a strong ability of decoding the interactions with high conditional dependences among genes. The method was then applied to genome-wide data sets of RNA-seq expression levels. The gene interactions with high interdependence from the estimated global networks show that most of the predicted gene-gene interactions have been reported in the literatures playing important roles in different human cancers. Also, the results validate the ability and reliability of the proposed method to identify high conditional dependences among genes in large-scale data sets.

全球遗传网络为分析人类疾病提供了更多的信息,超出了传统的以单一基因或地方网络为重点的分析。高斯图模型(Gaussian graphical model, GGM)定义了一个解码基因间条件依赖关系的无向图,被广泛应用于遗传网络的学习。许多基于GGM的遗传网络学习算法已经被提出。由于基因变量的数量通常远远超过所收集的样本数量,并且真正的遗传网络通常是稀疏的,因此GGM的图形套索实现成为推断基因之间条件相互依赖的流行工具。然而,尽管图形套索在低维数据集上表现良好,但计算成本高,效率低,甚至无法直接处理全基因组基因表达数据集。本研究提出了蒙特卡罗高斯图形模型(MCGGM)的方法来学习基因的全局遗传网络。该方法使用蒙特卡罗方法从全基因组基因表达数据和图形套索中对子网络进行采样,以了解子网络的结构。然后将学习到的子网络集成到近似的全局遗传网络中。采用相对较小的RNA-seq表达水平真实数据集对所提出的方法进行了评估。结果表明,该方法具有较强的解码基因间高条件依赖性相互作用的能力。然后将该方法应用于RNA-seq表达水平的全基因组数据集。从估计的全球网络中,高度相互依赖的基因相互作用表明,大多数预测的基因-基因相互作用已经在文献中报道,在不同的人类癌症中发挥重要作用。此外,结果验证了所提出的方法在大规模数据集中识别基因之间高条件依赖性的能力和可靠性。
{"title":"An Integrated Approach of Learning Genetic Networks From Genome-Wide Gene Expression Data Using Gaussian Graphical Model and Monte Carlo Method.","authors":"Haitao Zhao,&nbsp;Sujay Datta,&nbsp;Zhong-Hui Duan","doi":"10.1177/11779322231152972","DOIUrl":"https://doi.org/10.1177/11779322231152972","url":null,"abstract":"<p><p>Global genetic networks provide additional information for the analysis of human diseases, beyond the traditional analysis that focuses on single genes or local networks. The Gaussian graphical model (GGM) is widely applied to learn genetic networks because it defines an undirected graph decoding the conditional dependence between genes. Many algorithms based on the GGM have been proposed for learning genetic network structures. Because the number of gene variables is typically far more than the number of samples collected, and a real genetic network is typically sparse, the graphical lasso implementation of GGM becomes a popular tool for inferring the conditional interdependence among genes. However, graphical lasso, although showing good performance in low dimensional data sets, is computationally expensive and inefficient or even unable to work directly on genome-wide gene expression data sets. In this study, the method of Monte Carlo Gaussian graphical model (MCGGM) was proposed to learn global genetic networks of genes. This method uses a Monte Carlo approach to sample subnetworks from genome-wide gene expression data and graphical lasso to learn the structures of the subnetworks. The learned subnetworks are then integrated to approximate a global genetic network. The proposed method was evaluated with a relatively small real data set of RNA-seq expression levels. The results indicate the proposed method shows a strong ability of decoding the interactions with high conditional dependences among genes. The method was then applied to genome-wide data sets of RNA-seq expression levels. The gene interactions with high interdependence from the estimated global networks show that most of the predicted gene-gene interactions have been reported in the literatures playing important roles in different human cancers. Also, the results validate the ability and reliability of the proposed method to identify high conditional dependences among genes in large-scale data sets.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/4e/ca/10.1177_11779322231152972.PMC9972065.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10823900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Identification of Conserved and Novel MicroRNAs with their Targets in Garden Pea (Pisum Sativum L.) Leaves by High-Throughput Sequencing. 豌豆中保守和新型microrna及其靶点的鉴定叶片高通量测序。
IF 5.8 Q1 Mathematics Pub Date : 2023-01-01 DOI: 10.1177/11779322231162777
Qurshid Hasan Khan

MicroRNAs (miRNAs) are single-stranded, endogenous, non-coding RNAs of 20-24 nucleotides that play a significant role in post-transcriptional gene regulation. Various conserved and novel miRNAs have been characterized, especially from the plant species whose genomes were well-characterized; however, information on miRNA in economically important plants such as pea (Pisum sativum L.) is limited. In this study, I have identified conserved and novel miRNA in garden pea plant leaves samples along with their targets by analyzing the next generation sequencing (NGS) data. The raw data obtained from NGS were processed and 1.38 million high-quality non-redundant reads were retained for analysis, this tremendous quantity of reads indicates a large and diverse small RNA population in pea leaves. After analyzing the deep sequencing data, 255 conserved and 11 novel miRNAs were identified in the garden pea leaves sample. Utilizing psRNATarget tool, the miRNA targets of conserved and novel miRNA were predicted. Further, the functional annotation of the miRNA targets were performed using blast2Go software and the target gene products were predicted. The miRNA target gene products along with GO_ID (Gene Ontology Identifier) were categorized into biological processes, cellular components, and molecular functions. The information obtained from this study will provide genomic resources that will help in understanding miRNA-mediated post-transcriptional gene regulation in garden peas.

MicroRNAs (miRNAs)是一种单链、内源性的非编码rna,长度为20-24个核苷酸,在转录后基因调控中发挥重要作用。各种保守的和新颖的mirna已经被鉴定,特别是来自基因组已被充分鉴定的植物物种;然而,对具有重要经济意义的植物如豌豆(Pisum sativum L.)的miRNA信息有限。在这项研究中,我通过分析下一代测序(NGS)数据,在豌豆植物叶片样品及其靶标中鉴定出保守的和新的miRNA。对NGS获得的原始数据进行处理,保留了138万个高质量的非冗余reads用于分析,这一庞大的reads数量表明豌豆叶片中存在大量多样的小RNA群体。通过对深测序数据的分析,在豌豆叶片样品中鉴定出255个保守的mirna和11个新的mirna。利用psrnatarte工具,对保守miRNA和新miRNA的靶点进行预测。此外,使用blast2Go软件对miRNA靶点进行功能注释,并预测靶基因产物。miRNA靶基因产物连同GO_ID(基因本体标识符)被分类为生物过程、细胞组分和分子功能。本研究获得的信息将为了解豌豆mirna介导的转录后基因调控提供基因组资源。
{"title":"Identification of Conserved and Novel MicroRNAs with their Targets in Garden Pea (<i>Pisum Sativum</i> L.) Leaves by High-Throughput Sequencing.","authors":"Qurshid Hasan Khan","doi":"10.1177/11779322231162777","DOIUrl":"https://doi.org/10.1177/11779322231162777","url":null,"abstract":"<p><p>MicroRNAs (miRNAs) are single-stranded, endogenous, non-coding RNAs of 20-24 nucleotides that play a significant role in post-transcriptional gene regulation. Various conserved and novel miRNAs have been characterized, especially from the plant species whose genomes were well-characterized; however, information on miRNA in economically important plants such as pea (<i>Pisum sativum</i> L.) is limited. In this study, I have identified conserved and novel miRNA in garden pea plant leaves samples along with their targets by analyzing the next generation sequencing (NGS) data. The raw data obtained from NGS were processed and 1.38 million high-quality non-redundant reads were retained for analysis, this tremendous quantity of reads indicates a large and diverse small RNA population in pea leaves. After analyzing the deep sequencing data, 255 conserved and 11 novel miRNAs were identified in the garden pea leaves sample. Utilizing psRNATarget tool, the miRNA targets of conserved and novel miRNA were predicted. Further, the functional annotation of the miRNA targets were performed using blast2Go software and the target gene products were predicted. The miRNA target gene products along with GO_ID (Gene Ontology Identifier) were categorized into biological processes, cellular components, and molecular functions. The information obtained from this study will provide genomic resources that will help in understanding miRNA-mediated post-transcriptional gene regulation in garden peas.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/cb/37/10.1177_11779322231162777.PMC10068972.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9263176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional Analysis of Hypothetical Proteins of Vibrio parahaemolyticus Reveals the Presence of Virulence Factors and Growth-Related Enzymes With Therapeutic Potential. 副溶血性弧菌假想蛋白的功能分析揭示了具有治疗潜力的毒力因子和生长相关酶的存在。
IF 5.8 Q1 Mathematics Pub Date : 2022-11-09 eCollection Date: 2022-01-01 DOI: 10.1177/11779322221136002
Sazzad Shahrear, Maliha Afroj Zinnia, Md Rabi Us Sany, Abul Bashar Mir Md Khademul Islam

Vibrio parahaemolyticus, an aquatic pathogen, is a major concern in the shrimp aquaculture industry. Several strains of this pathogen are responsible for causing acute hepatopancreatic necrosis disease as well as other serious illness, both of which result in severe economic losses. The genome sequence of two pathogenic strains of V. parahaemolyticus, MSR16 and MSR17, isolated from Bangladesh, have been reported to gain a better understanding of their diversity and virulence. However, the prevalence of hypothetical proteins (HPs) makes it challenging to obtain a comprehensive understanding of the pathogenesis of V. parahaemolyticus. The aim of the present study is to provide a functional annotation of the HPs to elucidate their role in pathogenesis employing several in silico tools. The exploration of protein domains and families, similarity searches against proteins with known function, gene ontology enrichment, along with protein-protein interaction analysis of the HPs led to the functional assignment with a high level of confidence for 656 proteins out of a pool of 2631 proteins. The in silico approach used in this study was important for accurately assigning function to HPs and inferring interactions with proteins with previously described functions. The HPs with function predicted were categorized into various groups such as enzymes involved in small-compound biosynthesis pathway, iron binding proteins, antibiotics resistance proteins, and other proteins. Several proteins with potential druggability were identified among them. In addition, the HPs were investigated in search of virulent factors, which led to the identification of proteins that have the potential to be exploited as vaccine candidate. The findings of the study will be effective in gaining a better understanding of the molecular mechanisms of bacterial pathogenesis. They may also provide an insight into the process of evaluating promising targets for the development of drugs and vaccines against V. parahaemolyticus.

副溶血性弧菌是一种水生病原体,是对虾养殖业关注的主要问题。该病原体的一些菌株可引起急性肝胰腺坏死病和其他严重疾病,这两种疾病都会造成严重的经济损失。据报道,从孟加拉国分离的两种副溶血性弧菌致病性菌株MSR16和MSR17的基因组序列有助于更好地了解它们的多样性和毒力。然而,假设蛋白(HPs)的流行使得对副溶血性弧菌发病机制的全面了解具有挑战性。本研究的目的是提供HPs的功能注释,以阐明它们在几种计算机工具中的发病机制中的作用。对蛋白质结构域和家族的探索,对已知功能的蛋白质的相似性搜索,基因本体的富集,以及蛋白质-蛋白质相互作用的hp分析,使2631个蛋白质池中的656个蛋白质的功能分配具有高水平的置信度。本研究中使用的计算机方法对于准确分配hp的功能和推断与先前描述功能的蛋白质的相互作用非常重要。预测功能的hp可分为参与小化合物生物合成途径的酶、铁结合蛋白、抗生素耐药蛋白和其他蛋白等。其中鉴定出几种具有潜在药物作用的蛋白。此外,还对hp进行了研究,以寻找毒力因子,从而鉴定出有可能被开发为候选疫苗的蛋白质。该研究结果将有助于更好地了解细菌致病的分子机制。它们还可能为开发抗副溶血性弧菌的药物和疫苗评估有希望的靶点的过程提供见解。
{"title":"Functional Analysis of Hypothetical Proteins of <i>Vibrio parahaemolyticus</i> Reveals the Presence of Virulence Factors and Growth-Related Enzymes With Therapeutic Potential.","authors":"Sazzad Shahrear,&nbsp;Maliha Afroj Zinnia,&nbsp;Md Rabi Us Sany,&nbsp;Abul Bashar Mir Md Khademul Islam","doi":"10.1177/11779322221136002","DOIUrl":"https://doi.org/10.1177/11779322221136002","url":null,"abstract":"<p><p><i>Vibrio parahaemolyticus</i>, an aquatic pathogen, is a major concern in the shrimp aquaculture industry. Several strains of this pathogen are responsible for causing acute hepatopancreatic necrosis disease as well as other serious illness, both of which result in severe economic losses. The genome sequence of two pathogenic strains of <i>V. parahaemolyticus</i>, MSR16 and MSR17, isolated from Bangladesh, have been reported to gain a better understanding of their diversity and virulence. However, the prevalence of hypothetical proteins (HPs) makes it challenging to obtain a comprehensive understanding of the pathogenesis of <i>V. parahaemolyticus</i>. The aim of the present study is to provide a functional annotation of the HPs to elucidate their role in pathogenesis employing several in silico tools. The exploration of protein domains and families, similarity searches against proteins with known function, gene ontology enrichment, along with protein-protein interaction analysis of the HPs led to the functional assignment with a high level of confidence for 656 proteins out of a pool of 2631 proteins. The in silico approach used in this study was important for accurately assigning function to HPs and inferring interactions with proteins with previously described functions. The HPs with function predicted were categorized into various groups such as enzymes involved in small-compound biosynthesis pathway, iron binding proteins, antibiotics resistance proteins, and other proteins. Several proteins with potential druggability were identified among them. In addition, the HPs were investigated in search of virulent factors, which led to the identification of proteins that have the potential to be exploited as vaccine candidate. The findings of the study will be effective in gaining a better understanding of the molecular mechanisms of bacterial pathogenesis. They may also provide an insight into the process of evaluating promising targets for the development of drugs and vaccines against <i>V. parahaemolyticus</i>.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2022-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/d5/47/10.1177_11779322221136002.PMC9661560.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40468611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics and Biology Insights
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1