首页 > 最新文献

Computational Biology and Chemistry最新文献

英文 中文
Targeting cyclin-dependent kinase 2 CDK2: Insights from molecular docking and dynamics simulation – A systematic computational approach to discover novel cancer therapeutics 靶向细胞周期蛋白依赖性激酶 2 CDK2:分子对接和动力学模拟的启示--发现新型癌症疗法的系统计算方法。
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-06-25 DOI: 10.1016/j.compbiolchem.2024.108134
Bharath Kumar Chagaleti , Shantha Kumar B. , Anjana G.V. , Rajakrishnan Rajagopal , Ahmed Alfarhan , Jesu Arockiaraj , Kathiravan Muthu Kumaradoss , S. Karthick Raja Namasivayam

Global public health is confronted with significant challenges due to the prevalence of cancer and the emergence of treatment resistance. This work focuses on the identification of cyclin-dependent kinase 2 (CDK2) through a systematic computational approach to discover novel cancer therapeutics. A ligand-based pharmacophore model was initially developed using a training set of seven potent CDK2 inhibitors. The obtained most robust model was characterized by three features: one donor (|Don|) and two acceptors (|Acc|). Screening this model against the ZINC database resulted in identifying 108 hits, which underwent further molecular docking studies. The docking results indicated binding affinity, with energy values ranging from −6.59 kcal mol⁻¹ to −7.40 kcal mol⁻¹ compared to the standard Roscovitine. The top 10 compounds (Z1-Z10) selected from the docking data were further screened for ADMET profiling, ensuring their compliance with pharmacokinetic and toxicological criteria. The top 3 compounds (Z1-Z3) chosen from the docking were subjected to Density Functional Theory (DFT) studies. They revealed significant variations in electronic properties, providing insights into the reactivity, stability, and polarity of these compounds. Molecular dynamics simulations confirmed the stability of the ligand-protein complexes, with acceptable RMSD and RMSF values. Specifically, compound Z1 demonstrated stability, around 2.4 Å, and maintained throughout the 100 ns simulation period with minimal conformational changes, stable RMSD, and consistent protein-ligand interactions.

由于癌症的流行和抗药性的出现,全球公共卫生面临着重大挑战。这项工作的重点是通过系统的计算方法识别细胞周期蛋白依赖性激酶2(CDK2),从而发现新型癌症疗法。最初,利用七种强效 CDK2 抑制剂的训练集开发了基于配体的药效模型。获得的最稳健模型有三个特征:一个供体(|Don|)和两个受体(|Acc|)。根据 ZINC 数据库对该模型进行筛选后,确定了 108 个命中物,并对其进行了进一步的分子对接研究。对接结果表明,与标准的 Roscovitine 相比,其结合亲和力的能量值从 -6.59 kcal mol-¹ 到 -7.40 kcal mol-¹不等。从对接数据中选出的前 10 个化合物(Z1-Z10)进一步进行了 ADMET 分析筛选,确保它们符合药代动力学和毒理学标准。对从对接中选出的前 3 个化合物(Z1-Z3)进行了密度泛函理论(DFT)研究。这些研究揭示了电子特性的显著变化,为深入了解这些化合物的反应性、稳定性和极性提供了依据。分子动力学模拟证实了配体-蛋白质复合物的稳定性,其 RMSD 和 RMSF 值均可接受。具体来说,化合物 Z1 表现出了约 2.4 Å 的稳定性,并在 100 ns 模拟期间保持了最小的构象变化、稳定的 RMSD 和一致的蛋白质-配体相互作用。
{"title":"Targeting cyclin-dependent kinase 2 CDK2: Insights from molecular docking and dynamics simulation – A systematic computational approach to discover novel cancer therapeutics","authors":"Bharath Kumar Chagaleti ,&nbsp;Shantha Kumar B. ,&nbsp;Anjana G.V. ,&nbsp;Rajakrishnan Rajagopal ,&nbsp;Ahmed Alfarhan ,&nbsp;Jesu Arockiaraj ,&nbsp;Kathiravan Muthu Kumaradoss ,&nbsp;S. Karthick Raja Namasivayam","doi":"10.1016/j.compbiolchem.2024.108134","DOIUrl":"10.1016/j.compbiolchem.2024.108134","url":null,"abstract":"<div><p>Global public health is confronted with significant challenges due to the prevalence of cancer and the emergence of treatment resistance. This work focuses on the identification of cyclin-dependent kinase 2 (CDK2) through a systematic computational approach to discover novel cancer therapeutics. A ligand-based pharmacophore model was initially developed using a training set of seven potent CDK2 inhibitors. The obtained most robust model was characterized by three features: one donor (|Don|) and two acceptors (|Acc|). Screening this model against the ZINC database resulted in identifying 108 hits, which underwent further molecular docking studies. The docking results indicated binding affinity, with energy values ranging from −6.59 kcal mol⁻¹ to −7.40 kcal mol⁻¹ compared to the standard Roscovitine. The top 10 compounds (Z1-Z10) selected from the docking data were further screened for ADMET profiling, ensuring their compliance with pharmacokinetic and toxicological criteria. The top 3 compounds (Z1-Z3) chosen from the docking were subjected to Density Functional Theory (DFT) studies. They revealed significant variations in electronic properties, providing insights into the reactivity, stability, and polarity of these compounds. Molecular dynamics simulations confirmed the stability of the ligand-protein complexes, with acceptable RMSD and RMSF values. Specifically, compound Z1 demonstrated stability, around 2.4 Å, and maintained throughout the 100 ns simulation period with minimal conformational changes, stable RMSD, and consistent protein-ligand interactions.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141536174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-throughput virtual screening of Streptomyces spp. metabolites as antiviral inhibitors against the Nipah virus matrix protein 高通量虚拟筛选链霉菌代谢物作为尼帕病毒基质蛋白的抗病毒抑制剂。
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-06-25 DOI: 10.1016/j.compbiolchem.2024.108133
Mark Andrian B. Macalalad , Nyzar Mabeth O. Odchimar , Fredmoore L. Orosco

Nipah virus (NiV) remains a significant global concern due to its impact on both the agricultural industry and human health, resulting in substantial economic and health consequences. Currently, there is no cure or commercially available vaccine for the virus. Therefore, it is crucial to prioritize the discovery of new and effective treatment options to prevent its continued spread. Streptomyces spp. are rich sources of metabolites known for their bioactivity against certain diseases; however, their potential as antiviral drugs against the Nipah virus remain unexplored. In this study, 6524 Streptomyces spp. metabolites were screened through in silico methods for their inhibitory effects against the Nipah virus matrix (NiV-M) protein, which assists in virion assembly of Nipah virus. Different computer-aided tools were utilized to carry out the virtual screening process: ADMET profiling revealed 913 compounds with excellent safety and efficacy profiles, molecular docking predicted the binding poses and associated docking scores of the ligands in their respective targets, MD simulations confirmed the binding stability of the top ten highest-scoring ligands in a 100 ns all-atom simulation, PCA elucidated simulation convergence, and MMPB(GB)SA calculations estimated the binding energies of the final candidate compounds and determined the key residues crucial for complex formation. Using in silico methods, we identified six metabolites targeting the main substrate-binding site and five targeting the dimerization site that exhibited excellent stability and strong binding affinity. We recommend testing these compounds in the next stages of drug development to confirm their effectiveness as therapeutic agents against Nipah virus.

尼帕病毒(Nipah virus,NiV)对农业和人类健康都有影响,造成了严重的经济和健康后果,因此仍然是全球关注的一个重大问题。目前,该病毒尚无治愈方法或商业化疫苗。因此,必须优先发现新的有效治疗方案,以防止其继续传播。链霉菌属是代谢产物的丰富来源,它们对某些疾病具有生物活性;然而,它们作为抗尼帕病毒药物的潜力仍有待开发。在这项研究中,通过硅学方法筛选了 6524 种链霉菌代谢物,以确定它们对尼帕病毒基质蛋白(NiV-M)的抑制作用。在虚拟筛选过程中使用了不同的计算机辅助工具:ADMET分析揭示了913种安全性和有效性极佳的化合物;分子对接预测了配体在各自靶标中的结合位置和相关对接得分;MD模拟证实了在100 ns全原子模拟中得分最高的前十种配体的结合稳定性;PCA阐明了模拟的收敛性;MMPB(GB)SA计算估算了最终候选化合物的结合能,并确定了形成复合物的关键残基。通过使用硅学方法,我们确定了 6 个靶向主要底物结合位点的代谢物和 5 个靶向二聚化位点的代谢物,它们都表现出极佳的稳定性和很强的结合亲和力。我们建议在下一阶段的药物开发中测试这些化合物,以确认它们作为尼帕病毒治疗剂的有效性。
{"title":"High-throughput virtual screening of Streptomyces spp. metabolites as antiviral inhibitors against the Nipah virus matrix protein","authors":"Mark Andrian B. Macalalad ,&nbsp;Nyzar Mabeth O. Odchimar ,&nbsp;Fredmoore L. Orosco","doi":"10.1016/j.compbiolchem.2024.108133","DOIUrl":"10.1016/j.compbiolchem.2024.108133","url":null,"abstract":"<div><p>Nipah virus (NiV) remains a significant global concern due to its impact on both the agricultural industry and human health, resulting in substantial economic and health consequences. Currently, there is no cure or commercially available vaccine for the virus. Therefore, it is crucial to prioritize the discovery of new and effective treatment options to prevent its continued spread. <em>Streptomyces</em> spp. are rich sources of metabolites known for their bioactivity against certain diseases; however, their potential as antiviral drugs against the Nipah virus remain unexplored. In this study, 6524 <em>Streptomyces</em> spp. metabolites were screened through in silico methods for their inhibitory effects against the Nipah virus matrix (NiV-M) protein, which assists in virion assembly of Nipah virus. Different computer-aided tools were utilized to carry out the virtual screening process: ADMET profiling revealed 913 compounds with excellent safety and efficacy profiles, molecular docking predicted the binding poses and associated docking scores of the ligands in their respective targets, MD simulations confirmed the binding stability of the top ten highest-scoring ligands in a 100 ns all-atom simulation, PCA elucidated simulation convergence, and MMPB(GB)SA calculations estimated the binding energies of the final candidate compounds and determined the key residues crucial for complex formation. Using <em>in silico</em> methods, we identified six metabolites targeting the main substrate-binding site and five targeting the dimerization site that exhibited excellent stability and strong binding affinity. We recommend testing these compounds in the next stages of drug development to confirm their effectiveness as therapeutic agents against Nipah virus.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141539090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the binding dynamics of covalent inhibitors within active site of PLpro in SARS-CoV-2 探索共价抑制剂在 SARS-CoV-2 PLpro 活性位点的结合动力学
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-06-23 DOI: 10.1016/j.compbiolchem.2024.108132
Deepesh Kumar Patel, Harish Kumar, M. Elizabeth Sobhia

In the global fight against the COVID-19 pandemic caused by the highly transmissible SARS-CoV-2 virus, the search for potent medications is paramount. With a focused investigation on the SARS-CoV-2 papain-like protease (PLpro) as a promising therapeutic target due to its pivotal role in viral replication and immune modulation, the catalytic triad of PLpro comprising Cys111, His272, and Asp286, highlights Cys111 as an intriguing nucleophilic center for potential covalent bonds with ligands. The detailed analysis of the binding site unveils crucial interactions with both hydrophobic and polar residues, demonstrating the structural insights of the cavity and deepening our understanding of its molecular landscape. The sequence of PLpro among variants of concern (Alpha, Beta, Gamma, Delta and Omicron) and the recent variant of interest, JN.1, remains conserved with no mutations at the active site. Moreover, a thorough exploration of apo, non-covalently bound, and covalently bound PLpro conformations exposes significant conformational changes in loop regions, offering invaluable insights into the intricate dynamics of ligand-protein complex formation. Employing strategic in silico medication repurposing, this study swiftly identifies potential molecules for target inhibition. Within the domain of covalent docking studies and molecular dynamics, using reported inhibitors and clinically tested molecules elucidate the formation of stable covalent bonds with the cysteine residue, laying a robust foundation for potential therapeutic applications. These details not only deepen our comprehension of PLpro inhibition but also play a pivotal role in shaping the dynamic landscape of COVID-19 treatment strategies.

在全球抗击由传染性极强的 SARS-CoV-2 病毒引起的 COVID-19 大流行的斗争中,寻找有效的药物至关重要。由于 SARS-CoV-2 木瓜蛋白酶(PLpro)在病毒复制和免疫调节中起着关键作用,因此它是一个很有希望的治疗靶点。PLpro 的催化三元组包括 Cys111、His272 和 Asp286,其中 Cys111 是一个令人感兴趣的亲核中心,可与配体形成潜在的共价键。对结合位点的详细分析揭示了与疏水和极性残基的关键相互作用,展示了空腔的结构洞察力,加深了我们对其分子景观的理解。PLpro 的序列在相关变体(Alpha、Beta、Gamma、Delta 和 Omicron)和最近的相关变体 JN.1 中保持不变,活性位点没有发生突变。此外,对apo、非共价结合和共价结合PLpro构象的深入研究揭示了环区的显著构象变化,为了解配体-蛋白复合物形成的复杂动态提供了宝贵的见解。这项研究利用战略性的硅学药物再利用技术,迅速确定了潜在的目标抑制分子。在共价对接研究和分子动力学领域,利用已报道的抑制剂和临床测试分子阐明了与半胱氨酸残基形成稳定共价键的过程,为潜在的治疗应用奠定了坚实的基础。这些细节不仅加深了我们对PLpro抑制作用的理解,而且在塑造COVID-19治疗策略的动态景观方面发挥了关键作用。
{"title":"Exploring the binding dynamics of covalent inhibitors within active site of PLpro in SARS-CoV-2","authors":"Deepesh Kumar Patel,&nbsp;Harish Kumar,&nbsp;M. Elizabeth Sobhia","doi":"10.1016/j.compbiolchem.2024.108132","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108132","url":null,"abstract":"<div><p>In the global fight against the COVID-19 pandemic caused by the highly transmissible SARS-CoV-2 virus, the search for potent medications is paramount. With a focused investigation on the SARS-CoV-2 papain-like protease (PL<sup>pro</sup>) as a promising therapeutic target due to its pivotal role in viral replication and immune modulation, the catalytic triad of PL<sup>pro</sup> comprising Cys111, His272, and Asp286, highlights Cys111 as an intriguing nucleophilic center for potential covalent bonds with ligands. The detailed analysis of the binding site unveils crucial interactions with both hydrophobic and polar residues, demonstrating the structural insights of the cavity and deepening our understanding of its molecular landscape. The sequence of PL<sup>pro</sup> among variants of concern (Alpha, Beta, Gamma, Delta and Omicron) and the recent variant of interest, JN.1, remains conserved with no mutations at the active site. Moreover, a thorough exploration of apo, non-covalently bound, and covalently bound PL<sup>pro</sup> conformations exposes significant conformational changes in loop regions, offering invaluable insights into the intricate dynamics of ligand-protein complex formation. Employing strategic in silico medication repurposing, this study swiftly identifies potential molecules for target inhibition. Within the domain of covalent docking studies and molecular dynamics, using reported inhibitors and clinically tested molecules elucidate the formation of stable covalent bonds with the cysteine residue, laying a robust foundation for potential therapeutic applications. These details not only deepen our comprehension of PL<sup>pro</sup> inhibition but also play a pivotal role in shaping the dynamic landscape of COVID-19 treatment strategies.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141485428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single-cell transcriptome sequencing revealed the metabolic changes and microenvironment changes of cardiomyocytes induced by diabetes 单细胞转录组测序揭示了糖尿病诱导的心肌细胞代谢变化和微环境变化。
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-06-21 DOI: 10.1016/j.compbiolchem.2024.108136
Weiyu Zhou, Haiqiao Yu, Shuang Yan

Purpose

Diabetes is a chronic metabolic disorder characterized by elevated blood glucose levels. This study aimed to analyze the changes underlying heterogeneities and communication properties of CMs in diabetes mellitus (DM).

Methods

GSE213337 dataset was retrieved from NCBI Gene Expression Omnibus, containing the single-cell RNA sequencing data of hearts from the control and streptozotocin-induced diabetic mice. GSEA and GSVA were used to explore the function enrichment of DEGs in CM. Cell communication analysis was carried out to study the altered signals and significant ligand-receptor interactions.

Results

Seventeen cell types were identified between DM and the controls. The increasing ratio of CM suggested the occurrence of diabetes induces potential pathological changes of CM proliferation. A total of 1144 DEGs were identified in CM. GSEA and GSVA analysis indicated the enhancing lipid metabolism involving in DM. The results of cell communication analysis suggested that high glucose activated the ability of CM receiving fibroblast and LEC, while inhibited the capacity of receiving ECC and pericyte. Furthermore, GAS and ANGPTL were significantly decreased under DM, which was consistent with the results of GSEA and GSVA. Finally, the ligand-receptor interactions such as vegfc-vegfr2, angptl1 were changes in CM.

Conclusions

The CM showed the significant heterogeneities in DM, which played an important role in myocardial fibrosis induce by hyperglycemia.

目的:糖尿病是一种以血糖水平升高为特征的慢性代谢性疾病。本研究旨在分析糖尿病(DM)中 CMs 的异质性和通讯特性的基本变化:GSE213337 数据集来自 NCBI 基因表达总库,包含对照组和链脲佐菌素诱导的糖尿病小鼠心脏的单细胞 RNA 测序数据。利用GSEA和GSVA探讨了DEGs在CM中的功能富集。通过细胞通讯分析研究了改变的信号和重要的配体-受体相互作用:结果:在 DM 和对照组中发现了 17 种细胞类型。结果:DM和对照组之间共发现了17种细胞类型,CM比例的增加表明糖尿病的发生诱发了CM增殖的潜在病理变化。在 CM 中共鉴定出 1144 个 DEGs。GSEA和GSVA分析表明,DM患者的脂质代谢增强。细胞通讯分析结果表明,高糖激活了CM接受成纤维细胞和LEC的能力,而抑制了接受ECC和周细胞的能力。此外,在DM条件下,GAS和ANGPTL明显下降,这与GSEA和GSVA的结果一致。最后,CM中的配体-受体相互作用,如vegfc-vegfr2、angptl1发生了变化:结论:CM在DM中表现出明显的异质性,在高血糖诱导的心肌纤维化中起着重要作用。
{"title":"Single-cell transcriptome sequencing revealed the metabolic changes and microenvironment changes of cardiomyocytes induced by diabetes","authors":"Weiyu Zhou,&nbsp;Haiqiao Yu,&nbsp;Shuang Yan","doi":"10.1016/j.compbiolchem.2024.108136","DOIUrl":"10.1016/j.compbiolchem.2024.108136","url":null,"abstract":"<div><h3>Purpose</h3><p>Diabetes is a chronic metabolic disorder characterized by elevated blood glucose levels. This study aimed to analyze the changes underlying heterogeneities and communication properties of CMs in diabetes mellitus (DM).</p></div><div><h3>Methods</h3><p>GSE213337 dataset was retrieved from NCBI Gene Expression Omnibus, containing the single-cell RNA sequencing data of hearts from the control and streptozotocin-induced diabetic mice. GSEA and GSVA were used to explore the function enrichment of DEGs in CM. Cell communication analysis was carried out to study the altered signals and significant ligand-receptor interactions.</p></div><div><h3>Results</h3><p>Seventeen cell types were identified between DM and the controls. The increasing ratio of CM suggested the occurrence of diabetes induces potential pathological changes of CM proliferation. A total of 1144 DEGs were identified in CM. GSEA and GSVA analysis indicated the enhancing lipid metabolism involving in DM. The results of cell communication analysis suggested that high glucose activated the ability of CM receiving fibroblast and LEC, while inhibited the capacity of receiving ECC and pericyte. Furthermore, GAS and ANGPTL were significantly decreased under DM, which was consistent with the results of GSEA and GSVA. Finally, the ligand-receptor interactions such as vegfc-vegfr2, angptl1 were changes in CM.</p></div><div><h3>Conclusions</h3><p>The CM showed the significant heterogeneities in DM, which played an important role in myocardial fibrosis induce by hyperglycemia.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141461246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A top-down approach for studying the in-silico effect of the novel phytocompound tribulusamide B on the inhibition of Nipah virus transmission through targeting fusion glycoprotein and matrix protein 自上而下研究新型植物化合物蒺藜酰胺 B通过靶向融合糖蛋白和基质蛋白抑制尼帕病毒传播的作用。
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-06-21 DOI: 10.1016/j.compbiolchem.2024.108135
Deblina Rababi, Anish Nag

The proteins of Nipah virus ascribe to its lifecycle and are crucial to infections caused by the virus. In the absence of approved therapeutics, these proteins can be considered as drug targets. This study examined the potential of fifty-three (53) natural compounds to inhibit Nipah virus fusion glycoprotein (NiV F) and matrix protein (NiV M) in silico. The molecular docking experiment, supported by the principal component analysis (PCA), showed that out of all the phytochemicals considered, Tribulusamide B had the highest inhibitory potential against the target proteins NiV F and NiV M (-9.21 and −8.66 kcal mol−1, respectively), when compared to the control drug, Ribavirin (-7.01 and −6.52 kcal mol−1, respectively). Furthermore, it was found that Tribulusamide B pharmacophores, namely, hydrogen donors, acceptors, aromatic and hydrophobic groups, contributed towards the effective residual interactions with the target proteins. The molecular dynamic simulation further validated the results of the docking studies and concluded that Tribulusamide B formed a stable complex with the target proteins. The data obtained from MM-PBSA study further explained that the phytochemical could strongly bind with NiV F (-31.26 kJ mol−1) and NiV M (-40.26 kJ mol−1) proteins in comparison with the control drug Ribavirin (-13.12 and −13.94 kJ mol−1, respectively). Finally, the results indicated that Tribulusamide B, a common inhibitor effective against multiple proteins, can be considered a potential therapeutic entity in treating the Nipah virus infection.

尼帕病毒的蛋白质与病毒的生命周期有关,对病毒引起的感染至关重要。在没有获批治疗药物的情况下,这些蛋白质可被视为药物靶点。本研究对 53 种天然化合物抑制尼帕病毒融合糖蛋白(NiV F)和基质蛋白(NiV M)的潜力进行了硅学研究。主成分分析(PCA)支持的分子对接实验表明,与对照药物利巴韦林(分别为-7.01和-6.52 kcal mol-1)相比,在所有考虑的植物化学物质中,刺蒺藜酰胺B对目标蛋白NiV F和NiV M的抑制潜力最高(分别为-9.21和-8.66 kcal mol-1)。此外,研究还发现,刺蒺藜酰胺 B 的药效团,即氢供体、受体、芳香基团和疏水基团,有助于与目标蛋白质进行有效的剩余相互作用。分子动力学模拟进一步验证了对接研究的结果,认为刺蒺藜酰胺 B 与目标蛋白质形成了稳定的复合物。MM-PBSA 研究获得的数据进一步说明,与对照药物利巴韦林(分别为 -13.12 和 -13.94 kJ mol-1)相比,刺蒺藜酰胺 B 能与 NiV F 蛋白(-31.26 kJ mol-1)和 NiV M 蛋白(-40.26 kJ mol-1)强结合。最后,研究结果表明,刺蒺藜酰胺 B 是一种常见的抑制剂,能有效抑制多种蛋白质,可被视为治疗尼帕病毒感染的一种潜在疗法。
{"title":"A top-down approach for studying the in-silico effect of the novel phytocompound tribulusamide B on the inhibition of Nipah virus transmission through targeting fusion glycoprotein and matrix protein","authors":"Deblina Rababi,&nbsp;Anish Nag","doi":"10.1016/j.compbiolchem.2024.108135","DOIUrl":"10.1016/j.compbiolchem.2024.108135","url":null,"abstract":"<div><p>The proteins of Nipah virus ascribe to its lifecycle and are crucial to infections caused by the virus. In the absence of approved therapeutics, these proteins can be considered as drug targets. This study examined the potential of fifty-three (53) natural compounds to inhibit Nipah virus fusion glycoprotein (NiV F) and matrix protein (NiV M) <em>in silico</em>. The molecular docking experiment, supported by the principal component analysis (PCA), showed that out of all the phytochemicals considered, Tribulusamide B had the highest inhibitory potential against the target proteins NiV F and NiV M (-9.21 and −8.66 kcal mol<sup>−1</sup>, respectively), when compared to the control drug, Ribavirin (-7.01 and −6.52 kcal mol<sup>−1</sup>, respectively). Furthermore, it was found that Tribulusamide B pharmacophores, namely, hydrogen donors, acceptors, aromatic and hydrophobic groups, contributed towards the effective residual interactions with the target proteins. The molecular dynamic simulation further validated the results of the docking studies and concluded that Tribulusamide B formed a stable complex with the target proteins. The data obtained from MM-PBSA study further explained that the phytochemical could strongly bind with NiV F (-31.26 kJ mol<sup>−1</sup>) and NiV M (-40.26 kJ mol<sup>−1</sup>) proteins in comparison with the control drug Ribavirin (-13.12 and −13.94 kJ mol<sup>−1</sup>, respectively). Finally, the results indicated that Tribulusamide B, a common inhibitor effective against multiple proteins, can be considered a potential therapeutic entity in treating the Nipah virus infection.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141473640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SB-Net: Synergizing CNN and LSTM networks for uncovering retrosynthetic pathways in organic synthesis SB-Net:协同 CNN 和 LSTM 网络发现有机合成中的逆合成途径
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-06-15 DOI: 10.1016/j.compbiolchem.2024.108130
Bilal Ahmad Mir , Hilal Tayara , Kil To Chong

Retrosynthesis is vital in synthesizing target products, guiding reaction pathway design crucial for drug and material discovery. Current models often neglect multi-scale feature extraction, limiting efficacy in leveraging molecular descriptors. Our proposed SB-Net model, a deep-learning architecture tailored for retrosynthesis prediction, addresses this gap. SB-Net combines CNN and Bi-LSTM architectures, excelling in capturing multi-scale molecular features. It integrates parallel branches for processing one-hot encoded descriptors and ECFP, merging through dense layers. Experimental results demonstrate SB-Net’s superiority, achieving 73.6 % top-1 and 94.6 % top-10 accuracy on USPTO-50k data. Versatility is validated on MetaNetX, with rates of 52.8 % top-1, 74.3 % top-3, 79.8 % top-5, and 83.5 % top-10. SB-Net’s success in bioretrosynthesis prediction tasks indicates its efficacy. This research advances computational chemistry, offering a robust deep-learning model for retrosynthesis prediction. With implications for drug discovery and synthesis planning, SB-Net promises innovative and efficient pathways.

逆合成对于合成目标产物至关重要,它能指导对药物和材料发现至关重要的反应途径设计。目前的模型往往忽视多尺度特征提取,限制了利用分子描述符的功效。我们提出的 SB-Net 模型是专为逆合成预测定制的深度学习架构,它弥补了这一不足。SB-Net 结合了 CNN 和 Bi-LSTM 架构,在捕捉多尺度分子特征方面表现出色。它集成了处理单次编码描述符和 ECFP 的并行分支,并通过密集层进行合并。实验结果证明了 SB-Net 的优越性,它在 USPTO-50k 数据上的准确率达到了 73.6% top-1 和 94.6% top-10。在 MetaNetX 上,SB-Net 的多功能性得到了验证,top-1、top-3、top-5 和 top-10 的准确率分别为 52.8%、74.3%、79.8% 和 83.5%。SB-Net 在生物合成预测任务中的成功表明了它的功效。这项研究推动了计算化学的发展,为逆合成预测提供了一个强大的深度学习模型。SB-Net 对药物发现和合成规划具有重要意义,有望成为创新和高效的途径。
{"title":"SB-Net: Synergizing CNN and LSTM networks for uncovering retrosynthetic pathways in organic synthesis","authors":"Bilal Ahmad Mir ,&nbsp;Hilal Tayara ,&nbsp;Kil To Chong","doi":"10.1016/j.compbiolchem.2024.108130","DOIUrl":"10.1016/j.compbiolchem.2024.108130","url":null,"abstract":"<div><p>Retrosynthesis is vital in synthesizing target products, guiding reaction pathway design crucial for drug and material discovery. Current models often neglect multi-scale feature extraction, limiting efficacy in leveraging molecular descriptors. Our proposed SB-Net model, a deep-learning architecture tailored for retrosynthesis prediction, addresses this gap. SB-Net combines CNN and Bi-LSTM architectures, excelling in capturing multi-scale molecular features. It integrates parallel branches for processing one-hot encoded descriptors and ECFP, merging through dense layers. Experimental results demonstrate SB-Net’s superiority, achieving 73.6 % top-1 and 94.6 % top-10 accuracy on USPTO-50k data. Versatility is validated on MetaNetX, with rates of 52.8 % top-1, 74.3 % top-3, 79.8 % top-5, and 83.5 % top-10. SB-Net’s success in bioretrosynthesis prediction tasks indicates its efficacy. This research advances computational chemistry, offering a robust deep-learning model for retrosynthesis prediction. With implications for drug discovery and synthesis planning, SB-Net promises innovative and efficient pathways.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141405993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of DEMETER-like DNA demethylase gene family in citrus and their role in drought stress-adaptive responses 柑橘中 DEMETER 样 DNA 去甲基化酶基因家族的鉴定及其在干旱胁迫适应性反应中的作用
IF 2.6 4区 生物学 Q1 Mathematics Pub Date : 2024-06-12 DOI: 10.1016/j.compbiolchem.2024.108128
Gláucia C.B. Silva, Luciana R. Camillo, Dalma B. Santos, Maurício S. Amorim, Luana P. Gonçalves, Ana C.O. Barbosa, Dílson S. Rocha Junior, Grazielle M. Alcântara, Marcio G.C. Costa

DEMETER-Like DNA demethylases (DMLs) are epigenetic regulators of many developmental and biological processes in plants. No comprehensive information about the DML gene family in citrus is available to date. Here, a total of three DML genes in the genomes of Citrus sinensis (named CsDML1–3) and C. clementina (named CcDML1–3) were identified and analyzed. They encode hydrophilic and relatively large proteins, with prediction of nuclear localization, containing the conserved domains and motifs typical of plant DMLs. Protein interaction network analysis suggested that they interact primarily with proteins related to the maintenance of DNA methylation and remodeling of chromatin. Analysis of their promoter regions led to the identification of several cis-acting regulatory elements involved in stress response, including drought, heat and cold stresses. The presence of several miRNA targets and potential phosphorylation sites suggest that their expression is also regulated at post-transcriptional and post-translational levels. RNA-Seq data and quantitative real-time PCR analysis showed a low and drought-regulated gene expression of the citrus DMLs in different plant tissues. CsDML1 and CsDML3 were also differentially regulated by deficit irrigation in fruits at different developmental stages, with a positive and significant correlation found between CsDML1 and PHYTOENE SYNTHASE (PSY) and between CsDML3 and ATP CITRATE LYASEs (ACLs) and ZETA-CAROTENE DESATURASE (ZDS) gene expression. These results indicate that the citrus DMLs are potentially functional enzymes involved in developmental processes and drought stress-adaptive responses, providing a useful reference for further investigation of their functions and applications on the citrus improvement.

DEMETER-Like DNA 去甲基酶(DMLs)是植物许多发育和生物过程的表观遗传调节因子。迄今为止,还没有关于柑橘中 DML 基因家族的全面信息。本文鉴定并分析了柑橘属(Citrus sinensis,命名为 CsDML1-3)和柑橘属(C. clementina,命名为 CcDML1-3)基因组中的三个 DML 基因。它们编码亲水性和相对较大的蛋白质,预测会进行核定位,含有植物 DML 的典型保守结构域和基序。蛋白质相互作用网络分析表明,它们主要与维持 DNA 甲基化和重塑染色质相关的蛋白质相互作用。对它们的启动子区域进行分析后,发现了几个参与胁迫响应的顺式作用调控元件,包括干旱、热和冷胁迫。几个 miRNA 靶点和潜在磷酸化位点的存在表明,它们的表达也受到转录后和翻译后水平的调控。RNA-Seq 数据和定量实时 PCR 分析表明,柑橘 DMLs 在不同植物组织中的基因表达量较低,且受干旱调控。CsDML1和CsDML3在不同发育阶段的果实中也受到缺水灌溉的不同调控,其中CsDML1与芳香烃合成酶(PSY)、CsDML3与ATP柠檬酸裂解酶(ACLs)和ZETA-CAROTENE DESATURASE(ZDS)基因表达之间存在显著的正相关。这些结果表明,柑橘 DMLs 是参与发育过程和干旱胁迫适应反应的潜在功能酶,为进一步研究其功能和在柑橘改良中的应用提供了有益的参考。
{"title":"Identification of DEMETER-like DNA demethylase gene family in citrus and their role in drought stress-adaptive responses","authors":"Gláucia C.B. Silva,&nbsp;Luciana R. Camillo,&nbsp;Dalma B. Santos,&nbsp;Maurício S. Amorim,&nbsp;Luana P. Gonçalves,&nbsp;Ana C.O. Barbosa,&nbsp;Dílson S. Rocha Junior,&nbsp;Grazielle M. Alcântara,&nbsp;Marcio G.C. Costa","doi":"10.1016/j.compbiolchem.2024.108128","DOIUrl":"10.1016/j.compbiolchem.2024.108128","url":null,"abstract":"<div><p>DEMETER-Like DNA demethylases (DMLs) are epigenetic regulators of many developmental and biological processes in plants. No comprehensive information about the <em>DML</em> gene family in citrus is available to date. Here, a total of three <em>DML</em> genes in the genomes of <em>Citrus sinensis</em> (named <em>CsDML1–3</em>) and <em>C. clementina</em> (named <em>CcDML1–3</em>) were identified and analyzed. They encode hydrophilic and relatively large proteins, with prediction of nuclear localization, containing the conserved domains and motifs typical of plant DMLs. Protein interaction network analysis suggested that they interact primarily with proteins related to the maintenance of DNA methylation and remodeling of chromatin. Analysis of their promoter regions led to the identification of several <em>cis</em>-acting regulatory elements involved in stress response, including drought, heat and cold stresses. The presence of several miRNA targets and potential phosphorylation sites suggest that their expression is also regulated at post-transcriptional and post-translational levels. RNA-Seq data and quantitative real-time PCR analysis showed a low and drought-regulated gene expression of the citrus <em>DML</em>s in different plant tissues. <em>CsDML1</em> and <em>CsDML3</em> were also differentially regulated by deficit irrigation in fruits at different developmental stages, with a positive and significant correlation found between <em>CsDML1</em> and <em>PHYTOENE SYNTHASE</em> (<em>PSY</em>) and between <em>CsDML3</em> and <em>ATP CITRATE LYASE</em>s (<em>ACL</em>s) and <em>ZETA-CAROTENE DESATURASE</em> (<em>ZDS</em>) gene expression. These results indicate that the citrus DMLs are potentially functional enzymes involved in developmental processes and drought stress-adaptive responses, providing a useful reference for further investigation of their functions and applications on the citrus improvement.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141399969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STPDA: Leveraging spatial-temporal patterns for downstream analysis in spatial transcriptomic data STPDA:利用时空模式对空间转录组数据进行下游分析
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-06-11 DOI: 10.1016/j.compbiolchem.2024.108127
Mingguang Shi , Xudong Cheng , Yulong Dai

Spatial transcriptomics, a groundbreaking field in cellular biology, faces the challenge of effectively deciphering complex spatial-temporal gene expression patterns. Traditional data analysis methods often fail to capture the intricate nuances of this data, limiting the depth of understanding in spatial distribution and gene interactions. In response, we present Spatial-Temporal Patterns for Downstream Analysis (STPDA), a sophisticated computational framework tailored for spatial transcriptomic data analysis. STPDA leverages high-resolution mapping to bridge the gap between genomics and histopathology, offering a comprehensive perspective on the spatial dynamics of gene expression within tissues. This approach enables a view of cellular function and organization, marking a paradigm shift in our comprehension of biological systems. By employing Autoregressive Moving Average (ARMA) and Long Short-Term Memory (LSTM) models, STPDA effectively deciphers both global and local spatio-temporal dynamics in cellular environments. This integration of spatial-temporal patterns for downstream analysis offers a transformative approach to spatial transcriptomics data analysis. STPDA excels in various single-cell analytical tasks, including the identification of ligand-receptor interactions and cell type classification. Its ability to harness spatial-temporal patterns not only matches but frequently surpasses the performance of existing state-of-the-art methods. To ensure widespread usability and impact, we have encapsulated STPDA in a scalable and accessible Python package, addressing single-cell tasks through advanced spatial-temporal pattern analysis. This development promises to enhance our understanding of cellular biology, offering novel insights and therapeutic strategies, and represents a substantial advancement in the field of spatial transcriptomics.

空间转录组学是细胞生物学的一个突破性领域,它面临着有效破译复杂的时空基因表达模式的挑战。传统的数据分析方法往往无法捕捉到这些数据错综复杂的细微差别,从而限制了对空间分布和基因相互作用的深入理解。为此,我们提出了用于下游分析的空间-时间模式(STPDA),这是一个为空间转录组数据分析量身定制的复杂计算框架。STPDA 利用高分辨率图谱架起了基因组学与组织病理学之间的桥梁,为组织内基因表达的空间动态提供了一个全面的视角。通过这种方法可以了解细胞的功能和组织,标志着我们对生物系统理解的范式转变。通过采用自回归移动平均(ARMA)和长短期记忆(LSTM)模型,STPDA 能有效解读细胞环境中的全局和局部时空动态。这种整合时空模式的下游分析为空间转录组学数据分析提供了一种变革性方法。STPDA 擅长各种单细胞分析任务,包括配体-受体相互作用的鉴定和细胞类型分类。它利用空间-时间模式的能力不仅能与现有的先进方法相媲美,而且经常超越它们。为了确保广泛的可用性和影响力,我们将 STPDA 封装在一个可扩展且易于访问的 Python 软件包中,通过先进的时空模式分析来解决单细胞任务。这项开发有望增强我们对细胞生物学的理解,提供新的见解和治疗策略,是空间转录组学领域的一大进步。
{"title":"STPDA: Leveraging spatial-temporal patterns for downstream analysis in spatial transcriptomic data","authors":"Mingguang Shi ,&nbsp;Xudong Cheng ,&nbsp;Yulong Dai","doi":"10.1016/j.compbiolchem.2024.108127","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108127","url":null,"abstract":"<div><p>Spatial transcriptomics, a groundbreaking field in cellular biology, faces the challenge of effectively deciphering complex spatial-temporal gene expression patterns. Traditional data analysis methods often fail to capture the intricate nuances of this data, limiting the depth of understanding in spatial distribution and gene interactions. In response, we present Spatial-Temporal Patterns for Downstream Analysis (STPDA), a sophisticated computational framework tailored for spatial transcriptomic data analysis. STPDA leverages high-resolution mapping to bridge the gap between genomics and histopathology, offering a comprehensive perspective on the spatial dynamics of gene expression within tissues. This approach enables a view of cellular function and organization, marking a paradigm shift in our comprehension of biological systems. By employing Autoregressive Moving Average (ARMA) and Long Short-Term Memory (LSTM) models, STPDA effectively deciphers both global and local spatio-temporal dynamics in cellular environments. This integration of spatial-temporal patterns for downstream analysis offers a transformative approach to spatial transcriptomics data analysis. STPDA excels in various single-cell analytical tasks, including the identification of ligand-receptor interactions and cell type classification. Its ability to harness spatial-temporal patterns not only matches but frequently surpasses the performance of existing state-of-the-art methods. To ensure widespread usability and impact, we have encapsulated STPDA in a scalable and accessible Python package, addressing single-cell tasks through advanced spatial-temporal pattern analysis. This development promises to enhance our understanding of cellular biology, offering novel insights and therapeutic strategies, and represents a substantial advancement in the field of spatial transcriptomics.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141314371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spectrum of rare and common mitochondrial DNA variations from 1029 whole genomes of self-declared healthy individuals from India 来自印度 1029 个自称健康人的全基因组的罕见和常见线粒体 DNA 变异谱
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-06-10 DOI: 10.1016/j.compbiolchem.2024.108118
Vishu Gupta , Bani Jolly , Rahul C. Bhoyar , Mohit Kumar Divakar , Abhinav Jain , Anushree Mishra , Vigneshwar Senthivel , Mohamed Imran , Vinod Scaria , Sridhar Sivasubbu

Mitochondrial disorders are a class of heterogeneous disorders caused by genetic variations in the mitochondrial genome (mtDNA) as well as the nuclear genome. The spectrum of mtDNA variants remains unexplored in the Indian population. In the present study, we have cataloged 2689 high confidence single nucleotide variants, small insertions and deletions in mtDNA in 1029 healthy Indian individuals. We found a major proportion (76.5 %) of the variants being rare (AF<=0.005) in the studied population. Intriguingly, we found two ‘confirmed’ pathogenic variants (m.1555 A>G and m.14484 T>C) with a frequency of ∼1 in 250 individuals in our dataset. The high carrier frequency underscores the need for screening of the mtDNA pathogenic mutations in newborns in India. Interestingly, our analysis also revealed 202 variants in our dataset which have been ‘reported’ in disease cases as per the MITOMAP database. Additionally, we found the frequency of haplogroup M (52.2 %) to be the highest among all the 18 top-level haplogroups found in our dataset. In comparison to the global population datasets, 20 unique mtDNA variants are found in the Indian population. We hope the whole genome sequencing based compendium of mtDNA variants along with their allele frequencies and heteroplasmy levels in the Indian population will drive additional genome scale studies for mtDNA. Furthermore, the identification of clinically relevant variants in our dataset will aid in better clinical interpretation of the variants in mitochondrial disorders.

线粒体疾病是由线粒体基因组(mtDNA)和核基因组的遗传变异引起的一类异质性疾病。印度人群中的 mtDNA 变异谱仍未得到研究。在本研究中,我们对 1029 名健康印度人的 2689 个高置信度单核苷酸变异、线粒体 DNA 中的小插入和缺失进行了编目。我们发现,在所研究的人群中,大部分变异(76.5%)是罕见的(AF<=0.005)。有趣的是,我们在数据集中发现了两个 "确认 "的致病变体(m.1555 A>G 和 m.14484 T>C),在 250 个个体中的频率为 1∼1。高频率的携带者突显了对印度新生儿进行 mtDNA 致病突变筛查的必要性。有趣的是,我们的分析还发现,在我们的数据集中,有 202 个变异在 MITOMAP 数据库的疾病病例中被 "报告 "过。此外,我们还发现单倍群 M 的频率(52.2%)是我们数据集中所有 18 个顶级单倍群中最高的。与全球人口数据集相比,印度人口中发现了 20 个独特的 mtDNA 变异。我们希望基于全基因组测序的印度人群 mtDNA 变异及其等位基因频率和异源水平汇编能推动更多的 mtDNA 基因组规模研究。此外,我们数据集中与临床相关的变异的鉴定将有助于更好地对线粒体疾病中的变异进行临床解释。
{"title":"Spectrum of rare and common mitochondrial DNA variations from 1029 whole genomes of self-declared healthy individuals from India","authors":"Vishu Gupta ,&nbsp;Bani Jolly ,&nbsp;Rahul C. Bhoyar ,&nbsp;Mohit Kumar Divakar ,&nbsp;Abhinav Jain ,&nbsp;Anushree Mishra ,&nbsp;Vigneshwar Senthivel ,&nbsp;Mohamed Imran ,&nbsp;Vinod Scaria ,&nbsp;Sridhar Sivasubbu","doi":"10.1016/j.compbiolchem.2024.108118","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108118","url":null,"abstract":"<div><p>Mitochondrial disorders are a class of heterogeneous disorders caused by genetic variations in the mitochondrial genome (mtDNA) as well as the nuclear genome. The spectrum of mtDNA variants remains unexplored in the Indian population. In the present study, we have cataloged 2689 high confidence single nucleotide variants, small insertions and deletions in mtDNA in 1029 healthy Indian individuals. We found a major proportion (76.5 %) of the variants being rare (AF&lt;=0.005) in the studied population. Intriguingly, we found two ‘confirmed’ pathogenic variants (m.1555 A&gt;G and m.14484 T&gt;C) with a frequency of ∼1 in 250 individuals in our dataset. The high carrier frequency underscores the need for screening of the mtDNA pathogenic mutations in newborns in India. Interestingly, our analysis also revealed 202 variants in our dataset which have been ‘reported’ in disease cases as per the MITOMAP database. Additionally, we found the frequency of haplogroup M (52.2 %) to be the highest among all the 18 top-level haplogroups found in our dataset. In comparison to the global population datasets, 20 unique mtDNA variants are found in the Indian population. We hope the whole genome sequencing based compendium of mtDNA variants along with their allele frequencies and heteroplasmy levels in the Indian population will drive additional genome scale studies for mtDNA. Furthermore, the identification of clinically relevant variants in our dataset will aid in better clinical interpretation of the variants in mitochondrial disorders.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141324763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ACDMBI: A deep learning model based on community division and multi-source biological information fusion predicts essential proteins ACDMBI:基于群落划分和多源生物信息融合的深度学习模型预测必需蛋白质
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-06-06 DOI: 10.1016/j.compbiolchem.2024.108115
Pengli Lu, Jialong Tian

Accurately identifying essential proteins is vital for drug research and disease diagnosis. Traditional centrality methods and machine learning approaches often face challenges in accurately discerning essential proteins, primarily relying on information derived from protein-protein interaction (PPI) networks. Despite attempts by some researchers to integrate biological data and PPI networks for predicting essential proteins, designing effective integration methods remains a challenge. In response to these challenges, this paper presents the ACDMBI model, specifically designed to overcome the aforementioned issues. ACDMBI is comprised of two key modules: feature extraction and classification. In terms of capturing relevant information, we draw insights from three distinct data sources. Initially, structural features of proteins are extracted from the PPI network through community division. Subsequently, these features are further optimized using Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT). Moving forward, protein features are extracted from gene expression data utilizing Bidirectional Long Short-Term Memory networks (BiLSTM) and a multi-head self-attention mechanism. Finally, protein features are derived by mapping subcellular localization data to a one-dimensional vector and processing it through fully connected layers. In the classification phase, we integrate features extracted from three different data sources, crafting a multi-layer deep neural network (DNN) for protein classification prediction. Experimental results on brewing yeast data showcase the ACDMBI model’s superior performance, with AUC reaching 0.9533 and AUPR reaching 0.9153. Ablation experiments further reveal that the effective integration of features from diverse biological information significantly boosts the model’s performance.

准确识别必需蛋白质对药物研究和疾病诊断至关重要。传统的中心性方法和机器学习方法主要依赖于从蛋白质-蛋白质相互作用(PPI)网络中获得的信息,在准确识别必需蛋白质方面常常面临挑战。尽管一些研究人员尝试整合生物数据和 PPI 网络来预测必需蛋白,但设计有效的整合方法仍然是一个挑战。为了应对这些挑战,本文提出了 ACDMBI 模型,专门用于克服上述问题。ACDMBI 由两个关键模块组成:特征提取和分类。在捕捉相关信息方面,我们从三个不同的数据源中汲取灵感。首先,通过群落划分从 PPI 网络中提取蛋白质的结构特征。随后,使用图卷积网络(GCN)和图注意网络(GAT)进一步优化这些特征。接着,利用双向长短期记忆网络(BiLSTM)和多头自注意机制从基因表达数据中提取蛋白质特征。最后,通过将亚细胞定位数据映射到一维向量并通过全连接层进行处理,得出蛋白质特征。在分类阶段,我们整合了从三种不同数据源提取的特征,精心设计了一个用于蛋白质分类预测的多层深度神经网络(DNN)。酿酒酵母数据的实验结果表明,ACDMBI 模型性能优越,AUC 达到 0.9533,AUPR 达到 0.9153。消融实验进一步表明,有效整合来自不同生物信息的特征大大提高了模型的性能。
{"title":"ACDMBI: A deep learning model based on community division and multi-source biological information fusion predicts essential proteins","authors":"Pengli Lu,&nbsp;Jialong Tian","doi":"10.1016/j.compbiolchem.2024.108115","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108115","url":null,"abstract":"<div><p>Accurately identifying essential proteins is vital for drug research and disease diagnosis. Traditional centrality methods and machine learning approaches often face challenges in accurately discerning essential proteins, primarily relying on information derived from protein-protein interaction (PPI) networks. Despite attempts by some researchers to integrate biological data and PPI networks for predicting essential proteins, designing effective integration methods remains a challenge. In response to these challenges, this paper presents the ACDMBI model, specifically designed to overcome the aforementioned issues. ACDMBI is comprised of two key modules: feature extraction and classification. In terms of capturing relevant information, we draw insights from three distinct data sources. Initially, structural features of proteins are extracted from the PPI network through community division. Subsequently, these features are further optimized using Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT). Moving forward, protein features are extracted from gene expression data utilizing Bidirectional Long Short-Term Memory networks (BiLSTM) and a multi-head self-attention mechanism. Finally, protein features are derived by mapping subcellular localization data to a one-dimensional vector and processing it through fully connected layers. In the classification phase, we integrate features extracted from three different data sources, crafting a multi-layer deep neural network (DNN) for protein classification prediction. Experimental results on brewing yeast data showcase the ACDMBI model’s superior performance, with AUC reaching 0.9533 and AUPR reaching 0.9153. Ablation experiments further reveal that the effective integration of features from diverse biological information significantly boosts the model’s performance.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141307907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Biology and Chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1