首页 > 最新文献

Computational Biology and Chemistry最新文献

英文 中文
From driver genes to gene families: A computational analysis of oncogenic mutations and ubiquitination anomalies in hepatocellular carcinoma 从驱动基因到基因家族:肝细胞癌致癌突变和泛素化异常的计算分析
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-06-06 DOI: 10.1016/j.compbiolchem.2024.108119
Meng Wang, Xinyue Yan, Yanan Dong, Xiaoqin Li, Bin Gao

Hepatocellular carcinoma (HCC) is a widespread primary liver cancer with a high fatality rate. Despite several genes with oncogenic effects in HCC have been identified, many remain undiscovered. In this study, we conducted a comprehensive computational analysis to explore the involvement of genes within the same families as known driver genes in HCC. Specifically, we expanded the concept beyond single-gene mutations to encompass gene families sharing homologous structures, integrating various omics data to comprehensively understand gene abnormalities in cancer. Our analysis identified 74 domains with an enriched mutation burden, 404 domain mutation hotspots, and 233 dysregulated driver genes. We observed that specific low-frequency somatic mutations may contribute to HCC occurrence, potentially overlooked by single-gene algorithms. Furthermore, we systematically analyzed how abnormalities in the ubiquitinated proteasome system (UPS) impact HCC, finding that abnormal genes in E3, E2, DUB families, and Degron genes often result in HCC by affecting the stability of oncogenic or tumor suppressor proteins. In conclusion, expanding the exploration of driver genes to include gene families with homologous structures emerges as a promising strategy for uncovering additional oncogenic alterations in HCC.

肝细胞癌(HCC)是一种广泛存在的原发性肝癌,致死率很高。尽管已经发现了几个对 HCC 有致癌作用的基因,但仍有许多基因尚未被发现。在这项研究中,我们进行了全面的计算分析,以探索与已知驱动基因同族的基因参与 HCC 的情况。具体来说,我们将概念从单个基因突变扩展到了共享同源结构的基因家族,整合了各种omics数据,以全面了解癌症中的基因异常。我们的分析确定了 74 个具有丰富突变负担的结构域、404 个结构域突变热点和 233 个调控失调的驱动基因。我们观察到,特定的低频体细胞突变可能会导致 HCC 的发生,而单基因算法可能会忽略这些突变。此外,我们还系统分析了泛素化蛋白酶体系统(UPS)的异常如何影响 HCC,发现 E3、E2、DUB 家族和 Degron 基因的异常往往会影响致癌蛋白或抑癌基因的稳定性,从而导致 HCC。总之,扩大对驱动基因的探索范围,将具有同源结构的基因家族包括在内,是发现 HCC 中更多致癌改变的一种有前途的策略。
{"title":"From driver genes to gene families: A computational analysis of oncogenic mutations and ubiquitination anomalies in hepatocellular carcinoma","authors":"Meng Wang,&nbsp;Xinyue Yan,&nbsp;Yanan Dong,&nbsp;Xiaoqin Li,&nbsp;Bin Gao","doi":"10.1016/j.compbiolchem.2024.108119","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108119","url":null,"abstract":"<div><p>Hepatocellular carcinoma (HCC) is a widespread primary liver cancer with a high fatality rate. Despite several genes with oncogenic effects in HCC have been identified, many remain undiscovered. In this study, we conducted a comprehensive computational analysis to explore the involvement of genes within the same families as known driver genes in HCC. Specifically, we expanded the concept beyond single-gene mutations to encompass gene families sharing homologous structures, integrating various omics data to comprehensively understand gene abnormalities in cancer. Our analysis identified 74 domains with an enriched mutation burden, 404 domain mutation hotspots, and 233 dysregulated driver genes. We observed that specific low-frequency somatic mutations may contribute to HCC occurrence, potentially overlooked by single-gene algorithms. Furthermore, we systematically analyzed how abnormalities in the ubiquitinated proteasome system (UPS) impact HCC, finding that abnormal genes in E3, E2, DUB families, and Degron genes often result in HCC by affecting the stability of oncogenic or tumor suppressor proteins. In conclusion, expanding the exploration of driver genes to include gene families with homologous structures emerges as a promising strategy for uncovering additional oncogenic alterations in HCC.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141291963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of a novel representation of drug 3D structures and enhancement of the TSR-based method for probing drug and target interactions 开发新型药物三维结构表示法,改进基于 TSR 的药物与靶点相互作用探测方法
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-06-04 DOI: 10.1016/j.compbiolchem.2024.108117
Tarikul I. Milon , Yuhong Wang , Ryan L. Fontenot , Poorya Khajouie , Francois Villinger , Vijay Raghavan , Wu Xu

Understanding the mechanisms underlying interactions between drugs and target proteins is critical for drug discovery. In our earlier studies, we introduced the Triangular Spatial Relationship (TSR)-based algorithm, which enables the representation of a protein’s 3D structure as a vector of integers (TSR keys). These TSR keys correspond to substructures of the 3D structure of a protein and are computed based on the triangles constructed by all possible triples of Cα atoms within the protein. In this study, we report on a new TSR-based algorithm for probing drug and target interactions. Specifically, we have extended the previous algorithm in three novel directions: TSR keys for representing the 3D structure of a drug or a ligand, cross TSR keys between drugs and their targets and intra-residual TSR keys for phosphorylated amino acids. The outcomes illustrate the key contributions as follows: (i) The TSR-based method, which uses the TSR keys as features, is unique in its capability to interpret hierarchical relationships of drugs as well as drug - target complexes using common and specific TSR keys. (ii) The method can distinguish not only the binding sites from the rest of the protein structures, but also the binding sites of primary targets from those of off-targets. (iii) The method has the potential to correlate the 3D structures of drugs with their functions. (iv) Representation of 3D structures by TSR keys has its unique advantage in terms of ease of making searching for similar substructures across structure datasets easier. In summary, this study presents a novel computational methodology, with significant advantages, for providing insights into the mechanism underlying drug and target interactions.

了解药物与靶蛋白之间的相互作用机制对于药物发现至关重要。在早期的研究中,我们引入了基于三角形空间关系(TSR)的算法,该算法能将蛋白质的三维结构表示为整数向量(TSR 键)。这些 TSR 键对应于蛋白质三维结构的子结构,是根据蛋白质中所有可能的 Cα 原子三元组构建的三角形计算得出的。在本研究中,我们报告了一种基于 TSR 的新算法,用于探测药物与靶点的相互作用。具体来说,我们在三个新方向上扩展了之前的算法:代表药物或配体三维结构的 TSR 键、药物与其靶标之间的交叉 TSR 键以及磷酸化氨基酸的残留内 TSR 键。这些成果说明了以下主要贡献:(i) 基于 TSR 的方法使用 TSR 键作为特征,其独特之处在于能够使用常见和特定的 TSR 键解释药物的层次关系以及药物-靶标复合物。(ii) 该方法不仅能将结合位点与蛋白质结构的其他部分区分开来,还能将主要靶标的结合位点与非靶标的结合位点区分开来。(iii) 该方法有可能将药物的三维结构与其功能联系起来。(iv) 用 TSR 键表示三维结构有其独特的优势,可以更容易地在结构数据集中搜索相似的子结构。总之,本研究提出了一种具有显著优势的新型计算方法,有助于深入了解药物与靶点相互作用的内在机制。
{"title":"Development of a novel representation of drug 3D structures and enhancement of the TSR-based method for probing drug and target interactions","authors":"Tarikul I. Milon ,&nbsp;Yuhong Wang ,&nbsp;Ryan L. Fontenot ,&nbsp;Poorya Khajouie ,&nbsp;Francois Villinger ,&nbsp;Vijay Raghavan ,&nbsp;Wu Xu","doi":"10.1016/j.compbiolchem.2024.108117","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108117","url":null,"abstract":"<div><p>Understanding the mechanisms underlying interactions between drugs and target proteins is critical for drug discovery. In our earlier studies, we introduced the Triangular Spatial Relationship (TSR)-based algorithm, which enables the representation of a protein’s 3D structure as a vector of integers (TSR keys). These TSR keys correspond to substructures of the 3D structure of a protein and are computed based on the triangles constructed by all possible triples of C<sub>α</sub> atoms within the protein. In this study, we report on a new TSR-based algorithm for probing drug and target interactions. Specifically, we have extended the previous algorithm in three novel directions: TSR keys for representing the 3D structure of a drug or a ligand, cross TSR keys between drugs and their targets and intra-residual TSR keys for phosphorylated amino acids. The outcomes illustrate the key contributions as follows: (i) The TSR-based method, which uses the TSR keys as features, is unique in its capability to interpret hierarchical relationships of drugs as well as drug - target complexes using common and specific TSR keys. (ii) The method can distinguish not only the binding sites from the rest of the protein structures, but also the binding sites of primary targets from those of off-targets. (iii) The method has the potential to correlate the 3D structures of drugs with their functions. (iv) Representation of 3D structures by TSR keys has its unique advantage in terms of ease of making searching for similar substructures across structure datasets easier. In summary, this study presents a novel computational methodology, with significant advantages, for providing insights into the mechanism underlying drug and target interactions.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141291964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating solubility, stability, and inclusion complexation of oxyresveratrol with various β-cyclodextrin derivatives using advanced computational techniques and experimental validation 利用先进的计算技术和实验验证评估氧白藜芦醇与各种 β-环糊精衍生物的溶解性、稳定性和包合络合作用
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-06-01 DOI: 10.1016/j.compbiolchem.2024.108111
Saba Ali , Aamir Aman , Kowit Hengphasatporn , Lipika Oopkaew , Bunyaporn Todee , Ryo Fujiki , Ryuhei Harada , Yasuteru Shigeta , Kuakarun Krusong , Kiattawee Choowongkomon , Warinthorn Chavasiri , Peter Wolschann , Panupong Mahalapbutr , Thanyada Rungrotmongkol

Oxyresveratrol (OXY), a natural stilbenoid in mulberry fruits, is known for its diverse pharmacological properties. However, its clinical use is hindered by low water solubility and limited bioavailability. In the present study, the inclusion complexes of OXY with β-cyclodextrin (βCD) and its three analogs, dimethyl-β-cyclodextrin (DMβCD), hydroxypropyl-β-cyclodextrin (HPβCD) and sulfobutylether-β-cyclodextrin (SBEβCD), were investigated using in silico and in vitro studies. Molecular docking revealed two binding orientations of OXY, namely, 4′,6′-dihydroxyphenyl (A-form) and 5,7-benzenediol ring (B-form). Molecular Dynamics simulations suggested the formation of inclusion complexes with βCDs through two distinct orientations, with OXY/SBEβCD exhibiting maximum atom contacts and the lowest solvent-exposed area in the hydrophobic cavity. These results corresponded well with the highest binding affinity observed in OXY/SBEβCD when assessed using the MM/GBSA method. Beyond traditional simulation methods, Ligand-binding Parallel Cascade Selection Molecular Dynamics method was employed to investigate how the drug enters and accommodates within the hydrophobic cavity. The in silico results aligned with stability constants: SBEβCD (2060 M−1), HPβCD (1860 M−1), DMβCD (1700 M−1), and βCD (1420 M−1). All complexes exhibited a 1:1 binding mode (AL type), with SBEβCD enhancing OXY solubility (25-fold). SEM micrographs, DSC thermograms, FT-IR and 1H NMR spectra confirm the inclusion complex formation, revealing novel surface morphologies, distinctive thermal behaviors, and new peaks. Notably, the inhibitory impact on the proliferation of breast cancer cell lines, MCF-7, exhibited by inclusion complexes particularly OXY/DMβCD, OXY/HPβCD, and OXY/SBEβCD were markedly superior compared to that of OXY alone.

氧白藜芦醇(OXY)是桑葚果实中的一种天然硬脂类化合物,因其多种药理特性而闻名。然而,由于水溶性低和生物利用度有限,其临床应用受到阻碍。本研究采用硅学和体外实验研究了 OXY 与 β-环糊精(βCD)及其三种类似物(二甲基-β-环糊精(DMβCD)、羟丙基-β-环糊精(HPβCD)和磺丁基-β-环糊精(SBEβCD))的包合复合物。分子对接显示了 OXY 的两种结合方向,即 4′,6′-二羟基苯基(A-形式)和 5,7- 苯二醇环(B-形式)。分子动力学模拟表明,OXY/SBEβCD 通过两种不同的取向与 βCD 形成包合物复合物,OXY/SBEβCD 在疏水腔中表现出最大的原子接触和最小的溶剂暴露面积。这些结果与使用 MM/GBSA 方法评估 OXY/SBEβCD 时观察到的最高结合亲和力十分吻合。除了传统的模拟方法外,还采用了配体结合平行级联选择分子动力学方法来研究药物如何进入疏水空腔并与之相容。硅学结果与稳定性常数一致:SBEβCD(2060 M-1)、HPβCD(1860 M-1)、DMβCD(1700 M-1)和 βCD(1420 M-1)。所有复合物都表现出 1:1 的结合模式(AL 型),其中 SBEβCD 可提高 OXY 的溶解度(25 倍)。扫描电镜显微照片、DSC 热图、傅立叶变换红外光谱和 1H NMR 光谱证实了包合物的形成,揭示了新的表面形态、独特的热行为和新的峰值。值得注意的是,包合物(尤其是 OXY/DMβCD、OXY/HPβCD 和 OXY/SBEβCD)对乳腺癌细胞株 MCF-7 增殖的抑制作用明显优于单独的 OXY。
{"title":"Evaluating solubility, stability, and inclusion complexation of oxyresveratrol with various β-cyclodextrin derivatives using advanced computational techniques and experimental validation","authors":"Saba Ali ,&nbsp;Aamir Aman ,&nbsp;Kowit Hengphasatporn ,&nbsp;Lipika Oopkaew ,&nbsp;Bunyaporn Todee ,&nbsp;Ryo Fujiki ,&nbsp;Ryuhei Harada ,&nbsp;Yasuteru Shigeta ,&nbsp;Kuakarun Krusong ,&nbsp;Kiattawee Choowongkomon ,&nbsp;Warinthorn Chavasiri ,&nbsp;Peter Wolschann ,&nbsp;Panupong Mahalapbutr ,&nbsp;Thanyada Rungrotmongkol","doi":"10.1016/j.compbiolchem.2024.108111","DOIUrl":"10.1016/j.compbiolchem.2024.108111","url":null,"abstract":"<div><p>Oxyresveratrol (OXY), a natural stilbenoid in mulberry fruits, is known for its diverse pharmacological properties. However, its clinical use is hindered by low water solubility and limited bioavailability. In the present study, the inclusion complexes of OXY with β-cyclodextrin (βCD) and its three analogs, dimethyl-β-cyclodextrin (DMβCD), hydroxypropyl-β-cyclodextrin (HPβCD) and sulfobutylether-β-cyclodextrin (SBEβCD), were investigated using <em>in silico</em> and <em>in vitro</em> studies. Molecular docking revealed two binding orientations of OXY, namely, 4′,6′-dihydroxyphenyl (A-form) and 5,7-benzenediol ring (B-form). Molecular Dynamics simulations suggested the formation of inclusion complexes with βCDs through two distinct orientations, with OXY/SBEβCD exhibiting maximum atom contacts and the lowest solvent-exposed area in the hydrophobic cavity. These results corresponded well with the highest binding affinity observed in OXY/SBEβCD when assessed using the MM/GBSA method. Beyond traditional simulation methods, Ligand-binding Parallel Cascade Selection Molecular Dynamics method was employed to investigate how the drug enters and accommodates within the hydrophobic cavity. The <em>in silico</em> results aligned with stability constants: SBEβCD (2060 M<sup>−1</sup>), HPβCD (1860 M<sup>−1</sup>), DMβCD (1700 M<sup>−1</sup>), and βCD (1420 M<sup>−1</sup>). All complexes exhibited a 1:1 binding mode (A<sub>L</sub> type), with SBEβCD enhancing OXY solubility (25-fold). SEM micrographs, DSC thermograms, FT-IR and <sup>1</sup>H NMR spectra confirm the inclusion complex formation, revealing novel surface morphologies, distinctive thermal behaviors, and new peaks. Notably, the inhibitory impact on the proliferation of breast cancer cell lines, MCF-7, exhibited by inclusion complexes particularly OXY/DMβCD, OXY/HPβCD, and OXY/SBEβCD were markedly superior compared to that of OXY alone.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141232803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of viral families and hosts of single-stranded RNA viruses based on K-Mer coding from phylogenetic gene sequences 根据系统发育基因序列的 K-Mer 编码预测单链 RNA 病毒的病毒科和宿主
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-05-31 DOI: 10.1016/j.compbiolchem.2024.108114
Bahar Çi̇ftçi̇ , Ramazan Teki̇n

There are billions of virus species worldwide, and viruses, the smallest parasitic entities, pose a serious threat. Therefore, fighting associated disorders requires an understanding of the genetic structure of viruses. Considering the wide diversity and rapid evolution of viruses, there is a critical need to quickly and accurately classify viral species and their potential hosts to better understand transmission dynamics, facilitating the development of targeted therapies. Recognizing this, this study has investigated the classes of RNA viruses based on their genomic sequences using Machine Learning (ML) and Deep Learning (DL) models. The PhyVirus dataset, consisting of pathogenic Single-stranded RNA viruses of Baltimore group four (+ssRNA) and five (-ssRNA) with different hosts and species, was analyzed. The dataset containing viral gene sequences was analyzed using the K-Mer coding technique, which is based on base words of various lengths. The study used classical ML algorithms (Random Forest, Gradient Boosting and Extra Trees) and the Fully Connected Deep Neural Network, a Deep Learning algorithm, to predict viral families and hosts. Detailed analyses were performed on the classifier performance in scenarios with different train-test ratios and different word lengths (k-values) for K-Mer. The observed results show that Fully Connected Deep Neural Network has a high success rate of 99.60 % in predicting virus families. In predicting virus hosts, the Extra Trees classifier achieved the highest success rate of 81.53 %. This study is considered to be the first classification study in the literature on this dataset, which has a very large family and host diversity consisting of gene sequences of Single-stranded RNA viruses. Our detailed investigations on how varying word lengths based on K-Mer coding in gene sequences affect the classification into viral families and hosts make this study particularly valuable. This study shows that ML and DL methods have the potential to produce valuable results in phylogenetic studies. In addition, the results and high-performance values show that these methods can be successfully used in regenerative applications of gene sequences or in studies such as the elimination of losses in gene sequences.

全世界有数十亿种病毒,病毒作为最小的寄生实体,构成了严重威胁。因此,防治相关疾病需要了解病毒的基因结构。考虑到病毒的广泛多样性和快速进化,亟需快速准确地对病毒种类及其潜在宿主进行分类,以便更好地了解传播动态,促进靶向疗法的开发。有鉴于此,本研究利用机器学习(ML)和深度学习(DL)模型,根据 RNA 病毒的基因组序列研究了病毒的类别。研究分析了由不同宿主和物种的巴尔的摩第四组(+ssRNA)和第五组(-ssRNA)致病性单链 RNA 病毒组成的 PhyVirus 数据集。包含病毒基因序列的数据集采用 K-Mer 编码技术进行分析,该技术基于不同长度的碱基词。研究使用了经典的 ML 算法(随机森林、梯度提升和额外树)和全连接深度神经网络(一种深度学习算法)来预测病毒家族和宿主。对 K-Mer 不同训练测试比和不同词长(k 值)情况下的分类器性能进行了详细分析。观察结果表明,全连接深度神经网络预测病毒家族的成功率高达 99.60%。在预测病毒宿主方面,额外树分类器的成功率最高,达到 81.53%。该数据集由单链 RNA 病毒的基因序列组成,具有非常大的科属和宿主多样性。我们对基于基因序列中 K-Mer 编码的不同字长如何影响病毒科和宿主分类进行了详细调查,这使得这项研究尤为宝贵。这项研究表明,ML 和 DL 方法有可能在系统发育研究中产生有价值的结果。此外,研究结果和高性能值还表明,这些方法可以成功地用于基因序列的再生应用或消除基因序列损失等研究。
{"title":"Prediction of viral families and hosts of single-stranded RNA viruses based on K-Mer coding from phylogenetic gene sequences","authors":"Bahar Çi̇ftçi̇ ,&nbsp;Ramazan Teki̇n","doi":"10.1016/j.compbiolchem.2024.108114","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108114","url":null,"abstract":"<div><p>There are billions of virus species worldwide, and viruses, the smallest parasitic entities, pose a serious threat. Therefore, fighting associated disorders requires an understanding of the genetic structure of viruses. Considering the wide diversity and rapid evolution of viruses, there is a critical need to quickly and accurately classify viral species and their potential hosts to better understand transmission dynamics, facilitating the development of targeted therapies. Recognizing this, this study has investigated the classes of RNA viruses based on their genomic sequences using Machine Learning (ML) and Deep Learning (DL) models. The PhyVirus dataset, consisting of pathogenic Single-stranded RNA viruses of Baltimore group four (+ssRNA) and five (-ssRNA) with different hosts and species, was analyzed. The dataset containing viral gene sequences was analyzed using the K-Mer coding technique, which is based on base words of various lengths. The study used classical ML algorithms (Random Forest, Gradient Boosting and Extra Trees) and the Fully Connected Deep Neural Network, a Deep Learning algorithm, to predict viral families and hosts. Detailed analyses were performed on the classifier performance in scenarios with different train-test ratios and different word lengths (k-values) for K-Mer. The observed results show that Fully Connected Deep Neural Network has a high success rate of 99.60 % in predicting virus families. In predicting virus hosts, the Extra Trees classifier achieved the highest success rate of 81.53 %. This study is considered to be the first classification study in the literature on this dataset, which has a very large family and host diversity consisting of gene sequences of Single-stranded RNA viruses. Our detailed investigations on how varying word lengths based on K-Mer coding in gene sequences affect the classification into viral families and hosts make this study particularly valuable. This study shows that ML and DL methods have the potential to produce valuable results in phylogenetic studies. In addition, the results and high-performance values show that these methods can be successfully used in regenerative applications of gene sequences or in studies such as the elimination of losses in gene sequences.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141291931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UmamiPreDL: Deep learning model for umami taste prediction of peptides using BERT and CNN UmamiPreDL:使用 BERT 和 CNN 预测多肽鲜味的深度学习模型。
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-05-29 DOI: 10.1016/j.compbiolchem.2024.108116
Arun Pandiyan Indiran , Humaira Fatima , Sampriti Chattopadhyay , Sureshkumar Ramadoss , Yashwanth Radhakrishnan

Taste is crucial in driving food choice and preference. Umami is one of the basic tastes defined by characteristic deliciousness and mouthfulness that it imparts to foods. Identification of ingredients to enhance umami taste is of significant value to food industry. Various models have been shown to predict umami taste using feature encodings derived from traditional molecular descriptors such as amphiphilic pseudo-amino acid composition, dipeptide composition, and composition-transition-distribution. Highest reported accuracy of 90.5 % was recently achieved through novel model architecture. Here, we propose use of biological sequence transformers such as ProtBert and ESM2, trained on the Uniref databases, as the feature encoders block. With combination of 2 encoders and 2 classifiers, 4 model architectures were developed. Among the 4 models, ProtBert-CNN model outperformed other models with accuracy of 95 % on 5-fold cross validation data and 94 % on independent data.

口味对于食物的选择和偏好至关重要。"鲜味 "是基本口味之一,它赋予食物特有的美味和口感。对食品工业来说,确定能增强鲜味的配料具有重要价值。研究表明,各种模型都能利用从两亲假氨基酸组成、二肽组成和组成-过渡-分布等传统分子描述符中提取的特征编码来预测鲜味。最近,通过新颖的模型结构,报告的最高准确率达到了 90.5%。在此,我们建议使用在 Uniref 数据库上训练的生物序列转换器(如 ProtBert 和 ESM2)作为特征编码器模块。结合 2 个编码器和 2 个分类器,我们开发出了 4 种模型架构。在这 4 个模型中,ProtBert-CNN 模型在 5 倍交叉验证数据上的准确率为 95%,在独立数据上的准确率为 94%,表现优于其他模型。
{"title":"UmamiPreDL: Deep learning model for umami taste prediction of peptides using BERT and CNN","authors":"Arun Pandiyan Indiran ,&nbsp;Humaira Fatima ,&nbsp;Sampriti Chattopadhyay ,&nbsp;Sureshkumar Ramadoss ,&nbsp;Yashwanth Radhakrishnan","doi":"10.1016/j.compbiolchem.2024.108116","DOIUrl":"10.1016/j.compbiolchem.2024.108116","url":null,"abstract":"<div><p>Taste is crucial in driving food choice and preference. Umami is one of the basic tastes defined by characteristic deliciousness and mouthfulness that it imparts to foods. Identification of ingredients to enhance umami taste is of significant value to food industry. Various models have been shown to predict umami taste using feature encodings derived from traditional molecular descriptors such as amphiphilic pseudo-amino acid composition, dipeptide composition, and composition-transition-distribution. Highest reported accuracy of 90.5 % was recently achieved through novel model architecture. Here, we propose use of biological sequence transformers such as ProtBert and ESM2, trained on the Uniref databases, as the feature encoders block. With combination of 2 encoders and 2 classifiers, 4 model architectures were developed. Among the 4 models, ProtBert-CNN model outperformed other models with accuracy of 95 % on 5-fold cross validation data and 94 % on independent data.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141186564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Planning biosynthetic pathways of target molecules based on metabolic reaction prediction and AND-OR tree search 基于代谢反应预测和 AND-OR 树搜索规划目标分子的生物合成路径
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-05-29 DOI: 10.1016/j.compbiolchem.2024.108106
Xiaolei Zhang, Juan Liu, Feng Yang, Qiang Zhang, Zhihui Yang, Hayat Ali Shah

Bioretrosynthesis problem is to predict synthetic routes using substrates for given natural products (NPs). However, the huge number of metabolic reactions leads to a combinatorial explosion of searching space, which is high time-consuming and costly. Here, we propose a framework called BioRetro to predict bioretrosynthesis pathways using a one-step bioretrosynthesis network, termed HybridMLP combined with AND-OR tree heuristic search. The HybridMLP predicts precursors that will produce the target NPs, while the AND-OR tree generates the iterative multi-step biosynthetic pathways. The one-step bioretrosynthesis prediction experiments are conducted on MetaNetX dataset by using HybridMLP, which achieves 46.5%, 74.6%, 81.6% in terms of the top-1, top-5, top-10 accuracies. The great performance demonstrates the effectiveness of HybridMLP in one-step bioretrosynthesis. Besides, the evaluation of two benchmark datasets reveals that BioRetro can significantly improve the speed and success rate in predicting biosynthesis pathways. In addition, the BioRetro is further shown to find the synthetic pathway of compounds, such as ginsenoside F1 with the same substrates as reported but different enzymes, which may be the novel potential enzyme to have better catalytic performance.

生物合成问题是利用给定天然产物(NPs)的底物预测合成路线。然而,大量的代谢反应会导致搜索空间的组合爆炸,既费时又费钱。在此,我们提出了一个名为 BioRetro 的框架,利用一步生物合成网络预测生物合成途径,称为 HybridMLP 结合 AND-OR 树启发式搜索。HybridMLP 预测将产生目标 NP 的前体,而 AND-OR 树则生成迭代的多步骤生物合成途径。利用 HybridMLP 在 MetaNetX 数据集上进行了一步生物合成预测实验,结果表明,前 1 名、前 5 名、前 10 名的准确率分别为 46.5%、74.6%、81.6%。出色的性能证明了 HybridMLP 在一步法生物反合成中的有效性。此外,对两个基准数据集的评估表明,BioRetro 可以显著提高预测生物合成途径的速度和成功率。此外,BioRetro 还进一步证明了它能找到人参皂苷 F1 等化合物的合成途径,其底物与所报道的相同,但酶却不同,这可能是具有更好催化性能的新型潜在酶。
{"title":"Planning biosynthetic pathways of target molecules based on metabolic reaction prediction and AND-OR tree search","authors":"Xiaolei Zhang,&nbsp;Juan Liu,&nbsp;Feng Yang,&nbsp;Qiang Zhang,&nbsp;Zhihui Yang,&nbsp;Hayat Ali Shah","doi":"10.1016/j.compbiolchem.2024.108106","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108106","url":null,"abstract":"<div><p>Bioretrosynthesis problem is to predict synthetic routes using substrates for given natural products (NPs). However, the huge number of metabolic reactions leads to a combinatorial explosion of searching space, which is high time-consuming and costly. Here, we propose a framework called BioRetro to predict bioretrosynthesis pathways using a one-step bioretrosynthesis network, termed HybridMLP combined with AND-OR tree heuristic search. The HybridMLP predicts precursors that will produce the target NPs, while the AND-OR tree generates the iterative multi-step biosynthetic pathways. The one-step bioretrosynthesis prediction experiments are conducted on MetaNetX dataset by using HybridMLP, which achieves 46.5%, 74.6%, 81.6% in terms of the top-1, top-5, top-10 accuracies. The great performance demonstrates the effectiveness of HybridMLP in one-step bioretrosynthesis. Besides, the evaluation of two benchmark datasets reveals that BioRetro can significantly improve the speed and success rate in predicting biosynthesis pathways. In addition, the BioRetro is further shown to find the synthetic pathway of compounds, such as ginsenoside F1 with the same substrates as reported but different enzymes, which may be the novel potential enzyme to have better catalytic performance.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141243737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating classic AI and agriculture: A novel model for predicting insecticide-likeness to enhance efficiency in insecticide development 将经典人工智能与农业相结合:预测杀虫剂亲和性以提高杀虫剂开发效率的新型模型
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-05-27 DOI: 10.1016/j.compbiolchem.2024.108113
Jia-Lin Cui , Hua Li , Qi He , Bin-Yan Jin , Zhe Liu , Xiao-Ming Zhang , Li Zhang

The integration of artificial intelligence (AI) into smart agriculture boosts production and management efficiency, facilitating sustainable agricultural development. In intensive agricultural management, adopting eco-friendly and effective pesticides is crucial to promote green agricultural practices. However, exploring new insecticides species is a difficult and time-consuming task that involves significant risks. Enhancing compound druggability in the lead discovery phase could considerably shorten the discovery cycle, accelerating insecticides research and development. The Insecticide Activity Prediction (IAPred) model, a novel classic artificial intelligence-based method for evaluating the potential insecticidal activity of unknown functional compounds, is introduced in this study. The IAPred model utilized 27 insecticide-likeness features from PaDEL descriptors and employed an ensemble of Support Vector Machine (SVM) and Random Forest (RF) algorithms using the hard-vote mechanism, achieving an accuracy rate of 86 %. Notably, the IAPred model outperforms current models by accurately predicting the efficacy of novel insecticides such as nicofluprole, overcoming the limitations inherent in existing insecticide structures. Our research presents a practical approach for discovering and optimizing novel insecticide lead compounds quickly and efficiently.

将人工智能(AI)融入智慧农业,可以提高生产和管理效率,促进农业可持续发展。在集约化农业管理中,采用环保、高效的杀虫剂对于促进绿色农业实践至关重要。然而,探索新的杀虫剂品种是一项艰巨而耗时的任务,而且涉及重大风险。在先导发现阶段提高化合物的可药性可大大缩短发现周期,加快杀虫剂的研发。杀虫活性预测(IAPred)模型是一种基于人工智能的新型经典方法,用于评估未知功能化合物的潜在杀虫活性。IAPred 模型利用了 PaDEL 描述符中的 27 个杀虫相似性特征,并采用了支持向量机(SVM)和随机森林(RF)算法的集合,使用了硬投票机制,准确率达到了 86%。值得注意的是,IAPred 模型准确预测了新型杀虫剂(如烟碱氟虫腈等)的药效,克服了现有杀虫剂结构固有的局限性,优于现有模型。我们的研究为快速高效地发现和优化新型杀虫剂先导化合物提供了一种实用方法。
{"title":"Integrating classic AI and agriculture: A novel model for predicting insecticide-likeness to enhance efficiency in insecticide development","authors":"Jia-Lin Cui ,&nbsp;Hua Li ,&nbsp;Qi He ,&nbsp;Bin-Yan Jin ,&nbsp;Zhe Liu ,&nbsp;Xiao-Ming Zhang ,&nbsp;Li Zhang","doi":"10.1016/j.compbiolchem.2024.108113","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108113","url":null,"abstract":"<div><p>The integration of artificial intelligence (AI) into smart agriculture boosts production and management efficiency, facilitating sustainable agricultural development. In intensive agricultural management, adopting eco-friendly and effective pesticides is crucial to promote green agricultural practices. However, exploring new insecticides species is a difficult and time-consuming task that involves significant risks. Enhancing compound druggability in the lead discovery phase could considerably shorten the discovery cycle, accelerating insecticides research and development. The Insecticide Activity Prediction (IAPred) model, a novel classic artificial intelligence-based method for evaluating the potential insecticidal activity of unknown functional compounds, is introduced in this study. The IAPred model utilized 27 insecticide-likeness features from PaDEL descriptors and employed an ensemble of Support Vector Machine (SVM) and Random Forest (RF) algorithms using the hard-vote mechanism, achieving an accuracy rate of 86 %. Notably, the IAPred model outperforms current models by accurately predicting the efficacy of novel insecticides such as nicofluprole, overcoming the limitations inherent in existing insecticide structures. Our research presents a practical approach for discovering and optimizing novel insecticide lead compounds quickly and efficiently.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141291962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational therapeutic repurposing of tavaborole targeting arginase-1 for venous leg ulcer 以精氨酸酶-1 为靶点的他伐波罗治疗静脉性腿部溃疡的计算疗法再定位
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-05-27 DOI: 10.1016/j.compbiolchem.2024.108112
Naveen Kumar V, T. Tamilanban

Venous leg ulcers (VLUs) pose a growing healthcare challenge due to aging, obesity, and sedentary lifestyles. Despite various treatments available, addressing the complex nature of VLUs remains difficult. In this context, this study investigates repurposing boronated drugs to inhibit arginase 1 activity for VLU treatment. The molecular docking study conducted by Schrodinger GLIDE targeted the binuclear manganese cluster of arginase 1 enzyme (2PHO). Further, the ligand-protein complex was subjected to molecular dynamic studies at 500 ns in Gromacs-2019.4. Trajectory analysis was performed using the GROMACS simulation package of protein RMSD, RMSF, RG, SASA, and H-Bond. The docking study revealed intriguing results where the tavaborole showed a better docking score (-3.957 Kcal/mol) compared to the substrate L-arginine (-3.379 Kcal/mol) and standard L-norvaline (-3.141 Kcal/mol). Tavaborole interaction with aspartic acid ultimately suggests that the drug molecule binds to the catalytic site of arginase 1, potentially influencing the enzyme's function. The dynamics study revealed the compounds' stability and compactness of the protein throughout the simulation. The RMSD, RMSF, SASA, RG, inter and intra H-bond, PCA, FEL, and MMBSA studies affirmed the ligand-protein and protein complex flexibility, compactness, binding energy, van der waals energy, and solvation dynamics. These results revealed the stability and the interaction of the ligand with the catalytic site of arginase 1 enzyme, triggering the study towards the VLU treatment.

由于老龄化、肥胖和久坐不动的生活方式,腿部静脉溃疡(VLU)对医疗保健构成了日益严峻的挑战。尽管有各种治疗方法,但解决静脉性腿部溃疡的复杂性仍然困难重重。在这种情况下,本研究调查了将抑制精氨酸酶 1 活性的硼化药物重新用于治疗 VLU 的情况。利用 Schrodinger GLIDE 进行的分子对接研究以精氨酸酶 1 的双核锰簇(2PHO)为目标。此外,还在 Gromacs-2019.4 中对配体-蛋白质复合物进行了 500 ns 的分子动力学研究。使用 GROMACS 仿真软件包对蛋白质的 RMSD、RMSF、RG、SASA 和 H-Bond 进行了轨迹分析。对接研究揭示了耐人寻味的结果,与底物 L-精氨酸(-3.379 Kcal/mol)和标准 L-正缬氨酸(-3.141 Kcal/mol)相比,塔伐伯罗显示出更好的对接得分(-3.957 Kcal/mol)。Tavaborole 与天冬氨酸的相互作用最终表明,药物分子与精氨酸酶 1 的催化位点结合,可能会影响酶的功能。动力学研究揭示了化合物在整个模拟过程中的稳定性和蛋白质的紧密性。RMSD、RMSF、SASA、RG、H-键间和H-键内、PCA、FEL和MMBSA研究证实了配体-蛋白质和蛋白质复合物的灵活性、紧密性、结合能、范德华能和溶解动力学。这些结果表明了配体的稳定性和与精氨酸酶 1 酶催化位点的相互作用,从而引发了对 VLU 处理的研究。
{"title":"Computational therapeutic repurposing of tavaborole targeting arginase-1 for venous leg ulcer","authors":"Naveen Kumar V,&nbsp;T. Tamilanban","doi":"10.1016/j.compbiolchem.2024.108112","DOIUrl":"https://doi.org/10.1016/j.compbiolchem.2024.108112","url":null,"abstract":"<div><p>Venous leg ulcers (VLUs) pose a growing healthcare challenge due to aging, obesity, and sedentary lifestyles. Despite various treatments available, addressing the complex nature of VLUs remains difficult. In this context, this study investigates repurposing boronated drugs to inhibit arginase 1 activity for VLU treatment. The molecular docking study conducted by Schrodinger GLIDE targeted the binuclear manganese cluster of arginase 1 enzyme (2PHO). Further, the ligand-protein complex was subjected to molecular dynamic studies at 500 ns in Gromacs-2019.4. Trajectory analysis was performed using the GROMACS simulation package of protein RMSD, RMSF, RG, SASA, and H-Bond. The docking study revealed intriguing results where the tavaborole showed a better docking score (-3.957 Kcal/mol) compared to the substrate L-arginine (-3.379 Kcal/mol) and standard L-norvaline (-3.141 Kcal/mol). Tavaborole interaction with aspartic acid ultimately suggests that the drug molecule binds to the catalytic site of arginase 1, potentially influencing the enzyme's function. The dynamics study revealed the compounds' stability and compactness of the protein throughout the simulation. The RMSD, RMSF, SASA, RG, inter and intra H-bond, PCA, FEL, and MMBSA studies affirmed the ligand-protein and protein complex flexibility, compactness, binding energy, van der waals energy, and solvation dynamics. These results revealed the stability and the interaction of the ligand with the catalytic site of arginase 1 enzyme, triggering the study towards the VLU treatment.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141249921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge enhanced attention aggregation network for medicine recommendation 用于药品推荐的知识增强型注意力聚合网络
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-05-23 DOI: 10.1016/j.compbiolchem.2024.108099
Jiedong Wei , Yijia Zhang , Xingwang Li , Mingyu Lu , Hongfei Lin

The combination of deep learning and the medical field has recently achieved great success, particularly in recommending medicine for patients. However, patients’ clinical records often contain repeated medical information that can significantly impact their health condition. Most existing methods for modeling longitudinal patient information overlook the impact of individual diagnoses and procedures on the patient’s health, resulting in insufficient patient representation and limited accuracy of medicine recommendations. Therefore, we propose a medicine recommendation model called KEAN, which is based on an attention aggregation network and enhanced graph convolution. Specifically, KEAN can aggregate individual diagnoses and procedures in patient visits to capture significant features that affect patients’ diseases. We further incorporate medicine knowledge from complex medicine combinations, reduce drug–drug interactions (DDIs), and recommend medicines that are beneficial to patients’ health. The experimental results on the MIMIC-III dataset demonstrate that our model outperforms existing advanced methods, which highlights the effectiveness of the proposed method.

最近,深度学习与医疗领域的结合取得了巨大成功,尤其是在为患者推荐药物方面。然而,患者的临床记录往往包含重复的医疗信息,这些信息会对患者的健康状况产生重大影响。现有的大多数患者纵向信息建模方法都忽略了单个诊断和手术对患者健康的影响,导致患者代表性不足,药品推荐的准确性有限。因此,我们提出了基于注意力聚合网络和增强图卷积的医药推荐模型 KEAN。具体来说,KEAN 可以聚合患者就诊时的单个诊断和治疗过程,从而捕捉影响患者疾病的重要特征。我们还进一步从复杂的药物组合中纳入药物知识,减少药物间的相互作用(DDI),并推荐对患者健康有益的药物。在 MIMIC-III 数据集上的实验结果表明,我们的模型优于现有的先进方法,这凸显了所提方法的有效性。
{"title":"Knowledge enhanced attention aggregation network for medicine recommendation","authors":"Jiedong Wei ,&nbsp;Yijia Zhang ,&nbsp;Xingwang Li ,&nbsp;Mingyu Lu ,&nbsp;Hongfei Lin","doi":"10.1016/j.compbiolchem.2024.108099","DOIUrl":"10.1016/j.compbiolchem.2024.108099","url":null,"abstract":"<div><p>The combination of deep learning and the medical field has recently achieved great success, particularly in recommending medicine for patients. However, patients’ clinical records often contain repeated medical information that can significantly impact their health condition. Most existing methods for modeling longitudinal patient information overlook the impact of individual diagnoses and procedures on the patient’s health, resulting in insufficient patient representation and limited accuracy of medicine recommendations. Therefore, we propose a medicine recommendation model called KEAN, which is based on an attention aggregation network and enhanced graph convolution. Specifically, KEAN can aggregate individual diagnoses and procedures in patient visits to capture significant features that affect patients’ diseases. We further incorporate medicine knowledge from complex medicine combinations, reduce drug–drug interactions (DDIs), and recommend medicines that are beneficial to patients’ health. The experimental results on the MIMIC-III dataset demonstrate that our model outperforms existing advanced methods, which highlights the effectiveness of the proposed method.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141137592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross vision transformer with enhanced Growth Optimizer for breast cancer detection in IoMT environment 交叉视觉转换器与增强型增长优化器,用于物联网环境下的乳腺癌检测
IF 3.1 4区 生物学 Q1 Mathematics Pub Date : 2024-05-22 DOI: 10.1016/j.compbiolchem.2024.108110
Mohamed Abd Elaziz , Abdelghani Dahou , Ahmad O. Aseeri , Ahmed A. Ewees , Mohammed A.A. Al-qaness , Rehab Ali Ibrahim

The recent advances in artificial intelligence modern approaches can play vital roles in the Internet of Medical Things (IoMT). Automatic diagnosis is one of the most important topics in the IoMT, including cancer diagnosis. Breast cancer is one of the top causes of death among women. Accurate diagnosis and early detection of breast cancer can improve the survival rate of patients. Deep learning models have demonstrated outstanding potential in accurately detecting and diagnosing breast cancer. This paper proposes a novel technology for breast cancer detection using CrossViT as the deep learning model and an enhanced version of the Growth Optimizer algorithm (MGO) as the feature selection method. CrossVit is a hybrid deep learning model that combines the strengths of both convolutional neural networks (CNNs) and transformers. The MGO is a meta-heuristic algorithm that selects the most relevant features from a large pool of features to enhance the performance of the model. The developed approach was evaluated on three publicly available breast cancer datasets and achieved competitive performance compared to other state-of-the-art methods. The results show that the combination of CrossViT and the MGO can effectively identify the most informative features for breast cancer detection, potentially assisting clinicians in making accurate diagnoses and improving patient outcomes. The MGO algorithm improves accuracy by approximately 1.59% on INbreast, 5.00% on MIAS, and 0.79% on MiniDDSM compared to other methods on each respective dataset. The developed approach can also be utilized to improve the Quality of Service (QoS) in the healthcare system as a deployable IoT-based intelligent solution or a decision-making assistance service, enhancing the efficiency and precision of the diagnosis.

人工智能现代方法的最新进展可在医疗物联网(IoMT)中发挥重要作用。自动诊断是 IoMT 最重要的主题之一,其中包括癌症诊断。乳腺癌是妇女死亡的首要原因之一。准确诊断和早期发现乳腺癌可以提高患者的生存率。深度学习模型在准确检测和诊断乳腺癌方面表现出了突出的潜力。本文以 CrossViT 作为深度学习模型,以增强版增长优化算法(MGO)作为特征选择方法,提出了一种新型乳腺癌检测技术。CrossVit 是一种混合深度学习模型,结合了卷积神经网络(CNN)和变换器的优势。MGO 是一种元启发式算法,可从大量特征库中选择最相关的特征,以提高模型的性能。所开发的方法在三个公开的乳腺癌数据集上进行了评估,与其他最先进的方法相比,取得了具有竞争力的性能。结果表明,CrossViT 和 MGO 的结合能有效识别乳腺癌检测中信息量最大的特征,从而帮助临床医生做出准确诊断并改善患者预后。与其他方法相比,MGO 算法在 INbreast、MIAS 和 MiniDDSM 数据集上的准确率分别提高了约 1.59%、5.00% 和 0.79%。所开发的方法还可用于改善医疗系统的服务质量(QoS),作为一种可部署的基于物联网的智能解决方案或决策辅助服务,提高诊断的效率和精确度。
{"title":"Cross vision transformer with enhanced Growth Optimizer for breast cancer detection in IoMT environment","authors":"Mohamed Abd Elaziz ,&nbsp;Abdelghani Dahou ,&nbsp;Ahmad O. Aseeri ,&nbsp;Ahmed A. Ewees ,&nbsp;Mohammed A.A. Al-qaness ,&nbsp;Rehab Ali Ibrahim","doi":"10.1016/j.compbiolchem.2024.108110","DOIUrl":"10.1016/j.compbiolchem.2024.108110","url":null,"abstract":"<div><p>The recent advances in artificial intelligence modern approaches can play vital roles in the Internet of Medical Things (IoMT). Automatic diagnosis is one of the most important topics in the IoMT, including cancer diagnosis. Breast cancer is one of the top causes of death among women. Accurate diagnosis and early detection of breast cancer can improve the survival rate of patients. Deep learning models have demonstrated outstanding potential in accurately detecting and diagnosing breast cancer. This paper proposes a novel technology for breast cancer detection using CrossViT as the deep learning model and an enhanced version of the Growth Optimizer algorithm (MGO) as the feature selection method. CrossVit is a hybrid deep learning model that combines the strengths of both convolutional neural networks (CNNs) and transformers. The MGO is a meta-heuristic algorithm that selects the most relevant features from a large pool of features to enhance the performance of the model. The developed approach was evaluated on three publicly available breast cancer datasets and achieved competitive performance compared to other state-of-the-art methods. The results show that the combination of CrossViT and the MGO can effectively identify the most informative features for breast cancer detection, potentially assisting clinicians in making accurate diagnoses and improving patient outcomes. The MGO algorithm improves accuracy by approximately 1.59% on INbreast, 5.00% on MIAS, and 0.79% on MiniDDSM compared to other methods on each respective dataset. The developed approach can also be utilized to improve the Quality of Service (QoS) in the healthcare system as a deployable IoT-based intelligent solution or a decision-making assistance service, enhancing the efficiency and precision of the diagnosis.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141143989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Biology and Chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1