首页 > 最新文献

Computational Biology and Chemistry最新文献

英文 中文
Cancer detection with various classification models: A comprehensive feature analysis using HMM to extract a nucleotide pattern 利用各种分类模型检测癌症:利用 HMM 提取核苷酸模式的综合特征分析
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-21 DOI: 10.1016/j.compbiolchem.2024.108215
Vijay Kalal, Brajesh Kumar Jha
This work presents a novel feature extraction method for identifying complex patterns in genomic sequences by employing the Hidden Markov Model (HMM). In this study, we use HMM to identify gene nucleotide patterns that are specific to malignant and non-malignant cells. Crucial genetic components DNA and RNA are involved in many biological processes that impact both healthy and malignant cells. Early patient identification is essential to successful cancer diagnosis and therapy. Varying nucleotide patterns indicate different cellular responses, which are important to understanding the molecular causes of cancer and associated disorders. We present a detailed study of nucleotide patterns in whole raw nucleotide sequences with variations in both protein sequence (CDS) and non-protein sequence (NCDS) in both malignant and non-malignant cells. Nucleotide prediction has been achieved while computational expenses are reduced by using the proposed HMM for feature extraction and selection. The classification models implemented in this work for cancer detection are Gradient-Boosted Decision Trees (GBDT), Random Forests (RF), Decision Trees (DT), and Support Vector Machines (SVM) with kernels. The suggested classification model's accuracy and 10-fold cross-validation have been validated via comprehensive case studies. The results reveal that DT and ensemble learning techniques significantly differentiate between malignant and non-malignant DNA sequences. SVM with suitable kernels improves cancer detection accuracy significantly. Combining feature reduction approaches with nucleotide pattern classifiers based on Hidden Markov models improves performance and ensures reliable cancer detection.
本研究提出了一种新颖的特征提取方法,利用隐马尔可夫模型(HMM)识别基因组序列中的复杂模式。在这项研究中,我们使用 HMM 来识别恶性和非恶性细胞特有的基因核苷酸模式。DNA 和 RNA 这两种重要的基因成分参与了许多影响健康细胞和恶性细胞的生物过程。早期识别病人对成功诊断和治疗癌症至关重要。不同的核苷酸模式表示不同的细胞反应,这对了解癌症和相关疾病的分子原因非常重要。我们详细研究了恶性和非恶性细胞中蛋白质序列(CDS)和非蛋白质序列(NCDS)变化的整个原始核苷酸序列中的核苷酸模式。通过使用提议的 HMM 进行特征提取和选择,实现了核苷酸预测,同时减少了计算费用。这项工作中用于癌症检测的分类模型包括梯度提升决策树(GBDT)、随机森林(RF)、决策树(DT)和带内核的支持向量机(SVM)。通过全面的案例研究,验证了所建议的分类模型的准确性和 10 倍交叉验证。结果表明,DT 和集合学习技术能明显区分恶性和非恶性 DNA 序列。具有合适内核的 SVM 能显著提高癌症检测的准确性。将减少特征的方法与基于隐马尔可夫模型的核苷酸模式分类器相结合,可提高性能并确保可靠的癌症检测。
{"title":"Cancer detection with various classification models: A comprehensive feature analysis using HMM to extract a nucleotide pattern","authors":"Vijay Kalal,&nbsp;Brajesh Kumar Jha","doi":"10.1016/j.compbiolchem.2024.108215","DOIUrl":"10.1016/j.compbiolchem.2024.108215","url":null,"abstract":"<div><div>This work presents a novel feature extraction method for identifying complex patterns in genomic sequences by employing the Hidden Markov Model (HMM). In this study, we use HMM to identify gene nucleotide patterns that are specific to malignant and non-malignant cells. Crucial genetic components DNA and RNA are involved in many biological processes that impact both healthy and malignant cells. Early patient identification is essential to successful cancer diagnosis and therapy. Varying nucleotide patterns indicate different cellular responses, which are important to understanding the molecular causes of cancer and associated disorders. We present a detailed study of nucleotide patterns in whole raw nucleotide sequences with variations in both protein sequence (CDS) and non-protein sequence (NCDS) in both malignant and non-malignant cells. Nucleotide prediction has been achieved while computational expenses are reduced by using the proposed HMM for feature extraction and selection. The classification models implemented in this work for cancer detection are Gradient-Boosted Decision Trees (GBDT), Random Forests (RF), Decision Trees (DT), and Support Vector Machines (SVM) with kernels. The suggested classification model's accuracy and 10-fold cross-validation have been validated via comprehensive case studies. The results reveal that DT and ensemble learning techniques significantly differentiate between malignant and non-malignant DNA sequences. SVM with suitable kernels improves cancer detection accuracy significantly. Combining feature reduction approaches with nucleotide pattern classifiers based on Hidden Markov models improves performance and ensures reliable cancer detection.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108215"},"PeriodicalIF":2.6,"publicationDate":"2024-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142395948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using a dual immunoinformatics and bioinformatics approach to design a novel and effective multi-epitope vaccine against human torovirus disease 利用免疫信息学和生物信息学双重方法设计新型、有效的多表位人类托罗病毒病疫苗
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-19 DOI: 10.1016/j.compbiolchem.2024.108213
Sajjad Ahmad , Syed Shujait Ali , Arshad Iqbal , Shahid Ali , Zahid Hussain , Ishaq Khan , Hayat Khan
Human Torovirus (HToV), a member of the Coronaviridae family, causes severe enteric diseases with no specific medication available. To develop novel preventative measures, we employed immunoinformatics techniques to design a multi-epitope-based subunit vaccine (HToV-MEV) triggering diverse immune responses. We selected non-allergenic, non-toxic, and antigenic epitopes from structural polyproteins, joined them with suitable linkers, and added an adjuvant 50S ribosomal L7/L12 peptide. The vaccine's solubility score of 0.903678 and physiochemical properties were found effective. Molecular dynamics simulations and free energy calculations revealed strong binding affinity for Toll-like receptor 3 (TLR-3), with a lowest energy score of −303.88, indicating high affinity. In-silico cloning and codon optimization showed significant production potential in E. coli. Immune simulations predicted a human immunological response. Our results are promising, but subsequent in vivo research is recommended. The HToV-MEV vaccine design demonstrates potential for preventing HToV-related diseases, and further investigation is warranted to explore its therapeutic applications.
人类托罗病毒(HToV)是冠状病毒科的一种病毒,可引起严重的肠道疾病,目前尚无特效药物。为了开发新的预防措施,我们采用免疫信息学技术设计了一种基于多表位的亚单位疫苗(HToV-MEV),可引发多种免疫反应。我们从结构多蛋白中选择了非过敏性、无毒性和抗原性表位,用合适的连接体将它们连接起来,并添加了佐剂 50S 核糖体 L7/L12 肽。该疫苗的溶解度得分为 0.903678,其理化性质也十分有效。分子动力学模拟和自由能计算显示,该疫苗与 Toll 样受体 3(TLR-3)有很强的结合亲和力,最低能量得分为 -303.88,表明该疫苗具有很高的亲和力。在大肠杆菌中进行的内嵌克隆和密码子优化显示了巨大的生产潜力。免疫模拟预测了人类的免疫反应。我们的结果很有希望,但建议进行后续的体内研究。HToV-MEV疫苗设计展示了预防HToV相关疾病的潜力,我们有必要进一步研究其治疗应用。
{"title":"Using a dual immunoinformatics and bioinformatics approach to design a novel and effective multi-epitope vaccine against human torovirus disease","authors":"Sajjad Ahmad ,&nbsp;Syed Shujait Ali ,&nbsp;Arshad Iqbal ,&nbsp;Shahid Ali ,&nbsp;Zahid Hussain ,&nbsp;Ishaq Khan ,&nbsp;Hayat Khan","doi":"10.1016/j.compbiolchem.2024.108213","DOIUrl":"10.1016/j.compbiolchem.2024.108213","url":null,"abstract":"<div><div>Human Torovirus (HToV), a member of the Coronaviridae family, causes severe enteric diseases with no specific medication available. To develop novel preventative measures, we employed immunoinformatics techniques to design a multi-epitope-based subunit vaccine (HToV-MEV) triggering diverse immune responses. We selected non-allergenic, non-toxic, and antigenic epitopes from structural polyproteins, joined them with suitable linkers, and added an adjuvant 50S ribosomal L7/L12 peptide. The vaccine's solubility score of 0.903678 and physiochemical properties were found effective. Molecular dynamics simulations and free energy calculations revealed strong binding affinity for Toll-like receptor 3 (TLR-3), with a lowest energy score of −303.88, indicating high affinity. In-silico cloning and codon optimization showed significant production potential in <em>E. coli</em>. Immune simulations predicted a human immunological response. Our results are promising, but subsequent in vivo research is recommended. The HToV-MEV vaccine design demonstrates potential for preventing HToV-related diseases, and further investigation is warranted to explore its therapeutic applications.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108213"},"PeriodicalIF":2.6,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design, synthesis, in-vitro, in-silico, DFT and POM studies of a novel family of sulfonamides as potent anti-triple-negative breast cancer agents 作为强效抗三阴性乳腺癌药物的新型磺酰胺家族的设计、合成、体外、实验室、DFT 和 POM 研究
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-19 DOI: 10.1016/j.compbiolchem.2024.108214
Nassima Saghdani , Abdelmoula El Abbouchi , Nabil El Brahmi , Abderrazak Idir , Khadija Otmane Rachedi , Malika Berredjem , Rachid Haloui , Souad Elkhattabi , Hassan Ait Mouse , Taibi Ben Hadda , Mostapha Bousmina , Abdelmajid Zyad , Saïd El Kazzouli

In this study, a new family of ethacrynic acid-sulfonamides and indazole-sulfonamides was synthesized and tested in vitro against MDA-MB-468 triple-negative breast cancer cells and PBMCs human peripheral blood mononuclear cells, using the MTT (3-[4,5-dimethylthiazol-2-yl]-2,5 diphenyl tetrazolium bromide) assay. The aim of this research is to discover novel compounds with potential therapeutic effects on breast cancer. The antiproliferative activity of these compounds showed a significant dose-dependent activity, with IC50 values ranging between 2.83 and 7.52 µM. The lead compounds 8 and 9 displayed similar IC50 values to paclitaxel with 2.83, 3.84 and 2.72 µM, respectively. This highlights the novelty and potential of these compounds as alternatives to current treatments. The binding properties of 8, 9, and paclitaxel with the active sites of the PARP1(Poly(ADP-ribose) polymérase 1) and EGFR (Epidermal growth factor receptor) proteins were analyzed by molecular docking methods showing, for PARP1 protein, binding affinities of −9.8 Kcal /mol, −10 Kcal /mol, and −9.4 Kcal /mol, respectively. While their binding affinities for EGFR protein are −7.5 Kcal/mol, −7.2 Kcal/mol and −6.9 Kcal/mol, respectively. Moreover, drug-likeness and ADMET (Absorption–distribution–metabolism–excretion–toxicity) analyses demonstrated that both molecules are orally bioavailable and have good pharmacokinetic and non-toxic profiles. DFT (Density functional theory) was also carried out on both compounds 8 and 9 additionally to POM (Petra/Osiris/Molinspiration) studies on all compounds. The outcomes of this study suggest that compounds 8 and 9 are promising candidates for further development as therapeutic agents against triple-negative breast cancer

本研究合成了一系列新的乙酰丙酸磺酰胺类化合物和吲唑磺酰胺类化合物,并利用 MTT(3-[4,5-二甲基噻唑-2-基]-2,5-二苯基四唑溴化物)测定法对 MDA-MB-468 三阴性乳腺癌细胞和 PBMCs 人类外周血单核细胞进行了体外测试。这项研究的目的是发现对乳腺癌有潜在治疗作用的新型化合物。这些化合物的抗增殖活性表现出明显的剂量依赖性,IC50 值介于 2.83 和 7.52 µM 之间。先导化合物 8 和 9 的 IC50 值与紫杉醇相似,分别为 2.83、3.84 和 2.72 µM。这凸显了这些化合物的新颖性和替代现有疗法的潜力。分子对接法分析了 8、9 和紫杉醇与 PARP1(Poly(ADP-ribose) polymérase 1)和 EGFR(表皮生长因子受体)蛋白活性位点的结合特性,结果显示,它们与 PARP1 蛋白的结合亲和力分别为 -9.8 Kcal /mol、-10 Kcal /mol 和 -9.4 Kcal /mol。而它们与表皮生长因子受体蛋白的结合亲和力分别为-7.5 Kcal/mol、-7.2 Kcal/mol和-6.9 Kcal/mol。此外,药物相似性和 ADMET(吸收-分布-代谢-排泄-毒性)分析表明,这两种分子都具有良好的口服生物利用度、药代动力学特征和无毒性特征。除了对所有化合物进行 POM(Petra/Osiris/Molinspiration)研究外,还对化合物 8 和 9 进行了 DFT(密度泛函理论)研究。研究结果表明,化合物 8 和 9 有希望进一步发展成为治疗三阴性乳腺癌的药物。
{"title":"Design, synthesis, in-vitro, in-silico, DFT and POM studies of a novel family of sulfonamides as potent anti-triple-negative breast cancer agents","authors":"Nassima Saghdani ,&nbsp;Abdelmoula El Abbouchi ,&nbsp;Nabil El Brahmi ,&nbsp;Abderrazak Idir ,&nbsp;Khadija Otmane Rachedi ,&nbsp;Malika Berredjem ,&nbsp;Rachid Haloui ,&nbsp;Souad Elkhattabi ,&nbsp;Hassan Ait Mouse ,&nbsp;Taibi Ben Hadda ,&nbsp;Mostapha Bousmina ,&nbsp;Abdelmajid Zyad ,&nbsp;Saïd El Kazzouli","doi":"10.1016/j.compbiolchem.2024.108214","DOIUrl":"10.1016/j.compbiolchem.2024.108214","url":null,"abstract":"<div><p>In this study, a new family of ethacrynic acid-sulfonamides and indazole-sulfonamides was synthesized and tested <em>in vitro</em> against MDA-MB-468 triple-negative breast cancer cells and PBMCs human peripheral blood mononuclear cells, using the MTT (3-[4,5-dimethylthiazol-2-yl]-2,5 diphenyl tetrazolium bromide) assay. The aim of this research is to discover novel compounds with potential therapeutic effects on breast cancer. The antiproliferative activity of these compounds showed a significant dose-dependent activity, with IC<sub>50</sub> values ranging between 2.83 and 7.52 µM. The lead compounds <strong>8</strong> and <strong>9</strong> displayed similar IC<sub>50</sub> values to paclitaxel with 2.83, 3.84 and 2.72 µM, respectively. This highlights the novelty and potential of these compounds as alternatives to current treatments. The binding properties of <strong>8</strong>, <strong>9,</strong> and paclitaxel with the active sites of the PARP1(Poly(ADP-ribose) polymérase 1) and EGFR (Epidermal growth factor receptor) proteins were analyzed by molecular docking methods showing, for PARP1 protein, binding affinities of −9.8 Kcal /mol, −10 Kcal /mol, and −9.4 Kcal /mol, respectively. While their binding affinities for EGFR protein are −7.5 Kcal/mol, −7.2 Kcal/mol and −6.9 Kcal/mol, respectively. Moreover, drug-likeness and ADMET (Absorption–distribution–metabolism–excretion–toxicity) analyses demonstrated that both molecules are orally bioavailable and have good pharmacokinetic and non-toxic profiles. DFT (Density functional theory) was also carried out on both compounds <strong>8</strong> and <strong>9</strong> additionally to POM (Petra/Osiris/Molinspiration) studies on all compounds. The outcomes of this study suggest that compounds <strong>8</strong> and <strong>9</strong> are promising candidates for further development as therapeutic agents against triple-negative breast cancer</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108214"},"PeriodicalIF":2.6,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142270712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reliable estrogen-related prognostic signature for uterine corpus endometrial carcinoma 子宫体子宫内膜癌与雌激素相关的可靠预后特征
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-16 DOI: 10.1016/j.compbiolchem.2024.108216
Mojuan Li , Shuai Wang , Hao Huang , Li Li

Background

Uterine corpus endometrial carcinoma (UCEC) is a predominant gynecological malignancy worldwide. Overdosed estrogen exposure has been widely known as a crucial risk factor for UCEC patients. The purpose of this work is to explore crucial estrogen-related genes (ERGs) in UCEC.

Methods

UCEC scRNA-seq data, bulk RNA data, and ERGs were obtained from GEO, TCGA, and Molecular Signature Database, respectively. Differential expression analysis and cross analysis determined the candidate genes, and optimal genes in risk score were obtained after univariate Cox regression analysis, LASSO Cox regression analysis, and multivariate Cox regression analysis. The functional information was revealed by GO, KEGG, and GSVA enrichment analyses. CCK8 assay was used to detect the drug sensitivity.

Results

After cross analysis of the differentially expressed genes and the 8734 ERGs, 86 differentially expressed ERGs were identified in UCEC, which were significantly enriched in some immune related pathways and microbiota related pathways. Of them, the most optimal 8 ERGs were obtained to build prognostic risk score, including GAL, PHGDH, SLC7A2, HNMT, CLU, AREG, MACC1, and HMGA1. The risk score could reliably predict patient prognosis, and high-risk patients had worse prognosis. Higher HMGA1 gene expression exhibited higher sensitivity to Osimertinib.

Conclusions

Predictive risk score based on 8 ERGs exhibited excellent prognostic value in UCEC patients, and high-risk patients had inferior survival. UCEC patients with distinct prognoses showed different tumor immune microenvironment.
背景子宫内膜癌(UCEC)是全球最主要的妇科恶性肿瘤。众所周知,过量雌激素暴露是 UCEC 患者的一个重要风险因素。方法UCEC的scRNA-seq数据、大量RNA数据和ERGs分别来自GEO、TCGA和分子特征数据库。通过差异表达分析和交叉分析确定候选基因,并经过单变量 Cox 回归分析、LASSO Cox 回归分析和多变量 Cox 回归分析得出风险评分的最佳基因。GO、KEGG和GSVA富集分析揭示了功能信息。结果对差异表达基因和8734个ERGs进行交叉分析后,在UCEC中发现了86个差异表达的ERGs,它们在一些免疫相关通路和微生物群相关通路中显著富集。其中,GAL、PHGDH、SLC7A2、HNMT、CLU、AREG、MACC1和HMGA1等8个ERG被用于建立预后风险评分。该风险评分能可靠地预测患者的预后,高风险患者的预后较差。结论基于 8 种 ERGs 的预测性风险评分对 UCEC 患者的预后有很好的预测价值,高危患者的生存率较低。不同预后的UCEC患者表现出不同的肿瘤免疫微环境。
{"title":"Reliable estrogen-related prognostic signature for uterine corpus endometrial carcinoma","authors":"Mojuan Li ,&nbsp;Shuai Wang ,&nbsp;Hao Huang ,&nbsp;Li Li","doi":"10.1016/j.compbiolchem.2024.108216","DOIUrl":"10.1016/j.compbiolchem.2024.108216","url":null,"abstract":"<div><h3>Background</h3><div>Uterine corpus endometrial carcinoma (UCEC) is a predominant gynecological malignancy worldwide. Overdosed estrogen exposure has been widely known as a crucial risk factor for UCEC patients. The purpose of this work is to explore crucial estrogen-related genes (ERGs) in UCEC.</div></div><div><h3>Methods</h3><div>UCEC scRNA-seq data, bulk RNA data, and ERGs were obtained from GEO, TCGA, and Molecular Signature Database, respectively. Differential expression analysis and cross analysis determined the candidate genes, and optimal genes in risk score were obtained after univariate Cox regression analysis, LASSO Cox regression analysis, and multivariate Cox regression analysis. The functional information was revealed by GO, KEGG, and GSVA enrichment analyses. CCK8 assay was used to detect the drug sensitivity.</div></div><div><h3>Results</h3><div>After cross analysis of the differentially expressed genes and the 8734 ERGs, 86 differentially expressed ERGs were identified in UCEC, which were significantly enriched in some immune related pathways and microbiota related pathways. Of them, the most optimal 8 ERGs were obtained to build prognostic risk score, including GAL, PHGDH, SLC7A2, HNMT, CLU, AREG, MACC1, and HMGA1. The risk score could reliably predict patient prognosis, and high-risk patients had worse prognosis. Higher HMGA1 gene expression exhibited higher sensitivity to Osimertinib.</div></div><div><h3>Conclusions</h3><div>Predictive risk score based on 8 ERGs exhibited excellent prognostic value in UCEC patients, and high-risk patients had inferior survival. UCEC patients with distinct prognoses showed different tumor immune microenvironment.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108216"},"PeriodicalIF":2.6,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ILYCROsite: Identification of lysine crotonylation sites based on FCM-GRNN undersampling technique ILYCROsite:基于 FCM-GRNN 欠采样技术的赖氨酸巴豆酰化位点鉴定
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-13 DOI: 10.1016/j.compbiolchem.2024.108212
Yun Zuo , Minquan Wan , Yang Shen , Xinheng Wang , Wenying He , Yue Bi , Xiangrong Liu , Zhaohong Deng

Protein lysine crotonylation is an important post-translational modification that regulates various cellular activities. For example, histone crotonylation affects chromatin structure and promotes histone replacement. Identification and understanding of lysine crotonylation sites is crucial in the field of protein research. However, due to the increasing amount of non-histone crotonylation sites, existing classifiers based on traditional machine learning may encounter performance limitations. In order to address this problem, a novel deep learning-based model for identifying crotonylation sites is presented in this study, given the unique advantages of deep learning techniques for sequence data analysis. In this study, an MLP-Attention-based model was developed for the identification of crotonylation sites. Firstly, three feature extraction strategies, namely Amino Acid Composition, K-mer, and Distance-based residue features extraction strategy, were used to encode crotonylated and non-crotonylated sequences. Then, in order to balance the training dataset, the FCM-GRNN undersampling algorithm combining fuzzy clustering and generalized neural network approaches was introduced. Finally, to improve the effectiveness of crotonylation site identification, we explored various classification algorithms, and based on the relevant experimental performance comparisons, the multilayer perceptron (MLP) combined with the superimposed self-attention mechanism was finally selected to construct the prediction model ILYCROsite. The results obtained from independent testing and five-fold cross-validation demonstrated that the model proposed in this study, ILYCROsite, had excellent performance. Notably, on the independent test set, ILYCROsite achieves an AUC value of 87.93 %, which is significantly better than the existing state-of-the-art models. In addition, SHAP (Shapley Additive exPlanations) values were used to analyze the importance of features and their impact on model predictions. Meanwhile, in order to facilitate researchers to use the prediction model constructed in this study, we developed a prediction program to identify the crotonylation sites in a given protein sequence. The data and code for this program are available at: https://github.com/wmqskr/ILYCROsite.

蛋白质赖氨酸巴豆酰化是一种重要的翻译后修饰,可调节各种细胞活动。例如,组蛋白巴豆酰化会影响染色质结构并促进组蛋白替换。鉴定和了解赖氨酸巴豆酰化位点在蛋白质研究领域至关重要。然而,由于非组蛋白巴豆酰化位点的数量不断增加,现有的基于传统机器学习的分类器可能会遇到性能限制。鉴于深度学习技术在序列数据分析方面的独特优势,本研究提出了一种基于深度学习的新型巴豆酰化位点识别模型,以解决这一问题。本研究开发了一种基于 MLP-Attention 的模型来识别巴豆酰化位点。首先,使用三种特征提取策略,即氨基酸组成、K-mer和基于距离的残基特征提取策略,对巴豆化和非巴豆化序列进行编码。然后,为了平衡训练数据集,引入了结合模糊聚类和广义神经网络方法的 FCM-GRNN 欠采样算法。最后,为了提高巴豆酰化位点识别的有效性,我们探索了多种分类算法,并在相关实验性能比较的基础上,最终选择了多层感知器(MLP)结合叠加自注意机制来构建预测模型 ILYCROsite。独立测试和五倍交叉验证的结果表明,本研究提出的模型 ILYCROsite 具有优异的性能。值得注意的是,在独立测试集上,ILYCROsite 的 AUC 值达到了 87.93 %,明显优于现有的先进模型。此外,SHAP(Shapley Additive exPlanations)值用于分析特征的重要性及其对模型预测的影响。同时,为了方便研究人员使用本研究构建的预测模型,我们开发了一个预测程序来识别给定蛋白质序列中的巴豆酰化位点。该程序的数据和代码见:https://github.com/wmqskr/ILYCROsite。
{"title":"ILYCROsite: Identification of lysine crotonylation sites based on FCM-GRNN undersampling technique","authors":"Yun Zuo ,&nbsp;Minquan Wan ,&nbsp;Yang Shen ,&nbsp;Xinheng Wang ,&nbsp;Wenying He ,&nbsp;Yue Bi ,&nbsp;Xiangrong Liu ,&nbsp;Zhaohong Deng","doi":"10.1016/j.compbiolchem.2024.108212","DOIUrl":"10.1016/j.compbiolchem.2024.108212","url":null,"abstract":"<div><p>Protein lysine crotonylation is an important post-translational modification that regulates various cellular activities. For example, histone crotonylation affects chromatin structure and promotes histone replacement. Identification and understanding of lysine crotonylation sites is crucial in the field of protein research. However, due to the increasing amount of non-histone crotonylation sites, existing classifiers based on traditional machine learning may encounter performance limitations. In order to address this problem, a novel deep learning-based model for identifying crotonylation sites is presented in this study, given the unique advantages of deep learning techniques for sequence data analysis. In this study, an MLP-Attention-based model was developed for the identification of crotonylation sites. Firstly, three feature extraction strategies, namely Amino Acid Composition, K-mer, and Distance-based residue features extraction strategy, were used to encode crotonylated and non-crotonylated sequences. Then, in order to balance the training dataset, the FCM-GRNN undersampling algorithm combining fuzzy clustering and generalized neural network approaches was introduced. Finally, to improve the effectiveness of crotonylation site identification, we explored various classification algorithms, and based on the relevant experimental performance comparisons, the multilayer perceptron (MLP) combined with the superimposed self-attention mechanism was finally selected to construct the prediction model ILYCROsite. The results obtained from independent testing and five-fold cross-validation demonstrated that the model proposed in this study, ILYCROsite, had excellent performance. Notably, on the independent test set, ILYCROsite achieves an AUC value of 87.93 %, which is significantly better than the existing state-of-the-art models. In addition, SHAP (Shapley Additive exPlanations) values were used to analyze the importance of features and their impact on model predictions. Meanwhile, in order to facilitate researchers to use the prediction model constructed in this study, we developed a prediction program to identify the crotonylation sites in a given protein sequence. The data and code for this program are available at: <span><span>https://github.com/wmqskr/ILYCROsite</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108212"},"PeriodicalIF":2.6,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142230084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive bioinformatic analysis of the role of TGF-β1-stimulated activating transcription factor 3 by non-coding RNAs during breast cancer progression 对非编码 RNA 在乳腺癌进展过程中对 TGF-β1 刺激的活化转录因子 3 的作用进行全面的生物信息学分析
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-12 DOI: 10.1016/j.compbiolchem.2024.108208
Iyyappan Saranya, Dilipkumar Preetha, Sasi Nivruthi, Nagarajan Selvamurugan

A potent growth inhibitor for normal mammary epithelial cells is transforming growth factor beta 1 (TGF-β1). When breast tissues lose the anti-proliferative activity of this factor, invasion and bone metastases increase. Human breast cancer (hBC) cells express more activating transcription factor 3 (ATF3) when exposed to TGF-β1, and this transcription factor is essential for BC development and bone metastases. Non-coding RNAs (ncRNAs), including circular RNAs (circRNAs) and microRNAs (miRNAs), have emerged as key regulators controlling several cellular processes. In hBC cells, TGF-β1 stimulated the expression of hsa-miR-4653–5p that putatively targets ATF3. Bioinformatics analysis predicted that hsa-miR-4653–5p targets several key signaling components and transcription factors, including NFKB1, STAT1, STAT3, NOTCH1, JUN, TCF3, p300, NRF2, SUMO2, and NANOG, suggesting the diversified role of hsa-miR-4653–5p under physiological and pathological conditions. Despite the high abundance of hsa-miR-4653–5p in hBC cells, the ATF3 level remained elevated, indicating other ncRNAs could inhibit hsa-miR-4653–5p’s activity. In silico analysis identified several circRNAs having the binding sites for hsa-miR-4653–5p, indicating the sponging activity of circRNAs towards hsa-miR-4653–5p. The study's findings suggest that TGF-β1 regulates circRNAs and hsa-miR-4653–5p, which in turn affects ATF3 expression, thus influencing BC progression and bone metastasis. Therefore, focusing on the TGF-β1/circRNAs/hsa-miR-4653–5p/ATF3 network could lead to new ways of diagnosing and treating BC.

转化生长因子β1(TGF-β1)是正常乳腺上皮细胞的一种强效生长抑制因子。当乳腺组织失去这种因子的抗增殖活性时,侵袭和骨转移就会增加。人类乳腺癌(hBC)细胞暴露于 TGF-β1 时会表达更多的活化转录因子 3(ATF3),这种转录因子对 BC 的发展和骨转移至关重要。非编码 RNA(ncRNA),包括环状 RNA(circRNA)和微 RNA(miRNA),已成为控制多种细胞过程的关键调控因子。在 hBC 细胞中,TGF-β1 刺激了 hsa-miR-4653-5p 的表达,而 hsa-miR-4653-5p 可能是 ATF3 的靶标。生物信息学分析预测,hsa-miR-4653-5p靶向多个关键信号转导元件和转录因子,包括NFKB1、STAT1、STAT3、NOTCH1、JUN、TCF3、p300、NRF2、SUMO2和NANOG,这表明hsa-miR-4653-5p在生理和病理条件下发挥着多样化的作用。尽管hsa-miR-4653-5p在hBC细胞中含量很高,但ATF3水平仍然升高,这表明其他ncRNA可以抑制hsa-miR-4653-5p的活性。硅学分析发现了几个与hsa-miR-4653-5p有结合位点的circRNA,这表明circRNA对hsa-miR-4653-5p具有海绵活性。研究结果表明,TGF-β1可调控circRNAs和hsa-miR-4653-5p,进而影响ATF3的表达,从而影响BC的进展和骨转移。因此,关注TGF-β1/circRNAs/hsa-miR-4653-5p/ATF3网络可为诊断和治疗BC提供新方法。
{"title":"A comprehensive bioinformatic analysis of the role of TGF-β1-stimulated activating transcription factor 3 by non-coding RNAs during breast cancer progression","authors":"Iyyappan Saranya,&nbsp;Dilipkumar Preetha,&nbsp;Sasi Nivruthi,&nbsp;Nagarajan Selvamurugan","doi":"10.1016/j.compbiolchem.2024.108208","DOIUrl":"10.1016/j.compbiolchem.2024.108208","url":null,"abstract":"<div><p>A potent growth inhibitor for normal mammary epithelial cells is transforming growth factor beta 1 (TGF-β1). When breast tissues lose the anti-proliferative activity of this factor, invasion and bone metastases increase. Human breast cancer (hBC) cells express more activating transcription factor 3 (ATF3) when exposed to TGF-β1, and this transcription factor is essential for BC development and bone metastases. Non-coding RNAs (ncRNAs), including circular RNAs (circRNAs) and microRNAs (miRNAs), have emerged as key regulators controlling several cellular processes. In hBC cells, TGF-β1 stimulated the expression of hsa-miR-4653–5p that putatively targets ATF3. Bioinformatics analysis predicted that hsa-miR-4653–5p targets several key signaling components and transcription factors, including NFKB1, STAT1, STAT3, NOTCH1, JUN, TCF3, p300, NRF2, SUMO2, and NANOG, suggesting the diversified role of hsa-miR-4653–5p under physiological and pathological conditions. Despite the high abundance of hsa-miR-4653–5p in hBC cells, the ATF3 level remained elevated, indicating other ncRNAs could inhibit hsa-miR-4653–5p’s activity. <em>In silico</em> analysis identified several circRNAs having the binding sites for hsa-miR-4653–5p, indicating the sponging activity of circRNAs towards hsa-miR-4653–5p. The study's findings suggest that TGF-β1 regulates circRNAs and hsa-miR-4653–5p, which in turn affects ATF3 expression, thus influencing BC progression and bone metastasis. Therefore, focusing on the TGF-β1/circRNAs/hsa-miR-4653–5p/ATF3 network could lead to new ways of diagnosing and treating BC.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108208"},"PeriodicalIF":2.6,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142230082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling therapeutic biomarkers and druggable targets in ALS: An integrative microarray analysis, molecular docking, and structural dynamic studies 揭示 ALS 的治疗生物标志物和药物靶点:微阵列分析、分子对接和结构动力学综合研究
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-12 DOI: 10.1016/j.compbiolchem.2024.108211
Deboral Eshak, Mohanapriya Arumugam

Amyotrophic lateral sclerosis (ALS), commonly known as Lou Gehrig's disease, is a debilitating neurodegenerative disorder characterized by the progressive degeneration of nerve cells in the brain and spinal cord. Despite extensive research, its precise etiology remains elusive, and early diagnosis is challenging due to the absence of specific tests. This study aimed to identify potential blood-based biomarkers for early ALS detection and monitoring using datasets from whole blood samples (GSE112680) and oligodendrocytes, astrocytes, and fibroblasts (GSE87385) obtained from the NCBI-GEO repository. Through bioinformatics analysis, including protein-protein interactions and molecular pathway analyses, we identified differentially expressed genes (DEGs) associated with ALS. Notably, ALS2, ADH7, ALDH8A1, ALDH3B1, ABHD2, ABHD17B, ABHD12, ABHD13, PGAM2, AURKB, ANAPC11, VAPA, UNC45B, and TNNT2 emerged as top-ranked DEGs, implicated in drug metabolism, protein depalmytilation, and the AKT/mTOR signaling pathways. Among these, AurKB established as a potential therapeutic biomarker with relevance to various neurological conditions. Consequently, AurKB was selected for identifying potential therapeutic molecules and utilized for in silico structural characterization studies. Exploration of the IMPATT database led to the discovery of a lead compound similar to Fostamatinib, currently used for AurKB. Initial molecular docking and MMGBSA-based binding energy analysis were followed by molecular dynamics simulation (MDS) and free energy landscape (FEL) analysis to validate the ligand's binding efficacy and understand dynamic processes within the biological system. The identified potential biomarkers and lead molecule provide novel insights into the correlation between blood cell transcripts and ALS pathology, paving the way for blood-based diagnostic tools for early ALS detection and ongoing disease monitoring.

肌萎缩性脊髓侧索硬化症(ALS)俗称卢伽雷氏病,是一种使人衰弱的神经退行性疾病,其特征是大脑和脊髓中的神经细胞逐渐退化。尽管进行了广泛的研究,但其确切的病因仍然难以捉摸,而且由于缺乏特定的检测方法,早期诊断也很困难。本研究旨在利用从 NCBI-GEO 数据库中获得的全血样本(GSE112680)和少突胶质细胞、星形胶质细胞和成纤维细胞(GSE87385)数据集,鉴定用于早期 ALS 检测和监测的潜在血液生物标志物。通过生物信息学分析,包括蛋白-蛋白相互作用和分子通路分析,我们确定了与 ALS 相关的差异表达基因(DEGs)。值得注意的是,ALS2、ADH7、ALDH8A1、ALDH3B1、ABHD2、ABHD17B、ABHD12、ABHD13、PGAM2、AURKB、ANAPC11、VAPA、UNC45B 和 TNNT2 成为排名靠前的 DEGs,它们与药物代谢、蛋白质脱钙和 AKT/mTOR 信号通路有关。其中,AurKB 是一个潜在的治疗生物标志物,与各种神经疾病相关。因此,AurKB 被选中用于鉴定潜在的治疗分子,并被用于硅结构特征研究。通过探索 IMPATT 数据库,发现了一种与目前用于 AurKB 的 Fostamatinib 相似的先导化合物。初步的分子对接和基于 MMGBSA 的结合能分析之后,进行了分子动力学模拟(MDS)和自由能谱(FEL)分析,以验证配体的结合效能并了解生物系统内的动态过程。已确定的潜在生物标志物和先导分子为了解血细胞转录本与 ALS 病理学之间的相关性提供了新的视角,为基于血液的诊断工具铺平了道路,从而可用于 ALS 的早期检测和持续疾病监测。
{"title":"Unveiling therapeutic biomarkers and druggable targets in ALS: An integrative microarray analysis, molecular docking, and structural dynamic studies","authors":"Deboral Eshak,&nbsp;Mohanapriya Arumugam","doi":"10.1016/j.compbiolchem.2024.108211","DOIUrl":"10.1016/j.compbiolchem.2024.108211","url":null,"abstract":"<div><p>Amyotrophic lateral sclerosis (ALS), commonly known as Lou Gehrig's disease, is a debilitating neurodegenerative disorder characterized by the progressive degeneration of nerve cells in the brain and spinal cord. Despite extensive research, its precise etiology remains elusive, and early diagnosis is challenging due to the absence of specific tests. This study aimed to identify potential blood-based biomarkers for early ALS detection and monitoring using datasets from whole blood samples (GSE112680) and oligodendrocytes, astrocytes, and fibroblasts (GSE87385) obtained from the NCBI-GEO repository. Through bioinformatics analysis, including protein-protein interactions and molecular pathway analyses, we identified differentially expressed genes (DEGs) associated with ALS. Notably, ALS2, ADH7, ALDH8A1, ALDH3B1, ABHD2, ABHD17B, ABHD12, ABHD13, PGAM2, AURKB, ANAPC11, VAPA, UNC45B, and TNNT2 emerged as top-ranked DEGs, implicated in drug metabolism, protein depalmytilation, and the AKT/mTOR signaling pathways. Among these, AurKB established as a potential therapeutic biomarker with relevance to various neurological conditions. Consequently, AurKB was selected for identifying potential therapeutic molecules and utilized for <em>in silico</em> structural characterization studies. Exploration of the IMPATT database led to the discovery of a lead compound similar to Fostamatinib, currently used for AurKB. Initial molecular docking and MMGBSA-based binding energy analysis were followed by molecular dynamics simulation (MDS) and free energy landscape (FEL) analysis to validate the ligand's binding efficacy and understand dynamic processes within the biological system. The identified potential biomarkers and lead molecule provide novel insights into the correlation between blood cell transcripts and ALS pathology, paving the way for blood-based diagnostic tools for early ALS detection and ongoing disease monitoring.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108211"},"PeriodicalIF":2.6,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accurately identifying positive and negative regulation of apoptosis using fusion features and machine learning methods 利用融合特征和机器学习方法准确识别细胞凋亡的正向和负向调控
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-11 DOI: 10.1016/j.compbiolchem.2024.108207
Cheng-Yan Wu , Zhi-Xue Xu , Nan Li , Dan-Yang Qi , Zhi-Hong Hao , Hong-Ye Wu , Ru Gao , Yan-Ting Jin

Apoptotic proteins play a crucial role in the apoptosis process, ensuring a balance between cell proliferation and death. Thus, further elucidating the regulatory mechanisms of apoptosis will enhance our understanding of their functions. However, the development of computational methods to accurately identify positive and negative regulation of apoptosis remains a significant challenge. This work proposes a machine learning model based on multi-feature fusion to effectively identify the roles of positive and negative regulation of apoptosis. Initially, we constructed a reliable benchmark dataset containing 200 positive regulation of apoptosis and 241 negative regulation of apoptosis proteins. Subsequently, we developed a classifier that combines the support vector machine (SVM) with pseudo composition of k-spaced amino acid pairs (PseCKSAAP), composition transition distribution (CTD), dipeptide deviation from expected mean (DDE), and PSSM-composition to identify these proteins. Analysis of variance (ANOVA) was employed to select optimized features that could yield the maximum prediction performance. Evaluating the proposed model on independent data revealed and achieved an accuracy of 0.781 with an AUROC of 0.837, demonstrating our model's potent capabilities.

凋亡蛋白在细胞凋亡过程中发挥着至关重要的作用,确保了细胞增殖和死亡之间的平衡。因此,进一步阐明细胞凋亡的调控机制将增进我们对其功能的了解。然而,开发计算方法以准确识别细胞凋亡的正负调控仍是一项重大挑战。本研究提出了一种基于多特征融合的机器学习模型,以有效识别细胞凋亡正负调控的作用。首先,我们构建了一个可靠的基准数据集,其中包含 200 个凋亡正调控蛋白和 241 个凋亡负调控蛋白。随后,我们开发了一种分类器,将支持向量机(SVM)与k间隔氨基酸对的伪组成(PseCKSAAP)、组成转换分布(CTD)、二肽与预期平均值的偏差(DDE)和PSSM-组成相结合来识别这些蛋白质。采用方差分析(ANOVA)来选择能产生最大预测性能的优化特征。在独立数据上对所提出的模型进行评估后发现,该模型的准确率达到了 0.781,AUROC 为 0.837,这证明了我们模型的强大功能。
{"title":"Accurately identifying positive and negative regulation of apoptosis using fusion features and machine learning methods","authors":"Cheng-Yan Wu ,&nbsp;Zhi-Xue Xu ,&nbsp;Nan Li ,&nbsp;Dan-Yang Qi ,&nbsp;Zhi-Hong Hao ,&nbsp;Hong-Ye Wu ,&nbsp;Ru Gao ,&nbsp;Yan-Ting Jin","doi":"10.1016/j.compbiolchem.2024.108207","DOIUrl":"10.1016/j.compbiolchem.2024.108207","url":null,"abstract":"<div><p>Apoptotic proteins play a crucial role in the apoptosis process, ensuring a balance between cell proliferation and death. Thus, further elucidating the regulatory mechanisms of apoptosis will enhance our understanding of their functions. However, the development of computational methods to accurately identify positive and negative regulation of apoptosis remains a significant challenge. This work proposes a machine learning model based on multi-feature fusion to effectively identify the roles of positive and negative regulation of apoptosis. Initially, we constructed a reliable benchmark dataset containing 200 positive regulation of apoptosis and 241 negative regulation of apoptosis proteins. Subsequently, we developed a classifier that combines the support vector machine (SVM) with pseudo composition of <em>k</em>-spaced amino acid pairs (PseCKSAAP), composition transition distribution (CTD), dipeptide deviation from expected mean (DDE), and PSSM-composition to identify these proteins. Analysis of variance (ANOVA) was employed to select optimized features that could yield the maximum prediction performance. Evaluating the proposed model on independent data revealed and achieved an accuracy of 0.781 with an AUROC of 0.837, demonstrating our model's potent capabilities.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108207"},"PeriodicalIF":2.6,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142168672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular descriptors and in silico studies of 4-((5-(decylthio)-4-methyl-4n-1,2,4-triazol-3-yl)methyl)morpholine as a potential drug for the treatment of fungal pathologies 4-((5-(癸硫基)-4-甲基-4n-1,2,4-三唑-3-基)甲基)吗啉作为治疗真菌病症的潜在药物的分子描述符和硅学研究
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-11 DOI: 10.1016/j.compbiolchem.2024.108206
Ohloblina Myroslava , Alireza Poustforoosh , Bushuieva Inna , Volodymyr Parchenko , Burak Tüzün , Bogdan Gutyj

The article explores the polypharmacological profiling of 4-((5-(decylthio)-4-methyl-4H-1,2,4-triazole-3-yl)methyl)morpholine as a potential antimicrobial agent. The study utilized 15148 electronic pharmacophore models of organisms, ranked by the Tversky index. Detailed analysis revealed classical bonding patterns with selected enzymes, identifying key amino acid residues involved in complex formation. Protein target prediction was conducted through various stages using the Galaxy web service, including ligand structure creation, pharmacophore alignment, and target ranking. The activities of the molecules against 1G6C, 2W6O, 3G7F, 3OWU, 4IVR, and 4TZT proteins were compared. Docking studies with PyMOL and Discovery Studio Visualizer revealed binding to thymidine kinase, thiamine phosphate synthase, and biotin carboxylase with promising binding affinities. These interactions suggest potential antibacterial and antiviral effects, warranting further virtual screening and in-depth studies for the development of effective antimicrobial drugs. Calculations of the molecules were made with the gaussian package program. Calculations were made on the 6-31++g** basis set at B3LYP, HF, and M062X levels with Gaussian software. Afterwards, the 0–100 ns interaction of the molecule with the highest activity was examined.

文章探讨了作为潜在抗菌剂的 4-[(5-(癸硫基)-4-甲基-4H-1,2,4-三唑-3-基)甲基]吗啉的多药理学特征。这项研究利用了 15148 个生物电子药理模型,并按照 Tversky 指数进行了排序。详细分析揭示了与选定酶的经典结合模式,确定了参与复合物形成的关键氨基酸残基。蛋白质靶标预测是通过银河网络服务的各个阶段进行的,包括配体结构创建、药源比对和靶标排序。比较了分子对 1G6C、2W6O、3G7F、3OWU、4IVR 和 4TZT 蛋白的活性。利用 PyMOL 和 Discovery Studio Visualizer 进行的对接研究显示,这些分子与胸苷激酶、磷酸硫胺素合成酶和生物素羧化酶的结合亲和力良好。这些相互作用表明它们具有潜在的抗菌和抗病毒作用,值得进一步进行虚拟筛选和深入研究,以开发有效的抗菌药物。分子的计算采用高斯软件包程序。计算是在 B3LYP、HF 和 M062X 水平的 6-31++g** 基础集上用高斯软件进行的。随后,研究了活性最高的分子的 0-100 ns 相互作用。
{"title":"Molecular descriptors and in silico studies of 4-((5-(decylthio)-4-methyl-4n-1,2,4-triazol-3-yl)methyl)morpholine as a potential drug for the treatment of fungal pathologies","authors":"Ohloblina Myroslava ,&nbsp;Alireza Poustforoosh ,&nbsp;Bushuieva Inna ,&nbsp;Volodymyr Parchenko ,&nbsp;Burak Tüzün ,&nbsp;Bogdan Gutyj","doi":"10.1016/j.compbiolchem.2024.108206","DOIUrl":"10.1016/j.compbiolchem.2024.108206","url":null,"abstract":"<div><p>The article explores the polypharmacological profiling of 4-((5-(decylthio)-4-methyl-4H-1,2,4-triazole-3-yl)methyl)morpholine as a potential antimicrobial agent. The study utilized 15148 electronic pharmacophore models of organisms, ranked by the Tversky index. Detailed analysis revealed classical bonding patterns with selected enzymes, identifying key amino acid residues involved in complex formation. Protein target prediction was conducted through various stages using the Galaxy web service, including ligand structure creation, pharmacophore alignment, and target ranking. The activities of the molecules against 1G6C, 2W6O, 3G7F, 3OWU, 4IVR, and 4TZT proteins were compared. Docking studies with PyMOL and Discovery Studio Visualizer revealed binding to thymidine kinase, thiamine phosphate synthase, and biotin carboxylase with promising binding affinities. These interactions suggest potential antibacterial and antiviral effects, warranting further virtual screening and in-depth studies for the development of effective antimicrobial drugs. Calculations of the molecules were made with the gaussian package program. Calculations were made on the 6-31++g** basis set at B3LYP, HF, and M062X levels with Gaussian software. Afterwards, the 0–100 ns interaction of the molecule with the highest activity was examined.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108206"},"PeriodicalIF":2.6,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142168673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An open source in silico workflow to assist in the design of fusion proteins 协助设计融合蛋白的开源硅学工作流程
IF 2.6 4区 生物学 Q2 BIOLOGY Pub Date : 2024-09-10 DOI: 10.1016/j.compbiolchem.2024.108209
C.J. Lalaurie , C. Zhang , S.M. Liu , K.A. Bunting , P.A. Dalby

Fusion proteins have the potential to become the new norm for targeted therapeutic treatments. Highly specific payload delivery can be achieved by combining custom targeting moieties, such as VHH domains, with active parts of proteins that have a particular activity not naturally targeted to the intended cells. Conversely, novel drug products may make use of the highly specific targeting properties of naturally occurring proteins and combine them with custom payloads. When designing such a product, there is rarely a known structure for the final construct which makes it difficult to assess molecular behaviour that may ultimately impact therapeutic outcome. Considering the time and cost of expressing a construct, optimising the purification procedure, obtaining sufficient quantities for biophysical characterisation, and performing structural studies in vitro, there is an enormous benefit to conduct in silico studies ahead of wet lab work.

By following a repeatable, streamlined, and fast workflow of molecular dynamics assessment, it is possible to eliminate low-performing candidates from costly experimental work. There are, however, many aspects to consider when designing a novel fusion protein and it is crucial not to overlook some elements. In this work, we suggest a set of user-friendly, open-source methods which can be used to screen fusion protein candidates from the sequence alone. We used the light chain and translocation domain of botulinum toxin A (BoNT/A) fused with a selected VHH domain, termed here LC-HN-VHH, as a case study for a general approach to designing, modelling, and simulating fusion proteins. Its behaviour in silico correlated well with initial in vitro work, with SEC HPLC showing multiple protein states in solution and a dynamic protein shifting between these states over time without loss of material.

融合蛋白有可能成为靶向治疗的新标准。通过将定制的靶向分子(如 VHH 结构域)与蛋白质的活性部分相结合,可以实现高特异性的有效载荷传递,而这些蛋白质具有的特殊活性并非天然针对目标细胞。相反,新型药物产品可以利用天然蛋白质的高度特异性靶向特性,并将其与定制的有效载荷相结合。在设计此类产品时,最终构建物很少有已知的结构,因此很难评估可能最终影响治疗效果的分子行为。考虑到表达构建体、优化纯化程序、获得足够数量的生物物理特征以及在体外进行结构研究所需的时间和成本,在湿实验室工作之前进行硅学研究大有裨益。然而,在设计新型融合蛋白时需要考虑很多方面,因此不忽略某些因素至关重要。在这项工作中,我们提出了一套用户友好的开源方法,可用于仅从序列筛选候选融合蛋白。我们将肉毒杆菌毒素 A(BoNT/A)的轻链和易位结构域与一个选定的 VHH 结构域融合(在此称为 LC-HN-VHH),作为设计、建模和模拟融合蛋白一般方法的案例研究。它在硅学中的表现与最初的体外研究结果有很好的相关性,SEC HPLC 显示溶液中有多种蛋白质状态,而且随着时间的推移,蛋白质会在这些状态之间发生动态变化,而不会丢失物质。
{"title":"An open source in silico workflow to assist in the design of fusion proteins","authors":"C.J. Lalaurie ,&nbsp;C. Zhang ,&nbsp;S.M. Liu ,&nbsp;K.A. Bunting ,&nbsp;P.A. Dalby","doi":"10.1016/j.compbiolchem.2024.108209","DOIUrl":"10.1016/j.compbiolchem.2024.108209","url":null,"abstract":"<div><p>Fusion proteins have the potential to become the new norm for targeted therapeutic treatments. Highly specific payload delivery can be achieved by combining custom targeting moieties, such as V<sub>HH</sub> domains, with active parts of proteins that have a particular activity not naturally targeted to the intended cells. Conversely, novel drug products may make use of the highly specific targeting properties of naturally occurring proteins and combine them with custom payloads. When designing such a product, there is rarely a known structure for the final construct which makes it difficult to assess molecular behaviour that may ultimately impact therapeutic outcome. Considering the time and cost of expressing a construct, optimising the purification procedure, obtaining sufficient quantities for biophysical characterisation, and performing structural studies <em>in vitro</em>, there is an enormous benefit to conduct <em>in silico</em> studies ahead of wet lab work.</p><p>By following a repeatable, streamlined, and fast workflow of molecular dynamics assessment, it is possible to eliminate low-performing candidates from costly experimental work. There are, however, many aspects to consider when designing a novel fusion protein and it is crucial not to overlook some elements. In this work, we suggest a set of user-friendly, open-source methods which can be used to screen fusion protein candidates from the sequence alone. We used the light chain and translocation domain of botulinum toxin A (BoNT/A) fused with a selected V<sub>HH</sub> domain, termed here LC-H<sub>N</sub>-V<sub>HH</sub>, as a case study for a general approach to designing, modelling, and simulating fusion proteins. Its behaviour <em>in silico</em> correlated well with initial <em>in vitro</em> work, with SEC HPLC showing multiple protein states in solution and a dynamic protein shifting between these states over time without loss of material.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108209"},"PeriodicalIF":2.6,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S147692712400197X/pdfft?md5=6a2955eabc805b598902561119497014&pid=1-s2.0-S147692712400197X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142164680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Biology and Chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1