Si-Rui Xiao, Yao-Kun Zhang, Kai-Yu Liu, Yu-Xiang Huang, Rong Liu
{"title":"PNBACE: an ensemble algorithm to predict the effects of mutations on protein-nucleic acid binding affinity","authors":"Si-Rui Xiao, Yao-Kun Zhang, Kai-Yu Liu, Yu-Xiang Huang, Rong Liu","doi":"10.1186/s12915-024-02006-9","DOIUrl":null,"url":null,"abstract":"Mutations occurring in nucleic acids or proteins may affect the binding affinities of protein-nucleic acid interactions. Although many efforts have been devoted to the impact of protein mutations, few computational studies have addressed the effect of nucleic acid mutations and explored whether the identical methodology could be applied to the prediction of binding affinity changes caused by these two mutation types. Here, we developed a generalized algorithm named PNBACE for both DNA and protein mutations. We first demonstrated that DNA mutations could induce varying degrees of changes in binding affinity from multiple perspectives. We then designed a group of energy-based topological features based on different energy networks, which were combined with our previous partition-based energy features to construct individual prediction models through feature selections. Furthermore, we created an ensemble model by integrating the outputs of individual models using a differential evolution algorithm. In addition to predicting the impact of single-point mutations, PNBACE could predict the influence of multiple-point mutations and identify mutations significantly reducing binding affinities. Extensive comparisons indicated that PNBACE largely performed better than existing methods on both regression and classification tasks. PNBACE is an effective method for estimating the binding affinity changes of protein-nucleic acid complexes induced by DNA or protein mutations, therefore improving our understanding of the interactions between proteins and DNA/RNA.","PeriodicalId":9339,"journal":{"name":"BMC Biology","volume":null,"pages":null},"PeriodicalIF":4.4000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12915-024-02006-9","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Mutations occurring in nucleic acids or proteins may affect the binding affinities of protein-nucleic acid interactions. Although many efforts have been devoted to the impact of protein mutations, few computational studies have addressed the effect of nucleic acid mutations and explored whether the identical methodology could be applied to the prediction of binding affinity changes caused by these two mutation types. Here, we developed a generalized algorithm named PNBACE for both DNA and protein mutations. We first demonstrated that DNA mutations could induce varying degrees of changes in binding affinity from multiple perspectives. We then designed a group of energy-based topological features based on different energy networks, which were combined with our previous partition-based energy features to construct individual prediction models through feature selections. Furthermore, we created an ensemble model by integrating the outputs of individual models using a differential evolution algorithm. In addition to predicting the impact of single-point mutations, PNBACE could predict the influence of multiple-point mutations and identify mutations significantly reducing binding affinities. Extensive comparisons indicated that PNBACE largely performed better than existing methods on both regression and classification tasks. PNBACE is an effective method for estimating the binding affinity changes of protein-nucleic acid complexes induced by DNA or protein mutations, therefore improving our understanding of the interactions between proteins and DNA/RNA.
核酸或蛋白质中发生的突变可能会影响蛋白质-核酸相互作用的结合亲和力。尽管很多人致力于研究蛋白质突变的影响,但很少有计算研究涉及核酸突变的影响,并探讨是否可以将相同的方法应用于预测这两种突变类型引起的结合亲和力变化。在此,我们开发了一种名为 PNBACE 的通用算法,同时适用于 DNA 和蛋白质突变。我们首先从多个角度证明了 DNA 突变会引起不同程度的结合亲和力变化。然后,我们根据不同的能量网络设计了一组基于能量的拓扑特征,并将其与之前基于分区的能量特征相结合,通过特征选择构建了个体预测模型。此外,我们还利用差分进化算法整合了各个模型的输出结果,从而创建了一个集合模型。除了预测单点突变的影响外,PNBACE 还能预测多点突变的影响,并识别显著降低结合亲和力的突变。广泛的比较表明,在回归和分类任务中,PNBACE 的表现在很大程度上优于现有方法。PNBACE 是一种估算 DNA 或蛋白质突变引起的蛋白质-核酸复合物结合亲和力变化的有效方法,从而提高了我们对蛋白质与 DNA/RNA 之间相互作用的理解。
期刊介绍:
BMC Biology is a broad scope journal covering all areas of biology. Our content includes research articles, new methods and tools. BMC Biology also publishes reviews, Q&A, and commentaries.