首页 > 最新文献

Proteins-Structure Function and Bioinformatics最新文献

英文 中文
Cryo-EM Analysis in CASP16. CASP16的低温电镜分析。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-12-11 DOI: 10.1002/prot.70099
Thomas Mulvaney, Andriy Kryshtafovych, Maya Topf

Since CASP13, experimentalists have been encouraged to provide their cryo-EM data along with the derived atomic models to the CASP organizers to aid assessment. In CASP16, 38 cryo-EM datasets were provided for assessment, which represented most cryo-EM targets. The corresponding targets typically comprised a single derived atomic structure; however, that model may be only one of several valid conformations. Flexibility often manifests as low-resolution regions in a cryo-EM reconstruction, particularly in RNA but often also in protein complexes. We show that local resolution in the reconstruction correlates well with the root-mean-square fluctuations (RMSF) of residues of accurate CASP predictions. The correlation between the local resolution and pLDDT was less clear, especially when mobile domains were present. When the resolution allowed, assessment of features such as sidechains, using our variant of SMOC with local fragment alignment, indicated that even high-quality predictions have room for improvement; on the other hand, some predictions fitted the density better in specific regions, indicating modeling discrepancies in the target. In one extreme case, a submitted target had regions of low-resolution that limited unambiguous model building. In such cases, part of the target structure is essentially a prediction itself, with implications for the assessment. Experimental data remain essential for model-free assessment of predictions and offer unique analyses such as comparisons to local resolution and thus flexibility.

自CASP13以来,实验者被鼓励向CASP组织者提供他们的低温电镜数据以及衍生的原子模型,以帮助评估。在CASP16中,提供了38个冷冻电镜数据集用于评估,这些数据集代表了大多数冷冻电镜目标。相应的靶通常包括单一派生的原子结构;然而,该模型可能只是几种有效构造中的一种。在低温电镜重建中,柔韧性通常表现为低分辨率区域,特别是在RNA中,但通常也在蛋白质复合体中。我们表明,重建中的局部分辨率与准确的CASP预测残差的均方根波动(RMSF)有很好的相关性。局部分辨率和pLDDT之间的相关性不太清楚,特别是当存在移动域时。在分辨率允许的情况下,使用我们的带有局部片段比对的SMOC变体对侧链等特征进行评估,表明即使是高质量的预测也有改进的空间;另一方面,一些预测在特定区域更适合密度,这表明建模目标存在差异。在一种极端情况下,提交的目标具有低分辨率区域,限制了明确的模型构建。在这种情况下,目标结构的一部分本质上是预测本身,对评估有影响。实验数据对于无模型的预测评估仍然至关重要,并提供独特的分析,例如与当地分辨率的比较,从而提供灵活性。
{"title":"Cryo-EM Analysis in CASP16.","authors":"Thomas Mulvaney, Andriy Kryshtafovych, Maya Topf","doi":"10.1002/prot.70099","DOIUrl":"10.1002/prot.70099","url":null,"abstract":"<p><p>Since CASP13, experimentalists have been encouraged to provide their cryo-EM data along with the derived atomic models to the CASP organizers to aid assessment. In CASP16, 38 cryo-EM datasets were provided for assessment, which represented most cryo-EM targets. The corresponding targets typically comprised a single derived atomic structure; however, that model may be only one of several valid conformations. Flexibility often manifests as low-resolution regions in a cryo-EM reconstruction, particularly in RNA but often also in protein complexes. We show that local resolution in the reconstruction correlates well with the root-mean-square fluctuations (RMSF) of residues of accurate CASP predictions. The correlation between the local resolution and pLDDT was less clear, especially when mobile domains were present. When the resolution allowed, assessment of features such as sidechains, using our variant of SMOC with local fragment alignment, indicated that even high-quality predictions have room for improvement; on the other hand, some predictions fitted the density better in specific regions, indicating modeling discrepancies in the target. In one extreme case, a submitted target had regions of low-resolution that limited unambiguous model building. In such cases, part of the target structure is essentially a prediction itself, with implications for the assessment. Experimental data remain essential for model-free assessment of predictions and offer unique analyses such as comparisons to local resolution and thus flexibility.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"447-459"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Highlights of Model Quality Assessment in CASP16. CASP16中模型质量评估的重点。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-08-14 DOI: 10.1002/prot.70035
Alisia Fadini, Recep Adiyaman, Shaima N Alhaddad, Behnosh Behzadi, Jianlin Cheng, Xinyue Cui, Nicholas S Edmunds, Lydia Freddolino, Ahmet G Genc, Fang Liang, Dong Liu, Jian Liu, Quancheng Liu, Liam J McGuffin, Pawan Neupane, Chunxiang Peng, David R Shortle, Meng Sun, Haodong Wang, Qiqige Wuyun, Guijun Zhang, Xuanfeng Zhao, Wei Zheng, Randy J Read

Model quality assessment (MQA) remains a critical component of structural bioinformatics for both structure predictors and experimentalists seeking to use predictions for downstream applications. In CASP16, the Evaluation of Model Accuracy (EMA) category featured both global and local quality estimation for multimeric assemblies (QMODE1 and QMODE2), as well as a novel QMODE3 challenge-requiring predictors to identify the best five models from thousands generated by MassiveFold. This paper presents detailed results from several leading CASP16 EMA methods, highlighting the strengths and limitations of the approaches.

模型质量评估(MQA)仍然是结构生物信息学的一个重要组成部分,无论是结构预测者还是寻求将预测用于下游应用的实验者。在CASP16中,模型精度评估(EMA)类别具有多聚体组装(QMODE1和QMODE2)的全局和局部质量估计,以及新的QMODE3挑战-要求预测器从MassiveFold生成的数千个模型中识别出最佳的五个模型。本文介绍了几种领先的CASP16 EMA方法的详细结果,突出了这些方法的优势和局限性。
{"title":"Highlights of Model Quality Assessment in CASP16.","authors":"Alisia Fadini, Recep Adiyaman, Shaima N Alhaddad, Behnosh Behzadi, Jianlin Cheng, Xinyue Cui, Nicholas S Edmunds, Lydia Freddolino, Ahmet G Genc, Fang Liang, Dong Liu, Jian Liu, Quancheng Liu, Liam J McGuffin, Pawan Neupane, Chunxiang Peng, David R Shortle, Meng Sun, Haodong Wang, Qiqige Wuyun, Guijun Zhang, Xuanfeng Zhao, Wei Zheng, Randy J Read","doi":"10.1002/prot.70035","DOIUrl":"10.1002/prot.70035","url":null,"abstract":"<p><p>Model quality assessment (MQA) remains a critical component of structural bioinformatics for both structure predictors and experimentalists seeking to use predictions for downstream applications. In CASP16, the Evaluation of Model Accuracy (EMA) category featured both global and local quality estimation for multimeric assemblies (QMODE1 and QMODE2), as well as a novel QMODE3 challenge-requiring predictors to identify the best five models from thousands generated by MassiveFold. This paper presents detailed results from several leading CASP16 EMA methods, highlighting the strengths and limitations of the approaches.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"314-329"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750036/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144857136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving AlphaFold2- and AlphaFold3-Based Protein Complex Structure Prediction With MULTICOM4 in CASP16. 利用MULTICOM4改进CASP16中基于AlphaFold2和alphafold3的蛋白复合体结构预测。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-06-02 DOI: 10.1002/prot.26850
Jian Liu, Pawan Neupane, Jianlin Cheng

With AlphaFold achieving high-accuracy tertiary structure prediction for most single-chain proteins (monomers), the next major challenge in protein structure prediction is to accurately model multichain protein complexes (multimers). We developed MULTICOM4, the latest version of the MULTICOM system, to improve protein complex structure prediction by integrating transformer-based AlphaFold2, diffusion model-based AlphaFold3, and our in-house techniques. These include protein complex stoichiometry prediction, diverse multiple sequence alignment (MSA) generation leveraging both sequence and structure comparison, modeling exception handling, and deep learning-based protein model quality assessment. MULTICOM4 was blindly evaluated in the 16th Critical Assessment of Techniques for Protein Structure Prediction (CASP16) in 2024. In Phase 0 of CASP16, where stoichiometry information was unavailable, MULTICOM predictors performed best, with MULTICOM_human achieving a TM-score of 0.752 and a DockQ score of 0.584 for top-ranked predictions on average. In Phase 1 of CASP16, with stoichiometry information provided, MULTICOM_human remained among the top predictors, attaining a TM-score of 0.797 and a DockQ score of 0.558 on average. The CASP16 results demonstrate that integrating complementary AlphaFold2 and AlphaFold3 with enhanced MSA inputs, comprehensive model ranking, exception handling, and accurate stoichiometry prediction can effectively improve protein complex structure prediction.

随着AlphaFold对大多数单链蛋白(单体)实现高精度三级结构预测,蛋白质结构预测的下一个主要挑战是准确建模多链蛋白复合物(多聚体)。我们开发了MULTICOM系统的最新版本MULTICOM4,通过集成基于变压器的AlphaFold2,基于扩散模型的AlphaFold3和我们的内部技术来改进蛋白质复合物结构预测。这些包括蛋白质复杂化学计量预测、利用序列和结构比较的多种多序列比对(MSA)生成、建模异常处理以及基于深度学习的蛋白质模型质量评估。MULTICOM4在2024年第16届蛋白质结构预测技术关键评估(CASP16)中被盲目评价。在CASP16的0期,化学计量学信息不可用,MULTICOM预测器表现最好,MULTICOM_human的平均tm评分为0.752,DockQ评分为0.584。在提供了化学计量学信息的CASP16 1期中,MULTICOM_human仍然是最重要的预测因子之一,tm评分平均为0.797,DockQ评分平均为0.558。CASP16结果表明,将互补的AlphaFold2和AlphaFold3与增强的MSA输入、全面的模型排序、异常处理和精确的化学计量预测相结合,可以有效地提高蛋白质复合物结构的预测。
{"title":"Improving AlphaFold2- and AlphaFold3-Based Protein Complex Structure Prediction With MULTICOM4 in CASP16.","authors":"Jian Liu, Pawan Neupane, Jianlin Cheng","doi":"10.1002/prot.26850","DOIUrl":"10.1002/prot.26850","url":null,"abstract":"<p><p>With AlphaFold achieving high-accuracy tertiary structure prediction for most single-chain proteins (monomers), the next major challenge in protein structure prediction is to accurately model multichain protein complexes (multimers). We developed MULTICOM4, the latest version of the MULTICOM system, to improve protein complex structure prediction by integrating transformer-based AlphaFold2, diffusion model-based AlphaFold3, and our in-house techniques. These include protein complex stoichiometry prediction, diverse multiple sequence alignment (MSA) generation leveraging both sequence and structure comparison, modeling exception handling, and deep learning-based protein model quality assessment. MULTICOM4 was blindly evaluated in the 16th Critical Assessment of Techniques for Protein Structure Prediction (CASP16) in 2024. In Phase 0 of CASP16, where stoichiometry information was unavailable, MULTICOM predictors performed best, with MULTICOM_human achieving a TM-score of 0.752 and a DockQ score of 0.584 for top-ranked predictions on average. In Phase 1 of CASP16, with stoichiometry information provided, MULTICOM_human remained among the top predictors, attaining a TM-score of 0.797 and a DockQ score of 0.558 on average. The CASP16 results demonstrate that integrating complementary AlphaFold2 and AlphaFold3 with enhanced MSA inputs, comprehensive model ranking, exception handling, and accurate stoichiometry prediction can effectively improve protein complex structure prediction.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"131-141"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12354153/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144200947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ligand Binding Prediction on Pharmaceutical and Nucleic Acid Targets by the CoDock Group in CASP16. CASP16中CoDock基团对药物和核酸靶标的配体结合预测。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-08-01 DOI: 10.1002/prot.70032
Ren Kong, Zunyun Jiang, Xufeng Lu, Liangxu Xie, Shan Chang

Ligand binding prediction is a critical component of structure-based drug design, gaining prominence in Critical Assessment of protein Structure Prediction (CASP) since its introduction in CASP15. In CASP16, the challenges expanded to include protein-ligand and nucleic acid-ligand binding predictions, alongside binding affinity ranking, posing greater computational and methodological demands. This study presents a sophisticated prediction strategy combining template-based docking, multiple receptor conformations, and AI-driven scoring to address these challenges. For protein-ligand systems (L1000-L4000), we leveraged structural templates from PDB, ligand similarity analysis, and tools like CoDock-Ligand and AutoDock Vina to predict binding poses. Key successes included accurate predictions for targets like SARS-CoV-2 Mpro (L4000) and Autotaxin (L3000), though challenges persisted with binding site flexibility and pose ranking. The prediction of ligand pose achieved satisfactory results, with more than 66% of the distribution having RMSD less than 3 Å. Nucleic acid-ligand predictions (e.g., ZTP riboswitch) yielded mixed results, highlighting limitations in RNA/DNA structural accuracy. Affinity prediction employed diverse methods, with machine learning-based SVR_Conjoint outperforming physics-based approaches (Kendall's Tau = 0.43). Our strategy demonstrated robustness in CASP16, yet underscored the need for advancements in handling conformational dynamics and scoring accuracy. This work provides a framework for future ligand binding prediction efforts in computational drug discovery.

配体结合预测是基于结构的药物设计的关键组成部分,自CASP15引入以来,在蛋白质结构预测的关键评估(CASP)中获得了突出地位。在CASP16中,挑战扩大到包括蛋白质-配体和核酸-配体结合预测,以及结合亲和力排序,提出了更多的计算和方法要求。该研究提出了一种复杂的预测策略,结合了基于模板的对接、多种受体构象和人工智能驱动的评分来应对这些挑战。对于蛋白质-配体系统(L1000-L4000),我们利用PDB中的结构模板、配体相似性分析以及CoDock-Ligand和AutoDock Vina等工具来预测结合姿势。关键的成功包括准确预测SARS-CoV-2 Mpro (L4000)和Autotaxin (L3000)等靶标,尽管在结合位点灵活性和位姿排序方面仍然存在挑战。配体位姿的预测结果令人满意,超过66%的分布RMSD小于3 Å。核酸配体预测(例如,ZTP核糖开关)产生了不同的结果,突出了RNA/DNA结构准确性的局限性。亲和预测采用了多种方法,基于机器学习的svr_joint优于基于物理的方法(Kendall’s Tau = 0.43)。我们的策略证明了CASP16的稳健性,但强调了在处理构象动力学和评分准确性方面的进步。这项工作为未来计算药物发现中的配体结合预测工作提供了一个框架。
{"title":"Ligand Binding Prediction on Pharmaceutical and Nucleic Acid Targets by the CoDock Group in CASP16.","authors":"Ren Kong, Zunyun Jiang, Xufeng Lu, Liangxu Xie, Shan Chang","doi":"10.1002/prot.70032","DOIUrl":"10.1002/prot.70032","url":null,"abstract":"<p><p>Ligand binding prediction is a critical component of structure-based drug design, gaining prominence in Critical Assessment of protein Structure Prediction (CASP) since its introduction in CASP15. In CASP16, the challenges expanded to include protein-ligand and nucleic acid-ligand binding predictions, alongside binding affinity ranking, posing greater computational and methodological demands. This study presents a sophisticated prediction strategy combining template-based docking, multiple receptor conformations, and AI-driven scoring to address these challenges. For protein-ligand systems (L1000-L4000), we leveraged structural templates from PDB, ligand similarity analysis, and tools like CoDock-Ligand and AutoDock Vina to predict binding poses. Key successes included accurate predictions for targets like SARS-CoV-2 Mpro (L4000) and Autotaxin (L3000), though challenges persisted with binding site flexibility and pose ranking. The prediction of ligand pose achieved satisfactory results, with more than 66% of the distribution having RMSD less than 3 Å. Nucleic acid-ligand predictions (e.g., ZTP riboswitch) yielded mixed results, highlighting limitations in RNA/DNA structural accuracy. Affinity prediction employed diverse methods, with machine learning-based SVR_Conjoint outperforming physics-based approaches (Kendall's Tau = 0.43). Our strategy demonstrated robustness in CASP16, yet underscored the need for advancements in handling conformational dynamics and scoring accuracy. This work provides a framework for future ligand binding prediction efforts in computational drug discovery.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"276-285"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144765848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AlphaFold3 at CASP16. CASP16上的AlphaFold3。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-08-25 DOI: 10.1002/prot.70044
Arne Elofsson

The CASP16 experiment provided the first opportunity to benchmark AlphaFold3. In contrast to AlphaFold2, AlphaFold3 can predict the structure of non-protein molecules. According to the benchmark presented by the developers, it is expected to perform slightly better than AlphaFold2 for proteins. In this study, we assess the performance of AlphaFold3 using both automatic server submissions (AF3-server) and manual predictions from the Elofsson group (Elofsson). All predictions were generated via the AlphaFold3 web server, with manual interventions applied to large targets and ligands. Compared to AlphaFold2-based methods, we found that AlphaFold3 performs slightly better for protein complexes. However, when massive sampling is applied to AlphaFold2, the difference disappears. It was also noted that, according to the official ranking from CASP, the AF3-server performs better than AlphaFold2 for easier targets, but not for harder targets. Furthermore, the performance of the AF3-server is comparable to the best methods when considering the top-ranked predictions, but slightly behind when examining the best among the five submitted models. Here, there exist targets where AF3-server, the top-ranked method, is worse than lower-ranked models, indicating that a venue for progress could be to develop better strategies for identifying the best out of the generated models. When using AF3-server to predict the stoichiometry of larger protein complexes, the accuracy is limited, especially for heteromeric targets. When analyzing the predictions including nucleic acids, it was found that, in general, the accuracy is relatively low. However, the AF3-server performance was not far behind that of the top-ranked method. In summary, AF3-server offers a user-friendly tool that provides predictions comparable to state-of-the-art methods in all categories of CASP.

CASP16实验首次提供了对AlphaFold3进行基准测试的机会。与AlphaFold2相比,AlphaFold3可以预测非蛋白分子的结构。根据开发人员提供的基准测试,预计它在蛋白质方面的表现将略好于AlphaFold2。在本研究中,我们使用自动服务器提交(AF3-server)和Elofsson组(Elofsson)的手动预测来评估AlphaFold3的性能。所有的预测都是通过AlphaFold3网络服务器生成的,人工干预应用于大的目标和配体。与基于alphafold2的方法相比,我们发现AlphaFold3对蛋白质复合物的表现略好。然而,当对AlphaFold2进行大规模采样时,这种差异就消失了。还指出,根据CASP的官方排名,af3服务器在容易的目标上比AlphaFold2表现得更好,但在更难的目标上却没有。此外,在考虑排名靠前的预测时,af3服务器的性能与最佳方法相当,但在检查五个提交的模型中的最佳模型时略落后。在这里,存在AF3-server(排名靠前的方法)比排名靠后的模型更差的目标,这表明可以开发更好的策略来识别生成的模型中的最佳模型。当使用AF3-server预测较大蛋白质复合物的化学计量时,准确性是有限的,特别是对于异质靶标。在对包括核酸在内的预测进行分析时,发现总体上准确率较低。但是,af3服务器的性能与排名第一的方法相差无几。总之,AF3-server提供了一个用户友好的工具,可以提供与所有类别CASP中最先进的方法相媲美的预测。
{"title":"AlphaFold3 at CASP16.","authors":"Arne Elofsson","doi":"10.1002/prot.70044","DOIUrl":"10.1002/prot.70044","url":null,"abstract":"<p><p>The CASP16 experiment provided the first opportunity to benchmark AlphaFold3. In contrast to AlphaFold2, AlphaFold3 can predict the structure of non-protein molecules. According to the benchmark presented by the developers, it is expected to perform slightly better than AlphaFold2 for proteins. In this study, we assess the performance of AlphaFold3 using both automatic server submissions (AF3-server) and manual predictions from the Elofsson group (Elofsson). All predictions were generated via the AlphaFold3 web server, with manual interventions applied to large targets and ligands. Compared to AlphaFold2-based methods, we found that AlphaFold3 performs slightly better for protein complexes. However, when massive sampling is applied to AlphaFold2, the difference disappears. It was also noted that, according to the official ranking from CASP, the AF3-server performs better than AlphaFold2 for easier targets, but not for harder targets. Furthermore, the performance of the AF3-server is comparable to the best methods when considering the top-ranked predictions, but slightly behind when examining the best among the five submitted models. Here, there exist targets where AF3-server, the top-ranked method, is worse than lower-ranked models, indicating that a venue for progress could be to develop better strategies for identifying the best out of the generated models. When using AF3-server to predict the stoichiometry of larger protein complexes, the accuracy is limited, especially for heteromeric targets. When analyzing the predictions including nucleic acids, it was found that, in general, the accuracy is relatively low. However, the AF3-server performance was not far behind that of the top-ranked method. In summary, AF3-server offers a user-friendly tool that provides predictions comparable to state-of-the-art methods in all categories of CASP.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"154-166"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750027/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144980277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Protein-Ligand Structure Prediction by Template-Guided Ensemble Docking Strategy. 基于模板引导集成对接策略的蛋白质配体结构预测。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-10-06 DOI: 10.1002/prot.70063
Keqiong Zhang, Qilong Wu, Sheng-You Huang

In the 15th Critical Assessment of Techniques for Structure Prediction (CASP15), the category of protein-ligand complexes was introduced to advance the development of protein-ligand structure prediction techniques. CASP16 further expanded this category by introducing four sets of pharmaceutical targets as super-targets. Each super-target consists of multiple protein-ligand complexes involving the same protein but different ligands. Given the outstanding performance of template-based methods in CASP15, we employed a template-guided ensemble docking strategy for ligand (LG) tasks in CASP16. MODELER, AlphaFold3, and AlphaFold-Multimer were used to generate structural ensembles for each target protein. Then, we searched the Protein Data Bank (PDB) for reliable template complexes based on sequence identity, ligand similarity, and maximum common substructure (MCS) coverage score. If templates were identified, we used LSalign to perform ligand 3D alignment. For targets without a template, XDock and MDock were used to predict the binding poses. Finally, a knowledge-based scoring function, ITScore, was employed for energy evaluation. It is shown that our method performed well in the CASP16's LG tasks, ranking 4th out of 38 participating teams.

在第15届结构预测技术关键评估(CASP15)中,引入了蛋白质-配体复合物的类别,推动了蛋白质-配体结构预测技术的发展。CASP16通过引入四组药物靶点作为超级靶点进一步扩展了这一类别。每个超级靶标由多个蛋白质-配体复合物组成,涉及相同的蛋白质但不同的配体。鉴于基于模板的方法在CASP15中的出色表现,我们采用模板引导的集成对接策略来完成CASP16中的配体(LG)任务。使用MODELER、AlphaFold3和AlphaFold-Multimer生成每个目标蛋白的结构集合。然后,我们根据序列一致性、配体相似性和最大共同亚结构(MCS)覆盖评分在蛋白质数据库(PDB)中搜索可靠的模板配合物。如果模板被识别,我们使用LSalign进行配体3D对齐。对于没有模板的目标,使用XDock和MDock预测结合姿态。最后,采用基于知识的评分函数ITScore进行能量评价。结果表明,我们的方法在CASP16的LG任务中表现良好,在38个参赛队中排名第4。
{"title":"Protein-Ligand Structure Prediction by Template-Guided Ensemble Docking Strategy.","authors":"Keqiong Zhang, Qilong Wu, Sheng-You Huang","doi":"10.1002/prot.70063","DOIUrl":"10.1002/prot.70063","url":null,"abstract":"<p><p>In the 15th Critical Assessment of Techniques for Structure Prediction (CASP15), the category of protein-ligand complexes was introduced to advance the development of protein-ligand structure prediction techniques. CASP16 further expanded this category by introducing four sets of pharmaceutical targets as super-targets. Each super-target consists of multiple protein-ligand complexes involving the same protein but different ligands. Given the outstanding performance of template-based methods in CASP15, we employed a template-guided ensemble docking strategy for ligand (LG) tasks in CASP16. MODELER, AlphaFold3, and AlphaFold-Multimer were used to generate structural ensembles for each target protein. Then, we searched the Protein Data Bank (PDB) for reliable template complexes based on sequence identity, ligand similarity, and maximum common substructure (MCS) coverage score. If templates were identified, we used LSalign to perform ligand 3D alignment. For targets without a template, XDock and MDock were used to predict the binding poses. Finally, a knowledge-based scoring function, ITScore, was employed for energy evaluation. It is shown that our method performed well in the CASP16's LG tasks, ranking 4th out of 38 participating teams.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"267-275"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145234256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond Single Chains: Benchmarking Macromolecular Complex Prediction Methods With the Continuous Automated Model EvaluatiOn (CAMEO). 超越单链:用连续自动模型评估(CAMEO)对标大分子复合物预测方法。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-09-28 DOI: 10.1002/prot.70060
Xavier Robin, Peter Škrinjar, Andrew M Waterhouse, Gabriel Studer, Gerardo Tauriello, Janani Durairaj, Torsten Schwede

Independent, blind assessment of structure prediction methods is essential for establishing state-of-the-art performance, identifying limitations, and guiding future developments. The Continuous Automated Model EvaluatiOn (CAMEO) platform provides weekly, automated benchmarking of structure prediction servers, complementing the biennial Critical Assessment of Structure Prediction (CASP) experiments.

独立的、盲目的结构预测方法评估对于建立最先进的性能、识别局限性和指导未来发展至关重要。连续自动化模型评估(CAMEO)平台每周提供结构预测服务器的自动基准测试,补充了两年一次的结构预测关键评估(CASP)实验。
{"title":"Beyond Single Chains: Benchmarking Macromolecular Complex Prediction Methods With the Continuous Automated Model EvaluatiOn (CAMEO).","authors":"Xavier Robin, Peter Škrinjar, Andrew M Waterhouse, Gabriel Studer, Gerardo Tauriello, Janani Durairaj, Torsten Schwede","doi":"10.1002/prot.70060","DOIUrl":"10.1002/prot.70060","url":null,"abstract":"<p><p>Independent, blind assessment of structure prediction methods is essential for establishing state-of-the-art performance, identifying limitations, and guiding future developments. The Continuous Automated Model EvaluatiOn (CAMEO) platform provides weekly, automated benchmarking of structure prediction servers, complementing the biennial Critical Assessment of Structure Prediction (CASP) experiments.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"403-413"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750032/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling Protein-Protein and Protein-Ligand Interactions by the ClusPro Team in CASP16. ClusPro团队在CASP16中模拟蛋白质-蛋白质和蛋白质-配体相互作用。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-10-20 DOI: 10.1002/prot.70066
Ryota Ashizawa, Sergei Kotelnikov, Omeir Khan, Stan Xiaogang Li, Ernest Glukhov, Xin Cao, Maria Lazou, Ayse Bekar-Cesaretli, Derara Hailegeorgis, Veranika Averkava, Yimin Zhu, George Jones, Hao Yu, Dmytro Kalitin, Darya Stepanenko, Kushal Koirala, Taras Patsahan, Dmitri Beglov, Mark Lukin, Diane Joseph-McCarthy, Carlos Simmerling, Alexander Tropsha, Evangelos Coutsias, Ken A Dill, Dzmitry Padhorny, Sandor Vajda, Dima Kozakov

In the CASP16 experiment, our team employed hybrid computational strategies to predict both protein-protein and protein-ligand complex structures. For protein-protein docking, we combined physics-based sampling-using ClusPro FFT docking and molecular dynamics-with AlphaFold (AF)-based sampling, followed by AF-based refinement. Our method produced numerous high-accuracy complex models, including cases where AF alone failed, underscoring the critical role of physics-based sampling alongside deep learning-based refinement. For protein-ligand docking, we integrated the ClusPro LigTBM template-based approach with a machine learning-based confidence model for rescoring. The method preserves conserved interaction fragments derived from homologous complexes, followed by local resampling using physics-based sampling and a diffusion model. Our template-based strategy achieved a mean lDDT-PLI of 0.69 across 233 targets, which was highly competitive. These results demonstrate that combining physics-based modeling with AI-driven refinement can significantly enhance the accuracy of both protein-protein and protein-ligand structure predictions.

在CASP16实验中,我们的团队采用混合计算策略来预测蛋白质-蛋白质和蛋白质-配体复合物结构。对于蛋白质-蛋白质对接,我们将基于物理的采样(使用ClusPro FFT对接和分子动力学)与基于AlphaFold (AF)的采样结合起来,然后进行基于AF的细化。我们的方法产生了许多高精度的复杂模型,包括单独AF失败的情况,强调了基于物理的采样和基于深度学习的改进的关键作用。对于蛋白质配体对接,我们将基于ClusPro lightbm模板的方法与基于机器学习的置信度模型集成在一起进行评分。该方法保留来自同源配合物的保守相互作用片段,然后使用基于物理的采样和扩散模型进行局部重采样。我们基于模板的策略在233个目标中实现了0.69的平均lDDT-PLI,这是非常具有竞争力的。这些结果表明,将基于物理的建模与人工智能驱动的细化相结合,可以显著提高蛋白质-蛋白质和蛋白质-配体结构预测的准确性。
{"title":"Modeling Protein-Protein and Protein-Ligand Interactions by the ClusPro Team in CASP16.","authors":"Ryota Ashizawa, Sergei Kotelnikov, Omeir Khan, Stan Xiaogang Li, Ernest Glukhov, Xin Cao, Maria Lazou, Ayse Bekar-Cesaretli, Derara Hailegeorgis, Veranika Averkava, Yimin Zhu, George Jones, Hao Yu, Dmytro Kalitin, Darya Stepanenko, Kushal Koirala, Taras Patsahan, Dmitri Beglov, Mark Lukin, Diane Joseph-McCarthy, Carlos Simmerling, Alexander Tropsha, Evangelos Coutsias, Ken A Dill, Dzmitry Padhorny, Sandor Vajda, Dima Kozakov","doi":"10.1002/prot.70066","DOIUrl":"10.1002/prot.70066","url":null,"abstract":"<p><p>In the CASP16 experiment, our team employed hybrid computational strategies to predict both protein-protein and protein-ligand complex structures. For protein-protein docking, we combined physics-based sampling-using ClusPro FFT docking and molecular dynamics-with AlphaFold (AF)-based sampling, followed by AF-based refinement. Our method produced numerous high-accuracy complex models, including cases where AF alone failed, underscoring the critical role of physics-based sampling alongside deep learning-based refinement. For protein-ligand docking, we integrated the ClusPro LigTBM template-based approach with a machine learning-based confidence model for rescoring. The method preserves conserved interaction fragments derived from homologous complexes, followed by local resampling using physics-based sampling and a diffusion model. Our template-based strategy achieved a mean lDDT-PLI of 0.69 across 233 targets, which was highly competitive. These results demonstrate that combining physics-based modeling with AI-driven refinement can significantly enhance the accuracy of both protein-protein and protein-ligand structure predictions.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"183-191"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750026/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145338256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling Alternative Conformational States in CASP16. CASP16中不同构象态的建模。
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-10-28 DOI: 10.1002/prot.70065
Namita Dube, Theresa A Ramelot, Tiburon L Benavides, Yuanpeng J Huang, John Moult, Andriy Kryshtafovych, Gaetano T Montelione

The CASP16 Ensemble Prediction experiment assessed advances in methods for modeling proteins, nucleic acids, and their complexes in multiple conformational states. Targets included systems with experimental structures determined in two or three states, evaluated by direct comparison to experimental coordinates, as well as domain-linker-domain (D-L-D) targets assessed against statistical models generated from NMR and SAXS data. This paper focuses on the former class of multi-state targets. Ten ensembles were released as community challenges, including ligand-induced conformational changes, protein-DNA complexes, a trimeric protein, a stem-loop RNA, and multiple oligomeric states of a single RNA. For five targets, some groups produced reasonably accurate models of both reference states (best TM-score > 0.75). However, with the exception of one protein-ligand complex (T1214), where an apo structure was available as a template, predictors generally failed to capture key structural details distinguishing the states. Overall, accuracy was significantly lower than for single-state targets in other CASP experiments. The most successful approaches generated multiple AlphaFold2 models using enhanced multiple sequence alignments and sampling protocols, followed by model quality-based selection. Although the AlphaFold3 server performed well on several targets, individual groups outperformed it in specific cases. By contrast, predictions for one protein-DNA complex, three RNA targets, and multiple oligomeric RNA states consistently fell short (TM-score < 0.75). These results highlight both progress and persistent challenges in multi-state prediction. Despite recent advances, accurate modeling of conformational ensembles, particularly RNA and large multimeric assemblies, remains an important frontier for structural biology.

CASP16集合预测实验评估了多种构象状态下蛋白质、核酸及其复合物建模方法的进展。目标包括具有两种或三种状态的实验结构的系统,通过与实验坐标的直接比较进行评估,以及根据NMR和SAXS数据生成的统计模型评估的域连接域(D-L-D)目标。本文主要研究前一类多状态目标。作为群落挑战释放了10个集成,包括配体诱导的构象变化,蛋白质- dna复合物,三聚体蛋白,茎环RNA和单个RNA的多个低聚态。对于5个目标,一些小组产生了相当准确的两种参考状态模型(最佳tm得分为0.75)。然而,除了一种蛋白质-配体复合物(T1214),其中载脂蛋白结构可作为模板,预测器通常无法捕获区分状态的关键结构细节。总体而言,准确度明显低于其他CASP实验中的单状态目标。最成功的方法是使用增强的多序列比对和采样协议生成多个AlphaFold2模型,然后是基于模型质量的选择。虽然AlphaFold3服务器在几个目标上表现良好,但在特定情况下,个别组的表现优于它。相比之下,对一种蛋白质- dna复合物、三种RNA靶标和多种寡聚RNA状态的预测一直低于tm评分
{"title":"Modeling Alternative Conformational States in CASP16.","authors":"Namita Dube, Theresa A Ramelot, Tiburon L Benavides, Yuanpeng J Huang, John Moult, Andriy Kryshtafovych, Gaetano T Montelione","doi":"10.1002/prot.70065","DOIUrl":"10.1002/prot.70065","url":null,"abstract":"<p><p>The CASP16 Ensemble Prediction experiment assessed advances in methods for modeling proteins, nucleic acids, and their complexes in multiple conformational states. Targets included systems with experimental structures determined in two or three states, evaluated by direct comparison to experimental coordinates, as well as domain-linker-domain (D-L-D) targets assessed against statistical models generated from NMR and SAXS data. This paper focuses on the former class of multi-state targets. Ten ensembles were released as community challenges, including ligand-induced conformational changes, protein-DNA complexes, a trimeric protein, a stem-loop RNA, and multiple oligomeric states of a single RNA. For five targets, some groups produced reasonably accurate models of both reference states (best TM-score > 0.75). However, with the exception of one protein-ligand complex (T1214), where an apo structure was available as a template, predictors generally failed to capture key structural details distinguishing the states. Overall, accuracy was significantly lower than for single-state targets in other CASP experiments. The most successful approaches generated multiple AlphaFold2 models using enhanced multiple sequence alignments and sampling protocols, followed by model quality-based selection. Although the AlphaFold3 server performed well on several targets, individual groups outperformed it in specific cases. By contrast, predictions for one protein-DNA complex, three RNA targets, and multiple oligomeric RNA states consistently fell short (TM-score < 0.75). These results highlight both progress and persistent challenges in multi-state prediction. Despite recent advances, accurate modeling of conformational ensembles, particularly RNA and large multimeric assemblies, remains an important frontier for structural biology.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"330-347"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145379883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blind Prediction of Complex Water and Ion Ensembles Around RNA in CASP16. CASP16中RNA周围水离子复合物的盲预测
IF 2.8 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-01 Epub Date: 2025-11-08 DOI: 10.1002/prot.70079
Rachael C Kretsch, Elisa Posani, Eugene F Baulin, Janusz M Bujnicki, Giovanni Bussi, Thomas E Cheatham, Shi-Jie Chen, Arne Elofsson, Masoud Amiri Farsani, Olivia N Fisher, M Michael Gromiha, Ayush Gupta, Michiaki Hamada, K Harini, Gang Hu, David Huang, Junichi Iwakiri, Anika Jain, Yuki Kagaya, Daisuke Kihara, Sebastian Kmiecik, Sowmya Ramaswamy Krishnan, Ikuo Kurisaki, Olivier Languin-Cattoën, Jun Li, Shanshan Li, Karim Malekzadeh, Tsukasa Nakamura, Wentao Ni, Chandran Nithin, Michael Z Palo, Joon Hong Park, Smita P Pilla, Simón Poblete, Fabrizio Pucci, Pranav Punuru, Anouka Saha, Kengo Sato, Ambuj Srivastava, Genki Terashi, Emilia Tugolukova, Jacob Verburgt, Qiqige Wuyun, Gül H Zerze, Kaiming Zhang, Sicheng Zhang, Wei Zheng, Yuanzhe Zhou, Wah Chiu, David A Case, Rhiju Das

Biomolecules rely on water and ions for stable folding, but these interactions are often transient, dynamic, or disordered and thus hidden from experiments and evaluation challenges that represent biomolecules as single, ordered structures. Here, we compare blindly predicted ensembles of water and ion structure to the cryo-EM densities observed around the Tetrahymena ribozyme at 2.2-2.3 Å resolution, collected through target R1260 in the CASP16 competition. Twenty-six groups participated in this solvation "cryo-ensemble" prediction challenge, submitting over 350 million atoms in total, offering the first opportunity to compare blind predictions of dynamic solvent shell ensembles to cryo-EM density. Predicted atomic ensembles were converted to density through local alignment and these densities were compared to the cryo-EM densities using Pearson correlation, Spearman correlation, mutual information, and precision-recall curves. These predictions show that an ensemble representation is able to capture information of transient or dynamic water and ions better than traditional atomic models, but there remains a large accuracy gap to the performance ceiling set by experimental uncertainty. Overall, molecular dynamics approaches best matched the cryo-EM density, with blind predictions from bussilab_plain_md, SoutheRNA, bussilab_replex, coogs2, and coogs3 outperforming the baseline molecular dynamics prediction. This study indicates that simulations of water and ions can be quantitatively evaluated with cryo-EM maps. We propose that further community-wide blind challenges can drive and evaluate progress in modeling water, ions, and other previously hidden components of biomolecular systems.

生物分子依赖于水和离子进行稳定的折叠,但这些相互作用通常是短暂的、动态的或无序的,因此隐藏在代表生物分子单一有序结构的实验和评估挑战中。在这里,我们将盲目预测的水和离子结构集合与在2.2-2.3 Å分辨率下观察到的四膜核酶周围的低温电镜密度进行了比较,这些密度是通过CASP16竞争中的靶标R1260收集的。26个小组参加了这次溶剂化“低温系综”预测挑战,总共提交了超过3.5亿个原子,首次提供了将动态溶剂壳系综的盲目预测与低温em密度进行比较的机会。通过局部比对将预测的原子系综转换为密度,并使用Pearson相关、Spearman相关、互信息和精确召回曲线将这些密度与cryo-EM密度进行比较。这些预测表明,与传统的原子模型相比,集合表示能够更好地捕获瞬态或动态水和离子的信息,但与实验不确定性设定的性能上限相比,仍然存在很大的精度差距。总的来说,分子动力学方法与低温电子显微镜密度最匹配,bussilab_plain_md、SoutheRNA、bussilab_replex、coogs2和coogs3的盲预测优于基线分子动力学预测。这项研究表明,水和离子的模拟可以定量评估与低温电镜图。我们建议进一步的社区范围内的盲挑战可以推动和评估水,离子和其他以前隐藏的生物分子系统成分的建模进展。
{"title":"Blind Prediction of Complex Water and Ion Ensembles Around RNA in CASP16.","authors":"Rachael C Kretsch, Elisa Posani, Eugene F Baulin, Janusz M Bujnicki, Giovanni Bussi, Thomas E Cheatham, Shi-Jie Chen, Arne Elofsson, Masoud Amiri Farsani, Olivia N Fisher, M Michael Gromiha, Ayush Gupta, Michiaki Hamada, K Harini, Gang Hu, David Huang, Junichi Iwakiri, Anika Jain, Yuki Kagaya, Daisuke Kihara, Sebastian Kmiecik, Sowmya Ramaswamy Krishnan, Ikuo Kurisaki, Olivier Languin-Cattoën, Jun Li, Shanshan Li, Karim Malekzadeh, Tsukasa Nakamura, Wentao Ni, Chandran Nithin, Michael Z Palo, Joon Hong Park, Smita P Pilla, Simón Poblete, Fabrizio Pucci, Pranav Punuru, Anouka Saha, Kengo Sato, Ambuj Srivastava, Genki Terashi, Emilia Tugolukova, Jacob Verburgt, Qiqige Wuyun, Gül H Zerze, Kaiming Zhang, Sicheng Zhang, Wei Zheng, Yuanzhe Zhou, Wah Chiu, David A Case, Rhiju Das","doi":"10.1002/prot.70079","DOIUrl":"10.1002/prot.70079","url":null,"abstract":"<p><p>Biomolecules rely on water and ions for stable folding, but these interactions are often transient, dynamic, or disordered and thus hidden from experiments and evaluation challenges that represent biomolecules as single, ordered structures. Here, we compare blindly predicted ensembles of water and ion structure to the cryo-EM densities observed around the Tetrahymena ribozyme at 2.2-2.3 Å resolution, collected through target R1260 in the CASP16 competition. Twenty-six groups participated in this solvation \"cryo-ensemble\" prediction challenge, submitting over 350 million atoms in total, offering the first opportunity to compare blind predictions of dynamic solvent shell ensembles to cryo-EM density. Predicted atomic ensembles were converted to density through local alignment and these densities were compared to the cryo-EM densities using Pearson correlation, Spearman correlation, mutual information, and precision-recall curves. These predictions show that an ensemble representation is able to capture information of transient or dynamic water and ions better than traditional atomic models, but there remains a large accuracy gap to the performance ceiling set by experimental uncertainty. Overall, molecular dynamics approaches best matched the cryo-EM density, with blind predictions from bussilab_plain_md, SoutheRNA, bussilab_replex, coogs2, and coogs3 outperforming the baseline molecular dynamics prediction. This study indicates that simulations of water and ions can be quantitatively evaluated with cryo-EM maps. We propose that further community-wide blind challenges can drive and evaluate progress in modeling water, ions, and other previously hidden components of biomolecular systems.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"381-402"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145472539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proteins-Structure Function and Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1