Proteins-Structure Function and Bioinformatics最新文献_第6页

Blind Prediction of Complex Water and Ion Ensembles Around RNA in CASP16. CASP16中RNA周围水离子复合物的盲预测

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-11-08 DOI: 10.1002/prot.70079

Rachael C Kretsch, Elisa Posani, Eugene F Baulin, Janusz M Bujnicki, Giovanni Bussi, Thomas E Cheatham, Shi-Jie Chen, Arne Elofsson, Masoud Amiri Farsani, Olivia N Fisher, M Michael Gromiha, Ayush Gupta, Michiaki Hamada, K Harini, Gang Hu, David Huang, Junichi Iwakiri, Anika Jain, Yuki Kagaya, Daisuke Kihara, Sebastian Kmiecik, Sowmya Ramaswamy Krishnan, Ikuo Kurisaki, Olivier Languin-Cattoën, Jun Li, Shanshan Li, Karim Malekzadeh, Tsukasa Nakamura, Wentao Ni, Chandran Nithin, Michael Z Palo, Joon Hong Park, Smita P Pilla, Simón Poblete, Fabrizio Pucci, Pranav Punuru, Anouka Saha, Kengo Sato, Ambuj Srivastava, Genki Terashi, Emilia Tugolukova, Jacob Verburgt, Qiqige Wuyun, Gül H Zerze, Kaiming Zhang, Sicheng Zhang, Wei Zheng, Yuanzhe Zhou, Wah Chiu, David A Case, Rhiju Das

Biomolecules rely on water and ions for stable folding, but these interactions are often transient, dynamic, or disordered and thus hidden from experiments and evaluation challenges that represent biomolecules as single, ordered structures. Here, we compare blindly predicted ensembles of water and ion structure to the cryo-EM densities observed around the Tetrahymena ribozyme at 2.2-2.3 Å resolution, collected through target R1260 in the CASP16 competition. Twenty-six groups participated in this solvation "cryo-ensemble" prediction challenge, submitting over 350 million atoms in total, offering the first opportunity to compare blind predictions of dynamic solvent shell ensembles to cryo-EM density. Predicted atomic ensembles were converted to density through local alignment and these densities were compared to the cryo-EM densities using Pearson correlation, Spearman correlation, mutual information, and precision-recall curves. These predictions show that an ensemble representation is able to capture information of transient or dynamic water and ions better than traditional atomic models, but there remains a large accuracy gap to the performance ceiling set by experimental uncertainty. Overall, molecular dynamics approaches best matched the cryo-EM density, with blind predictions from bussilab_plain_md, SoutheRNA, bussilab_replex, coogs2, and coogs3 outperforming the baseline molecular dynamics prediction. This study indicates that simulations of water and ions can be quantitatively evaluated with cryo-EM maps. We propose that further community-wide blind challenges can drive and evaluate progress in modeling water, ions, and other previously hidden components of biomolecular systems.

生物分子依赖于水和离子进行稳定的折叠，但这些相互作用通常是短暂的、动态的或无序的，因此隐藏在代表生物分子单一有序结构的实验和评估挑战中。在这里，我们将盲目预测的水和离子结构集合与在2.2-2.3 Å分辨率下观察到的四膜核酶周围的低温电镜密度进行了比较，这些密度是通过CASP16竞争中的靶标R1260收集的。26个小组参加了这次溶剂化“低温系综”预测挑战，总共提交了超过3.5亿个原子，首次提供了将动态溶剂壳系综的盲目预测与低温em密度进行比较的机会。通过局部比对将预测的原子系综转换为密度，并使用Pearson相关、Spearman相关、互信息和精确召回曲线将这些密度与cryo-EM密度进行比较。这些预测表明，与传统的原子模型相比，集合表示能够更好地捕获瞬态或动态水和离子的信息，但与实验不确定性设定的性能上限相比，仍然存在很大的精度差距。总的来说，分子动力学方法与低温电子显微镜密度最匹配，bussilab_plain_md、SoutheRNA、bussilab_replex、coogs2和coogs3的盲预测优于基线分子动力学预测。这项研究表明，水和离子的模拟可以定量评估与低温电镜图。我们建议进一步的社区范围内的盲挑战可以推动和评估水，离子和其他以前隐藏的生物分子系统成分的建模进展。

{"title":"Blind Prediction of Complex Water and Ion Ensembles Around RNA in CASP16.","authors":"Rachael C Kretsch, Elisa Posani, Eugene F Baulin, Janusz M Bujnicki, Giovanni Bussi, Thomas E Cheatham, Shi-Jie Chen, Arne Elofsson, Masoud Amiri Farsani, Olivia N Fisher, M Michael Gromiha, Ayush Gupta, Michiaki Hamada, K Harini, Gang Hu, David Huang, Junichi Iwakiri, Anika Jain, Yuki Kagaya, Daisuke Kihara, Sebastian Kmiecik, Sowmya Ramaswamy Krishnan, Ikuo Kurisaki, Olivier Languin-Cattoën, Jun Li, Shanshan Li, Karim Malekzadeh, Tsukasa Nakamura, Wentao Ni, Chandran Nithin, Michael Z Palo, Joon Hong Park, Smita P Pilla, Simón Poblete, Fabrizio Pucci, Pranav Punuru, Anouka Saha, Kengo Sato, Ambuj Srivastava, Genki Terashi, Emilia Tugolukova, Jacob Verburgt, Qiqige Wuyun, Gül H Zerze, Kaiming Zhang, Sicheng Zhang, Wei Zheng, Yuanzhe Zhou, Wah Chiu, David A Case, Rhiju Das","doi":"10.1002/prot.70079","DOIUrl":"10.1002/prot.70079","url":null,"abstract":"Biomolecules rely on water and ions for stable folding, but these interactions are often transient, dynamic, or disordered and thus hidden from experiments and evaluation challenges that represent biomolecules as single, ordered structures. Here, we compare blindly predicted ensembles of water and ion structure to the cryo-EM densities observed around the Tetrahymena ribozyme at 2.2-2.3 Å resolution, collected through target R1260 in the CASP16 competition. Twenty-six groups participated in this solvation \"cryo-ensemble\" prediction challenge, submitting over 350 million atoms in total, offering the first opportunity to compare blind predictions of dynamic solvent shell ensembles to cryo-EM density. Predicted atomic ensembles were converted to density through local alignment and these densities were compared to the cryo-EM densities using Pearson correlation, Spearman correlation, mutual information, and precision-recall curves. These predictions show that an ensemble representation is able to capture information of transient or dynamic water and ions better than traditional atomic models, but there remains a large accuracy gap to the performance ceiling set by experimental uncertainty. Overall, molecular dynamics approaches best matched the cryo-EM density, with blind predictions from bussilab_plain_md, SoutheRNA, bussilab_replex, coogs2, and coogs3 outperforming the baseline molecular dynamics prediction. This study indicates that simulations of water and ions can be quantitatively evaluated with cryo-EM maps. We propose that further community-wide blind challenges can drive and evaluate progress in modeling water, ions, and other previously hidden components of biomolecular systems.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"381-402"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145472539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Modeling Alternative Conformational States in CASP16. CASP16中不同构象态的建模。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-10-28 DOI: 10.1002/prot.70065

Namita Dube, Theresa A Ramelot, Tiburon L Benavides, Yuanpeng J Huang, John Moult, Andriy Kryshtafovych, Gaetano T Montelione

The CASP16 Ensemble Prediction experiment assessed advances in methods for modeling proteins, nucleic acids, and their complexes in multiple conformational states. Targets included systems with experimental structures determined in two or three states, evaluated by direct comparison to experimental coordinates, as well as domain-linker-domain (D-L-D) targets assessed against statistical models generated from NMR and SAXS data. This paper focuses on the former class of multi-state targets. Ten ensembles were released as community challenges, including ligand-induced conformational changes, protein-DNA complexes, a trimeric protein, a stem-loop RNA, and multiple oligomeric states of a single RNA. For five targets, some groups produced reasonably accurate models of both reference states (best TM-score > 0.75). However, with the exception of one protein-ligand complex (T1214), where an apo structure was available as a template, predictors generally failed to capture key structural details distinguishing the states. Overall, accuracy was significantly lower than for single-state targets in other CASP experiments. The most successful approaches generated multiple AlphaFold2 models using enhanced multiple sequence alignments and sampling protocols, followed by model quality-based selection. Although the AlphaFold3 server performed well on several targets, individual groups outperformed it in specific cases. By contrast, predictions for one protein-DNA complex, three RNA targets, and multiple oligomeric RNA states consistently fell short (TM-score < 0.75). These results highlight both progress and persistent challenges in multi-state prediction. Despite recent advances, accurate modeling of conformational ensembles, particularly RNA and large multimeric assemblies, remains an important frontier for structural biology.

CASP16集合预测实验评估了多种构象状态下蛋白质、核酸及其复合物建模方法的进展。目标包括具有两种或三种状态的实验结构的系统，通过与实验坐标的直接比较进行评估，以及根据NMR和SAXS数据生成的统计模型评估的域连接域（D-L-D）目标。本文主要研究前一类多状态目标。作为群落挑战释放了10个集成，包括配体诱导的构象变化，蛋白质- dna复合物，三聚体蛋白，茎环RNA和单个RNA的多个低聚态。对于5个目标，一些小组产生了相当准确的两种参考状态模型（最佳tm得分为0.75）。然而，除了一种蛋白质-配体复合物（T1214），其中载脂蛋白结构可作为模板，预测器通常无法捕获区分状态的关键结构细节。总体而言，准确度明显低于其他CASP实验中的单状态目标。最成功的方法是使用增强的多序列比对和采样协议生成多个AlphaFold2模型，然后是基于模型质量的选择。虽然AlphaFold3服务器在几个目标上表现良好，但在特定情况下，个别组的表现优于它。相比之下，对一种蛋白质- dna复合物、三种RNA靶标和多种寡聚RNA状态的预测一直低于tm评分

{"title":"Modeling Alternative Conformational States in CASP16.","authors":"Namita Dube, Theresa A Ramelot, Tiburon L Benavides, Yuanpeng J Huang, John Moult, Andriy Kryshtafovych, Gaetano T Montelione","doi":"10.1002/prot.70065","DOIUrl":"10.1002/prot.70065","url":null,"abstract":"The CASP16 Ensemble Prediction experiment assessed advances in methods for modeling proteins, nucleic acids, and their complexes in multiple conformational states. Targets included systems with experimental structures determined in two or three states, evaluated by direct comparison to experimental coordinates, as well as domain-linker-domain (D-L-D) targets assessed against statistical models generated from NMR and SAXS data. This paper focuses on the former class of multi-state targets. Ten ensembles were released as community challenges, including ligand-induced conformational changes, protein-DNA complexes, a trimeric protein, a stem-loop RNA, and multiple oligomeric states of a single RNA. For five targets, some groups produced reasonably accurate models of both reference states (best TM-score > 0.75). However, with the exception of one protein-ligand complex (T1214), where an apo structure was available as a template, predictors generally failed to capture key structural details distinguishing the states. Overall, accuracy was significantly lower than for single-state targets in other CASP experiments. The most successful approaches generated multiple AlphaFold2 models using enhanced multiple sequence alignments and sampling protocols, followed by model quality-based selection. Although the AlphaFold3 server performed well on several targets, individual groups outperformed it in specific cases. By contrast, predictions for one protein-DNA complex, three RNA targets, and multiple oligomeric RNA states consistently fell short (TM-score < 0.75). These results highlight both progress and persistent challenges in multi-state prediction. Despite recent advances, accurate modeling of conformational ensembles, particularly RNA and large multimeric assemblies, remains an important frontier for structural biology.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"330-347"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12901538/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145379883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Engaging the Community: CASP Special Interest Groups. 参与社区：CASP特别兴趣小组。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-04-30 DOI: 10.1002/prot.26833

Arne Elofsson, Rachael C Kretsch, Marcin Magnus, Gaetano T Montelione

The Critical Assessment of Structure Prediction (CASP) brings together a diverse group of scientists, from deep learning experts to NMR specialists, all aimed at developing accurate prediction algorithms that can effectively characterize the structural aspects of biomolecules relevant to their functions. Engagement within the CASP community has traditionally been limited to the prediction season and the conference, with limited discourse in the 1.5 years between CASP seasons. CASP special interest groups (SIGs) were established in 2023 to encourage continuous dialogue within the community. The online seminar series has drawn global participation from across disciplines and career stages. This has facilitated cross-disciplinary discussions fostering collaborations. The archives of these seminars have become a vital learning tool for newcomers to the field, lowering the barrier to entry.

结构预测关键评估（CASP）汇集了不同的科学家群体，从深度学习专家到核磁共振专家，所有这些都旨在开发准确的预测算法，可以有效地表征与其功能相关的生物分子的结构方面。传统上，CASP社区的参与仅限于预测季节和会议，在CASP季节之间的1.5年里，讨论有限。CASP特别兴趣小组（SIGs）成立于2023年，旨在鼓励社区内的持续对话。该在线系列研讨会吸引了来自全球各个学科和职业阶段的参与者。这促进了跨学科讨论，促进了合作。这些研讨会的档案已成为该领域新手的重要学习工具，降低了进入门槛。

引用次数: 0

Enhancing RNA 3D Structure Prediction in CASP16: Integrating Physics-Based Modeling With Machine Learning for Improved Predictions. 在CASP16中增强RNA 3D结构预测：将基于物理的建模与机器学习集成以改进预测。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-06-09 DOI: 10.1002/prot.26856

Sicheng Zhang, Jun Li, Yuanzhe Zhou, Shi-Jie Chen

During the 16th Critical Assessment of Structure Prediction (CASP16), the Vfold team participated in the two RNA categories: RNA Monomers and RNA Multimers. The Vfold RNA structure prediction method is hierarchical and hybrid, incorporating physics-based models (Vfold2D and VfoldMCPX) for 2D structure prediction, template-based and molecular dynamics simulation-based models (Vfold-Pipeline, IsRNA and RNAJP) for 3D structure prediction. Additionally, Vfold integrates knowledge from templates and the state-of-the-art machine learning model AlphaFold3 into our physics-based models. This integration enhances the prediction accuracy. Here we describe the Vfold approach in CASP16 using selected targets and show how the integration of traditional structure prediction methods with machine learning models can improve RNA structure prediction accuracy.

在第16届结构预测关键评估（CASP16）期间，Vfold团队参与了RNA单体和RNA多聚体这两个RNA类别的测试。Vfold RNA结构预测方法是分层混合的，结合基于物理模型（Vfold2D和VfoldMCPX）进行二维结构预测，基于模板和基于分子动力学模拟模型（Vfold- pipeline、IsRNA和RNAJP）进行三维结构预测。此外，Vfold将模板中的知识和最先进的机器学习模型AlphaFold3集成到我们基于物理的模型中。这种集成提高了预测的准确性。在这里，我们使用选定的靶点描述了CASP16中的Vfold方法，并展示了传统结构预测方法与机器学习模型的集成如何提高RNA结构预测的准确性。

引用次数: 0

CASP16 Protein Monomer Structure Prediction Assessment. CASP16蛋白单体结构预测评估。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-08-17 DOI: 10.1002/prot.70031

Rongqing Yuan, Jing Zhang, Andriy Kryshtafovych, R Dustin Schaeffer, Jian Zhou, Qian Cong, Nick V Grishin

The assessment of monomer targets in the Critical Assessment of Structure Prediction Round 16 (CASP16) underscores that the problem of single-domain protein fold prediction is nearly solved-no target folds were incorrectly predicted across all Evaluation Units. However, challenges remain in accurately modeling truncated sequences, irregular secondary structures, and interaction-induced conformational changes. The release of AlphaFold3 (AF3) during CASP16, and its effective integration by many groups, demonstrated its superiority over AlphaFold2 (AF2), particularly in confidence estimation and model selection. Additional improvements in multiple sequence alignments (MSAs) and fragment-based prediction, that is, selecting the optimal fragment of the full sequence for modeling, also contributed to enhanced prediction accuracy. The top three groups-all from the Yang lab-consistently outperformed others across CASP16 monomer targets, reflecting their robust modeling pipelines and successful adoption of AF3. CASP16 also introduced three new challenges: Phase 0, in which stoichiometry was withheld; Phase 2, which supplied ~8000 MassiveFold models per target to test model selection strategies; and Model 6, which limited predictors to using MSAs provided by the organizers. While we evaluated group performance in these additional challenges, the insights gained were limited due to low participation and caveats in the design of experiments. We suggest improvements for the organization of these challenges and encourage broader engagement from the prediction community. The progress in monomer modeling from CASP15 to CASP16 was subtle, but more groups in CASP16 were able to outperform ColabFold, reflecting the community's improved ability in optimizing AF2 and the growing adoption of AF3. We anticipate that the recent release of the AF3 source code will stimulate future progress through user-driven optimization and innovations in model architecture. Finally, model ranking remains a persistent weakness across most groups, highlighting a critical area for future development.

在结构预测关键评估第16轮（CASP16）中对单体靶标的评估强调了单域蛋白折叠预测的问题几乎得到了解决-所有评估单元中没有错误预测目标折叠。然而，在精确建模截断序列、不规则二级结构和相互作用引起的构象变化方面仍然存在挑战。在CASP16期间，AlphaFold3 （AF3）的释放，以及它被许多组有效整合，证明了它比AlphaFold2 （AF2）的优势，特别是在置信度估计和模型选择方面。在多序列比对（msa）和基于片段的预测方面的其他改进，即选择完整序列的最佳片段进行建模，也有助于提高预测精度。来自Yang实验室的前三组在CASP16单体靶标上的表现始终优于其他组，这反映了他们强大的建模管道和AF3的成功采用。CASP16还引入了三个新的挑战：第0阶段，化学计量学被保留；第二阶段，为每个目标提供约8000个MassiveFold模型，以测试模型选择策略；模型6，它限制了预测者使用组织者提供的msa。虽然我们在这些额外的挑战中评估了小组的表现，但由于实验设计中的参与率低和注意事项，所获得的见解有限。我们建议改进这些挑战的组织，并鼓励预测社区更广泛的参与。从CASP15到CASP16的单体建模的进展是微妙的，但CASP16中的更多组能够优于ColabFold，这反映了社区优化AF2的能力提高以及AF3的越来越多的采用。我们期望最近发布的AF3源代码将通过用户驱动的优化和模型架构的创新来促进未来的发展。最后，在大多数群体中，模型排名仍然是一个持续的弱点，这突出了未来发展的一个关键领域。

{"title":"CASP16 Protein Monomer Structure Prediction Assessment.","authors":"Rongqing Yuan, Jing Zhang, Andriy Kryshtafovych, R Dustin Schaeffer, Jian Zhou, Qian Cong, Nick V Grishin","doi":"10.1002/prot.70031","DOIUrl":"10.1002/prot.70031","url":null,"abstract":"The assessment of monomer targets in the Critical Assessment of Structure Prediction Round 16 (CASP16) underscores that the problem of single-domain protein fold prediction is nearly solved-no target folds were incorrectly predicted across all Evaluation Units. However, challenges remain in accurately modeling truncated sequences, irregular secondary structures, and interaction-induced conformational changes. The release of AlphaFold3 (AF3) during CASP16, and its effective integration by many groups, demonstrated its superiority over AlphaFold2 (AF2), particularly in confidence estimation and model selection. Additional improvements in multiple sequence alignments (MSAs) and fragment-based prediction, that is, selecting the optimal fragment of the full sequence for modeling, also contributed to enhanced prediction accuracy. The top three groups-all from the Yang lab-consistently outperformed others across CASP16 monomer targets, reflecting their robust modeling pipelines and successful adoption of AF3. CASP16 also introduced three new challenges: Phase 0, in which stoichiometry was withheld; Phase 2, which supplied ~8000 MassiveFold models per target to test model selection strategies; and Model 6, which limited predictors to using MSAs provided by the organizers. While we evaluated group performance in these additional challenges, the insights gained were limited due to low participation and caveats in the design of experiments. We suggest improvements for the organization of these challenges and encourage broader engagement from the prediction community. The progress in monomer modeling from CASP15 to CASP16 was subtle, but more groups in CASP16 were able to outperform ColabFold, reflecting the community's improved ability in optimizing AF2 and the growing adoption of AF3. We anticipate that the recent release of the AF3 source code will stimulate future progress through user-driven optimization and innovations in model architecture. Finally, model ranking remains a persistent weakness across most groups, highlighting a critical area for future development.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"86-105"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750037/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144876980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Model Quality Assessment for CASP16. CASP16模型质量评价。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-08-22 DOI: 10.1002/prot.70037

Alisia Fadini, Gabriel Studer, Randy J Read

The CASP16 evaluation of model accuracy (EMA) experiment assessed the ability of predictors to estimate the accuracy of predicted models, with a particular emphasis on multimeric assemblies. Expanding on the CASP15 framework, CASP16 introduced a new evaluation mode (QMODE3) focused on selecting high-quality models from large-scale AlphaFold2-derived model pools generated by MassiveFold. Three primary evaluation tasks were therefore conducted: QMODE1 assessed global structure accuracy, QMODE2 focused on the accuracy of interface residues, and QMODE3 tested model selection performance. Predictors were evaluated using a diverse set of OpenStructure-based metrics, and a novel penalty-based ranking scheme was developed for QMODE3 to handle score interdependence and varying prediction quality distributions. Additionally, we explored the accuracy and utility of predicted local confidence measures now made available on a per-atom basis by methods that invoke AlphaFold3. Results showed that methods incorporating AlphaFold3-derived features-particularly per-atom pLDDT-performed best in estimating local accuracy and in utility for experimental structure solution. For QMODE3, performance varied significantly across monomeric, homomeric, and heteromeric target categories and underscored the ongoing challenge of evaluating complex assemblies.

CASP16模型准确性评估（EMA）实验评估了预测者估计预测模型准确性的能力，特别强调了多聚体组装。在CASP15框架的基础上，CASP16引入了一种新的评估模式（QMODE3），侧重于从MassiveFold生成的大规模alphafold2衍生模型池中选择高质量的模型。因此进行了三个主要的评估任务：QMODE1评估全局结构精度，QMODE2侧重于界面残留物的精度，QMODE3测试模型选择性能。使用一组不同的基于openstructure的指标对预测器进行评估，并为QMODE3开发了一种新的基于惩罚的排名方案，以处理分数相互依赖和不同的预测质量分布。此外，我们还探讨了预测的局部置信度度量的准确性和实用性，这些度量现在可以通过调用AlphaFold3的方法在每个原子的基础上获得。结果表明，结合alphafold3衍生特征的方法-特别是每个原子plddt -在估计局部精度和实验结构解决方案的实用性方面表现最好。对于QMODE3来说，性能在单体、同质和异质目标类别之间变化很大，并且强调了评估复杂组件的持续挑战。

{"title":"Model Quality Assessment for CASP16.","authors":"Alisia Fadini, Gabriel Studer, Randy J Read","doi":"10.1002/prot.70037","DOIUrl":"10.1002/prot.70037","url":null,"abstract":"The CASP16 evaluation of model accuracy (EMA) experiment assessed the ability of predictors to estimate the accuracy of predicted models, with a particular emphasis on multimeric assemblies. Expanding on the CASP15 framework, CASP16 introduced a new evaluation mode (QMODE3) focused on selecting high-quality models from large-scale AlphaFold2-derived model pools generated by MassiveFold. Three primary evaluation tasks were therefore conducted: QMODE1 assessed global structure accuracy, QMODE2 focused on the accuracy of interface residues, and QMODE3 tested model selection performance. Predictors were evaluated using a diverse set of OpenStructure-based metrics, and a novel penalty-based ranking scheme was developed for QMODE3 to handle score interdependence and varying prediction quality distributions. Additionally, we explored the accuracy and utility of predicted local confidence measures now made available on a per-atom basis by methods that invoke AlphaFold3. Results showed that methods incorporating AlphaFold3-derived features-particularly per-atom pLDDT-performed best in estimating local accuracy and in utility for experimental structure solution. For QMODE3, performance varied significantly across monomeric, homomeric, and heteromeric target categories and underscored the ongoing challenge of evaluating complex assemblies.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"302-313"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750031/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144980309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Updates to the CASP Infrastructure in 2024. 2024年CASP基础设施的更新。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-09-01 DOI: 10.1002/prot.70042

Andriy Kryshtafovych, Maciej Milostan, Marc F Lensink, Sameer Velankar, Alexandre M J J Bonvin, John Moult, Krzysztof Fidelis

CASP (critical assessment of structure prediction) conducts community experiments to determine the state of the art in calculating macromolecular structures. The CASP data management system is continually evolving to address the changing needs of the experiments. For CASP16, we expanded the infrastructure to enable data handling of newly introduced categories and fully support pilot categories introduced in CASP15. This technical note also documents the integration of the CASP and CAPRI (Critical Assessment of PRedicted Interactions) systems.

CASP（结构预测的关键评估）进行社区实验，以确定计算大分子结构的最新技术。CASP数据管理系统不断发展，以满足不断变化的实验需求。对于CASP16，我们扩展了基础设施，使其能够处理新引入的类别，并完全支持CASP15中引入的试点类别。该技术说明还记录了CASP和CAPRI（预测相互作用的关键评估）系统的集成。

引用次数: 0

Accurate Biomolecular Structure Prediction in CASP16 With Optimized Inputs to State-Of-The-Art Predictors. 基于最先进预测器优化输入的CASP16精确生物分子结构预测

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-08-05 DOI: 10.1002/prot.70030

Wenkai Wang, Yuxian Luo, Zhenling Peng, Jianyi Yang

Biomolecular structure prediction has reached an unprecedented level of accuracy, partly attributed to the use of advanced deep learning algorithms. We participated in the CASP16 experiments across the categories of protein domains, protein multimers, and RNA monomers, achieving official rankings of first, second, and fourth (top for server groups), respectively. We hypothesized that by leveraging state-of-the-art structure predictors such as AlphaFold2, AlphaFold3, trRosettaX2, and trRosettaRNA2, accurate structure predictions could be achieved through careful optimization of input information. For protein structure prediction, we enhanced the input sequences by removing intrinsically disordered regions, a simple yet effective approach that yielded accurate models for protein domains. However, fewer than 25% of the protein multimers were predicted with high quality. In RNA structure prediction, optimizing the secondary structure input for trRosettaRNA2 resulted in more accurate predictions than AlphaFold3. In summary, our prediction results in CASP16 indicate that protein domain structure prediction has achieved high accuracy. However, predicting protein multimers and RNA structures remains challenging, and we anticipate new advancements in these areas in the coming years.

生物分子结构预测的准确性达到了前所未有的水平，部分原因是使用了先进的深度学习算法。我们参与了CASP16蛋白结构域、蛋白多聚体和RNA单体的实验，分别获得了官方排名第一、第二和第四（服务器组排名第一）。我们假设，通过利用最先进的结构预测器，如AlphaFold2、AlphaFold3、trRosettaX2和trRosettaRNA2，可以通过仔细优化输入信息来实现准确的结构预测。对于蛋白质结构预测，我们通过去除内在无序区域来增强输入序列，这是一种简单而有效的方法，可以产生准确的蛋白质结构域模型。然而，不到25%的蛋白多聚体被预测为高质量。在RNA结构预测中，优化trRosettaRNA2的二级结构输入比AlphaFold3的预测更准确。综上所述，我们在CASP16上的预测结果表明，蛋白质结构域的预测达到了较高的准确性。然而，预测蛋白质多聚体和RNA结构仍然具有挑战性，我们预计在未来几年这些领域将取得新的进展。

{"title":"Accurate Biomolecular Structure Prediction in CASP16 With Optimized Inputs to State-Of-The-Art Predictors.","authors":"Wenkai Wang, Yuxian Luo, Zhenling Peng, Jianyi Yang","doi":"10.1002/prot.70030","DOIUrl":"10.1002/prot.70030","url":null,"abstract":"Biomolecular structure prediction has reached an unprecedented level of accuracy, partly attributed to the use of advanced deep learning algorithms. We participated in the CASP16 experiments across the categories of protein domains, protein multimers, and RNA monomers, achieving official rankings of first, second, and fourth (top for server groups), respectively. We hypothesized that by leveraging state-of-the-art structure predictors such as AlphaFold2, AlphaFold3, trRosettaX2, and trRosettaRNA2, accurate structure predictions could be achieved through careful optimization of input information. For protein structure prediction, we enhanced the input sequences by removing intrinsically disordered regions, a simple yet effective approach that yielded accurate models for protein domains. However, fewer than 25% of the protein multimers were predicted with high quality. In RNA structure prediction, optimizing the secondary structure input for trRosettaRNA2 resulted in more accurate predictions than AlphaFold3. In summary, our prediction results in CASP16 indicate that protein domain structure prediction has achieved high accuracy. However, predicting protein multimers and RNA structures remains challenging, and we anticipate new advancements in these areas in the coming years.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"142-153"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144786043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Graph_RG: Dominating CASP16's Small Molecule Affinity Prediction Subcategory-A Pose-Free Framework for Billion-Scale Virtual Screening. Graph_RG：支配CASP16小分子亲和预测亚类-十亿尺度虚拟筛选的无姿态框架。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-06-20 DOI: 10.1002/prot.70010

Haiping Zhang

Protein-ligand interaction prediction is pivotal in early-stage drug development, enabling large-scale virtual screening, drug optimization, and reverse target searching. In this work, we present Graph_RG, our top-performing model in the CASP16 small molecule track's protein-ligand affinity prediction category, achieving a N-weighted Kendall's Tau of 0.42-significantly outperforming other submissions (second-best: 0.36). Beyond accuracy, Graph_RG is noncomplex dependent, hence exhibits exceptional computational efficiency, operating > 100 000× faster than conformation-search dependent prediction methods, thus enabling billion- to 10-billion-scale screening on standard servers. We further discuss the potential improvements for Graph_RG, including dataset optimization, atomic vector representation enhancements, and model architecture upgrades. We also introduce the potential broader applications in large-scale drug screening, reverse target identification, and GPCR-specific drug discovery. We also point out the development of an interactive web platform hosting Graph_RG and its derivative models to enhance accessibility. By integrating community feedback and iterative model refinement, this initiative bridges the gap between AI-driven predictions and practical drug discovery, fostering advancements in both computational methodologies and biomedical applications.

蛋白质-配体相互作用预测在早期药物开发中至关重要，可以实现大规模的虚拟筛选、药物优化和反向靶标搜索。在这项工作中，我们提出了Graph_RG，这是我们在CASP16小分子轨道的蛋白质配体亲和预测类别中表现最好的模型，实现了0.42的n加权Kendall's Tau，显著优于其他提交的模型（第二好：0.36）。除了准确性之外，Graph_RG是非复杂依赖的，因此表现出卓越的计算效率，运行速度比构象搜索依赖的预测方法快100万倍，因此可以在标准服务器上进行十亿到100亿规模的筛选。我们进一步讨论了Graph_RG的潜在改进，包括数据集优化、原子向量表示增强和模型架构升级。我们还介绍了在大规模药物筛选，反向靶标鉴定和gpcr特异性药物发现方面的潜在更广泛的应用。我们还指出了托管Graph_RG及其衍生模型的交互式web平台的开发，以增强可访问性。通过整合社区反馈和迭代模型改进，该计划弥合了人工智能驱动的预测与实际药物发现之间的差距，促进了计算方法和生物医学应用的进步。

{"title":"Graph_RG: Dominating CASP16's Small Molecule Affinity Prediction Subcategory-A Pose-Free Framework for Billion-Scale Virtual Screening.","authors":"Haiping Zhang","doi":"10.1002/prot.70010","DOIUrl":"10.1002/prot.70010","url":null,"abstract":"Protein-ligand interaction prediction is pivotal in early-stage drug development, enabling large-scale virtual screening, drug optimization, and reverse target searching. In this work, we present Graph_RG, our top-performing model in the CASP16 small molecule track's protein-ligand affinity prediction category, achieving a N-weighted Kendall's Tau of 0.42-significantly outperforming other submissions (second-best: 0.36). Beyond accuracy, Graph_RG is noncomplex dependent, hence exhibits exceptional computational efficiency, operating > 100 000× faster than conformation-search dependent prediction methods, thus enabling billion- to 10-billion-scale screening on standard servers. We further discuss the potential improvements for Graph_RG, including dataset optimization, atomic vector representation enhancements, and model architecture upgrades. We also introduce the potential broader applications in large-scale drug screening, reverse target identification, and GPCR-specific drug discovery. We also point out the development of an interactive web platform hosting Graph_RG and its derivative models to enhance accessibility. By integrating community feedback and iterative model refinement, this initiative bridges the gap between AI-driven predictions and practical drug discovery, fostering advancements in both computational methodologies and biomedical applications.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"286-294"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144334487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Structure Modeling Protocols for Protein Multimer and RNA in CASP16 With Enhanced MSAs, Model Ranking, and Deep Learning. CASP16蛋白多聚体和RNA的结构建模协议与增强的msa，模型排序和深度学习。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics

Pub Date : 2026-01-01 Epub Date: 2025-08-01 DOI: 10.1002/prot.70033

Yuki Kagaya, Tsukasa Nakamura, Jacob Verburgt, Anika Jain, Genki Terashi, Pranav Punuru, Emilia Tugolukova, Joon Hong Park, Anouka Saha, David Huang, Daisuke Kihara

We present the methods and results of our protein complex and RNA structure predictions at CASP16. Our approach integrated multiple state-of-the-art deep learning models with a consensus-based scoring method. To enhance the depth of multiple sequence alignments (MSAs), we employed a large metagenomic sequence database. Model ranking was performed with a state-of-the-art consensus ranking method, to which we added more scoring terms. These predictions were further refined manually based on literature evidence. For RNA, we adopted an ensemble approach that incorporated multiple state-of-the-art methods, centered around our NuFold framework. As a result, our KiharaLab group ranked first in protein complex prediction and third in RNA structure prediction. A detailed analysis of targets that significantly differed from those of other groups highlighted both the strengths of our MSA and scoring strategies, as well as areas requiring further improvement.

我们介绍了CASP16蛋白复合物和RNA结构预测的方法和结果。我们的方法将多个最先进的深度学习模型与基于共识的评分方法集成在一起。为了提高多序列比对（msa）的深度，我们使用了一个大型宏基因组序列数据库。模型排名是用最先进的共识排名方法进行的，我们增加了更多的评分项。这些预测是在文献证据的基础上进一步人工完善的。对于RNA，我们采用了一种集成方法，结合了多种最先进的方法，以NuFold框架为中心。因此，我们的KiharaLab小组在蛋白质复合物预测方面排名第一，在RNA结构预测方面排名第三。对与其他组显著不同的目标进行了详细分析，突出了我们的MSA和评分策略的优势，以及需要进一步改进的领域。

{"title":"Structure Modeling Protocols for Protein Multimer and RNA in CASP16 With Enhanced MSAs, Model Ranking, and Deep Learning.","authors":"Yuki Kagaya, Tsukasa Nakamura, Jacob Verburgt, Anika Jain, Genki Terashi, Pranav Punuru, Emilia Tugolukova, Joon Hong Park, Anouka Saha, David Huang, Daisuke Kihara","doi":"10.1002/prot.70033","DOIUrl":"10.1002/prot.70033","url":null,"abstract":"We present the methods and results of our protein complex and RNA structure predictions at CASP16. Our approach integrated multiple state-of-the-art deep learning models with a consensus-based scoring method. To enhance the depth of multiple sequence alignments (MSAs), we employed a large metagenomic sequence database. Model ranking was performed with a state-of-the-art consensus ranking method, to which we added more scoring terms. These predictions were further refined manually based on literature evidence. For RNA, we adopted an ensemble approach that incorporated multiple state-of-the-art methods, centered around our NuFold framework. As a result, our KiharaLab group ranked first in protein complex prediction and third in RNA structure prediction. A detailed analysis of targets that significantly differed from those of other groups highlighted both the strengths of our MSA and scoring strategies, as well as areas requiring further improvement.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"167-182"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12321240/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144765849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0