Friedreich's ataxia (FRDA) is a neurodegenerative disorder caused by frataxin (FXN) deficiency, where protein replacement therapy is hampered by the inherent instability and aggregation propensity of wild-type (WT) FXN. The structural flexibility of Loop-1 (residues 115-123), a critical region within the acidic ridge, represents a key determinant of protein stability. This study introduces a computational pipeline integrating evolutionary conservation analysis (ConSurf) with diffusion-based de novo design (RFdiffusion) to redesign both the backbone and sequence of Loop-1. Through systematic filtration of 1000 ProteinMPNN-generated variants using aggregation propensity screening (AGGRESCAN) and 450 ns of molecular dynamics (MD) simulations, four lead candidates were identified. Design_188 (EERVGGREI) demonstrated optimal performance with 2.3-fold improvement in aggregation resistance (Na4vSS: -53.8 vs. -23.5 for WT), superior structural stability (RMSD: 0.486 nm), reduced conformational diversity (62.3% dominant cluster occupancy), and 93% retention of ISCU binding capacity (ΔΔG: +6.4 kcal/mol). Experimental validation through 15N NMR relaxation analysis confirmed computational predictions, with Design_188 exhibiting uniform backbone rigidification (S2 = 0.81-0.95) and strong MD-NMR correlation (Pearson r = 0.675, p = 0.003). SEC-MALS analysis demonstrated near-complete monomeric behavior (> 98% monomer content) compared to WT's heterogeneous oligomerization (68% monomer, 32% oligomers), directly confirming the predicted anti-aggregation properties. K-means clustering analysis revealed an inverse relationship between conformational heterogeneity and stability, while correlation analysis identified a fundamental trade-off between aggregation resistance and structural stability (r = -0.82, p < 0.01). This work establishes a generalizable framework for therapeutic protein engineering where backbone redesign enables conformational ensemble modulation beyond the limitations of sequence optimization alone.
弗里德赖希共济失调(FRDA)是一种由frataxin (FXN)缺乏引起的神经退行性疾病,其中蛋白质替代治疗受到野生型(WT) FXN固有的不稳定性和聚集倾向的阻碍。环-1(残基115-123)是酸性脊内的一个关键区域,其结构灵活性是蛋白质稳定性的关键决定因素。本研究引入了一种结合进化守恒分析(ConSurf)和基于扩散的从头设计(RFdiffusion)的计算管道来重新设计Loop-1的主干和序列。通过使用聚集倾向筛选(侵略者)和450 ns分子动力学(MD)模拟对1000个proteinmpnn生成的变异进行系统过滤,确定了4个主要候选基因。Design_188 (EERVGGREI)表现出最佳性能,其聚集阻力提高了2.3倍(Na4vSS: -53.8 vs. WT -23.5),优越的结构稳定性(RMSD: 0.486 nm),降低了构像多样性(62.3%的优势簇占用),并保持了93%的ISCU结合能力(ΔΔG: +6.4 kcal/mol)。通过15N NMR松弛分析进行的实验验证证实了计算预测,Design_188表现出均匀的骨干硬化(S2 = 0.81-0.95)和强的MD-NMR相关性(Pearson r = 0.675, p = 0.003)。SEC-MALS分析显示,与WT的非均相寡聚(68%单体,32%寡聚)相比,其单体行为接近完全(> 98%单体含量),直接证实了预测的抗聚集性能。K-means聚类分析揭示了构象异质性和结构稳定性之间的反比关系,而相关分析发现了聚集阻力和结构稳定性之间的基本权衡(r = -0.82, p
{"title":"Integration of Evolutionary Analysis With RFdiffusion for De Novo Design of Aggregation-Resistant Frataxin.","authors":"Kevser Kübra Kırboğa, Ecir Uğur Küçüksille","doi":"10.1002/prot.70114","DOIUrl":"https://doi.org/10.1002/prot.70114","url":null,"abstract":"<p><p>Friedreich's ataxia (FRDA) is a neurodegenerative disorder caused by frataxin (FXN) deficiency, where protein replacement therapy is hampered by the inherent instability and aggregation propensity of wild-type (WT) FXN. The structural flexibility of Loop-1 (residues 115-123), a critical region within the acidic ridge, represents a key determinant of protein stability. This study introduces a computational pipeline integrating evolutionary conservation analysis (ConSurf) with diffusion-based de novo design (RFdiffusion) to redesign both the backbone and sequence of Loop-1. Through systematic filtration of 1000 ProteinMPNN-generated variants using aggregation propensity screening (AGGRESCAN) and 450 ns of molecular dynamics (MD) simulations, four lead candidates were identified. Design_188 (EERVGGREI) demonstrated optimal performance with 2.3-fold improvement in aggregation resistance (Na4vSS: -53.8 vs. -23.5 for WT), superior structural stability (RMSD: 0.486 nm), reduced conformational diversity (62.3% dominant cluster occupancy), and 93% retention of ISCU binding capacity (ΔΔG: +6.4 kcal/mol). Experimental validation through <sup>15</sup>N NMR relaxation analysis confirmed computational predictions, with Design_188 exhibiting uniform backbone rigidification (S<sup>2</sup> = 0.81-0.95) and strong MD-NMR correlation (Pearson r = 0.675, p = 0.003). SEC-MALS analysis demonstrated near-complete monomeric behavior (> 98% monomer content) compared to WT's heterogeneous oligomerization (68% monomer, 32% oligomers), directly confirming the predicted anti-aggregation properties. K-means clustering analysis revealed an inverse relationship between conformational heterogeneity and stability, while correlation analysis identified a fundamental trade-off between aggregation resistance and structural stability (r = -0.82, p < 0.01). This work establishes a generalizable framework for therapeutic protein engineering where backbone redesign enables conformational ensemble modulation beyond the limitations of sequence optimization alone.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146013610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Samuel A Barker, Porter K Ellis, Andrew Hammer, Sean J Johnson, Nicholas E Dickenson
Yersinia pestis was responsible for the Black Plague, one of the worst epidemiological disasters in recorded history. Today, Y. pestis, Y. enterocolitica, and Y. pseudotuberculosis remain clinically relevant human pathogens. Each of these pathogenic Yersinia species relies on a Type Three Secretion System (T3SS) for virulence, with the ATPase YscN playing a critical role in T3SS function. T3SS ATPases are responsible for powering apparatus formation and effector protein secretion through ATP hydrolysis. This study provides an extensive enzymatic characterization of recombinant YscN under several conditions, including variable pH and temperature, substrate and protein concentrations, and in the presence of putative inhibitors. Thermal stability data, assessed by circular dichroism, demonstrate that YscN exhibits increased stability in alkaline conditions, coinciding with greatest ATPase activity. Further, we report the first high-resolution crystal structure of YscN and leverage homology data to model an oligomeric active site. Mutational analysis of a predicted active site residue confirms oligomerization as necessary for YscN ATPase activity and corroborates our oligomeric model and enzyme concentration-dependent specific activity. Interestingly, however, AUC analysis reveals that the purified YscN predominantly exists as a monomer, despite oligomerization-dependent active site formation. Thus, we propose that transient oligomeric interactions support the observed ATP hydrolysis. Together, these data uncover structural and environmental impacts on YscN activity that may support the highly specialized Yersinia pathogenic lifecycle and leverage its role in virulence in search of pan-effective small molecule T3SS ATPase inhibitors.
{"title":"Structural and Biophysical Characterization of the Yersinia Type Three Secretion System ATPase YscN.","authors":"Samuel A Barker, Porter K Ellis, Andrew Hammer, Sean J Johnson, Nicholas E Dickenson","doi":"10.1002/prot.70112","DOIUrl":"10.1002/prot.70112","url":null,"abstract":"<p><p>Yersinia pestis was responsible for the Black Plague, one of the worst epidemiological disasters in recorded history. Today, Y. pestis, Y. enterocolitica, and Y. pseudotuberculosis remain clinically relevant human pathogens. Each of these pathogenic Yersinia species relies on a Type Three Secretion System (T3SS) for virulence, with the ATPase YscN playing a critical role in T3SS function. T3SS ATPases are responsible for powering apparatus formation and effector protein secretion through ATP hydrolysis. This study provides an extensive enzymatic characterization of recombinant YscN under several conditions, including variable pH and temperature, substrate and protein concentrations, and in the presence of putative inhibitors. Thermal stability data, assessed by circular dichroism, demonstrate that YscN exhibits increased stability in alkaline conditions, coinciding with greatest ATPase activity. Further, we report the first high-resolution crystal structure of YscN and leverage homology data to model an oligomeric active site. Mutational analysis of a predicted active site residue confirms oligomerization as necessary for YscN ATPase activity and corroborates our oligomeric model and enzyme concentration-dependent specific activity. Interestingly, however, AUC analysis reveals that the purified YscN predominantly exists as a monomer, despite oligomerization-dependent active site formation. Thus, we propose that transient oligomeric interactions support the observed ATP hydrolysis. Together, these data uncover structural and environmental impacts on YscN activity that may support the highly specialized Yersinia pathogenic lifecycle and leverage its role in virulence in search of pan-effective small molecule T3SS ATPase inhibitors.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12863218/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146004844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A X Quintana-Armas, E Flores-Hernández, C Cardona-Félix, E Rudiño-Piñera
We present the first x-ray crystallographic structural evidence of an archaeal DNA ligase showing the AMP covalent adduct together with further cofactor hydrolysis, capturing a transient intermediary in the first step of the ligation reaction, triggered by the pyrophosphate hydrolysis. Our crystallographic models of Thermococcus gammatolerans DNA ligase (LigTgam), coupled with bioinformatic analysis of at least 28 crystallographic structures from ATP- and NAD+-dependent DNA ligases, highlight the central role of domain mobility. Notably, elevated B-values are consistently observed in key catalytic and binding regions, suggesting a link between structural flexibility and enzymatic efficiency. Remarkably, this pattern of high B-values is conserved in replicative ligases, including bacterial Lig A, indicating a broader evolutionary relevance. These fluctuations emphasize the importance of conformational adaptability in accommodating substrate DNA and facilitating catalytic steps, including adenylation and phosphodiester bond formation. In this work, we delve deeper into this dynamic behavior, providing evidence of its critical role in ligase function.
{"title":"Snapshots of Motion: A Novel Structural Intermediate Reveals Conserved Dynamics in Archaeal DNA Ligases.","authors":"A X Quintana-Armas, E Flores-Hernández, C Cardona-Félix, E Rudiño-Piñera","doi":"10.1002/prot.70116","DOIUrl":"10.1002/prot.70116","url":null,"abstract":"<p><p>We present the first x-ray crystallographic structural evidence of an archaeal DNA ligase showing the AMP covalent adduct together with further cofactor hydrolysis, capturing a transient intermediary in the first step of the ligation reaction, triggered by the pyrophosphate hydrolysis. Our crystallographic models of Thermococcus gammatolerans DNA ligase (LigTgam), coupled with bioinformatic analysis of at least 28 crystallographic structures from ATP- and NAD<sup>+</sup>-dependent DNA ligases, highlight the central role of domain mobility. Notably, elevated B-values are consistently observed in key catalytic and binding regions, suggesting a link between structural flexibility and enzymatic efficiency. Remarkably, this pattern of high B-values is conserved in replicative ligases, including bacterial Lig A, indicating a broader evolutionary relevance. These fluctuations emphasize the importance of conformational adaptability in accommodating substrate DNA and facilitating catalytic steps, including adenylation and phosphodiester bond formation. In this work, we delve deeper into this dynamic behavior, providing evidence of its critical role in ligase function.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146013518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The rise of flaviviral diseases, including West Nile virus (WNV), presents a growing threat to global public health and underscores the urgent need for new therapeutic strategies. The non-structural protein 3 helicase (NS3h) of the Orthoflavivirus genus, including WNV, is essential for viral replication and a promising antiviral target. Previously [Roy et al., Nucleic Acids Research, 52 (13), 2024, 7447-7464], we showed that the motif VI loop (VIL) in WNV NS3h functions as a nucleotide valve, regulating ADP affinity during hydrolysis. In this study, we uncover an ATP-dependent coupling between nucleotide affinity at motif VIL and RNA affinity at motifs IVa and V, suggesting a coordinated mechanism of ssRNA translocation. Using microsecond-scale all-atom molecular dynamics simulations of hydrolysis-cycle intermediates, we find that key VIL residues (R461, R464) correlate strongly with RNA phosphate affinity of motif V. Structural analyses reveal an ATP-sensitive interaction between E413 (motif V) and R461 (motif VIL) that modulates the conformation of the motif V 310-helix, thereby influencing RNA binding. This dynamic interaction is lost in catalytically deficient VIL mutants, which have been experimentally shown to impair hydrolysis and attenuate viral replication. These findings provide mechanistic insights into NS3h function and identify new opportunities for structure-based antiviral design.
包括西尼罗河病毒(WNV)在内的黄病毒疾病的增加对全球公共卫生构成了日益严重的威胁,并强调迫切需要新的治疗策略。包括西尼罗河病毒在内的正黄病毒属的非结构蛋白3解旋酶(NS3h)是病毒复制所必需的,也是一个有前景的抗病毒靶点。先前[Roy et al., Nucleic Acids Research, 52(13), 2024, 7447-7464],我们发现WNV NS3h的motif VI loop (VIL)作为核苷酸阀,在水解过程中调节ADP的亲和力。在这项研究中,我们揭示了基序VIL上的核苷酸亲和力与基序IVa和V上的RNA亲和力之间的atp依赖性偶联,提示了ssRNA易位的协调机制。利用微秒级水解循环中间体的全原子分子动力学模拟,我们发现关键的VIL残基(R461, R464)与基序V的RNA磷酸盐亲和力密切相关。结构分析显示E413(基序V)和R461(基序VIL)之间的atp敏感相互作用调节基序V 310-螺旋的构象,从而影响RNA结合。这种动态相互作用在催化缺陷的VIL突变体中丢失,实验表明这种突变体会损害水解和减弱病毒复制。这些发现提供了NS3h功能的机制见解,并为基于结构的抗病毒设计提供了新的机会。
{"title":"Allosteric Regulation of RNA Affinity by Motif V-VI Coupling in West Nile Virus NS3 Helicase.","authors":"Priti Roy, Martin McCullagh","doi":"10.1002/prot.70113","DOIUrl":"10.1002/prot.70113","url":null,"abstract":"<p><p>The rise of flaviviral diseases, including West Nile virus (WNV), presents a growing threat to global public health and underscores the urgent need for new therapeutic strategies. The non-structural protein 3 helicase (NS3h) of the Orthoflavivirus genus, including WNV, is essential for viral replication and a promising antiviral target. Previously [Roy et al., Nucleic Acids Research, 52 (13), 2024, 7447-7464], we showed that the motif VI loop (VIL) in WNV NS3h functions as a nucleotide valve, regulating ADP affinity during hydrolysis. In this study, we uncover an ATP-dependent coupling between nucleotide affinity at motif VIL and RNA affinity at motifs IVa and V, suggesting a coordinated mechanism of ssRNA translocation. Using microsecond-scale all-atom molecular dynamics simulations of hydrolysis-cycle intermediates, we find that key VIL residues (R461, R464) correlate strongly with RNA phosphate affinity of motif V. Structural analyses reveal an ATP-sensitive interaction between E413 (motif V) and R461 (motif VIL) that modulates the conformation of the motif V 3<sub>10</sub>-helix, thereby influencing RNA binding. This dynamic interaction is lost in catalytically deficient VIL mutants, which have been experimentally shown to impair hydrolysis and attenuate viral replication. These findings provide mechanistic insights into NS3h function and identify new opportunities for structure-based antiviral design.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12882695/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146013573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GnRH1 binds to its receptor GnRH1R to stimulate release of FSH and LH. Earlier NMR analysis had reported several possible conformers of GnRH1; however, the biologically active conformation of GnRH1 is not identified so far. Here, molecular docking of different NMR conformers of GnRH1 to GnRH1R is performed. Based on: (a) residues of GnRH1R interacting with antagonist elagolix (as ligand-binding pocket), (b) intermolecular hydrogen bonds (for specificity of interaction), and (c) total intermolecular non-covalent interactions (for stability of interaction), one NMR conformation, having an asymmetric U-turn reverse coil structure with a beta strand comprised of residues Gly6 and Leu7, is identified as the bioactive conformation of GnRH1. Further, the identified bioactive NMR conformation of GnRH1 is used to explain in vivo GnRH1-neutralizing ability of monoclonal antibody (mAb) F1D3C5 and lack of neutralization by another mAb E2D2H12. In mice, F1D3C5 completely blocks estrus cycle, while E2D2H12, despite having a relatively higher affinity for GnRH1 in ELISA, does not alter the estrus cycle. Molecular docking of the identified bioactive NMR conformation of GnRH1 to homology models of scFv attributes in vivo neutralizing ability of F1D3C5 to structure-specific recognition of GnRH1. The bioactive conformation of GnRH1 identified here could guide co-crystallization studies, design of analogs and GnRH1 vaccination efforts.
{"title":"Identifying Bioactive Conformation of GnRH1 Based on Molecular Docking of NMR Conformers to GnRH1R and mAbs.","authors":"Madhavi Latha Yadav Bangaru, Anjali Anoop Karande, Nidhanapati Karanam Raghavendra","doi":"10.1002/prot.70111","DOIUrl":"https://doi.org/10.1002/prot.70111","url":null,"abstract":"<p><p>GnRH1 binds to its receptor GnRH1R to stimulate release of FSH and LH. Earlier NMR analysis had reported several possible conformers of GnRH1; however, the biologically active conformation of GnRH1 is not identified so far. Here, molecular docking of different NMR conformers of GnRH1 to GnRH1R is performed. Based on: (a) residues of GnRH1R interacting with antagonist elagolix (as ligand-binding pocket), (b) intermolecular hydrogen bonds (for specificity of interaction), and (c) total intermolecular non-covalent interactions (for stability of interaction), one NMR conformation, having an asymmetric U-turn reverse coil structure with a beta strand comprised of residues Gly6 and Leu7, is identified as the bioactive conformation of GnRH1. Further, the identified bioactive NMR conformation of GnRH1 is used to explain in vivo GnRH1-neutralizing ability of monoclonal antibody (mAb) F1D3C5 and lack of neutralization by another mAb E2D2H12. In mice, F1D3C5 completely blocks estrus cycle, while E2D2H12, despite having a relatively higher affinity for GnRH1 in ELISA, does not alter the estrus cycle. Molecular docking of the identified bioactive NMR conformation of GnRH1 to homology models of scFv attributes in vivo neutralizing ability of F1D3C5 to structure-specific recognition of GnRH1. The bioactive conformation of GnRH1 identified here could guide co-crystallization studies, design of analogs and GnRH1 vaccination efforts.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145913910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lijun Liu, Elizabeth K Harmon, Justin K Craig, Huili Yao, Kevin P Battaile, David K Johnson, Sandhya Subramanian, Wesley C Van Voorhis, Mario Rivera, Scott Lovell
Iron homeostasis in various pathogenic bacteria is regulated by bacterioferritins (Bfr) which function to store Fe3+ and release Fe2+ as needed for metabolic processes. The Bfr structure consists of 18 kDa subunits in which dimer pairs bind a heme molecule and are assembled into a highly symmetrical 24-meric spherical structure with an internal core diameter of approximately 80 Å. Release of iron is facilitated by the binding of a 7 kDa [2Fe-2S] ferredoxin (Bfd) to specific sites on the surface of Bfr which transfers electrons to the core thereby reducing the stored Fe3+ to Fe2+ for mobilization. The crystal structures of Bfr from Brucella abortus (Ba) in the apo and iron bound forms are presented and compared with those from Acinetobacter baumannii (Ab) and Pseudomonas aeruginosa (Pa). Additionally, models of the Bfr:Bfd complexes for Ba and Ab are provided and compared with the Pa complex. Finally, compounds known to target the Bfr:Bfd interaction in Pa were docked to the Ba and Ab structures which provided insight regarding the potential binding mode and inhibitory mechanism.
{"title":"Structural Analysis and Inhibitor Modeling of Bacterioferritin From Brucella abortus.","authors":"Lijun Liu, Elizabeth K Harmon, Justin K Craig, Huili Yao, Kevin P Battaile, David K Johnson, Sandhya Subramanian, Wesley C Van Voorhis, Mario Rivera, Scott Lovell","doi":"10.1002/prot.70109","DOIUrl":"10.1002/prot.70109","url":null,"abstract":"<p><p>Iron homeostasis in various pathogenic bacteria is regulated by bacterioferritins (Bfr) which function to store Fe<sup>3+</sup> and release Fe<sup>2+</sup> as needed for metabolic processes. The Bfr structure consists of 18 kDa subunits in which dimer pairs bind a heme molecule and are assembled into a highly symmetrical 24-meric spherical structure with an internal core diameter of approximately 80 Å. Release of iron is facilitated by the binding of a 7 kDa [2Fe-2S] ferredoxin (Bfd) to specific sites on the surface of Bfr which transfers electrons to the core thereby reducing the stored Fe<sup>3+</sup> to Fe<sup>2+</sup> for mobilization. The crystal structures of Bfr from Brucella abortus (Ba) in the apo and iron bound forms are presented and compared with those from Acinetobacter baumannii (Ab) and Pseudomonas aeruginosa (Pa). Additionally, models of the Bfr:Bfd complexes for Ba and Ab are provided and compared with the Pa complex. Finally, compounds known to target the Bfr:Bfd interaction in Pa were docked to the Ba and Ab structures which provided insight regarding the potential binding mode and inhibitory mechanism.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145893518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2025-10-03DOI: 10.1002/prot.70053
Andreas Tosstorff, Markus G Rudolph, Jörg Benz, Bernd Kuhn, Christian Kramer, May Sharpe, Chia-Ying Huang, Alexander Metz, Julien Hazemann, Daniel Ritz, Aengus Mac Sweeney, Michael K Gilson
This paper presents the experimental protein-ligand datasets used as benchmarks in the CASP 16 blind prediction experiment-the first CASP round to incorporate targets from pharmaceutical discovery projects. We have assembled and characterized protein-ligand complexes for four proteins that are known or candidate drug targets: human chymase, human cathepsin G, human autotaxin, and the SARS-CoV-2 main protease. The collection encompasses over 200 co-crystal structures at resolutions better than 2.7 Å, paired with binding affinity measurements for approximately 160 compounds covering a broad affinity range (nanomolar to high micromolar). These data enabled the CASP16 pose-prediction and affinity-prediction challenges. Many systems feature potentially challenging characteristics, including chymase's electropositive surface and acidic ligands, which require proper handling of titratable ligand groups; autotaxin complexes with and without zinc coordination; and a SARS-CoV-2 protease crystal form exhibiting an unusually open active site conformation. We describe the experimental approaches-from protein production and crystallization to binding assay development-that yielded these reference data. Contributed by scientists at F. Hoffmann-La Roche and Idorsia Pharmaceuticals, these datasets represent actual drug discovery projects and therefore provide a realistic testbed for assessing how computational methods perform on pharmaceutically relevant targets. An accompanying paper in the present special journal issue provides a comprehensive assessment of the pose and affinity predictions for these pharmaceutical protein-ligand systems.
{"title":"The CASP 16 Experimental Protein-Ligand Datasets.","authors":"Andreas Tosstorff, Markus G Rudolph, Jörg Benz, Bernd Kuhn, Christian Kramer, May Sharpe, Chia-Ying Huang, Alexander Metz, Julien Hazemann, Daniel Ritz, Aengus Mac Sweeney, Michael K Gilson","doi":"10.1002/prot.70053","DOIUrl":"10.1002/prot.70053","url":null,"abstract":"<p><p>This paper presents the experimental protein-ligand datasets used as benchmarks in the CASP 16 blind prediction experiment-the first CASP round to incorporate targets from pharmaceutical discovery projects. We have assembled and characterized protein-ligand complexes for four proteins that are known or candidate drug targets: human chymase, human cathepsin G, human autotaxin, and the SARS-CoV-2 main protease. The collection encompasses over 200 co-crystal structures at resolutions better than 2.7 Å, paired with binding affinity measurements for approximately 160 compounds covering a broad affinity range (nanomolar to high micromolar). These data enabled the CASP16 pose-prediction and affinity-prediction challenges. Many systems feature potentially challenging characteristics, including chymase's electropositive surface and acidic ligands, which require proper handling of titratable ligand groups; autotaxin complexes with and without zinc coordination; and a SARS-CoV-2 protease crystal form exhibiting an unusually open active site conformation. We describe the experimental approaches-from protein production and crystallization to binding assay development-that yielded these reference data. Contributed by scientists at F. Hoffmann-La Roche and Idorsia Pharmaceuticals, these datasets represent actual drug discovery projects and therefore provide a realistic testbed for assessing how computational methods perform on pharmaceutically relevant targets. An accompanying paper in the present special journal issue provides a comprehensive assessment of the pose and affinity predictions for these pharmaceutical protein-ligand systems.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"79-85"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145214496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2025-04-08DOI: 10.1002/prot.26827
Alex Morehead, Jian Liu, Pawan Neupane, Nabin Giri, Jianlin Cheng
Predicting the structure of ligands bound to proteins is a foundational problem in modern biotechnology and drug discovery, yet little is known about how to combine the predictions of protein-ligand structure (poses) produced by the latest deep learning methods to identify the best poses and how to accurately estimate the binding affinity between a protein target and a list of ligand candidates. Further, a blind benchmarking and assessment of protein-ligand structure and binding affinity prediction is necessary to ensure it generalizes well to new settings. Towards this end, we introduce MULTICOM_ligand, a deep learning-based protein-ligand structure and binding affinity prediction ensemble featuring structural consensus ranking for unsupervised pose ranking and a new deep generative flow matching model for joint structure and binding affinity prediction. Notably, MULTICOM_ligand ranked among the top-5 ligand prediction methods in both protein-ligand structure prediction and binding affinity prediction in the 16th Critical Assessment of Techniques for Structure Prediction (CASP16), demonstrating its efficacy and utility for real-world drug discovery efforts. The source code for MULTICOM_ligand is freely available on GitHub.
{"title":"Protein-Ligand Structure and Affinity Prediction in CASP16 Using a Geometric Deep Learning Ensemble and Flow Matching.","authors":"Alex Morehead, Jian Liu, Pawan Neupane, Nabin Giri, Jianlin Cheng","doi":"10.1002/prot.26827","DOIUrl":"10.1002/prot.26827","url":null,"abstract":"<p><p>Predicting the structure of ligands bound to proteins is a foundational problem in modern biotechnology and drug discovery, yet little is known about how to combine the predictions of protein-ligand structure (poses) produced by the latest deep learning methods to identify the best poses and how to accurately estimate the binding affinity between a protein target and a list of ligand candidates. Further, a blind benchmarking and assessment of protein-ligand structure and binding affinity prediction is necessary to ensure it generalizes well to new settings. Towards this end, we introduce MULTICOM_ligand, a deep learning-based protein-ligand structure and binding affinity prediction ensemble featuring structural consensus ranking for unsupervised pose ranking and a new deep generative flow matching model for joint structure and binding affinity prediction. Notably, MULTICOM_ligand ranked among the top-5 ligand prediction methods in both protein-ligand structure prediction and binding affinity prediction in the 16th Critical Assessment of Techniques for Structure Prediction (CASP16), demonstrating its efficacy and utility for real-world drug discovery efforts. The source code for MULTICOM_ligand is freely available on GitHub.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"295-301"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12353915/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143804833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2025-10-15DOI: 10.1002/prot.70078
Luciano A Abriata, Matteo Dal Peraro
The 16th Critical Assessment of Structure Prediction benchmarked advancements in biomolecular modeling, particularly in the context of AlphaFold 2 and 3 systems. Protein monomer and domain prediction is largely solved, with barely any space for further improvements at the backbone level except for very specific details, irregular secondary structures, and mutational effects that remain challenging to predict. For protein assemblies, AF-based methods, especially when expertly guided or enhanced by servers like those from the Yang, Zheng/Zhang, and Cheng lab, show progress, though complex topologies and in particular antibody-antigen interactions are still difficult. Notably, a priori knowledge of stoichiometry significantly aids assembly prediction. Protein-ligand co-folding with AF3 demonstrated strong potential for pose prediction, outperforming many participants and some dedicated docking tools in baseline tests, but several caveats hold as discussed. Ligand affinity prediction is totally unreliable. Nucleic acid structure prediction lags considerably, heavily relying on 3D templates and expert human intervention, even AF3 showing substantial limitations. Overall, on all fronts, AF3's modeling capabilities are at or close to the state of the art; additionally, it shows slight improvements over AF2 and more detailed confidence metrics than it. We guide users on tool selection, realistic accuracy expectations, and persistent challenges, emphasizing the critical role of confidence metrics in interpreting AI-generated models.
{"title":"Practical Outcomes From CASP16 for Users in Need of Biomolecular Structure Prediction.","authors":"Luciano A Abriata, Matteo Dal Peraro","doi":"10.1002/prot.70078","DOIUrl":"10.1002/prot.70078","url":null,"abstract":"<p><p>The 16th Critical Assessment of Structure Prediction benchmarked advancements in biomolecular modeling, particularly in the context of AlphaFold 2 and 3 systems. Protein monomer and domain prediction is largely solved, with barely any space for further improvements at the backbone level except for very specific details, irregular secondary structures, and mutational effects that remain challenging to predict. For protein assemblies, AF-based methods, especially when expertly guided or enhanced by servers like those from the Yang, Zheng/Zhang, and Cheng lab, show progress, though complex topologies and in particular antibody-antigen interactions are still difficult. Notably, a priori knowledge of stoichiometry significantly aids assembly prediction. Protein-ligand co-folding with AF3 demonstrated strong potential for pose prediction, outperforming many participants and some dedicated docking tools in baseline tests, but several caveats hold as discussed. Ligand affinity prediction is totally unreliable. Nucleic acid structure prediction lags considerably, heavily relying on 3D templates and expert human intervention, even AF3 showing substantial limitations. Overall, on all fronts, AF3's modeling capabilities are at or close to the state of the art; additionally, it shows slight improvements over AF2 and more detailed confidence metrics than it. We guide users on tool selection, realistic accuracy expectations, and persistent challenges, emphasizing the critical role of confidence metrics in interpreting AI-generated models.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"435-446"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750028/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145294492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-01Epub Date: 2025-10-31DOI: 10.1002/prot.70068
Jing Zhang, Rongqing Yuan, Andriy Kryshtafovych, Jimin Pei, Rachael C Kretsch, R Dustin Schaeffer, Jian Zhou, Rhiju Das, Nick V Grishin, Qian Cong
The assessment of oligomer targets in the Critical Assessment of Structure Prediction Round 16 (CASP16) suggests that complex structure prediction remains an unsolved challenge. Even the leading groups can only predict slightly more than half of the targets to high accuracy. Most CASP16 groups relied on AlphaFold-Multimer (AFM) or AlphaFold3 (AF3) as their core modeling engines. By optimizing input MSAs, refining modeling constructs (using partial rather than full sequences), and employing massive model sampling and selection, top-performing groups were able to significantly outperform the default AFM/AF3 predictions. CASP16 also introduced two additional challenges: Phase 0, which required predictions without stoichiometry information, and Phase 2, which provided participants with thousands of models generated by MassiveFold (MF) to enable large-scale sampling for resource-limited groups. Across all phases, the MULTICOM series and Kiharalab emerged as top performers based on the quality of their best models. However, these groups did not have a strong advantage in model ranking, and thus their lead over other teams, such as Yang-Multimer and kozakovvajda, was less pronounced when evaluating only the first submitted models. Compared to CASP15, CASP16 showed moderate overall improvement, likely driven by the release of AF3 and the extensive model sampling employed by top groups. Several notable trends highlight frontiers for future development. First, the kozakovvajda group significantly outperformed others on antibody-antigen targets, achieving over a 60% success rate without relying on AFM or AF3 as their primary modeling framework, suggesting that alternative approaches may offer promising solutions for these difficult targets. Second, model ranking and selection continue to be major bottlenecks. The PEZYFoldings group demonstrated a notable advantage in selecting their best models as first models, suggesting that their pipeline for model ranking may offer important insights for the field. Finally, the Phase 0 experiment indicated moderate success in stoichiometry prediction; however, stoichiometry prediction remains challenging for high-order assemblies and targets that differ from available homologous templates. Overall, CASP16 demonstrated steady progress in multimer prediction while emphasizing the need for more effective model ranking strategies, improved stoichiometry prediction, and new modeling methods that extend beyond the current AF-based paradigm.
{"title":"Assessment of Protein Complex Predictions in CASP16: Are We Making Progress?","authors":"Jing Zhang, Rongqing Yuan, Andriy Kryshtafovych, Jimin Pei, Rachael C Kretsch, R Dustin Schaeffer, Jian Zhou, Rhiju Das, Nick V Grishin, Qian Cong","doi":"10.1002/prot.70068","DOIUrl":"10.1002/prot.70068","url":null,"abstract":"<p><p>The assessment of oligomer targets in the Critical Assessment of Structure Prediction Round 16 (CASP16) suggests that complex structure prediction remains an unsolved challenge. Even the leading groups can only predict slightly more than half of the targets to high accuracy. Most CASP16 groups relied on AlphaFold-Multimer (AFM) or AlphaFold3 (AF3) as their core modeling engines. By optimizing input MSAs, refining modeling constructs (using partial rather than full sequences), and employing massive model sampling and selection, top-performing groups were able to significantly outperform the default AFM/AF3 predictions. CASP16 also introduced two additional challenges: Phase 0, which required predictions without stoichiometry information, and Phase 2, which provided participants with thousands of models generated by MassiveFold (MF) to enable large-scale sampling for resource-limited groups. Across all phases, the MULTICOM series and Kiharalab emerged as top performers based on the quality of their best models. However, these groups did not have a strong advantage in model ranking, and thus their lead over other teams, such as Yang-Multimer and kozakovvajda, was less pronounced when evaluating only the first submitted models. Compared to CASP15, CASP16 showed moderate overall improvement, likely driven by the release of AF3 and the extensive model sampling employed by top groups. Several notable trends highlight frontiers for future development. First, the kozakovvajda group significantly outperformed others on antibody-antigen targets, achieving over a 60% success rate without relying on AFM or AF3 as their primary modeling framework, suggesting that alternative approaches may offer promising solutions for these difficult targets. Second, model ranking and selection continue to be major bottlenecks. The PEZYFoldings group demonstrated a notable advantage in selecting their best models as first models, suggesting that their pipeline for model ranking may offer important insights for the field. Finally, the Phase 0 experiment indicated moderate success in stoichiometry prediction; however, stoichiometry prediction remains challenging for high-order assemblies and targets that differ from available homologous templates. Overall, CASP16 demonstrated steady progress in multimer prediction while emphasizing the need for more effective model ranking strategies, improved stoichiometry prediction, and new modeling methods that extend beyond the current AF-based paradigm.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"106-130"},"PeriodicalIF":2.8,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750043/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145423594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}