首页 > 最新文献

Acta Crystallographica. Section D, Structural Biology最新文献

英文 中文
Off-target structural insights: ArnA and AcrB in bacterial membrane-protein cryo-EM analysis. 脱靶结构洞察:细菌膜蛋白冷冻电镜分析中的ArnA和AcrB。
IF 3.8 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-10-01 Epub Date: 2025-09-10 DOI: 10.1107/S2059798325007089
Mehmet Caliseki, Ufuk Borucu, Sathish K N Yadav, Christiane Schaffitzel, Burak Veli Kabasakal

Membrane-protein quality control in Escherichia coli involves coordinated actions of the AAA+ protease FtsH, the insertase YidC and the regulatory complex HflKC. These systems maintain proteostasis by facilitating membrane-protein insertion, folding and degradation. To gain structural insights into a putative complex formed by FtsH and YidC, we performed single-particle cryogenic electron microscopy on detergent-solubilized membrane samples, from which FtsH and YidC were purified using Ni-NTA affinity and size-exclusion chromatography. Although SDS-PAGE analysis indicated high purity of these proteins, cryo-EM data sets unexpectedly yielded high-resolution structures of ArnA and AcrB at 4.0 and 2.9 Å resolution, respectively. ArnA is a bifunctional enzyme involved in lipid A modification and polymyxin resistance, while AcrB is a multidrug efflux transporter of the AcrAB-TolC system. ArnA and AcrB, known Ni-NTA purification contaminants, were also consistently detected by mass spectrometry in Strep-Tactin affinity-purified samples, validating their presence independently of affinity-tag selection. ArnA, which is typically cytoplasmic, was consistently found in membrane-isolated samples, indicating an association with membrane components. Only 2D class averages corresponding to the cytoplasmic AAA+ domain of FtsH were observed; neither side views of full-length FtsH nor densities corresponding to an intact FtsH-YidC complex could be identified, due to the conformational flexibility of the FtsH complex and its transient interaction with YidC, which limited particle alignment and stable classification in cryo-EM data sets. Two-dimensional class averages revealed additional particles resembling GroEL and cytochrome bo3 oxidase. These results underscore the utility of cryo-EM in uncovering off-target yet structurally well defined complexes, which may reflect physiologically relevant interactions or purification biases during membrane-protein overexpression.

大肠杆菌的膜蛋白质量控制涉及AAA+蛋白酶FtsH、插入酶YidC和调控复合体HflKC的协调作用。这些系统通过促进膜蛋白的插入、折叠和降解来维持蛋白质稳态。为了深入了解FtsH和YidC可能形成的复合物的结构,我们对洗涤剂溶解的膜样品进行了单颗粒低温电子显微镜观察,使用Ni-NTA亲和层析和尺寸排除层析纯化了FtsH和YidC。虽然SDS-PAGE分析表明这些蛋白的纯度很高,但冷冻电镜数据集意外地获得了ArnA和AcrB的高分辨率结构,分别为4.0和2.9 Å分辨率。ArnA是参与脂质a修饰和多粘菌素耐药的双功能酶,而AcrB是acrabb - tolc系统的多药物外排转运体。已知的Ni-NTA纯化污染物ArnA和AcrB也在strep - tacn亲和纯化的样品中通过质谱法一致检测到,验证了它们独立于亲和标签选择的存在。ArnA是典型的细胞质,在膜分离样品中一致发现,表明与膜组分有关。仅观察到FtsH胞质AAA+结构域对应的2D类平均值;由于FtsH配合物的构象灵活性及其与YidC的瞬态相互作用,限制了低温电镜数据集中的粒子排列和稳定分类,因此既不能识别全长FtsH的侧面视图,也不能识别完整FtsH-YidC配合物对应的密度。二维类平均显示额外的颗粒类似于GroEL和细胞色素bo3氧化酶。这些结果强调了冷冻电镜在揭示脱靶但结构明确的复合物方面的效用,这些复合物可能反映了膜蛋白过表达过程中生理相关的相互作用或纯化偏差。
{"title":"Off-target structural insights: ArnA and AcrB in bacterial membrane-protein cryo-EM analysis.","authors":"Mehmet Caliseki, Ufuk Borucu, Sathish K N Yadav, Christiane Schaffitzel, Burak Veli Kabasakal","doi":"10.1107/S2059798325007089","DOIUrl":"10.1107/S2059798325007089","url":null,"abstract":"<p><p>Membrane-protein quality control in Escherichia coli involves coordinated actions of the AAA+ protease FtsH, the insertase YidC and the regulatory complex HflKC. These systems maintain proteostasis by facilitating membrane-protein insertion, folding and degradation. To gain structural insights into a putative complex formed by FtsH and YidC, we performed single-particle cryogenic electron microscopy on detergent-solubilized membrane samples, from which FtsH and YidC were purified using Ni-NTA affinity and size-exclusion chromatography. Although SDS-PAGE analysis indicated high purity of these proteins, cryo-EM data sets unexpectedly yielded high-resolution structures of ArnA and AcrB at 4.0 and 2.9 Å resolution, respectively. ArnA is a bifunctional enzyme involved in lipid A modification and polymyxin resistance, while AcrB is a multidrug efflux transporter of the AcrAB-TolC system. ArnA and AcrB, known Ni-NTA purification contaminants, were also consistently detected by mass spectrometry in Strep-Tactin affinity-purified samples, validating their presence independently of affinity-tag selection. ArnA, which is typically cytoplasmic, was consistently found in membrane-isolated samples, indicating an association with membrane components. Only 2D class averages corresponding to the cytoplasmic AAA+ domain of FtsH were observed; neither side views of full-length FtsH nor densities corresponding to an intact FtsH-YidC complex could be identified, due to the conformational flexibility of the FtsH complex and its transient interaction with YidC, which limited particle alignment and stable classification in cryo-EM data sets. Two-dimensional class averages revealed additional particles resembling GroEL and cytochrome bo<sub>3</sub> oxidase. These results underscore the utility of cryo-EM in uncovering off-target yet structurally well defined complexes, which may reflect physiologically relevant interactions or purification biases during membrane-protein overexpression.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"545-557"},"PeriodicalIF":3.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12485490/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validation of helical symmetry parameters in the EMDB. 在EMDB中验证螺旋对称参数。
IF 3.8 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-10-01 Epub Date: 2025-09-04 DOI: 10.1107/S2059798325007260
Daoyi Li, María Muñoz Pérez, Xiaoqi Zhang, Jiaqing Li, Wen Jiang

Helical symmetry is a structural feature of many biological assemblies, including cytoskeletons, viruses and pathological amyloid fibrils. The helical parameters twist and rise are unique metadata for helical structures. With the increasing number of helical structures being resolved through cryo-EM and deposited in the EMDB, there is a growing possibility of errors in the metadata associated with these entries. During our cryo-EM analysis of protein amyloids and the development of helical analysis tools, we realized that many deposited helical parameters appear to be inconsistent with the associated density maps. Here, we have developed a comprehensive validation process that examines the consistency of these parameters by combining high-throughput computational evaluation with manual verification. Multiple errors were identified and corrected for ∼14% of the total entries, including missing parameters, swapped twist and rise values, incorrect sign of twist angles, partial symmetries and bona fide errors. Our validation code, workflow and the validated parameters are publicly available.

螺旋对称是许多生物组件的结构特征,包括细胞骨架、病毒和病理性淀粉样原纤维。螺旋参数twist和rise是螺旋结构的独特元数据。随着越来越多的螺旋结构通过低温电镜解析并沉积在EMDB中,与这些条目相关的元数据中出现错误的可能性越来越大。在我们对蛋白质淀粉样蛋白的冷冻电镜分析和螺旋分析工具的开发过程中,我们意识到许多沉积的螺旋参数似乎与相关的密度图不一致。在这里,我们开发了一个全面的验证过程,通过结合高通量计算评估和手动验证来检查这些参数的一致性。发现并纠正了约14%的总输入的多个错误,包括缺少参数、扭曲和上升值交换、扭曲角符号不正确、部分对称和真实错误。我们的验证代码、工作流和验证参数都是公开可用的。
{"title":"Validation of helical symmetry parameters in the EMDB.","authors":"Daoyi Li, María Muñoz Pérez, Xiaoqi Zhang, Jiaqing Li, Wen Jiang","doi":"10.1107/S2059798325007260","DOIUrl":"10.1107/S2059798325007260","url":null,"abstract":"<p><p>Helical symmetry is a structural feature of many biological assemblies, including cytoskeletons, viruses and pathological amyloid fibrils. The helical parameters twist and rise are unique metadata for helical structures. With the increasing number of helical structures being resolved through cryo-EM and deposited in the EMDB, there is a growing possibility of errors in the metadata associated with these entries. During our cryo-EM analysis of protein amyloids and the development of helical analysis tools, we realized that many deposited helical parameters appear to be inconsistent with the associated density maps. Here, we have developed a comprehensive validation process that examines the consistency of these parameters by combining high-throughput computational evaluation with manual verification. Multiple errors were identified and corrected for ∼14% of the total entries, including missing parameters, swapped twist and rise values, incorrect sign of twist angles, partial symmetries and bona fide errors. Our validation code, workflow and the validated parameters are publicly available.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"527-534"},"PeriodicalIF":3.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12485488/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144991244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding domain swapping in the c-Src SH3 domain through hinge-loop mutagenesis. 通过铰链环诱变了解c-Src SH3结构域交换。
IF 3.8 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-09-01 Epub Date: 2025-08-27 DOI: 10.1107/S2059798325006977
M Carmen Salinas-Garcia, Marina Plaza-Garrido, Jose C Martinez, Ana Camara-Artigas

The c-Src SH3 domain is one of the best-characterized modular domains from a biophysical and structural point of view. This SH3 domain displays noncanonical alternative folding, forming 3D domain-swapped oligomers and amyloid fibrils. These features make this small protein an ideal model for studying these phenomena. Residues in the regions that favour unfolding of the monomer and those in the hinge loop have been deeply studied in proteins undergoing 3D domain swapping. To study the role of these residues in the unfolding of the c-Src SH3 domain, we have constructed several chimeric proteins by interchanging residues in the RT and n-Src loops between the c-Src SH3 and Abl SH3 domains. The RT (the region between β1 and β2) and n-Src (the region between β2 and β3) loops create two sides of the shallow hydrophobic groove where proline-rich motif sequences bind to the SH3 domain. In addition to the structural information, we have performed a biophysical characterization of these chimeric constructs. The c-Src SH3 domain bearing the loops of the Abl SH3 shows minor changes in stability. Interestingly, these replacements do not prevent the formation of domain-swapped dimers. However, the interchange of one or two loops within the Abl SH3 domain produces a noticeable reduction in its stability but does not promote the formation of 3D domain-swapped oligomers. Thus, our results indicate that although the composition of the hinge loop is likely to play a role in the interchange of structural elements to form the intertwined dimers, it is not the sole driving force in their formation.

从生物物理和结构的角度来看,c-Src SH3结构域是最具特征的模块化结构域之一。这个SH3结构域显示非规范的替代折叠,形成3D结构域交换的低聚物和淀粉样原纤维。这些特征使这种小蛋白质成为研究这些现象的理想模型。在进行三维结构域交换的蛋白质中,对有利于单体展开的区域和铰链环中的残基进行了深入研究。为了研究这些残基在c-Src SH3结构域展开中的作用,我们通过在c-Src SH3和Abl SH3结构域之间交换RT和n-Src环中的残基构建了几种嵌合蛋白。RT (β1和β2之间的区域)和n-Src (β2和β3之间的区域)环形成浅疏水槽的两侧,富含脯氨酸的基序序列与SH3结构域结合。除了结构信息外,我们还对这些嵌合结构进行了生物物理表征。承载Abl SH3环的c-Src SH3结构域在稳定性上表现出微小的变化。有趣的是,这些替换并不能阻止结构域交换二聚体的形成。然而,Abl SH3结构域内一个或两个环的交换会显著降低其稳定性,但不会促进3D结构域交换低聚物的形成。因此,我们的研究结果表明,虽然铰链环的组成可能在结构元素的交换中发挥作用,形成相互缠绕的二聚体,但它不是形成二聚体的唯一驱动力。
{"title":"Understanding domain swapping in the c-Src SH3 domain through hinge-loop mutagenesis.","authors":"M Carmen Salinas-Garcia, Marina Plaza-Garrido, Jose C Martinez, Ana Camara-Artigas","doi":"10.1107/S2059798325006977","DOIUrl":"https://doi.org/10.1107/S2059798325006977","url":null,"abstract":"<p><p>The c-Src SH3 domain is one of the best-characterized modular domains from a biophysical and structural point of view. This SH3 domain displays noncanonical alternative folding, forming 3D domain-swapped oligomers and amyloid fibrils. These features make this small protein an ideal model for studying these phenomena. Residues in the regions that favour unfolding of the monomer and those in the hinge loop have been deeply studied in proteins undergoing 3D domain swapping. To study the role of these residues in the unfolding of the c-Src SH3 domain, we have constructed several chimeric proteins by interchanging residues in the RT and n-Src loops between the c-Src SH3 and Abl SH3 domains. The RT (the region between β1 and β2) and n-Src (the region between β2 and β3) loops create two sides of the shallow hydrophobic groove where proline-rich motif sequences bind to the SH3 domain. In addition to the structural information, we have performed a biophysical characterization of these chimeric constructs. The c-Src SH3 domain bearing the loops of the Abl SH3 shows minor changes in stability. Interestingly, these replacements do not prevent the formation of domain-swapped dimers. However, the interchange of one or two loops within the Abl SH3 domain produces a noticeable reduction in its stability but does not promote the formation of 3D domain-swapped oligomers. Thus, our results indicate that although the composition of the hinge loop is likely to play a role in the interchange of structural elements to form the intertwined dimers, it is not the sole driving force in their formation.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":"81 Pt 9","pages":"492-510"},"PeriodicalIF":3.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12400191/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144938514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crystal structures of 40- and 71-substitution variants of hydroxynitrile lyase from rubber tree. 橡胶树羟基腈裂解酶40-和71-取代变异体的晶体结构。
IF 3.8 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-09-01 Epub Date: 2025-08-27 DOI: 10.1107/S2059798325007065
Colin T Pierce, Panhavuth Tan, Lauren R Greenberg, Meghan E Walsh, Ke Shi, Alana H Nguyen, Elyssa L Meixner, Sharad Sarak, Hideki Aihara, Robert L Evans, Romas J Kazlauskas

Hydroxynitrile lyase from Hevea brasiliensis (HbHNL) and the esterase SABP2 from Nicotiana tabacum share the α/β-hydrolase fold, a Ser-His-Asp catalytic triad and 44% sequence identity, yet catalyze different reactions. Prior studies showed that three active-site substitutions in HbHNL conferred weak esterase activity. To investigate how regions beyond the active site influence catalytic efficiency and active-site geometry, we engineered HbHNL variants with increasing numbers of substitutions to match SABP2. Variant HNL16 has all amino acids within 6.5 Å of the active site identical to SABP2, HNL40 those within 10 Å and HNL71 those within 14 Å. HNL16 exhibited poor esterase activity, whereas both HNL40 and HNL71 showed efficient esterase catalysis, demonstrating that residues beyond the immediate active site are critical for functional conversion. X-ray structures of HNL40 and HNL71 reveal a progressive shift in backbone positions toward those of SABP2, with r.m.s.d. values of 0.51 Å (HNL40) and 0.41 Å (HNL71) over the Cα atoms, and even smaller r.m.s.d.s within the active-site region. Both HNL40 and HNL71 show a restored oxyanion hole and an additional tunnel connecting the active site to the protein surface. This work demonstrates the essential role of distant, indirectly acting residues to catalysis in α/β-hydrolase enzymes.

巴西橡胶树(Hevea brasiliensis, HbHNL)和烟草(Nicotiana tabacum)酯酶SABP2具有α/β-水解酶折叠、Ser-His-Asp催化三元组和44%的序列同源性,但催化的反应不同。先前的研究表明,HbHNL的三个活性位点替换导致酯酶活性较弱。为了研究活性位点以外的区域如何影响催化效率和活性位点的几何形状,我们设计了HbHNL变体,增加了取代数量,以匹配SABP2。变体HNL16与SABP2活性位点6.5 Å以内的氨基酸全部相同,HNL40与SABP2活性位点10 Å以内的氨基酸全部相同,HNL71与SABP2活性位点14 Å以内的氨基酸全部相同。HNL16表现出较差的酯酶活性,而HNL40和HNL71都表现出有效的酯酶催化作用,这表明直接活性位点以外的残基对功能转化至关重要。HNL40和HNL71的x射线结构显示主链位置向SABP2的主链位置逐渐偏移,其Cα原子上的r.m.s.d.值分别为0.51 Å (HNL40)和0.41 Å (HNL71),活性位区域内的r.m.s.d.值更小。HNL40和HNL71都显示了一个修复的氧阴离子空洞和一个连接活性位点到蛋白质表面的额外通道。这项工作证明了远端间接作用残基对α/β-水解酶催化的重要作用。
{"title":"Crystal structures of 40- and 71-substitution variants of hydroxynitrile lyase from rubber tree.","authors":"Colin T Pierce, Panhavuth Tan, Lauren R Greenberg, Meghan E Walsh, Ke Shi, Alana H Nguyen, Elyssa L Meixner, Sharad Sarak, Hideki Aihara, Robert L Evans, Romas J Kazlauskas","doi":"10.1107/S2059798325007065","DOIUrl":"10.1107/S2059798325007065","url":null,"abstract":"<p><p>Hydroxynitrile lyase from Hevea brasiliensis (HbHNL) and the esterase SABP2 from Nicotiana tabacum share the α/β-hydrolase fold, a Ser-His-Asp catalytic triad and 44% sequence identity, yet catalyze different reactions. Prior studies showed that three active-site substitutions in HbHNL conferred weak esterase activity. To investigate how regions beyond the active site influence catalytic efficiency and active-site geometry, we engineered HbHNL variants with increasing numbers of substitutions to match SABP2. Variant HNL16 has all amino acids within 6.5 Å of the active site identical to SABP2, HNL40 those within 10 Å and HNL71 those within 14 Å. HNL16 exhibited poor esterase activity, whereas both HNL40 and HNL71 showed efficient esterase catalysis, demonstrating that residues beyond the immediate active site are critical for functional conversion. X-ray structures of HNL40 and HNL71 reveal a progressive shift in backbone positions toward those of SABP2, with r.m.s.d. values of 0.51 Å (HNL40) and 0.41 Å (HNL71) over the C<sup>α</sup> atoms, and even smaller r.m.s.d.s within the active-site region. Both HNL40 and HNL71 show a restored oxyanion hole and an additional tunnel connecting the active site to the protein surface. This work demonstrates the essential role of distant, indirectly acting residues to catalysis in α/β-hydrolase enzymes.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":"81 Pt 9","pages":"511-523"},"PeriodicalIF":3.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12400190/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144938503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rolf Peter Hilgenfeld (1954-2025). 罗尔夫·彼得·希尔根菲尔德(1954-2025)。
IF 3.8 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-08-01 Epub Date: 2025-07-28 DOI: 10.1107/S2059798325006424
Jeroen R Mesters

Rolf Hilgenfeld lived a life in the fast lane, devoted entirely to science, in which he absorbed a wealth of knowledge, conscientiously created new knowledge and passionately passed it on to his students.

罗尔夫·希尔根菲尔德生活在快车道上,完全致力于科学,他吸收了丰富的知识,认真创造新知识,并热情地将其传授给他的学生。
{"title":"Rolf Peter Hilgenfeld (1954-2025).","authors":"Jeroen R Mesters","doi":"10.1107/S2059798325006424","DOIUrl":"https://doi.org/10.1107/S2059798325006424","url":null,"abstract":"<p><p>Rolf Hilgenfeld lived a life in the fast lane, devoted entirely to science, in which he absorbed a wealth of knowledge, conscientiously created new knowledge and passionately passed it on to his students.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":"81 Pt 8","pages":"469-472"},"PeriodicalIF":3.8,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144758919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Q-score as a reliability measure for protein, nucleic acid and small-molecule atomic coordinate models derived from 3DEM maps. Q-score作为蛋白质、核酸和小分子原子坐标模型的可靠性测量,来源于3DEM地图。
IF 3.8 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-08-01 Epub Date: 2025-07-14 DOI: 10.1107/S2059798325005923
Grigore Pintilie, Chenghua Shao, Zhe Wang, Brian P Hudson, Justin W Flatt, Michael F Schmid, Kyle L Morris, Stephen K Burley, Wah Chiu

Atomic coordinate models are important for the interpretation of 3D maps produced with cryoEM and cryoET (3D electron microscopy; 3DEM). In addition to visual inspection of such maps and models, quantitative metrics can inform about the reliability of the atomic coordinates, in particular how well the model is supported by the experimentally determined 3DEM map. A recently introduced metric, Q-score, was shown to correlate well with the reported resolution of the map for well fitted models. Here, we present new statistical analyses of Q-score based on its application to ∼10 000 maps and models archived in the EMDB (Electron Microscopy Data Bank) and PDB (Protein Data Bank). Further, we introduce two new metrics based on Q-score to represent each map and model relative to all entries in the EMDB and those with similar resolution. We explore through illustrative examples of proteins, nucleic acids and small molecules how Q-scores can indicate whether the atomic coordinates are well fitted to 3DEM maps and also whether some parts of a map may be poorly resolved due to factors such as molecular flexibility, radiation damage and/or conformational heterogeneity. These examples and statistical analyses provide a basis for how Q-scores can be interpreted effectively in order to evaluate 3DEM maps and atomic coordinate models prior to publication and archiving.

原子坐标模型对于用cryoEM和cryoET (3D电子显微镜;3 dem)。除了对这些地图和模型进行目视检查外,定量指标还可以告知原子坐标的可靠性,特别是实验确定的3DEM地图对模型的支持程度。最近引入的一个度量,Q-score,被证明与良好拟合模型的地图分辨率有很好的相关性。在这里,我们基于Q-score在EMDB(电子显微镜数据库)和PDB(蛋白质数据库)中存档的约10,000个地图和模型中的应用,提出了新的Q-score统计分析。此外,我们引入了两个基于Q-score的新指标,以表示相对于EMDB中的所有条目和具有相似分辨率的条目的每个映射和模型。我们通过蛋白质,核酸和小分子的说明性示例探索q分数如何指示原子坐标是否与3DEM图很好地拟合,以及由于分子灵活性,辐射损伤和/或构象异质性等因素,地图的某些部分是否可能无法很好地解决。这些例子和统计分析为如何有效地解释q分数提供了基础,以便在出版和存档之前评估3DEM地图和原子坐标模型。
{"title":"Q-score as a reliability measure for protein, nucleic acid and small-molecule atomic coordinate models derived from 3DEM maps.","authors":"Grigore Pintilie, Chenghua Shao, Zhe Wang, Brian P Hudson, Justin W Flatt, Michael F Schmid, Kyle L Morris, Stephen K Burley, Wah Chiu","doi":"10.1107/S2059798325005923","DOIUrl":"10.1107/S2059798325005923","url":null,"abstract":"<p><p>Atomic coordinate models are important for the interpretation of 3D maps produced with cryoEM and cryoET (3D electron microscopy; 3DEM). In addition to visual inspection of such maps and models, quantitative metrics can inform about the reliability of the atomic coordinates, in particular how well the model is supported by the experimentally determined 3DEM map. A recently introduced metric, Q-score, was shown to correlate well with the reported resolution of the map for well fitted models. Here, we present new statistical analyses of Q-score based on its application to ∼10 000 maps and models archived in the EMDB (Electron Microscopy Data Bank) and PDB (Protein Data Bank). Further, we introduce two new metrics based on Q-score to represent each map and model relative to all entries in the EMDB and those with similar resolution. We explore through illustrative examples of proteins, nucleic acids and small molecules how Q-scores can indicate whether the atomic coordinates are well fitted to 3DEM maps and also whether some parts of a map may be poorly resolved due to factors such as molecular flexibility, radiation damage and/or conformational heterogeneity. These examples and statistical analyses provide a basis for how Q-scores can be interpreted effectively in order to evaluate 3DEM maps and atomic coordinate models prior to publication and archiving.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"410-422"},"PeriodicalIF":3.8,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12315582/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144625260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient high-resolution refinement in cryo-EM with stochastic gradient descent. 低温电镜随机梯度下降的高效高分辨率细化。
IF 2.6 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-07-01 Epub Date: 2025-06-23 DOI: 10.1107/S205979832500511X
Bogdan Toader, Marcus A Brubaker, Roy R Lederman

Electron cryo-microscopy (cryo-EM) is an imaging technique that is widely used in structural biology to determine the three-dimensional structure of biological molecules from noisy two-dimensional projections with unknown orientations. As the typical pipeline involves processing large amounts of data, efficient algorithms are crucial for fast and reliable results. The stochastic gradient descent (SGD) algorithm has been used to improve the speed of ab initio reconstruction, which results in an initial, low-resolution estimation of the volume representing the molecule of interest, but has yet to be applied successfully in the high-resolution regime, where expectation-maximization algorithms achieve state-of-the-art results, at a high computational cost. In this article, we investigate the conditioning of the optimization problem and show that the large condition number prevents the successful application of gradient descent-based methods at high resolution. Our results include a theoretical analysis of the condition number of the optimization problem in a simplified setting where the individual projection directions are known, an algorithm based on computing a diagonal preconditioner using Hutchinson's diagonal estimator and numerical experiments showing the improvement in the convergence speed when using the estimated preconditioner with SGD. The preconditioned SGD approach can potentially enable a simple and unified approach to ab initio reconstruction and high-resolution refinement with faster convergence speed and higher flexibility, and our results are a promising step in this direction.

电子冷冻显微镜(cryo-EM)是一种广泛应用于结构生物学的成像技术,用于从具有未知方向的嘈杂二维投影中确定生物分子的三维结构。由于典型的管道涉及处理大量数据,因此高效的算法对于快速可靠的结果至关重要。随机梯度下降(SGD)算法已被用于提高从头算重建的速度,这导致对代表感兴趣分子的体积的初始、低分辨率估计,但尚未成功应用于高分辨率领域,其中期望最大化算法以高计算成本实现了最先进的结果。在本文中,我们研究了优化问题的条件,并表明大的条件数阻碍了基于梯度下降的方法在高分辨率下的成功应用。我们的结果包括在已知单个投影方向的简化设置下对优化问题条件数的理论分析,基于使用Hutchinson对角估计器计算对角预条件的算法和数值实验,表明使用SGD估计预条件时收敛速度的提高。预条件SGD方法可以实现一种简单统一的从头开始重建和高分辨率细化方法,具有更快的收敛速度和更高的灵活性,我们的研究结果是朝着这个方向迈出的有希望的一步。
{"title":"Efficient high-resolution refinement in cryo-EM with stochastic gradient descent.","authors":"Bogdan Toader, Marcus A Brubaker, Roy R Lederman","doi":"10.1107/S205979832500511X","DOIUrl":"10.1107/S205979832500511X","url":null,"abstract":"<p><p>Electron cryo-microscopy (cryo-EM) is an imaging technique that is widely used in structural biology to determine the three-dimensional structure of biological molecules from noisy two-dimensional projections with unknown orientations. As the typical pipeline involves processing large amounts of data, efficient algorithms are crucial for fast and reliable results. The stochastic gradient descent (SGD) algorithm has been used to improve the speed of ab initio reconstruction, which results in an initial, low-resolution estimation of the volume representing the molecule of interest, but has yet to be applied successfully in the high-resolution regime, where expectation-maximization algorithms achieve state-of-the-art results, at a high computational cost. In this article, we investigate the conditioning of the optimization problem and show that the large condition number prevents the successful application of gradient descent-based methods at high resolution. Our results include a theoretical analysis of the condition number of the optimization problem in a simplified setting where the individual projection directions are known, an algorithm based on computing a diagonal preconditioner using Hutchinson's diagonal estimator and numerical experiments showing the improvement in the convergence speed when using the estimated preconditioner with SGD. The preconditioned SGD approach can potentially enable a simple and unified approach to ab initio reconstruction and high-resolution refinement with faster convergence speed and higher flexibility, and our results are a promising step in this direction.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"327-343"},"PeriodicalIF":2.6,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12216675/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144473730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structural dynamics of IDR interactions in human SFPQ and implications for liquid-liquid phase separation. 人SFPQ中IDR相互作用的结构动力学及其对液-液相分离的意义。
IF 2.6 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-07-01 Epub Date: 2025-06-27 DOI: 10.1107/S2059798325005303
Heidar J Koning, Valerie Lai, Ashish Sethi, Shatabdi Chakraborty, Ching Seng Ang, Archa H Fox, Anthony P Duff, Andrew E Whitten, Andrew C Marshall, Charles S Bond

The proteins SFPQ (splicing factor proline- and glutamine-rich) and NONO (non-POU domain-containing octamer-binding protein) are members of the Drosophila behaviour/human splicing (DBHS) protein family, sharing 76% sequence identity in their conserved DBHS domain. These proteins are critical for elements of pre- and post-transcriptional regulation in mammals and are primarily located in paraspeckles: ribonucleoprotein bodies templated by NEAT1 long noncoding RNA. Regions that are structured and predicted to be disordered (IDRs) in DBHS proteins facilitate various interactions, including dimerization, polymerization, nucleic acid binding and liquid-liquid phase separation, all of which have consequences for cell health, the pathology of some neurological diseases and cancer. To date, very limited structural work has been carried out on characterizing the IDRs of the DBHS proteins, largely due to their predicted disordered nature and the fact that this is often a bottleneck for conventional structural techniques. This is a problem worth addressing, as the IDRs have been shown to be critical to the material state of the protein as well as its function. In this study, we used small-angle X-ray scattering (SAXS) and small-angle neutron scattering (SANS), together with lysine cross-linking mass spectrometry (XL-MS), to investigate the regions of SFPQ flanking the structured DBHS domain and the possibility of dimer partner exchange of full-length proteins. Our results demonstrate experimentally that the N- and C-terminal regions on either side of the folded DBHS domain are long, disordered and flexible in solution. Realistic modelling of disordered chains to fit the scattering data and the compaction of the different protein variants suggests that it is physically possible for the IDRs to be close enough to interact. The mass-spectrometry data additionally indicate that the C-terminal IDR can potentially interact with the folded DBHS domain and also shares some conformational space with the N-terminal IDR. Our small-angle neutron scattering (SANS) experiments reveal that full-length SFPQ is capable of swapping dimer partners with itself, which has implications for our understanding of the combinatorial dimerization of DBHS proteins within cells. Our study provides insight into possible interactions between different IDRs either in cis or in trans and how these may relate to protein function, and the possible impact of mutations in these regions. The dynamic dimer partner exchange of a full-length protein inferred from this study is a phenomenon that is integral to the function of DBHS proteins, allowing changes in gene-regulatory activity by altering levels of the various heterodimers or homodimers.

SFPQ蛋白(剪接因子脯氨酸和谷氨酰胺丰富)和NONO蛋白(非pou结构域含八聚体结合蛋白)是果蝇行为/人类剪接(DBHS)蛋白家族的成员,在其保守的DBHS结构域具有76%的序列一致性。这些蛋白在哺乳动物的转录前和转录后调控中起着至关重要的作用,主要位于旁斑:由NEAT1长链非编码RNA模板化的核糖核蛋白小体。DBHS蛋白中的结构和预测紊乱(IDRs)区域促进各种相互作用,包括二聚化、聚合、核酸结合和液-液相分离,所有这些都对细胞健康、一些神经系统疾病和癌症的病理产生影响。迄今为止,在表征DBHS蛋白的idr方面进行的结构工作非常有限,这主要是由于它们预测的无序性质,而且这通常是传统结构技术的瓶颈。这是一个值得解决的问题,因为idr已被证明对蛋白质的物质状态及其功能至关重要。在这项研究中,我们利用小角x射线散射(SAXS)和小角中子散射(SANS),结合赖氨酸交联质谱(XL-MS),研究了SFPQ在DBHS结构域两侧的区域以及全长蛋白二聚体伴侣交换的可能性。实验结果表明,折叠DBHS结构域两侧的N端和c端在溶液中是长、无序和灵活的。对无序链进行现实建模以拟合散射数据和不同蛋白质变体的压实表明,idr在物理上可能足够接近以相互作用。质谱数据还表明,c端IDR可能与折叠的DBHS结构域相互作用,并与n端IDR共享一些构象空间。我们的小角中子散射(SANS)实验表明,全长SFPQ能够与自身交换二聚体伙伴,这对我们理解DBHS蛋白在细胞内的组合二聚化具有重要意义。我们的研究深入了解了顺式或反式中不同idr之间可能的相互作用,以及这些相互作用如何与蛋白质功能相关,以及这些区域突变可能产生的影响。从本研究推断,全长蛋白的动态二聚体伴侣交换是DBHS蛋白功能不可或缺的现象,允许通过改变各种异源二聚体或同型二聚体的水平来改变基因调控活性。
{"title":"Structural dynamics of IDR interactions in human SFPQ and implications for liquid-liquid phase separation.","authors":"Heidar J Koning, Valerie Lai, Ashish Sethi, Shatabdi Chakraborty, Ching Seng Ang, Archa H Fox, Anthony P Duff, Andrew E Whitten, Andrew C Marshall, Charles S Bond","doi":"10.1107/S2059798325005303","DOIUrl":"10.1107/S2059798325005303","url":null,"abstract":"<p><p>The proteins SFPQ (splicing factor proline- and glutamine-rich) and NONO (non-POU domain-containing octamer-binding protein) are members of the Drosophila behaviour/human splicing (DBHS) protein family, sharing 76% sequence identity in their conserved DBHS domain. These proteins are critical for elements of pre- and post-transcriptional regulation in mammals and are primarily located in paraspeckles: ribonucleoprotein bodies templated by NEAT1 long noncoding RNA. Regions that are structured and predicted to be disordered (IDRs) in DBHS proteins facilitate various interactions, including dimerization, polymerization, nucleic acid binding and liquid-liquid phase separation, all of which have consequences for cell health, the pathology of some neurological diseases and cancer. To date, very limited structural work has been carried out on characterizing the IDRs of the DBHS proteins, largely due to their predicted disordered nature and the fact that this is often a bottleneck for conventional structural techniques. This is a problem worth addressing, as the IDRs have been shown to be critical to the material state of the protein as well as its function. In this study, we used small-angle X-ray scattering (SAXS) and small-angle neutron scattering (SANS), together with lysine cross-linking mass spectrometry (XL-MS), to investigate the regions of SFPQ flanking the structured DBHS domain and the possibility of dimer partner exchange of full-length proteins. Our results demonstrate experimentally that the N- and C-terminal regions on either side of the folded DBHS domain are long, disordered and flexible in solution. Realistic modelling of disordered chains to fit the scattering data and the compaction of the different protein variants suggests that it is physically possible for the IDRs to be close enough to interact. The mass-spectrometry data additionally indicate that the C-terminal IDR can potentially interact with the folded DBHS domain and also shares some conformational space with the N-terminal IDR. Our small-angle neutron scattering (SANS) experiments reveal that full-length SFPQ is capable of swapping dimer partners with itself, which has implications for our understanding of the combinatorial dimerization of DBHS proteins within cells. Our study provides insight into possible interactions between different IDRs either in cis or in trans and how these may relate to protein function, and the possible impact of mutations in these regions. The dynamic dimer partner exchange of a full-length protein inferred from this study is a phenomenon that is integral to the function of DBHS proteins, allowing changes in gene-regulatory activity by altering levels of the various heterodimers or homodimers.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"357-379"},"PeriodicalIF":2.6,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12216677/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144504448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crystal structure of coagulation factor XII N-terminal domains 1-5. 凝血因子XII n端结构域1-5的晶体结构。
IF 2.6 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-07-01 Epub Date: 2025-06-27 DOI: 10.1107/S2059798325005297
Muhammad Saleem, Chan Li, Bubacarr G Kaira, Alexander K Brown, Monika Pathak, Shabir Najmudin, Nathan Cowieson, Ingrid Dreveny, Clare Wilson, Aleksandr Shamanaev, David Gailani, Stephanie A Smith, James H Morrissey, Helen Philippou, Jonas Emsley

Factor XIIa (FXIIa) is generated from its zymogen factor XII (FXII) by contact with polyanions such as inorganic polyphosphates. FXIIa cleaves the substrates prekallikrein and factor XI, triggering inflammatory cascades and plasma coagulation. From the N-terminus, FXII has fibronectin type II (FnII), epidermal growth factor-1 (EGF1), fibronectin type I (FnI), EGF2 and kringle domains. The N-terminal domains of FXII mediate polyanion and Zn2+ binding. To understand how ligand binding to polyanions and Zn2+ is coordinated across multiple domains, we determined the crystal structure of recombinant FXII domains 1-5 (FXIIHC5) to 3.4 Å resolution. A separate crystal structure of the isolated FXII FnII domain at 1.2 Å resolution revealed two bound Zn2+ ions. In FXIIHC5 a head-to-tail interaction is formed between the FnII and kringle domains, co-localizing the lysine-binding sites of the kringle domain and the cation-binding site of the FnII domain. Two FXIIHC5 monomers interlock, burying a large surface area of 2067 Å2, such that two kringle domains point outwards separated by a distance of 20 Å. The polyanion-binding site in the EGF1 domain is localized onto a plane together with the FnII and FnI domains. Using native mass spectrometry, we detected a major FXIIHC5 monomer peak and a minor dimer peak. Small-angle X-ray scattering and gel-filtration chromatography revealed the presence of monomers and dimers in solution. These FXII N-terminal domain structures provide a holistic framework to understand how the mosaic domain structure of FXII assembles diverse ligand-binding sites in three dimensions.

因子XIIa (FXIIa)是由其酶原因子XII (FXII)与多阴离子(如无机多磷酸盐)接触而产生的。FXIIa切割底物prekallikin和因子XI,引发炎症级联反应和血浆凝固。从n端来看,FXII具有纤维连接蛋白II型(FnII)、表皮生长因子-1 (EGF1)、纤维连接蛋白I型(FnI)、EGF2和kringle结构域。FXII的n端结构域介导多阴离子和Zn2+的结合。为了了解配体与多阴离子和Zn2+的结合是如何跨多个结构域协调的,我们以3.4 Å分辨率确定了重组FXII结构域1-5 (FXIIHC5)的晶体结构。在1.2 Å分辨率下,分离的FXII FnII畴的单独晶体结构显示了两个结合的Zn2+离子。在FXIIHC5中,FnII和kringle结构域之间形成了头尾相互作用,kringle结构域的赖氨酸结合位点和FnII结构域的阳离子结合位点共定位。两个FXIIHC5单体互锁,掩埋了2067 Å2的大表面积,使得两个kringle域指向外,相距20 Å。EGF1结构域的多阴离子结合位点与FnII和FnI结构域一起定位在一个平面上。使用天然质谱法,我们检测到一个主要的FXIIHC5单体峰和一个次要的二聚体峰。小角x射线散射和凝胶过滤色谱显示溶液中存在单体和二聚体。这些FXII n端结构域结构为了解FXII的镶嵌结构域结构如何在三维上组装不同的配体结合位点提供了一个整体框架。
{"title":"Crystal structure of coagulation factor XII N-terminal domains 1-5.","authors":"Muhammad Saleem, Chan Li, Bubacarr G Kaira, Alexander K Brown, Monika Pathak, Shabir Najmudin, Nathan Cowieson, Ingrid Dreveny, Clare Wilson, Aleksandr Shamanaev, David Gailani, Stephanie A Smith, James H Morrissey, Helen Philippou, Jonas Emsley","doi":"10.1107/S2059798325005297","DOIUrl":"10.1107/S2059798325005297","url":null,"abstract":"<p><p>Factor XIIa (FXIIa) is generated from its zymogen factor XII (FXII) by contact with polyanions such as inorganic polyphosphates. FXIIa cleaves the substrates prekallikrein and factor XI, triggering inflammatory cascades and plasma coagulation. From the N-terminus, FXII has fibronectin type II (FnII), epidermal growth factor-1 (EGF1), fibronectin type I (FnI), EGF2 and kringle domains. The N-terminal domains of FXII mediate polyanion and Zn<sup>2+</sup> binding. To understand how ligand binding to polyanions and Zn<sup>2+</sup> is coordinated across multiple domains, we determined the crystal structure of recombinant FXII domains 1-5 (FXII<sup>HC5</sup>) to 3.4 Å resolution. A separate crystal structure of the isolated FXII FnII domain at 1.2 Å resolution revealed two bound Zn<sup>2+</sup> ions. In FXII<sup>HC5</sup> a head-to-tail interaction is formed between the FnII and kringle domains, co-localizing the lysine-binding sites of the kringle domain and the cation-binding site of the FnII domain. Two FXII<sup>HC5</sup> monomers interlock, burying a large surface area of 2067 Å<sup>2</sup>, such that two kringle domains point outwards separated by a distance of 20 Å. The polyanion-binding site in the EGF1 domain is localized onto a plane together with the FnII and FnI domains. Using native mass spectrometry, we detected a major FXII<sup>HC5</sup> monomer peak and a minor dimer peak. Small-angle X-ray scattering and gel-filtration chromatography revealed the presence of monomers and dimers in solution. These FXII N-terminal domain structures provide a holistic framework to understand how the mosaic domain structure of FXII assembles diverse ligand-binding sites in three dimensions.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"380-393"},"PeriodicalIF":2.6,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12216678/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144504447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced intensity-based clustering of isomorphous multi-crystal data sets in the presence of subtle variations. 增强的基于强度的同构多晶数据集在存在细微变化的情况下聚类。
IF 2.6 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-01 Epub Date: 2025-05-29 DOI: 10.1107/S2059798325004589
Amy J Thompson, James Beilsten-Edmands, Cicely Tam, Juan Sanchez-Weatherby, James Sandy, Halina Mikolajek, Danny Axford, Sofia Jaho, Michael A Hough, Graeme Winter

Multi-crystal processing of X-ray diffraction data has become highly automated to keep pace with the current high-throughput capabilities afforded by beamlines. A significant challenge, however, is the automated clustering of such data based on subtle differences such as ligand binding or conformational shifts. Intensity-based hierarchical clustering has been shown to be a viable method of identifying such subtle structural differences, but the interpretation of the resulting dendrograms is difficult to automate. Using isomorphous crystals of bovine, porcine and human insulin, the existing clustering methods in the multi-crystal processing software xia2.multiplex were validated and their limits were tested. It was determined that weighting the pairwise correlation coefficient calculations with the intensity uncertainties was required for accurate calculation of the pairwise correlation coefficient matrix (correlation clustering) and dimension optimization was required when expressing this matrix as a set of coordinates representing data sets (cosine-angle clustering). Finally, the introduction of the OPTICS spatial density-based clustering algorithm into DIALS allowed the automatic output of species-pure clusters of bovine, porcine and human insulin data sets.

x射线衍射数据的多晶处理已经高度自动化,以跟上当前光束线提供的高通量能力。然而,一个重大的挑战是基于诸如配体结合或构象转移等细微差异的数据的自动聚类。基于强度的分层聚类已被证明是识别这种细微结构差异的可行方法,但对所产生的树状图的解释很难自动化。利用牛、猪和人胰岛素的同构晶体,对现有的多晶处理软件霞2中的聚类方法进行了分析。对多路复合进行了验证,并对其极限进行了测试。确定对两两相关系数计算与强度不确定性进行加权,才能准确计算两两相关系数矩阵(相关聚类),将两两相关系数矩阵表示为代表数据集的坐标集(余弦角聚类)时,需要进行维数优化。最后,将OPTICS基于空间密度的聚类算法引入DIALS,可以自动输出牛、猪和人胰岛素数据集的物种纯聚类。
{"title":"Enhanced intensity-based clustering of isomorphous multi-crystal data sets in the presence of subtle variations.","authors":"Amy J Thompson, James Beilsten-Edmands, Cicely Tam, Juan Sanchez-Weatherby, James Sandy, Halina Mikolajek, Danny Axford, Sofia Jaho, Michael A Hough, Graeme Winter","doi":"10.1107/S2059798325004589","DOIUrl":"10.1107/S2059798325004589","url":null,"abstract":"<p><p>Multi-crystal processing of X-ray diffraction data has become highly automated to keep pace with the current high-throughput capabilities afforded by beamlines. A significant challenge, however, is the automated clustering of such data based on subtle differences such as ligand binding or conformational shifts. Intensity-based hierarchical clustering has been shown to be a viable method of identifying such subtle structural differences, but the interpretation of the resulting dendrograms is difficult to automate. Using isomorphous crystals of bovine, porcine and human insulin, the existing clustering methods in the multi-crystal processing software xia2.multiplex were validated and their limits were tested. It was determined that weighting the pairwise correlation coefficient calculations with the intensity uncertainties was required for accurate calculation of the pairwise correlation coefficient matrix (correlation clustering) and dimension optimization was required when expressing this matrix as a set of coordinates representing data sets (cosine-angle clustering). Finally, the introduction of the OPTICS spatial density-based clustering algorithm into DIALS allowed the automatic output of species-pure clusters of bovine, porcine and human insulin data sets.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"278-290"},"PeriodicalIF":2.6,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12128884/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144172350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Acta Crystallographica. Section D, Structural Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1