Pub Date : 2025-02-26DOI: 10.1007/s10858-025-00460-3
Daniel Braun, Clemens Kauffmann, Andreas Beier, Irene Ceccolini, Olga O. Lebedenko, Nikolai R. Skrynnikov, Robert Konrat
Structurally diverse ensembles of intrinsically disordered proteins or regions are difficult to determine, because experimental observables usually report a conformational average. Therefore, in order to infer the underlying distribution, a set of experiments that measure different aspects of the system is necessary. In principle, there exists a set of cross-correlated relaxation (CCR) rates that report on protein backbone geometry in a complementary way. However, CCR rates are hard to interpret, because geometric information is encoded in an ambiguous way and they present themselves as a convolute of both structure and dynamics. Despite these challenges, CCR rates analyzed within a suitable statistical framework are able to identify conformations in structured proteins. In the context of disordered proteins, we find that this approach has to be adjusted to account for local dynamics via including an additional CCR rate. The results of this study show that CCR rates can be used to characterize structure propensities also in disordered proteins. Instead of using an experimental reference structure, we employed computational spectroscopy to calculate CCR rates from molecular dynamics (MD) simulations and subsequently compared the results to conformations as observed directly in the MD trajectory.
{"title":"Local structure propensities in disordered proteins from cross-correlated NMR spin relaxation","authors":"Daniel Braun, Clemens Kauffmann, Andreas Beier, Irene Ceccolini, Olga O. Lebedenko, Nikolai R. Skrynnikov, Robert Konrat","doi":"10.1007/s10858-025-00460-3","DOIUrl":"10.1007/s10858-025-00460-3","url":null,"abstract":"<div><p>Structurally diverse ensembles of intrinsically disordered proteins or regions are difficult to determine, because experimental observables usually report a conformational average. Therefore, in order to infer the underlying distribution, a set of experiments that measure different aspects of the system is necessary. In principle, there exists a set of cross-correlated relaxation (CCR) rates that report on protein backbone geometry in a complementary way. However, CCR rates are hard to interpret, because geometric information is encoded in an ambiguous way and they present themselves as a convolute of both structure and dynamics. Despite these challenges, CCR rates analyzed within a suitable statistical framework are able to identify conformations in structured proteins. In the context of disordered proteins, we find that this approach has to be adjusted to account for local dynamics via including an additional CCR rate. The results of this study show that CCR rates can be used to characterize structure propensities also in disordered proteins. Instead of using an experimental reference structure, we employed computational spectroscopy to calculate CCR rates from molecular dynamics (MD) simulations and subsequently compared the results to conformations as observed directly in the MD trajectory.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 2","pages":"115 - 127"},"PeriodicalIF":1.3,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-025-00460-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143514313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-17DOI: 10.1007/s10858-024-00457-4
Louis-Philippe Picard, Dmitry Pichugin, Shuya Kate Huang, Motasem Suleiman, R. Scott Prosser
Large proteins and dilute spin systems within a deuterated background are often characterized by long proton (1H) spin-lattice relaxation times (T1), which directly impacts the recycle delay and hence, the total experimental time. Dioxygen (O2) is a well-known paramagnetic species whose short electronic spin-lattice relaxation time (7.5 ps) contributes to effective spin-lattice relaxation of high gamma nuclei. Oxygen’s chemical potential and high diffusivity also allows it to access both the protein exterior and much of the (hydrophobic) interior of the protein. Consequently, at O2 partial pressures of ~ 10 bar, 1H and 19F spin-lattice relaxation rates (R1) typically reach 3–5 Hz (versus rates of 0.7-1.0 Hz without oxygen) with comparable line-broadening in protein NMR spectra. Using fluoroacetate dehalogenase (FAcD) a soluble 35 kDa homodimeric enzyme, a nanodisc-stabilized G protein-coupled receptor (A2AR), and bovine serum albumin (BSA) as test cases, a 3-fold savings in time was achieved in acquiring 1H-15 N HSQC and 19F NMR spectra, after oxygenation at 9 bar for 24 h. Additional spin-diffusion effects are anticipated to contribute to uniform 1H spin-lattice relaxation for both solvent-exposed and buried protons, as demonstrated by T1 relaxation analysis of amides in 15N-labeled FAcD. Finally, we show that in protein samples dissolved oxygen pre-equilibrated at 9 bar (pO2) is largely retained in solution at 20° C or lower, using a standard NMR tube for a period of 3–4 days, thus avoiding the use of specialized apparatus or high-pressure NMR tubes in the spectrometer. The convenience of being able to add or remove the quenching species, while avoiding any complex apparatus in the NMR experiment, makes this a practical tool for both 19F, 1H-13 C, and 1H-15 N NMR studies of proteins.
在氚化背景下的大型蛋白质和稀释自旋系统通常具有质子(1H)自旋晶格弛豫时间(T1)长的特点,这直接影响到循环延迟,进而影响到总的实验时间。二氧(O2)是一种著名的顺磁性物质,其较短的电子自旋-晶格弛豫时间(7.5 ps)有助于高伽马核的有效自旋-晶格弛豫。氧气的化学势和高扩散性也使其能够进入蛋白质的外部和大部分(疏水性)内部。因此,在氧气分压约为 10 bar 时,1H 和 19F 自旋晶格弛豫速率 (R1) 通常达到 3-5 Hz(相比之下,无氧时的速率为 0.7-1.0 Hz),蛋白质 NMR 光谱中的线宽相当。使用氟乙酸脱卤酶(FAcD)(一种 35 kDa 的可溶性同源二聚体酶)、纳米盘稳定的 G 蛋白偶联受体(A2AR)和牛血清白蛋白(BSA)作为测试案例,在 9 bar 下充氧 24 小时后,获取 1H-15 N HSQC 和 19F NMR 图谱的时间节省了 3 倍。正如 15N 标记的 FAcD 中酰胺的 T1 弛豫分析所证明的那样,预计其他自旋扩散效应也会促进溶剂暴露和埋藏质子的 1H 自旋晶格均匀弛豫。最后,我们使用标准 NMR 管证明,在蛋白质样品中,在 9 巴(pO2)下预平衡的溶解氧可在 20° C 或更低温度的溶液中保留 3-4 天,从而避免了在光谱仪中使用专用仪器或高压 NMR 管。既能方便地添加或移除淬灭物种,又能避免在 NMR 实验中使用任何复杂的仪器,这使其成为蛋白质 19F、1H-13 C 和 1H-15 N NMR 研究的实用工具。
{"title":"Reducing experimental time through spin-lattice relaxation enhancement via dissolved oxygen","authors":"Louis-Philippe Picard, Dmitry Pichugin, Shuya Kate Huang, Motasem Suleiman, R. Scott Prosser","doi":"10.1007/s10858-024-00457-4","DOIUrl":"10.1007/s10858-024-00457-4","url":null,"abstract":"<div><p>Large proteins and dilute spin systems within a deuterated background are often characterized by long proton (<sup>1</sup>H) spin-lattice relaxation times (T<sub>1</sub>), which directly impacts the recycle delay and hence, the total experimental time. Dioxygen (O<sub>2</sub>) is a well-known paramagnetic species whose short electronic spin-lattice relaxation time (7.5 ps) contributes to effective spin-lattice relaxation of high gamma nuclei. Oxygen’s chemical potential and high diffusivity also allows it to access both the protein exterior and much of the (hydrophobic) interior of the protein. Consequently, at O<sub>2</sub> partial pressures of ~ 10 bar, <sup>1</sup>H and <sup>19</sup>F spin-lattice relaxation rates (R<sub>1</sub>) typically reach 3–5 Hz (versus rates of 0.7-1.0 Hz without oxygen) with comparable line-broadening in protein NMR spectra. Using fluoroacetate dehalogenase (FAcD) a soluble 35 kDa homodimeric enzyme, a nanodisc-stabilized G protein-coupled receptor (A<sub>2A</sub>R), and bovine serum albumin (BSA) as test cases, a 3-fold savings in time was achieved in acquiring <sup>1</sup>H-<sup>15</sup> N HSQC and <sup>19</sup>F NMR spectra, after oxygenation at 9 bar for 24 h. Additional spin-diffusion effects are anticipated to contribute to uniform <sup>1</sup>H spin-lattice relaxation for both solvent-exposed and buried protons, as demonstrated by T<sub>1</sub> relaxation analysis of amides in <sup>15</sup>N-labeled FAcD. Finally, we show that in protein samples dissolved oxygen pre-equilibrated at 9 bar (pO<sub>2</sub>) is largely retained in solution at 20° C or lower, using a standard NMR tube for a period of 3–4 days, thus avoiding the use of specialized apparatus or high-pressure NMR tubes in the spectrometer. The convenience of being able to add or remove the quenching species, while avoiding any complex apparatus in the NMR experiment, makes this a practical tool for both <sup>19</sup>F, <sup>1</sup>H-<sup>13</sup> C, and <sup>1</sup>H-<sup>15</sup> N NMR studies of proteins.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 2","pages":"67 - 78"},"PeriodicalIF":1.3,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143439688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-29DOI: 10.1007/s10858-024-00455-6
Tata Gopinath, Alyssa Kraft, Kyungsoo Shin, Nicholas A. Wood, Francesca M. Marassi
The NMR signals from protein sidechains are rich in information about intra- and inter-molecular interactions, but their detection can be complicated due to spectral overlap as well as conformational and hydrogen exchange. In this work, we demonstrate a protocol for multi-dimensional solid-state NMR spectral editing of signals from basic sidechains based on Hadamard matrix encoding. The Hadamard method acquires multi-dimensional experiments in such a way that both the backbone and under-sampled sidechain signals can be decoded for unambiguous editing in the 15N spectral frequency dimension. All multi-dimensional 15N-edited solid-state NMR experiments can be acquired using this strategy, thereby accelerating the acquisition of spectra spanning broad frequency bandwidth. Application of these methods to the ferritin nanocage, reveals signals from N atoms from His, Arg, Lys and Trp sidechains, as well as their tightly bound, ordered water molecules. The Hadamard approach adds to the arsenal of spectroscopic approaches for protein NMR signal detection.
{"title":"Solid state NMR spectral editing of histidine, arginine and lysine using Hadamard encoding","authors":"Tata Gopinath, Alyssa Kraft, Kyungsoo Shin, Nicholas A. Wood, Francesca M. Marassi","doi":"10.1007/s10858-024-00455-6","DOIUrl":"10.1007/s10858-024-00455-6","url":null,"abstract":"<div><p>The NMR signals from protein sidechains are rich in information about intra- and inter-molecular interactions, but their detection can be complicated due to spectral overlap as well as conformational and hydrogen exchange. In this work, we demonstrate a protocol for multi-dimensional solid-state NMR spectral editing of signals from basic sidechains based on Hadamard matrix encoding. The Hadamard method acquires multi-dimensional experiments in such a way that both the backbone and under-sampled sidechain signals can be decoded for unambiguous editing in the <sup>15</sup>N spectral frequency dimension. All multi-dimensional <sup>15</sup>N-edited solid-state NMR experiments can be acquired using this strategy, thereby accelerating the acquisition of spectra spanning broad frequency bandwidth. Application of these methods to the ferritin nanocage, reveals signals from N atoms from His, Arg, Lys and Trp sidechains, as well as their tightly bound, ordered water molecules. The Hadamard approach adds to the arsenal of spectroscopic approaches for protein NMR signal detection.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 1","pages":"35 - 45"},"PeriodicalIF":1.3,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00455-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143062996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-22DOI: 10.1007/s10858-024-00453-8
Maria Anna Rodella, Robert Schneider, Rainer Kümmerle, Isabella C. Felli, Roberta Pierattelli
Intrinsically disordered proteins and protein regions are central to many biological processes but difficult to characterize at atomic resolution. Nuclear magnetic resonance is particularly well-suited for providing structural and dynamical information on intrinsically disordered proteins, but existing NMR methodologies need to be constantly refined to provide greater sensitivity and resolution, particularly to capitalise on the potential of high magnetic fields to investigate large proteins. In this paper, we describe how 15N-detected 2D NMR experiments can be optimised for better performance. We show that using selective aliphatic 1H decoupling in N-TROSY type experiments results in significant increases in sensitivity and resolution for a prototypical intrinsically disordered protein, α-synuclein, as well as for a heterogeneous intrinsically disordered region of a large multidomain protein, CBP-ID4. We also investigated the performance of incorporating longitudinal relaxation enhancement in N-TROSY experiments, both with and without aliphatic 1H decoupling, and discussed the findings in light of the available information for the two systems.
{"title":"15N-detected TROSY for 1H-15N heteronuclear correlation to study intrinsically disordered proteins: strategies to increase spectral quality","authors":"Maria Anna Rodella, Robert Schneider, Rainer Kümmerle, Isabella C. Felli, Roberta Pierattelli","doi":"10.1007/s10858-024-00453-8","DOIUrl":"10.1007/s10858-024-00453-8","url":null,"abstract":"<div><p>Intrinsically disordered proteins and protein regions are central to many biological processes but difficult to characterize at atomic resolution. Nuclear magnetic resonance is particularly well-suited for providing structural and dynamical information on intrinsically disordered proteins, but existing NMR methodologies need to be constantly refined to provide greater sensitivity and resolution, particularly to capitalise on the potential of high magnetic fields to investigate large proteins. In this paper, we describe how <sup>15</sup>N-detected 2D NMR experiments can be optimised for better performance. We show that using selective aliphatic <sup>1</sup>H decoupling in N-TROSY type experiments results in significant increases in sensitivity and resolution for a prototypical intrinsically disordered protein, α-synuclein, as well as for a heterogeneous intrinsically disordered region of a large multidomain protein, CBP-ID4. We also investigated the performance of incorporating longitudinal relaxation enhancement in N-TROSY experiments, both with and without aliphatic <sup>1</sup>H decoupling, and discussed the findings in light of the available information for the two systems.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 1","pages":"15 - 24"},"PeriodicalIF":1.3,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00453-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142998175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-22DOI: 10.1007/s10858-024-00454-7
Carl Öster, Veniamin Chevelkov, Adam Lange
Chemical shift assignments of large membrane proteins by solid-state NMR experiments are challenging. Recent advancements in sensitivity-enhanced pulse sequences, have made it feasible to acquire 1H-detected 4D spectra of these challenging protein samples within reasonable timeframes. However, obtaining unambiguous assignments remains difficult without access to side-chain chemical shifts. Drawing inspiration from sensitivity-enhanced TOCSY experiments in solution NMR, we have explored the potential of 13C- 13C TOCSY mixing as a viable option for triple sensitivity-enhanced 4D experiments aimed at side-chain assignments in solid-state NMR. Through simulations and experimental trials, we have identified optimal conditions to achieve uniform transfer efficiency for both transverse components and to minimize undesired cross-transfers. Our experiments, conducted on the 30 kDa membrane protein GlpG embedded in E. coli liposomes, have demonstrated enhanced sensitivity compared to the most effective dipolar and J-coupling-based 13C- 13C mixing sequences. Notably, a non-uniformly sampled 4D hCXCANH spectrum with exceptionally high sensitivity was obtained in just a few days using a 600 MHz spectrometer equipped with a 1.3 mm probe operating at a magic angle spinning rate of 55 kHz.
{"title":"Evaluation of TOCSY mixing for sensitivity-enhancement in solid-state NMR and application of 4D experiments for side-chain assignments of the full-length 30 kDa membrane protein GlpG","authors":"Carl Öster, Veniamin Chevelkov, Adam Lange","doi":"10.1007/s10858-024-00454-7","DOIUrl":"10.1007/s10858-024-00454-7","url":null,"abstract":"<div><p>Chemical shift assignments of large membrane proteins by solid-state NMR experiments are challenging. Recent advancements in sensitivity-enhanced pulse sequences, have made it feasible to acquire <sup>1</sup>H-detected 4D spectra of these challenging protein samples within reasonable timeframes. However, obtaining unambiguous assignments remains difficult without access to side-chain chemical shifts. Drawing inspiration from sensitivity-enhanced TOCSY experiments in solution NMR, we have explored the potential of <sup>13</sup>C- <sup>13</sup>C TOCSY mixing as a viable option for triple sensitivity-enhanced 4D experiments aimed at side-chain assignments in solid-state NMR. Through simulations and experimental trials, we have identified optimal conditions to achieve uniform transfer efficiency for both transverse components and to minimize undesired cross-transfers. Our experiments, conducted on the 30 kDa membrane protein GlpG embedded in <i>E. coli</i> liposomes, have demonstrated enhanced sensitivity compared to the most effective dipolar and J-coupling-based <sup>13</sup>C- <sup>13</sup>C mixing sequences. Notably, a non-uniformly sampled 4D hCXCANH spectrum with exceptionally high sensitivity was obtained in just a few days using a 600 MHz spectrometer equipped with a 1.3 mm probe operating at a magic angle spinning rate of 55 kHz.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 1","pages":"25 - 34"},"PeriodicalIF":1.3,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00454-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142998176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-11DOI: 10.1007/s10858-024-00456-5
Yang Shen, Marshall J. Smith, John M. Louis, Ad Bax
Inclusion of residual dipolar couplings (RDCs) during the early rounds of protein structure determination requires use of a floating alignment tensor or knowledge of the alignment tensor strength and rhombicity. For proteins with interdomain motion, such analysis can falsely hide the presence of domain dynamics. We demonstrate for three proteins, maltotriose-ligated maltose binding protein (MBP), Ca2+-ligated calmodulin, and a monomeric N-terminal deletion mutant of the SARS-CoV-2 Main Protease, MPro, that good alignment tensor estimates of their domains can be obtained from RDCs measured for residues that are identified as α-helical based on their chemical shifts. The program, Helix-Fit, fits the RDCs to idealized α-helical coordinates, often yielding a comparable or better alignment tensor estimate than fitting to the actual high-resolution X-ray helix coordinates. The 13 helices of ligated MBP all show very similar alignment tensors, indicative of a high degree of order relative to one another. By contrast, while for monomeric MPro the alignment strengths of the five helices in the C-terminal helical domain (residues 200–306) are very similar, pointing to a well-ordered domain, the single α-helix Y54-I59 in the N-terminal catalytic domain (residues 10–185) aligns considerably weaker. This result indicates the presence of large amplitude motions of either Y54-I59 or of the entire N-terminal domain relative to the C-terminal domain, contrasting with the high degree of order seen in the native homodimeric structure.
{"title":"Alpha-helices as alignment reporters in residual dipolar coupling analysis of proteins","authors":"Yang Shen, Marshall J. Smith, John M. Louis, Ad Bax","doi":"10.1007/s10858-024-00456-5","DOIUrl":"10.1007/s10858-024-00456-5","url":null,"abstract":"<div><p>Inclusion of residual dipolar couplings (RDCs) during the early rounds of protein structure determination requires use of a floating alignment tensor or knowledge of the alignment tensor strength and rhombicity. For proteins with interdomain motion, such analysis can falsely hide the presence of domain dynamics. We demonstrate for three proteins, maltotriose-ligated maltose binding protein (MBP), Ca<sup>2+</sup>-ligated calmodulin, and a monomeric N-terminal deletion mutant of the SARS-CoV-2 Main Protease, MPro, that good alignment tensor estimates of their domains can be obtained from RDCs measured for residues that are identified as α-helical based on their chemical shifts. The program, Helix-Fit, fits the RDCs to idealized α-helical coordinates, often yielding a comparable or better alignment tensor estimate than fitting to the actual high-resolution X-ray helix coordinates. The 13 helices of ligated MBP all show very similar alignment tensors, indicative of a high degree of order relative to one another. By contrast, while for monomeric MPro the alignment strengths of the five helices in the C-terminal helical domain (residues 200–306) are very similar, pointing to a well-ordered domain, the single α-helix Y54-I59 in the N-terminal catalytic domain (residues 10–185) aligns considerably weaker. This result indicates the presence of large amplitude motions of either Y54-I59 or of the entire N-terminal domain relative to the C-terminal domain, contrasting with the high degree of order seen in the native homodimeric structure.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 1","pages":"47 - 57"},"PeriodicalIF":1.3,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00456-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142805914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-07DOI: 10.1007/s10858-024-00452-9
Paulina Putko, Javier Agustin Romero, Christian F. Pantoja, Markus Zweckstetter, Krzysztof Kazimierczuk, Anna Zawadzka-Kazimierczuk
The resonance assignment of large intrinsically disordered proteins (IDPs) is difficult due to the low dispersion of chemical shifts (CSs). Luckily, CSs are often specific for certain residue types, which makes the task easier. Our recent work showed that the CS-based spin-system classification can be improved by applying a linear discriminant analysis (LDA). In this paper, we extend a set of classification parameters by adding temperature coefficients (TCs), i.e., rates of change of chemical shifts with temperature. As demonstrated previously by other groups, the TCs in IDPs depend on a residue type, although the relation is often too complex to be predicted theoretically. Thus, we propose an approach based on experimental data; CSs and TCs values of residues assigned using conventional methods serve as a training set for LDA, which then classifies the remaining resonances. The method is demonstrated on a large fragment (1-239) of highly disordered protein Tau. We noticed that adding TCs to sets of chemical shifts significantly improves the recognition efficiency. For example, it allows distinguishing between lysine and glutamic acid, as well as valine and isoleucine residues based on ({{text {H}}^{text {N}}}), N, ({{text {C}}_alpha }) and C(^{prime }) data. Moreover, adding TCs to CSs of ({{text {H}}^{text {N}}}), N, ({{text {C}}_alpha }), and C(^{prime }) is more beneficial than adding ({{text {C}}_beta }) CSs. Our program for LDA analysis is available at https://github.com/gugumatz/LDA-Temp-Coeff.
由于化学位移(CSs)的低分散性,大的内在无序蛋白(IDPs)的共振分配是困难的。幸运的是,CSs通常是特定于某些残留类型的,这使得任务更容易。我们最近的工作表明,利用线性判别分析(LDA)可以改进基于cs的自旋系统分类。在本文中,我们通过添加温度系数(TCs)扩展了一组分类参数,即化学位移随温度的变化率。正如先前其他研究小组所证明的那样,流离失所者的tc取决于剩余类型,尽管这种关系往往过于复杂,无法从理论上预测。因此,我们提出了一种基于实验数据的方法;使用传统方法分配的残差的CSs和tc值作为LDA的训练集,然后LDA对剩余的共振进行分类。该方法在高度无序的Tau蛋白的大片段(1-239)上得到了验证。我们注意到,将tc添加到化学位移集合中可以显著提高识别效率。例如,它可以根据H N, N, C α和C′数据区分赖氨酸和谷氨酸,以及缬氨酸和异亮氨酸残基。此外,在含有H N、N、C α和C′的碳水化合物中添加tc比添加C β碳水化合物更有利。我们的LDA分析程序可在https://github.com/gugumatz/LDA-Temp-Coeff上获得。
{"title":"Using temperature coefficients to support resonance assignment of intrinsically disordered proteins","authors":"Paulina Putko, Javier Agustin Romero, Christian F. Pantoja, Markus Zweckstetter, Krzysztof Kazimierczuk, Anna Zawadzka-Kazimierczuk","doi":"10.1007/s10858-024-00452-9","DOIUrl":"10.1007/s10858-024-00452-9","url":null,"abstract":"<div><p>The resonance assignment of large intrinsically disordered proteins (IDPs) is difficult due to the low dispersion of chemical shifts (CSs). Luckily, CSs are often specific for certain residue types, which makes the task easier. Our recent work showed that the CS-based spin-system classification can be improved by applying a linear discriminant analysis (LDA). In this paper, we extend a set of classification parameters by adding temperature coefficients (TCs), i.e., rates of change of chemical shifts with temperature. As demonstrated previously by other groups, the TCs in IDPs depend on a residue type, although the relation is often too complex to be predicted theoretically. Thus, we propose an approach based on experimental data; CSs and TCs values of residues assigned using conventional methods serve as a training set for LDA, which then classifies the remaining resonances. The method is demonstrated on a large fragment (1-239) of highly disordered protein Tau. We noticed that adding TCs to sets of chemical shifts significantly improves the recognition efficiency. For example, it allows distinguishing between lysine and glutamic acid, as well as valine and isoleucine residues based on <span>({{text {H}}^{text {N}}})</span>, N, <span>({{text {C}}_alpha })</span> and C<span>(^{prime })</span> data. Moreover, adding TCs to CSs of <span>({{text {H}}^{text {N}}})</span>, N, <span>({{text {C}}_alpha })</span>, and C<span>(^{prime })</span> is more beneficial than adding <span>({{text {C}}_beta })</span> CSs. Our program for LDA analysis is available at https://github.com/gugumatz/LDA-Temp-Coeff.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 1","pages":"59 - 65"},"PeriodicalIF":1.3,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00452-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142790829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-20DOI: 10.1007/s10858-024-00451-w
Victoria A. Higman, Eliza Płoskoń, Gary S. Thompson, Geerten W. Vuister
Artificial intelligence (AI) models are revolutionising scientific data analysis but are reliant on large training data sets. While artificial training data can be used in the context of NMR processing and data analysis methods, relating NMR parameters back to protein sequence and structure requires experimental data. In this perspective we examine what the biological NMR community needs to do, in order to store and share its data better so that we can make effective use of AI methods to further our understanding of biological molecules. We argue, first, that the community should be depositing much more of its experimental data. In particular, we should be depositing more spectra and dynamics data. Second, the NMR data deposited needs to capture the full information content required to be able to use and validate it adequately. The NMR Exchange Format (NEF) was designed several years ago to do this. The widespread adoption of NEF combined with a new proposal for dynamics data specifications come at the right time for the community to expand its deposition of data. Third, we highlight the importance of expanding and safeguarding our experimental data repository, the Biological Magnetic Resonance Data Bank (BMRB), not only in the interests of NMR spectroscopists, but biological scientists more widely. With this article we invite others in the biological NMR community to champion increased (possibly mandatory) data deposition, to get involved in designing new NEF specifications, and to advocate on behalf of the BMRB within the wider scientific community.
{"title":"Perspective: on the importance of extensive, high-quality and reliable deposition of biomolecular NMR data in the age of artificial intelligence","authors":"Victoria A. Higman, Eliza Płoskoń, Gary S. Thompson, Geerten W. Vuister","doi":"10.1007/s10858-024-00451-w","DOIUrl":"10.1007/s10858-024-00451-w","url":null,"abstract":"<div><p>Artificial intelligence (AI) models are revolutionising scientific data analysis but are reliant on large training data sets. While artificial training data can be used in the context of NMR processing and data analysis methods, relating NMR parameters back to protein sequence and structure requires experimental data. In this perspective we examine what the biological NMR community needs to do, in order to store and share its data better so that we can make effective use of AI methods to further our understanding of biological molecules. We argue, first, that the community should be depositing much more of its experimental data. In particular, we should be depositing more spectra and dynamics data. Second, the NMR data deposited needs to capture the full information content required to be able to use and validate it adequately. The NMR Exchange Format (NEF) was designed several years ago to do this. The widespread adoption of NEF combined with a new proposal for dynamics data specifications come at the right time for the community to expand its deposition of data. Third, we highlight the importance of expanding and safeguarding our experimental data repository, the Biological Magnetic Resonance Data Bank (BMRB), not only in the interests of NMR spectroscopists, but biological scientists more widely. With this article we invite others in the biological NMR community to champion increased (possibly mandatory) data deposition, to get involved in designing new NEF specifications, and to advocate on behalf of the BMRB within the wider scientific community.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"78 4","pages":"193 - 197"},"PeriodicalIF":1.3,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00451-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142455362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-16DOI: 10.1007/s10858-024-00450-x
Md Khushnood Alam, R. Aishwarya Bhuvaneshwari, Ishita Sengupta
The recent application of 19F NMR in the study of biomolecular structure and dynamics has made it a potentially attractive probe to complement traditional 15N/13C labelled probes for backbone and sidechain dynamics, albeit with some complications. The utility of 15N relaxation rates of rigid backbone amide groups to determine the rotational diffusion tensor of proteins is well established. Here we show that the measured 19F relaxation rates of two buried and possibly immobile 19F labelled tryptophan sidechains for the multidomain protein RfaH, in its closed conformation, are in reasonable agreement with the calculated values, only when anisotropic rotational diffusion of the protein is considered. While the sparsity of 19F relaxation data from a limited number of probes precludes the experimental determination of the rotational diffusion tensor here, these results demonstrate the influence of rotational diffusion anisotropy of proteins on 19F NMR relaxation of rigid tryptophan sidechains, while adding to the expanding literature of 19F NMR relaxation data sets in biomolecules.
{"title":"19F NMR relaxation of buried tryptophan side chains suggest anisotropic rotational diffusion of the protein RfaH","authors":"Md Khushnood Alam, R. Aishwarya Bhuvaneshwari, Ishita Sengupta","doi":"10.1007/s10858-024-00450-x","DOIUrl":"10.1007/s10858-024-00450-x","url":null,"abstract":"<div><p>The recent application of <sup>19</sup>F NMR in the study of biomolecular structure and dynamics has made it a potentially attractive probe to complement traditional <sup>15</sup>N/<sup>13</sup>C labelled probes for backbone and sidechain dynamics, albeit with some complications. The utility of <sup>15</sup>N relaxation rates of rigid backbone amide groups to determine the rotational diffusion tensor of proteins is well established. Here we show that the measured <sup>19</sup>F relaxation rates of two buried and possibly immobile <sup>19</sup>F labelled tryptophan sidechains for the multidomain protein RfaH, in its closed conformation, are in reasonable agreement with the calculated values, only when anisotropic rotational diffusion of the protein is considered. While the sparsity of <sup>19</sup>F relaxation data from a limited number of probes precludes the experimental determination of the rotational diffusion tensor here, these results demonstrate the influence of rotational diffusion anisotropy of proteins on <sup>19</sup>F NMR relaxation of rigid tryptophan sidechains, while adding to the expanding literature of <sup>19</sup>F NMR relaxation data sets in biomolecules.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"78 4","pages":"265 - 273"},"PeriodicalIF":1.3,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142455361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-31DOI: 10.1007/s10858-024-00449-4
Vladlena Kharchenko, Samah Al-Harthi, Andrzej Ejchart, Łukasz Jaremko
The dynamics of the backbone and side-chains of protein are routinely studied by interpreting experimentally determined 15N spin relaxation rates. R1(15N), the longitudinal relaxation rate, reports on fast motions and encodes, together with the transverse relaxation R2, structural information about the shape of the molecule and the orientation of the amide bond vectors in the internal diffusion frame. Determining error-free 15N longitudinal relaxation rates remains a challenge for small, disordered, and medium-sized proteins. Here, we show that mono-exponential fitting is sufficient, with no statistical preference for bi-exponential fitting up to 800 MHz. A detailed comparison of the TROSY and HSQC techniques at medium and high fields showed no statistically significant differences. The least error-prone DD/CSA interference removal technique is the selective inversion of amide signals while avoiding water resonance. The exchange of amide with solvent deuterons appears to affect the rate R1 of solvent-exposed amides in all fields tested and in each DD/CSA interference removal technique in a statistically significant manner. In summary, the most accurate R1(15N) rates in proteins are achieved by selective amide inversion, without the addition of D2O. Importantly, at high magnetic fields stronger than 800 MHz, when non-mono-exponential decay is involved, it is advisable to consider elimination of the shortest delays (typically up to 0.32 s) or bi-exponential fitting.
{"title":"Pitfalls in measurements of R1 relaxation rates of protein backbone 15N nuclei","authors":"Vladlena Kharchenko, Samah Al-Harthi, Andrzej Ejchart, Łukasz Jaremko","doi":"10.1007/s10858-024-00449-4","DOIUrl":"10.1007/s10858-024-00449-4","url":null,"abstract":"<div><p>The dynamics of the backbone and side-chains of protein are routinely studied by interpreting experimentally determined <sup>15</sup>N spin relaxation rates. <i>R</i><sub>1</sub>(<sup>15</sup>N), the longitudinal relaxation rate, reports on fast motions and encodes, together with the transverse relaxation <i>R</i><sub>2</sub>, structural information about the shape of the molecule and the orientation of the amide bond vectors in the internal diffusion frame. Determining error-free <sup>15</sup>N longitudinal relaxation rates remains a challenge for small, disordered, and medium-sized proteins. Here, we show that mono-exponential fitting is sufficient, with no statistical preference for bi-exponential fitting up to 800 MHz. A detailed comparison of the TROSY and HSQC techniques at medium and high fields showed no statistically significant differences. The least error-prone DD/CSA interference removal technique is the selective inversion of amide signals while avoiding water resonance. The exchange of amide with solvent deuterons appears to affect the rate <i>R</i><sub>1</sub> of solvent-exposed amides in all fields tested and in each DD/CSA interference removal technique in a statistically significant manner. In summary, the most accurate <i>R</i><sub>1</sub>(<sup>15</sup>N) rates in proteins are achieved by selective amide inversion, without the addition of D<sub>2</sub>O. Importantly, at high magnetic fields stronger than 800 MHz, when non-mono-exponential decay is involved, it is advisable to consider elimination of the shortest delays (typically up to 0.32 s) or bi-exponential fitting.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"79 1","pages":"1 - 14"},"PeriodicalIF":1.3,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00449-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142103198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}