Embedding AI in the Protein Crystallography Workflow

Q3 Physics and Astronomy Synchrotron Radiation News Pub Date : 2022-10-07 DOI:10.1080/08940886.2022.2114723
Richard J. Gildea, C. Orr, N. Paterson, D. Hall
{"title":"Embedding AI in the Protein Crystallography Workflow","authors":"Richard J. Gildea, C. Orr, N. Paterson, D. Hall","doi":"10.1080/08940886.2022.2114723","DOIUrl":null,"url":null,"abstract":"Historically, solving the structure of a protein required deep knowledge of crystallography and the ability to produce protein crystals of suitable quality to generate high-quality diffraction data. Over the years, as beamline optics, end-stations, detectors, and data collection strategies have improved, it has become more feasible to extract highquality diffraction data from ever smaller or less perfect protein crystals and from very large arrays of crystals for techniques such as serial synchrotron crystallography and fragment-based drug discovery. At Diamond, these improvements have been coupled with highly integrated automated pipelines for data reduction and structure solution using techniques such as molecular replacement and experimental phasing. This has led to the dichotomy, and benefits, of being able to do increasingly challenging experiments requiring deep crystallographic knowledge with facility staff support at the same time as lowering the barrier to entry where automated structure solution tools of the facility perform this task for those scientists with less experience. This enables users to focus on the science rather than the process. Diamond Light Source, the UK’s national synchrotron, has a suite of instruments dedicated to solving the 3D structure of large biological molecules, including seven macromolecular crystallography (MX) beamlines. Solved 3D structures are deposited into the publicly available Protein Data Bank (PDB) and the depositions are released on a weekly basis. In 2020, following 13 years of operation, Diamond hit the milestone of 10,000 structures deposited in the PDB. Two years on, this number is now more than 12,000. Thanks to decades of work across the world, there is an ocean of information in the PDB that serves as an invaluable reference when solving the structures of new proteins.","PeriodicalId":39020,"journal":{"name":"Synchrotron Radiation News","volume":"35 1","pages":"51 - 54"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Synchrotron Radiation News","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/08940886.2022.2114723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Physics and Astronomy","Score":null,"Total":0}
引用次数: 1

Abstract

Historically, solving the structure of a protein required deep knowledge of crystallography and the ability to produce protein crystals of suitable quality to generate high-quality diffraction data. Over the years, as beamline optics, end-stations, detectors, and data collection strategies have improved, it has become more feasible to extract highquality diffraction data from ever smaller or less perfect protein crystals and from very large arrays of crystals for techniques such as serial synchrotron crystallography and fragment-based drug discovery. At Diamond, these improvements have been coupled with highly integrated automated pipelines for data reduction and structure solution using techniques such as molecular replacement and experimental phasing. This has led to the dichotomy, and benefits, of being able to do increasingly challenging experiments requiring deep crystallographic knowledge with facility staff support at the same time as lowering the barrier to entry where automated structure solution tools of the facility perform this task for those scientists with less experience. This enables users to focus on the science rather than the process. Diamond Light Source, the UK’s national synchrotron, has a suite of instruments dedicated to solving the 3D structure of large biological molecules, including seven macromolecular crystallography (MX) beamlines. Solved 3D structures are deposited into the publicly available Protein Data Bank (PDB) and the depositions are released on a weekly basis. In 2020, following 13 years of operation, Diamond hit the milestone of 10,000 structures deposited in the PDB. Two years on, this number is now more than 12,000. Thanks to decades of work across the world, there is an ocean of information in the PDB that serves as an invaluable reference when solving the structures of new proteins.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
将人工智能嵌入蛋白质晶体学工作流程
从历史上看,解决蛋白质的结构需要深入的晶体学知识和生产合适质量的蛋白质晶体以生成高质量衍射数据的能力。多年来,随着光束线光学、终端站、探测器和数据收集策略的改进,从越来越小或不太完美的蛋白质晶体和非常大的晶体阵列中提取高质量衍射数据变得更加可行,用于串行同步加速器晶体学和基于碎片的药物发现等技术。在Diamond,这些改进与高度集成的自动化管道相结合,用于使用分子替换和实验定相等技术进行数据简化和结构解决方案。这导致了二分法和好处,即能够在设施工作人员的支持下进行越来越具有挑战性的实验,需要深入的晶体学知识,同时降低了设施的自动化结构解决方案工具为经验较少的科学家执行这项任务的门槛。这使得用户能够专注于科学而不是过程。英国国家同步加速器钻石光源拥有一套专门用于解决大生物分子三维结构的仪器,包括七条大分子晶体学(MX)光束线。解算的3D结构被存入公开可用的蛋白质数据库(PDB),并且沉积物每周发布一次。2020年,经过13年的运营,Diamond达到了在PDB中沉积10000个结构的里程碑。两年过去了,这个数字现在已经超过1.2万。由于世界各地几十年的工作,PDB中有大量的信息,在解决新蛋白质的结构时可以作为宝贵的参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Synchrotron Radiation News
Synchrotron Radiation News Physics and Astronomy-Nuclear and High Energy Physics
CiteScore
1.30
自引率
0.00%
发文量
46
期刊最新文献
Overview over Surface Sensitive X-Ray Techniques Soft Matter Surface and Interfacial Dynamics by Grazing-Incidence X-Ray Photon Correlation Spectroscopy Bragg Coherent Imaging of Fluctuating Domains Ultrafast Photoemission Spectroscopy and Photon Engineering Techniques Unveiled at NSRRC Taiwan Liquid Interface Science at ChemMatCARS
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1