冷冻电子显微镜单粒子分析中的偏差、方差、过拟合、金标准和共识。

IF 0.7 Q3 ANTHROPOLOGY Museum Anthropology Pub Date : 2022-04-01 Epub Date: 2022-03-16 DOI:10.1107/S2059798322001978
C O S Sorzano, A Jiménez-Moreno, D Maluenda, M Martínez, E Ramírez-Aportela, J Krieger, R Melero, A Cuervo, J Conesa, J Filipovic, P Conesa, L Del Caño, Y C Fonseca, J Jiménez-de la Morena, P Losana, R Sánchez-García, D Strelak, E Fernández-Giménez, F P de Isidro-Gómez, D Herreros, J L Vilas, R Marabini, J M Carazo
{"title":"冷冻电子显微镜单粒子分析中的偏差、方差、过拟合、金标准和共识。","authors":"C O S Sorzano, A Jiménez-Moreno, D Maluenda, M Martínez, E Ramírez-Aportela, J Krieger, R Melero, A Cuervo, J Conesa, J Filipovic, P Conesa, L Del Caño, Y C Fonseca, J Jiménez-de la Morena, P Losana, R Sánchez-García, D Strelak, E Fernández-Giménez, F P de Isidro-Gómez, D Herreros, J L Vilas, R Marabini, J M Carazo","doi":"10.1107/S2059798322001978","DOIUrl":null,"url":null,"abstract":"<p><p>Cryo-electron microscopy (cryoEM) has become a well established technique to elucidate the 3D structures of biological macromolecules. Projection images from thousands of macromolecules that are assumed to be structurally identical are combined into a single 3D map representing the Coulomb potential of the macromolecule under study. This article discusses possible caveats along the image-processing path and how to avoid them to obtain a reliable 3D structure. Some of these problems are very well known in the community. These may be referred to as sample-related (such as specimen denaturation at interfaces or non-uniform projection geometry leading to underrepresented projection directions). The rest are related to the algorithms used. While some have been discussed in depth in the literature, such as the use of an incorrect initial volume, others have received much less attention. However, they are fundamental in any data-analysis approach. Chiefly among them, instabilities in estimating many of the key parameters that are required for a correct 3D reconstruction that occur all along the processing workflow are referred to, which may significantly affect the reliability of the whole process. In the field, the term overfitting has been coined to refer to some particular kinds of artifacts. It is argued that overfitting is a statistical bias in key parameter-estimation steps in the 3D reconstruction process, including intrinsic algorithmic bias. It is also shown that common tools (Fourier shell correlation) and strategies (gold standard) that are normally used to detect or prevent overfitting do not fully protect against it. Alternatively, it is proposed that detecting the bias that leads to overfitting is much easier when addressed at the level of parameter estimation, rather than detecting it once the particle images have been combined into a 3D map. Comparing the results from multiple algorithms (or at least, independent executions of the same algorithm) can detect parameter bias. These multiple executions could then be averaged to give a lower variance estimate of the underlying parameters.</p>","PeriodicalId":43404,"journal":{"name":"Museum Anthropology","volume":"29 1","pages":"410-423"},"PeriodicalIF":0.7000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8972802/pdf/","citationCount":"0","resultStr":"{\"title\":\"On bias, variance, overfitting, gold standard and consensus in single-particle analysis by cryo-electron microscopy.\",\"authors\":\"C O S Sorzano, A Jiménez-Moreno, D Maluenda, M Martínez, E Ramírez-Aportela, J Krieger, R Melero, A Cuervo, J Conesa, J Filipovic, P Conesa, L Del Caño, Y C Fonseca, J Jiménez-de la Morena, P Losana, R Sánchez-García, D Strelak, E Fernández-Giménez, F P de Isidro-Gómez, D Herreros, J L Vilas, R Marabini, J M Carazo\",\"doi\":\"10.1107/S2059798322001978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Cryo-electron microscopy (cryoEM) has become a well established technique to elucidate the 3D structures of biological macromolecules. Projection images from thousands of macromolecules that are assumed to be structurally identical are combined into a single 3D map representing the Coulomb potential of the macromolecule under study. This article discusses possible caveats along the image-processing path and how to avoid them to obtain a reliable 3D structure. Some of these problems are very well known in the community. These may be referred to as sample-related (such as specimen denaturation at interfaces or non-uniform projection geometry leading to underrepresented projection directions). The rest are related to the algorithms used. While some have been discussed in depth in the literature, such as the use of an incorrect initial volume, others have received much less attention. However, they are fundamental in any data-analysis approach. Chiefly among them, instabilities in estimating many of the key parameters that are required for a correct 3D reconstruction that occur all along the processing workflow are referred to, which may significantly affect the reliability of the whole process. In the field, the term overfitting has been coined to refer to some particular kinds of artifacts. It is argued that overfitting is a statistical bias in key parameter-estimation steps in the 3D reconstruction process, including intrinsic algorithmic bias. It is also shown that common tools (Fourier shell correlation) and strategies (gold standard) that are normally used to detect or prevent overfitting do not fully protect against it. Alternatively, it is proposed that detecting the bias that leads to overfitting is much easier when addressed at the level of parameter estimation, rather than detecting it once the particle images have been combined into a 3D map. Comparing the results from multiple algorithms (or at least, independent executions of the same algorithm) can detect parameter bias. These multiple executions could then be averaged to give a lower variance estimate of the underlying parameters.</p>\",\"PeriodicalId\":43404,\"journal\":{\"name\":\"Museum Anthropology\",\"volume\":\"29 1\",\"pages\":\"410-423\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2022-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8972802/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Museum Anthropology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1107/S2059798322001978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/3/16 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"ANTHROPOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Museum Anthropology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1107/S2059798322001978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/3/16 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

冷冻电子显微镜(cryoEM)已成为阐明生物大分子三维结构的成熟技术。成千上万个假定结构相同的大分子的投影图像被组合成一张三维图,代表了所研究大分子的库仑势。本文将讨论图像处理过程中可能出现的问题,以及如何避免这些问题以获得可靠的三维结构。其中一些问题在业界众所周知。这些问题可能与样本有关(如界面处的样本变性或投影几何形状不均匀导致投影方向代表性不足)。其余问题则与所使用的算法有关。虽然文献中已经对一些问题进行了深入讨论,如使用不正确的初始体积,但其他问题受到的关注要少得多。然而,它们是任何数据分析方法的基础。其中主要是指在整个处理工作流程中,在估计正确的三维重建所需的许多关键参数时出现的不稳定性,这可能会严重影响整个过程的可靠性。在这一领域,"过拟合"(overfitting)一词被用来指代某些特定类型的伪影。本文认为,过拟合是三维重建过程中关键参数估计步骤的统计偏差,包括内在算法偏差。研究还表明,通常用于检测或防止过拟合的常用工具(傅里叶壳相关性)和策略(黄金标准)并不能完全防止过拟合。另外,研究还提出,如果在参数估计阶段就能发现导致过拟合的偏差,而不是在将粒子图像组合成三维地图后才发现,则会容易得多。比较多个算法(或至少是同一算法的独立执行)的结果可以发现参数偏差。然后,可以对这些多次执行的结果进行平均,以得出基础参数的较低方差估计值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
On bias, variance, overfitting, gold standard and consensus in single-particle analysis by cryo-electron microscopy.

Cryo-electron microscopy (cryoEM) has become a well established technique to elucidate the 3D structures of biological macromolecules. Projection images from thousands of macromolecules that are assumed to be structurally identical are combined into a single 3D map representing the Coulomb potential of the macromolecule under study. This article discusses possible caveats along the image-processing path and how to avoid them to obtain a reliable 3D structure. Some of these problems are very well known in the community. These may be referred to as sample-related (such as specimen denaturation at interfaces or non-uniform projection geometry leading to underrepresented projection directions). The rest are related to the algorithms used. While some have been discussed in depth in the literature, such as the use of an incorrect initial volume, others have received much less attention. However, they are fundamental in any data-analysis approach. Chiefly among them, instabilities in estimating many of the key parameters that are required for a correct 3D reconstruction that occur all along the processing workflow are referred to, which may significantly affect the reliability of the whole process. In the field, the term overfitting has been coined to refer to some particular kinds of artifacts. It is argued that overfitting is a statistical bias in key parameter-estimation steps in the 3D reconstruction process, including intrinsic algorithmic bias. It is also shown that common tools (Fourier shell correlation) and strategies (gold standard) that are normally used to detect or prevent overfitting do not fully protect against it. Alternatively, it is proposed that detecting the bias that leads to overfitting is much easier when addressed at the level of parameter estimation, rather than detecting it once the particle images have been combined into a 3D map. Comparing the results from multiple algorithms (or at least, independent executions of the same algorithm) can detect parameter bias. These multiple executions could then be averaged to give a lower variance estimate of the underlying parameters.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Museum Anthropology
Museum Anthropology ANTHROPOLOGY-
CiteScore
0.80
自引率
75.00%
发文量
23
期刊介绍: Museum Anthropology seeks to be a leading voice for scholarly research on the collection, interpretation, and representation of the material world. Through critical articles, provocative commentaries, and thoughtful reviews, this peer-reviewed journal aspires to cultivate vibrant dialogues that reflect the global and transdisciplinary work of museums. Situated at the intersection of practice and theory, Museum Anthropology advances our knowledge of the ways in which material objects are intertwined with living histories of cultural display, economics, socio-politics, law, memory, ethics, colonialism, conservation, and public education.
期刊最新文献
Silencing the past: Power and the production of history By Michel‐RolphTrouillot, Boston: Beacon Press. 1995 Issue Information The “saint” of Livingstonia: Assembling, memorializing, and representation of missionary paraphernalia at the Stone House Museum in Malawi Diversity and philanthropy at African American museums: Black Renaissance By Patricia A. Banks, London and New York: Routledge Taylor & Francis Group. First Published 2019 by Routledge. pp. 212. ISBN: 9780367730093 (pbk), ISBN: 9780815349648 (hbk), ISBN: 9781351164368 (ebk) An artists' reflection of her First Civil Rights Tour
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1