Limitations of Protein Structure Prediction Algorithms in Therapeutic Protein Development

Sarfaraz K. Niazi, Zamara Mariam, R. Z. Paracha
{"title":"Limitations of Protein Structure Prediction Algorithms in Therapeutic Protein Development","authors":"Sarfaraz K. Niazi, Zamara Mariam, R. Z. Paracha","doi":"10.3390/biomedinformatics4010007","DOIUrl":null,"url":null,"abstract":"The three-dimensional protein structure is pivotal in comprehending biological phenomena. It directly governs protein function and hence aids in drug discovery. The development of protein prediction algorithms, such as AlphaFold2, ESMFold, and trRosetta, has given much hope in expediting protein-based therapeutic discovery. Though no study has reported a conclusive application of these algorithms, the efforts continue with much optimism. We intended to test the application of these algorithms in rank-ordering therapeutic proteins for their instability during the pre-translational modification stages, as may be predicted according to the confidence of the structure predicted by these algorithms. The selected molecules were based on a harmonized category of licensed therapeutic proteins; out of the 204 licensed products, 188 that were not conjugated were chosen for analysis, resulting in a lack of correlation between the confidence scores and structural or protein properties. It is crucial to note here that the predictive accuracy of these algorithms is contingent upon the presence of the known structure of the protein in the accessible database. Consequently, our conclusion emphasizes that these algorithms primarily replicate information derived from existing structures. While our findings caution against relying on these algorithms for drug discovery purposes, we acknowledge the need for a nuanced interpretation. Considering their limitations and recognizing that their utility may be constrained to scenarios where known structures are available is important. Hence, caution is advised when applying these algorithms to characterize various attributes of therapeutic proteins without the support of adequate structural information. It is worth noting that the two main algorithms, AlfphaFold2 and ESMFold, also showed a 72% correlation in their scores, pointing to similar limitations. While much progress has been made in computational sciences, the Levinthal paradox remains unsolved.","PeriodicalId":72394,"journal":{"name":"BioMedInformatics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BioMedInformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/biomedinformatics4010007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The three-dimensional protein structure is pivotal in comprehending biological phenomena. It directly governs protein function and hence aids in drug discovery. The development of protein prediction algorithms, such as AlphaFold2, ESMFold, and trRosetta, has given much hope in expediting protein-based therapeutic discovery. Though no study has reported a conclusive application of these algorithms, the efforts continue with much optimism. We intended to test the application of these algorithms in rank-ordering therapeutic proteins for their instability during the pre-translational modification stages, as may be predicted according to the confidence of the structure predicted by these algorithms. The selected molecules were based on a harmonized category of licensed therapeutic proteins; out of the 204 licensed products, 188 that were not conjugated were chosen for analysis, resulting in a lack of correlation between the confidence scores and structural or protein properties. It is crucial to note here that the predictive accuracy of these algorithms is contingent upon the presence of the known structure of the protein in the accessible database. Consequently, our conclusion emphasizes that these algorithms primarily replicate information derived from existing structures. While our findings caution against relying on these algorithms for drug discovery purposes, we acknowledge the need for a nuanced interpretation. Considering their limitations and recognizing that their utility may be constrained to scenarios where known structures are available is important. Hence, caution is advised when applying these algorithms to characterize various attributes of therapeutic proteins without the support of adequate structural information. It is worth noting that the two main algorithms, AlfphaFold2 and ESMFold, also showed a 72% correlation in their scores, pointing to similar limitations. While much progress has been made in computational sciences, the Levinthal paradox remains unsolved.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
治疗性蛋白质开发中蛋白质结构预测算法的局限性
蛋白质的三维结构对于理解生物现象至关重要。它直接决定着蛋白质的功能,因此有助于药物发现。蛋白质预测算法的发展,如 AlphaFold2、ESMFold 和 trRosetta,为加快基于蛋白质的治疗发现带来了很大希望。虽然还没有研究报告称这些算法得到了最终应用,但人们仍对其前景充满信心。我们打算测试这些算法在对治疗用蛋白质进行排序时的应用情况,以确定这些蛋白质在翻译前修饰阶段的不稳定性。所选分子基于已获许可的治疗蛋白质统一类别;在 204 种已获许可的产品中,有 188 种未共轭,因此可信度评分与结构或蛋白质特性之间缺乏相关性。在此必须指出的是,这些算法的预测准确性取决于可访问数据库中是否存在已知的蛋白质结构。因此,我们的结论强调,这些算法主要是复制从现有结构中获得的信息。虽然我们的研究结果提醒人们不要依赖这些算法来发现药物,但我们也承认有必要对其进行细致的解释。考虑到这些算法的局限性,并认识到它们的实用性可能仅限于已知结构可用的情况,这一点非常重要。因此,在没有足够结构信息支持的情况下,应用这些算法来描述治疗蛋白质的各种属性时,建议谨慎行事。值得注意的是,AlfphaFold2 和 ESMFold 这两种主要算法的得分也显示出 72% 的相关性,这也说明了类似的局限性。虽然计算科学取得了很大进展,但勒文塔尔悖论仍未得到解决。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.70
自引率
0.00%
发文量
0
期刊最新文献
Cinco de Bio: A Low-Code Platform for Domain-Specific Workflows for Biomedical Imaging Research Approaches to Extracting Patterns of Service Utilization for Patients with Complex Conditions: Graph Community Detection vs. Natural Language Processing Clustering Replies to Queries in Gynecologic Oncology by Bard, Bing and the Google Assistant Should AI-Powered Whole-Genome Sequencing Be Used Routinely for Personalized Decision Support in Surgical Oncology—A Scoping Review Transfer-Learning Approach for Enhanced Brain Tumor Classification in MRI Imaging
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1