Interpretable and explainable predictive machine learning models for data-driven protein engineering.

IF 12.1 1区 工程技术 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Biotechnology advances Pub Date : 2024-12-05 DOI:10.1016/j.biotechadv.2024.108495
David Medina-Ortiz, Ashkan Khalifeh, Hoda Anvari-Kazemabad, Mehdi D Davari
{"title":"Interpretable and explainable predictive machine learning models for data-driven protein engineering.","authors":"David Medina-Ortiz, Ashkan Khalifeh, Hoda Anvari-Kazemabad, Mehdi D Davari","doi":"10.1016/j.biotechadv.2024.108495","DOIUrl":null,"url":null,"abstract":"<p><p>Protein engineering through directed evolution and (semi)rational design has become a powerful approach for optimizing and enhancing proteins with desired properties. The integration of artificial intelligence methods has further accelerated protein engineering process by enabling the development of predictive models based on data-driven strategies. However, the lack of interpretability and transparency in these models limits their trustworthiness and applicability in real-world scenarios. Explainable Artificial Intelligence addresses these challenges by providing insights into the decision-making processes of machine learning models, enhancing their reliability and interpretability. Explainable strategies has been successfully applied in various biotechnology fields, including drug discovery, genomics, and medicine, yet its application in protein engineering remains underexplored. The incorporation of explainable strategies in protein engineering holds significant potential, as it can guide protein design by revealing how predictive models function, benefiting approaches such as machine learning-assisted directed evolution. This perspective work explores the principles and methodologies of explainable artificial intelligence, highlighting its relevance in biotechnology and its potential to enhance protein design. Additionally, three theoretical pipelines integrating predictive models with explainable strategies are proposed, focusing on their advantages, disadvantages, and technical requirements. Finally, the remaining challenges of explainable artificial intelligence in protein engineering and future directions for its development as a support tool for traditional protein engineering methodologies are discussed.</p>","PeriodicalId":8946,"journal":{"name":"Biotechnology advances","volume":" ","pages":"108495"},"PeriodicalIF":12.1000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biotechnology advances","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.biotechadv.2024.108495","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Protein engineering through directed evolution and (semi)rational design has become a powerful approach for optimizing and enhancing proteins with desired properties. The integration of artificial intelligence methods has further accelerated protein engineering process by enabling the development of predictive models based on data-driven strategies. However, the lack of interpretability and transparency in these models limits their trustworthiness and applicability in real-world scenarios. Explainable Artificial Intelligence addresses these challenges by providing insights into the decision-making processes of machine learning models, enhancing their reliability and interpretability. Explainable strategies has been successfully applied in various biotechnology fields, including drug discovery, genomics, and medicine, yet its application in protein engineering remains underexplored. The incorporation of explainable strategies in protein engineering holds significant potential, as it can guide protein design by revealing how predictive models function, benefiting approaches such as machine learning-assisted directed evolution. This perspective work explores the principles and methodologies of explainable artificial intelligence, highlighting its relevance in biotechnology and its potential to enhance protein design. Additionally, three theoretical pipelines integrating predictive models with explainable strategies are proposed, focusing on their advantages, disadvantages, and technical requirements. Finally, the remaining challenges of explainable artificial intelligence in protein engineering and future directions for its development as a support tool for traditional protein engineering methodologies are discussed.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Biotechnology advances
Biotechnology advances 工程技术-生物工程与应用微生物
CiteScore
25.50
自引率
2.50%
发文量
167
审稿时长
37 days
期刊介绍: Biotechnology Advances is a comprehensive review journal that covers all aspects of the multidisciplinary field of biotechnology. The journal focuses on biotechnology principles and their applications in various industries, agriculture, medicine, environmental concerns, and regulatory issues. It publishes authoritative articles that highlight current developments and future trends in the field of biotechnology. The journal invites submissions of manuscripts that are relevant and appropriate. It targets a wide audience, including scientists, engineers, students, instructors, researchers, practitioners, managers, governments, and other stakeholders in the field. Additionally, special issues are published based on selected presentations from recent relevant conferences in collaboration with the organizations hosting those conferences.
期刊最新文献
The role of the proteosurfaceome and exoproteome in bacterial coaggregation. Direct cloning strategies for large genomic fragments: A review. From discovery to application: Enabling technology-based optimizing carbonyl reductases biocatalysis for active pharmaceutical ingredient synthesis. Engineering microbial cell factories by multiplexed spatiotemporal control of cellular metabolism: Advances, challenges, and future perspectives. Interpretable and explainable predictive machine learning models for data-driven protein engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1