David Medina-Ortiz, Ashkan Khalifeh, Hoda Anvari-Kazemabad, Mehdi D Davari
{"title":"Interpretable and explainable predictive machine learning models for data-driven protein engineering.","authors":"David Medina-Ortiz, Ashkan Khalifeh, Hoda Anvari-Kazemabad, Mehdi D Davari","doi":"10.1016/j.biotechadv.2024.108495","DOIUrl":null,"url":null,"abstract":"<p><p>Protein engineering through directed evolution and (semi)rational design has become a powerful approach for optimizing and enhancing proteins with desired properties. The integration of artificial intelligence methods has further accelerated protein engineering process by enabling the development of predictive models based on data-driven strategies. However, the lack of interpretability and transparency in these models limits their trustworthiness and applicability in real-world scenarios. Explainable Artificial Intelligence addresses these challenges by providing insights into the decision-making processes of machine learning models, enhancing their reliability and interpretability. Explainable strategies has been successfully applied in various biotechnology fields, including drug discovery, genomics, and medicine, yet its application in protein engineering remains underexplored. The incorporation of explainable strategies in protein engineering holds significant potential, as it can guide protein design by revealing how predictive models function, benefiting approaches such as machine learning-assisted directed evolution. This perspective work explores the principles and methodologies of explainable artificial intelligence, highlighting its relevance in biotechnology and its potential to enhance protein design. Additionally, three theoretical pipelines integrating predictive models with explainable strategies are proposed, focusing on their advantages, disadvantages, and technical requirements. Finally, the remaining challenges of explainable artificial intelligence in protein engineering and future directions for its development as a support tool for traditional protein engineering methodologies are discussed.</p>","PeriodicalId":8946,"journal":{"name":"Biotechnology advances","volume":" ","pages":"108495"},"PeriodicalIF":12.1000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biotechnology advances","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.biotechadv.2024.108495","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Protein engineering through directed evolution and (semi)rational design has become a powerful approach for optimizing and enhancing proteins with desired properties. The integration of artificial intelligence methods has further accelerated protein engineering process by enabling the development of predictive models based on data-driven strategies. However, the lack of interpretability and transparency in these models limits their trustworthiness and applicability in real-world scenarios. Explainable Artificial Intelligence addresses these challenges by providing insights into the decision-making processes of machine learning models, enhancing their reliability and interpretability. Explainable strategies has been successfully applied in various biotechnology fields, including drug discovery, genomics, and medicine, yet its application in protein engineering remains underexplored. The incorporation of explainable strategies in protein engineering holds significant potential, as it can guide protein design by revealing how predictive models function, benefiting approaches such as machine learning-assisted directed evolution. This perspective work explores the principles and methodologies of explainable artificial intelligence, highlighting its relevance in biotechnology and its potential to enhance protein design. Additionally, three theoretical pipelines integrating predictive models with explainable strategies are proposed, focusing on their advantages, disadvantages, and technical requirements. Finally, the remaining challenges of explainable artificial intelligence in protein engineering and future directions for its development as a support tool for traditional protein engineering methodologies are discussed.
期刊介绍:
Biotechnology Advances is a comprehensive review journal that covers all aspects of the multidisciplinary field of biotechnology. The journal focuses on biotechnology principles and their applications in various industries, agriculture, medicine, environmental concerns, and regulatory issues. It publishes authoritative articles that highlight current developments and future trends in the field of biotechnology. The journal invites submissions of manuscripts that are relevant and appropriate. It targets a wide audience, including scientists, engineers, students, instructors, researchers, practitioners, managers, governments, and other stakeholders in the field. Additionally, special issues are published based on selected presentations from recent relevant conferences in collaboration with the organizations hosting those conferences.