Miguel Correa Marrero, Jürgen Jänes, Delora Baptista, Pedro Beltrao
{"title":"Integrating Large-Scale Protein Structure Prediction into Human Genetics Research","authors":"Miguel Correa Marrero, Jürgen Jänes, Delora Baptista, Pedro Beltrao","doi":"10.1146/annurev-genom-120622-020615","DOIUrl":null,"url":null,"abstract":"The last five years have seen impressive progress in deep learning models applied to protein research. Most notably, sequence-based structure predictions have seen transformative gains in the form of AlphaFold2 and related approaches. Millions of missense protein variants in the human population lack annotations, and these computational methods are a valuable means to prioritize variants for further analysis. Here, we review the recent progress in deep learning models applied to the prediction of protein structure and protein variants, with particular emphasis on their implications for human genetics and health. Improved prediction of protein structures facilitates annotations of the impact of variants on protein stability, protein–protein interaction interfaces, and small-molecule binding pockets. Moreover, it contributes to the study of host–pathogen interactions and the characterization of protein function. As genome sequencing in large cohorts becomes increasingly prevalent, we believe that better integration of state-of-the-art protein informatics technologies into human genetics research is of paramount importance.","PeriodicalId":8231,"journal":{"name":"Annual review of genomics and human genetics","volume":"31 1","pages":""},"PeriodicalIF":7.7000,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual review of genomics and human genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1146/annurev-genom-120622-020615","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
The last five years have seen impressive progress in deep learning models applied to protein research. Most notably, sequence-based structure predictions have seen transformative gains in the form of AlphaFold2 and related approaches. Millions of missense protein variants in the human population lack annotations, and these computational methods are a valuable means to prioritize variants for further analysis. Here, we review the recent progress in deep learning models applied to the prediction of protein structure and protein variants, with particular emphasis on their implications for human genetics and health. Improved prediction of protein structures facilitates annotations of the impact of variants on protein stability, protein–protein interaction interfaces, and small-molecule binding pockets. Moreover, it contributes to the study of host–pathogen interactions and the characterization of protein function. As genome sequencing in large cohorts becomes increasingly prevalent, we believe that better integration of state-of-the-art protein informatics technologies into human genetics research is of paramount importance.
期刊介绍:
Since its inception in 2000, the Annual Review of Genomics and Human Genetics has been dedicated to showcasing significant developments in genomics as they pertain to human genetics and the human genome. The journal emphasizes genomic technology, genome structure and function, genetic modification, human variation and population genetics, human evolution, and various aspects of human genetic diseases, including individualized medicine.