首页 > 最新文献

Genomics, Proteomics & Bioinformatics最新文献

英文 中文
T2T-YAO: A Telomere-to-telomere Assembled Diploid Reference Genome for Han Chinese. T2T-YAO:从端粒到端粒的汉族二倍体参考基因组。
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-12-01 Epub Date: 2023-08-16 DOI: 10.1016/j.gpb.2023.08.001
Yukun He, Yanan Chu, Shuming Guo, Jiang Hu, Ran Li, Yali Zheng, Xinqian Ma, Zhenglin Du, Lili Zhao, Wenyi Yu, Jianbo Xue, Wenjie Bian, Feifei Yang, Xi Chen, Pingan Zhang, Rihan Wu, Yifan Ma, Changjun Shao, Jing Chen, Jian Wang, Jiwei Li, Jing Wu, Xiaoyi Hu, Qiuyue Long, Mingzheng Jiang, Hongli Ye, Shixu Song, Guangyao Li, Yue Wei, Yu Xu, Yanliang Ma, Yanwen Chen, Keqiang Wang, Jing Bao, Wen Xi, Fang Wang, Wentao Ni, Moqin Zhang, Yan Yu, Shengnan Li, Yu Kang, Zhancheng Gao

Since its initial release in 2001, the human reference genome has undergone continuous improvement in quality, and the recently released telomere-to-telomere (T2T) version - T2T-CHM13 - reaches its highest level of continuity and accuracy after 20 years of effort by working on a simplified, nearly homozygous genome of a hydatidiform mole cell line. Here, to provide an authentic complete diploid human genome reference for the Han Chinese, the largest population in the world, we assembled the genome of a male Han Chinese individual, T2T-YAO, which includes T2T assemblies of all the 22 + X + M and 22 + Y chromosomes in both haploids. The quality of T2T-YAO is much better than those of all currently available diploid assemblies, and its haploid version, T2T-YAO-hp, generated by selecting the better assembly for each autosome, reaches the top quality of fewer than one error per 29.5 Mb, even higher than that of T2T-CHM13. Derived from an individual living in the aboriginal region of the Han population, T2T-YAO shows clear ancestry and potential genetic continuity from the ancient ancestors. Each haplotype of T2T-YAO possesses ∼ 330-Mb exclusive sequences, ∼ 3100 unique genes, and tens of thousands of nucleotide and structural variations as compared with CHM13, highlighting the necessity of a population-stratified reference genome. The construction of T2T-YAO, an accurate and authentic representative of the Chinese population, would enable precise delineation of genomic variations and advance our understandings in the hereditability of diseases and phenotypes, especially within the context of the unique variations of the Chinese population.

自2001年首次发布以来,人类参考基因组的质量不断提高,最近发布的端粒到端粒(T2T)版本--T2T-CHM13--经过20年的努力,在简化的、近乎同源的水滴形痣细胞系基因组的基础上,达到了连续性和准确性的最高水平。在这里,为了给世界上人口最多的汉族提供一个真实完整的二倍体人类基因组参考,我们组装了一个汉族男性个体的基因组 T2T-YAO,其中包括单倍体中所有 22 + X + M 和 22 + Y 染色体的 T2T 组装。T2T-YAO 的质量远远优于目前所有的二倍体基因组,其单倍体版本 T2T-YAO-hp 是通过为每个常染色体选择更好的基因组而生成的,达到了每 29.5 Mb 只有不到一个错误的最高质量,甚至高于 T2T-CHM13 的质量。T2T-YAO 源自一个生活在汉族原住民地区的个体,显示出与远古祖先清晰的祖先关系和潜在的遗传连续性。与 CHM13 相比,T2T-YAO 的每个单倍型拥有 330-Mb 的独有序列,3100 个独特基因,以及数以万计的核苷酸和结构变异,凸显了人群分层参考基因组的必要性。T2T-YAO是一个真正准确和真实的中国人群代表,它的构建将有助于精确划分基因组变异,推进我们对疾病和表型遗传性的理解,尤其是在中国人群独特变异的背景下。
{"title":"T2T-YAO: A Telomere-to-telomere Assembled Diploid Reference Genome for Han Chinese.","authors":"Yukun He, Yanan Chu, Shuming Guo, Jiang Hu, Ran Li, Yali Zheng, Xinqian Ma, Zhenglin Du, Lili Zhao, Wenyi Yu, Jianbo Xue, Wenjie Bian, Feifei Yang, Xi Chen, Pingan Zhang, Rihan Wu, Yifan Ma, Changjun Shao, Jing Chen, Jian Wang, Jiwei Li, Jing Wu, Xiaoyi Hu, Qiuyue Long, Mingzheng Jiang, Hongli Ye, Shixu Song, Guangyao Li, Yue Wei, Yu Xu, Yanliang Ma, Yanwen Chen, Keqiang Wang, Jing Bao, Wen Xi, Fang Wang, Wentao Ni, Moqin Zhang, Yan Yu, Shengnan Li, Yu Kang, Zhancheng Gao","doi":"10.1016/j.gpb.2023.08.001","DOIUrl":"10.1016/j.gpb.2023.08.001","url":null,"abstract":"<p><p>Since its initial release in 2001, the human reference genome has undergone continuous improvement in quality, and the recently released telomere-to-telomere (T2T) version - T2T-CHM13 - reaches its highest level of continuity and accuracy after 20 years of effort by working on a simplified, nearly homozygous genome of a hydatidiform mole cell line. Here, to provide an authentic complete diploid human genome reference for the Han Chinese, the largest population in the world, we assembled the genome of a male Han Chinese individual, T2T-YAO, which includes T2T assemblies of all the 22 + X + M and 22 + Y chromosomes in both haploids. The quality of T2T-YAO is much better than those of all currently available diploid assemblies, and its haploid version, T2T-YAO-hp, generated by selecting the better assembly for each autosome, reaches the top quality of fewer than one error per 29.5 Mb, even higher than that of T2T-CHM13. Derived from an individual living in the aboriginal region of the Han population, T2T-YAO shows clear ancestry and potential genetic continuity from the ancient ancestors. Each haplotype of T2T-YAO possesses ∼ 330-Mb exclusive sequences, ∼ 3100 unique genes, and tens of thousands of nucleotide and structural variations as compared with CHM13, highlighting the necessity of a population-stratified reference genome. The construction of T2T-YAO, an accurate and authentic representative of the Chinese population, would enable precise delineation of genomic variations and advance our understandings in the hereditability of diseases and phenotypes, especially within the context of the unique variations of the Chinese population.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"1085-1100"},"PeriodicalIF":11.5,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11082261/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10023539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Proteome Landscape of Human Placentas for Monochorionic Twins with Selective Intrauterine Growth Restriction. 选择性宫内生长受限的单绒毛膜双胎人类胎盘的蛋白质组图谱
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-12-01 Epub Date: 2023-04-29 DOI: 10.1016/j.gpb.2023.03.002
Xin-Lu Meng, Peng-Bo Yuan, Xue-Ju Wang, Jing Hang, Xiao-Ming Shi, Yang-Yu Zhao, Yuan Wei

In perinatal medicine, intrauterine growth restriction (IUGR) is one of the greatest challenges. The etiology of IUGR is multifactorial, but most cases are thought to arise from placental insufficiency. However, identifying the placental cause of IUGR can be difficult due to numerous confounding factors. Selective IUGR (sIUGR) would be a good model to investigate how impaired placentation affects fetal development, as the growth discordance between monochorionic twins cannot be explained by confounding genetic or maternal factors. Herein, we constructed and analyzed the placental proteomic profiles of IUGR twins and normal cotwins. Specifically, we identified a total of 5481 proteins, of which 233 were differentially expressed (57 up-regulated and 176 down-regulated) in IUGR twins. Bioinformatics analysis indicates that these differentially expressed proteins (DEPs) are mainly associated with cardiovascular system development and function, organismal survival, and organismal development. Notably, 34 DEPs are significantly enriched in angiogenesis, and diminished placental angiogenesis in IUGR twins has been further elaborately confirmed. Moreover, we found decreased expression of metadherin (MTDH) in the placentas of IUGR twins and demonstrated that MTDH contributes to placental angiogenesis and fetal growth in vitro. Collectively, our findings reveal the comprehensive proteomic signatures of placentas for sIUGR twins, and the DEPs identified may provide in-depth insights into the pathogenesis of placental dysfunction and subsequent impaired fetal growth.

在围产医学中,宫内生长受限(IUGR)是最大的挑战之一。IUGR 的病因是多方面的,但大多数病例被认为是由胎盘功能不全引起的。然而,由于混杂因素较多,确定 IUGR 的胎盘病因可能比较困难。选择性 IUGR(sIUGR)是研究胎盘功能受损如何影响胎儿发育的一个很好的模型,因为单绒毛膜双胎之间的生长不一致无法用遗传或母体因素来解释。在此,我们构建并分析了IUGR双胞胎和正常同胎双胞胎的胎盘蛋白质组图谱。具体来说,我们共鉴定出5481个蛋白质,其中233个蛋白质在IUGR双胞胎中存在差异表达(57个上调,176个下调)。生物信息学分析表明,这些差异表达蛋白(DEPs)主要与心血管系统发育和功能、机体存活和机体发育有关。值得注意的是,34 个 DEPs 在血管生成中明显富集,而 IUGR 双胞胎胎盘血管生成的减少也得到了进一步的详细证实。此外,我们还发现IUGR双胞胎胎盘中的偏粘连蛋白(MTDH)表达减少,并证明MTDH有助于胎盘血管生成和胎儿体外生长。总之,我们的研究结果揭示了sIUGR双胞胎胎盘的综合蛋白质组特征,所发现的DEPs可能会为胎盘功能障碍及随后的胎儿生长受损的发病机制提供深入的见解。
{"title":"The Proteome Landscape of Human Placentas for Monochorionic Twins with Selective Intrauterine Growth Restriction.","authors":"Xin-Lu Meng, Peng-Bo Yuan, Xue-Ju Wang, Jing Hang, Xiao-Ming Shi, Yang-Yu Zhao, Yuan Wei","doi":"10.1016/j.gpb.2023.03.002","DOIUrl":"10.1016/j.gpb.2023.03.002","url":null,"abstract":"<p><p>In perinatal medicine, intrauterine growth restriction (IUGR) is one of the greatest challenges. The etiology of IUGR is multifactorial, but most cases are thought to arise from placental insufficiency. However, identifying the placental cause of IUGR can be difficult due to numerous confounding factors. Selective IUGR (sIUGR) would be a good model to investigate how impaired placentation affects fetal development, as the growth discordance between monochorionic twins cannot be explained by confounding genetic or maternal factors. Herein, we constructed and analyzed the placental proteomic profiles of IUGR twins and normal cotwins. Specifically, we identified a total of 5481 proteins, of which 233 were differentially expressed (57 up-regulated and 176 down-regulated) in IUGR twins. Bioinformatics analysis indicates that these differentially expressed proteins (DEPs) are mainly associated with cardiovascular system development and function, organismal survival, and organismal development. Notably, 34 DEPs are significantly enriched in angiogenesis, and diminished placental angiogenesis in IUGR twins has been further elaborately confirmed. Moreover, we found decreased expression of metadherin (MTDH) in the placentas of IUGR twins and demonstrated that MTDH contributes to placental angiogenesis and fetal growth in vitro. Collectively, our findings reveal the comprehensive proteomic signatures of placentas for sIUGR twins, and the DEPs identified may provide in-depth insights into the pathogenesis of placental dysfunction and subsequent impaired fetal growth.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"1246-1259"},"PeriodicalIF":11.5,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11082409/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9375205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comprehensive Characterization and Global Transcriptome Analysis of Human Fetal Liver Terminal Erythropoiesis. 人类胎儿肝脏末期红细胞生成的全面特征描述和全局转录组分析
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-12-01 Epub Date: 2023-08-30 DOI: 10.1016/j.gpb.2023.07.001
Yongshuai Han, Shihui Wang, Yaomei Wang, Yumin Huang, Chengjie Gao, Xinhua Guo, Lixiang Chen, Huizhi Zhao, Xiuli An

The fetal liver (FL) is the key erythropoietic organ during fetal development, but knowledge on human FL erythropoiesis is very limited. In this study, we sorted primary erythroblasts from FL cells and performed RNA sequencing (RNA-seq) analyses. We found that temporal gene expression patterns reflected changes in function during primary human FL terminal erythropoiesis. Notably, the expression of genes enriched in proteolysis and autophagy was up-regulated in orthochromatic erythroblasts (OrthoEs), suggesting the involvement of these pathways in enucleation. We also performed RNA-seq of in vitro cultured erythroblasts derived from FL CD34+ cells. Comparison of transcriptomes between the primary and cultured erythroblasts revealed significant differences, indicating impacts of the culture system on gene expression. Notably, the expression of lipid metabolism-related genes was increased in cultured erythroblasts. We further immortalized erythroid cell lines from FL and cord blood (CB) CD34+ cells (FL-iEry and CB-iEry, respectively). FL-iEry and CB-iEry were immortalized at the proerythroblast stage and can be induced to differentiate into OrthoEs, but their enucleation ability was very low. Comparison of the transcriptomes between OrthoEs with and without enucleation capability revealed the down-regulation of pathways involved in chromatin organization and mitophagy in OrthoEs without enucleation capacity, indicating that defects in chromatin organization and mitophagy contribute to the inability of OrthoEs to enucleate. Additionally, the expression of HBE1, HBZ, and HBG2 was up-regulated in FL-iEry compared with CB-iEry, and such up-regulation was accompanied by down-regulated expression of BCL11A and up-regulated expression of LIN28B and IGF2BP1. Our study provides new insights into human FL erythropoiesis and rich resources for future studies.

胎儿肝脏(FL)是胎儿发育过程中关键的红细胞生成器官,但有关人类FL红细胞生成的知识却非常有限。在这项研究中,我们从FL细胞中分拣出原代红细胞,并进行了RNA测序(RNA-seq)分析。我们发现,时间基因表达模式反映了原代人类 FL 终末红细胞生成过程中的功能变化。值得注意的是,在正色红细胞(OrthoEs)中,富含蛋白分解和自噬的基因表达上调,这表明这些通路参与了去核过程。我们还对从FL CD34+细胞中提取的体外培养红细胞进行了RNA-seq分析。比较原代红细胞和培养红细胞的转录组发现了显著差异,表明培养系统对基因表达有影响。值得注意的是,脂质代谢相关基因的表达在培养红细胞中有所增加。我们进一步从 FL 和脐带血(CB)CD34+细胞中永生化了红细胞系(分别为 FL-iEry 和 CB-iEry)。FL-iEry和CB-iEry在原红细胞阶段永生,可诱导分化成OrthoEs,但它们的去核能力很低。比较具有和不具有去核能力的OrthoEs的转录组发现,在不具有去核能力的OrthoEs中,参与染色质组织和有丝分裂的通路下调,这表明染色质组织和有丝分裂的缺陷导致OrthoEs不能去核。此外,与 CB-iEry 相比,FL-iEry 中 HBE1、HBZ 和 HBG2 的表达上调,这种上调伴随着 BCL11A 的表达下调以及 LIN28B 和 IGF2BP1 的表达上调。我们的研究为人类 FL 红细胞生成提供了新的见解,并为今后的研究提供了丰富的资源。
{"title":"Comprehensive Characterization and Global Transcriptome Analysis of Human Fetal Liver Terminal Erythropoiesis.","authors":"Yongshuai Han, Shihui Wang, Yaomei Wang, Yumin Huang, Chengjie Gao, Xinhua Guo, Lixiang Chen, Huizhi Zhao, Xiuli An","doi":"10.1016/j.gpb.2023.07.001","DOIUrl":"10.1016/j.gpb.2023.07.001","url":null,"abstract":"<p><p>The fetal liver (FL) is the key erythropoietic organ during fetal development, but knowledge on human FL erythropoiesis is very limited. In this study, we sorted primary erythroblasts from FL cells and performed RNA sequencing (RNA-seq) analyses. We found that temporal gene expression patterns reflected changes in function during primary human FL terminal erythropoiesis. Notably, the expression of genes enriched in proteolysis and autophagy was up-regulated in orthochromatic erythroblasts (OrthoEs), suggesting the involvement of these pathways in enucleation. We also performed RNA-seq of in vitro cultured erythroblasts derived from FL CD34<sup>+</sup> cells. Comparison of transcriptomes between the primary and cultured erythroblasts revealed significant differences, indicating impacts of the culture system on gene expression. Notably, the expression of lipid metabolism-related genes was increased in cultured erythroblasts. We further immortalized erythroid cell lines from FL and cord blood (CB) CD34<sup>+</sup> cells (FL-iEry and CB-iEry, respectively). FL-iEry and CB-iEry were immortalized at the proerythroblast stage and can be induced to differentiate into OrthoEs, but their enucleation ability was very low. Comparison of the transcriptomes between OrthoEs with and without enucleation capability revealed the down-regulation of pathways involved in chromatin organization and mitophagy in OrthoEs without enucleation capacity, indicating that defects in chromatin organization and mitophagy contribute to the inability of OrthoEs to enucleate. Additionally, the expression of HBE1, HBZ, and HBG2 was up-regulated in FL-iEry compared with CB-iEry, and such up-regulation was accompanied by down-regulated expression of BCL11A and up-regulated expression of LIN28B and IGF2BP1. Our study provides new insights into human FL erythropoiesis and rich resources for future studies.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"1117-1132"},"PeriodicalIF":11.5,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11082260/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10135733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Superior Fidelity and Distinct Editing Outcomes of SaCas9 Compared with SpCas9 in Genome Editing. 与 SpCas9 相比,SaCas9 在基因组编辑中具有更高的保真度和不同的编辑结果。
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-12-01 Epub Date: 2022-12-20 DOI: 10.1016/j.gpb.2022.12.003
Zhi-Xue Yang, Ya-Wen Fu, Juan-Juan Zhao, Feng Zhang, Si-Ang Li, Mei Zhao, Wei Wen, Lei Zhang, Tao Cheng, Jian-Ping Zhang, Xiao-Bing Zhang

A series of clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR associated protein 9 (Cas9) systems have been engineered for genome editing. The most widely used Cas9 is SpCas9 from Streptococcus pyogenes and SaCas9 from Staphylococcus aureus. However, a comparison of their detailed gene editing outcomes is still lacking. By characterizing the editing outcomes of 11 sites in human induced pluripotent stem cells (iPSCs) and K562 cells, we found that SaCas9 could edit the genome with greater efficiencies than SpCas9. We also compared the effects of spacer lengths of single-guide RNAs (sgRNAs; 18-21 nt for SpCas9 and 19-23 nt for SaCas9) and found that the optimal spacer lengths were 20 nt and 21 nt for SpCas9 and SaCas9, respectively. However, the optimal spacer length for a particular sgRNA was 18-21 nt for SpCas9 and 21-22 nt for SaCas9. Furthermore, SpCas9 exhibited a more substantial bias than SaCas9 for nonhomologous end-joining (NHEJ) +1 insertion at the fourth nucleotide upstream of the protospacer adjacent motif (PAM), indicating a characteristic of a staggered cut. Accordingly, editing with SaCas9 led to higher efficiencies of NHEJ-mediated double-stranded oligodeoxynucleotide (dsODN) insertion or homology-directed repair (HDR)-mediated adeno-associated virus serotype 6 (AAV6) donor knock-in. Finally, GUIDE-seq analysis revealed that SaCas9 exhibited significantly reduced off-target effects compared with SpCas9. Our work indicates the superior performance of SaCas9 to SpCas9 in transgene integration-based therapeutic gene editing and the necessity to identify the optimal spacer length to achieve desired editing results.

目前已设计出一系列用于基因组编辑的簇状规则间隔短回文重复序列(CRISPR)-CRISPR 相关蛋白 9(Cas9)系统。使用最广泛的 Cas9 是来自化脓性链球菌的 SpCas9 和来自金黄色葡萄球菌的 SaCas9。然而,目前仍缺乏对其基因编辑结果的详细比较。通过分析人类诱导多能干细胞(iPSC)和 K562 细胞中 11 个位点的编辑结果,我们发现 SaCas9 比 SpCas9 能更有效地编辑基因组。我们还比较了单导RNA(sgRNA,SpCas9为18-21 nt,SaCas9为19-23 nt)的间隔长度,发现SpCas9和SaCas9的最佳间隔长度分别为20 nt和21 nt。然而,对于 SpCas9 和 SaCas9 而言,特定引导 RNA 的最佳间隔长度分别为 18-21 nt 或 21-22 nt。此外,与 SaCas9 相比,SpCas9 在非同源末端连接(NHEJ)+1 插入原间隔邻接基序(PAM)上游第四个核苷酸时表现出更大的偏差,这是交错切割的特征。因此,用SaCas9进行编辑可提高NHEJ介导的双链寡核苷酸(dsODN)插入或腺病毒血清型6(AAV6)供体介导的同源定向修复(HDR)的基因敲入效率。最后,GUIDE-seq分析显示,与SpCas9相比,SaCas9的脱靶效应明显降低。我们的工作表明,在基于转基因整合的治疗性基因编辑中,SaCas9的性能优于SpCas9,而且有必要确定最佳间隔长度,以实现理想的编辑效果。
{"title":"Superior Fidelity and Distinct Editing Outcomes of SaCas9 Compared with SpCas9 in Genome Editing.","authors":"Zhi-Xue Yang, Ya-Wen Fu, Juan-Juan Zhao, Feng Zhang, Si-Ang Li, Mei Zhao, Wei Wen, Lei Zhang, Tao Cheng, Jian-Ping Zhang, Xiao-Bing Zhang","doi":"10.1016/j.gpb.2022.12.003","DOIUrl":"10.1016/j.gpb.2022.12.003","url":null,"abstract":"<p><p>A series of clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR associated protein 9 (Cas9) systems have been engineered for genome editing. The most widely used Cas9 is SpCas9 from Streptococcus pyogenes and SaCas9 from Staphylococcus aureus. However, a comparison of their detailed gene editing outcomes is still lacking. By characterizing the editing outcomes of 11 sites in human induced pluripotent stem cells (iPSCs) and K562 cells, we found that SaCas9 could edit the genome with greater efficiencies than SpCas9. We also compared the effects of spacer lengths of single-guide RNAs (sgRNAs; 18-21 nt for SpCas9 and 19-23 nt for SaCas9) and found that the optimal spacer lengths were 20 nt and 21 nt for SpCas9 and SaCas9, respectively. However, the optimal spacer length for a particular sgRNA was 18-21 nt for SpCas9 and 21-22 nt for SaCas9. Furthermore, SpCas9 exhibited a more substantial bias than SaCas9 for nonhomologous end-joining (NHEJ) +1 insertion at the fourth nucleotide upstream of the protospacer adjacent motif (PAM), indicating a characteristic of a staggered cut. Accordingly, editing with SaCas9 led to higher efficiencies of NHEJ-mediated double-stranded oligodeoxynucleotide (dsODN) insertion or homology-directed repair (HDR)-mediated adeno-associated virus serotype 6 (AAV6) donor knock-in. Finally, GUIDE-seq analysis revealed that SaCas9 exhibited significantly reduced off-target effects compared with SpCas9. Our work indicates the superior performance of SaCas9 to SpCas9 in transgene integration-based therapeutic gene editing and the necessity to identify the optimal spacer length to achieve desired editing results.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"1206-1220"},"PeriodicalIF":11.5,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11082263/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10419418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel Time-dependent Multi-omics Integration in Sepsis-associated Liver Dysfunction. 脓毒症相关肝功能障碍中的新型时间依赖性多组学整合技术
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-12-01 Epub Date: 2023-04-20 DOI: 10.1016/j.gpb.2023.04.002
Ann-Yae Na, Hyojin Lee, Eun Ki Min, Sanjita Paudel, So Young Choi, HyunChae Sim, Kwang-Hyeon Liu, Ki-Tae Kim, Jong-Sup Bae, Sangkyu Lee

The recently developed technologies that allow the analysis of each single omics have provided an unbiased insight into ongoing disease processes. However, it remains challenging to specify the study design for the subsequent integration strategies that can associate sepsis pathophysiology and clinical outcomes. Here, we conducted a time-dependent multi-omics integration (TDMI) in a sepsis-associated liver dysfunction (SALD) model. We successfully deduced the relation of the Toll-like receptor 4 (TLR4) pathway with SALD. Although TLR4 is a critical factor in sepsis progression, it is not specified in single-omics analyses but only in the TDMI analysis. This finding indicates that the TDMI-based approach is more advantageous than single-omics analyses in terms of exploring the underlying pathophysiological mechanism of SALD. Furthermore, TDMI-based approach can be an ideal paradigm for insightful biological interpretations of multi-omics datasets that will potentially reveal novel insights into basic biology, health, and diseases, thus allowing the identification of promising candidates for therapeutic strategies.

最近开发的技术可以对每一个单一的全息图进行分析,从而提供了对正在发生的疾病过程的无偏见的洞察力。然而,为后续整合策略指定研究设计以将脓毒症病理生理学和临床结果联系起来仍具有挑战性。在这里,我们在脓毒症相关肝功能异常(SALD)模型中进行了时间依赖性多组学整合(TDMI)。我们成功地推断出了toll样受体4(TLR4)通路与SALD的关系。虽然 TLR4 是脓毒症进展的关键因素,但它在单体组学结果中并不明确,而仅在 TDMI 分析中明确。这一结果表明,在探索这种疾病的潜在病理生理机制方面,基于 TDMI 的方法比单组学分析更具优势。此外,这种方法还是对多组学数据集进行深入生物学解读的理想范例,有可能揭示基础生物学、健康和疾病方面的新见解,从而确定有希望的候选治疗策略。
{"title":"Novel Time-dependent Multi-omics Integration in Sepsis-associated Liver Dysfunction.","authors":"Ann-Yae Na, Hyojin Lee, Eun Ki Min, Sanjita Paudel, So Young Choi, HyunChae Sim, Kwang-Hyeon Liu, Ki-Tae Kim, Jong-Sup Bae, Sangkyu Lee","doi":"10.1016/j.gpb.2023.04.002","DOIUrl":"10.1016/j.gpb.2023.04.002","url":null,"abstract":"<p><p>The recently developed technologies that allow the analysis of each single omics have provided an unbiased insight into ongoing disease processes. However, it remains challenging to specify the study design for the subsequent integration strategies that can associate sepsis pathophysiology and clinical outcomes. Here, we conducted a time-dependent multi-omics integration (TDMI) in a sepsis-associated liver dysfunction (SALD) model. We successfully deduced the relation of the Toll-like receptor 4 (TLR4) pathway with SALD. Although TLR4 is a critical factor in sepsis progression, it is not specified in single-omics analyses but only in the TDMI analysis. This finding indicates that the TDMI-based approach is more advantageous than single-omics analyses in terms of exploring the underlying pathophysiological mechanism of SALD. Furthermore, TDMI-based approach can be an ideal paradigm for insightful biological interpretations of multi-omics datasets that will potentially reveal novel insights into basic biology, health, and diseases, thus allowing the identification of promising candidates for therapeutic strategies.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"1101-1116"},"PeriodicalIF":11.5,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11082264/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9422024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Omics Views of Mechanisms for Cell Fate Determination in Early Mammalian Development. Omics 对哺乳动物早期发育中细胞命运决定机制的看法。
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-10-01 Epub Date: 2023-04-17 DOI: 10.1016/j.gpb.2023.03.001
Lin-Fang Ju, Heng-Ji Xu, Yun-Gui Yang, Ying Yang

During mammalian preimplantation development, a totipotent zygote undergoes several cell cleavages and two rounds of cell fate determination, ultimately forming a mature blastocyst. Along with compaction, the establishment of apicobasal cell polarity breaks the symmetry of an embryo and guides subsequent cell fate choice. Although the lineage segregation of the inner cell mass (ICM) and trophectoderm (TE) is the first symbol of cell differentiation, several molecules have been shown to bias the early cell fate through their inter-cellular variations at much earlier stages, including the 2- and 4-cell stages. The underlying mechanisms of early cell fate determination have long been an important research topic. In this review, we summarize the molecular events that occur during early embryogenesis, as well as the current understanding of their regulatory roles in cell fate decisions. Moreover, as powerful tools for early embryogenesis research, single-cell omics techniques have been applied to both mouse and human preimplantation embryos and have contributed to the discovery of cell fate regulators. Here, we summarize their applications in the research of preimplantation embryos, and provide new insights and perspectives on cell fate regulation.

在哺乳动物胚胎植入前的发育过程中,一个全能的合子要经历数次细胞裂解和两轮细胞命运决定,最终形成一个成熟的囊胚。在压实的同时,顶基底细胞极性的建立打破了胚胎的对称性,并指导后续的细胞命运选择。虽然内细胞团(ICM)和滋养外胚层(TE)的细胞系分离是细胞分化的第一个标志,但在更早的阶段,包括 2 细胞和 4 细胞阶段,有几种分子已被证明能通过其细胞间的变化偏向早期细胞命运。长期以来,早期细胞命运决定的内在机制一直是一个重要的研究课题。在这篇综述中,我们总结了早期胚胎发生过程中发生的分子事件,以及目前对它们在细胞命运决定中的调控作用的理解。此外,作为早期胚胎发生研究的有力工具,单细胞全息技术已被应用于小鼠和人类植入前胚胎,并为发现细胞命运调控因子做出了贡献。在此,我们总结了这些技术在植入前胚胎研究中的应用,并为细胞命运调控提供了新的见解和视角。
{"title":"Omics Views of Mechanisms for Cell Fate Determination in Early Mammalian Development.","authors":"Lin-Fang Ju, Heng-Ji Xu, Yun-Gui Yang, Ying Yang","doi":"10.1016/j.gpb.2023.03.001","DOIUrl":"10.1016/j.gpb.2023.03.001","url":null,"abstract":"<p><p>During mammalian preimplantation development, a totipotent zygote undergoes several cell cleavages and two rounds of cell fate determination, ultimately forming a mature blastocyst. Along with compaction, the establishment of apicobasal cell polarity breaks the symmetry of an embryo and guides subsequent cell fate choice. Although the lineage segregation of the inner cell mass (ICM) and trophectoderm (TE) is the first symbol of cell differentiation, several molecules have been shown to bias the early cell fate through their inter-cellular variations at much earlier stages, including the 2- and 4-cell stages. The underlying mechanisms of early cell fate determination have long been an important research topic. In this review, we summarize the molecular events that occur during early embryogenesis, as well as the current understanding of their regulatory roles in cell fate decisions. Moreover, as powerful tools for early embryogenesis research, single-cell omics techniques have been applied to both mouse and human preimplantation embryos and have contributed to the discovery of cell fate regulators. Here, we summarize their applications in the research of preimplantation embryos, and provide new insights and perspectives on cell fate regulation.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"950-961"},"PeriodicalIF":11.5,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10928378/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10101436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revolutionizing Antibody Discovery: An Innovative AI Model for Generating Robust Libraries. 革命性的抗体发现:一种用于生成鲁棒库的创新AI模型。
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-10-01 Epub Date: 2023-06-22 DOI: 10.1016/j.gpb.2023.06.001
Yaojun Wang, Shiwei Sun
{"title":"Revolutionizing Antibody Discovery: An Innovative AI Model for Generating Robust Libraries.","authors":"Yaojun Wang, Shiwei Sun","doi":"10.1016/j.gpb.2023.06.001","DOIUrl":"10.1016/j.gpb.2023.06.001","url":null,"abstract":"","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"910-912"},"PeriodicalIF":11.5,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10928364/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9671806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Patient Assessment and Therapy Planning Based on Homologous Recombination Repair Deficiency. 基于同源重组修复缺陷的患者评估和治疗计划
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-10-01 Epub Date: 2023-02-14 DOI: 10.1016/j.gpb.2023.02.004
Wenbin Li, Lin Gao, Xin Yi, Shuangfeng Shi, Jie Huang, Leming Shi, Xiaoyan Zhou, Lingying Wu, Jianming Ying

Defects in genes involved in the DNA damage response cause homologous recombination repair deficiency (HRD). HRD is found in a subgroup of cancer patients for several tumor types, and it has a clinical relevance to cancer prevention and therapies. Accumulating evidence has identified HRD as a biomarker for assessing the therapeutic response of tumor cells to poly(ADP-ribose) polymerase inhibitors and platinum-based chemotherapies. Nevertheless, the biology of HRD is complex, and its applications and the benefits of different HRD biomarker assays are controversial. This is primarily due to inconsistencies in HRD assessments and definitions (gene-level tests, genomic scars, mutational signatures, or a combination of these methods) and difficulties in assessing the contribution of each genomic event. Therefore, we aim to review the biological rationale and clinical evidence of HRD as a biomarker. This review provides a blueprint for the standardization and harmonization of HRD assessments.

参与 DNA 损伤反应的基因缺陷会导致同源重组修复缺陷(HRD)。HRD在多种肿瘤类型的癌症患者中都有发现,在癌症预防和治疗中具有临床意义。越来越多的证据表明,HRD 是评估肿瘤细胞对聚(ADP 核糖)聚合酶抑制剂和铂类化疗药物治疗反应的生物标志物。然而,HRD 的生物学特性十分复杂,其应用和不同 HRD 生物标志物检测方法的益处也存在争议。这主要是由于 HRD 评估和定义(基因水平测试、基因组疤痕、突变特征或这些方法的组合)不一致,以及难以评估每个基因组事件的贡献。因此,我们旨在回顾将 HRD 作为生物标记物的生物学原理和临床证据。这篇综述为 HRD 评估的标准化和统一化提供了一个蓝图。
{"title":"Patient Assessment and Therapy Planning Based on Homologous Recombination Repair Deficiency.","authors":"Wenbin Li, Lin Gao, Xin Yi, Shuangfeng Shi, Jie Huang, Leming Shi, Xiaoyan Zhou, Lingying Wu, Jianming Ying","doi":"10.1016/j.gpb.2023.02.004","DOIUrl":"10.1016/j.gpb.2023.02.004","url":null,"abstract":"<p><p>Defects in genes involved in the DNA damage response cause homologous recombination repair deficiency (HRD). HRD is found in a subgroup of cancer patients for several tumor types, and it has a clinical relevance to cancer prevention and therapies. Accumulating evidence has identified HRD as a biomarker for assessing the therapeutic response of tumor cells to poly(ADP-ribose) polymerase inhibitors and platinum-based chemotherapies. Nevertheless, the biology of HRD is complex, and its applications and the benefits of different HRD biomarker assays are controversial. This is primarily due to inconsistencies in HRD assessments and definitions (gene-level tests, genomic scars, mutational signatures, or a combination of these methods) and difficulties in assessing the contribution of each genomic event. Therefore, we aim to review the biological rationale and clinical evidence of HRD as a biomarker. This review provides a blueprint for the standardization and harmonization of HRD assessments.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"962-975"},"PeriodicalIF":11.5,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10928375/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10737665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AB-Gen: Antibody Library Design with Generative Pre-trained Transformer and Deep Reinforcement Learning. AB-Gen:利用生成式预训练变换器和深度强化学习设计抗体库。
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-10-01 Epub Date: 2023-06-24 DOI: 10.1016/j.gpb.2023.03.004
Xiaopeng Xu, Tiantian Xu, Juexiao Zhou, Xingyu Liao, Ruochi Zhang, Yu Wang, Lu Zhang, Xin Gao

Antibody leads must fulfill multiple desirable properties to be clinical candidates. Primarily due to the low throughput in the experimental procedure, the need for such multi-property optimization causes the bottleneck in preclinical antibody discovery and development, because addressing one issue usually causes another. We developed a reinforcement learning (RL) method, named AB-Gen, for antibody library design using a generative pre-trained transformer (GPT) as the policy network of the RL agent. We showed that this model can learn the antibody space of heavy chain complementarity determining region 3 (CDRH3) and generate sequences with similar property distributions. Besides, when using human epidermal growth factor receptor-2 (HER2) as the target, the agent model of AB-Gen was able to generate novel CDRH3 sequences that fulfill multi-property constraints. Totally, 509 generated sequences were able to pass all property filters, and three highly conserved residues were identified. The importance of these residues was further demonstrated by molecular dynamics simulations, consolidating that the agent model was capable of grasping important information in this complex optimization task. Overall, the AB-Gen method is able to design novel antibody sequences with an improved success rate than the traditional propose-then-filter approach. It has the potential to be used in practical antibody design, thus empowering the antibody discovery and development process. The source code of AB-Gen is freely available at Zenodo (https://doi.org/10.5281/zenodo.7657016) and BioCode (https://ngdc.cncb.ac.cn/biocode/tools/BT007341).

抗体先导物必须满足多种理想特性才能成为临床候选物。主要由于实验过程的吞吐量较低,这种多属性优化的需求造成了临床前抗体发现和开发的瓶颈,因为解决一个问题通常会引发另一个问题。我们开发了一种用于抗体库设计的强化学习(RL)方法,命名为 AB-Gen,使用生成式预训练变换器(GPT)作为 RL 代理的策略网络。我们的研究表明,该模型可以学习重链互补决定区 3(CDRH3)的抗体空间,并生成具有相似性质分布的序列。此外,当使用人表皮生长因子受体-2(HER2)作为靶点时,AB-Gen 的代理模型能够生成满足多属性约束的新型 CDRH3 序列。总共有 509 个生成的序列能够通过所有属性筛选,并确定了三个高度保守的残基。分子动力学模拟进一步证明了这些残基的重要性,从而巩固了代理模型能够在这项复杂的优化任务中掌握重要信息。总之,与传统的 "提出-然后过滤 "方法相比,AB-Gen 方法能够提高设计新型抗体序列的成功率。它有望用于实际的抗体设计,从而促进抗体的发现和开发过程。AB-Gen 的源代码可在 Zenodo (https://doi.org/10.5281/zenodo.7657016) 和 BioCode (https://ngdc.cncb.ac.cn/biocode/tools/BT007341) 免费获取。
{"title":"AB-Gen: Antibody Library Design with Generative Pre-trained Transformer and Deep Reinforcement Learning.","authors":"Xiaopeng Xu, Tiantian Xu, Juexiao Zhou, Xingyu Liao, Ruochi Zhang, Yu Wang, Lu Zhang, Xin Gao","doi":"10.1016/j.gpb.2023.03.004","DOIUrl":"10.1016/j.gpb.2023.03.004","url":null,"abstract":"<p><p>Antibody leads must fulfill multiple desirable properties to be clinical candidates. Primarily due to the low throughput in the experimental procedure, the need for such multi-property optimization causes the bottleneck in preclinical antibody discovery and development, because addressing one issue usually causes another. We developed a reinforcement learning (RL) method, named AB-Gen, for antibody library design using a generative pre-trained transformer (GPT) as the policy network of the RL agent. We showed that this model can learn the antibody space of heavy chain complementarity determining region 3 (CDRH3) and generate sequences with similar property distributions. Besides, when using human epidermal growth factor receptor-2 (HER2) as the target, the agent model of AB-Gen was able to generate novel CDRH3 sequences that fulfill multi-property constraints. Totally, 509 generated sequences were able to pass all property filters, and three highly conserved residues were identified. The importance of these residues was further demonstrated by molecular dynamics simulations, consolidating that the agent model was capable of grasping important information in this complex optimization task. Overall, the AB-Gen method is able to design novel antibody sequences with an improved success rate than the traditional propose-then-filter approach. It has the potential to be used in practical antibody design, thus empowering the antibody discovery and development process. The source code of AB-Gen is freely available at Zenodo (https://doi.org/10.5281/zenodo.7657016) and BioCode (https://ngdc.cncb.ac.cn/biocode/tools/BT007341).</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"1043-1053"},"PeriodicalIF":11.5,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10928431/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10045398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms. 蛋白质结构预测:挑战、进展和研究范式的转变。
IF 11.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2023-10-01 Epub Date: 2023-03-30 DOI: 10.1016/j.gpb.2022.11.014
Bin Huang, Lupeng Kong, Chao Wang, Fusong Ju, Qi Zhang, Jianwei Zhu, Tiansu Gong, Haicang Zhang, Chungong Yu, Wei-Mou Zheng, Dongbo Bu

Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields, including biochemistry, medicine, physics, mathematics, and computer science. These researchers adopt various research paradigms to attack the same structure prediction problem: biochemists and physicists attempt to reveal the principles governing protein folding; mathematicians, especially statisticians, usually start from assuming a probability distribution of protein structures given a target sequence and then find the most likely structure, while computer scientists formulate protein structure prediction as an optimization problem - finding the structural conformation with the lowest energy or minimizing the difference between predicted structure and native structure. These research paradigms fall into the two statistical modeling cultures proposed by Leo Breiman, namely, data modeling and algorithmic modeling. Recently, we have also witnessed the great success of deep learning in protein structure prediction. In this review, we present a survey of the efforts for protein structure prediction. We compare the research paradigms adopted by researchers from different fields, with an emphasis on the shift of research paradigms in the era of deep learning. In short, the algorithmic modeling techniques, especially deep neural networks, have considerably improved the accuracy of protein structure prediction; however, theories interpreting the neural networks and knowledge on protein folding are still highly desired.

蛋白质结构预测是一个跨学科研究课题,吸引了来自生物化学、医学、物理学、数学和计算机科学等多个领域的研究人员。这些研究人员采用不同的研究范式来解决相同的结构预测问题:生物化学家和物理学家试图揭示蛋白质折叠的原理;数学家,尤其是统计学家,通常从假设目标序列中蛋白质结构的概率分布出发,然后找出最可能的结构;而计算机科学家则将蛋白质结构预测表述为一个优化问题--寻找能量最低的结构构象,或将预测结构与原生结构之间的差异最小化。这些研究范式属于 L. Breiman 提出的两种统计建模文化,即数据建模和算法建模。最近,我们也见证了深度学习在蛋白质结构预测方面的巨大成功。在这篇综述中,我们对蛋白质结构预测方面的工作进行了调查。我们比较了不同领域研究人员所采用的研究范式,重点关注深度学习时代研究范式的转变。总之,算法建模技术,尤其是深度神经网络,大大提高了蛋白质结构预测的准确性;然而,解释神经网络的理论和蛋白质折叠方面的知识仍是亟待解决的问题。
{"title":"Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms.","authors":"Bin Huang, Lupeng Kong, Chao Wang, Fusong Ju, Qi Zhang, Jianwei Zhu, Tiansu Gong, Haicang Zhang, Chungong Yu, Wei-Mou Zheng, Dongbo Bu","doi":"10.1016/j.gpb.2022.11.014","DOIUrl":"10.1016/j.gpb.2022.11.014","url":null,"abstract":"<p><p>Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields, including biochemistry, medicine, physics, mathematics, and computer science. These researchers adopt various research paradigms to attack the same structure prediction problem: biochemists and physicists attempt to reveal the principles governing protein folding; mathematicians, especially statisticians, usually start from assuming a probability distribution of protein structures given a target sequence and then find the most likely structure, while computer scientists formulate protein structure prediction as an optimization problem - finding the structural conformation with the lowest energy or minimizing the difference between predicted structure and native structure. These research paradigms fall into the two statistical modeling cultures proposed by Leo Breiman, namely, data modeling and algorithmic modeling. Recently, we have also witnessed the great success of deep learning in protein structure prediction. In this review, we present a survey of the efforts for protein structure prediction. We compare the research paradigms adopted by researchers from different fields, with an emphasis on the shift of research paradigms in the era of deep learning. In short, the algorithmic modeling techniques, especially deep neural networks, have considerably improved the accuracy of protein structure prediction; however, theories interpreting the neural networks and knowledge on protein folding are still highly desired.</p>","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":" ","pages":"913-925"},"PeriodicalIF":11.5,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10928435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9593946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genomics, Proteomics & Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1