首页 > 最新文献

BMC Genomics最新文献

英文 中文
Alternative splicing perspective to prey preference of environmentally friendly biological agent Cryptolaemus montrouzieri. 从替代剪接角度看环境友好型生物病原体隐翅虫对猎物的偏好。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-15 DOI: 10.1186/s12864-024-10870-6
Yuqi Liu, Xinhui Xia, Wenxu Ren, Xiyao Hong, Xuefei Tang, Hong Pang, Yuchen Yang

Background: Cryptolaemus montrouzieri (Coccinellidae) is widely utilized as biological control agents in modern agriculture. A comprehensive understanding of its food preference can help guide mass rearing and safety management during field application of pest control. Although some studies have paid attentions to the impacts of prey shift on C. montrouzieri, little is known regarding the role of post-transcriptional regulations in its acclimation to unnatural preys.

Results: We performed a genome-wide investigation on alternative splicing dynamics in C. montrouzieri in response to the predation transition from natural prey to unnatural ones. When feeding on undesired diets, 402-764 genes were differentially alternative spliced in C. montrouzieri. It is noteworthy that the majority of these genes (> 87%) were not differentially expressed, and these differentially spliced genes regulated distinct biological processes from differentially expressed genes, such as organ development and morphogenesis, locomotory behavior, and homeostasis processes. These suggested the functionally nonredendant role of alternative splicing in modulating physiological and metabolic responses of C. montrouzieri to the shift to undesired preys. In addition, the individuals feeding on aphids were subject to a lower level of changes in splicing than other alternative diets, which might be because of the similar chemical and microbial compositions. Our study further suggested a putative coupling of alternative splicing and nonsense-mediated decay (AS-NMD), which may play an important role in fine-tuning the protein repertoire of C. montrouzieri, and promoting its acclimation to predation changes.

Conclusion: These findings highlight the key role of alternative splicing in modulating the acclimation of ladybirds to prey shift and provide new genetic clues for the future application of ladybirds in biocontrol.

背景:隐翅虫(Coccinellidae)在现代农业中被广泛用作生物防治剂。全面了解隐翅虫的食物偏好有助于指导大规模饲养和田间害虫防治过程中的安全管理。虽然一些研究关注了猎物转移对 C. montrouzieri 的影响,但对转录后调控在其适应非自然猎物过程中的作用却知之甚少:结果:我们在全基因组范围内调查了 C. montrouzieri 在从天然猎物向非天然猎物捕食转变过程中的替代剪接动态。当捕食不受欢迎的食物时,C. montrouzieri体内有402-764个基因发生了不同程度的替代剪接。值得注意的是,这些基因中的大部分(> 87%)没有差异表达,而且这些差异剪接基因调控着与差异表达基因不同的生物学过程,如器官发育和形态发生、运动行为和体内平衡过程。这些结果表明,替代剪接在调节 C. montrouzieri 转食不受欢迎的猎物时的生理和新陈代谢反应中起着功能上不可替代的作用。此外,与其他替代食物相比,以蚜虫为食的个体受到的剪接变化程度较低,这可能是因为它们的化学成分和微生物成分相似。我们的研究进一步表明,替代剪接和无义介导衰变(AS-NMD)之间可能存在一种假定的耦合关系,这种耦合关系可能在微调C. montrouzieri的蛋白质库、促进其适应捕食变化方面发挥重要作用:这些发现凸显了替代剪接在调节瓢虫对猎物变化的适应性中的关键作用,并为未来瓢虫在生物防治中的应用提供了新的遗传线索。
{"title":"Alternative splicing perspective to prey preference of environmentally friendly biological agent Cryptolaemus montrouzieri.","authors":"Yuqi Liu, Xinhui Xia, Wenxu Ren, Xiyao Hong, Xuefei Tang, Hong Pang, Yuchen Yang","doi":"10.1186/s12864-024-10870-6","DOIUrl":"https://doi.org/10.1186/s12864-024-10870-6","url":null,"abstract":"<p><strong>Background: </strong>Cryptolaemus montrouzieri (Coccinellidae) is widely utilized as biological control agents in modern agriculture. A comprehensive understanding of its food preference can help guide mass rearing and safety management during field application of pest control. Although some studies have paid attentions to the impacts of prey shift on C. montrouzieri, little is known regarding the role of post-transcriptional regulations in its acclimation to unnatural preys.</p><p><strong>Results: </strong>We performed a genome-wide investigation on alternative splicing dynamics in C. montrouzieri in response to the predation transition from natural prey to unnatural ones. When feeding on undesired diets, 402-764 genes were differentially alternative spliced in C. montrouzieri. It is noteworthy that the majority of these genes (> 87%) were not differentially expressed, and these differentially spliced genes regulated distinct biological processes from differentially expressed genes, such as organ development and morphogenesis, locomotory behavior, and homeostasis processes. These suggested the functionally nonredendant role of alternative splicing in modulating physiological and metabolic responses of C. montrouzieri to the shift to undesired preys. In addition, the individuals feeding on aphids were subject to a lower level of changes in splicing than other alternative diets, which might be because of the similar chemical and microbial compositions. Our study further suggested a putative coupling of alternative splicing and nonsense-mediated decay (AS-NMD), which may play an important role in fine-tuning the protein repertoire of C. montrouzieri, and promoting its acclimation to predation changes.</p><p><strong>Conclusion: </strong>These findings highlight the key role of alternative splicing in modulating the acclimation of ladybirds to prey shift and provide new genetic clues for the future application of ladybirds in biocontrol.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11481726/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comprehensive identification of GASA genes in sunflower and expression profiling in response to drought. 向日葵中 GASA 基因的全面鉴定以及对干旱反应的表达谱分析。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10860-8
Muhammad Asad Ullah, Muhammad Awais Ahmed, Latifa AlHusnain, Muhammad Abu Bakar Zia, Muneera D F AlKahtani, Kotb A Attia, Mohammed Hawash

Drought stress poses a critical threat to global crop yields and sustainable agriculture. The GASA genes are recognized for their pivotal role in stress tolerance and plant growth, but little is known about how they function in sunflowers. The investigation aimed to identify and elucidate the role of HaGASA genes in conferring sunflowers with drought tolerance. Twenty-seven different HaGASA gene family members were found in this study that were inconsistently located across eleven sunflower chromosomes. Phylogeny analysis revealed that the sunflower HaGASA genes were divided into five subgroups by comparing GASA genes with those from Arabidopsis, peanut, and soybean, with members within each subgroup displaying similar conserved motifs and gene structures. In-silico evaluation of cis-regulatory elements indicated the existence of specific elements associated with stress-responsiveness being the most abundant, followed by hormone, light, and growth-responsive elements. Transcriptomic data from the NCBI database was utilized to assess the HaGASA genes expression profile in different sunflower varieties under drought conditions. The HaGASA genes expression across ten sunflower genotypes under drought stress, revealed 14 differentially expressed HaGASA genes, implying their active role in the plant's stress response. The expression in different organs revealed that HaGASA2, HaGASA11, HaGASA17, HaGASA19, HaGASA21 and HaGASA26 displayed maximum expression in the stem. Our findings implicate HaGASA genes in mediating sunflower growth maintenance and adaptation to abiotic stress, particularly drought. The findings, taken together, provided a basic understanding of the structure and potential functions of HaGASA genes, setting the framework for further functional investigations into their roles in drought stress mitigation and crop improvement strategies.

干旱胁迫对全球作物产量和可持续农业构成严重威胁。GASA 基因在胁迫耐受性和植物生长中发挥着关键作用,但人们对它们在向日葵中的功能知之甚少。这项研究旨在鉴定和阐明 HaGASA 基因在赋予向日葵耐旱性方面的作用。这项研究发现了 27 个不同的 HaGASA 基因家族成员,它们在 11 条向日葵染色体上的位置并不一致。系统进化分析表明,通过将向日葵的 HaGASA 基因与拟南芥、花生和大豆的 GASA 基因进行比较,向日葵的 HaGASA 基因被分为五个亚群,每个亚群中的成员都具有相似的保守基序和基因结构。对顺式调控元件的内部评估表明,与胁迫响应相关的特定元件最多,其次是激素、光和生长响应元件。利用 NCBI 数据库中的转录组数据评估了不同向日葵品种在干旱条件下的 HaGASA 基因表达谱。干旱胁迫条件下 10 个向日葵基因型的 HaGASA 基因表达情况显示,有 14 个 HaGASA 基因有差异表达,这意味着它们在植物的胁迫响应中发挥了积极作用。在不同器官中的表达显示,HaGASA2、HaGASA11、HaGASA17、HaGASA19、HaGASA21 和 HaGASA26 在茎中的表达量最大。我们的研究结果表明,HaGASA 基因介导向日葵的生长维持和对非生物胁迫(尤其是干旱)的适应。综合上述研究结果,我们对 HaGASA 基因的结构和潜在功能有了基本的了解,为进一步研究其在缓解干旱胁迫和作物改良战略中的作用奠定了基础。
{"title":"Comprehensive identification of GASA genes in sunflower and expression profiling in response to drought.","authors":"Muhammad Asad Ullah, Muhammad Awais Ahmed, Latifa AlHusnain, Muhammad Abu Bakar Zia, Muneera D F AlKahtani, Kotb A Attia, Mohammed Hawash","doi":"10.1186/s12864-024-10860-8","DOIUrl":"https://doi.org/10.1186/s12864-024-10860-8","url":null,"abstract":"<p><p>Drought stress poses a critical threat to global crop yields and sustainable agriculture. The GASA genes are recognized for their pivotal role in stress tolerance and plant growth, but little is known about how they function in sunflowers. The investigation aimed to identify and elucidate the role of HaGASA genes in conferring sunflowers with drought tolerance. Twenty-seven different HaGASA gene family members were found in this study that were inconsistently located across eleven sunflower chromosomes. Phylogeny analysis revealed that the sunflower HaGASA genes were divided into five subgroups by comparing GASA genes with those from Arabidopsis, peanut, and soybean, with members within each subgroup displaying similar conserved motifs and gene structures. In-silico evaluation of cis-regulatory elements indicated the existence of specific elements associated with stress-responsiveness being the most abundant, followed by hormone, light, and growth-responsive elements. Transcriptomic data from the NCBI database was utilized to assess the HaGASA genes expression profile in different sunflower varieties under drought conditions. The HaGASA genes expression across ten sunflower genotypes under drought stress, revealed 14 differentially expressed HaGASA genes, implying their active role in the plant's stress response. The expression in different organs revealed that HaGASA2, HaGASA11, HaGASA17, HaGASA19, HaGASA21 and HaGASA26 displayed maximum expression in the stem. Our findings implicate HaGASA genes in mediating sunflower growth maintenance and adaptation to abiotic stress, particularly drought. The findings, taken together, provided a basic understanding of the structure and potential functions of HaGASA genes, setting the framework for further functional investigations into their roles in drought stress mitigation and crop improvement strategies.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472593/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning classification of archaea and bacteria identifies novel predictive genomic features. 古细菌和细菌的机器学习分类确定了新的预测性基因组特征。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10832-y
Tania Bobbo, Filippo Biscarini, Sachithra K Yaddehige, Leonardo Alberghini, Davide Rigoni, Nicoletta Bianchi, Cristian Taccioli

Background: Archaea and Bacteria are distinct domains of life that are adapted to a variety of ecological niches. Several genome-based methods have been developed for their accurate classification, yet many aspects of the specific genomic features that determine these differences are not fully understood. In this study, we used publicly available whole-genome sequences from bacteria ( N = 2546 ) and archaea ( N = 109 ). From these, a set of genomic features (nucleotide frequencies and proportions, coding sequences (CDS), non-coding, ribosomal and transfer RNA genes (ncRNA, rRNA, tRNA), Chargaff's, topological entropy and Shannon's entropy scores) was extracted and used as input data to develop machine learning models for the classification of archaea and bacteria.

Results: The classification accuracy ranged from 0.993 (Random Forest) to 0.998 (Neural Networks). Over the four models, only 11 examples were misclassified, especially those belonging to the minority class (Archaea). From variable importance, tRNA topological and Shannon's entropy, nucleotide frequencies in tRNA, rRNA and ncRNA, CDS, tRNA and rRNA Chargaff's scores have emerged as the top discriminating factors. In particular, tRNA entropy (both topological and Shannon's) was the most important genomic feature for classification, pointing at the complex interactions between the genetic code, tRNAs and the translational machinery.

Conclusions: tRNA, rRNA and ncRNA genes emerged as the key genomic elements that underpin the classification of archaea and bacteria. In particular, higher nucleotide diversity was found in tRNA from bacteria compared to archaea. The analysis of the few classification errors reflects the complex phylogenetic relationships between bacteria, archaea and eukaryotes.

背景:古细菌和细菌是适应各种生态位的不同生命领域。目前已开发出几种基于基因组的方法来对它们进行准确分类,但人们对决定这些差异的特定基因组特征的许多方面还不完全了解。在这项研究中,我们使用了可公开获得的细菌(N = 2546)和古菌(N = 109)的全基因组序列。从中提取了一组基因组特征(核苷酸频率和比例、编码序列(CDS)、非编码、核糖体和转运RNA基因(ncRNA、rRNA、tRNA)、Chargaff熵、拓扑熵和香农熵分数),并将其作为输入数据,用于开发古细菌和细菌分类的机器学习模型:分类准确率从 0.993(随机森林)到 0.998(神经网络)不等。在四个模型中,只有 11 个例子被错误分类,尤其是属于少数类(古菌)的例子。从变量重要性来看,tRNA 的拓扑熵和香农熵、tRNA、rRNA 和 ncRNA 中的核苷酸频率、CDS、tRNA 和 rRNA 的 Chargaff 分数成为最重要的判别因素。结论:tRNA、rRNA 和 ncRNA 基因是支持古细菌和细菌分类的关键基因组元素。与古细菌相比,细菌的 tRNA 的核苷酸多样性更高。对少数分类错误的分析反映了细菌、古菌和真核生物之间复杂的系统发育关系。
{"title":"Machine learning classification of archaea and bacteria identifies novel predictive genomic features.","authors":"Tania Bobbo, Filippo Biscarini, Sachithra K Yaddehige, Leonardo Alberghini, Davide Rigoni, Nicoletta Bianchi, Cristian Taccioli","doi":"10.1186/s12864-024-10832-y","DOIUrl":"https://doi.org/10.1186/s12864-024-10832-y","url":null,"abstract":"<p><strong>Background: </strong>Archaea and Bacteria are distinct domains of life that are adapted to a variety of ecological niches. Several genome-based methods have been developed for their accurate classification, yet many aspects of the specific genomic features that determine these differences are not fully understood. In this study, we used publicly available whole-genome sequences from bacteria ( <math><mrow><mi>N</mi> <mo>=</mo> <mn>2546</mn></mrow> </math> ) and archaea ( <math><mrow><mi>N</mi> <mo>=</mo> <mn>109</mn></mrow> </math> ). From these, a set of genomic features (nucleotide frequencies and proportions, coding sequences (CDS), non-coding, ribosomal and transfer RNA genes (ncRNA, rRNA, tRNA), Chargaff's, topological entropy and Shannon's entropy scores) was extracted and used as input data to develop machine learning models for the classification of archaea and bacteria.</p><p><strong>Results: </strong>The classification accuracy ranged from 0.993 (Random Forest) to 0.998 (Neural Networks). Over the four models, only 11 examples were misclassified, especially those belonging to the minority class (Archaea). From variable importance, tRNA topological and Shannon's entropy, nucleotide frequencies in tRNA, rRNA and ncRNA, CDS, tRNA and rRNA Chargaff's scores have emerged as the top discriminating factors. In particular, tRNA entropy (both topological and Shannon's) was the most important genomic feature for classification, pointing at the complex interactions between the genetic code, tRNAs and the translational machinery.</p><p><strong>Conclusions: </strong>tRNA, rRNA and ncRNA genes emerged as the key genomic elements that underpin the classification of archaea and bacteria. In particular, higher nucleotide diversity was found in tRNA from bacteria compared to archaea. The analysis of the few classification errors reflects the complex phylogenetic relationships between bacteria, archaea and eukaryotes.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472548/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developmental and validation of a novel small and high-efficient panel of microhaplotypes for forensic genetics by the next generation sequencing. 利用新一代测序技术开发和验证用于法医遗传学的新型小型高效微单型面板。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10880-4
Changyun Gu, Weipeng Huo, Xiaolan Huang, Li Chen, Shunyi Tian, Qianchong Ran, Zheng Ren, Qiyan Wang, Meiqing Yang, Jingyan Ji, Yubo Liu, Min Zhong, Kang Wang, Danlu Song, Jiang Huang, Hongling Zhang, Xiaoye Jin

Background: In the domain of forensic science, the application of kinship identification and mixture deconvolution techniques are of critical importance, providing robust scientific evidence for the resolution of complex cases. Microhaplotypes, as the emerging class of genetic markers, have been widely studied in forensics due to their high polymorphisms and excellent stability.

Results and discussion: In this research, a novel and high-efficient panel integrating 33 microhaplotype loci along with a sex-determining locus was developed by the next generation sequencing technology. In addition, we also assessed its forensic utility and delved into its capacity for kinship analysis and mixture deconvolution. The average effective number of alleles (Ae) of the 33 microhaplotype loci in the Guizhou Han population was 6.06, and the Ae values of 30 loci were greater than 5. The cumulative power of discrimination and cumulative power of exclusion values of the novel panel in the Guizhou Han population were 1-5.6 × 10- 43 and 1-1.6 × 10- 15, respectively. In the simulated kinship analysis, the panel could effectively distinguish between parent-child, full-sibling, half-sibling, grandfather-grandson, aunt-nephew and unrelated individuals, but uncertainty rates clearly increased when distinguishing between first cousins and unrelated individuals. For the mixtures, the novel panel had demonstrated excellent performance in estimating the number of contributors of mixtures with 1 to 5 contributors in combination with the machine learning methods.

Conclusions: In summary, we have developed a small and high-efficient panel for forensic genetics, which could provide novel insights into forensic complex kinships testing and mixture deconvolution.

背景:在法医学领域,亲缘关系鉴定和混合物解旋技术的应用至关重要,可为复杂案件的侦破提供有力的科学证据。微单体型作为一类新兴的遗传标记,因其高多态性和出色的稳定性,在法医学中得到了广泛的研究:在这项研究中,我们利用新一代测序技术开发了一个新颖高效的小组,该小组整合了 33 个微单体型位点和一个性别决定位点。此外,我们还评估了它在法医方面的实用性,并深入研究了它在亲缘关系分析和混合解构方面的能力。贵州汉族 33 个微单型位点的平均有效等位基因数(Ae)为 6.06,其中 30 个位点的 Ae 值大于 5。贵州汉族人群中新型面板的累积区分度和累积排除度分别为 1-5.6 × 10- 43 和 1-1.6 × 10- 15。在模拟亲缘关系分析中,面板能有效区分亲子、全同胞、半同胞、祖孙、姑侄和非亲缘关系个体,但在区分嫡表亲和非亲缘关系个体时,不确定率明显增加。在混合物方面,新型面板与机器学习方法相结合,在估计 1 至 5 个贡献者的混合物贡献者数量方面表现出色:总之,我们为法医遗传学开发了一个小型高效面板,可为法医复杂亲缘关系测试和混合物解卷积提供新的见解。
{"title":"Developmental and validation of a novel small and high-efficient panel of microhaplotypes for forensic genetics by the next generation sequencing.","authors":"Changyun Gu, Weipeng Huo, Xiaolan Huang, Li Chen, Shunyi Tian, Qianchong Ran, Zheng Ren, Qiyan Wang, Meiqing Yang, Jingyan Ji, Yubo Liu, Min Zhong, Kang Wang, Danlu Song, Jiang Huang, Hongling Zhang, Xiaoye Jin","doi":"10.1186/s12864-024-10880-4","DOIUrl":"https://doi.org/10.1186/s12864-024-10880-4","url":null,"abstract":"<p><strong>Background: </strong>In the domain of forensic science, the application of kinship identification and mixture deconvolution techniques are of critical importance, providing robust scientific evidence for the resolution of complex cases. Microhaplotypes, as the emerging class of genetic markers, have been widely studied in forensics due to their high polymorphisms and excellent stability.</p><p><strong>Results and discussion: </strong>In this research, a novel and high-efficient panel integrating 33 microhaplotype loci along with a sex-determining locus was developed by the next generation sequencing technology. In addition, we also assessed its forensic utility and delved into its capacity for kinship analysis and mixture deconvolution. The average effective number of alleles (Ae) of the 33 microhaplotype loci in the Guizhou Han population was 6.06, and the Ae values of 30 loci were greater than 5. The cumulative power of discrimination and cumulative power of exclusion values of the novel panel in the Guizhou Han population were 1-5.6 × 10<sup>- 43</sup> and 1-1.6 × 10<sup>- 15</sup>, respectively. In the simulated kinship analysis, the panel could effectively distinguish between parent-child, full-sibling, half-sibling, grandfather-grandson, aunt-nephew and unrelated individuals, but uncertainty rates clearly increased when distinguishing between first cousins and unrelated individuals. For the mixtures, the novel panel had demonstrated excellent performance in estimating the number of contributors of mixtures with 1 to 5 contributors in combination with the machine learning methods.</p><p><strong>Conclusions: </strong>In summary, we have developed a small and high-efficient panel for forensic genetics, which could provide novel insights into forensic complex kinships testing and mixture deconvolution.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11475632/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Microsatellite density landscapes illustrate short tandem repeats aggregation in the complete reference human genome. 微卫星密度图显示了完整参考人类基因组中短串联重复序列的聚集情况。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10843-9
Yun Xia, Douyue Li, Tingyi Chen, Saichao Pan, Hanrou Huang, Wenxiang Zhang, Yulin Liang, Yongzhuo Fu, Zhuli Peng, Hongxi Zhang, Liang Zhang, Shan Peng, Ruixue Shi, Xingxin He, Siqian Zhou, Weili Jiao, Xiangyan Zhao, Xiaolong Wu, Lan Zhou, Jingyu Zhou, Qingjian Ouyang, You Tian, Xiaoping Jiang, Yi Zhou, Shiying Tang, Junxiong Shen, Kazusato Ohshima, Zhongyang Tan

Background: Microsatellites are increasingly realized to have biological significance in human genome and health in past decades, the assembled complete reference sequence of human genome T2T-CHM13 brought great help for a comprehensive study of short tandem repeats in the human genome.

Results: Microsatellites density landscapes of all 24 chromosomes were built here for the first complete reference sequence of human genome T2T-CHM13. These landscapes showed that short tandem repeats (STRs) are prone to aggregate characteristically to form a large number of STRs density peaks. We classified 8,823 High Microsatellites Density Peaks (HMDPs), 35,257 Middle Microsatellites Density Peaks (MMDPs) and 199, 649 Low Microsatellites Density Peaks (LMDPs) on the 24 chromosomes; and also classified the motif types of every microsatellites density peak. These STRs density aggregation peaks are mainly composing of a single motif, and AT is the most dominant motif, followed by AATGG and CCATT motifs. And 514 genomic regions were characterized by microsatellite density feature in the full T2T-CHM13 genome.

Conclusions: These landscape maps exhibited that microsatellites aggregate in many genomic positions to form a large number of microsatellite density peaks with composing of mainly single motif type in the complete reference genome, indicating that the local microsatellites density varies enormously along the every chromosome of T2T-CHM13.

背景:过去几十年来,人们越来越认识到微卫星在人类基因组和健康中的生物学意义,人类基因组T2T-CHM13完整参考序列的组装为全面研究人类基因组中的短串联重复序列提供了巨大帮助:结果:在人类基因组T2T-CHM13的第一个完整参考序列中,建立了全部24条染色体的微卫星密度图谱。这些图谱显示,短串联重复序列(STR)容易聚集形成大量的STR密度峰。我们对 24 条染色体上的 8823 个高微卫星密度峰(HMDP)、35257 个中微卫星密度峰(MMDP)和 199649 个低微卫星密度峰(LMDP)进行了分类,并对每个微卫星密度峰的主题类型进行了分类。这些 STRs 密度聚集峰主要由单一图案构成,其中 AT 是最主要的图案,其次是 AATGG 和 CCATT 图案。T2T-CHM13全基因组中有514个基因组区域具有微卫星密度特征:这些图谱显示,在完整的参考基因组中,微卫星在许多基因组位置聚集,形成了大量的微卫星密度峰,且主要由单一的图案类型组成,这表明在T2T-CHM13的每条染色体上,局部微卫星的密度差异巨大。
{"title":"Microsatellite density landscapes illustrate short tandem repeats aggregation in the complete reference human genome.","authors":"Yun Xia, Douyue Li, Tingyi Chen, Saichao Pan, Hanrou Huang, Wenxiang Zhang, Yulin Liang, Yongzhuo Fu, Zhuli Peng, Hongxi Zhang, Liang Zhang, Shan Peng, Ruixue Shi, Xingxin He, Siqian Zhou, Weili Jiao, Xiangyan Zhao, Xiaolong Wu, Lan Zhou, Jingyu Zhou, Qingjian Ouyang, You Tian, Xiaoping Jiang, Yi Zhou, Shiying Tang, Junxiong Shen, Kazusato Ohshima, Zhongyang Tan","doi":"10.1186/s12864-024-10843-9","DOIUrl":"https://doi.org/10.1186/s12864-024-10843-9","url":null,"abstract":"<p><strong>Background: </strong>Microsatellites are increasingly realized to have biological significance in human genome and health in past decades, the assembled complete reference sequence of human genome T2T-CHM13 brought great help for a comprehensive study of short tandem repeats in the human genome.</p><p><strong>Results: </strong>Microsatellites density landscapes of all 24 chromosomes were built here for the first complete reference sequence of human genome T2T-CHM13. These landscapes showed that short tandem repeats (STRs) are prone to aggregate characteristically to form a large number of STRs density peaks. We classified 8,823 High Microsatellites Density Peaks (HMDPs), 35,257 Middle Microsatellites Density Peaks (MMDPs) and 199, 649 Low Microsatellites Density Peaks (LMDPs) on the 24 chromosomes; and also classified the motif types of every microsatellites density peak. These STRs density aggregation peaks are mainly composing of a single motif, and AT is the most dominant motif, followed by AATGG and CCATT motifs. And 514 genomic regions were characterized by microsatellite density feature in the full T2T-CHM13 genome.</p><p><strong>Conclusions: </strong>These landscape maps exhibited that microsatellites aggregate in many genomic positions to form a large number of microsatellite density peaks with composing of mainly single motif type in the complete reference genome, indicating that the local microsatellites density varies enormously along the every chromosome of T2T-CHM13.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11477012/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome-wide identification, expression analysis of the R2R3-MYB gene family and their potential roles under cold stress in Prunus sibirica. 西伯利亚李子 R2R3-MYB 基因家族的全基因组鉴定、表达分析及其在冷胁迫下的潜在作用。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10868-0
Xin Zhao, Shipeng Wang, Hongrui Zhang, Shengjun Dong, Jianhua Chen, Yongqiang Sun, Yueyuan Zhang, Quangang Liu

Background: The R2R3-MYB transcription factors in plants participate in various physiological and biochemical processes and responds to various external stimuli. Prunus sibirica (known as Siberian apricot) is a drupe tree species that produces extremely high nutritional value kernels. However, it is susceptiblility to frost damage during the flowering period, results in a marked reduction in kernel yield.

Results: In this study, the MYB gene family of P. sibirica (PsMYB) was systematically analyzed, and 116 R2R3-MYB genes that were distributed unevenly over eight chromosomes were ultimately screened. Phylogenetic analysis divided these 116 genes into 30 subgroups. We discovered that 37 PsMYBs had cold stress-responsive promoters, and six PsMYBs were annotated to be associated with cold response. Intraspecific homology analysis identified segmental duplication as the primary gene amplification mechanism, and homology analysis of the PsMYB genes with those of five other species revealed phylogenetic relationships with Rosaceae species. Protein interaction studies revealed collaborative regulation of the PsMYB proteins with Arabidopsis protein, and transcriptome analysis identified PsMYB genes that were highly expressed at low temperatures. Additionally, the expression levels of 22 PsMYBs in different tissue parts of P. sibirica and under different low-temperature stress conditions were evaluated using quantitative real-time PCR, with the results verifying that PsMYBs are specifically expressed in different plant parts and may be involved in the growth and development of P. sibirica species. Genes upregulated after exposure to low-temperature stress and likely involved in cold response were identified.

Conclusion: This study lays a foundation for understanding the molecular biology of PsMYBs in P. sibirica and provides a theoretical basis for the future study of transgenic lines with cold resistance during the flowering period of this tree.

背景:植物中的 R2R3-MYB 转录因子参与各种生理和生化过程,并对各种外部刺激做出反应。西伯利亚杏(Prunus sibirica,又称西伯利亚杏)是一种核果类树种,果核营养价值极高。然而,它在开花期易受冻害,导致果仁产量明显下降:结果:本研究系统分析了西伯利亚莓的 MYB 基因家族(PsMYB),最终筛选出 116 个 R2R3-MYB 基因,这些基因不均匀地分布在 8 条染色体上。系统进化分析将这 116 个基因分为 30 个亚群。我们发现 37 个 PsMYB 具有冷胁迫响应启动子,6 个 PsMYB 被注释为与冷响应相关。种内同源性分析发现,片段复制是主要的基因扩增机制,PsMYB基因与其他五个物种基因的同源性分析揭示了与蔷薇科物种的系统发育关系。蛋白质相互作用研究表明,PsMYB 蛋白与拟南芥蛋白具有协同调控作用,转录组分析发现了在低温条件下高表达的 PsMYB 基因。此外,利用实时定量 PCR 技术评估了 22 个 PsMYB 在西伯利亚豹不同组织部位和不同低温胁迫条件下的表达水平,结果验证了 PsMYB 在植物不同部位的特异性表达,可能参与了西伯利亚豹的生长发育。研究还发现了暴露于低温胁迫后上调的基因,这些基因可能参与了低温响应:本研究为了解西伯利亚红豆杉(PsMYBs)的分子生物学奠定了基础,并为今后研究该树种花期抗寒性转基因品系提供了理论依据。
{"title":"Genome-wide identification, expression analysis of the R2R3-MYB gene family and their potential roles under cold stress in Prunus sibirica.","authors":"Xin Zhao, Shipeng Wang, Hongrui Zhang, Shengjun Dong, Jianhua Chen, Yongqiang Sun, Yueyuan Zhang, Quangang Liu","doi":"10.1186/s12864-024-10868-0","DOIUrl":"https://doi.org/10.1186/s12864-024-10868-0","url":null,"abstract":"<p><strong>Background: </strong>The R2R3-MYB transcription factors in plants participate in various physiological and biochemical processes and responds to various external stimuli. Prunus sibirica (known as Siberian apricot) is a drupe tree species that produces extremely high nutritional value kernels. However, it is susceptiblility to frost damage during the flowering period, results in a marked reduction in kernel yield.</p><p><strong>Results: </strong>In this study, the MYB gene family of P. sibirica (PsMYB) was systematically analyzed, and 116 R2R3-MYB genes that were distributed unevenly over eight chromosomes were ultimately screened. Phylogenetic analysis divided these 116 genes into 30 subgroups. We discovered that 37 PsMYBs had cold stress-responsive promoters, and six PsMYBs were annotated to be associated with cold response. Intraspecific homology analysis identified segmental duplication as the primary gene amplification mechanism, and homology analysis of the PsMYB genes with those of five other species revealed phylogenetic relationships with Rosaceae species. Protein interaction studies revealed collaborative regulation of the PsMYB proteins with Arabidopsis protein, and transcriptome analysis identified PsMYB genes that were highly expressed at low temperatures. Additionally, the expression levels of 22 PsMYBs in different tissue parts of P. sibirica and under different low-temperature stress conditions were evaluated using quantitative real-time PCR, with the results verifying that PsMYBs are specifically expressed in different plant parts and may be involved in the growth and development of P. sibirica species. Genes upregulated after exposure to low-temperature stress and likely involved in cold response were identified.</p><p><strong>Conclusion: </strong>This study lays a foundation for understanding the molecular biology of PsMYBs in P. sibirica and provides a theoretical basis for the future study of transgenic lines with cold resistance during the flowering period of this tree.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472476/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metagenomic assemblies tend to break around antibiotic resistance genes. 元基因组的组合往往会在抗生素抗性基因周围发生断裂。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10876-0
Anna Abramova, Antti Karkman, Johan Bengtsson-Palme

Background: Assembly of metagenomic samples can provide essential information about the mobility potential and taxonomic origin of antibiotic resistance genes (ARGs) and inform interventions to prevent further spread of resistant bacteria. However, similar to other conserved regions, such as ribosomal RNA genes and mobile genetic elements, almost identical ARGs typically occur in multiple genomic contexts across different species, representing a considerable challenge for the assembly process. Usually, this results in many fragmented contigs of unclear origin, complicating the risk assessment of ARG detections. To systematically investigate the impact of this issue on detection, quantification and contextualization of ARGs, we evaluated the performance of different assembly approaches, including genomic-, metagenomic- and transcriptomic-specialized assemblers. We quantified recovery and accuracy rates of each tool for ARGs both from in silico spiked metagenomic samples as well as real samples sequenced using both long- and short-read sequencing technologies.

Results: The results revealed that none of the investigated tools can accurately capture genomic contexts present in samples of high complexity. The transcriptomic assembler Trinity showed a better performance in terms of reconstructing longer and fewer contigs matching unique genomic contexts, which can be beneficial for deciphering the taxonomic origin of ARGs. The currently commonly used metagenomic assembly tools metaSPAdes and MEGAHIT were able to identify the ARG repertoire but failed to fully recover the diversity of genomic contexts present in a sample. On top of that, in a complex scenario MEGAHIT produced very short contigs, which can lead to considerable underestimation of the resistome in a given sample.

Conclusions: Our study shows that metaSPAdes and Trinity would be the preferable tools in terms of accuracy to recover correct genomic contexts around ARGs in metagenomic samples characterized by uneven coverages. Overall, the inability of assemblers to reconstruct long ARG-containing contigs has impacts on ARG quantification, suggesting that directly mapping reads to an ARG database should be performed as a complementary strategy to get accurate ARG abundance and diversity measures.

背景:元基因组样本的组装可以提供有关抗生素耐药基因(ARGs)的迁移潜力和分类起源的重要信息,并为防止耐药细菌进一步扩散的干预措施提供依据。然而,与核糖体 RNA 基因和移动遗传因子等其他保守区域类似,几乎相同的 ARGs 通常出现在不同物种的多个基因组环境中,这对组装过程是一个相当大的挑战。这通常会导致许多来源不明的片段等位基因,从而使 ARG 检测的风险评估复杂化。为了系统地研究这一问题对 ARGs 的检测、量化和背景化的影响,我们评估了不同组装方法的性能,包括基因组、元基因组和转录组专用组装器。我们量化了每种工具对ARGs的回收率和准确率,这些ARGs既来自硅学添加的元基因组样本,也来自使用长、短线程测序技术测序的真实样本:结果表明,所研究的工具都无法准确捕捉高复杂性样本中的基因组背景。转录组组装工具 Trinity 在重建与独特基因组背景相匹配的较长、较少的等位基因方面表现较好,这有利于破译 ARGs 的分类起源。目前常用的元基因组组装工具 metaSPAdes 和 MEGAHIT 能够识别 ARG 基因库,但不能完全恢复样本中基因组上下文的多样性。此外,在复杂的情况下,MEGAHIT 产生的等位基因非常短,这可能会导致严重低估特定样本中的抗性基因组:我们的研究表明,在覆盖率不均匀的元基因组样本中,要恢复 ARGs 周围正确的基因组上下文,就准确性而言,metaSPAdes 和 Trinity 是更可取的工具。总之,组装器无法重建含 ARG 的长片段会影响 ARG 定量,这表明应将直接将读数映射到 ARG 数据库作为一种补充策略,以获得准确的 ARG 丰度和多样性测量结果。
{"title":"Metagenomic assemblies tend to break around antibiotic resistance genes.","authors":"Anna Abramova, Antti Karkman, Johan Bengtsson-Palme","doi":"10.1186/s12864-024-10876-0","DOIUrl":"https://doi.org/10.1186/s12864-024-10876-0","url":null,"abstract":"<p><strong>Background: </strong>Assembly of metagenomic samples can provide essential information about the mobility potential and taxonomic origin of antibiotic resistance genes (ARGs) and inform interventions to prevent further spread of resistant bacteria. However, similar to other conserved regions, such as ribosomal RNA genes and mobile genetic elements, almost identical ARGs typically occur in multiple genomic contexts across different species, representing a considerable challenge for the assembly process. Usually, this results in many fragmented contigs of unclear origin, complicating the risk assessment of ARG detections. To systematically investigate the impact of this issue on detection, quantification and contextualization of ARGs, we evaluated the performance of different assembly approaches, including genomic-, metagenomic- and transcriptomic-specialized assemblers. We quantified recovery and accuracy rates of each tool for ARGs both from in silico spiked metagenomic samples as well as real samples sequenced using both long- and short-read sequencing technologies.</p><p><strong>Results: </strong>The results revealed that none of the investigated tools can accurately capture genomic contexts present in samples of high complexity. The transcriptomic assembler Trinity showed a better performance in terms of reconstructing longer and fewer contigs matching unique genomic contexts, which can be beneficial for deciphering the taxonomic origin of ARGs. The currently commonly used metagenomic assembly tools metaSPAdes and MEGAHIT were able to identify the ARG repertoire but failed to fully recover the diversity of genomic contexts present in a sample. On top of that, in a complex scenario MEGAHIT produced very short contigs, which can lead to considerable underestimation of the resistome in a given sample.</p><p><strong>Conclusions: </strong>Our study shows that metaSPAdes and Trinity would be the preferable tools in terms of accuracy to recover correct genomic contexts around ARGs in metagenomic samples characterized by uneven coverages. Overall, the inability of assemblers to reconstruct long ARG-containing contigs has impacts on ARG quantification, suggesting that directly mapping reads to an ARG database should be performed as a complementary strategy to get accurate ARG abundance and diversity measures.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11479545/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Under the name of "Lua": revisiting genetic heterogeneity and population ancestry of Austroasiatic speakers in northern Thailand through genomic analysis. 以 "Lua "为名:通过基因组分析重新审视泰国北部讲奥斯特罗西亚语的人的遗传异质性和人口祖先。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10865-3
Jatupol Kampuansai, Tanapon Seetaraso, Maneesawan Dansawan, Suwapat Sathupak, Wibhu Kutanan, Metawee Srikummool, Angkana Inta

Background: Austroasiatic (AA)-speaking populations in northern Thailand are of significant interest due to their status as indigenous descendants and their location at the crossroads of AA prehistoric distribution across Southern China, the Indian Subcontinent, and Mainland Southeast Asia. However, the complexity of ethnic identification can result in inaccuracies regarding the origin and migration history of these populations. To address this, we have conducted a genome-wide SNP analysis of 89 individuals from two Lavue and three Lwa-endonym populations. We then combined our outcomes with previously published data to elucidate the genetic diversity and clustering of AA groups in northern Thailand.

Results: Our findings align with existing linguistic classifications, revealing different genetic compositions among the three branches of the Mon-Khmer subfamily within the AA family: Monic, Khmuic, and Palaungic. Although the term "Lua" ethnicity is confusingly used to identify ethnic groups belonging to both Khmuic and Palaungic branches, our genomic data indicate that the Khmuic-speaking Lua living on the eastern side of the region are relatively distant from the Palaungic-speaking Lavue and Lwa populations living on the western side. The Lavue populations, primarily inhabiting mountainous areas, exhibit a genetic makeup unique to the AA family, with a close genetic relationship to the Karenic subgroup of the Sino-Tibetan language family. Conversely, the Lwa and Blang populations, residing in lowland river valleys, display genetic signatures resulting from admixture with Tai-Kadai-speaking ethnic groups.

Conclusion: Utilizing genome-wide SNP markers, our findings indicate genetic heterogeneity among the Lua, Lavue, and Lwa ethnic groups. The intricate interplay of genetics, cultural heritage, and historical influences has shaped these ethnic communities. Our study underscores the importance of accurate ethnic classifications, emphasizing the use of self-identified endonyms, names created and used by the ethnic groups themselves. This approach respects the AA communities in northern Thailand and acknowledges their significant contributions to advancing our understanding of genetic anthropology.

背景:泰国北部讲奥斯特罗西亚语(AA)的人群由于其原住民后裔的身份,以及地处中国南部、印度次大陆和东南亚大陆 AA 史前分布的十字路口而备受关注。然而,民族识别的复杂性可能会导致这些人群的起源和迁徙历史不准确。为了解决这个问题,我们对来自两个拉乌人和三个泸瓦人的 89 个个体进行了全基因组 SNP 分析。然后,我们将分析结果与之前公布的数据相结合,阐明了泰国北部 AA 群体的遗传多样性和聚类:结果:我们的研究结果与现有的语言分类一致,揭示了 AA 族中孟高棉亚族三个分支之间不同的遗传组成:莫尼克族、高棉族和帕拉昂族。尽管 "Lua "这个词被混淆地用于识别属于高棉语和巴朗吉语两个分支的族群,但我们的基因组数据表明,生活在该地区东部、讲高棉语的 Lua 族群与生活在西部、讲巴朗吉语的 Lavue 和 Lwa 族群之间的距离相对较远。拉乌人主要居住在山区,他们的基因构成是 AA 族所独有的,与汉藏语系的克伦语亚族有着密切的遗传关系。相反,居住在低地河谷地区的卢瓦族和布朗族则显示出与讲傣语的民族混血的遗传特征:结论:利用全基因组 SNP 标记,我们的研究结果表明 Lua、Lavue 和 Lwa 族群之间存在遗传异质性。遗传学、文化遗产和历史影响的复杂相互作用塑造了这些族群。我们的研究强调了准确的种族分类的重要性,强调使用自我认同的本地名称,即由族群自己创建和使用的名称。这种方法尊重了泰国北部的 AA 族群,并肯定了他们为促进我们对遗传人类学的理解所做出的重要贡献。
{"title":"Under the name of \"Lua\": revisiting genetic heterogeneity and population ancestry of Austroasiatic speakers in northern Thailand through genomic analysis.","authors":"Jatupol Kampuansai, Tanapon Seetaraso, Maneesawan Dansawan, Suwapat Sathupak, Wibhu Kutanan, Metawee Srikummool, Angkana Inta","doi":"10.1186/s12864-024-10865-3","DOIUrl":"10.1186/s12864-024-10865-3","url":null,"abstract":"<p><strong>Background: </strong>Austroasiatic (AA)-speaking populations in northern Thailand are of significant interest due to their status as indigenous descendants and their location at the crossroads of AA prehistoric distribution across Southern China, the Indian Subcontinent, and Mainland Southeast Asia. However, the complexity of ethnic identification can result in inaccuracies regarding the origin and migration history of these populations. To address this, we have conducted a genome-wide SNP analysis of 89 individuals from two Lavue and three Lwa-endonym populations. We then combined our outcomes with previously published data to elucidate the genetic diversity and clustering of AA groups in northern Thailand.</p><p><strong>Results: </strong>Our findings align with existing linguistic classifications, revealing different genetic compositions among the three branches of the Mon-Khmer subfamily within the AA family: Monic, Khmuic, and Palaungic. Although the term \"Lua\" ethnicity is confusingly used to identify ethnic groups belonging to both Khmuic and Palaungic branches, our genomic data indicate that the Khmuic-speaking Lua living on the eastern side of the region are relatively distant from the Palaungic-speaking Lavue and Lwa populations living on the western side. The Lavue populations, primarily inhabiting mountainous areas, exhibit a genetic makeup unique to the AA family, with a close genetic relationship to the Karenic subgroup of the Sino-Tibetan language family. Conversely, the Lwa and Blang populations, residing in lowland river valleys, display genetic signatures resulting from admixture with Tai-Kadai-speaking ethnic groups.</p><p><strong>Conclusion: </strong>Utilizing genome-wide SNP markers, our findings indicate genetic heterogeneity among the Lua, Lavue, and Lwa ethnic groups. The intricate interplay of genetics, cultural heritage, and historical influences has shaped these ethnic communities. Our study underscores the importance of accurate ethnic classifications, emphasizing the use of self-identified endonyms, names created and used by the ethnic groups themselves. This approach respects the AA communities in northern Thailand and acknowledges their significant contributions to advancing our understanding of genetic anthropology.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472482/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying transcription factors with cell-type specific DNA binding signatures. 识别具有细胞类型特异性 DNA 结合特征的转录因子
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-14 DOI: 10.1186/s12864-024-10859-1
Aseel Awdeh, Marcel Turcotte, Theodore J Perkins

Background: Transcription factors (TFs) bind to different parts of the genome in different types of cells, but it is usually assumed that the inherent DNA-binding preferences of a TF are invariant to cell type. Yet, there are several known examples of TFs that switch their DNA-binding preferences in different cell types, and yet more examples of other mechanisms, such as steric hindrance or cooperative binding, that may result in a "DNA signature" of differential binding.

Results: To survey this phenomenon systematically, we developed a deep learning method we call SigTFB (Signatures of TF Binding) to detect and quantify cell-type specificity in a TF's known genomic binding sites. We used ENCODE ChIP-seq data to conduct a wide scale investigation of 169 distinct TFs in up to 14 distinct cell types. SigTFB detected statistically significant DNA binding signatures in approximately two-thirds of TFs, far more than might have been expected from the relatively sparse evidence in prior literature. We found that the presence or absence of a cell-type specific DNA binding signature is distinct from, and indeed largely uncorrelated to, the degree of overlap between ChIP-seq peaks in different cell types, and tended to arise by two mechanisms: using established motifs in different frequencies, and by selective inclusion of motifs for distint TFs.

Conclusions: While recent results have highlighted cell state features such as chromatin accessibility and gene expression in predicting TF binding, our results emphasize that, for some TFs, the DNA sequences of the binding sites contain substantial cell-type specific motifs.

背景:转录因子(TF)在不同类型的细胞中与基因组的不同部分结合,但人们通常认为TF固有的DNA结合偏好对细胞类型是不变的。然而,有几个已知的例子表明,TF 在不同细胞类型中会改变其 DNA 结合偏好,还有更多例子表明,其他机制(如立体阻碍或合作结合)可能会导致不同结合的 "DNA 签名":为了系统地研究这一现象,我们开发了一种深度学习方法,称为SigTFB(TF结合特征),用于检测和量化TF已知基因组结合位点的细胞类型特异性。我们使用 ENCODE ChIP-seq 数据对多达 14 种不同细胞类型中的 169 种不同 TF 进行了大规模调查。SigTFB 在大约三分之二的 TFs 中检测到了具有统计学意义的 DNA 结合特征,远远超出了之前文献中相对稀少的证据的预期。我们发现,细胞类型特异性 DNA 结合特征的存在或不存在与不同细胞类型中 ChIP-seq 峰之间的重叠程度不同,实际上也基本不相关,而且往往是通过两种机制产生的:以不同的频率使用既定的基调,以及选择性地包含不同 TF 的基调:尽管最近的研究结果强调了细胞状态特征,如染色质可及性和基因表达在预测TF结合中的作用,但我们的研究结果强调,对于某些TFs来说,结合位点的DNA序列包含大量细胞类型特异性基序。
{"title":"Identifying transcription factors with cell-type specific DNA binding signatures.","authors":"Aseel Awdeh, Marcel Turcotte, Theodore J Perkins","doi":"10.1186/s12864-024-10859-1","DOIUrl":"https://doi.org/10.1186/s12864-024-10859-1","url":null,"abstract":"<p><strong>Background: </strong>Transcription factors (TFs) bind to different parts of the genome in different types of cells, but it is usually assumed that the inherent DNA-binding preferences of a TF are invariant to cell type. Yet, there are several known examples of TFs that switch their DNA-binding preferences in different cell types, and yet more examples of other mechanisms, such as steric hindrance or cooperative binding, that may result in a \"DNA signature\" of differential binding.</p><p><strong>Results: </strong>To survey this phenomenon systematically, we developed a deep learning method we call SigTFB (Signatures of TF Binding) to detect and quantify cell-type specificity in a TF's known genomic binding sites. We used ENCODE ChIP-seq data to conduct a wide scale investigation of 169 distinct TFs in up to 14 distinct cell types. SigTFB detected statistically significant DNA binding signatures in approximately two-thirds of TFs, far more than might have been expected from the relatively sparse evidence in prior literature. We found that the presence or absence of a cell-type specific DNA binding signature is distinct from, and indeed largely uncorrelated to, the degree of overlap between ChIP-seq peaks in different cell types, and tended to arise by two mechanisms: using established motifs in different frequencies, and by selective inclusion of motifs for distint TFs.</p><p><strong>Conclusions: </strong>While recent results have highlighted cell state features such as chromatin accessibility and gene expression in predicting TF binding, our results emphasize that, for some TFs, the DNA sequences of the binding sites contain substantial cell-type specific motifs.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472444/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome-wide identification and analysis of anthocyanin synthesis-related R2R3-MYB genes in Fragaria pentaphylla. 在全基因组范围内鉴定和分析五倍子花青素合成相关的 R2R3-MYB 基因。
IF 3.5 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-10-13 DOI: 10.1186/s12864-024-10882-2
Liangmu Xie, Yinuo Wang, Yutian Tao, Luxi Chen, Hanyang Lin, Zhechen Qi, Junmin Li

Background: MYB transcription factors regulate anthocyanin biosynthesis across numerous plant species. However, comprehensive genome-wide investigations regarding the R2R3-MYB gene family and its involvement in regulating anthocyanin biosynthesis in the red and white fruit color morphs of Fragaria pentaphylla remain scarce.

Results: A total of 101 FpR2R3-MYB genes were identified from the F. pentaphylla genome and were divided into 34 subgroups based on phylogenetic analysis. Gene structure (exon/intron) and protein motifs were particularly conserved among the FpR2R3-MYB genes, especially members within the same subgroup. The FpR2R3-MYB genes were distributed over seven F. pentaphylla chromosomes. Analysis of gene duplication events revealed five pairs of tandem duplication genes and 16 pairs of segmental duplication genes, suggesting that segmental duplications are the major pattern for expansion of the FpR2R3-MYB gene family expansion in F. pentaphylla. Cis-regulatory elements of the FpR2R3-MYB promoters were involved in cellular development, phytohormones, environmental stress and photoresponse. Based on the analysis of the FpR2R3-MYB gene family and transcriptome sequencing (RNA-seq) data, FpMYB9 was identified as a key transcription factor involved in the regulation of anthocyanin synthesis in F. pentaphylla fruits. The expression of FpMYB9 increases significantly during the ripening stage of red fruits, as confirmed by reverse transcription quantitative real-time PCR. In addition, subcellular localization experiments further confirmed the nuclear presence of FpMYB9, supporting its role as a transcription factor involved in anthocyanin biosynthesis.

Conclusion: Our results showed that the FpR2R3-MYB genes are highly conserved and play important roles in the anthocyanin biosynthesis in F. pentaphylla fruits. Our results also provide a compelling basis for further understanding of the regulatory mechanism underlying the role of FpMYB9 in anthocyanin formation in F. pentaphylla fruits.

背景:MYB 转录因子调节许多植物物种的花青素生物合成。然而,有关 R2R3-MYB 基因家族及其参与调控五色番茄红白果实花青素生物合成的全基因组综合研究仍然很少:结果:从五味子基因组中共鉴定出101个FpR2R3-MYB基因,并根据系统发育分析将其分为34个亚群。基因结构(外显子/内含子)和蛋白质基序在 FpR2R3-MYB 基因中特别保守,尤其是同一亚群中的成员。FpR2R3-MYB 基因分布在 F. pentaphylla 的 7 条染色体上。对基因复制事件的分析表明,有5对串联复制基因和16对片段复制基因,这表明片段复制是五倍子蛙FpR2R3-MYB基因家族扩展的主要模式。FpR2R3-MYB启动子的顺式调节元件涉及细胞发育、植物激素、环境胁迫和光反应。根据对 FpR2R3-MYB 基因家族和转录组测序(RNA-seq)数据的分析,发现 FpMYB9 是参与调控五倍子果实花青素合成的关键转录因子。反转录定量实时聚合酶链式反应(reverse transcription quantitative real-time PCR)证实,FpMYB9 的表达在红果成熟期显著增加。此外,亚细胞定位实验进一步证实了 FpMYB9 在核内的存在,支持其作为转录因子参与花青素生物合成的作用:我们的研究结果表明,FpR2R3-MYB 基因高度保守,在五倍子果实的花青素生物合成过程中发挥着重要作用。我们的研究结果还为进一步了解 FpMYB9 在五倍子果实花青素形成过程中的调控机制提供了令人信服的依据。
{"title":"Genome-wide identification and analysis of anthocyanin synthesis-related R2R3-MYB genes in Fragaria pentaphylla.","authors":"Liangmu Xie, Yinuo Wang, Yutian Tao, Luxi Chen, Hanyang Lin, Zhechen Qi, Junmin Li","doi":"10.1186/s12864-024-10882-2","DOIUrl":"https://doi.org/10.1186/s12864-024-10882-2","url":null,"abstract":"<p><strong>Background: </strong>MYB transcription factors regulate anthocyanin biosynthesis across numerous plant species. However, comprehensive genome-wide investigations regarding the R2R3-MYB gene family and its involvement in regulating anthocyanin biosynthesis in the red and white fruit color morphs of Fragaria pentaphylla remain scarce.</p><p><strong>Results: </strong>A total of 101 FpR2R3-MYB genes were identified from the F. pentaphylla genome and were divided into 34 subgroups based on phylogenetic analysis. Gene structure (exon/intron) and protein motifs were particularly conserved among the FpR2R3-MYB genes, especially members within the same subgroup. The FpR2R3-MYB genes were distributed over seven F. pentaphylla chromosomes. Analysis of gene duplication events revealed five pairs of tandem duplication genes and 16 pairs of segmental duplication genes, suggesting that segmental duplications are the major pattern for expansion of the FpR2R3-MYB gene family expansion in F. pentaphylla. Cis-regulatory elements of the FpR2R3-MYB promoters were involved in cellular development, phytohormones, environmental stress and photoresponse. Based on the analysis of the FpR2R3-MYB gene family and transcriptome sequencing (RNA-seq) data, FpMYB9 was identified as a key transcription factor involved in the regulation of anthocyanin synthesis in F. pentaphylla fruits. The expression of FpMYB9 increases significantly during the ripening stage of red fruits, as confirmed by reverse transcription quantitative real-time PCR. In addition, subcellular localization experiments further confirmed the nuclear presence of FpMYB9, supporting its role as a transcription factor involved in anthocyanin biosynthesis.</p><p><strong>Conclusion: </strong>Our results showed that the FpR2R3-MYB genes are highly conserved and play important roles in the anthocyanin biosynthesis in F. pentaphylla fruits. Our results also provide a compelling basis for further understanding of the regulatory mechanism underlying the role of FpMYB9 in anthocyanin formation in F. pentaphylla fruits.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472487/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142457476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BMC Genomics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1