首页 > 最新文献

Briefings in Functional Genomics最新文献

英文 中文
A review of artificial intelligence-based brain age estimation and its applications for related diseases. 基于人工智能的脑年龄估计及其在相关疾病中的应用综述。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae042
Mohamed Azzam, Ziyang Xu, Ruobing Liu, Lie Li, Kah Meng Soh, Kishore B Challagundla, Shibiao Wan, Jieqiong Wang

The study of brain age has emerged over the past decade, aiming to estimate a person's age based on brain imaging scans. Ideally, predicted brain age should match chronological age in healthy individuals. However, brain structure and function change in the presence of brain-related diseases. Consequently, brain age also changes in affected individuals, making the brain age gap (BAG)-the difference between brain age and chronological age-a potential biomarker for brain health, early screening, and identifying age-related cognitive decline and disorders. With the recent successes of artificial intelligence in healthcare, it is essential to track the latest advancements and highlight promising directions. This review paper presents recent machine learning techniques used in brain age estimation (BAE) studies. Typically, BAE models involve developing a machine learning regression model to capture age-related variations in brain structure from imaging scans of healthy individuals and automatically predict brain age for new subjects. The process also involves estimating BAG as a measure of brain health. While we discuss recent clinical applications of BAE methods, we also review studies of biological age that can be integrated into BAE research. Finally, we point out the current limitations of BAE's studies.

脑年龄研究是在过去十年间兴起的,旨在根据脑成像扫描来估算一个人的年龄。理想情况下,预测的脑年龄应与健康人的实际年龄相符。然而,脑部结构和功能会因脑部相关疾病而发生变化。因此,受影响个体的脑年龄也会发生变化,这就使得脑年龄差距(BAG)--脑年龄与实际年龄之间的差值--成为大脑健康、早期筛查以及识别与年龄相关的认知衰退和失调的潜在生物标志物。最近,人工智能在医疗保健领域取得了巨大成功,因此有必要跟踪最新进展并强调有前景的发展方向。本综述论文介绍了最近用于脑年龄估计(BAE)研究的机器学习技术。通常,BAE 模型涉及开发一个机器学习回归模型,以便从健康人的成像扫描中捕捉大脑结构中与年龄相关的变化,并自动预测新受试者的脑年龄。这一过程还包括估算作为大脑健康度量的 BAG。在讨论 BAE 方法的最新临床应用的同时,我们还回顾了可纳入 BAE 研究的生物年龄研究。最后,我们指出了 BAE 研究目前存在的局限性。
{"title":"A review of artificial intelligence-based brain age estimation and its applications for related diseases.","authors":"Mohamed Azzam, Ziyang Xu, Ruobing Liu, Lie Li, Kah Meng Soh, Kishore B Challagundla, Shibiao Wan, Jieqiong Wang","doi":"10.1093/bfgp/elae042","DOIUrl":"10.1093/bfgp/elae042","url":null,"abstract":"<p><p>The study of brain age has emerged over the past decade, aiming to estimate a person's age based on brain imaging scans. Ideally, predicted brain age should match chronological age in healthy individuals. However, brain structure and function change in the presence of brain-related diseases. Consequently, brain age also changes in affected individuals, making the brain age gap (BAG)-the difference between brain age and chronological age-a potential biomarker for brain health, early screening, and identifying age-related cognitive decline and disorders. With the recent successes of artificial intelligence in healthcare, it is essential to track the latest advancements and highlight promising directions. This review paper presents recent machine learning techniques used in brain age estimation (BAE) studies. Typically, BAE models involve developing a machine learning regression model to capture age-related variations in brain structure from imaging scans of healthy individuals and automatically predict brain age for new subjects. The process also involves estimating BAG as a measure of brain health. While we discuss recent clinical applications of BAE methods, we also review studies of biological age that can be integrated into BAE research. Finally, we point out the current limitations of BAE's studies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735757/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142481472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crosstalk between genomic variants and DNA methylation in FLT3 mutant acute myeloid leukemia. FLT3突变型急性髓性白血病中基因组变异与DNA甲基化之间的相互关系
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae028
Bac Dao, Van Ngu Trinh, Huy V Nguyen, Hoa L Nguyen, Thuc Duy Le, Phuc Loi Luu

Acute myeloid leukemia (AML) is a type of blood cancer with diverse genetic variations and DNA methylation alterations. By studying the interaction of gene mutations, expression, and DNA methylation, we aimed to gain valuable insights into the processes that lead to block differentiation in AML. We analyzed TCGA-LAML data (173 samples) with RNA sequencing and DNA methylation arrays, comparing FLT3 mutant (48) and wild-type (125) cases. We conducted differential gene expression analysis using cBioPortal, identified DNA methylation differences with ChAMP tool, and correlated them with gene expression changes. Gene set enrichment analysis (g:Profiler) revealed significant biological processes and pathways. ShinyGo and GeneCards were used to find potential transcription factors and their binding sites among significant genes. We found significant differentially expressed genes (DEGs) negatively correlated with their most significant methylation probes (Pearson correlation coefficient of -0.49, P-value <0.001) between FLT3 mutant and wild-type groups. Moreover, our exploration of 450 k CpG sites uncovered a global hypo-methylated status in 168 DEGs. Notably, these methylation changes were enriched in the promoter regions of Homebox superfamily gene, which are crucial in transcriptional-regulating pathways in blood cancer. Furthermore, in FLT3 mutant AML patient samples, we observed overexpress of WT1, a transcription factor known to bind homeobox gene family. This finding suggests a potential mechanism by which WT1 recruits TET2 to demethylate specific genomic regions. Integrating gene expression and DNA methylation analyses shed light on the impact of FLT3 mutations on cancer cell development and differentiation, supporting a two-hit model in AML. This research advances understanding of AML and fosters targeted therapeutic strategy development.

急性髓性白血病(AML)是一种具有多种基因变异和DNA甲基化改变的血癌。通过研究基因突变、表达和DNA甲基化之间的相互作用,我们旨在获得有关导致急性髓细胞白血病分化受阻过程的宝贵见解。我们用RNA测序和DNA甲基化阵列分析了TCGA-LAML数据(173个样本),比较了FLT3突变型(48个)和野生型(125个)病例。我们使用 cBioPortal 进行了差异基因表达分析,使用 ChAMP 工具确定了 DNA 甲基化差异,并将其与基因表达变化相关联。基因组富集分析(g:Profiler)揭示了重要的生物过程和通路。我们使用 ShinyGo 和 GeneCards 寻找重要基因中的潜在转录因子及其结合位点。我们发现重要的差异表达基因(DEGs)与其最重要的甲基化探针呈负相关(Pearson 相关系数为 -0.49,P-value 为
{"title":"Crosstalk between genomic variants and DNA methylation in FLT3 mutant acute myeloid leukemia.","authors":"Bac Dao, Van Ngu Trinh, Huy V Nguyen, Hoa L Nguyen, Thuc Duy Le, Phuc Loi Luu","doi":"10.1093/bfgp/elae028","DOIUrl":"10.1093/bfgp/elae028","url":null,"abstract":"<p><p>Acute myeloid leukemia (AML) is a type of blood cancer with diverse genetic variations and DNA methylation alterations. By studying the interaction of gene mutations, expression, and DNA methylation, we aimed to gain valuable insights into the processes that lead to block differentiation in AML. We analyzed TCGA-LAML data (173 samples) with RNA sequencing and DNA methylation arrays, comparing FLT3 mutant (48) and wild-type (125) cases. We conducted differential gene expression analysis using cBioPortal, identified DNA methylation differences with ChAMP tool, and correlated them with gene expression changes. Gene set enrichment analysis (g:Profiler) revealed significant biological processes and pathways. ShinyGo and GeneCards were used to find potential transcription factors and their binding sites among significant genes. We found significant differentially expressed genes (DEGs) negatively correlated with their most significant methylation probes (Pearson correlation coefficient of -0.49, P-value <0.001) between FLT3 mutant and wild-type groups. Moreover, our exploration of 450 k CpG sites uncovered a global hypo-methylated status in 168 DEGs. Notably, these methylation changes were enriched in the promoter regions of Homebox superfamily gene, which are crucial in transcriptional-regulating pathways in blood cancer. Furthermore, in FLT3 mutant AML patient samples, we observed overexpress of WT1, a transcription factor known to bind homeobox gene family. This finding suggests a potential mechanism by which WT1 recruits TET2 to demethylate specific genomic regions. Integrating gene expression and DNA methylation analyses shed light on the impact of FLT3 mutations on cancer cell development and differentiation, supporting a two-hit model in AML. This research advances understanding of AML and fosters targeted therapeutic strategy development.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735749/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141472885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Functional genomics in the era of cancer immunotherapy: challenges and clinical implications. 癌症免疫治疗时代的功能基因组学:挑战和临床意义。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae053
{"title":"Correction to: Functional genomics in the era of cancer immunotherapy: challenges and clinical implications.","authors":"","doi":"10.1093/bfgp/elae053","DOIUrl":"https://doi.org/10.1093/bfgp/elae053","url":null,"abstract":"","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11742188/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143016869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the impact of N4-acetylcytidine modification in RNA on non-neoplastic disease: unveiling its role in pathogenesis and therapeutic opportunities. 探索 RNA 中 N4-乙酰胞嘧啶修饰对非肿瘤性疾病的影响:揭示其在发病机制中的作用和治疗机会。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae020
Keyu Wan, Tiantian Nie, Wenhao Ouyang, Yunjing Xiong, Jing Bian, Ying Huang, Li Ling, Zhenjun Huang, Xianhua Zhu

RNA modifications include not only methylation modifications, such as m6A, but also acetylation modifications, which constitute a complex interaction involving "writers," "readers," and "erasers" that play crucial roles in growth, genetics, and disease. N4-acetylcytidine (ac4C) is an ancient and highly conserved RNA modification that plays a profound role in the pathogenesis of a wide range of diseases. This review provides insights into the functional impact of ac4C modifications in disease and introduces new perspectives for disease treatment. These studies provide important insights into the biological functions of post-transcriptional RNA modifications and their potential roles in disease mechanisms, offering new perspectives and strategies for disease treatment.

RNA 修饰不仅包括甲基化修饰(如 m6A),还包括乙酰化修饰,它们构成了一种复杂的相互作用,涉及 "写者"、"读者 "和 "擦除者",在生长、遗传和疾病中发挥着至关重要的作用。N4-乙酰胞苷(ac4C)是一种古老而高度保守的 RNA 修饰,在多种疾病的发病机制中发挥着深远的作用。本综述深入探讨了 ac4C 修饰在疾病中的功能性影响,并为疾病治疗提供了新的视角。这些研究为了解转录后 RNA 修饰的生物学功能及其在疾病机制中的潜在作用提供了重要见解,为疾病治疗提供了新的视角和策略。
{"title":"Exploring the impact of N4-acetylcytidine modification in RNA on non-neoplastic disease: unveiling its role in pathogenesis and therapeutic opportunities.","authors":"Keyu Wan, Tiantian Nie, Wenhao Ouyang, Yunjing Xiong, Jing Bian, Ying Huang, Li Ling, Zhenjun Huang, Xianhua Zhu","doi":"10.1093/bfgp/elae020","DOIUrl":"10.1093/bfgp/elae020","url":null,"abstract":"<p><p>RNA modifications include not only methylation modifications, such as m6A, but also acetylation modifications, which constitute a complex interaction involving \"writers,\" \"readers,\" and \"erasers\" that play crucial roles in growth, genetics, and disease. N4-acetylcytidine (ac4C) is an ancient and highly conserved RNA modification that plays a profound role in the pathogenesis of a wide range of diseases. This review provides insights into the functional impact of ac4C modifications in disease and introduces new perspectives for disease treatment. These studies provide important insights into the biological functions of post-transcriptional RNA modifications and their potential roles in disease mechanisms, offering new perspectives and strategies for disease treatment.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735739/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141263641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lossless reference-free sequence compression algorithm leveraging grammatical, statistical, and substitution rules. 利用语法、统计和替换规则的无损无引用序列压缩算法。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae050
Subhankar Roy, Dilip Kumar Maity, Anirban Mukhopadhyay

Deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) sequence compressors for novel species frequently face challenges when processing wide-scale raw, FASTA, or multi-FASTA structured data. For years, molecular sequence databases have favored the widely used general-purpose Gzip and Zstd compressors. The absence of sequence-specific characteristics in these encoders results in subpar performance, and their use depends on time-consuming parameter adjustments. To address these limitations, in this article, we propose a reference-free, lossless sequence compressor called GraSS (Grammatical, Statistical, and Substitution Rule-Based). GraSS compresses sequences more effectively by taking advantage of certain characteristics seen in DNA and RNA sequences. It supports various formats, including raw, FASTA, and multi-FASTA, commonly found in GenBank DNA and RNA files. We evaluate GraSS's performance using ten benchmark DNA sequences with reduced number of repeats, two highly repetitive RNA sequences, and fifteen raw DNA sequences. Test results indicate that the weighted average compression ratios (WACR) for DNA and RNA sequences are 4.5 and 19.6, respectively. Additionally, the entire DNA sequence corpus has a total compression time (TCT) of 246.8 seconds (s). These results demonstrate that the proposed compression method performs better than several advanced algorithms specifically designed to handle various levels of sequence redundancy. The decompression times, memory usage, and CPU usage are also very competitive. Contact:  anirban@klyuniv.ac.in.

用于新物种的脱氧核糖核酸(DNA)或核糖核酸(RNA)序列压缩器在处理大规模的原始、FASTA或多FASTA结构化数据时经常面临挑战。多年来,分子序列数据库一直青睐于广泛使用的通用Gzip和Zstd压缩器。在这些编码器中缺乏序列特定的特性导致性能低于标准,并且它们的使用依赖于耗时的参数调整。为了解决这些限制,在本文中,我们提出了一个无引用的无损序列压缩器,称为GraSS(基于语法、统计和替换规则)。GraSS通过利用DNA和RNA序列中的某些特征更有效地压缩序列。它支持各种格式,包括原始,FASTA和多FASTA,常见于GenBank DNA和RNA文件。我们使用10个重复次数减少的基准DNA序列、两个高度重复的RNA序列和15个原始DNA序列来评估GraSS的性能。测试结果表明,DNA和RNA序列的加权平均压缩比(WACR)分别为4.5和19.6。此外,整个DNA序列语料库的总压缩时间(TCT)为246.8秒(s)。这些结果表明,所提出的压缩方法比专门设计用于处理不同级别序列冗余的几种高级算法性能更好。解压时间、内存使用和CPU使用也非常有竞争力。联系:anirban@klyuniv.ac.in。
{"title":"A lossless reference-free sequence compression algorithm leveraging grammatical, statistical, and substitution rules.","authors":"Subhankar Roy, Dilip Kumar Maity, Anirban Mukhopadhyay","doi":"10.1093/bfgp/elae050","DOIUrl":"10.1093/bfgp/elae050","url":null,"abstract":"<p><p>Deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) sequence compressors for novel species frequently face challenges when processing wide-scale raw, FASTA, or multi-FASTA structured data. For years, molecular sequence databases have favored the widely used general-purpose Gzip and Zstd compressors. The absence of sequence-specific characteristics in these encoders results in subpar performance, and their use depends on time-consuming parameter adjustments. To address these limitations, in this article, we propose a reference-free, lossless sequence compressor called GraSS (Grammatical, Statistical, and Substitution Rule-Based). GraSS compresses sequences more effectively by taking advantage of certain characteristics seen in DNA and RNA sequences. It supports various formats, including raw, FASTA, and multi-FASTA, commonly found in GenBank DNA and RNA files. We evaluate GraSS's performance using ten benchmark DNA sequences with reduced number of repeats, two highly repetitive RNA sequences, and fifteen raw DNA sequences. Test results indicate that the weighted average compression ratios (WACR) for DNA and RNA sequences are 4.5 and 19.6, respectively. Additionally, the entire DNA sequence corpus has a total compression time (TCT) of 246.8 seconds (s). These results demonstrate that the proposed compression method performs better than several advanced algorithms specifically designed to handle various levels of sequence redundancy. The decompression times, memory usage, and CPU usage are also very competitive. Contact:  anirban@klyuniv.ac.in.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735755/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142959201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepMEns: an ensemble model for predicting sgRNA on-target activity based on multiple features. DeepMEns:基于多种特征预测 sgRNA 靶向活性的集合模型。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae043
Shumei Ding, Jia Zheng, Cangzhi Jia

The CRISPR/Cas9 system developed from Streptococcus pyogenes (SpCas9) has high potential in gene editing. However, its successful application is hindered by the considerable variability in target efficiencies across different single guide RNAs (sgRNAs). Although several deep learning models have been created to predict sgRNA on-target activity, the intrinsic mechanisms of these models are difficult to explain, and there is still scope for improvement in prediction performance. To overcome these issues, we propose an ensemble interpretable model termed DeepMEns based on deep learning to predict sgRNA on-target activity. By using five different training and validation datasets, we constructed five sub-regressors, each comprising three parts. The first part uses one-hot encoding, wherein 0-1 representation of the secondary structure is used as the input to the convolutional neural network (CNN) with Transformer encoder. The second part uses the DNA shape feature matrix as the input to the CNN with Transformer encoder. The third part uses positional encoding feature matrices as the proposed input into a long short-term memory network with an attention mechanism. These three parts are concatenated through the flattened layer, and the final prediction result is the average of the five sub-regressors. Extensive benchmarking experiments indicated that DeepMEns achieved the highest Spearman correlation coefficient for 6 of 10 independent test datasets as compared to previous predictors, this finding confirmed that DeepMEns can accomplish state-of-the-art performance. Moreover, the ablation analysis also indicated that the ensemble strategy may improve the performance of the prediction model.

从化脓性链球菌(SpCas9)中开发的 CRISPR/Cas9 系统在基因编辑方面具有很大的潜力。然而,不同的单导RNA(sgRNA)在靶标效率上存在很大差异,这阻碍了它的成功应用。虽然已经创建了几个深度学习模型来预测 sgRNA 的靶上活性,但这些模型的内在机制难以解释,预测性能仍有改进的余地。为了克服这些问题,我们提出了一种基于深度学习的集合可解释模型,称为 DeepMEns,用于预测 sgRNA 靶向活性。通过使用五个不同的训练和验证数据集,我们构建了五个子回归器,每个子回归器由三部分组成。第一部分使用单次编码,其中二级结构的 0-1 表示被用作带有 Transformer 编码器的卷积神经网络(CNN)的输入。第二部分使用 DNA 形状特征矩阵作为带变换器编码器的卷积神经网络的输入。第三部分使用位置编码特征矩阵作为具有注意力机制的长短期记忆网络的拟议输入。这三个部分通过扁平化层进行串联,最终预测结果是五个子回归器的平均值。广泛的基准测试实验表明,在 10 个独立测试数据集中,DeepMEns 有 6 个数据集的斯皮尔曼相关系数与之前的预测器相比最高,这一结果证实了 DeepMEns 可以达到最先进的性能。此外,消融分析还表明,集合策略可以提高预测模型的性能。
{"title":"DeepMEns: an ensemble model for predicting sgRNA on-target activity based on multiple features.","authors":"Shumei Ding, Jia Zheng, Cangzhi Jia","doi":"10.1093/bfgp/elae043","DOIUrl":"10.1093/bfgp/elae043","url":null,"abstract":"<p><p>The CRISPR/Cas9 system developed from Streptococcus pyogenes (SpCas9) has high potential in gene editing. However, its successful application is hindered by the considerable variability in target efficiencies across different single guide RNAs (sgRNAs). Although several deep learning models have been created to predict sgRNA on-target activity, the intrinsic mechanisms of these models are difficult to explain, and there is still scope for improvement in prediction performance. To overcome these issues, we propose an ensemble interpretable model termed DeepMEns based on deep learning to predict sgRNA on-target activity. By using five different training and validation datasets, we constructed five sub-regressors, each comprising three parts. The first part uses one-hot encoding, wherein 0-1 representation of the secondary structure is used as the input to the convolutional neural network (CNN) with Transformer encoder. The second part uses the DNA shape feature matrix as the input to the CNN with Transformer encoder. The third part uses positional encoding feature matrices as the proposed input into a long short-term memory network with an attention mechanism. These three parts are concatenated through the flattened layer, and the final prediction result is the average of the five sub-regressors. Extensive benchmarking experiments indicated that DeepMEns achieved the highest Spearman correlation coefficient for 6 of 10 independent test datasets as compared to previous predictors, this finding confirmed that DeepMEns can accomplish state-of-the-art performance. Moreover, the ablation analysis also indicated that the ensemble strategy may improve the performance of the prediction model.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735754/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142630918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive review of approaches for spatial domain recognition of spatial transcriptomes. 空间转录组的空间域识别方法综述。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/bfgp/elae040
Ziyi Wang, Aoyun Geng, Hao Duan, Feifei Cui, Quan Zou, Zilong Zhang

In current bioinformatics research, spatial transcriptomics (ST) as a rapidly evolving technology is gradually receiving widespread attention from researchers. Spatial domains are regions where gene expression and histology are consistent in space, and detecting spatial domains can better understand the organization and functional distribution of tissues. Spatial domain recognition is a fundamental step in the process of ST data interpretation, which is also a major challenge in ST analysis. Therefore, developing more accurate, efficient, and general spatial domain recognition methods has become an important and urgent research direction. This article aims to review the current status and progress of spatial domain recognition research, explore the advantages and limitations of existing methods, and provide suggestions and directions for future tool development.

在当前的生物信息学研究中,空间转录组学(ST)作为一种快速发展的技术正逐渐受到研究人员的广泛关注。空间域是基因表达和组织学在空间上一致的区域,检测空间域可以更好地了解组织的组织和功能分布。空间域识别是 ST 数据解读过程中的基础步骤,也是 ST 分析中的一大挑战。因此,开发更准确、高效、通用的空间域识别方法已成为一个重要而紧迫的研究方向。本文旨在回顾空间域识别研究的现状和进展,探讨现有方法的优势和局限,并为未来工具的开发提供建议和方向。
{"title":"A comprehensive review of approaches for spatial domain recognition of spatial transcriptomes.","authors":"Ziyi Wang, Aoyun Geng, Hao Duan, Feifei Cui, Quan Zou, Zilong Zhang","doi":"10.1093/bfgp/elae040","DOIUrl":"10.1093/bfgp/elae040","url":null,"abstract":"<p><p>In current bioinformatics research, spatial transcriptomics (ST) as a rapidly evolving technology is gradually receiving widespread attention from researchers. Spatial domains are regions where gene expression and histology are consistent in space, and detecting spatial domains can better understand the organization and functional distribution of tissues. Spatial domain recognition is a fundamental step in the process of ST data interpretation, which is also a major challenge in ST analysis. Therefore, developing more accurate, efficient, and general spatial domain recognition methods has become an important and urgent research direction. This article aims to review the current status and progress of spatial domain recognition research, explore the advantages and limitations of existing methods, and provide suggestions and directions for future tool development.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"702-712"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142481471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AMLdb: a comprehensive multi-omics platform to identify biomarkers and drug targets for acute myeloid leukemia. AMLdb:鉴定急性髓性白血病生物标志物和药物靶点的综合性多组学平台。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/bfgp/elae024
Keerthana Vinod Kumar, Ambuj Kumar, Kavita Kundal, Avik Sengupta, Kunjulakshmi R, Subashani Singh, Bhanu Teja Korra, Simran Sharma, Vandana Suresh, Mayilaadumveettil Nishana, Rahul Kumar

Acute myeloid leukemia (AML) is one of the leading leukemic malignancies in adults. The heterogeneity of the disease makes the diagnosis and treatment extremely difficult. With the advent of next-generation sequencing (NGS) technologies, exploration at the molecular level for the identification of biomarkers and drug targets has been the focus for the researchers to come up with novel therapies for better prognosis and survival outcomes of AML patients. However, the huge amount of data from NGS platforms requires a comprehensive AML platform to streamline literature mining efforts and save time. To facilitate this, we developed AMLdb, an interactive multi-omics platform that allows users to query, visualize, retrieve, and analyse AML related multi-omics data. AMLdb contains 86 datasets for gene expression profiles, 15 datasets for methylation profiles, CRISPR-Cas9 knockout screens of 26 AML cell lines, sensitivity of 26 AML cell lines to 288 drugs, mutations in 41 unique genes in 23 AML cell lines, and information on 41 experimentally validated biomarkers. In this study, we have reported five genes, i.e. CBFB, ENO1, IMPDH2, SEPHS2, and MYH9 identified via our analysis using AMLdb. ENO1 is uniquely identified gene which requires further investigation as a novel potential target while other reported genes have been previously confirmed as targets through experimental studies. Top of form we believe that these findings utilizing AMLdb can make it an invaluable resource to accelerate the development of effective therapies for AML and assisting the research community in advancing their understanding of AML pathogenesis. AMLdb is freely available at https://project.iith.ac.in/cgntlab/amldb.

急性髓性白血病(AML)是成人主要的白血病恶性肿瘤之一。这种疾病的异质性给诊断和治疗带来了极大的困难。随着下一代测序(NGS)技术的出现,在分子水平上探索生物标志物和药物靶点已成为研究人员的工作重点,以便提出新的疗法,改善急性髓细胞白血病患者的预后和生存状况。然而,来自 NGS 平台的海量数据需要一个全面的 AML 平台来简化文献挖掘工作并节省时间。为此,我们开发了一个交互式多组学平台 AMLdb,允许用户查询、可视化、检索和分析 AML 相关的多组学数据。AMLdb 包含 86 个基因表达谱数据集、15 个甲基化谱数据集、26 个 AML 细胞系的 CRISPR-Cas9 基因敲除筛选、26 个 AML 细胞系对 288 种药物的敏感性、23 个 AML 细胞系中 41 个独特基因的突变以及 41 个实验验证生物标志物的信息。在本研究中,我们报告了通过 AMLdb 分析发现的五个基因,即 CBFB、ENO1、IMPDH2、SEPHS2 和 MYH9。ENO1是唯一被发现的基因,作为一个新的潜在靶点还需要进一步研究,而其他报告的基因之前已通过实验研究证实为靶点。最重要的是,我们相信利用 AMLdb 的这些发现可以使其成为加快开发急性髓细胞性白血病有效疗法的宝贵资源,并帮助研究界加深对急性髓细胞性白血病发病机制的了解。AMLdb 可在 https://project.iith.ac.in/cgntlab/amldb 免费获取。
{"title":"AMLdb: a comprehensive multi-omics platform to identify biomarkers and drug targets for acute myeloid leukemia.","authors":"Keerthana Vinod Kumar, Ambuj Kumar, Kavita Kundal, Avik Sengupta, Kunjulakshmi R, Subashani Singh, Bhanu Teja Korra, Simran Sharma, Vandana Suresh, Mayilaadumveettil Nishana, Rahul Kumar","doi":"10.1093/bfgp/elae024","DOIUrl":"10.1093/bfgp/elae024","url":null,"abstract":"<p><p>Acute myeloid leukemia (AML) is one of the leading leukemic malignancies in adults. The heterogeneity of the disease makes the diagnosis and treatment extremely difficult. With the advent of next-generation sequencing (NGS) technologies, exploration at the molecular level for the identification of biomarkers and drug targets has been the focus for the researchers to come up with novel therapies for better prognosis and survival outcomes of AML patients. However, the huge amount of data from NGS platforms requires a comprehensive AML platform to streamline literature mining efforts and save time. To facilitate this, we developed AMLdb, an interactive multi-omics platform that allows users to query, visualize, retrieve, and analyse AML related multi-omics data. AMLdb contains 86 datasets for gene expression profiles, 15 datasets for methylation profiles, CRISPR-Cas9 knockout screens of 26 AML cell lines, sensitivity of 26 AML cell lines to 288 drugs, mutations in 41 unique genes in 23 AML cell lines, and information on 41 experimentally validated biomarkers. In this study, we have reported five genes, i.e. CBFB, ENO1, IMPDH2, SEPHS2, and MYH9 identified via our analysis using AMLdb. ENO1 is uniquely identified gene which requires further investigation as a novel potential target while other reported genes have been previously confirmed as targets through experimental studies. Top of form we believe that these findings utilizing AMLdb can make it an invaluable resource to accelerate the development of effective therapies for AML and assisting the research community in advancing their understanding of AML pathogenesis. AMLdb is freely available at https://project.iith.ac.in/cgntlab/amldb.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"798-805"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141307484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances in integrating single-cell sequencing data to unravel the mechanism of ferroptosis in cancer. 整合单细胞测序数据以揭示癌症中铁凋亡机制的进展。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/bfgp/elae025
Zhaolan Du, Yi Shi, Jianjun Tan

Ferroptosis, a commonly observed type of programmed cell death caused by abnormal metabolic and biochemical mechanisms, is frequently triggered by cellular stress. The occurrence of ferroptosis is predominantly linked to pathophysiological conditions due to the substantial impact of various metabolic pathways, including fatty acid metabolism and iron regulation, on cellular reactions to lipid peroxidation and ferroptosis. This mode of cell death serves as a fundamental factor in the development of numerous diseases, thereby presenting a range of therapeutic targets. Single-cell sequencing technology provides insights into the cellular and molecular characteristics of individual cells, as opposed to bulk sequencing, which provides data in a more generalized manner. Single-cell sequencing has found extensive application in the field of cancer research. This paper reviews the progress made in ferroptosis-associated cancer research using single-cell sequencing, including ferroptosis-associated pathways, immune checkpoints, biomarkers, and the identification of cell clusters associated with ferroptosis in tumors. In general, the utilization of single-cell sequencing technology has the potential to contribute significantly to the investigation of the mechanistic regulatory pathways linked to ferroptosis. Moreover, it can shed light on the intricate connection between ferroptosis and cancer. This technology holds great promise in advancing tumor-wide diagnosis, targeted therapy, and prognosis prediction.

铁中毒是一种常见的由异常代谢和生化机制引起的程序性细胞死亡,经常由细胞压力引发。由于各种代谢途径(包括脂肪酸代谢和铁调节)对脂质过氧化和铁中毒的细胞反应有重大影响,铁中毒的发生主要与病理生理条件有关。这种细胞死亡模式是多种疾病发生发展的基本因素,从而提供了一系列治疗靶点。单细胞测序技术能深入了解单个细胞的细胞和分子特征,而批量测序技术则能以更概括的方式提供数据。单细胞测序技术已在癌症研究领域得到广泛应用。本文回顾了利用单细胞测序技术在铁突变相关癌症研究中取得的进展,包括铁突变相关通路、免疫检查点、生物标记物以及肿瘤中铁突变相关细胞群的鉴定。总的来说,利用单细胞测序技术有可能大大有助于研究与铁凋亡相关的机理调控途径。此外,它还能揭示铁突变与癌症之间错综复杂的联系。这项技术在推进肿瘤诊断、靶向治疗和预后预测方面大有可为。
{"title":"Advances in integrating single-cell sequencing data to unravel the mechanism of ferroptosis in cancer.","authors":"Zhaolan Du, Yi Shi, Jianjun Tan","doi":"10.1093/bfgp/elae025","DOIUrl":"10.1093/bfgp/elae025","url":null,"abstract":"<p><p>Ferroptosis, a commonly observed type of programmed cell death caused by abnormal metabolic and biochemical mechanisms, is frequently triggered by cellular stress. The occurrence of ferroptosis is predominantly linked to pathophysiological conditions due to the substantial impact of various metabolic pathways, including fatty acid metabolism and iron regulation, on cellular reactions to lipid peroxidation and ferroptosis. This mode of cell death serves as a fundamental factor in the development of numerous diseases, thereby presenting a range of therapeutic targets. Single-cell sequencing technology provides insights into the cellular and molecular characteristics of individual cells, as opposed to bulk sequencing, which provides data in a more generalized manner. Single-cell sequencing has found extensive application in the field of cancer research. This paper reviews the progress made in ferroptosis-associated cancer research using single-cell sequencing, including ferroptosis-associated pathways, immune checkpoints, biomarkers, and the identification of cell clusters associated with ferroptosis in tumors. In general, the utilization of single-cell sequencing technology has the potential to contribute significantly to the investigation of the mechanistic regulatory pathways linked to ferroptosis. Moreover, it can shed light on the intricate connection between ferroptosis and cancer. This technology holds great promise in advancing tumor-wide diagnosis, targeted therapy, and prognosis prediction.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"713-725"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141319002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing novel isoform discovery: leveraging nanopore long-read sequencing and machine learning approaches. 加强新型同工酶的发现:利用纳米孔长读数测序和机器学习方法。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/bfgp/elae031
Kristina Santucci, Yuning Cheng, Si-Mei Xu, Michael Janitz

Long-read sequencing technologies can capture entire RNA transcripts in a single sequencing read, reducing the ambiguity in constructing and quantifying transcript models in comparison to more common and earlier methods, such as short-read sequencing. Recent improvements in the accuracy of long-read sequencing technologies have expanded the scope for novel splice isoform detection and have also enabled a far more accurate reconstruction of complex splicing patterns and transcriptomes. Additionally, the incorporation and advancements of machine learning and deep learning algorithms in bioinformatic software have significantly improved the reliability of long-read sequencing transcriptomic studies. However, there is a lack of consensus on what bioinformatic tools and pipelines produce the most precise and consistent results. Thus, this review aims to discuss and compare the performance of available methods for novel isoform discovery with long-read sequencing technologies, with 25 tools being presented. Furthermore, this review intends to demonstrate the need for developing standard analytical pipelines, tools, and transcript model conventions for novel isoform discovery and transcriptomic studies.

与短线程测序等更常见和更早的方法相比,长线程测序技术可以在单个测序读数中捕获整个 RNA 转录本,从而减少了构建和量化转录本模型的模糊性。最近,长读程测序技术的准确性有所提高,扩大了新型剪接异构体的检测范围,也能更准确地重建复杂的剪接模式和转录组。此外,机器学习和深度学习算法在生物信息学软件中的应用和进步也大大提高了长片段测序转录组研究的可靠性。然而,对于什么样的生物信息学工具和管道能产生最精确、最一致的结果还缺乏共识。因此,本综述旨在讨论和比较利用长线程测序技术发现新型同工酶的现有方法的性能,共介绍了 25 种工具。此外,本综述还旨在说明有必要为新型同工酶发现和转录组研究开发标准的分析管道、工具和转录本模型惯例。
{"title":"Enhancing novel isoform discovery: leveraging nanopore long-read sequencing and machine learning approaches.","authors":"Kristina Santucci, Yuning Cheng, Si-Mei Xu, Michael Janitz","doi":"10.1093/bfgp/elae031","DOIUrl":"10.1093/bfgp/elae031","url":null,"abstract":"<p><p>Long-read sequencing technologies can capture entire RNA transcripts in a single sequencing read, reducing the ambiguity in constructing and quantifying transcript models in comparison to more common and earlier methods, such as short-read sequencing. Recent improvements in the accuracy of long-read sequencing technologies have expanded the scope for novel splice isoform detection and have also enabled a far more accurate reconstruction of complex splicing patterns and transcriptomes. Additionally, the incorporation and advancements of machine learning and deep learning algorithms in bioinformatic software have significantly improved the reliability of long-read sequencing transcriptomic studies. However, there is a lack of consensus on what bioinformatic tools and pipelines produce the most precise and consistent results. Thus, this review aims to discuss and compare the performance of available methods for novel isoform discovery with long-read sequencing technologies, with 25 tools being presented. Furthermore, this review intends to demonstrate the need for developing standard analytical pipelines, tools, and transcript model conventions for novel isoform discovery and transcriptomic studies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"683-694"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142001414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Briefings in Functional Genomics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1