首页 > 最新文献

Briefings in Functional Genomics最新文献

英文 中文
pyRforest: a comprehensive R package for genomic data analysis featuring scikit-learn Random Forests in R. pyRforest:用于基因组数据分析的综合性 R 软件包,采用 R 中的 scikit-learn 随机森林技术。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae038
Tyler Kolisnik, Faeze Keshavarz-Rahaghi, Rachel V Purcell, Adam N H Smith, Olin K Silander

Random Forest models are widely used in genomic data analysis and can offer insights into complex biological mechanisms, particularly when features influence the target in interactive, nonlinear, or nonadditive ways. Currently, some of the most efficient Random Forest methods in terms of computational speed are implemented in Python. However, many biologists use R for genomic data analysis, as R offers a unified platform for performing additional statistical analysis and visualization. Here, we present an R package, pyRforest, which integrates Python scikit-learn "RandomForestClassifier" algorithms into the R environment. pyRforest inherits the efficient memory management and parallelization of Python, and is optimized for classification tasks on large genomic datasets, such as those from RNA-seq. pyRforest offers several additional capabilities, including a novel rank-based permutation method for biomarker identification. This method can be used to estimate and visualize P-values for individual features, allowing the researcher to identify a subset of features for which there is robust statistical evidence of an effect. In addition, pyRforest includes methods for the calculation and visualization of SHapley Additive exPlanations values. Finally, pyRforest includes support for comprehensive downstream analysis for gene ontology and pathway enrichment. pyRforest thus improves the implementation and interpretability of Random Forest models for genomic data analysis by merging the strengths of Python with R. pyRforest can be downloaded at: https://www.github.com/tkolisnik/pyRforest with an associated vignette at https://github.com/tkolisnik/pyRforest/blob/main/vignettes/pyRforest-vignette.pdf.

随机森林模型被广泛应用于基因组数据分析,并能深入揭示复杂的生物机制,尤其是当特征以交互、非线性或非相加的方式影响目标时。目前,一些计算速度最快的随机森林方法是用 Python 实现的。然而,许多生物学家使用 R 进行基因组数据分析,因为 R 提供了一个统一的平台来执行额外的统计分析和可视化。pyRforest 继承了 Python 的高效内存管理和并行化功能,并针对大型基因组数据集(如 RNA-seq 数据集)上的分类任务进行了优化。这种方法可用于估算和直观显示单个特征的 P 值,使研究人员能够识别出有可靠统计证据表明存在效应的特征子集。此外,pyRforest 还包括 SHapley Additive exPlanations 值的计算和可视化方法。pyRforest 结合了 Python 和 R 的优势,从而改进了用于基因组数据分析的随机森林模型的实现和可解释性。pyRforest 的下载地址为:https://www.github.com/tkolisnik/pyRforest,相关的 vignette 下载地址为:https://github.com/tkolisnik/pyRforest/blob/main/vignettes/pyRforest-vignette.pdf。
{"title":"pyRforest: a comprehensive R package for genomic data analysis featuring scikit-learn Random Forests in R.","authors":"Tyler Kolisnik, Faeze Keshavarz-Rahaghi, Rachel V Purcell, Adam N H Smith, Olin K Silander","doi":"10.1093/bfgp/elae038","DOIUrl":"10.1093/bfgp/elae038","url":null,"abstract":"<p><p>Random Forest models are widely used in genomic data analysis and can offer insights into complex biological mechanisms, particularly when features influence the target in interactive, nonlinear, or nonadditive ways. Currently, some of the most efficient Random Forest methods in terms of computational speed are implemented in Python. However, many biologists use R for genomic data analysis, as R offers a unified platform for performing additional statistical analysis and visualization. Here, we present an R package, pyRforest, which integrates Python scikit-learn \"RandomForestClassifier\" algorithms into the R environment. pyRforest inherits the efficient memory management and parallelization of Python, and is optimized for classification tasks on large genomic datasets, such as those from RNA-seq. pyRforest offers several additional capabilities, including a novel rank-based permutation method for biomarker identification. This method can be used to estimate and visualize P-values for individual features, allowing the researcher to identify a subset of features for which there is robust statistical evidence of an effect. In addition, pyRforest includes methods for the calculation and visualization of SHapley Additive exPlanations values. Finally, pyRforest includes support for comprehensive downstream analysis for gene ontology and pathway enrichment. pyRforest thus improves the implementation and interpretability of Random Forest models for genomic data analysis by merging the strengths of Python with R. pyRforest can be downloaded at: https://www.github.com/tkolisnik/pyRforest with an associated vignette at https://github.com/tkolisnik/pyRforest/blob/main/vignettes/pyRforest-vignette.pdf.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735746/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142382536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational inference of Rhizobium phaseoli transcriptional regulatory network predicts Transcription Factors involved in nodulation. 相根瘤菌转录调控网络预测结瘤相关转录因子的计算推断。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elaf020
Ericka M Hernandez-Benitez, Esperanza Martínez-Romero, José Luis Aguirre-Noyola, Jose Arcadio Farias-Rico, Daniela Ledezma-Tejeida

Growth of the common bean plant Phaseolus vulgaris is tightly linked to its symbiotic relationship with diverse rhizobial species, particularly Rhizobium phaseoli, an alphaproteobacterium that forms root nodules and provides high levels of nitrogen to the plant. Molecular cross-talk is known to happen through plant-derived metabolites, but only flavonoids have been identified as nodulation signals, which act through the activation of the NodD Transcription Factor (TF). The identification of signals that mediate nodulation via TFs can aid in the rational design of biofertilizers that promote plant-bacteria symbiosis. Here, we identified 57 TFs in the R. phaseoli genome through sequence conservation from Escherichia coli, and predicted a transcriptional regulatory network comprising 16 TFs, and 1,371 target genes. We identified the regulatory interactions relevant to nodulation via transcriptome analysis, and hypothesize that PuuR is a TF involved in nodulation, potentially acting via its known binding metabolite putrescine. Sequence and structural evidence predict a model where putrescine acts as a signaling metabolite in nodulation via the TF PuuR, and the regulation of the nodI gene.

普通豆类植物Phaseolus vulgaris的生长与其与多种根瘤菌的共生关系密切相关,特别是相根瘤菌,一种形成根瘤并为植物提供高水平氮的甲变形菌。分子间的相互作用是通过植物衍生的代谢物发生的,但只有黄酮类化合物被确定为结瘤信号,它通过NodD转录因子(TF)的激活起作用。识别通过TFs介导结瘤的信号有助于合理设计促进植物-细菌共生的生物肥料。本研究通过大肠杆菌的序列保守鉴定了相大肠杆菌基因组中的57个tf,并预测了一个由16个tf和1371个靶基因组成的转录调控网络。我们通过转录组分析确定了与结瘤相关的调节相互作用,并假设PuuR是一种参与结瘤的TF,可能通过其已知的结合代谢物腐胺起作用。序列和结构证据预测了腐胺通过TF PuuR和nodI基因调控在结瘤过程中作为信号代谢产物的模型。
{"title":"Computational inference of Rhizobium phaseoli transcriptional regulatory network predicts Transcription Factors involved in nodulation.","authors":"Ericka M Hernandez-Benitez, Esperanza Martínez-Romero, José Luis Aguirre-Noyola, Jose Arcadio Farias-Rico, Daniela Ledezma-Tejeida","doi":"10.1093/bfgp/elaf020","DOIUrl":"10.1093/bfgp/elaf020","url":null,"abstract":"<p><p>Growth of the common bean plant Phaseolus vulgaris is tightly linked to its symbiotic relationship with diverse rhizobial species, particularly Rhizobium phaseoli, an alphaproteobacterium that forms root nodules and provides high levels of nitrogen to the plant. Molecular cross-talk is known to happen through plant-derived metabolites, but only flavonoids have been identified as nodulation signals, which act through the activation of the NodD Transcription Factor (TF). The identification of signals that mediate nodulation via TFs can aid in the rational design of biofertilizers that promote plant-bacteria symbiosis. Here, we identified 57 TFs in the R. phaseoli genome through sequence conservation from Escherichia coli, and predicted a transcriptional regulatory network comprising 16 TFs, and 1,371 target genes. We identified the regulatory interactions relevant to nodulation via transcriptome analysis, and hypothesize that PuuR is a TF involved in nodulation, potentially acting via its known binding metabolite putrescine. Sequence and structural evidence predict a model where putrescine acts as a signaling metabolite in nodulation via the TF PuuR, and the regulation of the nodI gene.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12645834/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145607605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VirusImmu: a novel ensemble machine learning approach for viral immunogenicity prediction. 病毒免疫:一种用于病毒免疫原性预测的新型集成机器学习方法。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elaf008
Jing Li, Zhongpeng Zhao, ChengZheng Tai, Ting Sun, Lingyun Tan, Xinyu Li, Wei He, HongJun Li, Jing Zhang

The viruses threats provoke concerns regarding their sustained epidemic transmission, making the development of vaccines particularly important. In the prolonged and costly process of vaccine development, the most important initial step is to identify protective immunogens. Machine learning (ML) approaches are productive in analyzing big data such as microbial proteomes, and can remarkably reduce the cost of experimental work in developing novel vaccine candidates. We intensively evaluated the B cell epitope immunogenicity prediction power of eight commonly-used ML methods by random sampling cross validation on a large dataset consisting of known viral immunogens and non-immunogens we manually curated from the public domain. Extreme Gradient Boosting, K Nearest Neighbours, and Random Forest) showed the strongest predictive power. We then proposed a novel soft-voting based ensemble approach (VirusImmu), which demonstrated a powerful and stable capability for viral immunogenicity prediction across the test set and external test set irrespective of protein sequence length. VirusImmu was successfully applied to facilitate identifying linear B cell epitopes against African Swine Fever Virus as confirmed by indirect ELISA in vitro. In short, VirusImmu exhibited tremendous potentials in predicting immunogenicity of viral protein segments. It is freely accessible at https://github.com/zhangjbig/VirusImmu.

这些病毒的威胁引起人们对其持续流行传播的关切,因此研制疫苗尤为重要。在疫苗开发的漫长和昂贵的过程中,最重要的第一步是确定保护性免疫原。机器学习(ML)方法在分析微生物蛋白质组等大数据方面具有生产力,并且可以显着降低开发新型候选疫苗的实验工作成本。我们通过随机抽样交叉验证,集中评估了八种常用ML方法的B细胞表位免疫原性预测能力,该方法由我们手动从公共领域收集的已知病毒免疫原和非免疫原组成。极端梯度增强、K近邻和随机森林)显示出最强的预测能力。然后,我们提出了一种新的基于软投票的集成方法(virusimmune),该方法在测试集和外部测试集上显示出强大而稳定的病毒免疫原性预测能力,而不考虑蛋白质序列长度。通过间接ELISA验证了virusimmune对非洲猪瘟病毒线性B细胞抗原表位的体外鉴定。总之,virusimmune在预测病毒蛋白片段的免疫原性方面显示出巨大的潜力。它可以在https://github.com/zhangjbig/VirusImmu上免费访问。
{"title":"VirusImmu: a novel ensemble machine learning approach for viral immunogenicity prediction.","authors":"Jing Li, Zhongpeng Zhao, ChengZheng Tai, Ting Sun, Lingyun Tan, Xinyu Li, Wei He, HongJun Li, Jing Zhang","doi":"10.1093/bfgp/elaf008","DOIUrl":"https://doi.org/10.1093/bfgp/elaf008","url":null,"abstract":"<p><p>The viruses threats provoke concerns regarding their sustained epidemic transmission, making the development of vaccines particularly important. In the prolonged and costly process of vaccine development, the most important initial step is to identify protective immunogens. Machine learning (ML) approaches are productive in analyzing big data such as microbial proteomes, and can remarkably reduce the cost of experimental work in developing novel vaccine candidates. We intensively evaluated the B cell epitope immunogenicity prediction power of eight commonly-used ML methods by random sampling cross validation on a large dataset consisting of known viral immunogens and non-immunogens we manually curated from the public domain. Extreme Gradient Boosting, K Nearest Neighbours, and Random Forest) showed the strongest predictive power. We then proposed a novel soft-voting based ensemble approach (VirusImmu), which demonstrated a powerful and stable capability for viral immunogenicity prediction across the test set and external test set irrespective of protein sequence length. VirusImmu was successfully applied to facilitate identifying linear B cell epitopes against African Swine Fever Virus as confirmed by indirect ELISA in vitro. In short, VirusImmu exhibited tremendous potentials in predicting immunogenicity of viral protein segments. It is freely accessible at https://github.com/zhangjbig/VirusImmu.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12051847/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144053127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STLBRF: an improved random forest algorithm based on standardized-threshold for feature screening of gene expression data. STLBRF:基于标准化阈值的改进型随机森林算法,用于基因表达数据的特征筛选。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae048
Huini Feng, Ying Ju, Xiaofeng Yin, Wenshi Qiu, Xu Zhang

When the traditional random forest (RF) algorithm is used to select feature elements in biostatistical data, a large amount of noise data and parameters can affect the importance of the selected feature elements, making the control of feature selection difficult. Therefore, it is a challenge for the traditional RF algorithm to preserve the accuracy of algorithm results in the presence of noise data. Generally, directly removing noise data can result in significant bias in the results. In this study, we develop a new algorithm, standardized threshold, and loops based random forest (STLBRF), and apply it to the field of gene expression data for feature gene selection. This algorithm, based on the traditional RF algorithm, combines backward elimination and K-fold cross-validation to construct a cyclic system and set a standardized threshold: error increment. The algorithm overcomes the shortcomings of existing gene selection methods. We compare ridge regression, lasso regression, elastic net regression, the traditional RF algorithm, and our improved RF algorithm using three real gene expression datasets and conducting a quantitative analysis. To ensure the reliability of the results, we validate the effectiveness of the genes selected by these methods using the Random Forest classifier. The results indicate that, compared to other methods, the STLBRF algorithm achieves not only higher effectiveness in feature gene selection but also better control over the number of selected genes. Our method offers reliable technical support for feature expression analysis and research on biomarker selection.

传统的随机森林(random forest, RF)算法在生物统计数据中选择特征元素时,大量的噪声数据和参数会影响所选特征元素的重要性,给特征选择的控制带来困难。因此,传统的射频算法在存在噪声数据的情况下如何保持算法结果的准确性是一个挑战。通常,直接去除噪声数据会导致结果出现明显偏差。在本研究中,我们开发了一种新的算法,标准化阈值和基于循环的随机森林(STLBRF),并将其应用于基因表达数据领域的特征基因选择。该算法在传统射频算法的基础上,结合反向消除和K-fold交叉验证构建循环系统,并设置标准化阈值:误差增量。该算法克服了现有基因选择方法的不足。我们比较了岭回归、lasso回归、弹性网回归、传统的射频算法和改进的射频算法,并使用三个真实的基因表达数据集进行了定量分析。为了确保结果的可靠性,我们使用随机森林分类器验证了这些方法选择的基因的有效性。结果表明,与其他方法相比,STLBRF算法不仅在特征基因选择方面具有更高的有效性,而且对选择的基因数量也有更好的控制。该方法为特征表达分析和生物标志物选择研究提供了可靠的技术支持。
{"title":"STLBRF: an improved random forest algorithm based on standardized-threshold for feature screening of gene expression data.","authors":"Huini Feng, Ying Ju, Xiaofeng Yin, Wenshi Qiu, Xu Zhang","doi":"10.1093/bfgp/elae048","DOIUrl":"10.1093/bfgp/elae048","url":null,"abstract":"<p><p>When the traditional random forest (RF) algorithm is used to select feature elements in biostatistical data, a large amount of noise data and parameters can affect the importance of the selected feature elements, making the control of feature selection difficult. Therefore, it is a challenge for the traditional RF algorithm to preserve the accuracy of algorithm results in the presence of noise data. Generally, directly removing noise data can result in significant bias in the results. In this study, we develop a new algorithm, standardized threshold, and loops based random forest (STLBRF), and apply it to the field of gene expression data for feature gene selection. This algorithm, based on the traditional RF algorithm, combines backward elimination and K-fold cross-validation to construct a cyclic system and set a standardized threshold: error increment. The algorithm overcomes the shortcomings of existing gene selection methods. We compare ridge regression, lasso regression, elastic net regression, the traditional RF algorithm, and our improved RF algorithm using three real gene expression datasets and conducting a quantitative analysis. To ensure the reliability of the results, we validate the effectiveness of the genes selected by these methods using the Random Forest classifier. The results indicate that, compared to other methods, the STLBRF algorithm achieves not only higher effectiveness in feature gene selection but also better control over the number of selected genes. Our method offers reliable technical support for feature expression analysis and research on biomarker selection.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735748/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142907874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive review of deep learning-based approaches for drug-drug interaction prediction. 基于深度学习的药物-药物相互作用预测方法综述。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae052
Yan Xia, An Xiong, Zilong Zhang, Quan Zou, Feifei Cui

Deep learning models have made significant progress in the biomedical field, particularly in the prediction of drug-drug interactions (DDIs). DDIs are pharmacodynamic reactions between two or more drugs in the body, which may lead to adverse effects and are of great significance for drug development and clinical research. However, predicting DDI through traditional clinical trials and experiments is not only costly but also time-consuming. When utilizing advanced Artificial Intelligence (AI) and deep learning techniques, both developers and users face multiple challenges, including the problem of acquiring and encoding data, as well as the difficulty of designing computational methods. In this paper, we review a variety of DDI prediction methods, including similarity-based, network-based, and integration-based approaches, to provide an up-to-date and easy-to-understand guide for researchers in different fields. Additionally, we provide an in-depth analysis of widely used molecular representations and a systematic exposition of the theoretical framework of models used to extract features from graph data.

深度学习模型在生物医学领域取得了重大进展,特别是在药物-药物相互作用(ddi)的预测方面。ddi是体内两种或两种以上药物之间的药效学反应,可能导致不良反应,对药物开发和临床研究具有重要意义。然而,通过传统的临床试验和实验预测DDI不仅成本高,而且耗时长。在利用先进的人工智能(AI)和深度学习技术时,开发人员和用户都面临着多重挑战,包括获取和编码数据的问题,以及设计计算方法的困难。在本文中,我们回顾了各种DDI预测方法,包括基于相似性、基于网络和基于集成的方法,为不同领域的研究人员提供一个最新的、易于理解的指南。此外,我们对广泛使用的分子表示进行了深入分析,并系统地阐述了用于从图数据中提取特征的模型的理论框架。
{"title":"A comprehensive review of deep learning-based approaches for drug-drug interaction prediction.","authors":"Yan Xia, An Xiong, Zilong Zhang, Quan Zou, Feifei Cui","doi":"10.1093/bfgp/elae052","DOIUrl":"10.1093/bfgp/elae052","url":null,"abstract":"<p><p>Deep learning models have made significant progress in the biomedical field, particularly in the prediction of drug-drug interactions (DDIs). DDIs are pharmacodynamic reactions between two or more drugs in the body, which may lead to adverse effects and are of great significance for drug development and clinical research. However, predicting DDI through traditional clinical trials and experiments is not only costly but also time-consuming. When utilizing advanced Artificial Intelligence (AI) and deep learning techniques, both developers and users face multiple challenges, including the problem of acquiring and encoding data, as well as the difficulty of designing computational methods. In this paper, we review a variety of DDI prediction methods, including similarity-based, network-based, and integration-based approaches, to provide an up-to-date and easy-to-understand guide for researchers in different fields. Additionally, we provide an in-depth analysis of widely used molecular representations and a systematic exposition of the theoretical framework of models used to extract features from graph data.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11847217/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143484633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
m6A RNA modification pathway: orchestrating fibrotic mechanisms across multiple organs. m6A RNA修饰途径:协调多器官纤维化机制。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae051
Xiangfei Huang, Zilu Yu, Juan Tian, Tao Chen, Aiping Wei, Chao Mei, Shibiao Chen, Yong Li

Organ fibrosis, a common consequence of chronic tissue injury, presents a significant health challenge. Recent research has revealed the regulatory role of N6-methyladenosine (m6A) RNA modification in fibrosis of various organs, including the lung, liver, kidney, and heart. In this comprehensive review, we summarize the latest findings on the mechanisms and functions of m6A modification in organ fibrosis. By highlighting the potential of m6A modification as a therapeutic target, our goal is to encourage further research in this emerging field and support advancements in the clinical treatment of organ fibrosis.

器官纤维化是慢性组织损伤的常见后果,对健康提出了重大挑战。最近的研究揭示了n6 -甲基腺苷(m6A) RNA修饰在包括肺、肝、肾和心脏在内的各种器官纤维化中的调节作用。在这篇综述中,我们对m6A修饰在器官纤维化中的机制和功能的最新发现进行了综述。通过强调m6A修饰作为治疗靶点的潜力,我们的目标是鼓励在这一新兴领域的进一步研究,并支持器官纤维化临床治疗的进步。
{"title":"m6A RNA modification pathway: orchestrating fibrotic mechanisms across multiple organs.","authors":"Xiangfei Huang, Zilu Yu, Juan Tian, Tao Chen, Aiping Wei, Chao Mei, Shibiao Chen, Yong Li","doi":"10.1093/bfgp/elae051","DOIUrl":"10.1093/bfgp/elae051","url":null,"abstract":"<p><p>Organ fibrosis, a common consequence of chronic tissue injury, presents a significant health challenge. Recent research has revealed the regulatory role of N6-methyladenosine (m6A) RNA modification in fibrosis of various organs, including the lung, liver, kidney, and heart. In this comprehensive review, we summarize the latest findings on the mechanisms and functions of m6A modification in organ fibrosis. By highlighting the potential of m6A modification as a therapeutic target, our goal is to encourage further research in this emerging field and support advancements in the clinical treatment of organ fibrosis.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735750/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142933353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
INTS7 modulates cell proliferation and apoptosis via promoting cell cycle progression in lung adenocarcinoma. 在肺腺癌中,INTS7通过促进细胞周期进程调节细胞增殖和凋亡。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elaf014
Yaming Liu, Tengfei Huang, Dehua Zeng, Meiqing Zhang, Duohuan Lian, Shunkai Zhou, Mengmeng Chen, Zhiyong Zeng, Huizhong Li

The evolutionarily conserved Integrator complex, which is composed of over 10 subunits, orchestrates diverse RNA-processing events such as 3'-end maturation of small nuclear RNAs (snRNAs), transcription termination of RNA Polymerase II, and DNA damage response signaling pathways; however, the functional roles of individual Integrator complex subunits in lung adenocarcinoma (LUAD) remain poorly characterized, and this study aimed to systematically investigate the potential oncogenic functions and prognostic values of these subunits in LUAD. To achieve this goal, the expression profiles of Integrator complex subunits were profiled using transcriptomic data from the The Cancer Genome Atlas (TCGA) database, survival analyses (including Kaplan-Meier and Cox regression models) were performed to evaluate the correlations between subunit expression levels and patient survival outcomes (overall survival (OS) and disease-free survival (DFS)), co-expression network analysis was conducted to annotate the potential biological functions of key subunits, and functional validation was performed using CCK-8 assays and flow cytometry to assess the impact of INTS7 depletion on cell proliferation and cycle progression in LUAD cell lines. The findings of this study showed that Integrator complex subunits were significantly overexpressed in LUAD tissues compared to normal lung parenchyma; among these subunits, INTS7 expression was most strongly associated with shortened OS and DFS, indicating its pivotal role in LUAD pathogenesis, while bioinformatics analyses revealed that INTS7 is involved in regulating critical biological processes including cell cycle progression, transcriptional regulation, and RNA metabolism, and loss-of-function experiments demonstrated that genetic silencing of INTS7 significantly inhibited cell proliferation and induced cell cycle arrest in LUAD cells. Ultimately, this study provides the first evidence that INTS7, a core component of the Integrator complex, serves as a functional and prognostic regulator in LUAD, highlighting its potential as a therapeutic target for this malignancy.

进化上保守的Integrator复合体由超过10个亚基组成,协调各种RNA加工事件,如小核RNA (snrna)的3'端成熟,RNA聚合酶II的转录终止和DNA损伤反应信号通路;然而,单个整合子复合物亚基在肺腺癌(LUAD)中的功能作用仍然知之甚少,本研究旨在系统地研究这些亚基在LUAD中的潜在致癌功能和预后价值。为了实现这一目标,利用来自癌症基因组图谱(TCGA)数据库的转录组学数据对Integrator复合体亚基的表达谱进行了分析,并进行了生存分析(包括Kaplan-Meier和Cox回归模型),以评估亚基表达水平与患者生存结果(总生存期(OS)和无病生存期(DFS))之间的相关性。通过共表达网络分析来注释关键亚基的潜在生物学功能,并使用CCK-8检测和流式细胞术进行功能验证,以评估INTS7缺失对LUAD细胞系细胞增殖和周期进展的影响。本研究结果表明,与正常肺实质相比,整合子复合体亚基在LUAD组织中显著过表达;在这些亚基中,INTS7的表达与缩短OS和DFS的相关性最强,表明其在LUAD发病机制中起关键作用,而生物信息学分析显示,INTS7参与调节关键的生物过程,包括细胞周期进程、转录调节和RNA代谢,功能缺失实验表明,基因沉默INTS7可显著抑制LUAD细胞增殖并诱导细胞周期阻滞。最终,本研究提供了第一个证据,证明INTS7作为Integrator复合体的核心成分,在LUAD中发挥功能和预后调节作用,突出了其作为这种恶性肿瘤治疗靶点的潜力。
{"title":"INTS7 modulates cell proliferation and apoptosis via promoting cell cycle progression in lung adenocarcinoma.","authors":"Yaming Liu, Tengfei Huang, Dehua Zeng, Meiqing Zhang, Duohuan Lian, Shunkai Zhou, Mengmeng Chen, Zhiyong Zeng, Huizhong Li","doi":"10.1093/bfgp/elaf014","DOIUrl":"https://doi.org/10.1093/bfgp/elaf014","url":null,"abstract":"<p><p>The evolutionarily conserved Integrator complex, which is composed of over 10 subunits, orchestrates diverse RNA-processing events such as 3'-end maturation of small nuclear RNAs (snRNAs), transcription termination of RNA Polymerase II, and DNA damage response signaling pathways; however, the functional roles of individual Integrator complex subunits in lung adenocarcinoma (LUAD) remain poorly characterized, and this study aimed to systematically investigate the potential oncogenic functions and prognostic values of these subunits in LUAD. To achieve this goal, the expression profiles of Integrator complex subunits were profiled using transcriptomic data from the The Cancer Genome Atlas (TCGA) database, survival analyses (including Kaplan-Meier and Cox regression models) were performed to evaluate the correlations between subunit expression levels and patient survival outcomes (overall survival (OS) and disease-free survival (DFS)), co-expression network analysis was conducted to annotate the potential biological functions of key subunits, and functional validation was performed using CCK-8 assays and flow cytometry to assess the impact of INTS7 depletion on cell proliferation and cycle progression in LUAD cell lines. The findings of this study showed that Integrator complex subunits were significantly overexpressed in LUAD tissues compared to normal lung parenchyma; among these subunits, INTS7 expression was most strongly associated with shortened OS and DFS, indicating its pivotal role in LUAD pathogenesis, while bioinformatics analyses revealed that INTS7 is involved in regulating critical biological processes including cell cycle progression, transcriptional regulation, and RNA metabolism, and loss-of-function experiments demonstrated that genetic silencing of INTS7 significantly inhibited cell proliferation and induced cell cycle arrest in LUAD cells. Ultimately, this study provides the first evidence that INTS7, a core component of the Integrator complex, serves as a functional and prognostic regulator in LUAD, highlighting its potential as a therapeutic target for this malignancy.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12393144/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144979564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of computational algorithms for single-cell RNA-seq and ATAC-seq in neurodegenerative diseases. 单细胞 RNA-seq 和 ATAC-seq 计算算法在神经退行性疾病中的应用。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elae044
Hwisoo Choi, Hyeonkyu Kim, Hoebin Chung, Dong-Sung Lee, Junil Kim

Recent advancements in single-cell technologies, including single-cell RNA sequencing (scRNA-seq) and Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq), have greatly improved our insight into the epigenomic landscapes across various biological contexts and diseases. This paper reviews key computational tools and machine learning approaches that integrate scRNA-seq and scATAC-seq data to facilitate the alignment of transcriptomic data with chromatin accessibility profiles. Applying these integrated single-cell technologies in neurodegenerative diseases, such as Alzheimer's disease and Parkinson's disease, reveals how changes in chromatin accessibility and gene expression can illuminate pathogenic mechanisms and identify potential therapeutic targets. Despite facing challenges like data sparsity and computational demands, ongoing enhancements in scATAC-seq and scRNA-seq technologies, along with better analytical methods, continue to expand their applications. These advancements promise to revolutionize our approach to medical research and clinical diagnostics, offering a comprehensive view of cellular function and disease pathology.

单细胞技术的最新进展,包括单细胞RNA测序(scRNA-seq)和转座酶可及染色质测序(scATAC-seq),大大提高了我们对各种生物背景和疾病的表观基因组景观的洞察力。本文综述了整合 scRNA-seq 和 scATAC-seq 数据的关键计算工具和机器学习方法,以促进转录组数据与染色质可及性图谱的配准。在阿尔茨海默病和帕金森病等神经退行性疾病中应用这些集成单细胞技术,揭示了染色质可及性和基因表达的变化如何阐明致病机制并确定潜在的治疗靶点。尽管面临数据稀缺和计算需求等挑战,scATAC-seq 和 scRNA-seq 技术的不断改进以及更好的分析方法仍在继续扩大其应用范围。这些进步有望彻底改变我们的医学研究和临床诊断方法,为细胞功能和疾病病理提供一个全面的视角。
{"title":"Application of computational algorithms for single-cell RNA-seq and ATAC-seq in neurodegenerative diseases.","authors":"Hwisoo Choi, Hyeonkyu Kim, Hoebin Chung, Dong-Sung Lee, Junil Kim","doi":"10.1093/bfgp/elae044","DOIUrl":"10.1093/bfgp/elae044","url":null,"abstract":"<p><p>Recent advancements in single-cell technologies, including single-cell RNA sequencing (scRNA-seq) and Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq), have greatly improved our insight into the epigenomic landscapes across various biological contexts and diseases. This paper reviews key computational tools and machine learning approaches that integrate scRNA-seq and scATAC-seq data to facilitate the alignment of transcriptomic data with chromatin accessibility profiles. Applying these integrated single-cell technologies in neurodegenerative diseases, such as Alzheimer's disease and Parkinson's disease, reveals how changes in chromatin accessibility and gene expression can illuminate pathogenic mechanisms and identify potential therapeutic targets. Despite facing challenges like data sparsity and computational demands, ongoing enhancements in scATAC-seq and scRNA-seq technologies, along with better analytical methods, continue to expand their applications. These advancements promise to revolutionize our approach to medical research and clinical diagnostics, offering a comprehensive view of cellular function and disease pathology.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735751/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: A dynamic model of gene activation in response to hypoxia accounting for both HIF-1 and HIF-2. 更正:HIF-1和HIF-2对缺氧反应的基因激活动态模型。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elaf028
{"title":"Correction to: A dynamic model of gene activation in response to hypoxia accounting for both HIF-1 and HIF-2.","authors":"","doi":"10.1093/bfgp/elaf028","DOIUrl":"10.1093/bfgp/elaf028","url":null,"abstract":"","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12723799/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145812409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integration of single cell multiomics data by deep transfer hypergraph neural network. 基于深度传递超图神经网络的单细胞多组学数据集成。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2025-01-15 DOI: 10.1093/bfgp/elaf009
Yulong Kan, Zhongxiao Zhang, Yingjie Wang, Yunjing Qi, Haoxin Chang, Weihao Wang, Zheng Zhang, Quanhong Liu, Xiaoran Shi

Multi-omics characterization of individual cells offers remarkable potential for analyzing the dynamics and relationships of gene regulatory states across millions of cells. How to integrate multimodal data is an open problem, existing integration methods struggle with accuracy and modality-specific biological variation retention. In this paper, we present scHyper (scalable, interpretable machine learning for single cell integration), a low-code and data-efficient deep transfer model designed for integrating paired and unpaired single-cell multimodal data. We benchmark scHyper against datasets from different multimodal data. ScHyper learns a low-dimensional representation and aligns the covariance matrices of the measured modalities, achieving high accuracy even with large scale atlas-level datasets with low memory and computational time across different cell lines, shedding light on regulatory relationships between different types of omics. Altogether, we show that scHyper is a versatile and robust tool for cell-type label transfer and integration from multimodal single-cell datasets.

单个细胞的多组学表征为分析数百万细胞中基因调控状态的动态和关系提供了显着的潜力。如何集成多模态数据是一个开放的问题,现有的集成方法在准确性和模态特异性生物变异保留方面存在问题。在本文中,我们提出了scHyper(可扩展,可解释的单细胞集成机器学习),这是一种低代码和数据高效的深度传输模型,旨在集成成对和非成对的单细胞多模态数据。我们对来自不同多模态数据集的scHyper进行基准测试。ScHyper学习低维表示并对齐测量模式的协方差矩阵,即使使用跨不同细胞系的低内存和计算时间的大规模图谱级数据集也能实现高精度,从而揭示不同类型组学之间的调节关系。总之,我们表明scHyper是一个多功能和强大的工具,用于多模态单细胞数据集的细胞类型标签转移和集成。
{"title":"Integration of single cell multiomics data by deep transfer hypergraph neural network.","authors":"Yulong Kan, Zhongxiao Zhang, Yingjie Wang, Yunjing Qi, Haoxin Chang, Weihao Wang, Zheng Zhang, Quanhong Liu, Xiaoran Shi","doi":"10.1093/bfgp/elaf009","DOIUrl":"10.1093/bfgp/elaf009","url":null,"abstract":"<p><p>Multi-omics characterization of individual cells offers remarkable potential for analyzing the dynamics and relationships of gene regulatory states across millions of cells. How to integrate multimodal data is an open problem, existing integration methods struggle with accuracy and modality-specific biological variation retention. In this paper, we present scHyper (scalable, interpretable machine learning for single cell integration), a low-code and data-efficient deep transfer model designed for integrating paired and unpaired single-cell multimodal data. We benchmark scHyper against datasets from different multimodal data. ScHyper learns a low-dimensional representation and aligns the covariance matrices of the measured modalities, achieving high accuracy even with large scale atlas-level datasets with low memory and computational time across different cell lines, shedding light on regulatory relationships between different types of omics. Altogether, we show that scHyper is a versatile and robust tool for cell-type label transfer and integration from multimodal single-cell datasets.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397973/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144979613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Briefings in Functional Genomics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1