High-throughput gene expression data have been extensively generated and utilized in biological mechanism investigations, biomarker detection, disease diagnosis and prognosis. These applications encompass not only bulk transcriptome, but also single cell RNA-seq data. However, extracting reliable biological information from transcriptome data remains challenging due to the constrains of Compositional Data Analysis. Current data preprocessing methods, including dataset normalization and batch effect correction, are insufficient to address these issues and improve data quality for downstream analysis. Alternatively, qualification methods focusing on the relative order of gene expression (ROGER) are more informative than the quantification methods that rely on gene expression abundance. The Pairwise Analysis of Gene expression method is an enhancement of ROGER, designed for data integration in either sample space or feature space. In this review, we summarize the methods applied to transcriptome data analysis and discuss their potentials in predicting clinical outcomes.
高通量基因表达数据已广泛产生并用于生物机制研究、生物标记物检测、疾病诊断和预后。这些应用不仅包括大量转录组数据,还包括单细胞 RNA-seq 数据。然而,由于合成数据分析的限制,从转录组数据中提取可靠的生物信息仍然具有挑战性。目前的数据预处理方法,包括数据集归一化和批量效应校正,都不足以解决这些问题并提高下游分析的数据质量。另外,与依赖基因表达丰度的定量方法相比,侧重于基因表达相对顺序(ROGER)的定性方法信息量更大。基因表达成对分析方法是 ROGER 的增强版,旨在对样本空间或特征空间进行数据整合。在这篇综述中,我们总结了应用于转录组数据分析的方法,并讨论了这些方法在预测临床结果方面的潜力。
{"title":"Less is more: relative rank is more informative than absolute abundance for compositional NGS data.","authors":"Xubin Zheng, Nana Jin, Qiong Wu, Ning Zhang, Haonan Wu, Yuanhao Wang, Rui Luo, Tao Liu, Wanfu Ding, Qingshan Geng, Lixin Cheng","doi":"10.1093/bfgp/elae045","DOIUrl":"10.1093/bfgp/elae045","url":null,"abstract":"<p><p>High-throughput gene expression data have been extensively generated and utilized in biological mechanism investigations, biomarker detection, disease diagnosis and prognosis. These applications encompass not only bulk transcriptome, but also single cell RNA-seq data. However, extracting reliable biological information from transcriptome data remains challenging due to the constrains of Compositional Data Analysis. Current data preprocessing methods, including dataset normalization and batch effect correction, are insufficient to address these issues and improve data quality for downstream analysis. Alternatively, qualification methods focusing on the relative order of gene expression (ROGER) are more informative than the quantification methods that rely on gene expression abundance. The Pairwise Analysis of Gene expression method is an enhancement of ROGER, designed for data integration in either sample space or feature space. In this review, we summarize the methods applied to transcriptome data analysis and discuss their potentials in predicting clinical outcomes.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735744/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Adam Wojtulewski, Aleksandra Sikora, Sean Dineen, Mustafa Raoof, Aleksandra Karolak
Objective: The primary objective of this study is to investigate various applications of artificial intelligence (AI) and statistical methodologies for analyzing and managing peritoneal metastases (PM) caused by gastrointestinal cancers.
Methods: Relevant keywords and search criteria were comprehensively researched on PubMed and Google Scholar to identify articles and reviews related to the topic. The AI approaches considered were conventional machine learning (ML) and deep learning (DL) models, and the relevant statistical approaches included biostatistics and logistic models.
Results: The systematic literature review yielded nearly 30 articles meeting the predefined criteria. Analyses of these studies showed that AI methodologies consistently outperformed traditional statistical approaches. In the AI approaches, DL consistently produced the most precise results, while classical ML demonstrated varied performance but maintained high predictive accuracy. The sample size was the recurring factor that increased the accuracy of the predictions for models of the same type.
Conclusions: AI and statistical approaches can detect PM developing among patients with gastrointestinal cancers. Therefore, if clinicians integrated these approaches into diagnostics and prognostics, they could better analyze and manage PM, enhancing clinical decision-making and patients' outcomes. Collaboration across multiple institutions would also help in standardizing methods for data collection and allowing consistent results.
{"title":"Using artificial intelligence and statistics for managing peritoneal metastases from gastrointestinal cancers.","authors":"Adam Wojtulewski, Aleksandra Sikora, Sean Dineen, Mustafa Raoof, Aleksandra Karolak","doi":"10.1093/bfgp/elae049","DOIUrl":"10.1093/bfgp/elae049","url":null,"abstract":"<p><strong>Objective: </strong>The primary objective of this study is to investigate various applications of artificial intelligence (AI) and statistical methodologies for analyzing and managing peritoneal metastases (PM) caused by gastrointestinal cancers.</p><p><strong>Methods: </strong>Relevant keywords and search criteria were comprehensively researched on PubMed and Google Scholar to identify articles and reviews related to the topic. The AI approaches considered were conventional machine learning (ML) and deep learning (DL) models, and the relevant statistical approaches included biostatistics and logistic models.</p><p><strong>Results: </strong>The systematic literature review yielded nearly 30 articles meeting the predefined criteria. Analyses of these studies showed that AI methodologies consistently outperformed traditional statistical approaches. In the AI approaches, DL consistently produced the most precise results, while classical ML demonstrated varied performance but maintained high predictive accuracy. The sample size was the recurring factor that increased the accuracy of the predictions for models of the same type.</p><p><strong>Conclusions: </strong>AI and statistical approaches can detect PM developing among patients with gastrointestinal cancers. Therefore, if clinicians integrated these approaches into diagnostics and prognostics, they could better analyze and manage PM, enhancing clinical decision-making and patients' outcomes. Collaboration across multiple institutions would also help in standardizing methods for data collection and allowing consistent results.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735730/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142907876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The misuse and overprescription of antibiotics have accelerated the rise of antimicrobial resistance (AMR), rendering many antibiotics ineffective and leading to significant clinical challenges. The conventional treatment methods have become progressively challenging, posing a threat of evolving into an impending silent pandemic. The long track record of bacteriophages combating bacterial infections has renewed hope into the potential therapeutic benefits of bacteriophages. Bacteriophage therapy offers a promising alternative to antibiotics, particularly against multidrug-resistant (MDR) pathogens. This article explores the promise of phages as a potential means to combat superbugs from the perspective of the genomic and transcriptomic landscape of the phages and their bacterial host. Advances in bacteriophage genomics have expedited the detection of new phages and AMR genes, enhancing our understanding of phage-host interactions and enabling the identification of potential treatments for antibiotic-resistant bacteria. At the same time, holo-transcriptomic studies hold potential for discovering disease and context-specific transcriptionally active phages vis-à-vis disease severity. Holo-transcriptomic profiling can be applied to investigate the presence of AMR-bacteria, highlighting COVID-19 and Dengue diseases, in addition to the globally recognized ESKAPE pathogens. By simultaneously capturing phage, bacterial and host transcripts, this approach enables a better comprehension of the bacteriophage dynamics. Moreover, insight into these defence and counter-defence interactions is essential for augmenting the adoption of phage therapy at scale and advancing bacterial control in clinical settings.
{"title":"Genomic insights into bacteriophages: a new frontier in AMR detection and phage therapy.","authors":"Basudha Banerjee, Sayanti Halder, Shubham Kumar, Muskan Chaddha, Raiyan Ali, Ramakant Mohite, Muskan Bano, Rajesh Pandey","doi":"10.1093/bfgp/elaf011","DOIUrl":"10.1093/bfgp/elaf011","url":null,"abstract":"<p><p>The misuse and overprescription of antibiotics have accelerated the rise of antimicrobial resistance (AMR), rendering many antibiotics ineffective and leading to significant clinical challenges. The conventional treatment methods have become progressively challenging, posing a threat of evolving into an impending silent pandemic. The long track record of bacteriophages combating bacterial infections has renewed hope into the potential therapeutic benefits of bacteriophages. Bacteriophage therapy offers a promising alternative to antibiotics, particularly against multidrug-resistant (MDR) pathogens. This article explores the promise of phages as a potential means to combat superbugs from the perspective of the genomic and transcriptomic landscape of the phages and their bacterial host. Advances in bacteriophage genomics have expedited the detection of new phages and AMR genes, enhancing our understanding of phage-host interactions and enabling the identification of potential treatments for antibiotic-resistant bacteria. At the same time, holo-transcriptomic studies hold potential for discovering disease and context-specific transcriptionally active phages vis-à-vis disease severity. Holo-transcriptomic profiling can be applied to investigate the presence of AMR-bacteria, highlighting COVID-19 and Dengue diseases, in addition to the globally recognized ESKAPE pathogens. By simultaneously capturing phage, bacterial and host transcripts, this approach enables a better comprehension of the bacteriophage dynamics. Moreover, insight into these defence and counter-defence interactions is essential for augmenting the adoption of phage therapy at scale and advancing bacterial control in clinical settings.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12302716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144735535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aman Chandra Kaushik, Shubham Krushna Talware, Mohammad Imran Siddiqi
MDM2 (Mouse Double Minute 2), a fundamental governor of the p53 tumor suppressor pathway, has garnered significant attention as a favorable target for cancer therapy. Recent years have witnessed the development and synthesis of potent MDM2 inhibitors. Despite the fact that numerous MDM2 inhibitors and degraders have been assessed in clinical studies for various human cancers, no FDA-approved drug targeting MDM2 is presently available in the market. Researchers have investigated the effects of various drugs, which are involved in cancer therapies with known mechanisms, on well-characterized cancer cell lines. The prediction of drug inhibition responses becomes crucial to enhance the effectiveness and personalization of cancer treatments. Such findings can provide new perceptions aimed at designing new drugs for targeted cancer therapies. In our current insilico work, a robust response was observed for Idasanutlin in cancer cell lines, indicating the drug's significant impact on gene expression. We also identified transcriptional response signatures, which were informative about the drug's mechanism of action and potential clinical application. Further, we applied a similarity search approach for the identification of potential lead compounds from the ChEMBL database and validated them by molecular docking and dynamics studies. The study highlights the potential of incorporating machine learning with omics and single-cell RNA-seq data for predicting drug responses in cancer cells. Our findings could provide valuable insights for improving cancer treatment in the future, particularly in developing effective therapies.
{"title":"Integrative machine learning approach for identification of new molecular scaffold and prediction of inhibition responses in cancer cells using multi-omics data.","authors":"Aman Chandra Kaushik, Shubham Krushna Talware, Mohammad Imran Siddiqi","doi":"10.1093/bfgp/elaf006","DOIUrl":"https://doi.org/10.1093/bfgp/elaf006","url":null,"abstract":"<p><p>MDM2 (Mouse Double Minute 2), a fundamental governor of the p53 tumor suppressor pathway, has garnered significant attention as a favorable target for cancer therapy. Recent years have witnessed the development and synthesis of potent MDM2 inhibitors. Despite the fact that numerous MDM2 inhibitors and degraders have been assessed in clinical studies for various human cancers, no FDA-approved drug targeting MDM2 is presently available in the market. Researchers have investigated the effects of various drugs, which are involved in cancer therapies with known mechanisms, on well-characterized cancer cell lines. The prediction of drug inhibition responses becomes crucial to enhance the effectiveness and personalization of cancer treatments. Such findings can provide new perceptions aimed at designing new drugs for targeted cancer therapies. In our current insilico work, a robust response was observed for Idasanutlin in cancer cell lines, indicating the drug's significant impact on gene expression. We also identified transcriptional response signatures, which were informative about the drug's mechanism of action and potential clinical application. Further, we applied a similarity search approach for the identification of potential lead compounds from the ChEMBL database and validated them by molecular docking and dynamics studies. The study highlights the potential of incorporating machine learning with omics and single-cell RNA-seq data for predicting drug responses in cancer cells. Our findings could provide valuable insights for improving cancer treatment in the future, particularly in developing effective therapies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12008120/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144045378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jung Hun Oh, Gabrielle Rizzuto, Rena Elkin, Corey Weistuch, Larry Norton, Gabriela Dveksler, Joseph O Deasy
Recently, the mRNA presence of pregnancy-specific glycoproteins (PSGs) in cancer biopsies has been shown to be associated with poor survival. Given the pregnancy-related function of PSGs, we hypothesized that PSGs might act in a sex-dependent behavior in cancer patients. A differential sex effect of PSG genes with respect to tumor immune landscape and cancer outcomes was investigated using statistical, bioinformatic, and machine learning analyses in The Cancer Genome Atlas (TCGA) data. The resulting findings were then validated in the Clinical Proteomic Tumor Analysis Consortium (CPTAC) data. In a pan-cancer TCGA data analysis, the strongest PSG-related sex difference for the prognostic association was found in lung adenocarcinoma (LUAD). Kaplan-Meier analysis revealed that expression of PSG genes is strongly associated with overall survival rate in the female group on the TCGA, but not in the male group. This sex-specific association was validated in an independent dataset from the CPTAC study. A combination of PSG3, PSG7, and PSG8 expression was most significantly linked to poor prognosis in females (P = 8.67E-06 in TCGA and P = .0382 in CPTAC). Pathway analysis revealed enrichment of the 'KRAS Signaling Down' pathway in the high-risk female group. A predictive model showed good predictive performance for the female group (validated C-index = 0.78 in CPTAC), but poor predictive performance for the male group. These findings suggest that PSGs may have a sex-specific negative impact on survival in female LUAD patients, and the mechanism may be related to KRAS signaling pathway modulation.
{"title":"Pregnancy-specific glycoproteins as potential drug targets for female lung adenocarcinoma patients.","authors":"Jung Hun Oh, Gabrielle Rizzuto, Rena Elkin, Corey Weistuch, Larry Norton, Gabriela Dveksler, Joseph O Deasy","doi":"10.1093/bfgp/elaf004","DOIUrl":"https://doi.org/10.1093/bfgp/elaf004","url":null,"abstract":"<p><p>Recently, the mRNA presence of pregnancy-specific glycoproteins (PSGs) in cancer biopsies has been shown to be associated with poor survival. Given the pregnancy-related function of PSGs, we hypothesized that PSGs might act in a sex-dependent behavior in cancer patients. A differential sex effect of PSG genes with respect to tumor immune landscape and cancer outcomes was investigated using statistical, bioinformatic, and machine learning analyses in The Cancer Genome Atlas (TCGA) data. The resulting findings were then validated in the Clinical Proteomic Tumor Analysis Consortium (CPTAC) data. In a pan-cancer TCGA data analysis, the strongest PSG-related sex difference for the prognostic association was found in lung adenocarcinoma (LUAD). Kaplan-Meier analysis revealed that expression of PSG genes is strongly associated with overall survival rate in the female group on the TCGA, but not in the male group. This sex-specific association was validated in an independent dataset from the CPTAC study. A combination of PSG3, PSG7, and PSG8 expression was most significantly linked to poor prognosis in females (P = 8.67E-06 in TCGA and P = .0382 in CPTAC). Pathway analysis revealed enrichment of the 'KRAS Signaling Down' pathway in the high-risk female group. A predictive model showed good predictive performance for the female group (validated C-index = 0.78 in CPTAC), but poor predictive performance for the male group. These findings suggest that PSGs may have a sex-specific negative impact on survival in female LUAD patients, and the mechanism may be related to KRAS signaling pathway modulation.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12010166/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144042547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ruijie Zhang, Zhengxin Chen, Tianyi Li, Dehua Feng, Xinying Liu, Xuefeng Wang, Huirui Han, Lei Yu, Xia Li, Bing Li, Limei Wang, Jin Li
Enhancer RNA (eRNA), a type of non-coding RNA transcribed from enhancer regions, serves as a class of critical regulatory elements in gene expression. In cancer biology, eRNAs exhibit profound roles in tumorigenesis, metastasis, and therapeutic response modulation. In this review, we outline eRNA identification methods utilizing enhancer region prediction, histone H3 lysine 4 monomethyl chromatin signatures, and nucleosome positioning analysis. We quantitate eRNA expression through RNA-seq, single-cell transcriptomics, and epigenomic integration approaches. Functionally, eRNAs regulate gene expression, protein function modulation, and chromatin modification. Key databases detailing eRNA annotations and interactions are highlighted. Furthermore, we analyze the connection of eRNA with immune cells and its potential in immunotherapy. Emerging evidence demonstrates eRNA's critical involvement in immune cell crosstalk and tumor microenvironment reprogramming. Notably, eRNA signatures show promise as predictive biomarkers for immunotherapy response and chemoresistance monitoring in multiple malignancies. This review underscores eRNA's transformative potential in precision oncology, advocating for integrated multiomics approaches to fully realize their clinical applicability.
{"title":"Enhancer RNA in cancer: identification, expression, resources, relationship with immunity, drugs, and prognosis.","authors":"Ruijie Zhang, Zhengxin Chen, Tianyi Li, Dehua Feng, Xinying Liu, Xuefeng Wang, Huirui Han, Lei Yu, Xia Li, Bing Li, Limei Wang, Jin Li","doi":"10.1093/bfgp/elaf007","DOIUrl":"https://doi.org/10.1093/bfgp/elaf007","url":null,"abstract":"<p><p>Enhancer RNA (eRNA), a type of non-coding RNA transcribed from enhancer regions, serves as a class of critical regulatory elements in gene expression. In cancer biology, eRNAs exhibit profound roles in tumorigenesis, metastasis, and therapeutic response modulation. In this review, we outline eRNA identification methods utilizing enhancer region prediction, histone H3 lysine 4 monomethyl chromatin signatures, and nucleosome positioning analysis. We quantitate eRNA expression through RNA-seq, single-cell transcriptomics, and epigenomic integration approaches. Functionally, eRNAs regulate gene expression, protein function modulation, and chromatin modification. Key databases detailing eRNA annotations and interactions are highlighted. Furthermore, we analyze the connection of eRNA with immune cells and its potential in immunotherapy. Emerging evidence demonstrates eRNA's critical involvement in immune cell crosstalk and tumor microenvironment reprogramming. Notably, eRNA signatures show promise as predictive biomarkers for immunotherapy response and chemoresistance monitoring in multiple malignancies. This review underscores eRNA's transformative potential in precision oncology, advocating for integrated multiomics approaches to fully realize their clinical applicability.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12031722/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144057511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Correction to: STAT3-dependent long non-coding RNA Lncenc1 contributes to mouse ES cells pluripotency via stabilizing Klf4 mRNA.","authors":"","doi":"10.1093/bfgp/elaf012","DOIUrl":"10.1093/bfgp/elaf012","url":null,"abstract":"","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":"24 ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12260493/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144638691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Single-cell technology opened up a new avenue to delineate cellular status at a single-cell resolution and has become an essential tool for studying human diseases. Multiplexing allows cost-effective experiments by combining multiple samples and effectively mitigates batch effects. It starts by giving each sample a unique tag and then pooling them together for library preparation and sequencing. After sequencing, sample demultiplexing is performed based on tag detection, where cells belonging to one sample are expected to have a higher amount of the corresponding tag than cells from other samples. However, in reality, demultiplexing is not straightforward due to the noise and contamination from various sources. Successful demultiplexing depends on the efficient removal of such contamination. Here, we perform a systematic benchmark combining different normalization methods and demultiplexing approaches using real-world data and simulated datasets. We show that accounting for sequencing depth variability increases the separability between tagged and untagged cells, and the clustering-based approach outperforms existing tools. The clustering-based workflow is available as an R package from https://github.com/hwlim/hashDemux.
单细胞技术为以单细胞分辨率描述细胞状态开辟了一条新途径,已成为研究人类疾病的重要工具。多路复用技术通过将多个样本组合在一起,实现了经济高效的实验,并有效地减轻了批次效应。首先,给每个样本一个独特的标签,然后将它们集中在一起进行文库制备和测序。测序结束后,根据标签检测结果对样本进行解复用,预计属于一个样本的细胞会比其他样本的细胞含有更多的相应标签。然而,在现实中,由于各种来源的噪音和污染,解复用并不简单。成功的解复用取决于能否有效去除这些污染。在这里,我们利用真实世界数据和模拟数据集,结合不同的归一化方法和去多路复用方法,进行了一次系统的基准测试。我们的研究表明,考虑测序深度的可变性能提高标记细胞与非标记细胞之间的可分离性,基于聚类的方法优于现有工具。基于聚类的工作流程可作为 R 软件包从 https://github.com/hwlim/hashDemux 获取。
{"title":"Systematic benchmark of single-cell hashtag demultiplexing approaches reveals robust performance of a clustering-based method.","authors":"Mohammed Sayed, Yue Julia Wang, Hee-Woong Lim","doi":"10.1093/bfgp/elae039","DOIUrl":"10.1093/bfgp/elae039","url":null,"abstract":"<p><p>Single-cell technology opened up a new avenue to delineate cellular status at a single-cell resolution and has become an essential tool for studying human diseases. Multiplexing allows cost-effective experiments by combining multiple samples and effectively mitigates batch effects. It starts by giving each sample a unique tag and then pooling them together for library preparation and sequencing. After sequencing, sample demultiplexing is performed based on tag detection, where cells belonging to one sample are expected to have a higher amount of the corresponding tag than cells from other samples. However, in reality, demultiplexing is not straightforward due to the noise and contamination from various sources. Successful demultiplexing depends on the efficient removal of such contamination. Here, we perform a systematic benchmark combining different normalization methods and demultiplexing approaches using real-world data and simulated datasets. We show that accounting for sequencing depth variability increases the separability between tagged and untagged cells, and the clustering-based approach outperforms existing tools. The clustering-based workflow is available as an R package from https://github.com/hwlim/hashDemux.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735735/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142481473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tomas Klingström, Emelie Zonabend König, Avhashoni Agnes Zwane
Phenotyping of animals is a routine task in agriculture which can provide large datasets for the functional annotation of genomes. Using the livestock farming sector to study complex traits enables genetics researchers to fully benefit from the digital transformation of society as economies of scale substantially reduces the cost of phenotyping animals on farms. In the agricultural sector genomics has transitioned towards a model of 'Genomics without the genes' as a large proportion of the genetic variation in animals can be modelled using the infinitesimal model for genomic breeding valuations. Combined with third generation sequencing creating pan-genomes for livestock the digital infrastructure for trait collection and precision farming provides a unique opportunity for high-throughput phenotyping and the study of complex traits in a controlled environment. The emphasis on cost efficient data collection mean that mobile phones and computers have become ubiquitous for cost-efficient large-scale data collection but that the majority of the recorded traits can still be recorded manually with limited training or tools. This is especially valuable in low- and middle income countries and in settings where indigenous breeds are kept at farms preserving more traditional farming methods. Digitalization is therefore an important enabler for high-throughput phenotyping for smaller livestock herds with limited technology investments as well as large-scale commercial operations. It is demanding and challenging for individual researchers to keep up with the opportunities created by the rapid advances in digitalization for livestock farming and how it can be used by researchers with or without a specialization in livestock. This review provides an overview of the current status of key enabling technologies for precision livestock farming applicable for the functional annotation of genomes.
{"title":"Beyond the hype: using AI, big data, wearable devices, and the internet of things for high-throughput livestock phenotyping.","authors":"Tomas Klingström, Emelie Zonabend König, Avhashoni Agnes Zwane","doi":"10.1093/bfgp/elae032","DOIUrl":"10.1093/bfgp/elae032","url":null,"abstract":"<p><p>Phenotyping of animals is a routine task in agriculture which can provide large datasets for the functional annotation of genomes. Using the livestock farming sector to study complex traits enables genetics researchers to fully benefit from the digital transformation of society as economies of scale substantially reduces the cost of phenotyping animals on farms. In the agricultural sector genomics has transitioned towards a model of 'Genomics without the genes' as a large proportion of the genetic variation in animals can be modelled using the infinitesimal model for genomic breeding valuations. Combined with third generation sequencing creating pan-genomes for livestock the digital infrastructure for trait collection and precision farming provides a unique opportunity for high-throughput phenotyping and the study of complex traits in a controlled environment. The emphasis on cost efficient data collection mean that mobile phones and computers have become ubiquitous for cost-efficient large-scale data collection but that the majority of the recorded traits can still be recorded manually with limited training or tools. This is especially valuable in low- and middle income countries and in settings where indigenous breeds are kept at farms preserving more traditional farming methods. Digitalization is therefore an important enabler for high-throughput phenotyping for smaller livestock herds with limited technology investments as well as large-scale commercial operations. It is demanding and challenging for individual researchers to keep up with the opportunities created by the rapid advances in digitalization for livestock farming and how it can be used by researchers with or without a specialization in livestock. This review provides an overview of the current status of key enabling technologies for precision livestock farming applicable for the functional annotation of genomes.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735752/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142001413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Brad Balderson, Mitchell Fane, Tracey J Harvey, Michael Piper, Aaron Smith, Mikael Bodén
Metastatic melanoma originates from melanocytes of the skin. Melanoma metastasis results in poor treatment prognosis for patients and is associated with epigenetic and transcriptional changes that reflect the developmental program of melanocyte differentiation from neural crest stem cells. Several studies have explored melanoma transcriptional heterogeneity using microarray, bulk and single-cell RNA-sequencing technologies to derive data-driven models of the transcriptional-state change which occurs during melanoma progression. No study has systematically examined how different models of melanoma progression derived from different data types, technologies and biological conditions compare. Here, we perform a cross-sectional study to identify averaging effects of bulk-based studies that mask and distort apparent melanoma transcriptional heterogeneity; we describe new transcriptionally distinct melanoma cell states, identify differential co-expression of genes between studies and examine the effects of predicted drug susceptibilities of different cell states between studies. Importantly, we observe considerable variability in drug-target gene expression between studies, indicating potential transcriptional plasticity of melanoma to down-regulate these drug targets and thereby circumvent treatment. Overall, observed differences in gene co-expression and predicted drug susceptibility between studies suggest bulk-based transcriptional measurements do not reliably gauge heterogeneity and that melanoma transcriptional plasticity is greater than described when studies are considered in isolation.
{"title":"Systematic analysis of the transcriptional landscape of melanoma reveals drug-target expression plasticity.","authors":"Brad Balderson, Mitchell Fane, Tracey J Harvey, Michael Piper, Aaron Smith, Mikael Bodén","doi":"10.1093/bfgp/elad055","DOIUrl":"10.1093/bfgp/elad055","url":null,"abstract":"<p><p>Metastatic melanoma originates from melanocytes of the skin. Melanoma metastasis results in poor treatment prognosis for patients and is associated with epigenetic and transcriptional changes that reflect the developmental program of melanocyte differentiation from neural crest stem cells. Several studies have explored melanoma transcriptional heterogeneity using microarray, bulk and single-cell RNA-sequencing technologies to derive data-driven models of the transcriptional-state change which occurs during melanoma progression. No study has systematically examined how different models of melanoma progression derived from different data types, technologies and biological conditions compare. Here, we perform a cross-sectional study to identify averaging effects of bulk-based studies that mask and distort apparent melanoma transcriptional heterogeneity; we describe new transcriptionally distinct melanoma cell states, identify differential co-expression of genes between studies and examine the effects of predicted drug susceptibilities of different cell states between studies. Importantly, we observe considerable variability in drug-target gene expression between studies, indicating potential transcriptional plasticity of melanoma to down-regulate these drug targets and thereby circumvent treatment. Overall, observed differences in gene co-expression and predicted drug susceptibility between studies suggest bulk-based transcriptional measurements do not reliably gauge heterogeneity and that melanoma transcriptional plasticity is greater than described when studies are considered in isolation.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11979751/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139106948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}