首页 > 最新文献

Genomics, proteomics & bioinformatics最新文献

英文 中文
PASSpedia: A Polyadenylation Site Database Across Different Species at Single Cell Resolution. PASSpedia:在单细胞分辨率下跨不同物种的聚腺苷化位点数据库。
IF 7.9 Pub Date : 2025-09-23 DOI: 10.1093/gpbjnl/qzaf089
Pei-Hong Zhang, Hua Feng, Xu-Kai Ma, Fang Nan, Li Yang

Polyadenylation site (PAS) selection plays important roles in gene expression regulation and function. RNA-seq data derived from 3' tag sequencing contain intrinsic information about PAS usage and have been analyzed for alternative polyadenylation (APA) isoform expression in both bulk and single cell samples. Here, we upgraded our previously developed deep learning-based PAS analysis pipeline SCAPTURE v2 to profile PASs from 1330 published 3' tag-based scRNA-seq datasets across seven species, resulting in a comprehensive PAS landscape across species. Validation with long-read sequencing data from matched human tissues showed high accuracy of single-cell PAS profiling by SCAPTURE, including previously unannotated ones. Further comparisons revealed distinct PAS usage preferences in different species, such as human versus mouse, independent of conservation of gene expression. Finally, we present PASSpedia, a comprehensive database for PAS analysis and comparison across seven species at single cell resolution, which is freely accessible online at https://bits.fudan.edu.cn/PASSpedia/.

聚腺苷酸化位点(Polyadenylation site, PAS)选择在基因表达调控和功能中起着重要作用。来自3'标签测序的RNA-seq数据包含PAS使用的内在信息,并在散装和单细胞样品中分析了替代聚腺苷化(APA)异构体表达。在这里,我们升级了之前开发的基于深度学习的PAS分析管道SCAPTURE v2,从1330个已发表的基于3'标签的scRNA-seq数据集中分析了7个物种的PAS,从而获得了跨物种的全面PAS景观。来自匹配人体组织的长读测序数据验证表明,SCAPTURE的单细胞PAS分析具有很高的准确性,包括以前未注释的细胞。进一步的比较揭示了不同物种(如人类和小鼠)对PAS使用偏好的差异,这与基因表达的保守性无关。最后,我们介绍了PASSpedia,这是一个综合数据库,用于在单细胞分辨率下分析和比较七种物种的PAS,该数据库可在https://bits.fudan.edu.cn/PASSpedia/上免费访问。
{"title":"PASSpedia: A Polyadenylation Site Database Across Different Species at Single Cell Resolution.","authors":"Pei-Hong Zhang, Hua Feng, Xu-Kai Ma, Fang Nan, Li Yang","doi":"10.1093/gpbjnl/qzaf089","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzaf089","url":null,"abstract":"<p><p>Polyadenylation site (PAS) selection plays important roles in gene expression regulation and function. RNA-seq data derived from 3' tag sequencing contain intrinsic information about PAS usage and have been analyzed for alternative polyadenylation (APA) isoform expression in both bulk and single cell samples. Here, we upgraded our previously developed deep learning-based PAS analysis pipeline SCAPTURE v2 to profile PASs from 1330 published 3' tag-based scRNA-seq datasets across seven species, resulting in a comprehensive PAS landscape across species. Validation with long-read sequencing data from matched human tissues showed high accuracy of single-cell PAS profiling by SCAPTURE, including previously unannotated ones. Further comparisons revealed distinct PAS usage preferences in different species, such as human versus mouse, independent of conservation of gene expression. Finally, we present PASSpedia, a comprehensive database for PAS analysis and comparison across seven species at single cell resolution, which is freely accessible online at https://bits.fudan.edu.cn/PASSpedia/.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145133198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Model for the Development of Alzheimer's Disease. 阿尔茨海默病发展的模型。
IF 7.9 Pub Date : 2025-09-23 DOI: 10.1093/gpbjnl/qzaf087
Zhenyu Huang, Xuechen Mu, Qiufen Chen, Lingli Zhong, Jun Xiao, Chunman Zuo, Ye Zhang, Bocheng Shi, Yingwei Qu, Renbo Tan, Long Xu, Renchu Guan, Ying Xu

Intracellular alkalosis and extracellular acidosis are well-established characteristics of Alzheimer's disease (AD). We present a computational analysis and modeling of transcriptomic data of AD tissues, aiming to understand their causes and consequences. Our analyses have revealed that (1) persistent mitochondrial alkalization is due to chronic inflammation coupled with elevated iron and copper metabolisms; (2) the affected cells activate multiple acid-producing metabolisms to keep the mitochondrial pH stable for survival; (3) the most significant one is the continuous import and hydrolysis of glutamine to glutamate, NH3 and H+, resulting in persistent release of glutamates, an excitatory neurotransmitter, into the extracellular space; (4) this leads to persistent hyperexcitability of the nearby neurons, resulting in their continuous firing and release of H+-rich synaptic vesicles; (5) these H+s are neutralized by bicarbonates released by the neighboring astrocytes in normal tissues, which could not keep up with the increased H+-release in their discharge rates of bicarbonates in AD tissues, leading to progressively increased extracellular acidosis and ultimately cell death; and (6) multiple extensively studied AD-associated phenotypes, including Aβ aggregates and Tau fibers, are induced to help to alleviate the pH imbalances and beneficial to cell survival in the early phase of AD, which gradually become contributors to the AD development. Each step in this model is largely supported by published studies. Overall, we have developed a fundamentally novel and systems-level view of how AD may have developed.

细胞内碱中毒和细胞外酸中毒是阿尔茨海默病(AD)公认的特征。我们对阿尔茨海默病组织的转录组数据进行了计算分析和建模,旨在了解其原因和后果。我们的分析表明:(1)持续的线粒体碱化是由于慢性炎症加上铁和铜代谢升高;(2)受累细胞激活多种产酸代谢,维持线粒体pH稳定以维持生存;(3)最显著的是谷氨酰胺不断输入并水解为谷氨酸、NH3和H+,导致谷氨酸这种兴奋性神经递质持续释放到细胞外空间;(4)这导致附近神经元持续的高兴奋性,导致它们连续放电并释放富H+的突触囊泡;(5)这些H+s被正常组织内邻近星形胶质细胞释放的碳酸氢盐中和,无法跟上阿尔茨海默病组织内邻近星形胶质细胞释放碳酸氢盐的速度,导致细胞外酸中毒逐渐增加,最终导致细胞死亡;(6)多种广泛研究的AD相关表型,包括Aβ聚集体和Tau纤维,被诱导有助于缓解AD早期的pH失衡并有利于细胞存活,这些表型逐渐成为AD发展的贡献者。该模型中的每一步都得到了已发表研究的大力支持。总的来说,我们已经对AD的发展形成了一个全新的系统层面的观点。
{"title":"A Model for the Development of Alzheimer's Disease.","authors":"Zhenyu Huang, Xuechen Mu, Qiufen Chen, Lingli Zhong, Jun Xiao, Chunman Zuo, Ye Zhang, Bocheng Shi, Yingwei Qu, Renbo Tan, Long Xu, Renchu Guan, Ying Xu","doi":"10.1093/gpbjnl/qzaf087","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzaf087","url":null,"abstract":"<p><p>Intracellular alkalosis and extracellular acidosis are well-established characteristics of Alzheimer's disease (AD). We present a computational analysis and modeling of transcriptomic data of AD tissues, aiming to understand their causes and consequences. Our analyses have revealed that (1) persistent mitochondrial alkalization is due to chronic inflammation coupled with elevated iron and copper metabolisms; (2) the affected cells activate multiple acid-producing metabolisms to keep the mitochondrial pH stable for survival; (3) the most significant one is the continuous import and hydrolysis of glutamine to glutamate, NH3 and H+, resulting in persistent release of glutamates, an excitatory neurotransmitter, into the extracellular space; (4) this leads to persistent hyperexcitability of the nearby neurons, resulting in their continuous firing and release of H+-rich synaptic vesicles; (5) these H+s are neutralized by bicarbonates released by the neighboring astrocytes in normal tissues, which could not keep up with the increased H+-release in their discharge rates of bicarbonates in AD tissues, leading to progressively increased extracellular acidosis and ultimately cell death; and (6) multiple extensively studied AD-associated phenotypes, including Aβ aggregates and Tau fibers, are induced to help to alleviate the pH imbalances and beneficial to cell survival in the early phase of AD, which gradually become contributors to the AD development. Each step in this model is largely supported by published studies. Overall, we have developed a fundamentally novel and systems-level view of how AD may have developed.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145126891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LigExtract: Large-scale Automated Identification of Ligands from Protein Structures in the Protein Data Bank. LigExtract:在蛋白质数据库中从蛋白质结构中大规模自动识别配体。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf018
Natália Aniceto, Nuno Martinho, Ismael Rufino, Rita C Guedes

The Protein Data Bank (PDB) is an ever-growing database of three-dimensional macromolecular structures that has become a crucial resource for the drug discovery process. Exploring complexed proteins and accessing their associated ligands are essential for researchers to understand biological processes and design new compounds of pharmaceutical interest. However, currently available tools for large-scale ligand identification fail to address many of the more complex ways in which ligands are stored and represented in PDB structures. Therefore, a new tool called LigExtract was specifically developed for the large-scale processing of PDB structures and the identification of their ligands. This is a fully open-source tool available to the scientific community, designed to provide end-to-end processing. Users simply provide a list of UniProt IDs, and LigExtract returns a list of ligands, their individual PDB files, a PDB file of the protein chains interacting with the ligand, and a series of log files. These logs record the decisions made during the ligand extraction process and flag additional scenarios that might have to be considered during any follow-up use of the processed files (e.g., ligands covalently bound to the protein). LigExtract is freely available on GitHub (https://github.com/comp-medchem/LigExtract).

蛋白质数据库是一个不断增长的3D大分子结构数据库,已成为药物发现过程的重要资源。探索复杂的蛋白质和获取这些蛋白质中的配体对于帮助研究人员了解生物过程和设计新的药物感兴趣的化合物至关重要。然而,目前可用的工具来执行大规模的配体鉴定不解决许多更复杂的方式,其中配体存储和表示在PDB结构。因此,专门开发了一种名为LigExtract的新工具,用于PDB结构的大规模处理及其配体的鉴定。这是一个可供科学界使用的完全开源工具,旨在提供端到端处理,用户只需提供UniProt id列表,LigExtract返回配体列表及其单独的PDB文件。与配体结合的蛋白质链的PDB文件和一系列日志文件,这些文件通知用户在配体提取过程中做出的决定,以及在后续使用处理文件(例如,配体与蛋白质共价结合)期间可能必须考虑的潜在附加情况的标记。LigExtract是可用的,开源的,在GitHub (https://github.com/comp-medchem/LigExtract)。
{"title":"LigExtract: Large-scale Automated Identification of Ligands from Protein Structures in the Protein Data Bank.","authors":"Natália Aniceto, Nuno Martinho, Ismael Rufino, Rita C Guedes","doi":"10.1093/gpbjnl/qzaf018","DOIUrl":"10.1093/gpbjnl/qzaf018","url":null,"abstract":"<p><p>The Protein Data Bank (PDB) is an ever-growing database of three-dimensional macromolecular structures that has become a crucial resource for the drug discovery process. Exploring complexed proteins and accessing their associated ligands are essential for researchers to understand biological processes and design new compounds of pharmaceutical interest. However, currently available tools for large-scale ligand identification fail to address many of the more complex ways in which ligands are stored and represented in PDB structures. Therefore, a new tool called LigExtract was specifically developed for the large-scale processing of PDB structures and the identification of their ligands. This is a fully open-source tool available to the scientific community, designed to provide end-to-end processing. Users simply provide a list of UniProt IDs, and LigExtract returns a list of ligands, their individual PDB files, a PDB file of the protein chains interacting with the ligand, and a series of log files. These logs record the decisions made during the ligand extraction process and flag additional scenarios that might have to be considered during any follow-up use of the processed files (e.g., ligands covalently bound to the protein). LigExtract is freely available on GitHub (https://github.com/comp-medchem/LigExtract).</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12619641/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143560407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deciphering Haplotype-level Chromosome Conformation Alteration in Down Syndrome by Haplotype-resolved Multi-omics Analysis. 通过多重单倍体组学分析解读唐氏综合征的单倍体染色体构象变化。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf054
Chengchao Wu, Tianshu Zhou, Wenfu Ke, Wei Xiong, Zhihui Zhang, Siheng Zhang, Jinyue Wang, Lulu Deng, Keji Yan, Man Wang, Shenglong He, Qi Gong, Chao Ma, Xiaping Chen, Yan Li, He Long, Chong Guo, Gang Cao, Zhijun Zhang

For chromosome abnormalities (CAs), such as Down syndrome (DS), the influence of genomic variations on chromosome conformation and gene transcription remains elusive. Based on the complete genomic sequences from the parents of a DS trisomy patient, we systematically delineated an atlas of parental-specific, haplotype-resolved single nucleotide polymorphisms (SNPs), copy number variations (CNVs), three-dimensional (3D) genome architecture, and RNA expression profiles in the diencephalon of the DS patient. The integrated haplotype-resolved multi-omics analysis demonstrated that one-dimensional (1D) genomic variations including SNPs and CNVs in the DS patient are highly correlated with the alterations in the 3D genome organization and the subsequent changes in gene transcription. This correlation remains valid at the haplotype level. Moreover, we revealed the 3D genome alteration-associated dysregulation of DS-related genes, which facilitates understanding the pathogenesis of CAs. Together, our study contributes to deciphering the coding from 1D genomic variations to 3D genome architecture and the subsequent gene transcription outcomes in both health and disease.

对于染色体异常(CA),如唐氏综合症(DS),基因组变异对染色体构象和基因转录的影响仍然难以捉摸。基于来自DS三体患者父母的完整基因组序列,我们系统地描绘了DS患者间脑父母特异性单倍体单核苷酸多态性(SNP)、拷贝数变异(CNV)、三维(3D)基因组图谱和RNA表达谱。综合单倍体多组学分析表明,DS患者的一维基因组变异包括SNPs和CNVs与三维基因组的改变以及随后的基因转录高度相关。相关关系在单倍体水平上仍然有效。此外,我们还揭示了ds相关基因的三维基因组改变错误调控,有助于理解CA的发病机制。我们的研究有助于破译健康和疾病中从一维基因组变异到三维基因组结构的编码和随后的基因转录。
{"title":"Deciphering Haplotype-level Chromosome Conformation Alteration in Down Syndrome by Haplotype-resolved Multi-omics Analysis.","authors":"Chengchao Wu, Tianshu Zhou, Wenfu Ke, Wei Xiong, Zhihui Zhang, Siheng Zhang, Jinyue Wang, Lulu Deng, Keji Yan, Man Wang, Shenglong He, Qi Gong, Chao Ma, Xiaping Chen, Yan Li, He Long, Chong Guo, Gang Cao, Zhijun Zhang","doi":"10.1093/gpbjnl/qzaf054","DOIUrl":"10.1093/gpbjnl/qzaf054","url":null,"abstract":"<p><p>For chromosome abnormalities (CAs), such as Down syndrome (DS), the influence of genomic variations on chromosome conformation and gene transcription remains elusive. Based on the complete genomic sequences from the parents of a DS trisomy patient, we systematically delineated an atlas of parental-specific, haplotype-resolved single nucleotide polymorphisms (SNPs), copy number variations (CNVs), three-dimensional (3D) genome architecture, and RNA expression profiles in the diencephalon of the DS patient. The integrated haplotype-resolved multi-omics analysis demonstrated that one-dimensional (1D) genomic variations including SNPs and CNVs in the DS patient are highly correlated with the alterations in the 3D genome organization and the subsequent changes in gene transcription. This correlation remains valid at the haplotype level. Moreover, we revealed the 3D genome alteration-associated dysregulation of DS-related genes, which facilitates understanding the pathogenesis of CAs. Together, our study contributes to deciphering the coding from 1D genomic variations to 3D genome architecture and the subsequent gene transcription outcomes in both health and disease.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12571509/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144277053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PreDigs: A Database of Context-specific Cell Type Markers and Precise Cell Subtypes for Digestive Cell Annotation. PreDigs:用于消化细胞注释的上下文特异性细胞类型标记和精确细胞亚型数据库。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf066
Jiayue Meng, Mengyao Han, Yuwei Huang, Liang Li, Yuanhu Ju, Daqing Lv, Xiaoyi Chen, Liyun Yuan, Guoqing Zhang

Research on cell type markers helps investigators explore the diverse cellular composition of gastrointestinal tumors, thereby enhancing our understanding of tumor heterogeneity and its impact on disease progression and treatment response. However, the integration of large-scale datasets and the standardization of cell type identification remain challenging. Here, we developed PreDigs, a user-friendly database of predicted signatures for the digestive system, which offers 124 curated single-cell RNA sequencing datasets, covering over 3.4 million cells, all available for download. After unsupervised clustering, we unified the identification and nomenclature of cell subtype labels, constructing a cell ontology tree with 142 cell types across 8 hierarchical levels. Meanwhile, we calculated three different context-specific cell type markers, including "Cell Markers", "Subtype Markers", and "TPN Markers", based on various application requirements within or across tissues. Through the integrated analysis of PreDigs data, we identified distinct cell subpopulations exclusive to tumors, one of which corresponds to tumor-specific endothelial cells. Additionally, PreDigs offers online cell annotation tools, allowing users to classify single cells with greater flexibility. PreDigs is accessible at https://www.biosino.org/predigs/.

细胞类型标记的研究有助于研究人员探索胃肠道肿瘤的不同细胞组成。这增强了我们对肿瘤异质性及其对疾病进展和治疗反应的影响的理解。然而,整合大规模数据集和缺乏标准化的细胞类型识别仍然是挑战。在这里,我们开发了PreDigs,一个用户友好的消化系统预测特征数据库,它提供124个策划单细胞RNA测序数据集,覆盖超过340万个细胞,所有这些数据集都可以下载。在无监督聚类之后,我们统一了子类型标签的识别和命名,构建了包含8个层次142种细胞类型的细胞本体树。同时,我们根据组织内或组织间的不同应用需求,计算了三种不同的上下文特异性细胞类型标记,包括“细胞标记”、“亚型标记”和“TPN标记”。通过对PreDigs数据的综合分析,我们确定了肿瘤独有的不同细胞亚群,其中一个亚群对应于肿瘤特异性内皮细胞。此外,PreDigs还提供在线细胞注释工具,允许用户更灵活地对单个细胞进行分类。PreDigs可在https://www.biosino.org/predigs/访问。
{"title":"PreDigs: A Database of Context-specific Cell Type Markers and Precise Cell Subtypes for Digestive Cell Annotation.","authors":"Jiayue Meng, Mengyao Han, Yuwei Huang, Liang Li, Yuanhu Ju, Daqing Lv, Xiaoyi Chen, Liyun Yuan, Guoqing Zhang","doi":"10.1093/gpbjnl/qzaf066","DOIUrl":"10.1093/gpbjnl/qzaf066","url":null,"abstract":"<p><p>Research on cell type markers helps investigators explore the diverse cellular composition of gastrointestinal tumors, thereby enhancing our understanding of tumor heterogeneity and its impact on disease progression and treatment response. However, the integration of large-scale datasets and the standardization of cell type identification remain challenging. Here, we developed PreDigs, a user-friendly database of predicted signatures for the digestive system, which offers 124 curated single-cell RNA sequencing datasets, covering over 3.4 million cells, all available for download. After unsupervised clustering, we unified the identification and nomenclature of cell subtype labels, constructing a cell ontology tree with 142 cell types across 8 hierarchical levels. Meanwhile, we calculated three different context-specific cell type markers, including \"Cell Markers\", \"Subtype Markers\", and \"TPN Markers\", based on various application requirements within or across tissues. Through the integrated analysis of PreDigs data, we identified distinct cell subpopulations exclusive to tumors, one of which corresponds to tumor-specific endothelial cells. Additionally, PreDigs offers online cell annotation tools, allowing users to classify single cells with greater flexibility. PreDigs is accessible at https://www.biosino.org/predigs/.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12571502/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144839429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Precision and Accuracy in Quantitative Measurement of Gene Expression from Single-cell/nucleus RNA Sequencing Data. 从单细胞/细胞核RNA测序数据定量测量基因表达的精确性和准确性。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf077
Rujia Dai, Ming Zhang, Tianyao Chu, Richard Kopp, Chunling Zhang, Kefu Liu, Yue Wang, Xusheng Wang, Chao Chen, Chunyu Liu

Single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) have become essential tools for profiling gene expression across different cell types in biomedical research. While factors like RNA integrity, cell count, and sequencing depth are known to influence data quality, quantitative benchmarks and actionable guidelines are lacking. This gap contributes to variability in study designs and inconsistencies in downstream analyses. In this study, we systematically evaluated quantitative precision and accuracy in expression measures across 23 sc/snRNA-seq datasets comprising 3,682,576 cells from 339 samples. Precision was assessed using technical replicates based on pseudo-bulks created from subsampling. Accuracy was evaluated using sample-matched scRNA-seq and pooled-cell RNA sequencing data of mononuclear phagocytes from four species. Our results show that precision and accuracy are generally low at the single-cell level, with reproducibility being strongly influenced by cell count and RNA quality. We established data-driven thresholds for optimizing study design, recommending at least 500 cells per cell type per individual to achieve reliable quantification. Furthermore, we showed that signal-to-noise ratio is a key metric for identifying reproducible differentially expressed genes. To support future research, we developed Variability In single-Cell gene Expression (VICE), a tool that evaluates sc/snRNA-seq data quality and estimates the true positive rate of differential expression results based on sample size, observed noise levels, and expected effect size. These findings provide practical, evidence-based guidelines to enhance the reliability and reproducibility of sc/snRNA-seq studies.

单细胞和单核RNA测序(sc/snRNA-seq)已成为生物医学研究中分析不同细胞类型基因表达的重要工具。虽然RNA完整性、细胞计数和测序深度等因素已知会影响数据质量,但缺乏定量基准和可操作的指导方针。这一差距导致了研究设计的可变性和下游分析的不一致性。在这项研究中,我们系统地评估了23个sc/snRNA-seq数据集表达测量的定量精度和准确性,这些数据集包括来自339个样本的3,682,576个细胞。使用基于从次抽样创建的伪批量的技术重复来评估精度。使用样本匹配的scRNA-seq和来自四个物种的单核吞噬细胞的池细胞RNA测序(RNA-seq)数据来评估准确性。我们的研究结果表明,在单细胞水平上,精度和准确性普遍较低,可重复性受到细胞计数和RNA质量的强烈影响。我们建立了数据驱动的阈值来优化研究设计,建议每个个体每种细胞类型至少500个细胞,以实现可靠的定量。此外,我们发现信噪比是鉴定可重复差异表达基因的关键指标。为了支持未来的研究,我们开发了单细胞基因表达变异性(VICE),这是一个评估sc/snRNA-seq数据质量的工具,并根据样本量、观察到的噪声水平和预期的效应大小估计差异表达结果的真阳性率。这些发现为提高sc/snRNA-seq研究的可靠性和可重复性提供了实用的、基于证据的指南。
{"title":"Precision and Accuracy in Quantitative Measurement of Gene Expression from Single-cell/nucleus RNA Sequencing Data.","authors":"Rujia Dai, Ming Zhang, Tianyao Chu, Richard Kopp, Chunling Zhang, Kefu Liu, Yue Wang, Xusheng Wang, Chao Chen, Chunyu Liu","doi":"10.1093/gpbjnl/qzaf077","DOIUrl":"10.1093/gpbjnl/qzaf077","url":null,"abstract":"<p><p>Single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) have become essential tools for profiling gene expression across different cell types in biomedical research. While factors like RNA integrity, cell count, and sequencing depth are known to influence data quality, quantitative benchmarks and actionable guidelines are lacking. This gap contributes to variability in study designs and inconsistencies in downstream analyses. In this study, we systematically evaluated quantitative precision and accuracy in expression measures across 23 sc/snRNA-seq datasets comprising 3,682,576 cells from 339 samples. Precision was assessed using technical replicates based on pseudo-bulks created from subsampling. Accuracy was evaluated using sample-matched scRNA-seq and pooled-cell RNA sequencing data of mononuclear phagocytes from four species. Our results show that precision and accuracy are generally low at the single-cell level, with reproducibility being strongly influenced by cell count and RNA quality. We established data-driven thresholds for optimizing study design, recommending at least 500 cells per cell type per individual to achieve reliable quantification. Furthermore, we showed that signal-to-noise ratio is a key metric for identifying reproducible differentially expressed genes. To support future research, we developed Variability In single-Cell gene Expression (VICE), a tool that evaluates sc/snRNA-seq data quality and estimates the true positive rate of differential expression results based on sample size, observed noise levels, and expected effect size. These findings provide practical, evidence-based guidelines to enhance the reliability and reproducibility of sc/snRNA-seq studies.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12603356/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144983994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comprehensive Multi-omics Analysis of Regulatory Variants for Body Weight in Cattle. 牛体重调节变异的综合多组学分析。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf067
Qunhao Niu 牛群皓, Jiayuan Wu 武嘉远, Tianyi Wu 吴天弋, Tianliu Zhang 张天留, Tianzhen Wang 王添祯, Xu Zheng 郑旭, Zhida Zhao 赵志达, Ling Xu 徐玲, Zezhao Wang 王泽昭, Bo Zhu 朱波, Lupei Zhang 张路培, Huijiang Gao 高会江, George E Liu, Junya Li 李俊雅, Lingyang Xu 徐凌洋

Body weight is a polygenic trait with intricate inheritance patterns. Functional genomics enriched with multi-layer annotations offers essential resources for exploring the genetic architecture of complex traits. In this study, we conducted an extensive characterization of regulatory variants associated with body weight-related traits in cattle using multi-omics analysis. First, we identified seven candidate genes by integrating selective sweep analysis and multiple genome-wide association study (GWAS) strategies using imputed whole-genome sequencing data from a population of 1577 individuals. Subsequently, we uncovered 3340 eGenes (genes whose expression levels are associated with genetic variants) across 227 muscle samples. Transcriptome-wide association studies (TWASs) further revealed a total of 532 distinct candidate genes associated with body weight-related traits. Colocalization analyses unveiled 44 genes shared between expression quantitative trait loci (eQTLs) and GWAS signals. Moreover, a comprehensive analysis by integrating GWAS, selective sweep, eQTL, TWAS, epigenomic profiling, and molecular validation highlighted a positively selected genomic region on Bos taurus autosome 6 (BTA6). This locus harbors pleiotropic genes (LAP3, MED28, and NCAPG) and a prioritized functional variant involved in the complex regulation of body weight. Additionally, convergent evolution analysis and phenome-wide association studies underscored the conservation of this locus across species. Our study provides a comprehensive understanding of the genetic regulation of body weight through multi-omics analysis in cattle. Our findings contribute to unraveling the genetic mechanisms governing weight-related traits and shed valuable light on the genetic improvement of farm animals.

体重是一种具有复杂遗传模式的多基因性状。功能基因组学丰富的多层注释为探索复杂性状的遗传结构提供了必要的资源。在这项研究中,我们采用多组学方法对牛体重相关性状的调控变异进行了广泛的表征。首先,研究人员通过对1577个个体的选择性扫描和多个全基因组关联研究(GWAS)策略进行综合分析,确定了7个候选基因。随后,我们在227个肌肉样本中发现了3340个eGenes[在表达数量性状位点(eQTL)研究中,其表达水平与遗传变异显著相关的基因]。转录组全关联研究(TWAS)进一步揭示了总共532个与体重性状相关的不同候选基因。此外,共定位分析揭示了44个基因在eqtl和GWAS之间共享。此外,我们的综合分析突出了一个多效基因(LAP3、MED28和NCAPG)正选择的目标基因组区域,并通过整合GWAS、选择性扫描、eQTL、TWAS以及表观基因组分析和分子验证,在牛6常染色体(BTA6)上确定了一个优先功能变异,该变异对体重有复杂的调节作用。此外,趋同进化分析和全现象关联研究强调了基因座在物种间的保守性。本研究通过多组学分析,对牛体重相关性状的遗传调控有了全面的了解。我们的发现有助于揭示控制体重相关性状的遗传机制,并为农场动物的遗传改良提供有价值的启示。
{"title":"Comprehensive Multi-omics Analysis of Regulatory Variants for Body Weight in Cattle.","authors":"Qunhao Niu 牛群皓, Jiayuan Wu 武嘉远, Tianyi Wu 吴天弋, Tianliu Zhang 张天留, Tianzhen Wang 王添祯, Xu Zheng 郑旭, Zhida Zhao 赵志达, Ling Xu 徐玲, Zezhao Wang 王泽昭, Bo Zhu 朱波, Lupei Zhang 张路培, Huijiang Gao 高会江, George E Liu, Junya Li 李俊雅, Lingyang Xu 徐凌洋","doi":"10.1093/gpbjnl/qzaf067","DOIUrl":"10.1093/gpbjnl/qzaf067","url":null,"abstract":"<p><p>Body weight is a polygenic trait with intricate inheritance patterns. Functional genomics enriched with multi-layer annotations offers essential resources for exploring the genetic architecture of complex traits. In this study, we conducted an extensive characterization of regulatory variants associated with body weight-related traits in cattle using multi-omics analysis. First, we identified seven candidate genes by integrating selective sweep analysis and multiple genome-wide association study (GWAS) strategies using imputed whole-genome sequencing data from a population of 1577 individuals. Subsequently, we uncovered 3340 eGenes (genes whose expression levels are associated with genetic variants) across 227 muscle samples. Transcriptome-wide association studies (TWASs) further revealed a total of 532 distinct candidate genes associated with body weight-related traits. Colocalization analyses unveiled 44 genes shared between expression quantitative trait loci (eQTLs) and GWAS signals. Moreover, a comprehensive analysis by integrating GWAS, selective sweep, eQTL, TWAS, epigenomic profiling, and molecular validation highlighted a positively selected genomic region on Bos taurus autosome 6 (BTA6). This locus harbors pleiotropic genes (LAP3, MED28, and NCAPG) and a prioritized functional variant involved in the complex regulation of body weight. Additionally, convergent evolution analysis and phenome-wide association studies underscored the conservation of this locus across species. Our study provides a comprehensive understanding of the genetic regulation of body weight through multi-omics analysis in cattle. Our findings contribute to unraveling the genetic mechanisms governing weight-related traits and shed valuable light on the genetic improvement of farm animals.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12701805/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144877545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Duckweed Evolution: from Land back to Water. 浮萍的进化:从陆地回到水中。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf074
Yang Fang 方扬, Xueping Tian 田雪平, Yanling Jin 靳艳玲, Anping Du 杜安平, Yanqiang Ding 丁彦强, Zhihua Liao 廖志华, Kaize He 何开泽, Yonggui Zhao 赵永贵, Ling Guo 郭铃, Yao Xiao 肖瑶, Yaliang Xu 许亚良, Shuang Chen 陈爽, Yuqing Che 车育青, Li Tan 谭力, Songhu Wang 汪松虎, Jiatang Li 李家堂, Zhuolin Yi 易卓林, Lanchai Chen 陈兰钗, Leyi Zhao 赵乐伊, Fangyuan Zhang 张芳源, Guoyou Li 李国友, Jinmeng Li 李瑾萌, Qinli Xiong 熊勤犁, Yongmei Zhang 张咏梅, Qing Zhang 张庆, Xuan Hieu Cao, Hai Zhao 赵海

Terrestrialization is an important evolutionary process that plants experienced. However, little is known about how land plants acquired aquatic growth behaviors. Here, we integrate multiproxy evidence to elucidate the evolution of the aquatic plant duckweed. Three genera of duckweeds show chronologically gradual degeneration in root structure and stomatal function and a decrease in lignocellulose content, accompanied by the contraction of relevant gene families and/or a decline in their transcription levels. The number of genes in main phytohormone pathways is also gradually decreased. The coordinated action of genes involved in auxin signaling and rhizoid development causes a gradual decrease in adventitious roots. Additionally, the significant expansion of the flavonoid pathway is related to the adaptation of duckweeds to floating growth. This study reconstructs the evolutionary history of duckweeds, tracing its journey from land back to water - a reverse trajectory of early land plants.

陆地化是植物经历的一个重要进化过程。然而,关于陆地植物如何获得水生生长行为,人们知之甚少。在这里,我们整合了多代理证据来阐明水生植物浮萍的进化。3属浮萍的根结构和气孔功能随时间逐渐退化,木质纤维素含量下降,相关基因数量逐渐减少或转录下降。主要植物激素通路的基因数量也在逐渐减少。参与生长素和根状体发育的基因的共同作用导致不定根逐渐减少。类黄酮途径的显著扩张也与浮萍对漂浮生长的适应有关。本研究重建了浮萍栖息地从陆地到水中的进化历史,逆转了早期陆地植物的进化历史。
{"title":"Duckweed Evolution: from Land back to Water.","authors":"Yang Fang 方扬, Xueping Tian 田雪平, Yanling Jin 靳艳玲, Anping Du 杜安平, Yanqiang Ding 丁彦强, Zhihua Liao 廖志华, Kaize He 何开泽, Yonggui Zhao 赵永贵, Ling Guo 郭铃, Yao Xiao 肖瑶, Yaliang Xu 许亚良, Shuang Chen 陈爽, Yuqing Che 车育青, Li Tan 谭力, Songhu Wang 汪松虎, Jiatang Li 李家堂, Zhuolin Yi 易卓林, Lanchai Chen 陈兰钗, Leyi Zhao 赵乐伊, Fangyuan Zhang 张芳源, Guoyou Li 李国友, Jinmeng Li 李瑾萌, Qinli Xiong 熊勤犁, Yongmei Zhang 张咏梅, Qing Zhang 张庆, Xuan Hieu Cao, Hai Zhao 赵海","doi":"10.1093/gpbjnl/qzaf074","DOIUrl":"10.1093/gpbjnl/qzaf074","url":null,"abstract":"<p><p>Terrestrialization is an important evolutionary process that plants experienced. However, little is known about how land plants acquired aquatic growth behaviors. Here, we integrate multiproxy evidence to elucidate the evolution of the aquatic plant duckweed. Three genera of duckweeds show chronologically gradual degeneration in root structure and stomatal function and a decrease in lignocellulose content, accompanied by the contraction of relevant gene families and/or a decline in their transcription levels. The number of genes in main phytohormone pathways is also gradually decreased. The coordinated action of genes involved in auxin signaling and rhizoid development causes a gradual decrease in adventitious roots. Additionally, the significant expansion of the flavonoid pathway is related to the adaptation of duckweeds to floating growth. This study reconstructs the evolutionary history of duckweeds, tracing its journey from land back to water - a reverse trajectory of early land plants.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12707978/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144983989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MedImg: An Integrated Database for Public Medical Images. MedImg:公共医学图像集成数据库。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf068
Bitao Zhong, Rui Fan, Yue Ma, Xiangwen Ji, Qinghua Cui, Chunmei Cui

The advancements in deep learning algorithms for medical image analysis have garnered significant attention in recent years. While several studies have shown promising results, with models achieving or even surpassing human performance, translating these advancements into clinical practice is still accompanied by various challenges. A primary obstacle lies in the availability of large-scale, well-characterized datasets for validating the generalization of approaches. To address this challenge, we curated a diverse collection of medical image datasets from multiple public sources, containing 105 datasets and a total of 1,995,671 images. These images span 14 modalities, including X-ray, computed tomography, magnetic resonance imaging, optical coherence tomography, ultrasound, and endoscopy, and originate from 13 organs, such as the lung, brain, eye, and heart. Subsequently, we constructed an online database, MedImg, which incorporates and systematically organizes these medical images to facilitate data accessibility. MedImg serves as an intuitive and open-access platform for facilitating research in deep learning-based medical image analysis, accessible at https://www.cuilab.cn/medimg/.

近年来,医学图像分析中深度学习算法的进步引起了人们的极大关注。虽然一些研究显示出有希望的结果,模型达到甚至超过了人类的表现,但将这些进步转化为临床实践仍然伴随着各种挑战。主要障碍在于验证方法泛化的大规模、特征良好的数据集的可用性。为了应对这一挑战,我们从多个公共来源策划了一个不同的医学图像数据集,包含105个数据集和总共1,995,671张图像。这些图像跨越14种方式,包括x射线、计算机断层扫描、磁共振成像、光学相干断层扫描、超声波和内窥镜检查,来自13个器官,如肺、脑、眼和心。随后,我们构建了一个在线数据库MedImg,将这些医学图像整合并系统地组织起来,以方便数据的可访问性。MedImg是一个直观的开放访问平台,用于促进基于深度学习的医学图像分析研究,可访问https://www.cuilab.cn/medimg/。
{"title":"MedImg: An Integrated Database for Public Medical Images.","authors":"Bitao Zhong, Rui Fan, Yue Ma, Xiangwen Ji, Qinghua Cui, Chunmei Cui","doi":"10.1093/gpbjnl/qzaf068","DOIUrl":"10.1093/gpbjnl/qzaf068","url":null,"abstract":"<p><p>The advancements in deep learning algorithms for medical image analysis have garnered significant attention in recent years. While several studies have shown promising results, with models achieving or even surpassing human performance, translating these advancements into clinical practice is still accompanied by various challenges. A primary obstacle lies in the availability of large-scale, well-characterized datasets for validating the generalization of approaches. To address this challenge, we curated a diverse collection of medical image datasets from multiple public sources, containing 105 datasets and a total of 1,995,671 images. These images span 14 modalities, including X-ray, computed tomography, magnetic resonance imaging, optical coherence tomography, ultrasound, and endoscopy, and originate from 13 organs, such as the lung, brain, eye, and heart. Subsequently, we constructed an online database, MedImg, which incorporates and systematically organizes these medical images to facilitate data accessibility. MedImg serves as an intuitive and open-access platform for facilitating research in deep learning-based medical image analysis, accessible at https://www.cuilab.cn/medimg/.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12558383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144984001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implication of the Vaginal Microbiome in Female Infertility and Assisted Conception Outcomes. 阴道微生物组在女性不孕症和辅助受孕结果中的意义。
IF 7.9 Pub Date : 2025-09-22 DOI: 10.1093/gpbjnl/qzaf042
Xiuju Chen 陈秀菊, Yanyu Sui 隋彦禹, Jiayi Gu 顾佳怡, Liang Wang 王亮, Ningxia Sun 孙宁霞

The rise in infertility rates has prompted research into the impact of vaginal microbiota on female fertility and the success of assisted reproductive technology (ART). Our study aimed to compare the vaginal microbiome between fertile and infertile women and explore its influence on ART outcomes. Vaginal secretion samples were collected from 194 infertile women and 100 healthy controls at Shanghai Changzheng Hospital. The V3-V4 region of the 16S rRNA gene was amplified using polymerase chain reaction (PCR). A machine learning model was applied to predict infertility based on genus-level abundance, and the PICRUSt algorithm was employed to predict metabolic pathways related to infertility and ART outcomes. The results showed that infertile women exhibited a significantly different vaginal microbial composition compared to healthy controls, along with increased microbial diversity. Notably, the abundance of Burkholderia, Pseudomonas, and Prevotella was significantly elevated in the vaginal microbiota of the infertility group, while that of Bifidobacterium and Lactobacillus was reduced. Among infertile women, those with recurrent implantation failure (RIF) showed even higher vaginal microbial diversity, with specific genera such as Mobiluncus, Peptoniphilus, Prevotella, and Varibaculum being more abundant. Eleven metabolic pathways were identified to be associated with both RIF and infertility, with Prevotella showing stronger correlations with these pathways. This study elucidates differences in vaginal microbiome between healthy and infertile women, providing novel insights into how vaginal microbiota may impact infertility and ART outcomes. Our findings underscore the importance of specific microbial taxa in women with RIF, suggesting potential avenues for targeted interventions to improve embryo transplantation success rates.

不孕不育率的上升促使人们开始研究阴道微生物群对女性生育能力和辅助生殖技术(ART)成功的影响。我们的研究比较了生育和不育妇女的阴道微生物组,并探讨了其对抗逆转录病毒治疗结果的影响。采用聚合酶链反应(PCR)扩增16S rRNA V3-V4区,分析了上海长征医院194名不孕妇女和100名健康对照者的阴道分泌物。机器学习模型基于属丰度预测不孕症,PICRUSt算法预测与不孕症和ART结果相关的代谢途径。结果显示,与健康女性相比,不孕症女性的阴道微生物组成明显不同,不孕症组的微生物多样性更高。不孕症组阴道菌群中的伯克霍尔德菌、假单胞菌和普雷沃氏菌水平显著升高,而双歧杆菌和乳杆菌丰度降低。在不育人群中,复发性植入失败(RIF)显示出更高的阴道微生物群多样性,Mobiluncus、Peptoniphilus、Prevotella和Varibaculum等特定属更为丰富。11种代谢途径与RIF和不孕症相关,其中普雷沃氏菌表现出更强的相关性。目前的研究提供了健康和不孕女性阴道微生物组差异的见解,提供了阴道微生物群如何影响不孕和ART结果的新理解。我们的研究结果强调了特定微生物类群在RIF妇女中的重要性,为有针对性的干预措施提供了途径,以提高胚胎移植成功率。
{"title":"Implication of the Vaginal Microbiome in Female Infertility and Assisted Conception Outcomes.","authors":"Xiuju Chen 陈秀菊, Yanyu Sui 隋彦禹, Jiayi Gu 顾佳怡, Liang Wang 王亮, Ningxia Sun 孙宁霞","doi":"10.1093/gpbjnl/qzaf042","DOIUrl":"10.1093/gpbjnl/qzaf042","url":null,"abstract":"<p><p>The rise in infertility rates has prompted research into the impact of vaginal microbiota on female fertility and the success of assisted reproductive technology (ART). Our study aimed to compare the vaginal microbiome between fertile and infertile women and explore its influence on ART outcomes. Vaginal secretion samples were collected from 194 infertile women and 100 healthy controls at Shanghai Changzheng Hospital. The V3-V4 region of the 16S rRNA gene was amplified using polymerase chain reaction (PCR). A machine learning model was applied to predict infertility based on genus-level abundance, and the PICRUSt algorithm was employed to predict metabolic pathways related to infertility and ART outcomes. The results showed that infertile women exhibited a significantly different vaginal microbial composition compared to healthy controls, along with increased microbial diversity. Notably, the abundance of Burkholderia, Pseudomonas, and Prevotella was significantly elevated in the vaginal microbiota of the infertility group, while that of Bifidobacterium and Lactobacillus was reduced. Among infertile women, those with recurrent implantation failure (RIF) showed even higher vaginal microbial diversity, with specific genera such as Mobiluncus, Peptoniphilus, Prevotella, and Varibaculum being more abundant. Eleven metabolic pathways were identified to be associated with both RIF and infertility, with Prevotella showing stronger correlations with these pathways. This study elucidates differences in vaginal microbiome between healthy and infertile women, providing novel insights into how vaginal microbiota may impact infertility and ART outcomes. Our findings underscore the importance of specific microbial taxa in women with RIF, suggesting potential avenues for targeted interventions to improve embryo transplantation success rates.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12603359/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144047396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genomics, proteomics & bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1