首页 > 最新文献

NAR Genomics and Bioinformatics最新文献

英文 中文
The divergent intron-containing actin in sponge morphogenetic processes. 海绵形态发生过程中发散的含内含子的肌动蛋白。
IF 4 Q1 GENETICS & HEREDITY Pub Date : 2025-06-04 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf071
Yulia V Lyupina, Kim I Adameyko, Vasiliy M Zubarev, Alexander V Cherkasov, Alina V Ryabova, Kirill V Mikhailov, Sergey A Golyshev, Anton V Burakov, Alexander D Finoshin, Pavel A Erokhov, Marat S Sabirov, Anna I Zhurakovskaya, Rustam H Ziganshin, Nikolai G Gornostaev, Vasilina M Ignatyuk, Aleksei M Kulikov, Victor S Mikhailov, Guzel R Gazizova, Elena I Shagimardanova, Oleg A Gusev, Ekaterina E Khrameeva, Oksana I Kravchuk

The ability of eukaryotic cells to orchestrate mechanical interactions from the subcellular to the organismal levels is mediated by their cytoskeleton. One of the key components of the eukaryotic cytoskeleton is actin, a highly conserved building block of the actin filaments, which interact with many other proteins and underlie diverse cell structures, necessary for organizing intracellular transport, phagocytosis and cell movement. Many organisms have evolved multiple actin variants, which share similar amino acid sequences but differ more dramatically at the gene level, including the presence and number of introns. In the current study, we show that the intron-containing and intronless actin genes are present in the poriferan Halisarca dujardini and that the encoded actins can perform different functions. These actins differ in the gene expression profiles, post-translational modifications, cellular, and subcellular localizations. The intronless actin genes of H. dujardini, HdA1/2/3, are products of recent duplications, exhibit low divergence between paralogs, and serve as the primary cytoskeletal actins. The divergent intron-containing actin gene, HdA6, is differentially expressed in a specific cell lineage and its expression is dependent on the state of cell aggregation, which indicates its unique functions in the morphogenetic processes of the sponge.

真核细胞协调从亚细胞到有机体水平的机械相互作用的能力是由它们的细胞骨架介导的。肌动蛋白是真核细胞骨架的关键成分之一,是肌动蛋白丝的一个高度保守的组成部分,它与许多其他蛋白质相互作用,构成多种细胞结构的基础,是组织细胞内运输、吞噬和细胞运动所必需的。许多生物体已经进化出多种肌动蛋白变体,它们具有相似的氨基酸序列,但在基因水平上差异更大,包括内含子的存在和数量。在目前的研究中,我们发现含有内含子和不含内含子的肌动蛋白基因存在于多孔菌Halisarca dujardini中,并且编码的肌动蛋白可以执行不同的功能。这些作用蛋白在基因表达谱、翻译后修饰、细胞和亚细胞定位方面有所不同。H. dujardini的无内含子肌动蛋白基因HdA1/2/3是近期复制的产物,在相似物之间表现出较低的差异,是主要的细胞骨架肌动蛋白。含有内含子的分化型肌动蛋白基因HdA6在特定细胞系中有差异表达,其表达依赖于细胞聚集状态,这表明其在海绵形态发生过程中的独特功能。
{"title":"The divergent intron-containing actin in sponge morphogenetic processes.","authors":"Yulia V Lyupina, Kim I Adameyko, Vasiliy M Zubarev, Alexander V Cherkasov, Alina V Ryabova, Kirill V Mikhailov, Sergey A Golyshev, Anton V Burakov, Alexander D Finoshin, Pavel A Erokhov, Marat S Sabirov, Anna I Zhurakovskaya, Rustam H Ziganshin, Nikolai G Gornostaev, Vasilina M Ignatyuk, Aleksei M Kulikov, Victor S Mikhailov, Guzel R Gazizova, Elena I Shagimardanova, Oleg A Gusev, Ekaterina E Khrameeva, Oksana I Kravchuk","doi":"10.1093/nargab/lqaf071","DOIUrl":"10.1093/nargab/lqaf071","url":null,"abstract":"<p><p>The ability of eukaryotic cells to orchestrate mechanical interactions from the subcellular to the organismal levels is mediated by their cytoskeleton. One of the key components of the eukaryotic cytoskeleton is actin, a highly conserved building block of the actin filaments, which interact with many other proteins and underlie diverse cell structures, necessary for organizing intracellular transport, phagocytosis and cell movement. Many organisms have evolved multiple actin variants, which share similar amino acid sequences but differ more dramatically at the gene level, including the presence and number of introns. In the current study, we show that the intron-containing and intronless actin genes are present in the poriferan <i>Halisarca dujardini</i> and that the encoded actins can perform different functions. These actins differ in the gene expression profiles, post-translational modifications, cellular, and subcellular localizations. The intronless actin genes of <i>H. dujardini</i>, <i>HdA1/2/3</i>, are products of recent duplications, exhibit low divergence between paralogs, and serve as the primary cytoskeletal actins. The divergent intron-containing actin gene, <i>HdA6</i>, is differentially expressed in a specific cell lineage and its expression is dependent on the state of cell aggregation, which indicates its unique functions in the morphogenetic processes of the sponge.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf071"},"PeriodicalIF":4.0,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12146513/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144259053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KILDA: identifying KIV-2 repeats from kmers. KILDA:从kmers中鉴定KIV-2重复序列。
IF 4 Q1 GENETICS & HEREDITY Pub Date : 2025-05-30 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf070
Corentin Molitor, Timothy Labidi, Antoine Rimbert, Bertrand Cariou, Mathilde Di Filippo, Claire Bardel

High concentration of lipoprotein(a) [Lp(a)], a lipoprotein with proatherogenic properties, is an important risk factor for cardiovascular disease. This concentration is mostly genetically determined by a complex interplay between the number of kringle IV type 2 repeats and Lp(a)-affecting variants. Besides Lp(a) plasma concentration, there is an unmet need to identify individuals most at risk based on their LPA genotype. We developed KILDA (KIv2 Length Determined from a kmer Analysis), a Nextflow pipeline, to identify the number of kringle IV type 2 repeats and Lp(a)-affecting variants directly from kmers generated from FASTQ files. The pipeline was tested on the 1000 Genomes Project (n = 2459) and results were equivalent to DRAGEN-LPA (R 2= 0.92). In silico datasets proved the robustness of KILDA's predictions under different scenarios of sequencing coverage and quality. In brief, KILDA is a robust, open-source, and free-to-use pipeline that can identify the number of kringle IV type 2 repeats and Lp(a)-associated variants even when inputting low-coverage libraries.

高浓度脂蛋白(a) [Lp(a)]是一种具有致动脉粥样硬化特性的脂蛋白,是心血管疾病的重要危险因素。这种浓度主要是由kringle IV 2型重复序列数量和Lp(a)影响变异之间的复杂相互作用决定的。除了Lp(a)血浆浓度外,根据LPA基因型确定高危个体的需求尚未得到满足。我们开发了KILDA (KIv2 Length Determined from a kmer Analysis),这是Nextflow的一个管道,用于直接从FASTQ文件生成的kmers中识别kringle IV型2重复序列和Lp(a)影响变异的数量。该管道在1000基因组计划(n = 2459)中进行了测试,结果与DRAGEN-LPA相当(r2 = 0.92)。在硅数据集证明了KILDA的预测在不同的测序覆盖率和质量情况下的稳健性。简而言之,KILDA是一个强大的、开源的、免费使用的管道,即使在输入低覆盖率的库时,也可以识别kringle IV型2重复和Lp(a)相关变体的数量。
{"title":"KILDA: identifying KIV-2 repeats from kmers.","authors":"Corentin Molitor, Timothy Labidi, Antoine Rimbert, Bertrand Cariou, Mathilde Di Filippo, Claire Bardel","doi":"10.1093/nargab/lqaf070","DOIUrl":"10.1093/nargab/lqaf070","url":null,"abstract":"<p><p>High concentration of lipoprotein(a) [Lp(a)], a lipoprotein with proatherogenic properties, is an important risk factor for cardiovascular disease. This concentration is mostly genetically determined by a complex interplay between the number of kringle IV type 2 repeats and Lp(a)-affecting variants. Besides Lp(a) plasma concentration, there is an unmet need to identify individuals most at risk based on their <i>LPA</i> genotype. We developed KILDA (KIv2 Length Determined from a kmer Analysis), a Nextflow pipeline, to identify the number of kringle IV type 2 repeats and Lp(a)-affecting variants directly from kmers generated from FASTQ files. The pipeline was tested on the 1000 Genomes Project (<i>n</i> = 2459) and results were equivalent to DRAGEN-LPA (<i>R</i> <sup>2</sup>= 0.92). <i>In silico</i> datasets proved the robustness of KILDA's predictions under different scenarios of sequencing coverage and quality. In brief, KILDA is a robust, open-source, and free-to-use pipeline that can identify the number of kringle IV type 2 repeats and Lp(a)-associated variants even when inputting low-coverage libraries.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf070"},"PeriodicalIF":4.0,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12123407/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144200253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to 'Current state and future prospects of Horizontal Gene Transfer detection'. 修正“水平基因转移检测的现状和未来展望”。
IF 2.8 Q1 GENETICS & HEREDITY Pub Date : 2025-05-30 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf078

[This corrects the article DOI: 10.1093/nar/lqaf005.].

[这更正了文章DOI: 10.1093/nar/lqaf005.]。
{"title":"Correction to 'Current state and future prospects of Horizontal Gene Transfer detection'.","authors":"","doi":"10.1093/nargab/lqaf078","DOIUrl":"10.1093/nargab/lqaf078","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1093/nar/lqaf005.].</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf078"},"PeriodicalIF":2.8,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12123403/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144200251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating gene expression, genomic, and phosphoproteomic data to infer transcription factor activity in lung cancer. 整合基因表达、基因组和磷蛋白组学数据推断肺癌中转录因子的活性。
IF 4 Q1 GENETICS & HEREDITY Pub Date : 2025-05-30 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf068
Chiara Carrino, Gerardo Pepe, Luca Parca, Manuela Helmer-Citterich, Pier Federico Gherardini

Transcription factors (TFs) are key regulators of cellular gene expression programs in health and disease. Here we set out to integrate genomic, transcriptomic, and phosphoproteomic data to characterize TF activity in lung adenocarcinoma patients. Using expression data from patient samples and genomic information on TF binding to super-enhancers, starting from a list of 1667 human TFs we calculated a patient-specific activity score and identified 34 with perturbed activity in the cancer samples, as evidenced by the expression of their direct targets. We then leveraged phosphoproteomic data on the same samples to identify phosphorylation events that modulate TF activity. This novel data integration approach to TF characterization led to the identification of ERG as a key regulator in lung adenocarcinoma whose activity strongly correlates with patient survival.

转录因子是健康和疾病中细胞基因表达程序的关键调控因子。在这里,我们着手整合基因组学、转录组学和磷酸化蛋白质组学数据,以表征肺腺癌患者的TF活性。利用来自患者样本的表达数据和TF与超级增强子结合的基因组信息,从1667个人类TF列表中开始,我们计算了患者特异性活性评分,并确定了34个在癌症样本中活性紊乱的TF,这可以通过其直接靶点的表达来证明。然后,我们利用相同样品的磷酸化蛋白质组学数据来确定调节TF活性的磷酸化事件。这种针对TF特征的新颖数据整合方法确定了ERG是肺腺癌的关键调节因子,其活性与患者生存密切相关。
{"title":"Integrating gene expression, genomic, and phosphoproteomic data to infer transcription factor activity in lung cancer.","authors":"Chiara Carrino, Gerardo Pepe, Luca Parca, Manuela Helmer-Citterich, Pier Federico Gherardini","doi":"10.1093/nargab/lqaf068","DOIUrl":"10.1093/nargab/lqaf068","url":null,"abstract":"<p><p>Transcription factors (TFs) are key regulators of cellular gene expression programs in health and disease. Here we set out to integrate genomic, transcriptomic, and phosphoproteomic data to characterize TF activity in lung adenocarcinoma patients. Using expression data from patient samples and genomic information on TF binding to super-enhancers, starting from a list of 1667 human TFs we calculated a patient-specific activity score and identified 34 with perturbed activity in the cancer samples, as evidenced by the expression of their direct targets. We then leveraged phosphoproteomic data on the same samples to identify phosphorylation events that modulate TF activity. This novel data integration approach to TF characterization led to the identification of ERG as a key regulator in lung adenocarcinoma whose activity strongly correlates with patient survival.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf068"},"PeriodicalIF":4.0,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12123410/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144200252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VariantFoldRNA: a flexible, containerized, and scalable pipeline for genome-wide riboSNitch prediction. VariantFoldRNA:一个灵活的,容器化的,可扩展的全基因组riboSNitch预测管道。
IF 4 Q1 GENETICS & HEREDITY Pub Date : 2025-05-29 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf066
Kobie J Kirven, Philip C Bevilacqua, Sarah M Assmann

Single nucleotide polymorphisms (SNPs) can alter RNA structure by changing the proportions of existing conformations or leading to new conformations in the structural ensemble. Such structure-changing SNPs, or riboSNitches, have been associated with diseases in humans and climate adaptation in plants. While several computational tools are available for predicting whether an SNP is a riboSNitch, these tools were generally developed to analyze individual RNAs and are not optimized for genome-wide analyses. To fill this gap, we developed VariantFoldRNA, a flexible, containerized, and automated pipeline for genome-wide prediction of riboSNitches. Our pipeline automatically installs all dependencies, can be run locally or on high-performance clusters, and is modular, enabling the user to customize the analysis for the research question of interest. VariantFoldRNA can predict riboSNitches genome-wide at user-specified temperatures and splicing conditions, opening the door to novel analyses. The pipeline is an open-source command-line tool that is freely available at https://github.com/The-Bevilacqua-Lab/variantfoldrna.

单核苷酸多态性(SNPs)可以通过改变现有构象的比例或在结构集合中导致新的构象来改变RNA结构。这种改变结构的snp或ribosnitch与人类疾病和植物的气候适应有关。虽然有几种计算工具可用于预测SNP是否为riboSNitch,但这些工具通常用于分析单个rna,并没有针对全基因组分析进行优化。为了填补这一空白,我们开发了VariantFoldRNA,这是一种灵活的、容器化的、自动化的riboSNitches全基因组预测管道。我们的管道自动安装所有依赖项,可以在本地或高性能集群上运行,并且是模块化的,使用户能够针对感兴趣的研究问题定制分析。VariantFoldRNA可以在用户指定的温度和剪接条件下预测全基因组的ribohnich,为新的分析打开了大门。该管道是一个开源命令行工具,可以在https://github.com/The-Bevilacqua-Lab/variantfoldrna上免费获得。
{"title":"VariantFoldRNA: a flexible, containerized, and scalable pipeline for genome-wide riboSNitch prediction.","authors":"Kobie J Kirven, Philip C Bevilacqua, Sarah M Assmann","doi":"10.1093/nargab/lqaf066","DOIUrl":"10.1093/nargab/lqaf066","url":null,"abstract":"<p><p>Single nucleotide polymorphisms (SNPs) can alter RNA structure by changing the proportions of existing conformations or leading to new conformations in the structural ensemble. Such structure-changing SNPs, or riboSNitches, have been associated with diseases in humans and climate adaptation in plants. While several computational tools are available for predicting whether an SNP is a riboSNitch, these tools were generally developed to analyze individual RNAs and are not optimized for genome-wide analyses. To fill this gap, we developed VariantFoldRNA, a flexible, containerized, and automated pipeline for genome-wide prediction of riboSNitches. Our pipeline automatically installs all dependencies, can be run locally or on high-performance clusters, and is modular, enabling the user to customize the analysis for the research question of interest. VariantFoldRNA can predict riboSNitches genome-wide at user-specified temperatures and splicing conditions, opening the door to novel analyses. The pipeline is an open-source command-line tool that is freely available at https://github.com/The-Bevilacqua-Lab/variantfoldrna.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf066"},"PeriodicalIF":4.0,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12121482/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144182273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEIVariantFilter: a bioinformatics tool to speed up genetic diagnosis of inborn errors of immunity patients. IEIVariantFilter:一种生物信息学工具,可加快免疫患者先天性错误的遗传诊断。
IF 2.8 Q1 GENETICS & HEREDITY Pub Date : 2025-05-28 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf069
Juan Pereda, Rafael Espinosa, Blanca García-Solís, Teresa Guerra-Galán, Ana Van-Den-Rym, Meltem Ece Kars, Rocío Mena, Victor Galán, Ana de Andrés-Martín, Carlos Rodríguez-Gallego, Alberto López-Lera, Fernando Corvillo, Antonio Pérez-Martínez, Eduardo López-Collazo, Silvia Sánchez-Ramón, Rubén Martínez-Barricarte, Lluis Quintana-Murci, José Miguel Lorenzo-Salazar, Yuval Itan, Carlos Flores, Rebeca Pérez-de-Diego

Severe infectious diseases remain the leading cause of death in children and young adults worldwide. Monogenic inborn errors of immunity (IEIs) are traditionally defined as a heterogeneous group of rare inborn genetic diseases affecting the functioning of the immune system. Greater awareness has led to the clinical definition of 485 monogenic IEIs and whole exome sequencing (WES) is becoming increasingly relevant for IEI genetic diagnosis. The current protocol for IEI genetic studies includes manual filtering of the list of genes obtained as a WES read-out providing a short list of candidate genes. This procedure is time-consuming and can produce mistakes due to human error in manual filtering. IEIVariantFilter is a new web-based bioinformatics tool to speed up and refine the genetic diagnosis of IEI patients oriented for users in the biomedical field without needing bioinformatics expertise. IEIVariantFilter prioritizes genetic variants based on ranges of zygosity, the quality of reads, the predicted variant effect, and genes related to immunity, considering a consanguineous hypothesis whenever necessary. IEIVariantFilter facilitates gene and variant list prioritization, speeding up the identification of candidate disease-causing variants for validation by experimental studies. The software improves the genetic diagnosis of patients, thereby facilitating precision medicine and fast and proper treatment.

严重传染病仍然是全世界儿童和青年死亡的主要原因。单基因先天性免疫缺陷(IEIs)传统上被定义为影响免疫系统功能的一组罕见的先天性遗传疾病。越来越多的人认识到,485个单基因IEI的临床定义和全外显子组测序(WES)对IEI的遗传诊断越来越重要。目前IEI遗传研究的方案包括人工过滤作为WES读出的基因列表,提供候选基因的短列表。此过程非常耗时,并且可能由于人工过滤中的人为错误而产生错误。IEIVariantFilter是一种新的基于网络的生物信息学工具,用于加速和改进IEI患者的遗传诊断,面向生物医学领域的用户,无需生物信息学专业知识。IEIVariantFilter根据合子范围、reads质量、预测变异效应和与免疫相关的基因对遗传变异进行优先排序,必要时考虑近亲假设。IEIVariantFilter有助于基因和变异列表的优先排序,加快候选致病变异的识别,以便通过实验研究进行验证。该软件提高了患者的基因诊断,从而促进了精准医疗和快速适当的治疗。
{"title":"IEIVariantFilter: a bioinformatics tool to speed up genetic diagnosis of inborn errors of immunity patients.","authors":"Juan Pereda, Rafael Espinosa, Blanca García-Solís, Teresa Guerra-Galán, Ana Van-Den-Rym, Meltem Ece Kars, Rocío Mena, Victor Galán, Ana de Andrés-Martín, Carlos Rodríguez-Gallego, Alberto López-Lera, Fernando Corvillo, Antonio Pérez-Martínez, Eduardo López-Collazo, Silvia Sánchez-Ramón, Rubén Martínez-Barricarte, Lluis Quintana-Murci, José Miguel Lorenzo-Salazar, Yuval Itan, Carlos Flores, Rebeca Pérez-de-Diego","doi":"10.1093/nargab/lqaf069","DOIUrl":"10.1093/nargab/lqaf069","url":null,"abstract":"<p><p>Severe infectious diseases remain the leading cause of death in children and young adults worldwide. Monogenic inborn errors of immunity (IEIs) are traditionally defined as a heterogeneous group of rare inborn genetic diseases affecting the functioning of the immune system. Greater awareness has led to the clinical definition of 485 monogenic IEIs and whole exome sequencing (WES) is becoming increasingly relevant for IEI genetic diagnosis. The current protocol for IEI genetic studies includes manual filtering of the list of genes obtained as a WES read-out providing a short list of candidate genes. This procedure is time-consuming and can produce mistakes due to human error in manual filtering. IEIVariantFilter is a new web-based bioinformatics tool to speed up and refine the genetic diagnosis of IEI patients oriented for users in the biomedical field without needing bioinformatics expertise. IEIVariantFilter prioritizes genetic variants based on ranges of zygosity, the quality of reads, the predicted variant effect, and genes related to immunity, considering a consanguineous hypothesis whenever necessary. IEIVariantFilter facilitates gene and variant list prioritization, speeding up the identification of candidate disease-causing variants for validation by experimental studies. The software improves the genetic diagnosis of patients, thereby facilitating precision medicine and fast and proper treatment.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf069"},"PeriodicalIF":2.8,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117399/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144175129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring gene-pathway associations from consolidated transcriptome datasets: an interactive gene network explorer for Tetrahymena thermophila. 从整合转录组数据集推断基因通路关联:嗜热四膜虫的交互式基因网络探索者。
IF 4 Q1 GENETICS & HEREDITY Pub Date : 2025-05-27 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf067
Michael A Bertagna, Lydia J Bright, Fei Ye, Yu-Yang Jiang, Debolina Sarkar, Ajay Pradhan, Santosh Kumar, Shan Gao, Aaron P Turkewitz, Lev M Z Tsypin

Although an established model organism, Tetrahymena thermophila remains comparatively inaccessible to high throughput screens, and alternative bioinformatic approaches still rely on unconnected datasets and outdated algorithms. Here, we report a new approach to consolidating RNA-seq and microarray data based on a systematic exploration of parameters and computational controls, enabling us to infer functional gene associations from their co-expression patterns. To illustrate the power of this approach, we took advantage of new data regarding a previously studied pathway, the biogenesis of a secretory organelle called the mucocyst. Our untargeted clustering approach recovered over 80% of the genes that were previously verified to play a role in mucocyst biogenesis. Furthermore, we tested four new genes that we predicted to be mucocyst-associated based on their co-expression and found that knocking out each of them results in mucocyst secretion defects. We also found that our approach succeeds in clustering genes associated with several other cellular pathways that we evaluated based on prior literature. We present the Tetrahymena Gene Network Explorer (TGNE) as an interactive tool for genetic hypothesis generation and functional annotation in this organism and as a framework for building similar tools for other systems.

尽管嗜热四膜虫是一种已建立的模式生物,但高通量筛选仍然相对难以获得,而替代的生物信息学方法仍然依赖于未连接的数据集和过时的算法。在这里,我们报告了一种基于系统探索参数和计算控制来整合RNA-seq和微阵列数据的新方法,使我们能够从它们的共表达模式推断功能基因关联。为了说明这种方法的力量,我们利用了关于先前研究途径的新数据,即称为粘液囊的分泌细胞器的生物发生。我们的非靶向聚类方法恢复了80%以上的基因,这些基因以前被证实在粘液囊肿生物发生中起作用。此外,我们测试了四个新的基因,根据它们的共表达,我们预测它们与粘液囊肿相关,并发现敲除它们中的每一个都会导致粘液囊肿分泌缺陷。我们还发现,我们的方法成功地聚类了与其他几种细胞通路相关的基因,我们基于先前的文献对这些基因进行了评估。我们提出了四膜虫基因网络浏览器(TGNE)作为遗传假设生成和功能注释的交互式工具,并作为为其他系统构建类似工具的框架。
{"title":"Inferring gene-pathway associations from consolidated transcriptome datasets: an interactive gene network explorer for <i>Tetrahymena thermophila</i>.","authors":"Michael A Bertagna, Lydia J Bright, Fei Ye, Yu-Yang Jiang, Debolina Sarkar, Ajay Pradhan, Santosh Kumar, Shan Gao, Aaron P Turkewitz, Lev M Z Tsypin","doi":"10.1093/nargab/lqaf067","DOIUrl":"10.1093/nargab/lqaf067","url":null,"abstract":"<p><p>Although an established model organism, <i>Tetrahymena thermophila</i> remains comparatively inaccessible to high throughput screens, and alternative bioinformatic approaches still rely on unconnected datasets and outdated algorithms. Here, we report a new approach to consolidating RNA-seq and microarray data based on a systematic exploration of parameters and computational controls, enabling us to infer functional gene associations from their co-expression patterns. To illustrate the power of this approach, we took advantage of new data regarding a previously studied pathway, the biogenesis of a secretory organelle called the mucocyst. Our untargeted clustering approach recovered over 80% of the genes that were previously verified to play a role in mucocyst biogenesis. Furthermore, we tested four new genes that we predicted to be mucocyst-associated based on their co-expression and found that knocking out each of them results in mucocyst secretion defects. We also found that our approach succeeds in clustering genes associated with several other cellular pathways that we evaluated based on prior literature. We present the <i>Tetrahymena</i> Gene Network Explorer (TGNE) as an interactive tool for genetic hypothesis generation and functional annotation in this organism and as a framework for building similar tools for other systems.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf067"},"PeriodicalIF":4.0,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12107436/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144162528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methyl-Micro-C: simultaneous characterization of chromatin accessibility, interactions, and DNA methylation. 甲基微c:同时表征染色质可及性,相互作用,和DNA甲基化。
IF 4 Q1 GENETICS & HEREDITY Pub Date : 2025-05-27 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf060
Leonardo Gonzalez-Smith, Claire Stevens, Huan Cao, Zexun Wu, Suhn K Rhie

Epigenomes, characterized by patterns of different signatures such as chromatin accessibility, chromatin interactions, and DNA methylation, vary across cell types and play a pivotal role in regulating gene expression. By mapping these signatures, the underlying mechanisms of development and diseases can be uncovered. However, many canonical epigenetic methods focus on mapping only one signature. Simultaneous measurement of epigenetic signatures from the same cell or tissue provides significant benefits for research, especially when resources are limited, and precise analysis is essential. Here, we report a technique called Methyl-Micro-C (MMC), which simultaneously profiles chromatin accessibility, chromatin interactions, and DNA methylation in the same sample. MMC enhances the resolution of chromatin interactions and the coverage of CpGs by combining MNase-mediated fragmentation with enzymatic conversion. This technique allows for the profiling of three-dimensional epigenomes, capturing consistent chromatin accessibility, chromatin interactions, and DNA methylation signals in an efficient manner. It is also relatively straightforward, allowing researchers to implement and apply it easily.

表观基因组以不同的特征模式为特征,如染色质可及性、染色质相互作用和DNA甲基化,在不同的细胞类型中各不相同,在调节基因表达中起着关键作用。通过绘制这些特征,可以揭示发育和疾病的潜在机制。然而,许多典型的表观遗传方法只关注一个特征的定位。同时测量来自同一细胞或组织的表观遗传特征为研究提供了显著的好处,特别是在资源有限的情况下,精确的分析是必不可少的。在这里,我们报告了一种称为甲基微c (MMC)的技术,该技术同时分析了同一样品中的染色质可及性、染色质相互作用和DNA甲基化。MMC通过结合mnase介导的断裂和酶转化,提高了染色质相互作用的分辨率和CpGs的覆盖范围。该技术允许三维表观基因组的分析,以有效的方式捕获一致的染色质可及性,染色质相互作用和DNA甲基化信号。它也相对简单,允许研究人员轻松实现和应用它。
{"title":"Methyl-Micro-C: simultaneous characterization of chromatin accessibility, interactions, and DNA methylation.","authors":"Leonardo Gonzalez-Smith, Claire Stevens, Huan Cao, Zexun Wu, Suhn K Rhie","doi":"10.1093/nargab/lqaf060","DOIUrl":"10.1093/nargab/lqaf060","url":null,"abstract":"<p><p>Epigenomes, characterized by patterns of different signatures such as chromatin accessibility, chromatin interactions, and DNA methylation, vary across cell types and play a pivotal role in regulating gene expression. By mapping these signatures, the underlying mechanisms of development and diseases can be uncovered. However, many canonical epigenetic methods focus on mapping only one signature. Simultaneous measurement of epigenetic signatures from the same cell or tissue provides significant benefits for research, especially when resources are limited, and precise analysis is essential. Here, we report a technique called Methyl-Micro-C (MMC), which simultaneously profiles chromatin accessibility, chromatin interactions, and DNA methylation in the same sample. MMC enhances the resolution of chromatin interactions and the coverage of CpGs by combining MNase-mediated fragmentation with enzymatic conversion. This technique allows for the profiling of three-dimensional epigenomes, capturing consistent chromatin accessibility, chromatin interactions, and DNA methylation signals in an efficient manner. It is also relatively straightforward, allowing researchers to implement and apply it easily.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf060"},"PeriodicalIF":4.0,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12107429/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144162530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TCRCluster: a novel approach to T-cell receptor latent featurization and clustering using contrastive learning-guided two-stage variational autoencoders. TCRCluster:一种使用对比学习引导的两阶段变分自编码器的t细胞受体潜在特征和聚类的新方法。
IF 4 Q1 GENETICS & HEREDITY Pub Date : 2025-05-27 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf065
Yat-Tsai Richie Wan, Morten Nielsen

T cells play a vital role in adaptive immunity by targeting pathogen-infected or cancerous cells, but predicting their specificity remains challenging. Encoding T-cell receptor (TCR) sequences into informative feature spaces is therefore crucial for advancing specificity prediction and downstream applications. For this, we developed a variational autoencoder (VAE)-based model trained on paired TCR α-β chain data, incorporating all six complementarity-determining regions. A semi-supervised 'two-stage VAE' framework, integrating cosine triplet loss and a classifier, was found to further refine peptide-specific latent representations, outperforming sequence-based methods in specificity prediction. Clustering analyses leveraging our VAE latent space were evaluated using K-means, agglomerative clustering, and a novel graph-based method. Agglomerative clustering achieved the most biologically relevant results, balancing cluster purity and retention despite noise in TCR specificity annotations. We extended these insights to evaluate TCR repertoire data. Across datasets, VAE-based models outperformed sequence-based methods, particularly in retention metrics, with notable improvements in the SARS-CoV-2 repertoire dataset. Moreover, the cancer repertoire analysis highlighted the generalizability of our approach, where the model displayed high performance despite minimal similarity between the training and test data. Collectively, these results demonstrate the potential of VAE-based latent representations to offer a robust framework for prediction, clustering, and repertoire analysis.

T细胞通过靶向病原体感染或癌细胞在适应性免疫中发挥重要作用,但预测其特异性仍然具有挑战性。因此,将t细胞受体(TCR)序列编码为信息特征空间对于推进特异性预测和下游应用至关重要。为此,我们开发了一个基于变分自编码器(VAE)的模型,该模型训练成对的TCR α-β链数据,包含所有六个互补决定区域。一个半监督的“两阶段VAE”框架,整合了余弦三重态损失和分类器,进一步完善了肽特异性潜在表征,在特异性预测方面优于基于序列的方法。利用我们的VAE潜在空间的聚类分析使用K-means、聚集聚类和一种新的基于图的方法进行评估。聚集聚类获得了最具生物学相关性的结果,平衡了聚类纯度和保留性,尽管在TCR特异性注释中存在噪声。我们将这些见解扩展到评估TCR曲目数据。在所有数据集中,基于vae的模型优于基于序列的方法,特别是在保留指标方面,SARS-CoV-2曲目数据集有显着改善。此外,癌症库分析强调了我们方法的通用性,尽管训练数据和测试数据之间的相似性很小,但模型仍显示出高性能。总的来说,这些结果证明了基于vae的潜在表示的潜力,为预测、聚类和库分析提供了一个强大的框架。
{"title":"TCRCluster: a novel approach to T-cell receptor latent featurization and clustering using contrastive learning-guided two-stage variational autoencoders.","authors":"Yat-Tsai Richie Wan, Morten Nielsen","doi":"10.1093/nargab/lqaf065","DOIUrl":"10.1093/nargab/lqaf065","url":null,"abstract":"<p><p>T cells play a vital role in adaptive immunity by targeting pathogen-infected or cancerous cells, but predicting their specificity remains challenging. Encoding T-cell receptor (TCR) sequences into informative feature spaces is therefore crucial for advancing specificity prediction and downstream applications. For this, we developed a variational autoencoder (VAE)-based model trained on paired TCR α-β chain data, incorporating all six complementarity-determining regions. A semi-supervised 'two-stage VAE' framework, integrating cosine triplet loss and a classifier, was found to further refine peptide-specific latent representations, outperforming sequence-based methods in specificity prediction. Clustering analyses leveraging our VAE latent space were evaluated using <i>K</i>-means, agglomerative clustering, and a novel graph-based method. Agglomerative clustering achieved the most biologically relevant results, balancing cluster purity and retention despite noise in TCR specificity annotations. We extended these insights to evaluate TCR repertoire data. Across datasets, VAE-based models outperformed sequence-based methods, particularly in retention metrics, with notable improvements in the SARS-CoV-2 repertoire dataset. Moreover, the cancer repertoire analysis highlighted the generalizability of our approach, where the model displayed high performance despite minimal similarity between the training and test data. Collectively, these results demonstrate the potential of VAE-based latent representations to offer a robust framework for prediction, clustering, and repertoire analysis.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf065"},"PeriodicalIF":4.0,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12107435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144162533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Immunopipe: a comprehensive and flexible scRNA-seq and scTCR-seq data analysis pipeline. 免疫管道:一个全面和灵活的scRNA-seq和scTCR-seq数据分析管道。
IF 2.8 Q1 GENETICS & HEREDITY Pub Date : 2025-05-19 eCollection Date: 2025-06-01 DOI: 10.1093/nargab/lqaf063
Panwen Wang, Yue Yu, Haidong Dong, Shuwen Zhang, Zhifu Sun, Hu Zeng, Patrizia Mondello, Jean-Pierre A Kocher, Junwen Wang, Yan W Asmann, Yi Lin, Ying Li

Single-cell sequencing technologies provide us with information at the level of individual cells. Combining single-cell RNA-seq and single-cell TCR-seq profiling enables the exploration of cell heterogeneity and T-cell receptor repertoires simultaneously. Integrating both types of data can play a crucial role in enhancing our understanding of T-cell-mediated immunity and, in turn, facilitate the advancement of immunotherapy. Here, we present immunopipe, a comprehensive and flexible pipeline to perform integrated analysis of scRNA-seq and scTCR-seq data. In addition to the command line tool, we provide a user-friendly web interface for pipeline configuration and execution monitoring, benefiting researchers without extensive programming experience. With its comprehensive functionality and ease of use, immunopipe empowers researchers to uncover valuable insights from scRNA-seq and scTCR-seq data, ultimately advancing the understanding of immune responses and immunotherapy development.

单细胞测序技术为我们提供了单个细胞水平的信息。结合单细胞RNA-seq和单细胞TCR-seq分析可以同时探索细胞异质性和t细胞受体谱。整合这两种类型的数据可以在增强我们对t细胞介导的免疫的理解方面发挥关键作用,进而促进免疫治疗的发展。在这里,我们提出了免疫管道,这是一个全面而灵活的管道,用于对scRNA-seq和scTCR-seq数据进行综合分析。除了命令行工具,我们还提供了一个用户友好的web界面,用于管道配置和执行监控,使没有丰富编程经验的研究人员受益。凭借其全面的功能和易用性,免疫管使研究人员能够从scRNA-seq和scTCR-seq数据中发现有价值的见解,最终促进对免疫反应和免疫治疗发展的理解。
{"title":"Immunopipe: a comprehensive and flexible scRNA-seq and scTCR-seq data analysis pipeline.","authors":"Panwen Wang, Yue Yu, Haidong Dong, Shuwen Zhang, Zhifu Sun, Hu Zeng, Patrizia Mondello, Jean-Pierre A Kocher, Junwen Wang, Yan W Asmann, Yi Lin, Ying Li","doi":"10.1093/nargab/lqaf063","DOIUrl":"10.1093/nargab/lqaf063","url":null,"abstract":"<p><p>Single-cell sequencing technologies provide us with information at the level of individual cells. Combining single-cell RNA-seq and single-cell TCR-seq profiling enables the exploration of cell heterogeneity and T-cell receptor repertoires simultaneously. Integrating both types of data can play a crucial role in enhancing our understanding of T-cell-mediated immunity and, in turn, facilitate the advancement of immunotherapy. Here, we present immunopipe, a comprehensive and flexible pipeline to perform integrated analysis of scRNA-seq and scTCR-seq data. In addition to the command line tool, we provide a user-friendly web interface for pipeline configuration and execution monitoring, benefiting researchers without extensive programming experience. With its comprehensive functionality and ease of use, immunopipe empowers researchers to uncover valuable insights from scRNA-seq and scTCR-seq data, ultimately advancing the understanding of immune responses and immunotherapy development.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 2","pages":"lqaf063"},"PeriodicalIF":2.8,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086537/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144102445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
NAR Genomics and Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1