首页 > 最新文献

Genomics & informatics最新文献

英文 中文
In silico design of a multi-epitope pan vaccine targeting Schistosoma species. 以血吸虫为靶点的多表位泛疫苗的计算机设计。
Pub Date : 2025-10-27 DOI: 10.1186/s44342-025-00053-4
Daksh Gandvi, Charmi Jyotishi, Mansi Patel, Reeshu Gupta

Schistosomiasis remains a significant public health burden, necessitating the development of effective vaccines against it. In this study, a multi-epitope subunit vaccine was designed against adenylate kinase 2 protein and evaluated for its potential to elicit protective immunity against three Schistosoma species. CTL, HTL, and B-cell epitopes were identified using immunoinformatics tools and linked using AAY and KK linkers, respectively. The 50S ribosomal protein L7/L12, a known TLR4 agonist, was incorporated as an adjuvant to enhance immune activation in the vaccine. Molecular docking and molecular dynamics (MD) simulations demonstrated a strong binding affinity between the vaccine and human TLR4, supported by low RMSD and Rg values, indicating structural stability. The negative binding energy further validated the vaccine's potential for engaging TLR4. The immunogenic profile predicted robust activation of CD4+ and CD8+ T cells, as well as neutralizing antibody responses. These findings highlight the potential of the vaccine to stimulate both cell-mediated and humoral immunity, making it a promising candidate for further experimental validation against schistosomiasis.

血吸虫病仍然是一个重大的公共卫生负担,需要开发有效的疫苗。本研究设计了一种针对腺苷酸激酶2蛋白的多表位亚单位疫苗,并评估了其对三种血吸虫产生保护性免疫的潜力。使用免疫信息学工具鉴定CTL、HTL和b细胞表位,并分别使用AAY和KK连接体进行连接。50S核糖体蛋白L7/L12是一种已知的TLR4激动剂,作为佐剂加入疫苗以增强免疫激活。分子对接和分子动力学(MD)模拟表明,疫苗与人TLR4具有较强的结合亲和力,RMSD和Rg值较低,表明其结构稳定。负结合能进一步验证了疫苗与TLR4结合的潜力。免疫原性谱预测CD4+和CD8+ T细胞的强大激活,以及中和抗体反应。这些发现强调了该疫苗刺激细胞介导和体液免疫的潜力,使其成为进一步实验验证血吸虫病的有希望的候选疫苗。
{"title":"In silico design of a multi-epitope pan vaccine targeting Schistosoma species.","authors":"Daksh Gandvi, Charmi Jyotishi, Mansi Patel, Reeshu Gupta","doi":"10.1186/s44342-025-00053-4","DOIUrl":"10.1186/s44342-025-00053-4","url":null,"abstract":"<p><p>Schistosomiasis remains a significant public health burden, necessitating the development of effective vaccines against it. In this study, a multi-epitope subunit vaccine was designed against adenylate kinase 2 protein and evaluated for its potential to elicit protective immunity against three Schistosoma species. CTL, HTL, and B-cell epitopes were identified using immunoinformatics tools and linked using AAY and KK linkers, respectively. The 50S ribosomal protein L7/L12, a known TLR4 agonist, was incorporated as an adjuvant to enhance immune activation in the vaccine. Molecular docking and molecular dynamics (MD) simulations demonstrated a strong binding affinity between the vaccine and human TLR4, supported by low RMSD and Rg values, indicating structural stability. The negative binding energy further validated the vaccine's potential for engaging TLR4. The immunogenic profile predicted robust activation of CD4+ and CD8+ T cells, as well as neutralizing antibody responses. These findings highlight the potential of the vaccine to stimulate both cell-mediated and humoral immunity, making it a promising candidate for further experimental validation against schistosomiasis.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"21"},"PeriodicalIF":0.0,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12560484/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145380516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deciphering the linguistic blueprint of DNA: context-sensitive structures, statistical patterns, and regulatory implications. 解读DNA的语言蓝图:上下文敏感结构、统计模式和监管含义。
Pub Date : 2025-10-23 DOI: 10.1186/s44342-025-00052-5
Iskander Akhmetov, Timur Saparov, Volkan Duran, Alexander Pak

DNA is often described as the "language of life" because it encodes biological information using nucleotide sequences. Unlike the traditional view focused on codon-to-amino acid mapping in coding regions, the vast non-coding genome reveals complex organizational patterns resembling natural language. This paper outlines essential approaches in DNA linguistics, including formal language theory, RNA secondary structure modeling, statistical methods, and phylogenetic analysis. Additionally, recent research on Indo-European populations shows correlations between lexical and phonemic traits and asymmetrical patterns of genetic inheritance. Together, these perspectives deepen our understanding of genome regulation, evolution, and the striking parallels between genetic and linguistic systems.

DNA通常被描述为“生命的语言”,因为它使用核苷酸序列编码生物信息。与传统的关注密码子到氨基酸在编码区的定位不同,庞大的非编码基因组揭示了类似于自然语言的复杂组织模式。本文概述了DNA语言学的基本方法,包括形式语言理论,RNA二级结构建模,统计方法和系统发育分析。此外,最近对印欧语系人群的研究表明,词汇和音素特征与基因遗传的不对称模式之间存在相关性。总之,这些观点加深了我们对基因组调控、进化以及基因和语言系统之间惊人的相似之处的理解。
{"title":"Deciphering the linguistic blueprint of DNA: context-sensitive structures, statistical patterns, and regulatory implications.","authors":"Iskander Akhmetov, Timur Saparov, Volkan Duran, Alexander Pak","doi":"10.1186/s44342-025-00052-5","DOIUrl":"10.1186/s44342-025-00052-5","url":null,"abstract":"<p><p>DNA is often described as the \"language of life\" because it encodes biological information using nucleotide sequences. Unlike the traditional view focused on codon-to-amino acid mapping in coding regions, the vast non-coding genome reveals complex organizational patterns resembling natural language. This paper outlines essential approaches in DNA linguistics, including formal language theory, RNA secondary structure modeling, statistical methods, and phylogenetic analysis. Additionally, recent research on Indo-European populations shows correlations between lexical and phonemic traits and asymmetrical patterns of genetic inheritance. Together, these perspectives deepen our understanding of genome regulation, evolution, and the striking parallels between genetic and linguistic systems.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"20"},"PeriodicalIF":0.0,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12551216/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145357472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical learning methods for improving predictive performance in time-dependent survival models. 提高时间依赖生存模型预测性能的统计学习方法。
Pub Date : 2025-09-01 DOI: 10.1186/s44342-025-00050-7
Hyungwoo Seo, Wonil Chung

Background: The COVID-19 pandemic has highlighted the need for survival models to assess risk factors and time-dependent effects in infectious diseases. However, the Cox proportional hazards (PH) model, which assumes constant covariate effects, struggles to capture disease dynamics. This underscores the need for advanced models that incorporate time-dependent coefficients and covariates for improved accuracy.

Methods: To address the need for modeling time-dependent effects and covariates, we applied a stratified Cox PH model with multiple time intervals to better satisfy the PH assumption. We conducted simulations to evaluate the performance of machine learning and deep learning survival models, including random survival forest (RSF), DeepSurv, and DeepHit. To improve time-dependent effect estimation, we introduced a refined time-interval division and a weighted sum approach for integrated hazard ratios of COVID-19 variants. The event of interest was death, and the specific risk compared was the risk of death from the start of the study to either death or the last follow-up among infected versus uninfected individuals.

Results: Our results showed that increasing the number of time intervals improved predictive accuracy. When the PH assumption held, the Cox PH model outperformed machine learning and deep learning models. Applying our approach to UK Biobank data, expanding time intervals from five to fifteen enhanced performance. The previously reported hazard ratio of 7.333 for the pre-Delta period was refined to 29.359 for the Early variant, 20.734 for EU1, and 4.079 for Alpha, revealing a decline in risk across variants.

Conclusions: These findings suggest that refining time intervals improves the understanding of time-dependent effects in infectious diseases. Incorporating stratified intervals and advanced models enhances risk assessment and predictive accuracy for COVID-19 and other evolving diseases.

背景:2019冠状病毒病大流行凸显了对生存模型的需求,以评估传染病的危险因素和时间依赖性影响。然而,假设恒定协变量效应的Cox比例风险(PH)模型难以捕捉疾病动态。这强调了需要先进的模型,包括时间相关系数和协变量,以提高精度。方法:为了解决建模时间依赖效应和协变量的需要,我们采用了具有多个时间间隔的分层Cox PH模型,以更好地满足PH假设。我们进行了模拟来评估机器学习和深度学习生存模型的性能,包括随机生存森林(RSF)、DeepSurv和DeepHit。为了改进时间依赖效应估计,我们引入了一种改进的时间间隔划分和加权和方法来计算COVID-19变异的综合风险比。感兴趣的事件是死亡,比较的具体风险是从研究开始到死亡或最后一次随访中感染和未感染个体的死亡风险。结果:我们的研究结果表明,增加时间间隔的数量可以提高预测的准确性。当PH假设成立时,Cox PH模型优于机器学习和深度学习模型。将我们的方法应用于英国生物银行数据,将时间间隔从5个扩展到15个,提高了性能。之前报告的delta前时期的风险比为7.333,而早期变异的风险比为29.359,EU1的风险比为20.734,Alpha的风险比为4.079,这表明变异的风险有所下降。结论:这些发现表明,细化时间间隔可以提高对传染病时间依赖性效应的理解。结合分层间隔和先进模型可提高COVID-19和其他不断演变的疾病的风险评估和预测准确性。
{"title":"Statistical learning methods for improving predictive performance in time-dependent survival models.","authors":"Hyungwoo Seo, Wonil Chung","doi":"10.1186/s44342-025-00050-7","DOIUrl":"10.1186/s44342-025-00050-7","url":null,"abstract":"<p><strong>Background: </strong>The COVID-19 pandemic has highlighted the need for survival models to assess risk factors and time-dependent effects in infectious diseases. However, the Cox proportional hazards (PH) model, which assumes constant covariate effects, struggles to capture disease dynamics. This underscores the need for advanced models that incorporate time-dependent coefficients and covariates for improved accuracy.</p><p><strong>Methods: </strong>To address the need for modeling time-dependent effects and covariates, we applied a stratified Cox PH model with multiple time intervals to better satisfy the PH assumption. We conducted simulations to evaluate the performance of machine learning and deep learning survival models, including random survival forest (RSF), DeepSurv, and DeepHit. To improve time-dependent effect estimation, we introduced a refined time-interval division and a weighted sum approach for integrated hazard ratios of COVID-19 variants. The event of interest was death, and the specific risk compared was the risk of death from the start of the study to either death or the last follow-up among infected versus uninfected individuals.</p><p><strong>Results: </strong>Our results showed that increasing the number of time intervals improved predictive accuracy. When the PH assumption held, the Cox PH model outperformed machine learning and deep learning models. Applying our approach to UK Biobank data, expanding time intervals from five to fifteen enhanced performance. The previously reported hazard ratio of 7.333 for the pre-Delta period was refined to 29.359 for the Early variant, 20.734 for EU1, and 4.079 for Alpha, revealing a decline in risk across variants.</p><p><strong>Conclusions: </strong>These findings suggest that refining time intervals improves the understanding of time-dependent effects in infectious diseases. Incorporating stratified intervals and advanced models enhances risk assessment and predictive accuracy for COVID-19 and other evolving diseases.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"19"},"PeriodicalIF":0.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12400734/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144984754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unravelling the molecular landscape of polycystic ovary syndrome (PCOS) and role of inflammation through transcriptomics analysis of human ovarian granulosa cells. 通过对人卵巢颗粒细胞的转录组学分析,揭示多囊卵巢综合征(PCOS)的分子格局和炎症的作用。
Pub Date : 2025-08-11 DOI: 10.1186/s44342-025-00051-6
Kanika Mahra, Vineet Singh, Jae-Ho Shin

Background: Polycystic ovary syndrome (PCOS) is a common metabolic problem in women of reproductive age that can lead to infertility and other metabolic disorders. Recent evidence indicates that inflammation might be one of the contributing factors in PCOS progression. However, there is a lack of information on the regulation of inflammatory genes in PCOS. Therefore, the aim of the study is to investigate the role of inflammation-associated genes and pathways in relation to PCOS.

Method: The bulk RNA-seq data of granulosa cells of human ovaries of PCOS-affected and healthy women were analyzed to evaluate the inflammatory regulation in PCOS. After quality trimming, the raw RNA-seq data were aligned to the human genome, and gene expression was quantified using featureCounts with Ensembl annotation. Further, downstream analyses of the resulting count matrix were performed in R Studio, where differentially expressed genes (DEG) were identified and CO-DEG analysis was performed.

Results: The study identifies the various differentially expressed inflammatory genes in the case of PCOS such as SPI1, HSPB1, MNDA, and ITGA. These DEG are closely associated with the activation of inflammatory responses, i.e., activation of lymphocytes and leukocytes, leukocyte migration and mononuclear cell proliferation, stimulating binding of various cytokines, immunoglobulins, and chemokines. PCOS group also exhibited an increased expression of androgen-mediated genes (SPI1 and ETS transcription factors) and genes associated with hyperlipidemia and insulin resistance (TNFRSF1B). Further, KEGG pathway enrichment analysis revealed significant upregulation of various pathways (autophagy, endocytosis) in the PCOS group. In addition, network analysis (cnetplot) of the top 10 KEGG GSEA pathways also highlights the key pathways in the PCOS group such as SNARE complex assembly pathway, SNAP-25, nucleophagy, and regulation of mast cell activation.

Conclusion: Therefore, the study highlights that inflammation is a major effector in PCOS, which also fuels obesity, an independent effector that further worsens the PCOS condition. In addition, the genes related to hyperandrogenism, hyperlipidemia, and insulin resistance were also overexpressed in PCOS, exacerbating the condition.

背景:多囊卵巢综合征(PCOS)是育龄妇女常见的代谢问题,可导致不孕和其他代谢紊乱。最近的证据表明,炎症可能是PCOS进展的因素之一。然而,炎症基因在多囊卵巢综合征中的调控作用尚缺乏相关信息。因此,本研究的目的是探讨炎症相关基因和途径在多囊卵巢综合征中的作用。方法:分析PCOS患者和健康女性卵巢颗粒细胞的大量RNA-seq数据,以评价PCOS患者的炎症调节作用。经过质量修剪后,将原始RNA-seq数据与人类基因组进行比对,并使用带有Ensembl注释的featurecots对基因表达进行量化。此外,在R Studio中对所得计数矩阵进行下游分析,其中鉴定差异表达基因(DEG)并进行CO-DEG分析。结果:本研究确定了PCOS病例中多种差异表达的炎症基因,如SPI1、HSPB1、MNDA和ITGA。这些DEG与炎症反应的激活密切相关,即淋巴细胞和白细胞的激活、白细胞迁移和单核细胞增殖,刺激各种细胞因子、免疫球蛋白和趋化因子的结合。PCOS组雄激素介导基因(SPI1和ETS转录因子)以及与高脂血症和胰岛素抵抗相关的基因(TNFRSF1B)的表达也有所增加。此外,KEGG通路富集分析显示,PCOS组多种通路(自噬、内吞)均显著上调。此外,通过对KEGG GSEA前10大通路的网络分析(cnetplot),也突出了PCOS组的SNARE复合物组装通路、SNAP-25、核自噬、肥大细胞活化调控等关键通路。结论:因此,本研究强调炎症是PCOS的一个主要影响因素,它也会导致肥胖,这是一个进一步恶化PCOS病情的独立影响因素。此外,与高雄激素、高脂血症、胰岛素抵抗相关的基因也在PCOS中过度表达,加重了病情。
{"title":"Unravelling the molecular landscape of polycystic ovary syndrome (PCOS) and role of inflammation through transcriptomics analysis of human ovarian granulosa cells.","authors":"Kanika Mahra, Vineet Singh, Jae-Ho Shin","doi":"10.1186/s44342-025-00051-6","DOIUrl":"10.1186/s44342-025-00051-6","url":null,"abstract":"<p><strong>Background: </strong>Polycystic ovary syndrome (PCOS) is a common metabolic problem in women of reproductive age that can lead to infertility and other metabolic disorders. Recent evidence indicates that inflammation might be one of the contributing factors in PCOS progression. However, there is a lack of information on the regulation of inflammatory genes in PCOS. Therefore, the aim of the study is to investigate the role of inflammation-associated genes and pathways in relation to PCOS.</p><p><strong>Method: </strong>The bulk RNA-seq data of granulosa cells of human ovaries of PCOS-affected and healthy women were analyzed to evaluate the inflammatory regulation in PCOS. After quality trimming, the raw RNA-seq data were aligned to the human genome, and gene expression was quantified using featureCounts with Ensembl annotation. Further, downstream analyses of the resulting count matrix were performed in R Studio, where differentially expressed genes (DEG) were identified and CO-DEG analysis was performed.</p><p><strong>Results: </strong>The study identifies the various differentially expressed inflammatory genes in the case of PCOS such as SPI1, HSPB1, MNDA, and ITGA. These DEG are closely associated with the activation of inflammatory responses, i.e., activation of lymphocytes and leukocytes, leukocyte migration and mononuclear cell proliferation, stimulating binding of various cytokines, immunoglobulins, and chemokines. PCOS group also exhibited an increased expression of androgen-mediated genes (SPI1 and ETS transcription factors) and genes associated with hyperlipidemia and insulin resistance (TNFRSF1B). Further, KEGG pathway enrichment analysis revealed significant upregulation of various pathways (autophagy, endocytosis) in the PCOS group. In addition, network analysis (cnetplot) of the top 10 KEGG GSEA pathways also highlights the key pathways in the PCOS group such as SNARE complex assembly pathway, SNAP-25, nucleophagy, and regulation of mast cell activation.</p><p><strong>Conclusion: </strong>Therefore, the study highlights that inflammation is a major effector in PCOS, which also fuels obesity, an independent effector that further worsens the PCOS condition. In addition, the genes related to hyperandrogenism, hyperlipidemia, and insulin resistance were also overexpressed in PCOS, exacerbating the condition.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"18"},"PeriodicalIF":0.0,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337442/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144823646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of cell-type-specific, transcriptionally active transposable elements using long-read RNA-sequencing data-based comprehensive annotation. 使用基于长读rna测序数据的综合注释鉴定细胞类型特异性,转录活性转座元件。
Pub Date : 2025-08-06 DOI: 10.1186/s44342-025-00048-1
Chaemin Lim, Hyunsu An, Jihwan Park

Background: The biological functions of transposable element (TE)-derived transcripts during physiological development, disease development, and progression have been previously reported. However, research on locus-specific TE-derived transcript expression in various human cell types remains limited.

Methods: We processed 2596 publicly available human long-read RNA-sequencing (LR RNA-seq) datasets covering 21 organs and 71 cell lines in both healthy individuals and diseased patients with various conditions to compile this TE-derived transcript annotation. We established a pipeline for assembling transcripts containing TE sequences to measure transcriptionally active TE-derived transcripts in diverse tissues and cell types. Next, we applied our TE annotation to the Genotype-Tissue Expression (GTEx) single-cell RNA-sequencing (scRNA-seq) data from eight tissues.

Results: We constructed the first transcriptom6e-based TE annotation using massive amounts of human LR RNA-seq data for use as a comprehensive reference to detect locus-specific TE-derived transcripts. Our annotation showed better detection accuracy for TE-derived transcripts than the RepeatMasker and GENCODE nonTE gene annotations. This annotation enabled the identification of novel TE-derived transcripts and their isoforms. We also identified alternative transcription end sites for long noncoding genes and confirmed previously annotated TE-nonTE gene fusion transcripts. Next, we applied our TE-derived transcript annotation to public scRNA-seq data from various human tissues and identified several cell-type-specific TE-derived transcripts in a locus-specific manner.

Conclusions: We generated a comprehensive, TE-derived transcript annotation using large-scale, LR RNA-seq data. Researchers can use our TE reference annotation to analyze active TE transcripts and their splicing isoforms in specific transcriptome datasets and to detect de novo TE transcripts. The discovery of cell-type-specific TE-derived transcripts may help explain mechanisms underlying the maintenance of cellular identity and provide new insights into the pathological mechanisms of various diseases.

背景:转座因子(TE)衍生转录物在生理发育、疾病发生和进展过程中的生物学功能此前已有报道。然而,对不同人类细胞类型中基因座特异性te衍生转录物表达的研究仍然有限。方法:我们处理了2596个公开的人类长读rna测序(LR RNA-seq)数据集,涵盖了健康个体和各种疾病患者的21个器官和71个细胞系,以编制这个te衍生的转录本注释。我们建立了一个管道来组装包含TE序列的转录本,以测量不同组织和细胞类型中转录活性TE衍生转录本。接下来,我们将TE注释应用于来自8个组织的基因型-组织表达(GTEx)单细胞rna测序(scRNA-seq)数据。结果:我们利用大量的人类LR RNA-seq数据构建了第一个基于转录组6的TE注释,作为检测位点特异性TE衍生转录物的综合参考。与RepeatMasker和GENCODE非te基因注释相比,我们的注释对te衍生转录本的检测精度更高。该注释能够鉴定新的te衍生转录本及其同工型。我们还鉴定了长非编码基因的替代转录端位点,并证实了先前注释的TE-nonTE基因融合转录物。接下来,我们将te衍生转录本注释应用于来自各种人体组织的公开scRNA-seq数据,并以位点特异性的方式鉴定了几种细胞类型特异性te衍生转录本。结论:我们使用大规模的LR RNA-seq数据生成了一个全面的te衍生转录本注释。研究人员可以使用我们的TE参考注释来分析特定转录组数据集中活跃的TE转录本及其剪接异构体,并检测新的TE转录本。细胞类型特异性te衍生转录本的发现可能有助于解释细胞身份维持的机制,并为各种疾病的病理机制提供新的见解。
{"title":"Identification of cell-type-specific, transcriptionally active transposable elements using long-read RNA-sequencing data-based comprehensive annotation.","authors":"Chaemin Lim, Hyunsu An, Jihwan Park","doi":"10.1186/s44342-025-00048-1","DOIUrl":"10.1186/s44342-025-00048-1","url":null,"abstract":"<p><strong>Background: </strong>The biological functions of transposable element (TE)-derived transcripts during physiological development, disease development, and progression have been previously reported. However, research on locus-specific TE-derived transcript expression in various human cell types remains limited.</p><p><strong>Methods: </strong>We processed 2596 publicly available human long-read RNA-sequencing (LR RNA-seq) datasets covering 21 organs and 71 cell lines in both healthy individuals and diseased patients with various conditions to compile this TE-derived transcript annotation. We established a pipeline for assembling transcripts containing TE sequences to measure transcriptionally active TE-derived transcripts in diverse tissues and cell types. Next, we applied our TE annotation to the Genotype-Tissue Expression (GTEx) single-cell RNA-sequencing (scRNA-seq) data from eight tissues.</p><p><strong>Results: </strong>We constructed the first transcriptom6e-based TE annotation using massive amounts of human LR RNA-seq data for use as a comprehensive reference to detect locus-specific TE-derived transcripts. Our annotation showed better detection accuracy for TE-derived transcripts than the RepeatMasker and GENCODE nonTE gene annotations. This annotation enabled the identification of novel TE-derived transcripts and their isoforms. We also identified alternative transcription end sites for long noncoding genes and confirmed previously annotated TE-nonTE gene fusion transcripts. Next, we applied our TE-derived transcript annotation to public scRNA-seq data from various human tissues and identified several cell-type-specific TE-derived transcripts in a locus-specific manner.</p><p><strong>Conclusions: </strong>We generated a comprehensive, TE-derived transcript annotation using large-scale, LR RNA-seq data. Researchers can use our TE reference annotation to analyze active TE transcripts and their splicing isoforms in specific transcriptome datasets and to detect de novo TE transcripts. The discovery of cell-type-specific TE-derived transcripts may help explain mechanisms underlying the maintenance of cellular identity and provide new insights into the pathological mechanisms of various diseases.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"17"},"PeriodicalIF":0.0,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12326599/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144796603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In silico design and evaluation of a multiepitope vaccine against Bordetella pertussis: structural, immunological, and biological properties. 针对百日咳博德泰氏杆菌的多表位疫苗的计算机设计和评价:结构、免疫学和生物学特性。
Pub Date : 2025-07-01 DOI: 10.1186/s44342-025-00049-0
Negar Souod, Hamid Madanchi, Fariborz Bahrami, Saeed Reza Pakzad, Fereshteh Shahcheraghi, Soheila Ajdary

Introduction/objectives: Despite widespread vaccination, the increasing incidence of pertussis underscores the urgent need for innovative vaccine strategies. This study aims to design and analyze, using in silico methods, a multiepitope protein that incorporates epitopes from the S1 subunit of pertussis toxin and the type 1 immunodominant domain of filamentous hemagglutinin (F1). The goal is to enhance both systemic and mucosal immunity through the incorporation of the C-terminal fragment of Clostridium perfringens enterotoxin (C-CPE).

Methods: Using reverse vaccinology, we predicted immunogenic epitopes for lymphocytes derived from the S1 and F1 proteins. The epitopes were assembled into a multiepitope construct named mF1S1-C-CPE, which was then evaluated for its physicochemical, immunological, and biological properties. Immunoinformatics tools were employed to analyze antigenicity, allergenicity, and population coverage. Additionally, molecular docking simulations of peptide‒MHC and mF1S1-C-CPE_TLR2/TLR4 binding were conducted.

Results: Structural analysis indicated that the final multiepitope construct maintained stability and solubility in aqueous environments. Immunoinformatic analysis revealed strong immunogenic properties, effectively eliciting both systemic and mucosal immune responses. Molecular docking demonstrated high-affinity binding patterns between the peptides (both individual or within the mF1S1-C-CPE) and corresponding HLA molecules. Additionally, molecular docking simulations of mF1S1-C-CPE and TLR2/TLR4 indicated strong binding affinity to receptors of innate immunity. The construct was predicted to be stable, soluble, and suitable for expression in Escherichia coli (CAI 0.93; GC content 54.9%).

Conclusion: This innovative approach holds promise for enhancing pertussis vaccination strategies by improving mucosal immune responses. Further in vivo studies are essential to validate the efficacy of this multiepitope vaccine candidate.

前言/目标:尽管广泛接种疫苗,百日咳发病率的增加强调了迫切需要创新的疫苗战略。本研究旨在利用计算机方法设计和分析一种多表位蛋白,该蛋白结合了百日咳毒素S1亚基和丝状血凝素(F1) 1型免疫优势结构域的表位。目的是通过结合产气荚膜梭菌肠毒素(C-CPE)的c端片段来增强全身和粘膜免疫。方法:采用反向疫苗学方法,预测来自S1和F1蛋白的淋巴细胞的免疫原性表位。这些表位被组装成一个名为mF1S1-C-CPE的多表位结构,然后对其物理化学、免疫学和生物学特性进行评估。免疫信息学工具用于分析抗原性、过敏原性和人群覆盖率。此外,还进行了肽- mhc与mF1S1-C-CPE_TLR2/TLR4结合的分子对接模拟。结果:结构分析表明,最终的多表位结构在水环境中保持稳定性和溶解度。免疫信息学分析显示强免疫原性,有效地引发全身和粘膜免疫反应。分子对接显示肽(无论是单个的还是mF1S1-C-CPE内的)和相应的HLA分子之间的高亲和力结合模式。此外,mF1S1-C-CPE与TLR2/TLR4的分子对接模拟显示,mF1S1-C-CPE与先天免疫受体有很强的结合亲和力。该构建体稳定、可溶,适合在大肠杆菌中表达(CAI 0.93;GC含量54.9%)。结论:这种创新的方法有望通过改善粘膜免疫反应来增强百日咳疫苗接种策略。进一步的体内研究对于验证这种多表位候选疫苗的有效性至关重要。
{"title":"In silico design and evaluation of a multiepitope vaccine against Bordetella pertussis: structural, immunological, and biological properties.","authors":"Negar Souod, Hamid Madanchi, Fariborz Bahrami, Saeed Reza Pakzad, Fereshteh Shahcheraghi, Soheila Ajdary","doi":"10.1186/s44342-025-00049-0","DOIUrl":"10.1186/s44342-025-00049-0","url":null,"abstract":"<p><strong>Introduction/objectives: </strong>Despite widespread vaccination, the increasing incidence of pertussis underscores the urgent need for innovative vaccine strategies. This study aims to design and analyze, using in silico methods, a multiepitope protein that incorporates epitopes from the S1 subunit of pertussis toxin and the type 1 immunodominant domain of filamentous hemagglutinin (F1). The goal is to enhance both systemic and mucosal immunity through the incorporation of the C-terminal fragment of Clostridium perfringens enterotoxin (C-CPE).</p><p><strong>Methods: </strong>Using reverse vaccinology, we predicted immunogenic epitopes for lymphocytes derived from the S1 and F1 proteins. The epitopes were assembled into a multiepitope construct named mF1S1-C-CPE, which was then evaluated for its physicochemical, immunological, and biological properties. Immunoinformatics tools were employed to analyze antigenicity, allergenicity, and population coverage. Additionally, molecular docking simulations of peptide‒MHC and mF1S1-C-CPE_TLR2/TLR4 binding were conducted.</p><p><strong>Results: </strong>Structural analysis indicated that the final multiepitope construct maintained stability and solubility in aqueous environments. Immunoinformatic analysis revealed strong immunogenic properties, effectively eliciting both systemic and mucosal immune responses. Molecular docking demonstrated high-affinity binding patterns between the peptides (both individual or within the mF1S1-C-CPE) and corresponding HLA molecules. Additionally, molecular docking simulations of mF1S1-C-CPE and TLR2/TLR4 indicated strong binding affinity to receptors of innate immunity. The construct was predicted to be stable, soluble, and suitable for expression in Escherichia coli (CAI 0.93; GC content 54.9%).</p><p><strong>Conclusion: </strong>This innovative approach holds promise for enhancing pertussis vaccination strategies by improving mucosal immune responses. Further in vivo studies are essential to validate the efficacy of this multiepitope vaccine candidate.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"16"},"PeriodicalIF":0.0,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220642/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144546754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene expression network analysis identified CDK1 and KIF11 as possible key molecules in the development of colorectal cancer from normal tissues. 基因表达网络分析发现CDK1和KIF11可能是正常组织中结直肠癌发展的关键分子。
Pub Date : 2025-06-02 DOI: 10.1186/s44342-025-00046-3
Soo Bin Lee, Young Seon Noh, Ji-Wook Moon, Soohyun Sim, Sung Won Han, Eun Sun Kim, Ji-Yun Lee

Background: Colorectal cancer (CRC) is one of the most common malignancies and the second most common cause of cancer-related mortality worldwide. Despite extensive research, the mechanism underlying CRC development remains unclear. This study aimed to understand the development and progression of CRC.

Methods: Gene network analysis of tumors with their paired normal tissues was performed using the differentially expressed genes dataset for CRC from the Cancer Genome Atlas. Further investigation of the regulatory relationship between hub genes and tumor development was conducted by protein-protein interaction network, Gene Ontology enrichment, and Kyoto Encyclopedia of Genes and Genomes pathway analyses using the selected hub genes.

Results: The network was more centered, and a common hub as well as a hub of hub genes were more connected to each other in the tumor than in the normal tissue, indicating changes in the network from normal to tumor. Eight downregulated and two upregulated hub genes (CDK1 and KIF11) in the tumor were identified. Further, the regulatory pathway was altered, especially in cell cycle and cell division. All R implementation codes are available on the journal website as supplementary materials.

Conclusions: Our findings may help understand the biological processes underlying tumor development and progression and suggest CDK1 and KIF11 as possible key molecules in the development of CRC.

背景:结直肠癌(CRC)是世界范围内最常见的恶性肿瘤之一,也是癌症相关死亡的第二大常见原因。尽管进行了广泛的研究,但CRC发展的机制仍不清楚。本研究旨在了解结直肠癌的发生发展。方法:使用来自癌症基因组图谱的CRC差异表达基因数据集对肿瘤及其配对的正常组织进行基因网络分析。通过蛋白-蛋白互作网络、基因本体富集、京都基因与基因组百科通路分析等方法进一步研究中心基因与肿瘤发生的调控关系。结果:网络更集中,肿瘤组织中一个共同枢纽和一个枢纽基因的枢纽相互连接比正常组织中更多,表明网络从正常到肿瘤的变化。在肿瘤中发现8个中心基因(CDK1和KIF11)下调,2个中心基因(CDK1和KIF11)上调。此外,调控通路发生改变,特别是在细胞周期和细胞分裂中。所有的R实现代码都可以在期刊网站上作为补充材料获得。结论:我们的研究结果可能有助于理解肿瘤发生和进展的生物学过程,并提示CDK1和KIF11可能是CRC发生的关键分子。
{"title":"Gene expression network analysis identified CDK1 and KIF11 as possible key molecules in the development of colorectal cancer from normal tissues.","authors":"Soo Bin Lee, Young Seon Noh, Ji-Wook Moon, Soohyun Sim, Sung Won Han, Eun Sun Kim, Ji-Yun Lee","doi":"10.1186/s44342-025-00046-3","DOIUrl":"10.1186/s44342-025-00046-3","url":null,"abstract":"<p><strong>Background: </strong>Colorectal cancer (CRC) is one of the most common malignancies and the second most common cause of cancer-related mortality worldwide. Despite extensive research, the mechanism underlying CRC development remains unclear. This study aimed to understand the development and progression of CRC.</p><p><strong>Methods: </strong>Gene network analysis of tumors with their paired normal tissues was performed using the differentially expressed genes dataset for CRC from the Cancer Genome Atlas. Further investigation of the regulatory relationship between hub genes and tumor development was conducted by protein-protein interaction network, Gene Ontology enrichment, and Kyoto Encyclopedia of Genes and Genomes pathway analyses using the selected hub genes.</p><p><strong>Results: </strong>The network was more centered, and a common hub as well as a hub of hub genes were more connected to each other in the tumor than in the normal tissue, indicating changes in the network from normal to tumor. Eight downregulated and two upregulated hub genes (CDK1 and KIF11) in the tumor were identified. Further, the regulatory pathway was altered, especially in cell cycle and cell division. All R implementation codes are available on the journal website as supplementary materials.</p><p><strong>Conclusions: </strong>Our findings may help understand the biological processes underlying tumor development and progression and suggest CDK1 and KIF11 as possible key molecules in the development of CRC.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"15"},"PeriodicalIF":0.0,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12128336/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144210653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-variable RNA deletion using the CRISPR-Cas9 technique demonstrated improved outcomes in human intestine single-cell RNA sequencing data, even at half sequencing depths. 使用CRISPR-Cas9技术的非可变RNA删除在人类肠道单细胞RNA测序数据中显示出改善的结果,即使在一半的测序深度。
Pub Date : 2025-05-20 DOI: 10.1186/s44342-025-00043-6
Dong Jun Kim, Christine Suh Yun Joh, So Young Jeong, Yong Jun Kim, Seong Joon Koh, Hyun Je Kim

In single-cell RNA sequencing (scRNA-seq) data, issues related to the high expression of non-variable RNAs often arise due to organ traits or sample quality. Computational methods, such as SoupX (Young (Gigascience 9:giaa151, 2020)), have been used to solve this problem but it may remove biologically relevant data. This study presents a clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9-based method that selectively removes non-variable RNAs. We applied this approach to scRNA-seq data from human intestinal tissues of 17 patients. By targeting non-variable genes, including ribosomal and mitochondrial RNAs, CRISPR-Cas9 treatment effectively reduced their expression, outperforming computational methods in both the number and extent of gene removal. The CRISPR-Cas9 treated samples, sequenced at half the depth compared to untreated samples, maintained comparable sequencing quality, and saturation, demonstrating that this approach can reduce sequencing costs while preserving data quality. Cell type composition and gene expression patterns remained consistent between treated and original datasets, with no unintended gene deletions. Overall, our findings suggest that the CRISPR-Cas9-based method offers a cost-effective solution for improving scRNA-seq data quality, particularly for tissues with high levels of non-variable RNAs, without compromising biological integrity.

在单细胞RNA测序(scRNA-seq)数据中,由于器官特征或样品质量,经常出现与非可变RNA高表达相关的问题。SoupX (Young (Gigascience 9: giaa151,2020))等计算方法已被用于解决这一问题,但它可能会删除生物学相关数据。本研究提出了一种基于CRISPR - cas9的聚类规则间隔短回文重复序列(clustered regularly interspaced short palindromic repeats,简称CRISPR)的方法,可以选择性地去除非可变rna。我们将这种方法应用于17例患者人肠组织的scRNA-seq数据。通过靶向非可变基因,包括核糖体和线粒体rna, CRISPR-Cas9治疗有效地降低了它们的表达,在基因去除的数量和程度上都优于计算方法。经过CRISPR-Cas9处理的样品,与未经处理的样品相比,在一半的深度进行测序,保持了相当的测序质量和饱和度,表明该方法可以在保持数据质量的同时降低测序成本。细胞类型组成和基因表达模式在处理后的数据集和原始数据集之间保持一致,没有意外的基因缺失。总的来说,我们的研究结果表明,基于crispr - cas9的方法为提高scRNA-seq数据质量提供了一种经济有效的解决方案,特别是对于具有高水平非可变rna的组织,而不会影响生物完整性。
{"title":"Non-variable RNA deletion using the CRISPR-Cas9 technique demonstrated improved outcomes in human intestine single-cell RNA sequencing data, even at half sequencing depths.","authors":"Dong Jun Kim, Christine Suh Yun Joh, So Young Jeong, Yong Jun Kim, Seong Joon Koh, Hyun Je Kim","doi":"10.1186/s44342-025-00043-6","DOIUrl":"10.1186/s44342-025-00043-6","url":null,"abstract":"<p><p>In single-cell RNA sequencing (scRNA-seq) data, issues related to the high expression of non-variable RNAs often arise due to organ traits or sample quality. Computational methods, such as SoupX (Young (Gigascience 9:giaa151, 2020)), have been used to solve this problem but it may remove biologically relevant data. This study presents a clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9-based method that selectively removes non-variable RNAs. We applied this approach to scRNA-seq data from human intestinal tissues of 17 patients. By targeting non-variable genes, including ribosomal and mitochondrial RNAs, CRISPR-Cas9 treatment effectively reduced their expression, outperforming computational methods in both the number and extent of gene removal. The CRISPR-Cas9 treated samples, sequenced at half the depth compared to untreated samples, maintained comparable sequencing quality, and saturation, demonstrating that this approach can reduce sequencing costs while preserving data quality. Cell type composition and gene expression patterns remained consistent between treated and original datasets, with no unintended gene deletions. Overall, our findings suggest that the CRISPR-Cas9-based method offers a cost-effective solution for improving scRNA-seq data quality, particularly for tissues with high levels of non-variable RNAs, without compromising biological integrity.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"14"},"PeriodicalIF":0.0,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12093678/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144113308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigating single-cell RNA-sequencing: protocols, tools, databases, and applications. 导航单细胞rna测序:协议,工具,数据库和应用。
Pub Date : 2025-05-17 DOI: 10.1186/s44342-025-00044-5
Ankish Arya, Prabhat Tripathi, Nidhi Dubey, Imlimaong Aier, Pritish Kumar Varadwaj

Single-cell RNA-sequencing (scRNA-seq) technology brought about a revolutionary change in the transcriptomic world, paving the way for comprehensive analysis of cellular heterogeneity in complex biological systems. It enabled researchers to see how different cells behaved at single-cell levels, providing new insights into the process. However, despite all these advancements, scRNA-seq also experiences challenges related to the complexity of data analysis, interpretation, and multi-omics data integration. In this review, these complications were discussed in detail, directly pointing at the optimization of scRNA-seq approaches and understanding the world of single-cell and its dynamics. Different protocols and currently functional single-cell databases were also covered. This review highlights different tools for the analysis of scRNA-seq and their methodologies, emphasizing innovative techniques that enhance resolution and accuracy at a single-cell level. Various applications were explored across domains including drug discovery, tumor microenvironment (TME), biomarker discovery, and microbial profiling, and case studies were discussed to explain the importance of scRNA-seq by uncovering novel and rare cell types and their identification. This review underlines a crucial aspect of scRNA-seq in the advancement of personalized medicine and highlights its potential to understand the complexity of biological systems.

单细胞rna测序(scRNA-seq)技术在转录组学领域带来了革命性的变化,为复杂生物系统中细胞异质性的综合分析铺平了道路。它使研究人员能够看到不同细胞在单细胞水平上的表现,为这一过程提供了新的见解。然而,尽管取得了这些进步,scRNA-seq也面临着与数据分析、解释和多组学数据集成的复杂性相关的挑战。在这篇综述中,详细讨论了这些并发症,直接指向scRNA-seq方法的优化和对单细胞世界及其动力学的理解。还涵盖了不同的协议和当前功能的单细胞数据库。本文重点介绍了scRNA-seq分析的不同工具及其方法,强调了在单细胞水平上提高分辨率和准确性的创新技术。本文探讨了scRNA-seq在药物发现、肿瘤微环境(TME)、生物标志物发现和微生物分析等领域的各种应用,并讨论了案例研究,通过发现新的和罕见的细胞类型及其鉴定来解释scRNA-seq的重要性。这篇综述强调了scRNA-seq在个性化医疗进步中的一个关键方面,并强调了它在理解生物系统复杂性方面的潜力。
{"title":"Navigating single-cell RNA-sequencing: protocols, tools, databases, and applications.","authors":"Ankish Arya, Prabhat Tripathi, Nidhi Dubey, Imlimaong Aier, Pritish Kumar Varadwaj","doi":"10.1186/s44342-025-00044-5","DOIUrl":"10.1186/s44342-025-00044-5","url":null,"abstract":"<p><p>Single-cell RNA-sequencing (scRNA-seq) technology brought about a revolutionary change in the transcriptomic world, paving the way for comprehensive analysis of cellular heterogeneity in complex biological systems. It enabled researchers to see how different cells behaved at single-cell levels, providing new insights into the process. However, despite all these advancements, scRNA-seq also experiences challenges related to the complexity of data analysis, interpretation, and multi-omics data integration. In this review, these complications were discussed in detail, directly pointing at the optimization of scRNA-seq approaches and understanding the world of single-cell and its dynamics. Different protocols and currently functional single-cell databases were also covered. This review highlights different tools for the analysis of scRNA-seq and their methodologies, emphasizing innovative techniques that enhance resolution and accuracy at a single-cell level. Various applications were explored across domains including drug discovery, tumor microenvironment (TME), biomarker discovery, and microbial profiling, and case studies were discussed to explain the importance of scRNA-seq by uncovering novel and rare cell types and their identification. This review underlines a crucial aspect of scRNA-seq in the advancement of personalized medicine and highlights its potential to understand the complexity of biological systems.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"13"},"PeriodicalIF":0.0,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12085826/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144096718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome-wide association study-driven identification of thrombomodulin and factor V as the best biomarker combination for deep vein thrombosis. 全基因组关联研究驱动的血栓调节蛋白和V因子作为深静脉血栓形成的最佳生物标志物组合的鉴定。
Pub Date : 2025-05-15 DOI: 10.1186/s44342-025-00047-2
Usi Sukorini, Gisca Ajeng Widya Ninggar, Mohammad Hendra Setia Lesmana, Lalu Irham, Wirawan Adikusuma, Hegaria Rahmawati, Nur Imma Fatimah Harahap, Chiou-Feng Lin, Rahmat Dani Satria

Deep vein thrombosis (DVT) is a clinically significant condition characterized by the formation of thrombi in deep venous structures, leading to high morbidity and potential mortality. Identifying reliable biomarkers for DVT risk prediction remains challenging due to the intricate genetic and molecular mechanisms underlying the disease. This study aims to investigate the best biomarker for DVT. Our study utilized genome-wide association studies (GWAS) findings coupled with a functional annotation scoring system to identify and prioritize genetic markers with strong associations to DVT. Furthermore, gene expression levels were analyzed to determine the most promising genetic markers. Several databases were utilized, including the GWAS Catalog, HaploReg 4.2, WebGestalt, Enrichr, and the GTEx Portal. Through the comprehensive analysis, we found 5 potential biomarkers and highlighted thrombomodulin (THBD) and Factor V (F5) as the best blood-based biomarkers. THBD and F5 genes were selected based on their elevated expression levels in blood and the presence of eQTLs. Functionally, THBD modulates coagulation via protein C activation, while F5 is pivotal in thrombin formation and clot stabilization, underscoring their mechanistic relevance to DVT pathogenesis, and rendering them suitable for non-invasive clinical assessment. Our findings emphasize the potential of genetic biomarkers to transform DVT risk assessment and support advancements in precision medicine for thrombotic disorders.

深静脉血栓形成(DVT)是一种临床意义重大的疾病,其特征是在深静脉结构中形成血栓,具有很高的发病率和潜在的死亡率。由于该疾病复杂的遗传和分子机制,确定可靠的DVT风险预测生物标志物仍然具有挑战性。本研究旨在探讨深静脉血栓形成的最佳生物标志物。我们的研究利用全基因组关联研究(GWAS)的结果,结合功能注释评分系统来识别和优先考虑与DVT强关联的遗传标记。此外,还分析了基因表达水平,以确定最有希望的遗传标记。使用了几个数据库,包括GWAS Catalog、HaploReg 4.2、WebGestalt、plentir和GTEx Portal。通过综合分析,我们发现了5个潜在的生物标志物,并强调血栓调节素(THBD)和因子V (F5)是最好的血基生物标志物。根据THBD和F5基因在血液中的高表达水平和eqtl的存在来选择它们。在功能上,THBD通过蛋白C激活调节凝血,而F5在凝血酶形成和凝块稳定中起关键作用,强调了它们与DVT发病机制的机制相关性,并使它们适合用于无创临床评估。我们的研究结果强调了遗传生物标志物在改变深静脉血栓形成风险评估方面的潜力,并支持血栓性疾病精准医学的进步。
{"title":"Genome-wide association study-driven identification of thrombomodulin and factor V as the best biomarker combination for deep vein thrombosis.","authors":"Usi Sukorini, Gisca Ajeng Widya Ninggar, Mohammad Hendra Setia Lesmana, Lalu Irham, Wirawan Adikusuma, Hegaria Rahmawati, Nur Imma Fatimah Harahap, Chiou-Feng Lin, Rahmat Dani Satria","doi":"10.1186/s44342-025-00047-2","DOIUrl":"10.1186/s44342-025-00047-2","url":null,"abstract":"<p><p>Deep vein thrombosis (DVT) is a clinically significant condition characterized by the formation of thrombi in deep venous structures, leading to high morbidity and potential mortality. Identifying reliable biomarkers for DVT risk prediction remains challenging due to the intricate genetic and molecular mechanisms underlying the disease. This study aims to investigate the best biomarker for DVT. Our study utilized genome-wide association studies (GWAS) findings coupled with a functional annotation scoring system to identify and prioritize genetic markers with strong associations to DVT. Furthermore, gene expression levels were analyzed to determine the most promising genetic markers. Several databases were utilized, including the GWAS Catalog, HaploReg 4.2, WebGestalt, Enrichr, and the GTEx Portal. Through the comprehensive analysis, we found 5 potential biomarkers and highlighted thrombomodulin (THBD) and Factor V (F5) as the best blood-based biomarkers. THBD and F5 genes were selected based on their elevated expression levels in blood and the presence of eQTLs. Functionally, THBD modulates coagulation via protein C activation, while F5 is pivotal in thrombin formation and clot stabilization, underscoring their mechanistic relevance to DVT pathogenesis, and rendering them suitable for non-invasive clinical assessment. Our findings emphasize the potential of genetic biomarkers to transform DVT risk assessment and support advancements in precision medicine for thrombotic disorders.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"11"},"PeriodicalIF":0.0,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083130/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144083126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genomics & informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1