首页 > 最新文献

Bioinformatics and Biology Insights最新文献

英文 中文
A User-Friendly Machine Learning Pipeline for Automated Leaf Segmentation in Atriplex lentiformis. 一种用户友好的机器学习管道用于矩阵透镜体叶片自动分割。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-06-08 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251344033
Michelle Lynn Yung, Kamila Murawska-Wlodarczyk, Alicja Babst-Kostecka, Raina Margaret Maier, Nirav Merchant, Aikseng Ooi

Automated leaf segmentation pipelines must balance accuracy, scalability, and usability to be readily adopted in plant research. We present an end-to-end deep learning pipeline designed for practical use in plant phenotyping, which we developed and evaluated during a real-world plant growth experiment using Atriplex lentiformis. The pipeline integrates a fine-tuned Mask Region-based Convolutional Neural Network (Mask R-CNN) segmentation model trained on 176 plant images and achieves high performance despite the small training data set (Dice coefficient = 0.781). We quantitatively compare the fine-tuned Mask R-CNN model to Meta AI's Segment Anything Model (SAM) and evaluate natural language prompts using Grounded SAM and the Leaf-Only SAM post-processing pipeline for refining segmentation outputs. Our findings highlight that transfer learning on a specialized data set can still outperform a large foundation model in domain-specific tasks. In addition, we integrate QR codes for automated sample identification and benchmark multiple QR code decoding libraries, evaluating their robustness under real-world imaging conditions like distortion and lighting variation. To ensure accessibility, we deploy the pipeline as a user-friendly Streamlit web application, allowing researchers to analyze images without deep learning expertise. By focusing on practical deployment in addition to model performance, this study provides an open-source, scalable framework for plant science applications and addresses real-world challenges in automation and usability by the end-researcher.

自动化叶片分割管道必须平衡准确性、可扩展性和可用性,以便在植物研究中容易采用。我们提出了一个端到端的深度学习管道,设计用于植物表型的实际应用,我们在使用Atriplex lentiformis的真实植物生长实验中开发和评估了该管道。该管道集成了一个经过微调的基于Mask区域的卷积神经网络(Mask R-CNN)分割模型,该模型对176张植物图像进行了训练,尽管训练数据集很小(Dice系数= 0.781),但仍取得了很高的性能。我们定量地将微调后的Mask R-CNN模型与Meta AI的Segment Anything model (SAM)进行比较,并使用Grounded SAM和Leaf-Only SAM后处理管道来评估自然语言提示,以改进分割输出。我们的研究结果强调,在特定领域的任务中,在专门数据集上的迁移学习仍然可以优于大型基础模型。此外,我们集成了用于自动样本识别的QR码,并对多个QR码解码库进行了基准测试,评估了它们在失真和光照变化等现实成像条件下的鲁棒性。为了确保可访问性,我们将管道部署为用户友好的Streamlit web应用程序,允许研究人员在没有深度学习专业知识的情况下分析图像。通过关注模型性能之外的实际部署,本研究为植物科学应用提供了一个开源的、可扩展的框架,并解决了最终研究人员在自动化和可用性方面的现实挑战。
{"title":"A User-Friendly Machine Learning Pipeline for Automated Leaf Segmentation in <i>Atriplex lentiformis</i>.","authors":"Michelle Lynn Yung, Kamila Murawska-Wlodarczyk, Alicja Babst-Kostecka, Raina Margaret Maier, Nirav Merchant, Aikseng Ooi","doi":"10.1177/11779322251344033","DOIUrl":"10.1177/11779322251344033","url":null,"abstract":"<p><p>Automated leaf segmentation pipelines must balance accuracy, scalability, and usability to be readily adopted in plant research. We present an end-to-end deep learning pipeline designed for practical use in plant phenotyping, which we developed and evaluated during a real-world plant growth experiment using <i>Atriplex lentiformis</i>. The pipeline integrates a fine-tuned Mask Region-based Convolutional Neural Network (Mask R-CNN) segmentation model trained on 176 plant images and achieves high performance despite the small training data set (Dice coefficient = 0.781). We quantitatively compare the fine-tuned Mask R-CNN model to Meta AI's Segment Anything Model (SAM) and evaluate natural language prompts using Grounded SAM and the Leaf-Only SAM post-processing pipeline for refining segmentation outputs. Our findings highlight that transfer learning on a specialized data set can still outperform a large foundation model in domain-specific tasks. In addition, we integrate QR codes for automated sample identification and benchmark multiple QR code decoding libraries, evaluating their robustness under real-world imaging conditions like distortion and lighting variation. To ensure accessibility, we deploy the pipeline as a user-friendly Streamlit web application, allowing researchers to analyze images without deep learning expertise. By focusing on practical deployment in addition to model performance, this study provides an open-source, scalable framework for plant science applications and addresses real-world challenges in automation and usability by the end-researcher.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251344033"},"PeriodicalIF":2.3,"publicationDate":"2025-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12149614/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144265239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrative Machine Learning Approach to Explore Glycosylation Signatures and Immune Landscape in Moyamoya Disease. 综合机器学习方法探索烟雾病的糖基化特征和免疫景观。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-05-24 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251342412
Cunxin Tan, Jing Wang, Yanru Wang, Shaoqi Xu, Zhenyu Zhou, Junze Zhang, Shihao He, Ran Duan

Moyamoya disease (MMD) is a rare, chronic cerebrovascular disorder of uncertain etiology. Although abnormal glucose metabolism has been implicated, the contribution of glycosylation-related genes in MMD remains elusive. In this study, we analyzed 2 transcriptome data sets (GSE189993 and GSE131293) from the Gene Expression Omnibus (GEO) database to identify 723 differentially expressed genes (DEGs) between MMD patients and controls. Intersection genes with known glycosylation-related genes underwent Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. We utilized machine learning to select key hub genes, followed by immune cell infiltration and correlation analyses. In-depth immune cell analysis indicated that both CFP and MGAT5B were closely tied to various immune components, suggesting potential crosstalk between glycosylation pathways and immune regulation. Notably, CFP was positively associated with pDCs, HLA, and CCR, whereas MGAT5B correlated with B-cells, check-points, and T helper cells but showed a negative relationship with Tregs, hinting at an immunoregulatory mechanism influencing MMD progression. Motif-TF annotation highlighted csibp_M2095 as the motif with the highest normalized enrichment score (NES: 6.57). Reverse microRNA (miRNA)-gene prediction identified 75 miRNAs regulating these focus genes, along with 126 miRNA-miRNA interconnections. Connectivity Map (Cmap) analysis revealed that Chenodeoxycholic acid, MRS-1220, Phenytoin, and Piceid were strongly negatively correlated with MMD expression profiles, suggesting potential therapeutic candidates. Enzyme-linked immunosorbent assays confirmed elevated CFP and MGAT5B and reduced PTPN11 in MMD, aligning with our bioinformatic findings. Moreover, PTPN11 knockdown in human brain microvascular endothelial cells (HBMECs) significantly enhanced tube formation, indicating a role in vascular remodeling. Collectively, these results emphasize the importance of glycosylation-related genes and immune dysregulation in MMD pathogenesis. These findings broaden our understanding of MMD's underlying mechanisms and underscore the necessity of continued research into glycosylation-driven pathways for improved disease management.

烟雾病是一种罕见的慢性脑血管疾病,病因不明。尽管异常的糖代谢已经牵涉其中,糖基化相关基因在烟雾病中的作用仍然难以捉摸。在这项研究中,我们分析了基因表达Omnibus (GEO)数据库中的2个转录组数据集(GSE189993和GSE131293),以确定烟雾病患者与对照组之间的723个差异表达基因(DEGs)。与已知糖基化相关基因的交叉基因进行基因本体(GO)和京都基因与基因组百科全书(KEGG)富集分析。我们利用机器学习选择关键枢纽基因,然后进行免疫细胞浸润和相关性分析。深入的免疫细胞分析表明,CFP和MGAT5B都与多种免疫成分密切相关,表明糖基化途径与免疫调节之间存在潜在的串导。值得注意的是,CFP与pDCs、HLA和CCR呈正相关,而MGAT5B与b细胞、检查点和T辅助细胞相关,但与Tregs呈负相关,暗示了影响烟雾病进展的免疫调节机制。motif - tf注释显示csibp_M2095是归一化富集分数最高的motif (NES: 6.57)。反向microRNA (miRNA)-基因预测鉴定了75个调节这些焦点基因的miRNA,以及126个miRNA-miRNA互连。连接图(Cmap)分析显示,Chenodeoxycholic acid、MRS-1220、Phenytoin和Piceid与烟雾病表达谱呈强烈负相关,提示潜在的治疗候选药物。酶联免疫吸附试验证实MMD中CFP和MGAT5B升高,PTPN11降低,与我们的生物信息学发现一致。此外,PTPN11在人脑微血管内皮细胞(HBMECs)中的敲低显著增强了管的形成,表明其在血管重塑中起作用。总之,这些结果强调了糖基化相关基因和免疫失调在烟雾病发病机制中的重要性。这些发现拓宽了我们对烟雾病潜在机制的理解,并强调了继续研究糖基化驱动途径以改善疾病管理的必要性。
{"title":"Integrative Machine Learning Approach to Explore Glycosylation Signatures and Immune Landscape in Moyamoya Disease.","authors":"Cunxin Tan, Jing Wang, Yanru Wang, Shaoqi Xu, Zhenyu Zhou, Junze Zhang, Shihao He, Ran Duan","doi":"10.1177/11779322251342412","DOIUrl":"10.1177/11779322251342412","url":null,"abstract":"<p><p>Moyamoya disease (MMD) is a rare, chronic cerebrovascular disorder of uncertain etiology. Although abnormal glucose metabolism has been implicated, the contribution of glycosylation-related genes in MMD remains elusive. In this study, we analyzed 2 transcriptome data sets (GSE189993 and GSE131293) from the Gene Expression Omnibus (GEO) database to identify 723 differentially expressed genes (DEGs) between MMD patients and controls. Intersection genes with known glycosylation-related genes underwent Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. We utilized machine learning to select key hub genes, followed by immune cell infiltration and correlation analyses. In-depth immune cell analysis indicated that both CFP and MGAT5B were closely tied to various immune components, suggesting potential crosstalk between glycosylation pathways and immune regulation. Notably, CFP was positively associated with pDCs, HLA, and CCR, whereas MGAT5B correlated with B-cells, check-points, and T helper cells but showed a negative relationship with Tregs, hinting at an immunoregulatory mechanism influencing MMD progression. Motif-TF annotation highlighted csibp_M2095 as the motif with the highest normalized enrichment score (NES: 6.57). Reverse microRNA (miRNA)-gene prediction identified 75 miRNAs regulating these focus genes, along with 126 miRNA-miRNA interconnections. Connectivity Map (Cmap) analysis revealed that Chenodeoxycholic acid, MRS-1220, Phenytoin, and Piceid were strongly negatively correlated with MMD expression profiles, suggesting potential therapeutic candidates. Enzyme-linked immunosorbent assays confirmed elevated CFP and MGAT5B and reduced PTPN11 in MMD, aligning with our bioinformatic findings. Moreover, PTPN11 knockdown in human brain microvascular endothelial cells (HBMECs) significantly enhanced tube formation, indicating a role in vascular remodeling. Collectively, these results emphasize the importance of glycosylation-related genes and immune dysregulation in MMD pathogenesis. These findings broaden our understanding of MMD's underlying mechanisms and underscore the necessity of continued research into glycosylation-driven pathways for improved disease management.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251342412"},"PeriodicalIF":2.3,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12103670/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144141431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structural Insights Into centSIRT6: Bioinformatic Analysis of N308K and A313S Substitution Effects. centSIRT6的结构洞察:N308K和A313S替代效应的生物信息学分析。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-05-21 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251339698
Francisco Alejandro Lagunas-Rangel

Sirtuin 6 (SIRT6), a member of the class III histone deacetylase (HDAC) family, is crucial for the maintenance of general health and is associated with increased life expectancy and resistance to age-related diseases such as cancer and metabolic disorders. A comparative analysis of the SIRT6 gene in Ashkenazi Jewish (AJ) centenarians and noncentenarian controls found a distinct allele, centSIRT6, enriched in the centenarian group. This allele features 2 linked substitutions, N308K and A313S, and exhibits enhanced functions, including more efficient suppression of LINE1 retrotransposons, improved repair of DNA double-strand breaks, and increased efficiency in cancer cell killing. Notably, centSIRT6 shows lower deacetylase activity but higher mono-adenosine diphosphate (ADP) ribosyl transferase activity compared with the wild-type enzyme. This study used several bioinformatics tools to explore the structural changes caused by the N308K and A313S substitutions in centSIRT6 and to elucidate how these alterations contribute to changes in the enzymatic activities of SIRT6. The results indicate that these mutations reduce the structural flexibility of centSIRT6, thus weakening its interactions with acetyl-lysine but strengthening its interactions with ADP-ribose. This research provides useful information for future experimental studies to further investigate the molecular mechanisms of centSIRT6.

Sirtuin 6 (SIRT6)是III类组蛋白去乙酰化酶(HDAC)家族的一员,对维持一般健康至关重要,并与预期寿命的延长和对年龄相关疾病(如癌症和代谢紊乱)的抵抗力有关。一项对德系犹太人百岁老人和非百岁对照者的SIRT6基因的比较分析发现,一种独特的等位基因centSIRT6在百岁组中富集。该等位基因具有2个连锁取代,N308K和A313S,并表现出增强的功能,包括更有效地抑制LINE1反转录转座子,改善DNA双链断裂的修复,提高癌细胞杀伤效率。值得注意的是,与野生型酶相比,centSIRT6表现出较低的去乙酰化酶活性,但较高的单腺苷二磷酸(ADP)核糖基转移酶活性。本研究使用了几种生物信息学工具来探索由N308K和A313S取代引起的centSIRT6结构变化,并阐明这些变化如何导致SIRT6酶活性的变化。结果表明,这些突变降低了centSIRT6的结构灵活性,从而减弱了它与乙酰赖氨酸的相互作用,但增强了它与adp核糖的相互作用。本研究为进一步研究centSIRT6的分子机制提供了有益的实验信息。
{"title":"Structural Insights Into centSIRT6: Bioinformatic Analysis of N308K and A313S Substitution Effects.","authors":"Francisco Alejandro Lagunas-Rangel","doi":"10.1177/11779322251339698","DOIUrl":"10.1177/11779322251339698","url":null,"abstract":"<p><p>Sirtuin 6 (SIRT6), a member of the class III histone deacetylase (HDAC) family, is crucial for the maintenance of general health and is associated with increased life expectancy and resistance to age-related diseases such as cancer and metabolic disorders. A comparative analysis of the SIRT6 gene in Ashkenazi Jewish (AJ) centenarians and noncentenarian controls found a distinct allele, centSIRT6, enriched in the centenarian group. This allele features 2 linked substitutions, N308K and A313S, and exhibits enhanced functions, including more efficient suppression of LINE1 retrotransposons, improved repair of DNA double-strand breaks, and increased efficiency in cancer cell killing. Notably, centSIRT6 shows lower deacetylase activity but higher mono-adenosine diphosphate (ADP) ribosyl transferase activity compared with the wild-type enzyme. This study used several bioinformatics tools to explore the structural changes caused by the N308K and A313S substitutions in centSIRT6 and to elucidate how these alterations contribute to changes in the enzymatic activities of SIRT6. The results indicate that these mutations reduce the structural flexibility of centSIRT6, thus weakening its interactions with acetyl-lysine but strengthening its interactions with ADP-ribose. This research provides useful information for future experimental studies to further investigate the molecular mechanisms of centSIRT6.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251339698"},"PeriodicalIF":2.3,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099093/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144141432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Reverse Vaccinology and Immunoinformatic Approach for the Designing of a Novel mRNA Vaccine Against Stomach Cancer Targeting the Potent Pathogenic Proteins of Helicobacter pylori. 以幽门螺杆菌强致病性蛋白为靶点的新型胃癌mRNA疫苗的反向疫苗学和免疫信息学研究
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-04-16 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251331104
Abanti Barua, Md Habib Ullah Masum, Ahmad Abdullah Mahdeen

Helicobacter pylori infection of the stomach's epithelial cells is a significant risk factor for stomach cancer. Various H pylori proteins (CagA, GGT, NapA, PatA, urease, and VacA) were targeted to design 2 messenger RNA (mRNA) vaccines, V1 and V2, using bioinformatics tools. Physicochemical parameters, secondary and tertiary structure, molecular docking and dynamic simulation, codon optimization, and RNA structure prediction have also been estimated for these developed vaccines. Physicochemical analyses revealed that these developed vaccines are soluble (GRAVY < 0), basic (pI < 7), and stable (aliphatic index < 80). The secondary and tertiary structure of the vaccines demonstrated robustness. The docking with toll-like receptors (TLRs) revealed that the vaccines have a potential affinity for TLR-2 (V1: -1132.3 kJ/mol, V2: -1093.6 kJ/mol) and TLR-4 (V1: -1042.7 kJ/mol, V2: -1201.2 kJ/mol), and molecular dynamics simulations confirmed their dynamic stability. Structural analyses of V1 (-505.96 kcal/mol) and V2 (-634.92 kcal/mol) mRNA vaccines underscored their stability. In addition, the vaccine showed a considerable rise in the counts of B cells and extended activation of both T cells was also observed for the vaccines, suggesting the potential for long-lasting immunity, and offering enhanced protection against H pylori. These findings not only suggest potential long-lasting immunity against H pylori but also offer hope for the future of stomach cancer prevention. Notably, the study emphasizes the need for subsequent animal and human-based studies to confirm these promising results.

胃上皮细胞幽门螺杆菌感染是胃癌的重要危险因素。利用生物信息学工具,以各种幽门螺杆菌蛋白(CagA、GGT、NapA、PatA、脲酶和VacA)为靶点,设计2种信使RNA (mRNA)疫苗V1和V2。对这些疫苗的理化参数、二级和三级结构、分子对接和动态模拟、密码子优化和RNA结构预测也进行了估计。理化分析表明,这些研制的疫苗是可溶的(肉汤幽门螺杆菌)。这些发现不仅表明对幽门螺杆菌具有潜在的长期免疫力,而且为未来的胃癌预防提供了希望。值得注意的是,该研究强调需要后续的动物和人类研究来证实这些有希望的结果。
{"title":"A Reverse Vaccinology and Immunoinformatic Approach for the Designing of a Novel mRNA Vaccine Against Stomach Cancer Targeting the Potent Pathogenic Proteins of <i>Helicobacter pylori</i>.","authors":"Abanti Barua, Md Habib Ullah Masum, Ahmad Abdullah Mahdeen","doi":"10.1177/11779322251331104","DOIUrl":"https://doi.org/10.1177/11779322251331104","url":null,"abstract":"<p><p><i>Helicobacter pylori</i> infection of the stomach's epithelial cells is a significant risk factor for stomach cancer. Various <i>H pylori</i> proteins (CagA, GGT, NapA, PatA, urease, and VacA) were targeted to design 2 messenger RNA (mRNA) vaccines, V1 and V2, using bioinformatics tools. Physicochemical parameters, secondary and tertiary structure, molecular docking and dynamic simulation, codon optimization, and RNA structure prediction have also been estimated for these developed vaccines. Physicochemical analyses revealed that these developed vaccines are soluble (GRAVY < 0), basic (pI < 7), and stable (aliphatic index < 80). The secondary and tertiary structure of the vaccines demonstrated robustness. The docking with toll-like receptors (TLRs) revealed that the vaccines have a potential affinity for TLR-2 (V1: -1132.3 kJ/mol, V2: -1093.6 kJ/mol) and TLR-4 (V1: -1042.7 kJ/mol, V2: -1201.2 kJ/mol), and molecular dynamics simulations confirmed their dynamic stability. Structural analyses of V1 (-505.96 kcal/mol) and V2 (-634.92 kcal/mol) mRNA vaccines underscored their stability. In addition, the vaccine showed a considerable rise in the counts of B cells and extended activation of both T cells was also observed for the vaccines, suggesting the potential for long-lasting immunity, and offering enhanced protection against <i>H pylori</i>. These findings not only suggest potential long-lasting immunity against <i>H pylori</i> but also offer hope for the future of stomach cancer prevention. Notably, the study emphasizes the need for subsequent animal and human-based studies to confirm these promising results.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251331104"},"PeriodicalIF":2.3,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033411/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143967723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Inference for Drug Discovery by High Negative Samples and Oversampling. 高负样本和过采样药物发现的贝叶斯推断。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-04-12 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251328269
Manh Hung Le, Nam Anh Dao, Xuan Tho Dang

Drug repositioning holds great promise for reducing the time and cost associated with traditional drug discovery, but it faces significant challenges related to data imbalance and noise in negative samples. In this article, we introduce a novel method leveraging high negative oversampling (HNO) to address these challenges. Our approach integrates HNO with advanced techniques such as network-based graph mining, matrix factorization, and Bayesian inference, specifically designed for imbalanced data scenarios. Constructing high-quality negative samples is crucial to mitigate the detrimental effects of noisy negative data and enhance model performance. Experimental results demonstrate the efficacy of our approach in enhancing the performance of drug discovery models by effectively managing data imbalance and refining the selection of negative samples. This methodology provides a robust framework for improving drug repositioning, with potential applications in broader biomedical domains.

药物重新定位在减少与传统药物发现相关的时间和成本方面具有很大的前景,但它面临着与负样本数据不平衡和噪声相关的重大挑战。在本文中,我们介绍了一种利用高负过采样(HNO)的新方法来解决这些挑战。我们的方法将HNO与基于网络的图挖掘、矩阵分解和贝叶斯推理等先进技术集成在一起,这些技术专门为不平衡数据场景设计。构建高质量的负样本对于减轻噪声负数据的不利影响和提高模型性能至关重要。实验结果表明,我们的方法通过有效地管理数据不平衡和改进负样本的选择,提高了药物发现模型的性能。这种方法为改进药物重新定位提供了一个强有力的框架,在更广泛的生物医学领域具有潜在的应用。
{"title":"Bayesian Inference for Drug Discovery by High Negative Samples and Oversampling.","authors":"Manh Hung Le, Nam Anh Dao, Xuan Tho Dang","doi":"10.1177/11779322251328269","DOIUrl":"https://doi.org/10.1177/11779322251328269","url":null,"abstract":"<p><p>Drug repositioning holds great promise for reducing the time and cost associated with traditional drug discovery, but it faces significant challenges related to data imbalance and noise in negative samples. In this article, we introduce a novel method leveraging high negative oversampling (HNO) to address these challenges. Our approach integrates HNO with advanced techniques such as network-based graph mining, matrix factorization, and Bayesian inference, specifically designed for imbalanced data scenarios. Constructing high-quality negative samples is crucial to mitigate the detrimental effects of noisy negative data and enhance model performance. Experimental results demonstrate the efficacy of our approach in enhancing the performance of drug discovery models by effectively managing data imbalance and refining the selection of negative samples. This methodology provides a robust framework for improving drug repositioning, with potential applications in broader biomedical domains.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251328269"},"PeriodicalIF":2.3,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033409/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144061907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revolutionizing Chikungunya Vaccines: mRNA Breakthroughs With Molecular and Immune Simulations. 基孔肯雅热疫苗的革命:分子和免疫模拟的mRNA突破
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-04-03 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251324859
Md Habib Ullah Masum, Ahmad Abdullah Mahdeen, Abanti Barua

With the ability to cause massive epidemics that have consequences on millions of individuals globally, the Chikungunya virus (CHIKV) emerges as a severe menace. Developing an effective vaccine is urgent as no effective therapeutics are available for such viral infections. Therefore, we designed a novel mRNA vaccine against CHIKV with a combination of highly antigenic and potential MHC-I, MHC-II, and B-cell epitopes from the structural polyprotein. The vaccine demonstrated well-characterized physicochemical properties, indicating its solubility and potential functional stability within the body (GRAVY score of -0.639). Structural analyses of the vaccine revealed a well-stabilized secondary and tertiary structure (Ramachandran score of 82.8% and a Z-score of -4.17). Docking studies of the vaccine with TLR-2 (-1027.7 KJ/mol) and TLR-4 (-1212.4 KJ/mol) exhibited significant affinity with detailed hydrogen bond interactions. Molecular dynamics simulations highlighted distinct conformational dynamics among the vaccine, "vaccine-TLR-2" and "vaccine-TLR-4" complexes. The vaccine's ability to elicit both innate and adaptive immune responses, including the presence of memory B-cells and T-cells, persistent B-cell immunity for a year, and the activation of TH cells leading to the release of IFN-γ and IL-2, has significant implications for its potential effectiveness. The CHIKV vaccine developed in this study shows promise as a potential candidate for future vaccine production against CHIKV, suggesting its suitability for further clinical advancement, including in vitro and in vivo experiments.

基孔肯雅病毒(CHIKV)能够引起大规模流行病,对全球数百万人造成影响,成为一种严重威胁。开发有效的疫苗迫在眉睫,因为没有有效的治疗方法可用于这种病毒感染。因此,我们设计了一种新的抗CHIKV mRNA疫苗,该疫苗结合了高抗原性和潜在的MHC-I、MHC-II和来自结构多蛋白的b细胞表位。该疫苗表现出良好的物理化学特性,表明其在体内的溶解度和潜在的功能稳定性(肉汁评分为-0.639)。结构分析显示该疫苗具有稳定的二级和三级结构(Ramachandran评分为82.8%,z评分为-4.17)。与TLR-2 (-1027.7 KJ/mol)和TLR-4 (-1212.4 KJ/mol)的对接研究显示,该疫苗与详细的氢键相互作用具有显著的亲和力。分子动力学模拟强调了疫苗、“疫苗- tlr -2”和“疫苗- tlr -4”复合物之间不同的构象动力学。该疫苗能够引发先天性和适应性免疫反应,包括记忆性b细胞和t细胞的存在,持续一年的b细胞免疫,以及TH细胞的激活导致IFN-γ和IL-2的释放,这对其潜在的有效性具有重要意义。本研究开发的CHIKV疫苗有望成为未来生产抗CHIKV疫苗的潜在候选疫苗,表明其适合进一步的临床进展,包括体外和体内实验。
{"title":"Revolutionizing Chikungunya Vaccines: mRNA Breakthroughs With Molecular and Immune Simulations.","authors":"Md Habib Ullah Masum, Ahmad Abdullah Mahdeen, Abanti Barua","doi":"10.1177/11779322251324859","DOIUrl":"10.1177/11779322251324859","url":null,"abstract":"<p><p>With the ability to cause massive epidemics that have consequences on millions of individuals globally, the Chikungunya virus (CHIKV) emerges as a severe menace. Developing an effective vaccine is urgent as no effective therapeutics are available for such viral infections. Therefore, we designed a novel mRNA vaccine against CHIKV with a combination of highly antigenic and potential MHC-I, MHC-II, and B-cell epitopes from the structural polyprotein. The vaccine demonstrated well-characterized physicochemical properties, indicating its solubility and potential functional stability within the body (GRAVY score of -0.639). Structural analyses of the vaccine revealed a well-stabilized secondary and tertiary structure (Ramachandran score of 82.8% and a Z-score of -4.17). Docking studies of the vaccine with TLR-2 (-1027.7 KJ/mol) and TLR-4 (-1212.4 KJ/mol) exhibited significant affinity with detailed hydrogen bond interactions. Molecular dynamics simulations highlighted distinct conformational dynamics among the vaccine, \"vaccine-TLR-2\" and \"vaccine-TLR-4\" complexes. The vaccine's ability to elicit both innate and adaptive immune responses, including the presence of memory B-cells and T-cells, persistent B-cell immunity for a year, and the activation of TH cells leading to the release of IFN-γ and IL-2, has significant implications for its potential effectiveness. The CHIKV vaccine developed in this study shows promise as a potential candidate for future vaccine production against CHIKV, suggesting its suitability for further clinical advancement, including in vitro and in vivo experiments.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251324859"},"PeriodicalIF":2.3,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11967231/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143779130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Microchimerism Literature Atlas. 微嵌合文学图集。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-04-03 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251324104
Michael Christian Gruber, Daniel Kummer, Katja Sallinger, Henderson James Cleaves, Arsev Umur Aydinoğlu, Thomas Kroneis

The Microchimerism Literature Atlas (MCLA) is a comprehensive online dataset to facilitate the investigation of microchimerism (MC), condition where individuals harbor cells from another individual of the same species. The MCLA provides access to more than 15 000 references from MC research, covering peer-reviewed articles and reviews from 1970 to the present. Key features include a multidimensional search function and logical operators for assembling search queries. The MCLA dataset offers a clearly structured data table view, combined with dynamic graphical data representation and visual citation analysis, aiding in the investigation and identification of research trends and patterns. The MCLA supports data export in various formats and receives regular updates. The MCLA is being developed as an essential resource for the MC research community while its framework is easily adaptable for custom literature datasets, enabling its use in other research fields.

微嵌合文献图谱(MCLA)是一个综合性的在线数据集,用于促进微嵌合(MC)的研究,即个体携带来自同一物种的另一个个体的细胞。MCLA提供了超过15,000篇MC研究参考文献,涵盖1970年至今的同行评审文章和评论。主要特性包括多维搜索功能和用于组合搜索查询的逻辑运算符。MCLA数据集提供结构清晰的数据表视图,结合动态图形数据表示和可视化引文分析,有助于调查和识别研究趋势和模式。MCLA支持多种格式的数据导出,并定期更新。MCLA正在发展成为MC研究界的一个重要资源,而它的框架很容易适应自定义文献数据集,使其能够在其他研究领域使用。
{"title":"The Microchimerism Literature Atlas.","authors":"Michael Christian Gruber, Daniel Kummer, Katja Sallinger, Henderson James Cleaves, Arsev Umur Aydinoğlu, Thomas Kroneis","doi":"10.1177/11779322251324104","DOIUrl":"10.1177/11779322251324104","url":null,"abstract":"<p><p>The Microchimerism Literature Atlas (MCLA) is a comprehensive online dataset to facilitate the investigation of microchimerism (MC), condition where individuals harbor cells from another individual of the same species. The MCLA provides access to more than 15 000 references from MC research, covering peer-reviewed articles and reviews from 1970 to the present. Key features include a multidimensional search function and logical operators for assembling search queries. The MCLA dataset offers a clearly structured data table view, combined with dynamic graphical data representation and visual citation analysis, aiding in the investigation and identification of research trends and patterns. The MCLA supports data export in various formats and receives regular updates. The MCLA is being developed as an essential resource for the MC research community while its framework is easily adaptable for custom literature datasets, enabling its use in other research fields.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251324104"},"PeriodicalIF":2.3,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11967202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143779132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of a Novel mRNA Vaccine Against Shigella Pathotypes Causing Widespread Shigellosis Endemic: An In-Silico Immunoinformatic Approach. 一种新型mRNA疫苗的研制:一种硅免疫信息学方法对抗引起广泛志贺氏菌病的致病性。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-28 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251328302
Abdur Razzak, Otun Saha, Khandokar Fahmida Sultana, Mohammad Ruhul Amin, Abdullah Bin Zahid, Afroza Sultana, Uditi Paul Bristi, Sultana Rajia, Nikkon Sarker, Md Mizanur Rahaman, Newaz Mohammed Bahadur, Foysal Hossen

Shigellosis remains a major global health concern, particularly in regions with poor sanitation and limited access to clean water. This study used immunoinformatics and reverse vaccinology to design a potential mRNA vaccine targeting Shigella pathotypes out of 4071 proteins from Shigella sonnei str. Ss046, 4 key antigenic candidates were identified: putative outer membrane protein (Q3YZL0), PapC-like porin protein (Q3YZM5), putative fimbrial-like protein (Q3Z3I2), and lipopolysaccharide (LPS)-assembly protein LptD (Q3Z5V5), ensuring broad pathotype coverage. A multitope vaccine was designed incorporating cytotoxic T lymphocyte, helper T lymphocyte, and B-cell epitopes, linked with suitable linkers and adjuvants to enhance immunogenicity. Computational analyses predicted vaccine's favorable antigenicity, solubility, and stability, while molecular docking and dynamic simulations demonstrated strong binding affinity and stability with Toll-like receptor 4 (TLR-4), indicating potential for robust immune activation. Immune simulations predicted strong humoral and cellular immune responses, characterized by significant cytokine production and long-term immune memory. Structural evaluations of the complex, including radius of gyration, root mean square deviation, root mean square fluctuation, and solvent accessibility, confirmed the vaccine's structural integrity, and stability under physiological conditions. This research contributes to the ongoing effort to alleviate the global burden of Shigella infections, providing a foundation for future wet laboratory investigations aimed at vaccine development.

志贺氏菌病仍然是一个主要的全球卫生问题,特别是在卫生条件差和获得清洁水有限的地区。本研究利用免疫信息学和反向疫苗学技术,从索尼氏志贺氏菌的4071种蛋白中设计了一种潜在的针对志贺氏菌病型的mRNA疫苗。Ss046鉴定出4种关键抗原候选物:推定的外膜蛋白(Q3YZL0)、papc样孔蛋白(Q3YZM5)、推定的纤维样蛋白(Q3Z3I2)和脂多糖(LPS)组装蛋白ltd (Q3Z5V5),确保了广泛的病型覆盖。设计了一种包含细胞毒性T淋巴细胞、辅助性T淋巴细胞和b细胞表位的多位点疫苗,并与合适的连接物和佐剂连接以增强免疫原性。计算分析预测疫苗具有良好的抗原性、溶解度和稳定性,而分子对接和动态模拟显示疫苗与toll样受体4 (TLR-4)具有很强的结合亲和力和稳定性,表明疫苗具有强大的免疫激活潜力。免疫模拟预测了强烈的体液和细胞免疫反应,其特征是显著的细胞因子产生和长期免疫记忆。复合物的结构评价,包括旋转半径、均方根偏差、均方根波动和溶剂可及性,证实了疫苗在生理条件下的结构完整性和稳定性。这项研究有助于正在进行的减轻志贺氏菌感染全球负担的努力,为未来旨在开发疫苗的湿实验室调查提供基础。
{"title":"Development of a Novel mRNA Vaccine Against <i>Shigella</i> Pathotypes Causing Widespread Shigellosis Endemic: An In-Silico Immunoinformatic Approach.","authors":"Abdur Razzak, Otun Saha, Khandokar Fahmida Sultana, Mohammad Ruhul Amin, Abdullah Bin Zahid, Afroza Sultana, Uditi Paul Bristi, Sultana Rajia, Nikkon Sarker, Md Mizanur Rahaman, Newaz Mohammed Bahadur, Foysal Hossen","doi":"10.1177/11779322251328302","DOIUrl":"10.1177/11779322251328302","url":null,"abstract":"<p><p>Shigellosis remains a major global health concern, particularly in regions with poor sanitation and limited access to clean water. This study used immunoinformatics and reverse vaccinology to design a potential mRNA vaccine targeting <i>Shigella</i> pathotypes out of 4071 proteins from <i>Shigella sonnei</i> str. Ss046, 4 key antigenic candidates were identified: putative outer membrane protein (Q3YZL0), PapC-like porin protein (Q3YZM5), putative fimbrial-like protein (Q3Z3I2), and lipopolysaccharide (LPS)-assembly protein LptD (Q3Z5V5), ensuring broad pathotype coverage. A multitope vaccine was designed incorporating cytotoxic T lymphocyte, helper T lymphocyte, and B-cell epitopes, linked with suitable linkers and adjuvants to enhance immunogenicity. Computational analyses predicted vaccine's favorable antigenicity, solubility, and stability, while molecular docking and dynamic simulations demonstrated strong binding affinity and stability with Toll-like receptor 4 (TLR-4), indicating potential for robust immune activation. Immune simulations predicted strong humoral and cellular immune responses, characterized by significant cytokine production and long-term immune memory. Structural evaluations of the complex, including radius of gyration, root mean square deviation, root mean square fluctuation, and solvent accessibility, confirmed the vaccine's structural integrity, and stability under physiological conditions. This research contributes to the ongoing effort to alleviate the global burden of <i>Shigella</i> infections, providing a foundation for future wet laboratory investigations aimed at vaccine development.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251328302"},"PeriodicalIF":2.3,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11951904/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143751080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Gene Attention Focus (DyGAF): Enhancing Biomarker Identification Through Dual-Model Attention Networks. 动态基因注意焦点(DyGAF):通过双模型注意网络增强生物标志物识别。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-27 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251325390
Md Khairul Islam, Himanshu Wagh, Hairong Wei

The DyGAF model, which stands for Dynamic Gene Attention Focus, is specifically designed and tailored to address the challenges in biomarker detection, progression reporting of pathogen infection, and disease diagnostics. The DyGAF model introduced a novel dual-model attention-based mechanism within neural networks, combined with machine learning algorithms to enhance the process of biomarker identification. The model transcended traditional diagnostic approaches by meticulously analyzing gene expression data. DyGAF not only identified but also ranked genes based on their significance, revealing a comprehensive list of the top genes essential for disease detection and prognosis. In addition, KEGG pathways, Wiki Pathways, and Gene Ontology-based analyses provided a multileveled evaluation of the genes' roles. In our analyses, we tailored COVID-19 gene expression profile from nasopharyngeal swabs that offer a more nuanced view of the intricate interplay between the host and the virus. The genes ranked by the DyGAF model were compared against those selected by differential expression analysis and random forest feature selection methods for further validation of our model. DyGAF demonstrated its prowess in identifying important biomarkers that could enrich gene ontologies and pathways crucial for elucidating the pathogenesis of COVID-19. Furthermore, DyGAF was also employed for diagnosing COVID-19 patients by classifying gene-expression profiles with an accuracy of 94.23%. Benchmarking against other conventional models revealed DyGAF's superior performance, highlighting its effectiveness in identifying and categorizing COVID-19 cases. In summary, DyGAF model represents a significant advancement in genomic research, providing a more comprehensive and precise tool for identifying key genetic markers and unraveling the complex biological insights of a disease. The DyGAF model is available as a software package at the following link: https://github.com/hiddenntreasure/DyGAF.

DyGAF模型,代表动态基因关注焦点,是专门设计和定制的,用于解决生物标志物检测,病原体感染进展报告和疾病诊断方面的挑战。DyGAF模型在神经网络中引入了一种新的基于注意力的双模型机制,并结合机器学习算法来增强生物标志物的识别过程。该模型通过细致地分析基因表达数据,超越了传统的诊断方法。DyGAF不仅对基因进行识别,还根据其重要性对基因进行排序,从而揭示了对疾病检测和预后至关重要的顶级基因的综合列表。此外,KEGG通路、Wiki通路和基于基因本体论的分析提供了对基因作用的多层次评估。在我们的分析中,我们从鼻咽拭子中定制了COVID-19基因表达谱,为宿主与病毒之间复杂的相互作用提供了更细致的视角。将DyGAF模型排序的基因与差异表达分析和随机森林特征选择方法选择的基因进行比较,以进一步验证我们的模型。DyGAF展示了其在识别重要生物标志物方面的能力,这些生物标志物可以丰富对阐明COVID-19发病机制至关重要的基因本体和途径。此外,DyGAF还用于诊断COVID-19患者,对基因表达谱进行分类,准确率为94.23%。与其他传统模型的对比显示,DyGAF的性能优越,突出了其在COVID-19病例识别和分类方面的有效性。总之,DyGAF模型代表了基因组研究的重大进步,为识别关键遗传标记和揭示疾病的复杂生物学见解提供了更全面和精确的工具。DyGAF模型作为软件包可在以下链接获得:https://github.com/hiddenntreasure/DyGAF。
{"title":"Dynamic Gene Attention Focus (DyGAF): Enhancing Biomarker Identification Through Dual-Model Attention Networks.","authors":"Md Khairul Islam, Himanshu Wagh, Hairong Wei","doi":"10.1177/11779322251325390","DOIUrl":"10.1177/11779322251325390","url":null,"abstract":"<p><p>The DyGAF model, which stands for Dynamic Gene Attention Focus, is specifically designed and tailored to address the challenges in biomarker detection, progression reporting of pathogen infection, and disease diagnostics. The DyGAF model introduced a novel dual-model attention-based mechanism within neural networks, combined with machine learning algorithms to enhance the process of biomarker identification. The model transcended traditional diagnostic approaches by meticulously analyzing gene expression data. DyGAF not only identified but also ranked genes based on their significance, revealing a comprehensive list of the top genes essential for disease detection and prognosis. In addition, KEGG pathways, Wiki Pathways, and Gene Ontology-based analyses provided a multileveled evaluation of the genes' roles. In our analyses, we tailored COVID-19 gene expression profile from nasopharyngeal swabs that offer a more nuanced view of the intricate interplay between the host and the virus. The genes ranked by the DyGAF model were compared against those selected by differential expression analysis and random forest feature selection methods for further validation of our model. DyGAF demonstrated its prowess in identifying important biomarkers that could enrich gene ontologies and pathways crucial for elucidating the pathogenesis of COVID-19. Furthermore, DyGAF was also employed for diagnosing COVID-19 patients by classifying gene-expression profiles with an accuracy of 94.23%. Benchmarking against other conventional models revealed DyGAF's superior performance, highlighting its effectiveness in identifying and categorizing COVID-19 cases. In summary, DyGAF model represents a significant advancement in genomic research, providing a more comprehensive and precise tool for identifying key genetic markers and unraveling the complex biological insights of a disease. The DyGAF model is available as a software package at the following link: https://github.com/hiddenntreasure/DyGAF.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251325390"},"PeriodicalIF":2.3,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11951896/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143751083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolutionary and epidemic dynamics of COVID-19 in Germany exemplified by three Bayesian phylodynamic case studies. 以三个贝叶斯系统动力学案例为例分析新冠肺炎在德国的进化和流行动力学。
IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-12 eCollection Date: 2025-01-01 DOI: 10.1177/11779322251321065
Sanni Översti, Ariane Weber, Viktor Baran, Bärbel Kieninger, Alexander Dilthey, Torsten Houwaart, Andreas Walker, Wulf Schneider-Brachert, Denise Kühnert

The importance of genomic surveillance strategies for pathogens has been particularly evident during the coronavirus disease 2019 (COVID-19) pandemic, as genomic data from the causative agent, severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2), have guided public health decisions worldwide. Bayesian phylodynamic inference, integrating epidemiology and evolutionary biology, has become an essential tool in genomic epidemiological surveillance. It enables the estimation of epidemiological parameters, such as the reproductive number, from pathogen sequence data alone. Despite the phylodynamic approach being widely adopted, the abundance of phylodynamic models often makes it challenging to select the appropriate model for specific research questions. This article illustrates the application of phylodynamic birth-death-sampling models in public health using genomic data, with a focus on SARS-CoV-2. Targeting researchers less familiar with phylodynamics, it introduces a comprehensive workflow, including the conceptualisation of a research study and detailed steps for data preprocessing and postprocessing. In addition, we demonstrate the versatility of birth-death-sampling models through three case studies from Germany, utilising the BEAST2 software and its model implementations. Each case study addresses a distinct research question relevant not only to SARS-CoV-2 but also to other pathogens: Case study 1 finds traces of a superspreading event at the start of an early outbreak, exemplifying how simple models for genomic data can provide information that would otherwise only be accessible through extensive contact tracing. Case study 2 compares transmission dynamics in a nosocomial outbreak to community transmission, highlighting distinct dynamics through integrative analysis. Case study 3 investigates whether local transmission patterns align with national trends, demonstrating how phylodynamic models can disentangle complex population substructure with little additional information. For each case study, we emphasise critical points where model assumptions and data properties may misalign and outline appropriate validation assessments. Overall, we aim to provide researchers with examples on using birth-death-sampling models in genomic epidemiology, balancing theoretical and practical aspects.

在2019冠状病毒病(COVID-19)大流行期间,病原体基因组监测战略的重要性尤为明显,因为病原体严重急性呼吸综合征冠状病毒2型(SARS-CoV-2)的基因组数据指导了全球公共卫生决策。贝叶斯系统动力学推断集流行病学和进化生物学于一体,已成为基因组流行病学监测的重要工具。它能够仅从病原体序列数据估计流行病学参数,如繁殖数。尽管系统动力学方法被广泛采用,但系统动力学模型的丰富往往使选择适合特定研究问题的模型具有挑战性。本文阐述了使用基因组数据的系统动力学出生-死亡抽样模型在公共卫生中的应用,重点是SARS-CoV-2。针对不太熟悉系统动力学的研究人员,它介绍了一个全面的工作流程,包括研究的概念化和数据预处理和后处理的详细步骤。此外,我们利用BEAST2软件及其模型实现,通过来自德国的三个案例研究,展示了出生-死亡抽样模型的多功能性。每个案例研究都解决了一个独特的研究问题,不仅与SARS-CoV-2有关,而且与其他病原体有关:案例研究1在早期疫情开始时发现了超级传播事件的痕迹,举例说明了基因组数据的简单模型如何能够提供信息,否则这些信息只能通过广泛的接触者追踪获得。案例研究2比较了医院暴发与社区传播的传播动态,通过综合分析突出了不同的传播动态。案例研究3调查了地方传播模式是否与国家趋势一致,展示了系统动力学模型如何在几乎没有额外信息的情况下解开复杂的种群亚结构。对于每个案例研究,我们强调模型假设和数据属性可能不一致的关键点,并概述适当的验证评估。总的来说,我们的目标是为研究人员提供在基因组流行病学中使用出生-死亡抽样模型的例子,平衡理论和实践方面。
{"title":"Evolutionary and epidemic dynamics of COVID-19 in Germany exemplified by three Bayesian phylodynamic case studies.","authors":"Sanni Översti, Ariane Weber, Viktor Baran, Bärbel Kieninger, Alexander Dilthey, Torsten Houwaart, Andreas Walker, Wulf Schneider-Brachert, Denise Kühnert","doi":"10.1177/11779322251321065","DOIUrl":"10.1177/11779322251321065","url":null,"abstract":"<p><p>The importance of genomic surveillance strategies for pathogens has been particularly evident during the coronavirus disease 2019 (COVID-19) pandemic, as genomic data from the causative agent, severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2), have guided public health decisions worldwide. Bayesian phylodynamic inference, integrating epidemiology and evolutionary biology, has become an essential tool in genomic epidemiological surveillance. It enables the estimation of epidemiological parameters, such as the reproductive number, from pathogen sequence data alone. Despite the phylodynamic approach being widely adopted, the abundance of phylodynamic models often makes it challenging to select the appropriate model for specific research questions. This article illustrates the application of phylodynamic birth-death-sampling models in public health using genomic data, with a focus on SARS-CoV-2. Targeting researchers less familiar with phylodynamics, it introduces a comprehensive workflow, including the conceptualisation of a research study and detailed steps for data preprocessing and postprocessing. In addition, we demonstrate the versatility of birth-death-sampling models through three case studies from Germany, utilising the BEAST2 software and its model implementations. Each case study addresses a distinct research question relevant not only to SARS-CoV-2 but also to other pathogens: Case study 1 finds traces of a superspreading event at the start of an early outbreak, exemplifying how simple models for genomic data can provide information that would otherwise only be accessible through extensive contact tracing. Case study 2 compares transmission dynamics in a nosocomial outbreak to community transmission, highlighting distinct dynamics through integrative analysis. Case study 3 investigates whether local transmission patterns align with national trends, demonstrating how phylodynamic models can disentangle complex population substructure with little additional information. For each case study, we emphasise critical points where model assumptions and data properties may misalign and outline appropriate validation assessments. Overall, we aim to provide researchers with examples on using birth-death-sampling models in genomic epidemiology, balancing theoretical and practical aspects.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251321065"},"PeriodicalIF":2.3,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11898094/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143613195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics and Biology Insights
全部 Contrib. Mineral. Petrol. Chem. Ecol. AAPG Bull. Gondwana Res. Environmental Claims Journal ECOSYSTEMS Paleontol. J. Asia-Pac. J. Atmos. Sci. ARCHAEOMETRY Hydrol. Processes Yan Ke Xue Bao (Hong Kong) Memai Heiko Igaku Contrib. Plasma Phys. Environ. Eng. Manage. J. RADIOCARBON Focus on Autism and Other Developmental Disabilities J. Atmos. Sol. Terr. Phys. ENVIRONMENT Adv. Meteorol. ACTA RADIOL Environmental dermatology : the official journal of the Japanese Society for Contact Dermatitis 2013 IEEE Conference on Computer Vision and Pattern Recognition Ecol. Indic. ARCH ACOUST J. Earth Sci. Acta Oceanolog. Sin. FRONT EARTH SCI-PRC ACTA ORTHOP Int. Geol. Rev. Exp. Eye Res. Eurasian Physical Technical Journal World J Clin Infect Dis Eurasian Journal of Emergency Medicine ACTA HAEMATOL-BASEL Communications Earth & Environment IEEE Trans. Appl. Supercond. Appl. Phys. Rev. 2008 International Conference on Electronic Packaging Technology & High Density Packaging Geosci. Model Dev. GEOLOGY Acta Trop. TERR ATMOS OCEAN SCI Environmental Progress Atmos. Chem. Phys. Leading Edge EUR PSYCHIAT Environ. Eng. Sci. AIDS Phys. Chem. Miner. [Rinsho ketsueki] The Japanese journal of clinical hematology ACTA GEOL SIN-ENGL Environ. Chem. ACTA PARASITOL AGE AGEING Ocean Dyn. Expert Rev. Mol. Diagn. Total Synthesis of Bioactive Natural Products "Laboratorio;" analisis clinicos, bacteriologia, inmunologia, parasitologia, hematologia, anatomia patologica, quimica clinica Family Practice Management ACTA GASTRO-ENT BELG Rev. Geophys. Espacio Tiempo y Forma. Serie VII, Historia del Arte Journal of Women and Minorities in Science and Engineering Mineral. Mag. EUR PHYS J-SPEC TOP ATL GEOL Classical Quantum Gravity J. Math. Phys. Environmental Sustainability J STAT MECH-THEORY E INDIAN J PURE AP PHY AIDS RES HUM RETROV Astrophys. Space Sci. Biomed Eng (NY) Aquat. Geochem. Expert Opin. Pharmacother. Acta Geophys. Transactions of The Royal Society of South Africa ACTA GEOL POL ARCT ANTARCT ALP RES ACTA PETROL SIN CRIT REV ENV SCI TEC ATMOSPHERE-BASEL Annu. Rev. Earth Planet. Sci. Environ. Prog. Sustainable Energy Am. J. Phys. Anthropol. Am. Mineral. Energy Ecol Environ Am. J. Sci. Geostand. Geoanal. Res. Atmos. Res. Geobiology Acta Geochimica Ecol. Eng. Org. Geochem. BIOGEOSCIENCES IZV-PHYS SOLID EART+ Appl. Geochem. Conserv. Genet. Resour. Geochim. Cosmochim. Acta
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1