首页 > 最新文献

Journal of Integrative Bioinformatics最新文献

英文 中文
Colon cancer survival prediction from gland shapes within histology slides using deep learning. 利用深度学习从组织学切片中的腺体形状预测结肠癌存活。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-14 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0052
Rawan Gedeon, Atulya Nagar

This study investigates the application of deep learning techniques for segmenting glands in histopathological images of colorectal cancer. We trained two convolutional neural network models, U-Net and DCAN, on a combination of the GlaS and CRAG datasets to enhance generalization across diverse histological appearances, selecting DCAN for its superior accuracy in delineating gland boundaries. The goal was to achieve robust gland segmentation applicable to whole slide images (WSIs) from The Cancer Genome Atlas (TCGA). Using the segmented glands, we extracted patient-level morphological features and used them to predict survival outcomes. A Cox proportional hazards model was trained on these features and achieved a high concordance index, indicating strong predictive performance. Patients were then stratified into high- and low-risk groups, with significant differences in survival distributions (log-rank p-value: 0.01317). In addition, we benchmarked our models against state-of-the-art gland segmentation methods on GlaS and CRAG, highlighting the trade-off between domain-specific accuracy and cross-dataset robustness.

本研究探讨了深度学习技术在结直肠癌组织病理图像中分割腺体的应用。我们在GlaS和CRAG数据集的组合上训练了两个卷积神经网络模型U-Net和DCAN,以增强对不同组织学外观的泛化,选择DCAN是因为它在描绘腺体边界方面具有卓越的准确性。目标是实现适用于来自癌症基因组图谱(TCGA)的整个幻灯片图像(WSIs)的稳健腺体分割。通过分割腺体,我们提取了患者水平的形态学特征,并用它们来预测生存结果。根据这些特征训练了Cox比例风险模型,并获得了较高的一致性指数,表明具有较强的预测性能。然后将患者分为高危组和低危组,生存分布有显著差异(log-rank p值:0.01317)。此外,我们将我们的模型与GlaS和CRAG上最先进的腺体分割方法进行了基准测试,强调了特定领域准确性和跨数据集鲁棒性之间的权衡。
{"title":"Colon cancer survival prediction from gland shapes within histology slides using deep learning.","authors":"Rawan Gedeon, Atulya Nagar","doi":"10.1515/jib-2024-0052","DOIUrl":"10.1515/jib-2024-0052","url":null,"abstract":"<p><p>This study investigates the application of deep learning techniques for segmenting glands in histopathological images of colorectal cancer. We trained two convolutional neural network models, U-Net and DCAN, on a combination of the GlaS and CRAG datasets to enhance generalization across diverse histological appearances, selecting DCAN for its superior accuracy in delineating gland boundaries. The goal was to achieve robust gland segmentation applicable to whole slide images (WSIs) from The Cancer Genome Atlas (TCGA). Using the segmented glands, we extracted patient-level morphological features and used them to predict survival outcomes. A Cox proportional hazards model was trained on these features and achieved a high concordance index, indicating strong predictive performance. Patients were then stratified into high- and low-risk groups, with significant differences in survival distributions (log-rank <i>p</i>-value: 0.01317). In addition, we benchmarked our models against state-of-the-art gland segmentation methods on GlaS and CRAG, highlighting the trade-off between domain-specific accuracy and cross-dataset robustness.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569585/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144621178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial - 20 years Journal of Integrative Bioinformatics. 编辑- 20 年整合生物信息学杂志。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-09 eCollection Date: 2025-03-01 DOI: 10.1515/jib-2025-0034
Ralf Hofestädt
{"title":"Editorial - 20 years Journal of Integrative Bioinformatics.","authors":"Ralf Hofestädt","doi":"10.1515/jib-2025-0034","DOIUrl":"10.1515/jib-2025-0034","url":null,"abstract":"","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12327197/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sustainable software development in science - insights from 20 years of Vanted. 科学中的可持续软件开发-来自20 年Vanted的见解。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-01 eCollection Date: 2025-03-01 DOI: 10.1515/jib-2025-0007
Falk Schreiber, Tobias Czauderna, Dimitar Garkov, Niklas Gröne, Karsten Klein, Matthias Lange, Uwe Scholz, Björn Sommer

Sustainable software development requires the software to remain accessible and maintainable over long time. This is particularly challenging in a scientific context. For example, fewer than one third of tools and platforms for biological network representation, analysis, and visualisation have been available and supported over a period of 15 years. One of those tools is Vanted, which has been developed and actively supported over the past 20 years. In this work, we discuss sustainable software development in science and investigate which software tools for biological network representation, analysis, and visualisation are maintained over a period of at least 15 years. With Vanted as a case study, we highlight five key insights that we consider crucial for sustainable, long-term software development and software maintenance in science.

可持续的软件开发要求软件在很长一段时间内保持可访问性和可维护性。这在科学背景下尤其具有挑战性。例如,在过去的15年里,只有不到三分之一的生物网络表示、分析和可视化工具和平台是可用的,并且得到了支持。其中一个工具是Vanted,它在过去20年里得到了开发和积极支持。在这项工作中,我们讨论了科学中的可持续软件开发,并调查了哪些用于生物网络表示、分析和可视化的软件工具在至少15年的时间内得到了维护。以Vanted为例,我们强调了我们认为对科学中可持续的、长期的软件开发和软件维护至关重要的五个关键见解。
{"title":"Sustainable software development in science - insights from 20 years of Vanted.","authors":"Falk Schreiber, Tobias Czauderna, Dimitar Garkov, Niklas Gröne, Karsten Klein, Matthias Lange, Uwe Scholz, Björn Sommer","doi":"10.1515/jib-2025-0007","DOIUrl":"10.1515/jib-2025-0007","url":null,"abstract":"<p><p>Sustainable software development requires the software to remain accessible and maintainable over long time. This is particularly challenging in a scientific context. For example, fewer than one third of tools and platforms for biological network representation, analysis, and visualisation have been available and supported over a period of 15 years. One of those tools is Vanted, which has been developed and actively supported over the past 20 years. In this work, we discuss sustainable software development in science and investigate which software tools for biological network representation, analysis, and visualisation are maintained over a period of at least 15 years. With Vanted as a case study, we highlight five key insights that we consider crucial for sustainable, long-term software development and software maintenance in science.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12327203/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144531041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metagenome and metabolome study on inhaled corticosteroids in asthma patients with side effects. 哮喘患者吸入皮质类固醇副作用的宏基因组和代谢组研究。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-24 DOI: 10.1515/jib-2024-0062
Igor Goryanin, Anatoly Sorokin, Meder Seitov, Berik Emilov, Muktarbek Iskakov, Irina Goryanin, Batyr Osmonov

This study investigates the gut microbiome and metabolome of asthma patients treated with inhaled corticosteroids (ICS), some of whom experience adverse side effects. We analyzed stool samples from 24 participants, divided into three cohorts: asthma patients with side effects, those without, and healthy controls. Using next-generation sequencing and LC-MS/MS metabolomics, we identified significant differences in bacterial species and metabolites. Multi-Omics Factor Analysis (MOFA) and Global Sensitivity Analysis-Partial Rank Correlation Coefficient (GSA-PRCC) provided insights into key contributors to side effects, such as tryptophan depletion and altered linolenate and glucose-1-phosphate levels. The study proposes dietary or probiotic interventions to mitigate side effects. Despite the limited sample size, these findings provide a basis for personalized asthma management approaches. Further studies are required to confirm initial fundings.

本研究调查了吸入皮质类固醇(ICS)治疗的哮喘患者的肠道微生物组和代谢组,其中一些患者出现了不良副作用。我们分析了24名参与者的粪便样本,将其分为三组:有副作用的哮喘患者、没有副作用的哮喘患者和健康对照组。通过下一代测序和LC-MS/MS代谢组学,我们发现了细菌种类和代谢物的显著差异。多组学因素分析(MOFA)和全局敏感性分析-部分秩相关系数(GSA-PRCC)提供了对副作用的关键影响因素的见解,例如色氨酸消耗和亚麻酸和葡萄糖-1-磷酸水平的改变。该研究建议通过饮食或益生菌干预来减轻副作用。尽管样本量有限,但这些发现为个性化哮喘管理方法提供了基础。需要进一步的研究来确认初始资金。
{"title":"Metagenome and metabolome study on inhaled corticosteroids in asthma patients with side effects.","authors":"Igor Goryanin, Anatoly Sorokin, Meder Seitov, Berik Emilov, Muktarbek Iskakov, Irina Goryanin, Batyr Osmonov","doi":"10.1515/jib-2024-0062","DOIUrl":"https://doi.org/10.1515/jib-2024-0062","url":null,"abstract":"<p><p>This study investigates the gut microbiome and metabolome of asthma patients treated with inhaled corticosteroids (ICS), some of whom experience adverse side effects. We analyzed stool samples from 24 participants, divided into three cohorts: asthma patients with side effects, those without, and healthy controls. Using next-generation sequencing and LC-MS/MS metabolomics, we identified significant differences in bacterial species and metabolites. Multi-Omics Factor Analysis (MOFA) and Global Sensitivity Analysis-Partial Rank Correlation Coefficient (GSA-PRCC) provided insights into key contributors to side effects, such as tryptophan depletion and altered linolenate and glucose-1-phosphate levels. The study proposes dietary or probiotic interventions to mitigate side effects. Despite the limited sample size, these findings provide a basis for personalized asthma management approaches. Further studies are required to confirm initial fundings.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144477807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging transformers for semi-supervised pathogenicity prediction with soft labels. 利用变压器进行软标签的半监督致病性预测。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-23 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0047
Pablo Enrique Guillem, Marco Zurdo-Tabernero, Noelia Egido Iglesias, Ángel Canal-Alonso, Liliana Durón Figueroa, Guillermo Hernández, Angélica González-Arrieta, Fernando de la Prieta

The rapid advancement of Next-Generation Sequencing (NGS) technologies has revolutionized the field of genomics, producing large volumes of data that necessitate sophisticated analytical techniques. This paper introduces a Deep Learning model designed to predict the pathogenicity of genetic variants, a vital component in advancing personalized medicine. The model is trained on a dataset derived from the analysis of NGS outputs, containing a combination of well-defined and ambiguous genetic variants. By employing a semi-supervised learning approach, the model efficiently utilizes both confidently labeled and less certain data. At the core of the methodology is the Feature Tokenizer Transformer architecture, which processes both numerical and categorical genomic information. The preprocessing pipeline includes key steps such as data imputation, scaling, and encoding to ensure high data quality. The results highlight the model's impressive accuracy, particularly in detecting confidently labeled variants, while also addressing the impact of its predictions on less certain (soft-labeled) data.

新一代测序(NGS)技术的快速发展彻底改变了基因组学领域,产生了大量数据,需要复杂的分析技术。本文介绍了一个深度学习模型,旨在预测遗传变异的致病性,这是推进个性化医疗的重要组成部分。该模型在NGS输出分析得出的数据集上进行训练,该数据集包含定义良好和模糊的遗传变异的组合。通过采用半监督学习方法,该模型有效地利用了自信标记和不太确定的数据。该方法的核心是Feature Tokenizer Transformer架构,它处理数值和分类基因组信息。预处理流程包括数据输入、缩放和编码等关键步骤,以确保高数据质量。结果突出了该模型令人印象深刻的准确性,特别是在检测自信标记的变体时,同时也解决了其预测对不太确定(软标记)数据的影响。
{"title":"Leveraging transformers for semi-supervised pathogenicity prediction with soft labels.","authors":"Pablo Enrique Guillem, Marco Zurdo-Tabernero, Noelia Egido Iglesias, Ángel Canal-Alonso, Liliana Durón Figueroa, Guillermo Hernández, Angélica González-Arrieta, Fernando de la Prieta","doi":"10.1515/jib-2024-0047","DOIUrl":"10.1515/jib-2024-0047","url":null,"abstract":"<p><p>The rapid advancement of Next-Generation Sequencing (NGS) technologies has revolutionized the field of genomics, producing large volumes of data that necessitate sophisticated analytical techniques. This paper introduces a Deep Learning model designed to predict the pathogenicity of genetic variants, a vital component in advancing personalized medicine. The model is trained on a dataset derived from the analysis of NGS outputs, containing a combination of well-defined and ambiguous genetic variants. By employing a semi-supervised learning approach, the model efficiently utilizes both confidently labeled and less certain data. At the core of the methodology is the Feature Tokenizer Transformer architecture, which processes both numerical and categorical genomic information. The preprocessing pipeline includes key steps such as data imputation, scaling, and encoding to ensure high data quality. The results highlight the model's impressive accuracy, particularly in detecting confidently labeled variants, while also addressing the impact of its predictions on less certain (soft-labeled) data.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569578/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144334418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Petri net modeling and simulation of post-transcriptional regulatory networks of human embryonic stem cell (hESC) differentiation to cardiomyocytes. 人胚胎干细胞(hESC)向心肌细胞分化的转录后调控网络的Petri网建模和模拟。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-23 eCollection Date: 2025-03-01 DOI: 10.1515/jib-2024-0037
Aruana F F Hansel-Fröse, Christoph Brinkrolf, Marcel Friedrichs, Bruno Dallagiovanna, Lucia Spangenberg

Stem cells are capable of self-renewal and differentiation into various cell types, showing significant potential for cellular therapies and regenerative medicine, particularly in cardiovascular diseases. The differentiation to cardiomyocytes replicates the embryonic heart development, potentially supporting cardiac regeneration. Cardiomyogenesis is controlled by complex post-transcriptional regulation that affects the construction of gene regulatory networks (GRNs), such as: alternative polyadenylation (APA), length changes in untranslated regulatory regions (3'UTRs), and microRNA (miRNA) regulation. To deepen our understanding of the cardiomyogenesis process, we have modeled a GRN for each day of cardiomyocyte differentiation. Then, each GRN was automatically transformed by four transformation rules to a Petri net and simulated using the software VANESA. The Petri nets highlighted the relationship between genes and alternative isoforms, emphasizing the inhibition of miRNA on APA isoforms with varying 3'UTR lengths. Moreover, in silico simulation of miRNA knockout enabled the visualization of the consequential effects on isoform expression. Our Petri net models provide a resourceful tool and holistic perspective to investigate the functional orchestra of transcript regulation that differentiate hESCs to cardiomyocytes. Additionally, the models can be adapted to investigate post-transcriptional GRN in other biological contexts.

干细胞能够自我更新并分化成各种细胞类型,在细胞治疗和再生医学方面显示出巨大的潜力,特别是在心血管疾病方面。向心肌细胞的分化复制了胚胎心脏的发育,可能支持心脏再生。心肌发生受复杂的转录后调控控制,影响基因调控网络(grn)的构建,如:选择性聚腺苷化(APA)、非翻译调控区域(3'UTRs)的长度变化和microRNA (miRNA)调控。为了加深我们对心肌形成过程的理解,我们为心肌细胞分化的每一天建立了一个GRN模型。然后,通过4条变换规则将每个GRN自动变换为Petri网,并利用VANESA软件进行仿真。Petri网强调了基因与备选亚型之间的关系,强调了miRNA对不同3'UTR长度的APA亚型的抑制作用。此外,miRNA敲除的计算机模拟能够可视化对异构体表达的相应影响。我们的Petri网模型提供了一个丰富的工具和整体的视角来研究将hESCs分化为心肌细胞的转录调控的功能组合。此外,该模型可用于研究其他生物学背景下的转录后GRN。
{"title":"Petri net modeling and simulation of post-transcriptional regulatory networks of human embryonic stem cell (hESC) differentiation to cardiomyocytes.","authors":"Aruana F F Hansel-Fröse, Christoph Brinkrolf, Marcel Friedrichs, Bruno Dallagiovanna, Lucia Spangenberg","doi":"10.1515/jib-2024-0037","DOIUrl":"10.1515/jib-2024-0037","url":null,"abstract":"<p><p>Stem cells are capable of self-renewal and differentiation into various cell types, showing significant potential for cellular therapies and regenerative medicine, particularly in cardiovascular diseases. The differentiation to cardiomyocytes replicates the embryonic heart development, potentially supporting cardiac regeneration. Cardiomyogenesis is controlled by complex post-transcriptional regulation that affects the construction of gene regulatory networks (GRNs), such as: alternative polyadenylation (APA), length changes in untranslated regulatory regions (3'UTRs), and microRNA (miRNA) regulation. To deepen our understanding of the cardiomyogenesis process, we have modeled a GRN for each day of cardiomyocyte differentiation. Then, each GRN was automatically transformed by four transformation rules to a Petri net and simulated using the software VANESA. The Petri nets highlighted the relationship between genes and alternative isoforms, emphasizing the inhibition of miRNA on APA isoforms with varying 3'UTR lengths. Moreover, <i>in silico</i> simulation of miRNA knockout enabled the visualization of the consequential effects on isoform expression. Our Petri net models provide a resourceful tool and holistic perspective to investigate the functional orchestra of transcript regulation that differentiate hESCs to cardiomyocytes. Additionally, the models can be adapted to investigate post-transcriptional GRN in other biological contexts.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12327202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144334419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating AI and genomics: predictive CNN models for schizophrenia phenotypes. 整合人工智能和基因组学:预测精神分裂症表型的CNN模型。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-18 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0057
Guilherme Henriques, Maryam Abbasi, Daniel Martins, Joel P Arrais

This study explores the use of deep learning to analyze genetic data and predict phenotypic traits associated with schizophrenia, a complex psychiatric disorder with a strong hereditary component yet incomplete genetic characterization. We applied Convolutional Neural Networks models to a large-scale case-control exome sequencing dataset from the Swedish population to identify genetic patterns linked to schizophrenia. To enhance model performance and reduce overfitting, we employed advanced optimization techniques, including dropout layers, learning rate scheduling, batch normalization, and early stopping. Following systematic refinements in data preprocessing, model architecture, and hyperparameter tuning, the final model achieved an accuracy of 80 %. These results demonstrate the potential of deep learning approaches to uncover intricate genotype-phenotype relationships and support their future integration into precision medicine and genetic diagnostics for psychiatric disorders such as schizophrenia.

本研究探索了使用深度学习来分析遗传数据并预测与精神分裂症相关的表型特征,精神分裂症是一种复杂的精神疾病,具有强烈的遗传成分,但遗传特征不完整。我们将卷积神经网络模型应用于来自瑞典人群的大规模病例对照外显子组测序数据集,以确定与精神分裂症相关的遗传模式。为了提高模型性能并减少过拟合,我们采用了先进的优化技术,包括辍学层、学习率调度、批处理归一化和早期停止。经过数据预处理、模型架构和超参数调优的系统改进,最终模型的精度达到了80% %。这些结果证明了深度学习方法在揭示复杂的基因型-表型关系方面的潜力,并支持它们未来整合到精神分裂症等精神疾病的精准医学和基因诊断中。
{"title":"Integrating AI and genomics: predictive CNN models for schizophrenia phenotypes.","authors":"Guilherme Henriques, Maryam Abbasi, Daniel Martins, Joel P Arrais","doi":"10.1515/jib-2024-0057","DOIUrl":"10.1515/jib-2024-0057","url":null,"abstract":"<p><p>This study explores the use of deep learning to analyze genetic data and predict phenotypic traits associated with schizophrenia, a complex psychiatric disorder with a strong hereditary component yet incomplete genetic characterization. We applied Convolutional Neural Networks models to a large-scale case-control exome sequencing dataset from the Swedish population to identify genetic patterns linked to schizophrenia. To enhance model performance and reduce overfitting, we employed advanced optimization techniques, including dropout layers, learning rate scheduling, batch normalization, and early stopping. Following systematic refinements in data preprocessing, model architecture, and hyperparameter tuning, the final model achieved an accuracy of 80 %. These results demonstrate the potential of deep learning approaches to uncover intricate genotype-phenotype relationships and support their future integration into precision medicine and genetic diagnostics for psychiatric disorders such as schizophrenia.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569582/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144310814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated mitosis detection in stained histopathological images using Faster R-CNN and stain techniques. 使用Faster R-CNN和染色技术在染色的组织病理学图像中自动检测有丝分裂。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-11 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0049
Jesús García-Salmerón, José Manuel García, Gregorio Bernabé, Pilar González-Férez

Accurate mitosis detection is essential for cancer diagnosis and treatment. Traditional manual counting by pathologists is time-consuming and may cause errors. This research investigates automated mitosis detection in stained histopathological images using Deep Learning (DL) techniques, particularly object detection models. We propose a two-stage object detection model based on Faster R-CNN to effectively detect mitosis within histopathological images. The stain augmentation and normalization techniques are also applied to address the significant challenge of domain shift in histopathological image analysis. The experiments are conducted using the MIDOG++ dataset, the most recent dataset from the MIDOG challenge. This research builds on our previous work, in which two one-stage frameworks, in particular on RetinaNet using fastai and PyTorch, are proposed. Our results indicate favorable F1-scores across various scenarios and tumor types, demonstrating the effectiveness of the object detection models. In addition, Faster R-CNN with stain techniques provides the most accurate and reliable mitosis detection, while RetinaNet models exhibit faster performance. Our results highlight the importance of handling domain shifts and the number of mitotic figures for robust diagnostic tools.

准确的有丝分裂检测对癌症的诊断和治疗至关重要。病理学家传统的手工计数既耗时又可能导致错误。本研究利用深度学习(DL)技术,特别是对象检测模型,研究染色组织病理学图像中有丝分裂的自动检测。我们提出了一种基于Faster R-CNN的两阶段目标检测模型,以有效检测组织病理图像中的有丝分裂。染色增强和归一化技术也被应用于解决组织病理图像分析领域转移的重大挑战。实验使用MIDOG++数据集进行,这是MIDOG挑战的最新数据集。本研究建立在我们之前的工作基础上,其中提出了两个单阶段框架,特别是使用fastai和PyTorch的RetinaNet框架。我们的研究结果表明,在各种场景和肿瘤类型中都有良好的f1得分,证明了目标检测模型的有效性。此外,更快的R-CNN与染色技术提供了最准确和可靠的有丝分裂检测,而RetinaNet模型表现出更快的性能。我们的研究结果强调了处理区域转移和有丝分裂图的数量对于稳健诊断工具的重要性。
{"title":"Automated mitosis detection in stained histopathological images using Faster R-CNN and stain techniques.","authors":"Jesús García-Salmerón, José Manuel García, Gregorio Bernabé, Pilar González-Férez","doi":"10.1515/jib-2024-0049","DOIUrl":"10.1515/jib-2024-0049","url":null,"abstract":"<p><p>Accurate mitosis detection is essential for cancer diagnosis and treatment. Traditional manual counting by pathologists is time-consuming and may cause errors. This research investigates automated mitosis detection in stained histopathological images using Deep Learning (DL) techniques, particularly object detection models. We propose a two-stage object detection model based on Faster R-CNN to effectively detect mitosis within histopathological images. The stain augmentation and normalization techniques are also applied to address the significant challenge of domain shift in histopathological image analysis. The experiments are conducted using the MIDOG++ dataset, the most recent dataset from the MIDOG challenge. This research builds on our previous work, in which two one-stage frameworks, in particular on RetinaNet using fastai and PyTorch, are proposed. Our results indicate favorable F1-scores across various scenarios and tumor types, demonstrating the effectiveness of the object detection models. In addition, Faster R-CNN with stain techniques provides the most accurate and reliable mitosis detection, while RetinaNet models exhibit faster performance. Our results highlight the importance of handling domain shifts and the number of mitotic figures for robust diagnostic tools.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569583/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144259406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting DDI-induced pregnancy and neonatal ADRs using sparse PCA and stacking ensemble approach. 稀疏PCA和叠加集合法预测ddi诱导妊娠和新生儿不良反应。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-10 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0056
Anushka Chaurasia, Deepak Kumar, Yogita

Predicting Drug-Drug interaction (DDI)-induced adverse drug reactions (ADRs) using computational methods is challenging due to the availability of limited data samples, data sparsity, and high dimensionality. The issue of class imbalance further increases the intricacy of prediction. Different computational techniques have been presented for predicting DDI-induced ADRs in the general population; however, the area of DDI-induced pregnancy and neonatal ADRs has been underexplored. In the present work, a sparse ensemble-based computational approach is proposed that leverages SMILES strings as features, addresses high-dimensional and sparse data using Sparse Principal Component Analysis (SPCA), mitigates class imbalance with the Multilabel Synthetic Minority Oversampling Technique (MLSMOTE), and predicts pregnancy and neonatal ADRs through a stacking ensemble model. The SPCA has been evaluated for handling sparse data and shown 2.67 %-5.45 % improvement compared to PCA. The proposed stacking ensemble model has outperformed six state-of-the-art predictors regarding micro and macro scores for True Positive Rate (TPR), F1 Score, False Positive Rate (FPR), Precision, Hamming Loss, and ROC-AUC Score with 1.16 %-14.94 %.

由于数据样本有限,数据稀疏性和高维性,使用计算方法预测药物-药物相互作用(DDI)诱导的药物不良反应(adr)具有挑战性。类不平衡的问题进一步增加了预测的复杂性。已经提出了不同的计算技术来预测ddi在一般人群中引起的不良反应;然而,ddi诱导妊娠和新生儿不良反应的研究尚未充分。在本工作中,提出了一种基于稀疏集成的计算方法,该方法利用SMILES字符串作为特征,使用稀疏主成分分析(SPCA)处理高维和稀疏数据,使用多标签合成少数过采样技术(MLSMOTE)减轻类失衡,并通过堆叠集成模型预测妊娠和新生儿adr。SPCA在处理稀疏数据方面进行了评估,与PCA相比显示出2.67 %-5.45 %的改进。所提出的叠加集成模型在真阳性率(TPR)、F1分数、假阳性率(FPR)、精度、汉明损失和ROC-AUC分数的微观和宏观得分方面优于六个最先进的预测指标,得分为1.16 %-14.94 %。
{"title":"Predicting DDI-induced pregnancy and neonatal ADRs using sparse PCA and stacking ensemble approach.","authors":"Anushka Chaurasia, Deepak Kumar, Yogita","doi":"10.1515/jib-2024-0056","DOIUrl":"10.1515/jib-2024-0056","url":null,"abstract":"<p><p>Predicting Drug-Drug interaction (DDI)-induced adverse drug reactions (ADRs) using computational methods is challenging due to the availability of limited data samples, data sparsity, and high dimensionality. The issue of class imbalance further increases the intricacy of prediction. Different computational techniques have been presented for predicting DDI-induced ADRs in the general population; however, the area of DDI-induced pregnancy and neonatal ADRs has been underexplored. In the present work, a sparse ensemble-based computational approach is proposed that leverages SMILES strings as features, addresses high-dimensional and sparse data using Sparse Principal Component Analysis (SPCA), mitigates class imbalance with the Multilabel Synthetic Minority Oversampling Technique (MLSMOTE), and predicts pregnancy and neonatal ADRs through a stacking ensemble model. The SPCA has been evaluated for handling sparse data and shown 2.67 %-5.45 % improvement compared to PCA. The proposed stacking ensemble model has outperformed six state-of-the-art predictors regarding micro and macro scores for True Positive Rate (<i>TPR</i>), F1 Score, False Positive Rate (<i>FPR</i>), Precision, Hamming Loss, and ROC-AUC Score with 1.16 %-14.94 %.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569586/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144250785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A ViTUNeT-based model using YOLOv8 for efficient LVNC diagnosis and automatic cleaning of dataset. 一个基于vitunet的模型,使用YOLOv8进行高效的LVNC诊断和数据集的自动清理。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-04 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0048
Salvador de Haro, Gregorio Bernabé, José Manuel García, Pilar González-Férez

Left ventricular non-compaction is a cardiac condition marked by excessive trabeculae in the left ventricle's inner wall. Although various methods exist to measure these structures, the medical community still lacks consensus on the best approach. Previously, we developed DL-LVTQ, a tool based on a UNet neural network, to quantify trabeculae in this region. In this study, we expand the dataset to include new patients with Titin cardiomyopathy and healthy individuals with fewer trabeculae, requiring retraining of our models to enhance predictions. We also propose ViTUNeT, a neural network architecture combining U-Net and Vision Transformers to segment the left ventricle more accurately. Additionally, we train a YOLOv8 model to detect the ventricle and integrate it with ViTUNeT model to focus on the region of interest. Results from ViTUNet and YOLOv8 are similar to DL-LVTQ, suggesting dataset quality limits further accuracy improvements. To test this, we analyze MRI images and develop a method using two YOLOv8 models to identify and remove problematic images, leading to better results. Combining YOLOv8 with deep learning networks offers a promising approach for improving cardiac image analysis and segmentation.

左心室不压实是一种以左心室内壁小梁过多为特征的心脏疾病。尽管存在各种测量这些结构的方法,但医学界对最佳方法仍缺乏共识。此前,我们开发了基于UNet神经网络的DL-LVTQ工具,用于量化该区域的小梁。在这项研究中,我们扩展了数据集,包括新的Titin心肌病患者和小梁较少的健康个体,需要对我们的模型进行再训练以增强预测。我们还提出了ViTUNeT,一种结合U-Net和Vision transformer的神经网络架构,以更准确地分割左心室。此外,我们训练了一个YOLOv8模型来检测心室,并将其与ViTUNeT模型相结合,聚焦于感兴趣的区域。来自ViTUNet和YOLOv8的结果与DL-LVTQ相似,表明数据集质量限制了进一步的准确性提高。为了验证这一点,我们分析了MRI图像,并开发了一种方法,使用两个YOLOv8模型来识别和去除有问题的图像,从而获得更好的结果。将YOLOv8与深度学习网络相结合,为改善心脏图像分析和分割提供了有前途的方法。
{"title":"A ViTUNeT-based model using YOLOv8 for efficient LVNC diagnosis and automatic cleaning of dataset.","authors":"Salvador de Haro, Gregorio Bernabé, José Manuel García, Pilar González-Férez","doi":"10.1515/jib-2024-0048","DOIUrl":"10.1515/jib-2024-0048","url":null,"abstract":"<p><p>Left ventricular non-compaction is a cardiac condition marked by excessive trabeculae in the left ventricle's inner wall. Although various methods exist to measure these structures, the medical community still lacks consensus on the best approach. Previously, we developed DL-LVTQ, a tool based on a UNet neural network, to quantify trabeculae in this region. In this study, we expand the dataset to include new patients with Titin cardiomyopathy and healthy individuals with fewer trabeculae, requiring retraining of our models to enhance predictions. We also propose ViTUNeT, a neural network architecture combining U-Net and Vision Transformers to segment the left ventricle more accurately. Additionally, we train a YOLOv8 model to detect the ventricle and integrate it with ViTUNeT model to focus on the region of interest. Results from ViTUNet and YOLOv8 are similar to DL-LVTQ, suggesting dataset quality limits further accuracy improvements. To test this, we analyze MRI images and develop a method using two YOLOv8 models to identify and remove problematic images, leading to better results. Combining YOLOv8 with deep learning networks offers a promising approach for improving cardiac image analysis and segmentation.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569573/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144217516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Integrative Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1