首页 > 最新文献

Computer methods and programs in biomedicine最新文献

英文 中文
Harnessing angular geometry in deep learning for protein–ligand binding affinity prediction 利用深度学习中的角度几何来预测蛋白质与配体的结合亲和力
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-02-12 DOI: 10.1016/j.cmpb.2026.109282
Julia Rahman , M.A. Hakim Newton , Jiffriya Mohamed Abdul Cader , Md Khaled Ben Islam , Mohammed Eunus Ali , Abdul Sattar

Background:

Protein–ligand binding affinity prediction is essential in structure-based drug design, where binding scores guide the selection of promising candidate ligands. Existing deep learning models often use 3D grids, voxelized complexes, or molecular graphs. These representations are resource-intensive and may not capture specific directional interactions.

Objective:

This paper introduces angular geometric features as key descriptors of binding interactions.

Methods:

Seven types of dihedral angles between protein and ligand atoms are extracted to encode orientation and geometry. A fully connected ensemble network, called the Angle-Aware Predictor (AAP), integrates these features.

Results:

On CASF-2016, AAP achieves state-of-the-art results with correlation coefficient (R) of 0.872, root mean squared error (RMSE) of 1.072, mean absolute error (MAE) 0.817, standard deviation (SD) of 1.077, and concordance index (CI) of 0.845. On four additional benchmarks, AAP shows consistent improvements ranging from 0.3% to 36%.

Conclusion:

The angular features are effective, lightweight, and robust descriptors for binding affinity prediction. These results highlight angular geometry as a valuable direction for future structure-based drug discovery. The program and data of AAP are publicly available at https://github.com/juliacse06/AAP.
背景:蛋白质-配体结合亲和力预测在基于结构的药物设计中是必不可少的,其中结合评分指导有希望的候选配体的选择。现有的深度学习模型通常使用3D网格、体素化复合体或分子图。这些表示是资源密集型的,可能无法捕获特定的定向交互。目的:引入角几何特征作为结合相互作用的关键描述符。方法:提取蛋白质与配体原子之间的7种二面角,编码蛋白质的取向和几何结构。一个完全连接的集成网络,称为角度感知预测器(AAP),集成了这些功能。结果:在CASF-2016上,AAP获得了最优结果,相关系数(R)为0.872,均方根误差(RMSE)为1.072,平均绝对误差(MAE)为0.817,标准差(SD)为1.077,一致性指数(CI)为0.845。在另外四个基准测试中,AAP表现出了0.3%至36%的持续改善。结论:角度特征是有效的、轻量级的、鲁棒的绑定亲和预测描述符。这些结果突出了角几何作为未来基于结构的药物发现的一个有价值的方向。AAP的程序和数据可在https://github.com/juliacse06/AAP上公开获取。
{"title":"Harnessing angular geometry in deep learning for protein–ligand binding affinity prediction","authors":"Julia Rahman ,&nbsp;M.A. Hakim Newton ,&nbsp;Jiffriya Mohamed Abdul Cader ,&nbsp;Md Khaled Ben Islam ,&nbsp;Mohammed Eunus Ali ,&nbsp;Abdul Sattar","doi":"10.1016/j.cmpb.2026.109282","DOIUrl":"10.1016/j.cmpb.2026.109282","url":null,"abstract":"<div><h3>Background:</h3><div>Protein–ligand binding affinity prediction is essential in structure-based drug design, where binding scores guide the selection of promising candidate ligands. Existing deep learning models often use 3D grids, voxelized complexes, or molecular graphs. These representations are resource-intensive and may not capture specific directional interactions.</div></div><div><h3>Objective:</h3><div>This paper introduces angular geometric features as key descriptors of binding interactions.</div></div><div><h3>Methods:</h3><div>Seven types of dihedral angles between protein and ligand atoms are extracted to encode orientation and geometry. A fully connected ensemble network, called the Angle-Aware Predictor (AAP), integrates these features.</div></div><div><h3>Results:</h3><div>On CASF-2016, AAP achieves state-of-the-art results with correlation coefficient (R) of 0.872, root mean squared error (RMSE) of 1.072, mean absolute error (MAE) 0.817, standard deviation (SD) of 1.077, and concordance index (CI) of 0.845. On four additional benchmarks, AAP shows consistent improvements ranging from 0.3% to 36%.</div></div><div><h3>Conclusion:</h3><div>The angular features are effective, lightweight, and robust descriptors for binding affinity prediction. These results highlight angular geometry as a valuable direction for future structure-based drug discovery. The program and data of AAP are publicly available at <span><span>https://github.com/juliacse06/AAP</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109282"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146186281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correlative analysis between ocular surface features and carotid plaque : A multimodal machine learning framework 眼表特征与颈动脉斑块的相关性分析:一个多模态机器学习框架
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-01-29 DOI: 10.1016/j.cmpb.2026.109265
Shichen Zhang , Dinghan Hu , Le Luo , Jiuwen Cao

Background and Objective:

The diagnosis of carotid plaques plays an important role in revealing cardiovascular and cerebrovascular diseases, thus attracting widespread research attention. However, most medical examinations rely heavily on specialists and carotid ultrasound images, which are time-consuming, radiative, expensive and limited in tracking disease progression. To alleviate these deficiency, inspired by the human blood supply sequence, a detailed study on the association between carotid plaque and ocular surface image features is proposed in the paper.

Methods:

This paper systematically verifies the correlation between carotid plaque and ocular surface image through a multi-dimensional feature analysis approach incorporating texture, frequency domain features, and color characteristics. The analysis combines feature selection, confidence evaluation, and distribution property studies to establish robust associations. Besides, multiple machine learning classifiers are used to evaluate the robustness of the extracted features, with subgroup validation conducted across different subsets, systematically assessing the influence of age and gender factors.

Results:

The proposed method achieves high prediction accuracy on 8875 individuals from Hangzhou Wuyunshan Hospital (Hangzhou Institute for Health Promotion), with electronic health record (EHR) features showing the strongest association (Odds Ratios [ORs]: 4.35 [3.90-4.86] in males; 2.92 [2.60-3.27] in females). Experimental results demonstrate that age, male gender, and ocular surface image features – including EHR, local binary patterns (LBP), gray-level gradient co-occurrence matrix (GLGCM), and gray-level co-occurrence matrix (GLCM) – show strong associations with carotid plaque, where LBP and EHR features are selected most frequently.

Conclusions:

Ocular surface image analysis offers a practical and non-invasive method for carotid plaque screening. The observed feature associations and strong predictive performance highlight its potential for clinical applications, especially in large-scale population screening.
背景与目的:颈动脉斑块的诊断在揭示心脑血管疾病中起着重要的作用,引起了广泛的研究关注。然而,大多数医学检查严重依赖于专家和颈动脉超声图像,这是耗时的,辐射的,昂贵的,并且在追踪疾病进展方面受到限制。为了缓解这些不足,受人体血液供应顺序的启发,本文提出对颈动脉斑块与眼表图像特征之间的关系进行详细的研究。方法:通过结合纹理特征、频域特征和颜色特征的多维特征分析方法,系统验证颈动脉斑块与眼表图像的相关性。分析结合了特征选择、置信度评估和分布属性研究,以建立稳健的关联。此外,使用多个机器学习分类器来评估提取的特征的鲁棒性,并在不同的子集上进行子组验证,系统地评估年龄和性别因素的影响。结果:本文提出的方法对杭州市武云山医院(杭州市健康促进研究所)8875例个体的预测准确率较高,其中电子病历(electronic Health record, EHR)特征的相关性最强(比值比男性为4.35[3.90-4.86],女性为2.92[2.60-3.27])。实验结果表明,年龄、男性和眼表图像特征(包括EHR、局部二值模式(LBP)、灰度梯度共现矩阵(GLGCM)和灰度共现矩阵(GLCM))与颈动脉斑块有很强的相关性,其中LBP和EHR特征被选择的频率最高。结论:眼表图像分析为颈动脉斑块筛查提供了一种实用、无创的方法。观察到的特征关联和强大的预测性能突出了其临床应用潜力,特别是在大规模人群筛查中。
{"title":"Correlative analysis between ocular surface features and carotid plaque : A multimodal machine learning framework","authors":"Shichen Zhang ,&nbsp;Dinghan Hu ,&nbsp;Le Luo ,&nbsp;Jiuwen Cao","doi":"10.1016/j.cmpb.2026.109265","DOIUrl":"10.1016/j.cmpb.2026.109265","url":null,"abstract":"<div><h3>Background and Objective:</h3><div>The diagnosis of carotid plaques plays an important role in revealing cardiovascular and cerebrovascular diseases, thus attracting widespread research attention. However, most medical examinations rely heavily on specialists and carotid ultrasound images, which are time-consuming, radiative, expensive and limited in tracking disease progression. To alleviate these deficiency, inspired by the human blood supply sequence, a detailed study on the association between carotid plaque and ocular surface image features is proposed in the paper.</div></div><div><h3>Methods:</h3><div>This paper systematically verifies the correlation between carotid plaque and ocular surface image through a multi-dimensional feature analysis approach incorporating texture, frequency domain features, and color characteristics. The analysis combines feature selection, confidence evaluation, and distribution property studies to establish robust associations. Besides, multiple machine learning classifiers are used to evaluate the robustness of the extracted features, with subgroup validation conducted across different subsets, systematically assessing the influence of age and gender factors.</div></div><div><h3>Results:</h3><div>The proposed method achieves high prediction accuracy on 8875 individuals from Hangzhou Wuyunshan Hospital (Hangzhou Institute for Health Promotion), with electronic health record (EHR) features showing the strongest association (Odds Ratios [ORs]: 4.35 [3.90-4.86] in males; 2.92 [2.60-3.27] in females). Experimental results demonstrate that age, male gender, and ocular surface image features – including EHR, local binary patterns (LBP), gray-level gradient co-occurrence matrix (GLGCM), and gray-level co-occurrence matrix (GLCM) – show strong associations with carotid plaque, where LBP and EHR features are selected most frequently.</div></div><div><h3>Conclusions:</h3><div>Ocular surface image analysis offers a practical and non-invasive method for carotid plaque screening. The observed feature associations and strong predictive performance highlight its potential for clinical applications, especially in large-scale population screening.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109265"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying neurotrophic factor related genes at the crosstalk between glioblastoma and ischemic stroke 神经营养因子相关基因在胶质母细胞瘤和缺血性脑卒中之间的相互作用
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-02-09 DOI: 10.1016/j.cmpb.2026.109283
Ce Shi , Lina Ding , Dandan Wang , Jing Zhu , Xiaoli Zheng , Lansheng Zhang , Yun Zhou

Background

Glioblastoma multiforme (GBM) and ischemic stroke (IS) are two major neurological disorders contributing substantially to global mortality and disability. GBM elevates IS risk via prothrombotic mechanisms, while IS may accelerate glioma progression through ischemia-driven neuroinflammation. Identifying shared molecular mediators is essential for understanding their bidirectional pathophysiology.

Methods

A systems biology approach was implemented to investigate shared neurotrophic factor-related genes (NFRGs) between GBM and IS. A total of 2871 NFRGs were screened from Genecards, with Caspase-3 (CASP3) and Protein Arginine N-Methyltransferase 6 (PRMT6) identified as core regulators. Multi-omics validation included: 1) Differential expression profiling across The Cancer Genome Atlas (TCGA)-GBM and Gene Expression Omnibus (GEO) stroke datasets; 2) Prognostic stratification using Kaplan-Meier (KM) survival curves with log-rank test and Cox proportional hazards regression; 3) Immune microenvironment analysis via CIBERSORT; 4) Experimental validation in middle cerebral artery occlusion (MCAO) mice and GBM cell lines (U87MG, T98G, A172) using Real-Time Quantitative Reverse Transcription PCR (qRT-PCR), Western blot (WB), and immunofluorescence (IF).

Results

CASP3 and PRMT6 were significantly upregulated in both GBM and IS (P < 0.05). KM survival analysis with log-rank test showed that high expression of CASP3 and PRMT6 was strongly associated with poorer overall survival (OS) in GBM patients (P < 0.001; Hazard Ratio (HR) = 4.375, 95% Confidence Interval (CI) = 3.336–5.738 for CASP3; HR = 4.547, 95% CI = 3.429–6.029 for PRMT6). Receiver operating characteristic (ROC) analysis confirmed robust diagnostic (Area Under the Curve (AUC) > 0.7) and prognostic efficacy for both markers. IF validated their elevated expression in ischemic brain tissues of Middle Cerebral Artery Occlusion (MCAO) mice, while qRT-PCR and WB confirmed higher expression in GBM cells versus normal glial cells. Immune infiltration analysis indicated that CASP3 and PRMT6 are associated with immunosuppressive remodeling in GBM, suggesting their role as a molecular bridge between the two diseases.

Conclusions

Our findings identify CASP3 and PRMT6 as dual molecular mediators coordinating GBM progression and post-IS pathological processes. Targeting these genes may provide novel therapeutic avenues for preventing GBM-associated IS and improving neuro-oncological outcomes.
多形性胶质母细胞瘤(GBM)和缺血性脑卒中(IS)是导致全球死亡和残疾的两种主要神经系统疾病。GBM通过血栓形成机制提高IS的风险,而IS可能通过缺血驱动的神经炎症加速胶质瘤的进展。识别共享的分子介质对于理解其双向病理生理至关重要。方法应用系统生物学方法研究GBM与IS之间共享的神经营养因子相关基因(NFRGs)。从Genecards中共筛选出2871个NFRGs,其中CASP3和PRMT6被鉴定为核心调控因子。多组学验证包括:1)癌症基因组图谱(TCGA)-GBM和基因表达综合(GEO)脑卒中数据集的差异表达谱;2)使用Kaplan-Meier (KM)生存曲线进行预后分层,并结合log-rank检验和Cox比例风险回归;3)基于CIBERSORT的免疫微环境分析;4)采用实时定量反转录PCR (qRT-PCR)、Western blot (WB)和免疫荧光(IF)技术对大脑中动脉闭塞(MCAO)小鼠和GBM细胞系(U87MG、T98G、A172)进行实验验证。结果scasp3和PRMT6在GBM和IS中均显著上调(P < 0.05)。采用log-rank检验的KM生存分析显示,高表达的CASP3和PRMT6与GBM患者较差的总生存(OS)密切相关(P < 0.001; CASP3的风险比(HR) = 4.375, 95%可信区间(CI) = 3.336-5.738;PRMT6的HR = 4.547, 95% CI = 3.429-6.029)。受试者工作特征(ROC)分析证实了两种标志物的可靠诊断(曲线下面积(AUC) > 0.7)和预后疗效。IF证实了它们在大脑中动脉闭塞(MCAO)小鼠缺血脑组织中的表达升高,而qRT-PCR和WB证实了它们在GBM细胞中的表达高于正常胶质细胞。免疫浸润分析表明,CASP3和PRMT6与GBM的免疫抑制性重塑相关,提示它们是两种疾病之间的分子桥梁。结论CASP3和PRMT6是协调GBM进展和is后病理过程的双重分子介质。靶向这些基因可能为预防gbm相关IS和改善神经肿瘤预后提供新的治疗途径。
{"title":"Identifying neurotrophic factor related genes at the crosstalk between glioblastoma and ischemic stroke","authors":"Ce Shi ,&nbsp;Lina Ding ,&nbsp;Dandan Wang ,&nbsp;Jing Zhu ,&nbsp;Xiaoli Zheng ,&nbsp;Lansheng Zhang ,&nbsp;Yun Zhou","doi":"10.1016/j.cmpb.2026.109283","DOIUrl":"10.1016/j.cmpb.2026.109283","url":null,"abstract":"<div><h3>Background</h3><div>Glioblastoma multiforme (GBM) and ischemic stroke (IS) are two major neurological disorders contributing substantially to global mortality and disability. GBM elevates IS risk via prothrombotic mechanisms, while IS may accelerate glioma progression through ischemia-driven neuroinflammation. Identifying shared molecular mediators is essential for understanding their bidirectional pathophysiology.</div></div><div><h3>Methods</h3><div>A systems biology approach was implemented to investigate shared neurotrophic factor-related genes (NFRGs) between GBM and IS. A total of 2871 NFRGs were screened from Genecards, with Caspase-3 (CASP3) and Protein Arginine N-Methyltransferase 6 (PRMT6) identified as core regulators. Multi-omics validation included: 1) Differential expression profiling across The Cancer Genome Atlas (TCGA)-GBM and Gene Expression Omnibus (GEO) stroke datasets; 2) Prognostic stratification using Kaplan-Meier (KM) survival curves with log-rank test and Cox proportional hazards regression; 3) Immune microenvironment analysis via CIBERSORT; 4) Experimental validation in middle cerebral artery occlusion (MCAO) mice and GBM cell lines (U87MG, T98G, A172) using Real-Time Quantitative Reverse Transcription PCR (qRT-PCR), Western blot (WB), and immunofluorescence (IF).</div></div><div><h3>Results</h3><div>CASP3 and PRMT6 were significantly upregulated in both GBM and IS (<em>P</em> &lt; 0.05). KM survival analysis with log-rank test showed that high expression of CASP3 and PRMT6 was strongly associated with poorer overall survival (OS) in GBM patients (<em>P</em> &lt; 0.001; Hazard Ratio (HR) = 4.375, 95% Confidence Interval (CI) = 3.336–5.738 for CASP3; HR = 4.547, 95% CI = 3.429–6.029 for PRMT6). Receiver operating characteristic (ROC) analysis confirmed robust diagnostic (Area Under the Curve (AUC) &gt; 0.7) and prognostic efficacy for both markers. IF validated their elevated expression in ischemic brain tissues of Middle Cerebral Artery Occlusion (MCAO) mice, while qRT-PCR and WB confirmed higher expression in GBM cells versus normal glial cells. Immune infiltration analysis indicated that CASP3 and PRMT6 are associated with immunosuppressive remodeling in GBM, suggesting their role as a molecular bridge between the two diseases.</div></div><div><h3>Conclusions</h3><div>Our findings identify CASP3 and PRMT6 as dual molecular mediators coordinating GBM progression and post-IS pathological processes. Targeting these genes may provide novel therapeutic avenues for preventing GBM-associated IS and improving neuro-oncological outcomes.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109283"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146186279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting optimal biomarkers in ovarian cancer cells from high-dimensional mRNA expression data using machine learning 利用机器学习从高维mRNA表达数据中检测卵巢癌细胞中的最佳生物标志物。
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-01-28 DOI: 10.1016/j.cmpb.2026.109263
Rama Krishna Thelagathoti, Chao Jiang, Dinesh S. Chandel, Wesley A. Tom, Cleo Sarmiento, Gary Krzyzanowski, Appolinaire Olou, M. Rohan Fernando

Background and Objective

Reliable detection of robust biomarkers from high-dimensional transcriptomic data remains a major challenge in computational oncology. Traditional approaches often suffer from overfitting and poor generalization due to the high dimensionality of genomic data and limited sample sizes. This study aims to identify an optimal, biologically meaningful subset of mRNA biomarkers capable of distinguishing ovarian cancer samples from healthy controls using an integrated machine learning–based feature selection framework.

Methods

We analyzed mRNA expression data encompassing approximately 63,000 transcripts from ovarian cancer and control samples derived from cell lines. A hybrid feature selection pipeline combining statistical filtering, recursive elimination, and regularization was implemented under stratified cross-validation to derive stable biomarkers. Model validation was performed using Logistic Regression, Random Forest, XGBoost, and Support Vector Machine classifiers, while experimental validation was conducted through droplet digital PCR (ddPCR). Statistical analyses included ANOVA, t-tests, and pathway enrichment.

Results

The pipeline identified 80 discriminative mRNA biomarkers with exceptionally high classification performance (accuracy = 1.00, sensitivity = 1.00, specificity = 1.00 for top models). ddPCR confirmed consistent expression patterns, with significant downregulation of ADAMTS12, FN1, and ABI3BP and overexpression of EPCAM, COX6C, and TMT1B in ovarian cancer. Pathway enrichment revealed involvement in DNA repair, RNA processing, protein translation, immune regulation, and metabolic reprogramming.

Conclusions

This hybrid feature selection framework applied to patient derived cell lines, effectively reduces dimensionality, enhances biomarker reliability, and uncovers biologically interpretable mRNA signatures associated with ovarian cancer, demonstrating potential for diagnostic and therapeutic applications.
背景和目的:从高维转录组学数据中可靠地检测健壮的生物标志物仍然是计算肿瘤学的主要挑战。由于基因组数据的高维数和有限的样本量,传统方法往往存在过拟合和泛化差的问题。本研究旨在利用基于机器学习的综合特征选择框架,确定一个最佳的、具有生物学意义的mRNA生物标志物子集,该子集能够将卵巢癌样本与健康对照区分开。方法:我们分析了来自卵巢癌和来源于细胞系的对照样本的约63,000个转录本的mRNA表达数据。在分层交叉验证下,实现了统计滤波、递归消除和正则化相结合的混合特征选择管道,以获得稳定的生物标志物。模型验证采用Logistic回归、随机森林、XGBoost和支持向量机分类器进行,实验验证采用液滴数字PCR (ddPCR)进行。统计分析包括方差分析、t检验和途径富集。结果:该管道鉴定出80个鉴别性mRNA生物标志物,具有非常高的分类性能(准确率= 1.00,灵敏度= 1.00,特异性= 1.00)。ddPCR证实了一致的表达模式,卵巢癌中ADAMTS12、FN1、ABI3BP显著下调,EPCAM、COX6C、TMT1B过表达。途径富集表明参与DNA修复、RNA加工、蛋白质翻译、免疫调节和代谢重编程。结论:该杂交特征选择框架适用于患者来源的细胞系,有效地降低了维数,提高了生物标志物的可靠性,并揭示了与卵巢癌相关的生物学可解释的mRNA特征,显示了诊断和治疗应用的潜力。
{"title":"Detecting optimal biomarkers in ovarian cancer cells from high-dimensional mRNA expression data using machine learning","authors":"Rama Krishna Thelagathoti,&nbsp;Chao Jiang,&nbsp;Dinesh S. Chandel,&nbsp;Wesley A. Tom,&nbsp;Cleo Sarmiento,&nbsp;Gary Krzyzanowski,&nbsp;Appolinaire Olou,&nbsp;M. Rohan Fernando","doi":"10.1016/j.cmpb.2026.109263","DOIUrl":"10.1016/j.cmpb.2026.109263","url":null,"abstract":"<div><h3>Background and Objective</h3><div>Reliable detection of robust biomarkers from high-dimensional transcriptomic data remains a major challenge in computational oncology. Traditional approaches often suffer from overfitting and poor generalization due to the high dimensionality of genomic data and limited sample sizes. This study aims to identify an optimal, biologically meaningful subset of mRNA biomarkers capable of distinguishing ovarian cancer samples from healthy controls using an integrated machine learning–based feature selection framework.</div></div><div><h3>Methods</h3><div>We analyzed mRNA expression data encompassing approximately 63,000 transcripts from ovarian cancer and control samples derived from cell lines. A hybrid feature selection pipeline combining statistical filtering, recursive elimination, and regularization was implemented under stratified cross-validation to derive stable biomarkers. Model validation was performed using Logistic Regression, Random Forest, XGBoost, and Support Vector Machine classifiers, while experimental validation was conducted through droplet digital PCR (ddPCR). Statistical analyses included ANOVA, <em>t</em>-tests, and pathway enrichment.</div></div><div><h3>Results</h3><div>The pipeline identified 80 discriminative mRNA biomarkers with exceptionally high classification performance (accuracy = 1.00, sensitivity = 1.00, specificity = 1.00 for top models). ddPCR confirmed consistent expression patterns, with significant downregulation of ADAMTS12, FN1, and ABI3BP and overexpression of EPCAM, COX6C, and TMT1B in ovarian cancer. Pathway enrichment revealed involvement in DNA repair, RNA processing, protein translation, immune regulation, and metabolic reprogramming.</div></div><div><h3>Conclusions</h3><div>This hybrid feature selection framework applied to patient derived cell lines, effectively reduces dimensionality, enhances biomarker reliability, and uncovers biologically interpretable mRNA signatures associated with ovarian cancer, demonstrating potential for diagnostic and therapeutic applications.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109263"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancing the vision of “reliability metadata”: From conceptual refinement to clinical validation 推进“可靠性元数据”的愿景:从概念细化到临床验证。
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-02-02 DOI: 10.1016/j.cmpb.2026.109267
Zekai Yu , Weihao Cheng
{"title":"Advancing the vision of “reliability metadata”: From conceptual refinement to clinical validation","authors":"Zekai Yu ,&nbsp;Weihao Cheng","doi":"10.1016/j.cmpb.2026.109267","DOIUrl":"10.1016/j.cmpb.2026.109267","url":null,"abstract":"","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109267"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146156496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable reinforcement learning for glucose monitoring based on shapley value analysis 基于shapley值分析的可解释强化学习血糖监测
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-01-29 DOI: 10.1016/j.cmpb.2026.109266
Arsene Adjevi , Abdiwahab Mohamed Abdirashid , Faruk Aktaş , Mustafa Hikmet Bilgehan Ucar , Serdar Solak

Background and Objective:

Effective diabetes management requires continuous regulation of blood glucose in response to complex factors such as diet, activity, stress, and medication. Advances in continuous glucose monitoring and machine learning have improved short-term glucose prediction. However, preprocessing of signals like insulin, carbohydrate intake, heart rate, and activity to better capture metabolic dynamics remains underexplored. Similarly, the integration of predictive models with preventive strategies for guiding interventions is still limited.

Methods:

We propose a research-only decision-support framework combining signal preprocessing, CNN-based glucose prediction, Shapley Additive Explanations (SHAP) values attribution, and an Actor–Critic Reinforcement Learning (RL) agent. Exponential decay models preprocess inputs, a compact CNN forecasts short-term glucose levels, and SHAP values highlights the most influential input features; however, these attributions reflect associative patterns in the data and do not establish or map to causal clinical mechanisms. These SHAP-derived attributions guide the RL agent, which issues bounded one-step behavioral adjustments. Because SHAP-guided RL remains stochastic and uncertain, the proposed system is exploratory and not clinically safe, serving solely as a simulation framework.

Results:

Using the OhioT1DM dataset, the model achieved state-of-the-art RMSE across prediction horizons with a compact size of 7̃4 KB per patient and training under one minute for 1000 epochs. Over 98% of predictions fell within Clarke Error Grid Zones A and B, confirming safe 5–20 min forecasts. The preventive component corrected hyper- and hypoglycemia in 2̃5% of cases within 10 min when predictions were near 80–120 mg/dL (±10 mg/dL). When deviations exceed ±10 mg/dL, the RL agent is unable to fully restore blood glucose to the target range within 10 min but can bring it as close as possible to the defined interval.

Conclusions:

This study presents a significant innovation by bridging predictive accuracy, adaptability, and transparency in diabetes management. The integration of a predictive model with Reinforcement Learning (RL) guided by SHAP values, which are typically used for interpretability but here are employed in the learning process, delivers a powerful decision support framework. This approach advances the field toward next-generation, personalized digital health tools.
背景与目的:有效的糖尿病管理需要持续调节血糖,以应对复杂的因素,如饮食、活动、压力和药物。连续血糖监测和机器学习的进步改善了短期血糖预测。然而,对胰岛素、碳水化合物摄入、心率和活动等信号的预处理,以更好地捕捉代谢动力学,仍未得到充分探索。同样,将预测模型与指导干预措施的预防战略相结合仍然有限。方法:我们提出了一个仅用于研究的决策支持框架,该框架结合了信号预处理、基于cnn的葡萄糖预测、Shapley加性解释(SHAP)值归因和Actor-Critic强化学习(RL)代理。指数衰减模型预处理输入,紧凑的CNN预测短期血糖水平,SHAP值突出了最具影响力的输入特征;然而,这些归因反映了数据中的关联模式,并没有建立或映射到因果临床机制。这些源自shap的归因引导RL代理,RL代理发出有限的一步行为调整。由于shap引导的RL仍然是随机和不确定的,因此所提出的系统是探索性的,临床上并不安全,仅作为模拟框架。结果:使用OhioT1DM数据集,该模型在预测范围内实现了最先进的RMSE,每位患者的紧凑大小为7.4 KB,训练时间在1分钟内,训练时间为1000次。超过98%的预测落在克拉克误差网格区域A和B内,确认了安全的5-20分钟预测。当预测值接近80-120 mg/dL(±10 mg/dL)时,预防成分在10分钟内纠正了2.5%的高血糖和低血糖。当偏差超过±10mg /dL时,RL剂不能在10min内将血糖完全恢复到目标范围,但可以使其尽可能接近规定的间隔。结论:本研究通过连接糖尿病管理的预测准确性、适应性和透明度,提出了一项重大创新。由SHAP值指导的预测模型与强化学习(RL)的集成提供了一个强大的决策支持框架,SHAP值通常用于可解释性,但这里用于学习过程。这种方法将该领域推向了下一代个性化数字健康工具。
{"title":"Explainable reinforcement learning for glucose monitoring based on shapley value analysis","authors":"Arsene Adjevi ,&nbsp;Abdiwahab Mohamed Abdirashid ,&nbsp;Faruk Aktaş ,&nbsp;Mustafa Hikmet Bilgehan Ucar ,&nbsp;Serdar Solak","doi":"10.1016/j.cmpb.2026.109266","DOIUrl":"10.1016/j.cmpb.2026.109266","url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Effective diabetes management requires continuous regulation of blood glucose in response to complex factors such as diet, activity, stress, and medication. Advances in continuous glucose monitoring and machine learning have improved short-term glucose prediction. However, preprocessing of signals like insulin, carbohydrate intake, heart rate, and activity to better capture metabolic dynamics remains underexplored. Similarly, the integration of predictive models with preventive strategies for guiding interventions is still limited.</div></div><div><h3>Methods:</h3><div>We propose a research-only decision-support framework combining signal preprocessing, CNN-based glucose prediction, Shapley Additive Explanations (SHAP) values attribution, and an Actor–Critic Reinforcement Learning (RL) agent. Exponential decay models preprocess inputs, a compact CNN forecasts short-term glucose levels, and SHAP values highlights the most influential input features; however, these attributions reflect associative patterns in the data and do not establish or map to causal clinical mechanisms. These SHAP-derived attributions guide the RL agent, which issues bounded one-step behavioral adjustments. Because SHAP-guided RL remains stochastic and uncertain, the proposed system is exploratory and not clinically safe, serving solely as a simulation framework.</div></div><div><h3>Results:</h3><div>Using the OhioT1DM dataset, the model achieved state-of-the-art RMSE across prediction horizons with a compact size of 7̃4 KB per patient and training under one minute for 1000 epochs. Over 98% of predictions fell within Clarke Error Grid Zones A and B, confirming safe 5–20 min forecasts. The preventive component corrected hyper- and hypoglycemia in 2̃5% of cases within 10 min when predictions were near 80–120 mg/dL (<span><math><mo>±</mo></math></span>10 mg/dL). When deviations exceed <span><math><mo>±</mo></math></span>10 mg/dL, the RL agent is unable to fully restore blood glucose to the target range within 10 min but can bring it as close as possible to the defined interval.</div></div><div><h3>Conclusions:</h3><div>This study presents a significant innovation by bridging predictive accuracy, adaptability, and transparency in diabetes management. The integration of a predictive model with Reinforcement Learning (RL) guided by SHAP values, which are typically used for interpretability but here are employed in the learning process, delivers a powerful decision support framework. This approach advances the field toward next-generation, personalized digital health tools.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109266"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A computational approach for classification of HIV drug resistance based on the self-consistent extreme classifier 基于自洽极值分类器的HIV耐药性分类计算方法。
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-02-02 DOI: 10.1016/j.cmpb.2026.109268
L.A. Stolbov , A.V. Rudik , E.A. Stolbova , A.V. Pokrovskaya , A.B. Shemshura , D.E. Kireev , A.A. Lagunin , D.A. Filimonov , V.V. Poroikov , O.A. Tarasova

Background and Objectives

The development of viral resistance can significantly reduce the effectiveness of therapy. Human immunodeficiency virus type 1 is the cause of chronic immune dysfunction, leading to the development of co-infections and serious complications. Despite worldwide progress and consolidated efforts to overcome HIV drug resistance, the development of novel approaches for rational drug therapy of HIV infection is still needed for building models with high accuracy of prediction and that can be applied for evaluation of resistance against wide variety of inhibitors. Our study is dedicated to the development of a novel computational ML-driven approach for the ternary classification of HIV protease, reverse transcriptase, and integrase sequences. Binary classification approaches naturally are not applicable to capture clinically important intermediate resistance levels, motivating the use of a ternary classification model.

Methods

For the model development we used the Self-Consistent Extreme Classifier. One-versus-rest and one-versus-one ternary approaches were applied to sequences related resistance data from Stanford University HIV Drug Resistance Database (StDB).

Results

For the final classifiers we selected the most appropriate models with 0.913 sensitivity, 0.894 specificity, 0.741 precision and 0.953 area under ROC, all values provided in average. We tested our approach in a clinical task and performed prospective validation for eight sequences of HIV protease and reverse transcriptase obtained from treatment-naive HIV-positive male patients. We performed a prediction and compared the results with the therapeutic outcome, in particular, with the viral load decline at 24 weeks.

Conclusions

The results of the prospective validation are generally consistent with the results of the therapeutic outcome and confirm the possibility of using the developed approach for the selection of the most appropriate therapeutic regimens.
背景和目的:病毒耐药性的发展会显著降低治疗的有效性。人类免疫缺陷病毒1型是慢性免疫功能障碍的原因,导致合并感染和严重并发症的发展。尽管世界范围内在克服艾滋病毒耐药性方面取得了进展和共同努力,但仍然需要开发新的方法来合理治疗艾滋病毒感染,以建立具有高预测精度的模型,并可用于评估对各种抑制剂的耐药性。我们的研究致力于开发一种新的计算机器学习驱动的方法,用于HIV蛋白酶、逆转录酶和整合酶序列的三元分类。二元分类方法自然不适用于捕获临床重要的中间抗性水平,这促使使用三元分类模型。方法:采用自洽极值分类器进行模型开发。对来自Stanford University HIV Drug resistance Database (StDB)的序列相关耐药数据应用One-versus-rest和one-versus-one三元方法。结果:对于最终的分类器,我们选择了最合适的模型,灵敏度为0.913,特异性为0.894,精度为0.741,ROC下面积为0.953,所有值均为平均值。我们在一项临床任务中测试了我们的方法,并对从初次治疗的HIV阳性男性患者中获得的8个HIV蛋白酶和逆转录酶序列进行了前瞻性验证。我们进行了预测,并将结果与治疗结果进行了比较,特别是在24周时病毒载量下降。结论:前瞻性验证的结果与治疗结果基本一致,并证实了使用所开发的方法选择最合适的治疗方案的可能性。
{"title":"A computational approach for classification of HIV drug resistance based on the self-consistent extreme classifier","authors":"L.A. Stolbov ,&nbsp;A.V. Rudik ,&nbsp;E.A. Stolbova ,&nbsp;A.V. Pokrovskaya ,&nbsp;A.B. Shemshura ,&nbsp;D.E. Kireev ,&nbsp;A.A. Lagunin ,&nbsp;D.A. Filimonov ,&nbsp;V.V. Poroikov ,&nbsp;O.A. Tarasova","doi":"10.1016/j.cmpb.2026.109268","DOIUrl":"10.1016/j.cmpb.2026.109268","url":null,"abstract":"<div><h3>Background and Objectives</h3><div>The development of viral resistance can significantly reduce the effectiveness of therapy. Human immunodeficiency virus type 1 is the cause of chronic immune dysfunction, leading to the development of co-infections and serious complications. Despite worldwide progress and consolidated efforts to overcome HIV drug resistance, the development of novel approaches for rational drug therapy of HIV infection is still needed for building models with high accuracy of prediction and that can be applied for evaluation of resistance against wide variety of inhibitors. Our study is dedicated to the development of a novel computational ML-driven approach for the ternary classification of HIV protease, reverse transcriptase, and integrase sequences. Binary classification approaches naturally are not applicable to capture clinically important intermediate resistance levels, motivating the use of a ternary classification model.</div></div><div><h3>Methods</h3><div>For the model development we used the Self-Consistent Extreme Classifier. One-versus-rest and one-versus-one ternary approaches were applied to sequences related resistance data from Stanford University HIV Drug Resistance Database (StDB).</div></div><div><h3>Results</h3><div>For the final classifiers we selected the most appropriate models with 0.913 sensitivity, 0.894 specificity, 0.741 precision and 0.953 area under ROC, all values provided in average. We tested our approach in a clinical task and performed prospective validation for eight sequences of HIV protease and reverse transcriptase obtained from treatment-naive HIV-positive male patients. We performed a prediction and compared the results with the therapeutic outcome, in particular, with the viral load decline at 24 weeks.</div></div><div><h3>Conclusions</h3><div>The results of the prospective validation are generally consistent with the results of the therapeutic outcome and confirm the possibility of using the developed approach for the selection of the most appropriate therapeutic regimens.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109268"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146164048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Patient-specific fluid-structure interaction modeling of cerebral aneurysm: influence of wall compliance, tissue prestress, and blood rheology 脑动脉瘤患者特异性流体-结构相互作用模型:壁顺应性、组织预应力和血液流变学的影响
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-02-10 DOI: 10.1016/j.cmpb.2026.109284
Amit Raj , Raghvendra Gupta , Anugrah Singh

Background and Objective

Cerebral aneurysms are pathological dilations of intracranial arteries that commonly develop at arterial bifurcations. At these locations, hemodynamic forces significantly affect structural properties of the vascular walls leading to focal weakening and vessel remodeling. This study aims to evaluate the influence of wall compliance and tissue prestress on aneurysmal hemodynamics and wall mechanics using a fluid–structure interaction (FSI) framework. The effect of shear thinning of blood is also studied.

Methods

The flow of blood and its effect on the vessel walls is modelled in a patient-specific cerebral aneurysm. Physiologically realistic inflow conditions derived from PC-MRI is used as the inlet boundary condition and three-element Windkessel model is used to specify the outlet boundary condition to account for the effect of downstream vasculature. Prestress is applied to the arterial wall to mimic the in-vivo stressed state of the vessel wall. Simulations are performed using the Arbitrary–Lagrangian–Eulerian (ALE) FSI approach under different considerations of wall compliance, blood rheology, and prestress, both individually and in-combination. The computational framework is validated against analytical and numerical solutions available in the literature.

Results

Accounting for wall compliance leads to increased inflow into the aneurysm sac and a reduced pressure drop between the inlet and outlet over a cardiac cycle. In the flexible wall model, a single, stable vortex core is observed in the dome instead of the multiple vortices which are observed in case of rigid wall. Further, consideration of flexible walls results in the reduction of peak time-averaged wall shear stress (TAWSS) by ∼20%, reduces the dome area exposed to low TAWSS and regions having high oscillatory shear index (OSI). Including the prestress in model proves critical, as it reduces wall displacement up to 72% and peak tensile stress up to 83% at peak systole. Consideration of shear thinning behaviour of blood further decreases peak TAWSS by up to 25% and reduces area having low TAWSS, but has minimal effect on wall displacement and tensile stress.

Conclusions

Wall compliance, blood rheology, and prestress substantially influence aneurysmal hemodynamics and wall mechanics, with prestress having the most dominant effect in reducing wall deformation and stress.
背景与目的脑动脉瘤是颅内动脉的病理性扩张,通常发生在动脉分叉处。在这些位置,血流动力学力显著影响血管壁的结构特性,导致局灶性减弱和血管重塑。本研究旨在利用流固相互作用(FSI)框架评估管壁顺应性和组织预应力对动脉瘤血流动力学和管壁力学的影响。还研究了血液剪切变薄的效果。方法采用脑动脉瘤模型模拟血流及其对血管壁的影响。采用PC-MRI得出的生理上真实的入流条件作为入口边界条件,采用三元Windkessel模型指定出口边界条件,以考虑下游脉管系统的影响。对动脉壁施加预应力以模拟血管壁在体内的受力状态。模拟使用任意拉格朗日-欧拉(ALE) FSI方法,在不同的壁面顺应性、血液流变学和预应力的考虑下进行,无论是单独的还是组合的。计算框架是根据文献中可用的解析和数值解进行验证的。结果在一个心动周期内,考虑壁面顺应性导致动脉瘤囊内的流入增加,入口和出口之间的压降减小。在柔性壁面模型中,圆顶内只观察到一个稳定的涡核,而刚性壁面中则观察到多个涡核。此外,考虑柔性墙体可使峰值时间平均墙体剪切应力(TAWSS)降低约20%,减少暴露于低TAWSS和高振荡剪切指数(OSI)区域的圆顶面积。在模型中加入预应力被证明是至关重要的,因为它可以减少高达72%的壁面位移和高达83%的峰值收缩拉应力。考虑到血液的剪切变薄行为,进一步降低峰值TAWSS高达25%,并减少具有低TAWSS的面积,但对壁位移和拉应力的影响最小。结论小血管顺应性、血液流变学和预应力对动脉瘤血流动力学和壁面力学有显著影响,其中预应力对减小壁面变形和应力的作用最为显著。
{"title":"Patient-specific fluid-structure interaction modeling of cerebral aneurysm: influence of wall compliance, tissue prestress, and blood rheology","authors":"Amit Raj ,&nbsp;Raghvendra Gupta ,&nbsp;Anugrah Singh","doi":"10.1016/j.cmpb.2026.109284","DOIUrl":"10.1016/j.cmpb.2026.109284","url":null,"abstract":"<div><h3>Background and Objective</h3><div>Cerebral aneurysms are pathological dilations of intracranial arteries that commonly develop at arterial bifurcations. At these locations, hemodynamic forces significantly affect structural properties of the vascular walls leading to focal weakening and vessel remodeling. This study aims to evaluate the influence of wall compliance and tissue prestress on aneurysmal hemodynamics and wall mechanics using a fluid–structure interaction (FSI) framework. The effect of shear thinning of blood is also studied.</div></div><div><h3>Methods</h3><div>The flow of blood and its effect on the vessel walls is modelled in a patient-specific cerebral aneurysm. Physiologically realistic inflow conditions derived from PC-MRI is used as the inlet boundary condition and three-element Windkessel model is used to specify the outlet boundary condition to account for the effect of downstream vasculature. Prestress is applied to the arterial wall to mimic the in-vivo stressed state of the vessel wall. Simulations are performed using the Arbitrary–Lagrangian–Eulerian (ALE) FSI approach under different considerations of wall compliance, blood rheology, and prestress, both individually and in-combination. The computational framework is validated against analytical and numerical solutions available in the literature.</div></div><div><h3>Results</h3><div>Accounting for wall compliance leads to increased inflow into the aneurysm sac and a reduced pressure drop between the inlet and outlet over a cardiac cycle. In the flexible wall model, a single, stable vortex core is observed in the dome instead of the multiple vortices which are observed in case of rigid wall. Further, consideration of flexible walls results in the reduction of peak time-averaged wall shear stress (TAWSS) by ∼20%, reduces the dome area exposed to low TAWSS and regions having high oscillatory shear index (OSI). Including the prestress in model proves critical, as it reduces wall displacement up to 72% and peak tensile stress up to 83% at peak systole. Consideration of shear thinning behaviour of blood further decreases peak TAWSS by up to 25% and reduces area having low TAWSS, but has minimal effect on wall displacement and tensile stress.</div></div><div><h3>Conclusions</h3><div>Wall compliance, blood rheology, and prestress substantially influence aneurysmal hemodynamics and wall mechanics, with prestress having the most dominant effect in reducing wall deformation and stress.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109284"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146186316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HCAR1 antagonist screening based on boundary-selected negative sampling strategy and multi-level graph neural network 基于边界选择负抽样策略和多层次图神经网络的HCAR1拮抗剂筛选。
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-01-24 DOI: 10.1016/j.cmpb.2026.109262
Mengmeng Fan , Dakuo He , Qian Liu , Qing Liu , Feng Wang , He Li , Hao Wang , Siqi Shan , Jinghao Zhang , Yue Hou

Background and objective:

Hydroxycarboxylic acid receptor 1 (HCAR1), also known as the lactate receptor, is closely associated with tumorigenesis and cancer progression due to its aberrant activation, making it an attractive therapeutic target for cancer treatment. Accurate prediction of HCAR1 antagonists is therefore crucial for tumor immunotherapy. However, traditional drug screening suffers from high costs and suboptimal performance caused by imbalanced datasets and incomplete molecular representations, contributing to the scarcity of clinically available HCAR1 antagonists.

Methods:

A balanced HCAR1 target activity dataset was constructed using a boundary-selected negative sampling strategy. Subsequently, a multi-level graph neural network (Multi-GNN) was proposed for HCAR1 target activity prediction, integrating multiple molecular representations, including fingerprints, molecular graphs, and fragment-level features.

Results:

Experimental results demonstrate that the proposed model outperforms eight state-of-the-art methods in comparative evaluations. Furthermore, approximately ten million compounds were screened using the trained Multi-GNN model in combination with physicochemical filtering and molecular docking, yielding five candidate compounds. Finally, in vitro cAMP antagonistic activity assays identified a promising HCAR1 inhibitor with an IC50 of 22.39 μM.

Conclusions:

This study introduces a novel artificial intelligence-based framework for HCAR1-targeted drug discovery and highlights potential lead compounds for further development.
背景与目的:羟羧酸受体1 (Hydroxycarboxylic acid receptor 1, HCAR1)又称乳酸受体,由于其异常激活,与肿瘤发生和癌症进展密切相关,成为癌症治疗的一个有吸引力的治疗靶点。因此,准确预测HCAR1拮抗剂对肿瘤免疫治疗至关重要。然而,由于数据集不平衡和分子表征不完整,传统的药物筛选成本高,性能不理想,导致临床可用的HCAR1拮抗剂稀缺。方法:采用边界选择负采样策略构建平衡的HCAR1靶活性数据集。随后,提出了一种多层次图神经网络(Multi-GNN)用于HCAR1靶点活性预测,该网络集成了多种分子表征,包括指纹、分子图和片段级特征。结果:实验结果表明,该模型在比较评价中优于八种最先进的方法。此外,利用训练好的Multi-GNN模型,结合物理化学过滤和分子对接,筛选了大约1000万种化合物,产生了5种候选化合物。最后,体外cAMP拮抗活性测定鉴定出一种有前景的HCAR1抑制剂,IC50为22.39 μM。结论:本研究为hcar1靶向药物的发现引入了一种新的基于人工智能的框架,并突出了潜在的先导化合物。
{"title":"HCAR1 antagonist screening based on boundary-selected negative sampling strategy and multi-level graph neural network","authors":"Mengmeng Fan ,&nbsp;Dakuo He ,&nbsp;Qian Liu ,&nbsp;Qing Liu ,&nbsp;Feng Wang ,&nbsp;He Li ,&nbsp;Hao Wang ,&nbsp;Siqi Shan ,&nbsp;Jinghao Zhang ,&nbsp;Yue Hou","doi":"10.1016/j.cmpb.2026.109262","DOIUrl":"10.1016/j.cmpb.2026.109262","url":null,"abstract":"<div><h3>Background and objective:</h3><div>Hydroxycarboxylic acid receptor 1 (HCAR1), also known as the lactate receptor, is closely associated with tumorigenesis and cancer progression due to its aberrant activation, making it an attractive therapeutic target for cancer treatment. Accurate prediction of HCAR1 antagonists is therefore crucial for tumor immunotherapy. However, traditional drug screening suffers from high costs and suboptimal performance caused by imbalanced datasets and incomplete molecular representations, contributing to the scarcity of clinically available HCAR1 antagonists.</div></div><div><h3>Methods:</h3><div>A balanced HCAR1 target activity dataset was constructed using a boundary-selected negative sampling strategy. Subsequently, a multi-level graph neural network (Multi-GNN) was proposed for HCAR1 target activity prediction, integrating multiple molecular representations, including fingerprints, molecular graphs, and fragment-level features.</div></div><div><h3>Results:</h3><div>Experimental results demonstrate that the proposed model outperforms eight state-of-the-art methods in comparative evaluations. Furthermore, approximately ten million compounds were screened using the trained Multi-GNN model in combination with physicochemical filtering and molecular docking, yielding five candidate compounds. Finally, in vitro cAMP antagonistic activity assays identified a promising HCAR1 inhibitor with an <span><math><msub><mrow><mtext>IC</mtext></mrow><mrow><mn>50</mn></mrow></msub></math></span> of 22.39 <span><math><mrow><mi>μ</mi><mi>M</mi></mrow></math></span>.</div></div><div><h3>Conclusions:</h3><div>This study introduces a novel artificial intelligence-based framework for HCAR1-targeted drug discovery and highlights potential lead compounds for further development.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109262"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146112531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DNA-Driven EEG monitoring for rapid seizure prediction in healthcare dna驱动脑电图监测快速癫痫发作预测在医疗保健。
IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-05-01 Epub Date: 2026-02-03 DOI: 10.1016/j.cmpb.2026.109277
Khalid Ansari, Unnati Chaurasia, Himanshu Kumar Pathak, Koushlendra Kumar Singh, Jitesh Pradhan

Background and Objective:

Worldwide, over 50 million people suffer from epilepsy, a neurological disorder characterised by recurrent seizures due to abnormal electrical activity in the brain. These occur as a result of sudden electric surges and the symptoms vary based on the region of the brain being affected, including brief staring spells and confusion to convulsions and loss of consciousness. Physicians typically classify seizures into four main phases: Interictal, Preictal, Ictal, and Postictal. Accurate analysis of EEG signals around seizure onset is extremely critical for timely clinical intervention. However, the current methodologies majorly utilise complex Convolutional Neural Networks (CNNs) with millions of parameters. They require high computational power, and, hence, it is difficult to deploy them in wearable devices. The core idea of this work is to develop a computationally compact architecture for seizure onset discrimination that offers potential for future integration with wearable devices.

Methods:

To achieve this, this work proposes employing a DNA-based encoding framework for Electroencephalogram (EEG) signals. Existing DNA-based compression techniques have demonstrated significant potential in reducing data complexity. Multichannel EEG signals using 23 scalp electrodes are obtained from the CHB-MIT dataset and normalised using min–max scaling. The signals are then windowed to capture temporal dependencies and transformed into integer safe magnitudes before being converted to binary. This approach then involves genetic coding-based preprocessing: genetic transcription and translation (DNA RNA Codons Amino Acids) occur. By converting EEG signal data to amino acid sequences, the proposed encoding scheme aims to capture underlying patterns in the data and provide a compact representation of temporal patterns. The encoded sequences are subsequently processed using a lightweight one-dimensional multi-level parallel CNN architecture.

Results and Conclusion:

These DNA-encoded EEG sequences are then used as input to the proposed 1D multi-level parallel CNN model, with drastically fewer parameters. After extensive testing, the proposed model achieves an accuracy of 96.22%. Additionally, the applicability of the proposed encoding framework on early seizure prediction tasks under a subject-wise protocol has been evaluated. An accuracy of 93.87% has been achieved. Overall, these findings indicate that the proposed approach provides a compact and effective representation for EEG-based seizure analysis across related onset and early prediction tasks.
背景和目的:全世界有5000多万人患有癫痫,这是一种神经系统疾病,其特征是由于大脑电活动异常而导致反复发作。这是由于突然的电涌造成的,症状因受影响的大脑区域而异,包括短暂的凝视、混乱到抽搐和失去意识。医生通常将癫痫发作分为四个主要阶段:发作间期、发作前期、发作期和发作后。准确分析癫痫发作前后的脑电图信号对临床及时干预至关重要。然而,目前的方法主要利用具有数百万个参数的复杂卷积神经网络(cnn)。它们需要很高的计算能力,因此很难将它们部署在可穿戴设备中。这项工作的核心思想是开发一种计算紧凑的癫痫发作识别架构,为未来与可穿戴设备的集成提供潜力。方法:为了实现这一目标,本工作提出采用基于dna的脑电图(EEG)信号编码框架。现有的基于dna的压缩技术在降低数据复杂性方面已经显示出巨大的潜力。使用23个头皮电极从CHB-MIT数据集中获得多通道EEG信号,并使用最小-最大缩放进行归一化。然后将信号加窗以捕获时间依赖性,并在转换为二进制之前将其转换为整数安全幅度。这种方法随后涉及到基于遗传编码的预处理:基因转录和翻译(DNA→RNA→密码子→氨基酸)发生。通过将脑电信号数据转换为氨基酸序列,提出的编码方案旨在捕获数据中的潜在模式,并提供时间模式的紧凑表示。编码序列随后使用轻量级的一维多层并行CNN架构进行处理。结果和结论:这些dna编码的脑电图序列随后被用作所提出的一维多级并行CNN模型的输入,参数大大减少。经过大量的测试,该模型的准确率达到96.22%。此外,所提出的编码框架在主题协议下对早期癫痫发作预测任务的适用性进行了评估。准确率达到93.87%。总的来说,这些发现表明,所提出的方法为基于脑电图的癫痫发作分析提供了一种紧凑而有效的表示,涵盖了相关的发病和早期预测任务。
{"title":"DNA-Driven EEG monitoring for rapid seizure prediction in healthcare","authors":"Khalid Ansari,&nbsp;Unnati Chaurasia,&nbsp;Himanshu Kumar Pathak,&nbsp;Koushlendra Kumar Singh,&nbsp;Jitesh Pradhan","doi":"10.1016/j.cmpb.2026.109277","DOIUrl":"10.1016/j.cmpb.2026.109277","url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Worldwide, over 50 million people suffer from epilepsy, a neurological disorder characterised by recurrent seizures due to abnormal electrical activity in the brain. These occur as a result of sudden electric surges and the symptoms vary based on the region of the brain being affected, including brief staring spells and confusion to convulsions and loss of consciousness. Physicians typically classify seizures into four main phases: Interictal, Preictal, Ictal, and Postictal. Accurate analysis of EEG signals around seizure onset is extremely critical for timely clinical intervention. However, the current methodologies majorly utilise complex Convolutional Neural Networks (CNNs) with millions of parameters. They require high computational power, and, hence, it is difficult to deploy them in wearable devices. The core idea of this work is to develop a computationally compact architecture for seizure onset discrimination that offers potential for future integration with wearable devices.</div></div><div><h3>Methods:</h3><div>To achieve this, this work proposes employing a DNA-based encoding framework for Electroencephalogram (EEG) signals. Existing DNA-based compression techniques have demonstrated significant potential in reducing data complexity. Multichannel EEG signals using 23 scalp electrodes are obtained from the CHB-MIT dataset and normalised using min–max scaling. The signals are then windowed to capture temporal dependencies and transformed into integer safe magnitudes before being converted to binary. This approach then involves genetic coding-based preprocessing: genetic transcription and translation (DNA <span><math><mo>→</mo></math></span> RNA <span><math><mo>→</mo></math></span> Codons <span><math><mo>→</mo></math></span> Amino Acids) occur. By converting EEG signal data to amino acid sequences, the proposed encoding scheme aims to capture underlying patterns in the data and provide a compact representation of temporal patterns. The encoded sequences are subsequently processed using a lightweight one-dimensional multi-level parallel CNN architecture.</div></div><div><h3>Results and Conclusion:</h3><div>These DNA-encoded EEG sequences are then used as input to the proposed 1D multi-level parallel CNN model, with drastically fewer parameters. After extensive testing, the proposed model achieves an accuracy of 96.22%. Additionally, the applicability of the proposed encoding framework on early seizure prediction tasks under a subject-wise protocol has been evaluated. An accuracy of 93.87% has been achieved. Overall, these findings indicate that the proposed approach provides a compact and effective representation for EEG-based seizure analysis across related onset and early prediction tasks.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"278 ","pages":"Article 109277"},"PeriodicalIF":4.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146177268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computer methods and programs in biomedicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1