首页 > 最新文献

Intelligence-based medicine最新文献

英文 中文
Explainable stacked-ensemble prediction of ABL1 tyrosine-kinase inhibitor resistance: A metaheuristic-optimized pipeline ABL1酪氨酸激酶抑制剂耐药的可解释的堆叠系综预测:一个元启发式优化管道
Pub Date : 2025-12-30 DOI: 10.1016/j.ibmed.2025.100341
Faris Hassan , Mohanad A. Deif , Alaa Zaghloul , Rania Elgohary , Mohammad Khishe
Accurate identification of ABL1 tyrosine-kinase mutations associated with therapeutic resistance can support timely adjustment of anticancer regimens; however, tyrosine-kinase inhibitor (TKI) mutation datasets are typically small and strongly imbalanced, which can bias model training and inflate performance if data processing is not strictly separated between training and testing. This study proposes an end-to-end, leakage-controlled machine-learning framework for ABL1 TKI-resistance prediction, in which all data-driven operations including feature selection and Synthetic Minority Oversampling Technique (SMOTE) are performed within cross-validation training folds only, preventing information from validation folds or the test set from influencing model development. Multiple base learners were independently tuned using metaheuristic hyperparameter optimization and then integrated using a stacked-ensemble architecture to reduce overfitting and improve generalization. On a held-out test set, the final ensemble achieved 91.9% accuracy, 75.0% precision, 96.9% specificity, 60.0% sensitivity, 66.7% F1-score, 0.626 MCC, 0.938 AUROC, and 0.729 PR-AUC, showing only a modest decline relative to cross-validation estimates. Post-hoc interpretability with Shapley additive explanations (SHAP) highlighted binding-score terms, mutation physicochemical descriptors, ligand flexibility, and local mutation-environment features as the main contributors, consistent with established principles of protein–ligand recognition. Overall, the results support a methodologically disciplined and interpretable approach for mutation-level resistance prediction, while motivating external validation and downstream evaluation of clinical utility.
准确识别与治疗耐药相关的ABL1酪氨酸激酶突变可以支持及时调整抗癌方案;然而,酪氨酸激酶抑制剂(TKI)突变数据集通常很小且极不平衡,如果数据处理没有严格区分训练和测试,这可能会影响模型训练和提高性能。本研究提出了一个端到端、泄漏控制的ABL1 tki抗性预测机器学习框架,其中所有数据驱动的操作,包括特征选择和合成少数派过采样技术(SMOTE),仅在交叉验证训练折叠中执行,防止验证折叠的信息或测试集影响模型开发。采用元启发式超参数优化对多个基学习器进行独立调优,然后采用堆叠集成架构进行集成,以减少过拟合,提高泛化能力。在一个固定的测试集上,最终的集合达到了91.9%的准确度、75.0%的精密度、96.9%的特异性、60.0%的灵敏度、66.7%的f1评分、0.626的MCC、0.938的AUROC和0.729的PR-AUC,相对于交叉验证的估计只有轻微的下降。Shapley加性解释(SHAP)的事后解释性突出了结合评分术语、突变物理化学描述符、配体灵活性和局部突变环境特征作为主要贡献者,与蛋白质配体识别的既定原则一致。总的来说,这些结果支持了一种方法学上有纪律和可解释的突变水平耐药预测方法,同时激发了外部验证和临床效用的下游评估。
{"title":"Explainable stacked-ensemble prediction of ABL1 tyrosine-kinase inhibitor resistance: A metaheuristic-optimized pipeline","authors":"Faris Hassan ,&nbsp;Mohanad A. Deif ,&nbsp;Alaa Zaghloul ,&nbsp;Rania Elgohary ,&nbsp;Mohammad Khishe","doi":"10.1016/j.ibmed.2025.100341","DOIUrl":"10.1016/j.ibmed.2025.100341","url":null,"abstract":"<div><div>Accurate identification of ABL1 tyrosine-kinase mutations associated with therapeutic resistance can support timely adjustment of anticancer regimens; however, tyrosine-kinase inhibitor (TKI) mutation datasets are typically small and strongly imbalanced, which can bias model training and inflate performance if data processing is not strictly separated between training and testing. This study proposes an end-to-end, leakage-controlled machine-learning framework for ABL1 TKI-resistance prediction, in which all data-driven operations including feature selection and Synthetic Minority Oversampling Technique (SMOTE) are performed within cross-validation training folds only, preventing information from validation folds or the test set from influencing model development. Multiple base learners were independently tuned using metaheuristic hyperparameter optimization and then integrated using a stacked-ensemble architecture to reduce overfitting and improve generalization. On a held-out test set, the final ensemble achieved 91.9% accuracy, 75.0% precision, 96.9% specificity, 60.0% sensitivity, 66.7% F1-score, 0.626 MCC, 0.938 AUROC, and 0.729 PR-AUC, showing only a modest decline relative to cross-validation estimates. Post-hoc interpretability with Shapley additive explanations (SHAP) highlighted binding-score terms, mutation physicochemical descriptors, ligand flexibility, and local mutation-environment features as the main contributors, consistent with established principles of protein–ligand recognition. Overall, the results support a methodologically disciplined and interpretable approach for mutation-level resistance prediction, while motivating external validation and downstream evaluation of clinical utility.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100341"},"PeriodicalIF":0.0,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the role of synthetic data in the future of AI in healthcare: A scoping review of frameworks, challenges, and implications 探索人工智能在医疗保健领域的未来中合成数据的作用:框架、挑战和影响的范围审查
Pub Date : 2025-12-29 DOI: 10.1016/j.ibmed.2025.100342
Mohammad Ishtiaque Rahman , Md Razuan Hossain , S.M. Sayem , Forhan Bin Emdad
Synthetic data has emerged as a transformative tool in healthcare, particularly in areas such as medical imaging, electronic health records (EHRs), and clinical trial simulation, where data privacy, diversity, and accessibility are critical. This scoping review examines current approaches to synthetic data generation in healthcare, with a focus on AI model training, privacy preservation, and bias mitigation. A comprehensive search of PubMed, IEEE Xplore, and ACM Digital Library yielded 2906 studies, of which 42 met the inclusion criteria. Key data generation techniques included generative adversarial networks (GANs), variational autoencoders (VAEs), diffusion models, Bayesian networks, federated learning, recurrent neural networks (RNNs), large language models (LLMs), agent-based models, graph-based generators, and SMOTE-based oversampling. Applications ranged from diagnostic model development to privacy-preserving data sharing and educational simulation. However, the field faces persistent challenges, including inconsistent validation practices, the absence of standard benchmarks, high computational demands, and ethical concerns related to consent and bias. This review underscores the need for standardized evaluation protocols, clearer regulatory guidance, and multidisciplinary collaboration to ensure the safe, equitable, and effective use of synthetic data in healthcare AI. In addition to technical advances, the review highlights the socio-technical implications of synthetic data adoption, including its impact on health equity, patient trust, and clinical decision-making.
合成数据已成为医疗保健领域的变革性工具,特别是在医疗成像、电子健康记录(EHRs)和临床试验模拟等领域,这些领域的数据隐私、多样性和可访问性至关重要。本范围审查审查了医疗保健中合成数据生成的当前方法,重点是人工智能模型训练、隐私保护和偏见缓解。综合检索PubMed、IEEE explore和ACM数字图书馆得到2906篇研究,其中42篇符合纳入标准。关键的数据生成技术包括生成对抗网络(GANs)、变分自编码器(VAEs)、扩散模型、贝叶斯网络、联邦学习、循环神经网络(rnn)、大型语言模型(LLMs)、基于代理的模型、基于图的生成器和基于smote的过采样。应用范围从诊断模型开发到保护隐私的数据共享和教育模拟。然而,该领域面临着持续的挑战,包括不一致的验证实践、缺乏标准基准、高计算需求以及与同意和偏见相关的伦理问题。本综述强调需要标准化的评估方案、更清晰的监管指导和多学科合作,以确保在医疗保健人工智能中安全、公平和有效地使用合成数据。除了技术进步外,该审查还强调了采用综合数据的社会技术影响,包括其对卫生公平、患者信任和临床决策的影响。
{"title":"Exploring the role of synthetic data in the future of AI in healthcare: A scoping review of frameworks, challenges, and implications","authors":"Mohammad Ishtiaque Rahman ,&nbsp;Md Razuan Hossain ,&nbsp;S.M. Sayem ,&nbsp;Forhan Bin Emdad","doi":"10.1016/j.ibmed.2025.100342","DOIUrl":"10.1016/j.ibmed.2025.100342","url":null,"abstract":"<div><div>Synthetic data has emerged as a transformative tool in healthcare, particularly in areas such as medical imaging, electronic health records (EHRs), and clinical trial simulation, where data privacy, diversity, and accessibility are critical. This scoping review examines current approaches to synthetic data generation in healthcare, with a focus on AI model training, privacy preservation, and bias mitigation. A comprehensive search of PubMed, IEEE Xplore, and ACM Digital Library yielded 2906 studies, of which 42 met the inclusion criteria. Key data generation techniques included generative adversarial networks (GANs), variational autoencoders (VAEs), diffusion models, Bayesian networks, federated learning, recurrent neural networks (RNNs), large language models (LLMs), agent-based models, graph-based generators, and SMOTE-based oversampling. Applications ranged from diagnostic model development to privacy-preserving data sharing and educational simulation. However, the field faces persistent challenges, including inconsistent validation practices, the absence of standard benchmarks, high computational demands, and ethical concerns related to consent and bias. This review underscores the need for standardized evaluation protocols, clearer regulatory guidance, and multidisciplinary collaboration to ensure the safe, equitable, and effective use of synthetic data in healthcare AI. In addition to technical advances, the review highlights the socio-technical implications of synthetic data adoption, including its impact on health equity, patient trust, and clinical decision-making.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100342"},"PeriodicalIF":0.0,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The potential of artificial intelligence in advancing neuroscience: A systematic review of current applications and models 人工智能在推进神经科学方面的潜力:对当前应用和模型的系统回顾
Pub Date : 2025-12-27 DOI: 10.1016/j.ibmed.2025.100338
Fatemeh Afroughi , SeyedAhmad SeyedAlinaghi , Pegah Mirzapour , Shabnam Shirdel , Zohal Parmoon , Mohammad Musa Khorshidi , Somaye Mansouri , Mahdi Sheykhi , Yusuf Popoola , Esmaeil Mehraeen

Introduction

Artificial intelligence (AI) is the simulation of human intelligence, in which machines perform problem-solving like the human brain. AI and neuroscience are interrelated. In this study, a systematic review of current AI models and applications was conducted to consider the potential of AI in advancing neuroscience.

Methods

Relevant articles were selected based on a search in three reputable databases, including Web of Science, PubMed, and Scopus. Two independent researchers conducted the selection process in two stages.

Results

A total of 99 studies (2019–2024) met PRISMA criteria. Of these, 83 studies focused on specific brain disorders—most notably Alzheimer's disease (n = 26), stroke (n = 14), epilepsy (n = 7), and Parkinson's disease (n = 7)—while 22 addressed broader neuroscience applications. A range of AI methods were applied, including traditional machine learning techniques (e.g., Support Vector Machines (SVM), Random Forest) and deep learning approaches (e.g., Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs)), with several studies employing hybrid models. A comparative analysis of study designs revealed a heavy reliance on public datasets (e.g., Alzheimers Disease Neuroimaging Initiative (ADNI)) for Alzheimer's research, while studies on other disorders predominantly utilized private cohorts. Regarding validation, the majority of studies employed internal cross-validation strategies, with fewer utilizing independent external datasets to test generalizability.

Conclusion

The transformative potential of AI in advancing neuroscience lies in its ability to increase diagnostic accuracy, predict disease progression, and enhance imaging techniques. Future research should focus on refining AI methods to enhance generalizability and foster collaborations between AI practitioners and neuroscientists.
人工智能(AI)是对人类智能的模拟,机器可以像人脑一样解决问题。人工智能和神经科学是相互关联的。在本研究中,对当前人工智能模型和应用进行了系统回顾,以考虑人工智能在推进神经科学方面的潜力。方法在Web of Science、PubMed、Scopus三个知名数据库中检索相关文章。两位独立的研究人员分两个阶段进行了选择。结果2019-2024年共有99项研究符合PRISMA标准。其中,83项研究集中于特定的脑部疾病——最著名的是阿尔茨海默病(n = 26)、中风(n = 14)、癫痫(n = 7)和帕金森病(n = 7)——而22项研究涉及更广泛的神经科学应用。应用了一系列人工智能方法,包括传统的机器学习技术(例如,支持向量机(SVM),随机森林)和深度学习方法(例如,卷积神经网络(cnn),生成对抗网络(gan)),以及一些采用混合模型的研究。对研究设计的比较分析显示,阿尔茨海默病研究严重依赖公共数据集(例如,阿尔茨海默病神经影像学倡议(ADNI)),而对其他疾病的研究主要使用私人队列。在验证方面,大多数研究采用内部交叉验证策略,较少使用独立的外部数据集来测试概括性。结论人工智能在推进神经科学方面的变革潜力在于其提高诊断准确性、预测疾病进展和增强成像技术的能力。未来的研究应侧重于改进人工智能方法,以提高通用性,并促进人工智能从业者和神经科学家之间的合作。
{"title":"The potential of artificial intelligence in advancing neuroscience: A systematic review of current applications and models","authors":"Fatemeh Afroughi ,&nbsp;SeyedAhmad SeyedAlinaghi ,&nbsp;Pegah Mirzapour ,&nbsp;Shabnam Shirdel ,&nbsp;Zohal Parmoon ,&nbsp;Mohammad Musa Khorshidi ,&nbsp;Somaye Mansouri ,&nbsp;Mahdi Sheykhi ,&nbsp;Yusuf Popoola ,&nbsp;Esmaeil Mehraeen","doi":"10.1016/j.ibmed.2025.100338","DOIUrl":"10.1016/j.ibmed.2025.100338","url":null,"abstract":"<div><h3>Introduction</h3><div>Artificial intelligence (AI) is the simulation of human intelligence, in which machines perform problem-solving like the human brain. AI and neuroscience are interrelated. In this study, a systematic review of current AI models and applications was conducted to consider the potential of AI in advancing neuroscience.</div></div><div><h3>Methods</h3><div>Relevant articles were selected based on a search in three reputable databases, including Web of Science, PubMed, and Scopus. Two independent researchers conducted the selection process in two stages.</div></div><div><h3>Results</h3><div>A total of 99 studies (2019–2024) met PRISMA criteria. Of these, 83 studies focused on specific brain disorders—most notably Alzheimer's disease (n = 26), stroke (n = 14), epilepsy (n = 7), and Parkinson's disease (n = 7)—while 22 addressed broader neuroscience applications. A range of AI methods were applied, including traditional machine learning techniques (e.g., Support Vector Machines (SVM), Random Forest) and deep learning approaches (e.g., Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs)), with several studies employing hybrid models. A comparative analysis of study designs revealed a heavy reliance on public datasets (e.g., Alzheimers Disease Neuroimaging Initiative (ADNI)) for Alzheimer's research, while studies on other disorders predominantly utilized private cohorts. Regarding validation, the majority of studies employed internal cross-validation strategies, with fewer utilizing independent external datasets to test generalizability.</div></div><div><h3>Conclusion</h3><div>The transformative potential of AI in advancing neuroscience lies in its ability to increase diagnostic accuracy, predict disease progression, and enhance imaging techniques. Future research should focus on refining AI methods to enhance generalizability and foster collaborations between AI practitioners and neuroscientists.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100338"},"PeriodicalIF":0.0,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A scoping review of machine learning models for predicting preterm birth and miscarriage: Mapping the landscape and the performance-validation paradox 预测早产和流产的机器学习模型的范围审查:绘制景观和性能验证悖论
Pub Date : 2025-12-22 DOI: 10.1016/j.ibmed.2025.100337
Golnoush Shahraki , Elias Mazrooei Rad , Mohammad Heidari

Objective

This scoping review aims to summarize how machine learning (ML) has been applied to predict adverse pregnancy outcomes—particularly preterm birth and miscarriage—by comparing data sources, model designs, and reported performance.

Methods

Evidence from 40 eligible studies was systematically reviewed. For each, we extracted details on the type of data used (e.g., clinical records, imaging, biomarkers), sample size, ML algorithms, and major performance metrics such as AUC and accuracy. Recurring strengths and weaknesses were identified through thematic analysis.

Results

The reviewed studies drew on a wide range of data sources—from large electronic health records (EHRs) to imaging and time-series physiological signals. Tree-based algorithms and support vector machines generally showed strong predictive performance, with several studies reporting AUC values above 0.90. However, most investigations were limited by small, single-center datasets and lacked external validation, raising concerns about generalizability and clinical interpretability.

Conclusion

Machine learning approaches could meaningfully improve how clinicians anticipate adverse pregnancy outcomes, but their clinical use remains premature. Progress will depend on studies that include larger and more diverse populations, apply rigorous external validation, and focus on developing models that clinicians can interpret and act upon.
本综述旨在通过比较数据来源、模型设计和报告的性能,总结机器学习(ML)如何应用于预测不良妊娠结局(特别是早产和流产)。方法系统回顾40项符合条件的研究的证据。对于每种方法,我们提取了所使用数据类型(例如,临床记录、成像、生物标志物)、样本量、ML算法和主要性能指标(如AUC和准确性)的详细信息。通过专题分析确定了反复出现的优点和缺点。结果回顾的研究利用了广泛的数据来源——从大型电子健康记录(EHRs)到成像和时间序列生理信号。基于树的算法和支持向量机普遍表现出较强的预测性能,有几项研究报告AUC值在0.90以上。然而,大多数研究受限于小型单中心数据集,缺乏外部验证,引起了对普遍性和临床可解释性的担忧。结论机器学习方法可以显著提高临床医生对不良妊娠结局的预测,但其临床应用尚不成熟。进展将取决于包括更大和更多样化人群的研究,应用严格的外部验证,并专注于开发临床医生可以解释和采取行动的模型。
{"title":"A scoping review of machine learning models for predicting preterm birth and miscarriage: Mapping the landscape and the performance-validation paradox","authors":"Golnoush Shahraki ,&nbsp;Elias Mazrooei Rad ,&nbsp;Mohammad Heidari","doi":"10.1016/j.ibmed.2025.100337","DOIUrl":"10.1016/j.ibmed.2025.100337","url":null,"abstract":"<div><h3>Objective</h3><div>This scoping review aims to summarize how machine learning (ML) has been applied to predict adverse pregnancy outcomes—particularly preterm birth and miscarriage—by comparing data sources, model designs, and reported performance.</div></div><div><h3>Methods</h3><div>Evidence from 40 eligible studies was systematically reviewed. For each, we extracted details on the type of data used (e.g., clinical records, imaging, biomarkers), sample size, ML algorithms, and major performance metrics such as AUC and accuracy. Recurring strengths and weaknesses were identified through thematic analysis.</div></div><div><h3>Results</h3><div>The reviewed studies drew on a wide range of data sources—from large electronic health records (EHRs) to imaging and time-series physiological signals. Tree-based algorithms and support vector machines generally showed strong predictive performance, with several studies reporting AUC values above 0.90. However, most investigations were limited by small, single-center datasets and lacked external validation, raising concerns about generalizability and clinical interpretability.</div></div><div><h3>Conclusion</h3><div>Machine learning approaches could meaningfully improve how clinicians anticipate adverse pregnancy outcomes, but their clinical use remains premature. Progress will depend on studies that include larger and more diverse populations, apply rigorous external validation, and focus on developing models that clinicians can interpret and act upon.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100337"},"PeriodicalIF":0.0,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion models vs. DCGANs for class-imbalanced lung cancer CT classification: A comparative study 扩散模型与dcgan在分级不平衡肺癌CT分型中的比较研究
Pub Date : 2025-12-22 DOI: 10.1016/j.ibmed.2025.100336
Masoud Tabibian, Tahereh Razmpour, Rajib Saha
Effective lung cancer detection from CT scans remains critically challenged by class imbalance where benign and normal cases are underrepresented, leading to biased machine learning models with reduced sensitivity for minority classes and potentially missed diagnoses in cancer screening applications. We present a comprehensive comparative analysis of Diffusion Models and Deep Convolutional Generative Adversarial Networks (DCGANs), both incorporating modern architectural enhancements including spectral normalization, self-attention mechanisms, and conditional generation, for addressing class imbalance in lung cancer CT classification. Using the IQ-OTH/NCCD dataset comprising 1097 CT images across normal, benign, and malignant categories with statistical validation across 10 independent runs, we evaluated both approaches through quantitative image quality metrics (Fréchet Inception Distance, Kullback-Leibler divergence, Kernel Inception Distance, and Inception Score) and downstream classification performance. While Diffusion models consistently outperformed DCGANs across most image quality measures, the clinical significance was confirmed through task-based validation. Both generative approaches successfully addressed class imbalance: DCGAN-augmented datasets achieved overall accuracy of 0.9760 ± 0.0116 with benign recall improvement from 0.833 to 0.933, while Diffusion-augmented datasets reached superior performance of 0.9959 ± 0.0068 with perfect benign recall (1.000 ± 0.000). Critically for cancer screening where false negatives carry severe consequences, Diffusion maintained the highest malignant detection sensitivity (0.997 ± 0.008) with substantially lower performance variance, demonstrating more consistent synthetic data quality. These findings establish that while both modern architectures can mitigate class imbalance, Diffusion models' superior recall performance and lower variability position them as the preferred approach for high-stakes clinical applications, demonstrating that ultimate validation must prioritize downstream clinical task performance over image quality metrics alone.
CT扫描中肺癌的有效检测仍然受到类别不平衡的严重挑战,其中良性和正常病例的代表性不足,导致机器学习模型对少数类别的敏感性降低,并可能在癌症筛查应用中遗漏诊断。我们对扩散模型和深度卷积生成对抗网络(dcgan)进行了全面的比较分析,两者都采用了现代架构增强,包括谱归一化、自注意机制和条件生成,以解决肺癌CT分类中的类别不平衡问题。使用IQ-OTH/NCCD数据集,包括1097张CT图像,包括正常、良性和恶性类别,并在10次独立运行中进行统计验证,我们通过定量图像质量指标(fr起始距离、Kullback-Leibler散度、核起始距离和起始分数)和下游分类性能来评估这两种方法。虽然扩散模型在大多数图像质量测量中始终优于dcgan,但通过基于任务的验证证实了临床意义。两种生成方法都成功地解决了类不平衡问题:dcgan增强数据集的总体准确率为0.9760±0.0116,良性召回率从0.833提高到0.933,而扩散增强数据集的总体准确率为0.9959±0.0068,良性召回率为1.000±0.000。对于假阴性会带来严重后果的癌症筛查来说,Diffusion保持了最高的恶性检测灵敏度(0.997±0.008),性能差异显著降低,显示出更一致的合成数据质量。这些发现表明,虽然这两种现代架构都可以缓解类失衡,但扩散模型优越的召回性能和较低的可变性使其成为高风险临床应用的首选方法,这表明最终验证必须优先考虑下游临床任务性能,而不是单独的图像质量指标。
{"title":"Diffusion models vs. DCGANs for class-imbalanced lung cancer CT classification: A comparative study","authors":"Masoud Tabibian,&nbsp;Tahereh Razmpour,&nbsp;Rajib Saha","doi":"10.1016/j.ibmed.2025.100336","DOIUrl":"10.1016/j.ibmed.2025.100336","url":null,"abstract":"<div><div>Effective lung cancer detection from CT scans remains critically challenged by class imbalance where benign and normal cases are underrepresented, leading to biased machine learning models with reduced sensitivity for minority classes and potentially missed diagnoses in cancer screening applications. We present a comprehensive comparative analysis of Diffusion Models and Deep Convolutional Generative Adversarial Networks (DCGANs), both incorporating modern architectural enhancements including spectral normalization, self-attention mechanisms, and conditional generation, for addressing class imbalance in lung cancer CT classification. Using the IQ-OTH/NCCD dataset comprising 1097 CT images across normal, benign, and malignant categories with statistical validation across 10 independent runs, we evaluated both approaches through quantitative image quality metrics (Fréchet Inception Distance, Kullback-Leibler divergence, Kernel Inception Distance, and Inception Score) and downstream classification performance. While Diffusion models consistently outperformed DCGANs across most image quality measures, the clinical significance was confirmed through task-based validation. Both generative approaches successfully addressed class imbalance: DCGAN-augmented datasets achieved overall accuracy of 0.9760 ± 0.0116 with benign recall improvement from 0.833 to 0.933, while Diffusion-augmented datasets reached superior performance of 0.9959 ± 0.0068 with perfect benign recall (1.000 ± 0.000). Critically for cancer screening where false negatives carry severe consequences, Diffusion maintained the highest malignant detection sensitivity (0.997 ± 0.008) with substantially lower performance variance, demonstrating more consistent synthetic data quality. These findings establish that while both modern architectures can mitigate class imbalance, Diffusion models' superior recall performance and lower variability position them as the preferred approach for high-stakes clinical applications, demonstrating that ultimate validation must prioritize downstream clinical task performance over image quality metrics alone.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100336"},"PeriodicalIF":0.0,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cardiac Magnetic Resonance-to-Computed Tomography Angiography image conversion using diffusion models for Transcatheter Aortic Valve Implantation planning 利用扩散模型对经导管主动脉瓣植入计划进行心脏磁共振-计算机断层血管造影图像转换
Pub Date : 2025-12-19 DOI: 10.1016/j.ibmed.2025.100335
Carmen Guadalupe Colin-Tenorio , Agnes Mayr , Christian Kremser , Markus Haltmeier , Enrique Almar-Munoz

Introduction:

Transcatheter Aortic Valve Implantation (TAVI) has become the preferred method for treating severe aortic stenosis, especially in patients who are unsuitable for traditional surgery. Typically, preoperative imaging for TAVI involves contrast-enhanced Computed Tomography Angiography (CTA). However, for patients with contraindications to contrast agents, Cardiac Magnetic Resonance imaging (CMR) is a viable alternative, albeit with its limitations in visualizing calcifications.

Methods:

This study explores the application of diffusion models to enhance CMR-to-CTA contrast-free image conversion, to avoid the use of contrast agents and ionizing radiation. We developed a pipeline incorporating Denoising Diffusion Probabilistic Models (DDPMs) and Stochastic Differential Equations (SDE) models to synthesize CTA-equivalent images from CMR scans. We evaluated this approach using an in-house dataset consisting of 39 paired CTA and CMR scans. For the training process, coregistration of both modalities was required, which we achieved by performing rigid registration using the segmented aorta masks.

Results:

Our results show that the synthesized CTA images maintain high fidelity to the actual scans. This is quantitatively supported by a mean Structural Similarity Index Measure (SSIM) of 0.8 and a Peak Signal-to-Noise Ratio (PSNR) of 22 dB using conditional Stochastic Differential Equations (SDE) and Prediction-Correction (PC), indicating strong structural preservation and low reconstruction error. However, the model occasionally fails to accurately detect valve calcifications, likely due to limitations in capturing subtle pathological details that are not visually discernible in CMR images.

Conclusion:

Diffusion models used to synthesize CTA images from CMR datasets achieve high accuracy, providing a contrast-free alternative for TAVI planning and potential insights into valvular calcification patterns. However, accurate visualization of valve calcification occasionally fails, and larger datasets are desirable for validation.
导论:经导管主动脉瓣植入术(Transcatheter Aortic Valve Implantation, TAVI)已成为治疗严重主动脉瓣狭窄的首选方法,特别是对于不适合传统手术治疗的患者。通常情况下,TAVI的术前成像包括对比增强计算机断层血管造影(CTA)。然而,对于有造影剂禁忌症的患者,心脏磁共振成像(CMR)是一种可行的替代方案,尽管在可视化钙化方面存在局限性。方法:本研究探索应用扩散模型增强cmr - cta无对比图像转换,避免使用造影剂和电离辐射。我们开发了一个结合去噪扩散概率模型(ddpm)和随机微分方程(SDE)模型的管道,以合成CMR扫描的cta等效图像。我们使用由39对CTA和CMR扫描组成的内部数据集来评估这种方法。对于训练过程,需要两种模式的共同注册,我们通过使用分段主动脉面罩进行刚性注册来实现。结果:合成的CTA图像与实际扫描保持了较高的保真度。基于条件随机微分方程(SDE)和预测校正(PC)的平均结构相似指数(SSIM)为0.8,峰值信噪比(PSNR)为22 dB,这在定量上支持了这一结论,表明结构保存性强,重建误差低。然而,该模型偶尔不能准确地检测到瓣膜钙化,这可能是由于在捕捉CMR图像中无法视觉识别的细微病理细节方面的限制。结论:用于从CMR数据集合成CTA图像的扩散模型具有较高的准确性,为TAVI规划提供了无对比度的替代方案,并可能深入了解瓣膜钙化模式。然而,瓣膜钙化的精确可视化有时会失败,需要更大的数据集进行验证。
{"title":"Cardiac Magnetic Resonance-to-Computed Tomography Angiography image conversion using diffusion models for Transcatheter Aortic Valve Implantation planning","authors":"Carmen Guadalupe Colin-Tenorio ,&nbsp;Agnes Mayr ,&nbsp;Christian Kremser ,&nbsp;Markus Haltmeier ,&nbsp;Enrique Almar-Munoz","doi":"10.1016/j.ibmed.2025.100335","DOIUrl":"10.1016/j.ibmed.2025.100335","url":null,"abstract":"<div><h3>Introduction:</h3><div>Transcatheter Aortic Valve Implantation (TAVI) has become the preferred method for treating severe aortic stenosis, especially in patients who are unsuitable for traditional surgery. Typically, preoperative imaging for TAVI involves contrast-enhanced Computed Tomography Angiography (CTA). However, for patients with contraindications to contrast agents, Cardiac Magnetic Resonance imaging (CMR) is a viable alternative, albeit with its limitations in visualizing calcifications.</div></div><div><h3>Methods:</h3><div>This study explores the application of diffusion models to enhance CMR-to-CTA contrast-free image conversion, to avoid the use of contrast agents and ionizing radiation. We developed a pipeline incorporating Denoising Diffusion Probabilistic Models (DDPMs) and Stochastic Differential Equations (SDE) models to synthesize CTA-equivalent images from CMR scans. We evaluated this approach using an in-house dataset consisting of 39 paired CTA and CMR scans. For the training process, coregistration of both modalities was required, which we achieved by performing rigid registration using the segmented aorta masks.</div></div><div><h3>Results:</h3><div>Our results show that the synthesized CTA images maintain high fidelity to the actual scans. This is quantitatively supported by a mean Structural Similarity Index Measure (SSIM) of 0.8 and a Peak Signal-to-Noise Ratio (PSNR) of 22<!--> <!-->dB using conditional Stochastic Differential Equations (SDE) and Prediction-Correction (PC), indicating strong structural preservation and low reconstruction error. However, the model occasionally fails to accurately detect valve calcifications, likely due to limitations in capturing subtle pathological details that are not visually discernible in CMR images.</div></div><div><h3>Conclusion:</h3><div>Diffusion models used to synthesize CTA images from CMR datasets achieve high accuracy, providing a contrast-free alternative for TAVI planning and potential insights into valvular calcification patterns. However, accurate visualization of valve calcification occasionally fails, and larger datasets are desirable for validation.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100335"},"PeriodicalIF":0.0,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated detection of HER2 gene copy number in breast cancer using deep learning techniques 利用深度学习技术自动检测乳腺癌中HER2基因拷贝数
Pub Date : 2025-12-18 DOI: 10.1016/j.ibmed.2025.100333
Terisara Micaraseth , Shanop Shuangshoti , Sakdina Prommaouan , Somruetai Shuangshoti , Rizwan Ullah , Gridsada Phanomchoeng
Accurate evaluation of HER2 gene amplification is critical for guiding breast cancer treatment decisions. This study proposes a deep learning-based diagnostic system for analyzing Dual In Situ Hybridization (DISH) images to support HER2 status assessment. The system integrates two models— YOLOv11-seg for cell detection and YOLOv11 object detection models for HER2 and CEP17 signal quantification—into a unified pipeline. High-resolution whole-slide images were preprocessed and annotated to train the models, which were then embedded into a standalone application designed for clinical environments. Upon uploading TIFF format images, the application performs automated cell detection, red/black signal analysis, and HER2/CEP17 ratio computation. Experimental results demonstrated an accuracy 95.24 % for the best identification and mean deviations of 6.08 % (CEP17) and 12.78 % (HER2) compared with manual counting. Statistical analyses confirm high consistency, particularly in red signal detection. Clinical feedback under scores the system's ease of use, accuracy, and potential to reduce diagnostic burden. The proposed approach demonstrates strong feasibility for routine adoption in pathology workflows.
准确评估HER2基因扩增对指导乳腺癌治疗决策至关重要。本研究提出了一种基于深度学习的诊断系统,用于分析双原位杂交(DISH)图像,以支持HER2状态评估。该系统将用于细胞检测的YOLOv11-seg和用于HER2和CEP17信号量化的YOLOv11目标检测模型集成到一个统一的管道中。对高分辨率的整张幻灯片图像进行预处理和注释以训练模型,然后将其嵌入为临床环境设计的独立应用程序中。在上传TIFF格式的图像后,应用程序执行自动细胞检测,红/黑信号分析和HER2/CEP17比率计算。实验结果表明,与人工计数相比,该方法的最佳识别准确率为95.24%,平均偏差为6.08% (CEP17)和12.78% (HER2)。统计分析证实了高一致性,特别是在红色信号检测方面。临床反馈评分系统的易用性、准确性和减少诊断负担的潜力。所提出的方法证明了在病理工作流程中常规采用的强大可行性。
{"title":"Automated detection of HER2 gene copy number in breast cancer using deep learning techniques","authors":"Terisara Micaraseth ,&nbsp;Shanop Shuangshoti ,&nbsp;Sakdina Prommaouan ,&nbsp;Somruetai Shuangshoti ,&nbsp;Rizwan Ullah ,&nbsp;Gridsada Phanomchoeng","doi":"10.1016/j.ibmed.2025.100333","DOIUrl":"10.1016/j.ibmed.2025.100333","url":null,"abstract":"<div><div>Accurate evaluation of HER2 gene amplification is critical for guiding breast cancer treatment decisions. This study proposes a deep learning-based diagnostic system for analyzing Dual In Situ Hybridization (DISH) images to support HER2 status assessment. The system integrates two models— YOLOv11-seg for cell detection and YOLOv11 object detection models for HER2 and CEP17 signal quantification—into a unified pipeline. High-resolution whole-slide images were preprocessed and annotated to train the models, which were then embedded into a standalone application designed for clinical environments. Upon uploading TIFF format images, the application performs automated cell detection, red/black signal analysis, and HER2/CEP17 ratio computation. Experimental results demonstrated an accuracy 95.24 % for the best identification and mean deviations of 6.08 % (CEP17) and 12.78 % (HER2) compared with manual counting. Statistical analyses confirm high consistency, particularly in red signal detection. Clinical feedback under scores the system's ease of use, accuracy, and potential to reduce diagnostic burden. The proposed approach demonstrates strong feasibility for routine adoption in pathology workflows.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100333"},"PeriodicalIF":0.0,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimization framework for overcoming tyrosine kinase inhibitor resistance: Multi-objective selection, scheduling, and adaptive therapy 克服酪氨酸激酶抑制剂耐药性的优化框架:多目标选择、调度和适应性治疗
Pub Date : 2025-12-18 DOI: 10.1016/j.ibmed.2025.100332
Mohanad A. Deif , Mohamed A. Hafez , Mohammad Khishe
Tyrosine kinase inhibitors are key drugs in targeted cancer therapy but often fail when resistance emerges. Many predictive methods focus on accuracy alone, while calibration and kinase selectivity, which matter for clinical use, receive less attention. We present a single framework that treats resistance prediction and dosing decisions as a three-objective problem: minimize misclassification, reduce calibration error, and increase selectivity. Using calibrated probabilities and tuned thresholds, baseline models improved in ROC–AUC and expected calibration error across stratified, scaffold, and mutation-cold splits. Pareto analysis with hypervolume and coverage showed that including selectivity changes the relative ranking of inhibitors and exposes trade-offs that accuracy alone cannot capture. On the treatment side, we compared continuous dosing, hysteresis switching, and adaptive model predictive control in a two-compartment tumor model. Adaptive control lowered total dose by about 18% and extended simulated survival by more than 25 weeks. These results provide a clear proof of concept that combining machine learning, multi-objective optimization, and adaptive therapy can improve prediction quality and guide personalized dosing to better manage resistance.
酪氨酸激酶抑制剂是靶向癌症治疗的关键药物,但往往在出现耐药性时失效。许多预测方法只关注准确性,而校准和激酶选择性对临床使用很重要,却很少受到关注。我们提出了一个单一的框架,将耐药性预测和给药决策视为一个三目标问题:最大限度地减少错误分类,减少校准误差,增加选择性。使用校准概率和调整阈值,基线模型改进了ROC-AUC和分层、支架和突变冷分裂的预期校准误差。使用超容量和覆盖的帕累托分析表明,包括选择性改变了抑制剂的相对排名,并暴露了仅靠准确性无法捕获的权衡。在治疗方面,我们比较了两室肿瘤模型中的连续给药、迟滞开关和自适应模型预测控制。适应性控制将总剂量降低了18%,并将模拟生存期延长了25周以上。这些结果提供了一个清晰的概念证明,结合机器学习、多目标优化和自适应治疗可以提高预测质量,并指导个性化给药,以更好地管理耐药性。
{"title":"Optimization framework for overcoming tyrosine kinase inhibitor resistance: Multi-objective selection, scheduling, and adaptive therapy","authors":"Mohanad A. Deif ,&nbsp;Mohamed A. Hafez ,&nbsp;Mohammad Khishe","doi":"10.1016/j.ibmed.2025.100332","DOIUrl":"10.1016/j.ibmed.2025.100332","url":null,"abstract":"<div><div>Tyrosine kinase inhibitors are key drugs in targeted cancer therapy but often fail when resistance emerges. Many predictive methods focus on accuracy alone, while calibration and kinase selectivity, which matter for clinical use, receive less attention. We present a single framework that treats resistance prediction and dosing decisions as a three-objective problem: minimize misclassification, reduce calibration error, and increase selectivity. Using calibrated probabilities and tuned thresholds, baseline models improved in ROC–AUC and expected calibration error across stratified, scaffold, and mutation-cold splits. Pareto analysis with hypervolume and coverage showed that including selectivity changes the relative ranking of inhibitors and exposes trade-offs that accuracy alone cannot capture. On the treatment side, we compared continuous dosing, hysteresis switching, and adaptive model predictive control in a two-compartment tumor model. Adaptive control lowered total dose by about 18% and extended simulated survival by more than 25 weeks. These results provide a clear proof of concept that combining machine learning, multi-objective optimization, and adaptive therapy can improve prediction quality and guide personalized dosing to better manage resistance.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100332"},"PeriodicalIF":0.0,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrasting deep learning audio models for direct respiratory insufficiency detection versus blood oxygen saturation estimation 对比深度学习音频模型用于直接呼吸功能不全检测与血氧饱和度估计
Pub Date : 2025-12-12 DOI: 10.1016/j.ibmed.2025.100331
Marcelo Matheus Gauy , Natália Hitomi Koza , Ricardo Mikio Morita , Gabriel Rocha Stanzione , Arnaldo Cândido Júnior , Larissa Cristina Berti , Anna Sara Shafferman Levin , Ester Cerdeira Sabino , Flaviane Romani Fernandes Svartman , Marcelo Finger
This work aims to investigate the strengths and limitations of non-invasive audio-based deep learning methods for the detection of respiratory conditions. We contrast the performance obtained in tasks such as the expert-centered respiratory insufficiency (RI) detection with easily measured blood oxygen saturation (SpO2) estimation. Several deep learning audio models have been recently proposed for RI detection via voice and speech analysis; these models have obtained an accuracy of 95% in general patients and 97.4% in COVID-19 patients. Here, we extend those results, refining several pretrained audio neural networks (CNN6, CNN10 and CNN14) and Masked Autoencoders (Audio-MAE) for RI detection, showing that some of these models achieve near perfect accuracy (99.9% on COVID RI and 98.6% on general RI). The models were pretrained on AudioSet resulting in improved performance, with transfer learning playing a key role in the prevention of overfitting. The near-perfect RI detection performance suggests that low-cost and automated methods could be developed for assisting patient triage. In parallel, this paper seeks to verify SpO2 estimation feasibility, so we perform a 92% SpO2-threshold binary classification using the same architectures. In contrast to our findings for RI, this model yielded an accuracy below 70% and MCC-correlation below 0.3, indicating both that SpO2 estimation solely from audio is unfeasible and the presence of multiple features in the audios which are useful for RI detection, but not for SpO2 estimation. We propose that this discrepancy demonstrates the limits of voice and speech biomarkers across different diagnostic tasks under current technologies.
这项工作旨在研究非侵入性基于音频的深度学习方法用于检测呼吸系统疾病的优势和局限性。我们将以专家为中心的呼吸功能不全(RI)检测与易于测量的血氧饱和度(SpO2)估计等任务中的性能进行了对比。最近提出了几个深度学习音频模型,用于通过语音和语音分析进行RI检测;这些模型在普通患者中的准确率为95%,在COVID-19患者中的准确率为97.4%。在这里,我们扩展了这些结果,改进了几个预训练的音频神经网络(CNN6, CNN10和CNN14)和掩码自动编码器(audio - mae)用于RI检测,表明其中一些模型达到了近乎完美的精度(COVID RI为99.9%,普通RI为98.6%)。在AudioSet上对模型进行预训练,从而提高了性能,迁移学习在防止过拟合方面发挥了关键作用。近乎完美的RI检测性能表明,可以开发低成本和自动化的方法来协助患者分诊。同时,本文试图验证SpO2估计的可行性,因此我们使用相同的架构执行92%的SpO2阈值二值分类。与我们的研究结果相比,该模型的RI精度低于70%,mcc相关性低于0.3,这表明仅从音频中估计SpO2是不可实现的,并且音频中存在多个特征,这些特征对RI检测有用,但对SpO2估计无效。我们认为,这种差异表明在当前技术下,语音和语音生物标志物在不同诊断任务中的局限性。
{"title":"Contrasting deep learning audio models for direct respiratory insufficiency detection versus blood oxygen saturation estimation","authors":"Marcelo Matheus Gauy ,&nbsp;Natália Hitomi Koza ,&nbsp;Ricardo Mikio Morita ,&nbsp;Gabriel Rocha Stanzione ,&nbsp;Arnaldo Cândido Júnior ,&nbsp;Larissa Cristina Berti ,&nbsp;Anna Sara Shafferman Levin ,&nbsp;Ester Cerdeira Sabino ,&nbsp;Flaviane Romani Fernandes Svartman ,&nbsp;Marcelo Finger","doi":"10.1016/j.ibmed.2025.100331","DOIUrl":"10.1016/j.ibmed.2025.100331","url":null,"abstract":"<div><div>This work aims to investigate the strengths and limitations of non-invasive audio-based deep learning methods for the detection of respiratory conditions. We contrast the performance obtained in tasks such as the expert-centered respiratory insufficiency (RI) detection with easily measured blood oxygen saturation (SpO2) estimation. Several deep learning audio models have been recently proposed for RI detection via voice and speech analysis; these models have obtained an accuracy of 95% in general patients and 97.4% in COVID-19 patients. Here, we extend those results, refining several pretrained audio neural networks (CNN6, CNN10 and CNN14) and Masked Autoencoders (Audio-MAE) for RI detection, showing that some of these models achieve near perfect accuracy (99.9% on COVID RI and 98.6% on general RI). The models were pretrained on AudioSet resulting in improved performance, with transfer learning playing a key role in the prevention of overfitting. The near-perfect RI detection performance suggests that low-cost and automated methods could be developed for assisting patient triage. In parallel, this paper seeks to verify SpO2 estimation feasibility, so we perform a 92% SpO2-threshold binary classification using the same architectures. In contrast to our findings for RI, this model yielded an accuracy below 70% and MCC-correlation below 0.3, indicating both that SpO2 estimation solely from audio is unfeasible and the presence of multiple features in the audios which are useful for RI detection, but not for SpO2 estimation. We propose that this discrepancy demonstrates the limits of voice and speech biomarkers across different diagnostic tasks under current technologies.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100331"},"PeriodicalIF":0.0,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid spatiotemporal feature fusion for robust lesion detection and tracking in breast ultrasound video data 基于混合时空特征融合的乳腺超声视频数据鲁棒病灶检测与跟踪
Pub Date : 2025-12-10 DOI: 10.1016/j.ibmed.2025.100330
Radwan Qasrawi , Suliman Thwib , Ghada Issa , Razan AbuGhoush , Hussein AlMasri , Marah Qawasmi , Nael Abu Halaweh

Background

Speckle noise, tissue deformation, low contrast, and frame inconsistencies limit the reliability of traditional breast lesion tracking approaches in ultrasound videos.

Objective

This study aims to develop a robust hybrid framework that integrates advanced image enhancement, deep learning-based detection, and spatiotemporal feature fusion for improved lesion detection and tracking in breast ultrasound video sequences.

Methods

We propose a two-phase computational framework. The first phase employs Contrast-Limited Adaptive Histogram Equalization (CLAHE) for local contrast enhancement, followed by a hybrid denoising strategy combining anisotropic diffusion and unsharp masking to suppress noise and preserve edge sharpness. In the second phase, lesion detection is performed using a YOLOv11-L model, fine-tuned on a curated dataset of annotated breast ultrasound images. For tracking, we utilize Kernelized Correlation Filtering (KCF) enhanced with a Hybrid Spatiotemporal Context (STC) representation. The system is evaluated on a dataset comprising 11,382 ultrasound images and 40 video sequences, with performance assessed using Intersection over Union (IoU), success rate, failure rate, and processing speed.

Results

The proposed framework achieved an IoU of 0.878 for benign lesions and 0.881 for malignant lesions. The integration of STC features and YOLO detection reduced tracking failure rates by over 25 % and improved success rates to 99.0 % for benign and 99.4 % for malignant lesions. The system processed 41–45 frames per second in real time.

Conclusions

Our framework provides an effective solution for real-time lesion detection and tracking in breast ultrasound videos. By enhancing both accuracy and reliability, it supports improved clinical decision-making in breast cancer diagnostics.
斑点噪声、组织变形、低对比度和帧不一致限制了超声视频中传统乳腺病变跟踪方法的可靠性。本研究旨在开发一个强大的混合框架,将先进的图像增强、基于深度学习的检测和时空特征融合相结合,以改进乳腺超声视频序列的病变检测和跟踪。方法提出了一种两阶段计算框架。第一阶段采用对比度限制自适应直方图均衡化(CLAHE)进行局部对比度增强,然后采用各向异性扩散和非锐利掩蔽相结合的混合降噪策略来抑制噪声并保持边缘清晰度。在第二阶段,使用YOLOv11-L模型进行病变检测,并在精心设计的带注释的乳腺超声图像数据集上进行微调。为了跟踪,我们使用了混合时空上下文(STC)表示增强的核化相关滤波(KCF)。该系统在包含11,382张超声图像和40个视频序列的数据集上进行了评估,并使用交汇交汇(IoU)、成功率、故障率和处理速度对性能进行了评估。结果该框架良性病变IoU为0.878,恶性病变IoU为0.881。STC特征与YOLO检测的结合使跟踪失败率降低了25%以上,良性病变的成功率提高到99.0%,恶性病变的成功率提高到99.4%。系统实时处理41-45帧/秒。结论sour框架为乳腺超声视频中病灶的实时检测和跟踪提供了有效的解决方案。通过提高准确性和可靠性,它支持改善乳腺癌诊断的临床决策。
{"title":"Hybrid spatiotemporal feature fusion for robust lesion detection and tracking in breast ultrasound video data","authors":"Radwan Qasrawi ,&nbsp;Suliman Thwib ,&nbsp;Ghada Issa ,&nbsp;Razan AbuGhoush ,&nbsp;Hussein AlMasri ,&nbsp;Marah Qawasmi ,&nbsp;Nael Abu Halaweh","doi":"10.1016/j.ibmed.2025.100330","DOIUrl":"10.1016/j.ibmed.2025.100330","url":null,"abstract":"<div><h3>Background</h3><div>Speckle noise, tissue deformation, low contrast, and frame inconsistencies limit the reliability of traditional breast lesion tracking approaches in ultrasound videos.</div></div><div><h3>Objective</h3><div>This study aims to develop a robust hybrid framework that integrates advanced image enhancement, deep learning-based detection, and spatiotemporal feature fusion for improved lesion detection and tracking in breast ultrasound video sequences.</div></div><div><h3>Methods</h3><div>We propose a two-phase computational framework. The first phase employs Contrast-Limited Adaptive Histogram Equalization (CLAHE) for local contrast enhancement, followed by a hybrid denoising strategy combining anisotropic diffusion and unsharp masking to suppress noise and preserve edge sharpness. In the second phase, lesion detection is performed using a YOLOv11-L model, fine-tuned on a curated dataset of annotated breast ultrasound images. For tracking, we utilize Kernelized Correlation Filtering (KCF) enhanced with a Hybrid Spatiotemporal Context (STC) representation. The system is evaluated on a dataset comprising 11,382 ultrasound images and 40 video sequences, with performance assessed using Intersection over Union (IoU), success rate, failure rate, and processing speed.</div></div><div><h3>Results</h3><div>The proposed framework achieved an IoU of 0.878 for benign lesions and 0.881 for malignant lesions. The integration of STC features and YOLO detection reduced tracking failure rates by over 25 % and improved success rates to 99.0 % for benign and 99.4 % for malignant lesions. The system processed 41–45 frames per second in real time.</div></div><div><h3>Conclusions</h3><div>Our framework provides an effective solution for real-time lesion detection and tracking in breast ultrasound videos. By enhancing both accuracy and reliability, it supports improved clinical decision-making in breast cancer diagnostics.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"13 ","pages":"Article 100330"},"PeriodicalIF":0.0,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligence-based medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1