首页 > 最新文献

Biomedical Physics & Engineering Express最新文献

英文 中文
Exploration and performance analysis of deep learning applications in spermatic vein ultrasound segmentation. 深度学习在精静脉超声分割中的应用探讨及性能分析。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-18 DOI: 10.1088/2057-1976/ae4eed
Yunhao Hu, Penglin Zou, Rongguo Yan, Xiyun Zeng, Qi Wang

Background.Varicocele is a common cause of male infertility, with ultrasound (US) serving as the primary diagnostic tool. Current practice relies on manual, subjective measurements of the spermatic vein, which are time-consuming and lack reproducibility. Developing automated tools is hindered by scarce annotated data and intrinsic US challenges like low contrast and high noise.Obejectives.This study aimed to: (1) develop and validate an efficient semi-automated annotation workflow; (2) establish the first performance benchmark for automated spermatic vein segmentation using deep learning; (3) critically evaluate the efficacy of state-of-the-art and customised segmentation models for this specific task.Methods.We proposed a semi-automated pipeline using the Segment Anything Model (SAM) with clinician refinement. Using the resulting dataset, we conducted a comprehensive benchmark, evaluating a baseline U-Net, advanced models (U-Net++, Attention U-Net, and RPA-UNet), and a proposed U-Net with deep supervision (UNet-DS). All models were assessed via leave-one-patient-out cross-validation and statistical tests.Results.The 'SAM+clinician' workflow showed excellent agreement with expert annotation (Dice Similarity Coefficient(DSC) = 92.66%; Kappa = 91.92%). In segmentation, the baseline U-Net achieved a mean DSC of 61.33%. Only Attention U-Net showed a statistically significant improvement (p= 0.0391). UNet-DS attained the mean DSC (64.65%) but this was not statistically significant (p= 0.0781). All models plateaued in a narrow range (DSC: 61%-65%), far below performance in mature US segmentation domains.Conclusion.This work validates an efficient semi-automated annotation solution and establishes the first performance benchmark for this task. Results reveal a distinct performance ceiling, indicating the primary barrier is the inherent data limitations, not model architecture. Future breakthroughs require a shift towards bespoke, physics-informed algorithms rather than applying generic deep learning models.

背景:精索静脉曲张是男性不育的常见原因,超声(US)是主要的诊断工具。目前的做法依赖于人工的、主观的精索静脉测量,这既耗时又缺乏可重复性。开发自动化工具受到缺乏注释数据和美国固有挑战(如低对比度和高噪声)的阻碍。目的:本研究旨在:(1)开发并验证一种高效的半自动化标注工作流程;(2)建立首个基于深度学习的精索静脉自动分割性能基准;(3)批判性地评估最先进的和定制的细分模型对这一特定任务的功效。方法:我们提出了一种半自动化的管道,使用分段任何模型(SAM),并经过临床医生的改进。利用得到的数据集,我们进行了全面的基准测试,评估了基线U-Net、高级模型(U-Net++、注意力U-Net和RPA-UNet)和具有深度监督的拟议U-Net (UNet-DS)。所有模型均通过留一患者的交叉验证和统计检验进行评估。结果:“SAM+临床医生”工作流程与专家注释具有很好的一致性(Dice Similarity Coefficient(DSC) = 92.66%;Kappa = 91.92%)。在分割中,基线U-Net的平均DSC为61.33%。只有注意U-Net有统计学上显著的改善(p = 0.0391)。UNet-DS达到平均DSC(64.65%),但无统计学意义(p = 0.0781)。所有模型都在一个狭窄的范围内趋于稳定(DSC: 61%-65%),远低于美国成熟细分领域的表现。结论:这项工作验证了一个高效的半自动注释解决方案,并为该任务建立了第一个性能基准。结果显示了明显的性能上限,表明主要障碍是固有的数据限制,而不是模型体系结构。未来的突破需要向定制的、基于物理的算法转变,而不是应用通用的深度学习模型。
{"title":"Exploration and performance analysis of deep learning applications in spermatic vein ultrasound segmentation.","authors":"Yunhao Hu, Penglin Zou, Rongguo Yan, Xiyun Zeng, Qi Wang","doi":"10.1088/2057-1976/ae4eed","DOIUrl":"10.1088/2057-1976/ae4eed","url":null,"abstract":"<p><p><i>Background.</i>Varicocele is a common cause of male infertility, with ultrasound (US) serving as the primary diagnostic tool. Current practice relies on manual, subjective measurements of the spermatic vein, which are time-consuming and lack reproducibility. Developing automated tools is hindered by scarce annotated data and intrinsic US challenges like low contrast and high noise.<i>Obejectives.</i>This study aimed to: (1) develop and validate an efficient semi-automated annotation workflow; (2) establish the first performance benchmark for automated spermatic vein segmentation using deep learning; (3) critically evaluate the efficacy of state-of-the-art and customised segmentation models for this specific task.<i>Methods.</i>We proposed a semi-automated pipeline using the Segment Anything Model (SAM) with clinician refinement. Using the resulting dataset, we conducted a comprehensive benchmark, evaluating a baseline U-Net, advanced models (U-Net++, Attention U-Net, and RPA-UNet), and a proposed U-Net with deep supervision (UNet-DS). All models were assessed via leave-one-patient-out cross-validation and statistical tests.<i>Results.</i>The 'SAM+clinician' workflow showed excellent agreement with expert annotation (Dice Similarity Coefficient(DSC) = 92.66%; Kappa = 91.92%). In segmentation, the baseline U-Net achieved a mean DSC of 61.33%. Only Attention U-Net showed a statistically significant improvement (<i>p</i>= 0.0391). UNet-DS attained the mean DSC (64.65%) but this was not statistically significant (<i>p</i>= 0.0781). All models plateaued in a narrow range (DSC: 61%-65%), far below performance in mature US segmentation domains.<i>Conclusion.</i>This work validates an efficient semi-automated annotation solution and establishes the first performance benchmark for this task. Results reveal a distinct performance ceiling, indicating the primary barrier is the inherent data limitations, not model architecture. Future breakthroughs require a shift towards bespoke, physics-informed algorithms rather than applying generic deep learning models.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147389073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NeuroCardioSense (NCS): a time-aware fuzzy decision framework for multi-lead ECG classification and arrhythmia detection. NeuroCardioSense (NCS):多导联心电图分类和心律失常检测的时间感知模糊决策框架。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-18 DOI: 10.1088/2057-1976/ae4df5
Cheng-Hai He, Xiao-Li Wang, Ying Feng, Qi Feng

Accurate classification of electrocardiogram (ECG) signals is essential for automated arrhythmia detection and clinical decision support. Existing deep learning methods still struggle to jointly characterize morphological patterns, multi-lead interactions, and temporal dependencies, leading to limited representation of waveform details, rhythm dynamics, and class boundary separability. To address these challenges, we propose a NeuroCardioSense (NCS) framework, built upon a convolutional neural network (CNN) backbone, comprising a base model, NeuroCardioSenseNet (NCSN), and an enhanced variant,NeuroCardioSenseNet-Fusion (NCSNF). NCSN constructs a novel Time-Aware Gated Convolution (TAG-Conv) layer together with a Time-Aware Gating Mechanism (TAGM), which adaptively modulate convolutional filters and cross-lead feature contributions based on local temporal context and channel energy distributions. This design enables joint modeling of morphological features and short-range temporal dynamics while reinforcing inter-lead coherence. Building upon NCSN, NCSNF incorporates a Time-Fuzzy Integration Module (TFIM) that constructs a learnable fuzzy subspace by jointly encoding features and membership degrees, effectively mitigating class boundary ambiguity and improving discriminability in limited-sample conditions. Extensive experiments on the MIT-BIH Arrhythmia Database demonstrate the superiority of the NCS framework. NCSN achieves 98.77% intra-patient and 87.82% inter-patient accuracy, while NCSNF further improves performance to 99.16% and 90.85%, respectively, outperforming existing baseline methods.

心电图信号的准确分类对于心律失常的自动检测和临床决策支持至关重要。现有的深度学习方法仍然难以共同表征形态模式,多导联相互作用和时间依赖性,导致波形细节,节奏动力学和类边界可分离性的有限表示。为了解决这些挑战,我们提出了一个基于卷积神经网络(CNN)主干的NeuroCardioSense (NCS)框架,包括一个基本模型,NeuroCardioSenseNet (NCSN)和一个增强的变体。NeuroCardioSenseNet-Fusion (NCSNF)。NCSN构建了一种新颖的时间感知门控卷积(TAG-Conv)层和时间感知门控机制(TAGM),该层基于局部时间背景和通道能量分布自适应调制卷积滤波器和交叉导联特征贡献。这种设计使形态特征和短期时间动态的联合建模成为可能,同时加强了导联间的一致性。在NCSN的基础上,NCSNF引入了Time-Fuzzy Integration Module (TFIM),通过对特征和隶属度进行联合编码,构建了一个可学习的模糊子空间,有效地缓解了类的边界模糊性,提高了有限样本条件下的可判别性。在MIT-BIH心律失常数据库上的大量实验证明了NCS框架的优越性。NCSN达到了98.77%的患者内准确率和87.82%的患者间准确率,而NCSNF进一步提高了性能,分别达到99.16%和90.85%,优于现有的基线方法。
{"title":"NeuroCardioSense (NCS): a time-aware fuzzy decision framework for multi-lead ECG classification and arrhythmia detection.","authors":"Cheng-Hai He, Xiao-Li Wang, Ying Feng, Qi Feng","doi":"10.1088/2057-1976/ae4df5","DOIUrl":"10.1088/2057-1976/ae4df5","url":null,"abstract":"<p><p>Accurate classification of electrocardiogram (ECG) signals is essential for automated arrhythmia detection and clinical decision support. Existing deep learning methods still struggle to jointly characterize morphological patterns, multi-lead interactions, and temporal dependencies, leading to limited representation of waveform details, rhythm dynamics, and class boundary separability. To address these challenges, we propose a NeuroCardioSense (NCS) framework, built upon a convolutional neural network (CNN) backbone, comprising a base model, NeuroCardioSenseNet (NCSN), and an enhanced variant,NeuroCardioSenseNet-Fusion (NCSNF). NCSN constructs a novel Time-Aware Gated Convolution (TAG-Conv) layer together with a Time-Aware Gating Mechanism (TAGM), which adaptively modulate convolutional filters and cross-lead feature contributions based on local temporal context and channel energy distributions. This design enables joint modeling of morphological features and short-range temporal dynamics while reinforcing inter-lead coherence. Building upon NCSN, NCSNF incorporates a Time-Fuzzy Integration Module (TFIM) that constructs a learnable fuzzy subspace by jointly encoding features and membership degrees, effectively mitigating class boundary ambiguity and improving discriminability in limited-sample conditions. Extensive experiments on the MIT-BIH Arrhythmia Database demonstrate the superiority of the NCS framework. NCSN achieves 98.77% intra-patient and 87.82% inter-patient accuracy, while NCSNF further improves performance to 99.16% and 90.85%, respectively, outperforming existing baseline methods.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147363651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An accurate glucose detection platform using colorimetry and supervised learning algorithms. 使用比色法和监督学习算法的精确葡萄糖检测平台。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-18 DOI: 10.1088/2057-1976/ae4eee
Mithun Kanchan, Pragna Harish, Omkar S Powar, Harsh More, Emani Ruthvesh Reddy

Maintaining optimal health and preventing diabetes-related complications requires accurate and timely monitoring of blood glucose levels. In line with this, the present study focuses on developing an affordable, reliable, and precise Point-of-Care (POC) diagnostic platform for glucose detection by integrating microfluidic and colorimetric principles. The system employs a custom-fabricated microfluidic chip designed to facilitate efficient enzymatic color reactions using only ~20 μl of sample per microwell, achieving complete color development within 3-4 min. This chip is housed inside a compact, USB-powered 3D-printed imaging module equipped with a high-resolution fixed-focus camera, enabling consistent control over imaging parameters such as focal distance, camera alignment, and illumination conditions. The overall workflow is optimized for seamless compatibility with embedded systems or laptops, eliminating the dependency on smartphones or external calibration tools and making the setup well-suited for real-time diagnostic use in POC environments. A total of 1280 images, representing 16 glucose concentration levels ranging from 50 to 200 mg dl-1, were captured under standardized conditions, labelled according to known concentrations, and processed through uniform preprocessing steps. Engineered image features extracted from the preprocesses images were then analysed using supervised machine learning models, including Random Forest, Support Vector Machine (SVM), K-Nearest Neighbours (KNN), and a Feedforward Neural Network, to establish a robust predictive framework capable of delivering fast, consistent, and accurate glucose estimation for practical healthcare applications. Among the evaluated models, the Random Forest (RF) classifier achieved the highest cross-validation precision of 98% and an exceptional specificity approaching 100%. This clearly describes its ability to distinguish between different glucose concentration levels. Further, the confusion matrix and the ROC curve analysis have validated the model's reliability, with very minimal chances of misclassifications and a high mean AUC value of around 1. These results ensure the potential of the image-based glucose concentration estimation as a cost effective and a reliable, scalable solution for real time monitoring in various medical related industries.

保持最佳健康和预防糖尿病相关并发症需要准确及时地监测血糖水平。在此背景下,本研究通过整合微流体和比色原理,开发了一种经济、可靠、精确的即时诊断平台,用于葡萄糖检测。该系统使用定制的微流控芯片,每个微孔中只有~20 μ L的样品,可以实现高效的酶促显色反应,在3-4分钟内实现完全显色。该芯片被封装在一个紧凑的、usb供电的3d打印成像模块中,该模块配备了一个高分辨率的定焦摄像头,以确保对焦距、对准和照明的稳定控制。该工作流程旨在与嵌入式系统或笔记本电脑无缝兼容,消除对智能手机或外部校准工具的依赖,并支持实时POC部署。在标准化条件下捕获了代表50至200 mg/dL的16种葡萄糖浓度的1280张图像的数据集,并进行了标记和统一的预处理。使用监督机器学习模型(包括随机森林、支持向量机、k近邻和前馈神经网络)对从处理过的图像中提取的工程图像特征进行评估,以创建快速一致的葡萄糖估计的鲁棒预测框架。其中Random Forest模型交叉验证精度最高,达到98%,特异性接近100%,有效区分血糖水平。混淆矩阵和ROC分析进一步证实了可靠性,显示最小的误分类和平均AUC接近1。总的来说,所提出的基于图像的葡萄糖估计方法为各种医疗保健环境中的实时监测提供了一种具有成本效益、可扩展且准确的解决方案。该系统还具有试剂消耗低、分析时间短、操作简单等特点,适用于分散筛查、常规监测以及资源有限的临床环境,这些环境可能无法获得或延迟常规实验室检测。未来的工作将集中在扩大浓度范围,用临床样品验证性能,以及集成自动校准以支持大规模和长期可用性。
{"title":"An accurate glucose detection platform using colorimetry and supervised learning algorithms.","authors":"Mithun Kanchan, Pragna Harish, Omkar S Powar, Harsh More, Emani Ruthvesh Reddy","doi":"10.1088/2057-1976/ae4eee","DOIUrl":"10.1088/2057-1976/ae4eee","url":null,"abstract":"<p><p>Maintaining optimal health and preventing diabetes-related complications requires accurate and timely monitoring of blood glucose levels. In line with this, the present study focuses on developing an affordable, reliable, and precise Point-of-Care (POC) diagnostic platform for glucose detection by integrating microfluidic and colorimetric principles. The system employs a custom-fabricated microfluidic chip designed to facilitate efficient enzymatic color reactions using only ~20 μl of sample per microwell, achieving complete color development within 3-4 min. This chip is housed inside a compact, USB-powered 3D-printed imaging module equipped with a high-resolution fixed-focus camera, enabling consistent control over imaging parameters such as focal distance, camera alignment, and illumination conditions. The overall workflow is optimized for seamless compatibility with embedded systems or laptops, eliminating the dependency on smartphones or external calibration tools and making the setup well-suited for real-time diagnostic use in POC environments. A total of 1280 images, representing 16 glucose concentration levels ranging from 50 to 200 mg dl<sup>-1</sup>, were captured under standardized conditions, labelled according to known concentrations, and processed through uniform preprocessing steps. Engineered image features extracted from the preprocesses images were then analysed using supervised machine learning models, including Random Forest, Support Vector Machine (SVM), K-Nearest Neighbours (KNN), and a Feedforward Neural Network, to establish a robust predictive framework capable of delivering fast, consistent, and accurate glucose estimation for practical healthcare applications. Among the evaluated models, the Random Forest (RF) classifier achieved the highest cross-validation precision of 98% and an exceptional specificity approaching 100%. This clearly describes its ability to distinguish between different glucose concentration levels. Further, the confusion matrix and the ROC curve analysis have validated the model's reliability, with very minimal chances of misclassifications and a high mean AUC value of around 1. These results ensure the potential of the image-based glucose concentration estimation as a cost effective and a reliable, scalable solution for real time monitoring in various medical related industries.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147389095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unet-like transformer with variable shifted windows for low dose CT denoising. 具有可变移位窗口的unet样变压器,用于低剂量CT去噪。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-18 DOI: 10.1088/2057-1976/ae41c5
Jianfang Li, Fazhi Qi, Yakang Li, Juan Chen, Yijie Pu, Shengxiang Wang

Low-dose computed tomography (LDCT) is crucial for reducing radiation exposure in medical imaging, but it often yields noisy images with artifacts that compromise diagnostic accuracy. Recently, Transformer-based models have shown great potential for LDCT denoising by modeling long-range dependencies and global context. However, standard Transformers incur prohibitive computational costs when applied to high-resolution medical images. To address this challenge, we propose a novel pure Transformer architecture for LDCT image restoration, designed within a hierarchical U-Net framework. The core of our innovation is the integration of an agent attention mechanism into a variable shifted-window design. This agent attention module efficiently approximates global self-attention by using a small set of agent tokens to aggregate and broadcast global contextual information, thereby achieving a global receptive field with only linear computational complexity. By embedding this mechanism within a multi-scale U-Net structure, our model effectively captures both fine-grained local details and long-range structural dependencies without sacrificing computational efficiency. Comprehensive experiments on a public LDCT dataset demonstrate that our method achieves state-of-the-art performance, outperforming existing approaches in both quantitative metrics and qualitative visual comparisons.

低剂量计算机断层扫描(LDCT)对于减少医学成像中的辐射暴露至关重要,但它经常产生带有伪影的噪声图像,从而影响诊断的准确性。最近,基于变压器的模型通过建模长期依赖关系和全局上下文显示出LDCT去噪的巨大潜力。然而,当应用于高分辨率医学图像时,标准变形金刚会产生令人望而却步的计算成本。为了解决这一挑战,我们提出了一种新的纯变压器结构用于LDCT图像恢复,该结构在分层U-Net框架内设计。我们创新的核心是将代理注意力机制集成到可变移动窗口设计中。该智能体关注模块通过使用一小组智能体令牌来聚合和传播全局上下文信息,从而实现仅具有线性计算复杂度的全局接受场,从而有效地近似全局自关注。通过在多尺度U-Net结构中嵌入这种机制,我们的模型在不牺牲计算效率的情况下有效地捕获了细粒度的局部细节和远程结构依赖关系。在公共LDCT数据集上的综合实验表明,我们的方法达到了最先进的性能,在定量指标和定性视觉比较方面都优于现有方法。
{"title":"Unet-like transformer with variable shifted windows for low dose CT denoising.","authors":"Jianfang Li, Fazhi Qi, Yakang Li, Juan Chen, Yijie Pu, Shengxiang Wang","doi":"10.1088/2057-1976/ae41c5","DOIUrl":"10.1088/2057-1976/ae41c5","url":null,"abstract":"<p><p>Low-dose computed tomography (LDCT) is crucial for reducing radiation exposure in medical imaging, but it often yields noisy images with artifacts that compromise diagnostic accuracy. Recently, Transformer-based models have shown great potential for LDCT denoising by modeling long-range dependencies and global context. However, standard Transformers incur prohibitive computational costs when applied to high-resolution medical images. To address this challenge, we propose a novel pure Transformer architecture for LDCT image restoration, designed within a hierarchical U-Net framework. The core of our innovation is the integration of an agent attention mechanism into a variable shifted-window design. This agent attention module efficiently approximates global self-attention by using a small set of agent tokens to aggregate and broadcast global contextual information, thereby achieving a global receptive field with only linear computational complexity. By embedding this mechanism within a multi-scale U-Net structure, our model effectively captures both fine-grained local details and long-range structural dependencies without sacrificing computational efficiency. Comprehensive experiments on a public LDCT dataset demonstrate that our method achieves state-of-the-art performance, outperforming existing approaches in both quantitative metrics and qualitative visual comparisons.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146117669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal skin disease classification using vision transformers, medical captioning, and metadata fusion: an analysis on the ISIC 2024 dataset. 使用视觉转换器、医疗字幕和元数据融合的多模态皮肤病分类:对ISIC 2024数据集的分析。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-18 DOI: 10.1088/2057-1976/ae4eeb
Aadit Shrestha, Aditi Palit

Skin cancer and dermatological diseases are among the most prevalent global health conditions, where early and accurate diagnosis is critical for improving patient outcomes. Although deep learning models have achieved strong performance in dermoscopic image classification, many existing approaches primarily rely on visual features and make limited use of complementary clinical metadata and language-based context routinely considered by dermatologists. Recent vision-language models (VLMs), including medical-domain adaptations such as MedCLIP, have begun to show promise in dermatology; however, their integration with structured clinical metadata and the impact of different multimodal fusion strategies have not been systematically analyzed. In this work, we address the binary skin lesion classification problem by conducting a structured evaluation of MedCLIP-based multimodal embeddings combined with classical machine learning and neural classifiers. Image-text representations are extracted using MedCLIP and fused with patient metadata through early and attention-based fusion mechanisms, followed by multilayer perceptron (MLP) and ensemble classifiers. Experiments are performed on a curated subset of the ISIC 2024 dataset comprising 1,600 training and 400 test dermoscopic images with associated metadata. The proposed multimodal approach achieves an accuracy of 96% (95.7% exact) with AUROC = 0.987, outperforming unimodal baselines and demonstrating the complementary value of language and metadata for skin lesion diagnosis. This study provides a comprehensive analysis of MedCLIP-based multimodal learning in dermatology and highlights the importance of fusion design in vision-language-metadata systems for computer-aided diagnosis.

皮肤癌和皮肤病是全球最普遍的健康状况之一,早期和准确的诊断对于改善患者的预后至关重要。尽管深度学习模型在皮肤镜图像分类方面取得了很强的表现,但许多现有的方法主要依赖于视觉特征,并且有限地使用了皮肤科医生通常考虑的互补临床元数据和基于语言的上下文。最近的视觉语言模型(VLMs),包括医学领域的适应,如MedCLIP,已经开始在皮肤病学中显示出希望;然而,它们与结构化临床元数据的整合以及不同多模式融合策略的影响尚未得到系统分析。在这项工作中,我们通过结合经典机器学习和神经分类器对基于medclip的多模态嵌入进行结构化评估,解决了皮肤损伤的二元分类问题。使用MedCLIP提取图像-文本表示,并通过早期和基于注意力的融合机制与患者元数据融合,然后使用多层感知器(MLP)和集成分类器。实验是在ISIC 2024数据集的一个精心策划的子集上进行的,该数据集包括1600张训练图像和400张测试皮肤镜图像,以及相关的元数据。所提出的多模态方法准确率达到96%(95.7%准确),AUROC = 0.987,优于单模态基线,证明了语言和元数据在皮肤病变诊断中的互补价值。本研究提供了基于medclip的皮肤病学多模式学习的全面分析,并强调了融合设计在计算机辅助诊断的视觉语言元数据系统中的重要性。
{"title":"Multimodal skin disease classification using vision transformers, medical captioning, and metadata fusion: an analysis on the ISIC 2024 dataset.","authors":"Aadit Shrestha, Aditi Palit","doi":"10.1088/2057-1976/ae4eeb","DOIUrl":"10.1088/2057-1976/ae4eeb","url":null,"abstract":"<p><p>Skin cancer and dermatological diseases are among the most prevalent global health conditions, where early and accurate diagnosis is critical for improving patient outcomes. Although deep learning models have achieved strong performance in dermoscopic image classification, many existing approaches primarily rely on visual features and make limited use of complementary clinical metadata and language-based context routinely considered by dermatologists. Recent vision-language models (VLMs), including medical-domain adaptations such as MedCLIP, have begun to show promise in dermatology; however, their integration with structured clinical metadata and the impact of different multimodal fusion strategies have not been systematically analyzed. In this work, we address the binary skin lesion classification problem by conducting a structured evaluation of MedCLIP-based multimodal embeddings combined with classical machine learning and neural classifiers. Image-text representations are extracted using MedCLIP and fused with patient metadata through early and attention-based fusion mechanisms, followed by multilayer perceptron (MLP) and ensemble classifiers. Experiments are performed on a curated subset of the ISIC 2024 dataset comprising 1,600 training and 400 test dermoscopic images with associated metadata. The proposed multimodal approach achieves an accuracy of 96% (95.7% exact) with AUROC = 0.987, outperforming unimodal baselines and demonstrating the complementary value of language and metadata for skin lesion diagnosis. This study provides a comprehensive analysis of MedCLIP-based multimodal learning in dermatology and highlights the importance of fusion design in vision-language-metadata systems for computer-aided diagnosis.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147389190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In vitrogold nanoparticle radiation sensitization effects in conjunction with a 2.5 megavoltage photon beam in MDA-MB-231, HeLa and PC-3 cell lines. 2.5兆电压光子束对MDA-MB-231、HeLa和PC-3细胞系的体外金纳米粒子辐射增敏效应
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-18 DOI: 10.1088/2057-1976/ae4eec
Xiao Qing Yao, Sarah A Sabatinos, Eric Da Silva, Rao Khan, James Gräfe, Raffi Karshafian

Gold nanoparticle (GNP) enhanced radiosensitization was studied across three tumour cells lines that vary in radiosensitivity for a 2.5 MV photon beam. The intrinsic sensitivity of the cell lines is studied for the effects of GNPs. Three cell lines which exhibit differences in radiation response: prostate adenocarcinoma (PC-3), breast adenocarcinoma (MDA-MB-231), and cervical adenocarcinoma (HeLa) were used to determine thein vitrodose enhancement effects of GNPs combined with a 2.5 MV photon beam. Cells were incubated with 20 μg ml-1GNPs for 24 h, and any extracellular GNPs were washed out prior to each assay. The cellular uptake of GNPs was assessed with inductively coupled plasma optical emission spectroscopy (ICP-OES). Clonogenic assays were conducted to assess cell viability after irradiation. The biological damage was assessed through DNA damage using terminal deoxynucleotidyl transferase dUTP nick end labeling assay (TUNEL). The production of reactive oxygen species (ROS) was assessed using CellRox assays. The enhancement factor when cells were irradiated in the presence of GNPs with 2.5 MV was 1.35 ± 0.46 for HeLa, 1.35 ± 0.11 for MDA-MB-231, and 0.99 ± 0.08 for PC-3 cells. On average, the level of DNA damage increased in MDA-MB-231 and HeLa cells when irradiated with 2.5 MV in the presence of GNPs. Increase in the ROS levels were detected in all cell lines when irradiated in the presence of GNPs. The enhancement effects with GNPs combined with a 2.5 MV photon beam were dependent on the cell line. The enhancement factor for HeLa and MDA-MB-231 supports further investigation of intermediate photon-energy beams in combination with GNPs. The combination of using a conventionally lower energy megavoltage beam with gold nanoparticles may become applicable in the clinical setting due to reduced skin dose and enhanced secondary electron production.

目的:研究金纳米颗粒(GNP)介导的三种肿瘤细胞系在2.5 MV x射线束下的放射敏感性。材料和方法:采用前列腺腺癌(PC-3)、乳腺腺癌(MDA-MB-231)和宫颈腺癌(HeLa)细胞,测定GNPs联合2.5 MV光子束照射细胞的体外剂量增强效应,以确定2.5 MV在具有不同辐射响应的细胞系中照射细胞的可行性。细胞用20 μg/mL GNPs孵育24小时,每次检测前冲洗细胞外GNPs。采用电感耦合等离子体发射光谱(ICP-OES)评价GNPs的细胞摄取。通过克隆实验评估辐照后的细胞活力。采用末端脱氧核苷酸转移酶dUTP缺口末端标记法(TUNEL)对DNA损伤进行评价。结果:细胞在GNPs的2.5 MV照射下,HeLa细胞的增强因子为1.35±0.46,MDA-MB-231细胞的增强因子为1.35±0.11,PC-3细胞的增强因子为0.99±0.08。平均而言,在GNPs存在的2.5 MV照射下,MDA-MB-231和HeLa细胞的DNA损伤水平增加。当GNPs存在时,在所有细胞系中检测到ROS水平增加。结论:GNPs联合2.5 MV光子束的增强效应与细胞系有关。HeLa和MDA-MB-231的增强因子支持了中间光子能量束与GNPs结合的进一步研究。由于减少皮肤剂量和增强二次电子产生,使用传统的低能量巨电压束与金纳米粒子的组合可能适用于临床环境。& # xD。
{"title":"<i>In vitro</i>gold nanoparticle radiation sensitization effects in conjunction with a 2.5 megavoltage photon beam in MDA-MB-231, HeLa and PC-3 cell lines.","authors":"Xiao Qing Yao, Sarah A Sabatinos, Eric Da Silva, Rao Khan, James Gräfe, Raffi Karshafian","doi":"10.1088/2057-1976/ae4eec","DOIUrl":"10.1088/2057-1976/ae4eec","url":null,"abstract":"<p><p>Gold nanoparticle (GNP) enhanced radiosensitization was studied across three tumour cells lines that vary in radiosensitivity for a 2.5 MV photon beam. The intrinsic sensitivity of the cell lines is studied for the effects of GNPs. Three cell lines which exhibit differences in radiation response: prostate adenocarcinoma (PC-3), breast adenocarcinoma (MDA-MB-231), and cervical adenocarcinoma (HeLa) were used to determine the<i>in vitro</i>dose enhancement effects of GNPs combined with a 2.5 MV photon beam. Cells were incubated with 20 μg ml<sup>-1</sup>GNPs for 24 h, and any extracellular GNPs were washed out prior to each assay. The cellular uptake of GNPs was assessed with inductively coupled plasma optical emission spectroscopy (ICP-OES). Clonogenic assays were conducted to assess cell viability after irradiation. The biological damage was assessed through DNA damage using terminal deoxynucleotidyl transferase dUTP nick end labeling assay (TUNEL). The production of reactive oxygen species (ROS) was assessed using CellRox assays. The enhancement factor when cells were irradiated in the presence of GNPs with 2.5 MV was 1.35 ± 0.46 for HeLa, 1.35 ± 0.11 for MDA-MB-231, and 0.99 ± 0.08 for PC-3 cells. On average, the level of DNA damage increased in MDA-MB-231 and HeLa cells when irradiated with 2.5 MV in the presence of GNPs. Increase in the ROS levels were detected in all cell lines when irradiated in the presence of GNPs. The enhancement effects with GNPs combined with a 2.5 MV photon beam were dependent on the cell line. The enhancement factor for HeLa and MDA-MB-231 supports further investigation of intermediate photon-energy beams in combination with GNPs. The combination of using a conventionally lower energy megavoltage beam with gold nanoparticles may become applicable in the clinical setting due to reduced skin dose and enhanced secondary electron production.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147389055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HARMONIC organ dose calculation framework for cardiac fluoroscopy. 心脏透视的HARMONIC器官剂量计算框架。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-17 DOI: 10.1088/2057-1976/ae4c96
Jérémie Dabin, Mahmoud Abdelrahman, David Borrego, Haegin Han, Choonsik Lee, Richard Harbron, Vadim Chumak, Angeliki Karambatsakidou, Serge Dreuil, Isabelle Thierry-Chef

Introduction. The HARMONIC study (health effects of cardiac fluoroscopy and modern radiotherapy in paediatrics) investigates, among other objectives, the relationship between ionising radiation and cancer incidence in children treated by cardiac fluoroscopy (HARMONIC-Cardio). This requires estimation of organ doses for a large patient cohort. This article describes the development and validation of a framework for calculations of cardiac fluoroscopy doses, and its application to build a software tool for rapid cohort dosimetry.Materials and methods. Organ doses were calculated with MCNP6.2, a Monte Carlo particle transport code. Realistic, anthropomorphic phantoms representing patients of both sexes at specific ages (newborn, 1, 5, 10, 15 year old and adult) were used. A large range of technical parameters was covered in the simulations, including 11 primary beam angles and seven secondary angles, four levels of beam filtration, three tube voltages and three field sizes. The absorbed dose was computed for 32 organs and tissues. The calculated organ doses were normalised to the air kerma-area product (PKA), a common modality-specific dose index, resulting in PKA-to-organ-dose conversion coefficients.Results. Organ dose conversion coefficients were calculated for 22,176 exposure configurations and extended to a total of 2,667,600. A coefficient database for 12 organs of interest for radiation protection and effective dose, was embedded in HARMONIC-CardioDose, a software tool that enables the estimation of organ doses for any exposure scenario within the simulation range. The program is in the form of a Python script or an executable file (.exe), and uses an Excel document for inputting the calculation parameters.Conclusion. A framework for the calculation of cardiac fluoroscopy doses was developed and validated. It was used as the basis for HARMONIC-CardioDose, a rapid software tool for organ dose estimates for epidemiology studies. The tool is also freely available to the medical and research community for supporting patient dosimetry.

HARMONIC- cardio研究(儿科心脏透视和现代放射治疗对健康的影响)的目的之一是调查电离辐射与接受心脏透视治疗的儿童癌症发病率之间的关系。这需要对大量患者进行器官剂量估计。开发了精确剂量计算框架,并用于创建快速剂量估计的软件工具。使用MCNP6.2蒙特卡罗粒子传输代码计算器官剂量。采用8个逼真的拟人模型,分别代表新生儿、1岁、5岁、10岁、15岁和成年患者。只有15岁以下的患者使用男性幽灵;老年患者为男性和女性)。仿真涉及的技术参数范围很大,包括11个主波束角和7个次波束角、4个波束滤波水平、3种管电压和3种场尺寸。计算了32个器官和组织的吸收剂量。将计算出的器官剂量归一化为空气角面积积(PKA),这是一种常见的模态特异性剂量指数,从而得到PKA-器官剂量转换系数。计算了22176种暴露形式的器官剂量转换系数,并将其内插到总共2,667,600种。15个感兴趣的辐射防护器官的系数和有效剂量已嵌入HARMONIC-CardioDose:这是一种软件工具,使用户能够估计模拟范围内几乎任何暴露情景的器官剂量。该程序采用Python脚本或可执行文件的形式,并使用简单的Excel文档输入计算所需的技术参数。HARMONIC-CardioDose是一种经过验证的软件工具,用于快速估计接受介入性心脏病治疗的儿科和成人患者的器官剂量。该工具免费提供给医学和研究界,用于支持患者剂量测定。
{"title":"HARMONIC organ dose calculation framework for cardiac fluoroscopy.","authors":"Jérémie Dabin, Mahmoud Abdelrahman, David Borrego, Haegin Han, Choonsik Lee, Richard Harbron, Vadim Chumak, Angeliki Karambatsakidou, Serge Dreuil, Isabelle Thierry-Chef","doi":"10.1088/2057-1976/ae4c96","DOIUrl":"10.1088/2057-1976/ae4c96","url":null,"abstract":"<p><p><i>Introduction</i>. The HARMONIC study (health effects of cardiac fluoroscopy and modern radiotherapy in paediatrics) investigates, among other objectives, the relationship between ionising radiation and cancer incidence in children treated by cardiac fluoroscopy (HARMONIC-Cardio). This requires estimation of organ doses for a large patient cohort. This article describes the development and validation of a framework for calculations of cardiac fluoroscopy doses, and its application to build a software tool for rapid cohort dosimetry.<i>Materials and methods</i>. Organ doses were calculated with MCNP6.2, a Monte Carlo particle transport code. Realistic, anthropomorphic phantoms representing patients of both sexes at specific ages (newborn, 1, 5, 10, 15 year old and adult) were used. A large range of technical parameters was covered in the simulations, including 11 primary beam angles and seven secondary angles, four levels of beam filtration, three tube voltages and three field sizes. The absorbed dose was computed for 32 organs and tissues. The calculated organ doses were normalised to the air kerma-area product (P<sub>KA</sub>), a common modality-specific dose index, resulting in P<sub>KA</sub>-to-organ-dose conversion coefficients.<i>Results</i>. Organ dose conversion coefficients were calculated for 22,176 exposure configurations and extended to a total of 2,667,600. A coefficient database for 12 organs of interest for radiation protection and effective dose, was embedded in HARMONIC-CardioDose, a software tool that enables the estimation of organ doses for any exposure scenario within the simulation range. The program is in the form of a Python script or an executable file (.exe), and uses an Excel document for inputting the calculation parameters.<i>Conclusion</i>. A framework for the calculation of cardiac fluoroscopy doses was developed and validated. It was used as the basis for HARMONIC-CardioDose, a rapid software tool for organ dose estimates for epidemiology studies. The tool is also freely available to the medical and research community for supporting patient dosimetry.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147347275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Gaussian mixture model for combining single threshold and adaptive threshold segmentation of bone microstructure. 一种结合单阈值和自适应阈值分割的骨微结构高斯混合模型。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-16 DOI: 10.1088/2057-1976/ae4c92
Jilmen Quintiens, G Harry van Lenthe

Segmentation of high-resolution CT enables the assessment of bone microstructure and provides relevant information in determining bone quality and fracture risk. Segmentation is typically performed with a global threshold value, expressed in bone mineral density (BMD). While consistent, a global threshold can lead to a poor representation of bone microstructure; amongst others, caused by altered bone physiology, image noise, image inhomogeneities, and partial volume effects. Adaptive threshold segmentations can preserve fine features, yet, it does not retain any quantitative information on BMD. We propose a new method for bone microstructure segmentation based on a Gaussian mixture model (GMM). This technique models the normalized image histogram as a bi-modal Gaussian distribution, reflecting bone and non-bone voxels, and gives an analytical description of the intensity threshold where probability of belonging to either class is equal. Next, the technique calculates a range of intensities around this threshold, where the tissue class is uncertain; only intensities within this range are adaptively segmented; others with a single threshold. Verification n was performed using simulated images witha prioriknown distributions. The GMM subsequentially reconstructed the input models. Next, high-resolution peripheral quantitative CT (HR-pQCT) and photon-counting CT (PCCT) images of cadaveric wrists were segmented, and segmentations were scored against reference segmentations from micro-CT. GMM segmentation accuracy was compared to adaptive and global thresholding. The optimal threshold from simulated images could accurately be determined, provided the bi-modal components did not accumulate into a single Gaussian. For HR-pQCT, full adaptive segmentation achieved the highest segmentation accuracy in this specific dataset (86.2 ± 3.7%), although the GMM led to comparable results (83.0 ± 2.3%). For PCCT, the GMM led to a slightly higher segmentation accuracy (77.9 ± 2.3%) than adaptive segmentation did (76.4 ± 2.4%). We conclude that this new method can segment bone microstructure with comparable accuracy as conventional techniques. Through the definition of uncertain intensities, the GMM method has the benefit that it provides the opportunity to tune the segmentation towards higher sensitivity or specificity, depending on the objective.

高分辨率CT的分割可以评估骨微观结构,并为确定骨质量和骨折风险提供相关信息。分割通常使用全局阈值进行,以骨矿物质密度(BMD)表示。虽然一致,但全局阈值可能导致骨微观结构的较差表示;其中包括骨生理改变、图像噪声、图像不均匀性和部分体积效应。自适应阈值分割可以保留较好的特征,但不能保留BMD的定量信息。提出了一种基于高斯混合模型(GMM)的骨微结构分割方法。该技术将归一化图像直方图建模为双峰高斯分布,反映骨骼和非骨骼体素,并给出了属于任何一类的概率相等的强度阈值的分析描述。接下来,该技术计算该阈值附近的强度范围,其中组织类别是不确定的;只有在这个范围内的强度才会被自适应分割;其他只有一个阈值的。使用具有先验已知分布的模拟图像进行验证。GMM随后重建输入模型。接下来,对尸体手腕的HR-pQCT和光子计数CT (PCCT)图像进行分割,并对来自micro-CT的参考分割进行评分。比较了自适应阈值法和全局阈值法的分割精度。从模拟图像中可以准确地确定最佳阈值,只要双模态分量不累积成单个高斯分布。对于HR-pQCT,完全自适应分割的准确率最高(86.2±3.7%),而GMM的结果与之相当(83.0±2.3%)。对于PCCT, GMM分割的准确率(77.9±2.3%)高于自适应分割(76.4±2.4%)。我们得出的结论是,这种方法可以分割骨微结构与精度相当的传统技术。通过对不确定强度的定义,GMM方法提供了向更高灵敏度或特异性调整分割的机会。
{"title":"A Gaussian mixture model for combining single threshold and adaptive threshold segmentation of bone microstructure.","authors":"Jilmen Quintiens, G Harry van Lenthe","doi":"10.1088/2057-1976/ae4c92","DOIUrl":"10.1088/2057-1976/ae4c92","url":null,"abstract":"<p><p>Segmentation of high-resolution CT enables the assessment of bone microstructure and provides relevant information in determining bone quality and fracture risk. Segmentation is typically performed with a global threshold value, expressed in bone mineral density (BMD). While consistent, a global threshold can lead to a poor representation of bone microstructure; amongst others, caused by altered bone physiology, image noise, image inhomogeneities, and partial volume effects. Adaptive threshold segmentations can preserve fine features, yet, it does not retain any quantitative information on BMD. We propose a new method for bone microstructure segmentation based on a Gaussian mixture model (GMM). This technique models the normalized image histogram as a bi-modal Gaussian distribution, reflecting bone and non-bone voxels, and gives an analytical description of the intensity threshold where probability of belonging to either class is equal. Next, the technique calculates a range of intensities around this threshold, where the tissue class is uncertain; only intensities within this range are adaptively segmented; others with a single threshold. Verification n was performed using simulated images with<i>a priori</i>known distributions. The GMM subsequentially reconstructed the input models. Next, high-resolution peripheral quantitative CT (HR-pQCT) and photon-counting CT (PCCT) images of cadaveric wrists were segmented, and segmentations were scored against reference segmentations from micro-CT. GMM segmentation accuracy was compared to adaptive and global thresholding. The optimal threshold from simulated images could accurately be determined, provided the bi-modal components did not accumulate into a single Gaussian. For HR-pQCT, full adaptive segmentation achieved the highest segmentation accuracy in this specific dataset (86.2 ± 3.7%), although the GMM led to comparable results (83.0 ± 2.3%). For PCCT, the GMM led to a slightly higher segmentation accuracy (77.9 ± 2.3%) than adaptive segmentation did (76.4 ± 2.4%). We conclude that this new method can segment bone microstructure with comparable accuracy as conventional techniques. Through the definition of uncertain intensities, the GMM method has the benefit that it provides the opportunity to tune the segmentation towards higher sensitivity or specificity, depending on the objective.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147347191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Breast Pathology Image Segmentation Based on DESB-Net: A Fusion Strategy of Detail Enhancement, Edge Focus, and Cross-Layer Connections. 基于DESB-Net的乳腺病理图像分割:一种细节增强、边缘聚焦和跨层连接的融合策略。
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-16 DOI: 10.1088/2057-1976/ae524e
Min Liu, Jianda Wang, Benhui Wu, Hengbo Hu

Breast cancer poses a significant threat to women's health, and early diagnosis is crucial for reducing mortality rates. Automatic breast tumor segmentation is important in medical image processing, but existing methods face challenges with breast pathology images due to sample scarcity, image degradation after data augmentation, and limitations in feature extraction. Traditional networks like U-Net often lose small lesions and edge details during downsampling and struggle with complex images and class imbalance. To address these issues, this study proposes DetailEdgeSkipBalance-Net (DESB-Net), an improved segmentation model based on U-Net. DESB-Net includes several innovations: Enhanced Detail-aware Multi-scale Re-parameterized Convolution (EDConv) for enhanced feature extraction, HistoEdge Focus Module (HEFM) for edge enhancement, Multi-Path Fusion Module (MPFM) for multi-scale feature fusion, and Binary Cross-Entropy Dice Loss (BD Loss) to balance class imbalance and boundary accuracy. These improvements significantly enhance the model's ability to capture small lesions and edge details, improve segmentation accuracy and robustness, and maintain high computational efficiency. On the UCSB dataset, DESB-Net achieved an mIoU of 79.53% and accuracy of 97.02%, outperforming U-Net by 6.5% and 1.89%, respectively, without increasing parameters or computational load. On the BCSS dataset, it achieved an mIoU of 63.4% and accuracy of 85.8%, surpassing U-Net by 4.2% and 2.6%. DESB-Net also outperformed mainstream models like DeepLabv3+, SegFormer, ResUNet, and Connected-UNets, demonstrating its effectiveness in breast pathology image segmentation. These results highlight the potential of DESB-Net to improve diagnostic accuracy and efficiency in clinical settings, making it a promising tool for early detection and treatment of breast cancer.

乳腺癌对妇女健康构成重大威胁,早期诊断对降低死亡率至关重要。乳腺肿瘤自动分割在医学图像处理中具有重要意义,但现有方法在处理乳腺病理图像时存在样本稀缺、数据增强后图像退化以及特征提取的局限性等问题。像U-Net这样的传统网络经常在降采样过程中丢失小的病灶和边缘细节,并与复杂的图像和类别不平衡作斗争。为了解决这些问题,本研究提出了一种基于U-Net的改进分割模型DetailEdgeSkipBalance-Net (DESB-Net)。DESB-Net包括几个创新:用于增强特征提取的增强细节感知多尺度重新参数化卷积(EDConv),用于边缘增强的HistoEdge Focus模块(HEFM),用于多尺度特征融合的多路径融合模块(MPFM),以及用于平衡类不平衡和边界精度的二元交叉熵Dice Loss (BD Loss)。这些改进显著增强了模型捕获小病灶和边缘细节的能力,提高了分割精度和鲁棒性,保持了较高的计算效率。在UCSB数据集上,在不增加参数和计算负荷的情况下,DESB-Net的mIoU为79.53%,准确率为97.02%,分别比U-Net高6.5%和1.89%。在BCSS数据集上,mIoU为63.4%,准确率为85.8%,分别比U-Net高4.2%和2.6%。DESB-Net也优于DeepLabv3+、SegFormer、ResUNet、Connected-UNets等主流模型,证明了其在乳腺病理图像分割中的有效性。这些结果突出了DESB-Net在提高临床诊断准确性和效率方面的潜力,使其成为早期发现和治疗乳腺癌的有前途的工具。
{"title":"Breast Pathology Image Segmentation Based on DESB-Net: A Fusion Strategy of Detail Enhancement, Edge Focus, and Cross-Layer Connections.","authors":"Min Liu, Jianda Wang, Benhui Wu, Hengbo Hu","doi":"10.1088/2057-1976/ae524e","DOIUrl":"https://doi.org/10.1088/2057-1976/ae524e","url":null,"abstract":"<p><p>Breast cancer poses a significant threat to women's health, and early diagnosis is crucial for reducing mortality rates. Automatic breast tumor segmentation is important in medical image processing, but existing methods face challenges with breast pathology images due to sample scarcity, image degradation after data augmentation, and limitations in feature extraction. Traditional networks like U-Net often lose small lesions and edge details during downsampling and struggle with complex images and class imbalance. To address these issues, this study proposes DetailEdgeSkipBalance-Net (DESB-Net), an improved segmentation model based on U-Net. DESB-Net includes several innovations: Enhanced Detail-aware Multi-scale Re-parameterized Convolution (EDConv) for enhanced feature extraction, HistoEdge Focus Module (HEFM) for edge enhancement, Multi-Path Fusion Module (MPFM) for multi-scale feature fusion, and Binary Cross-Entropy Dice Loss (BD Loss) to balance class imbalance and boundary accuracy. These improvements significantly enhance the model's ability to capture small lesions and edge details, improve segmentation accuracy and robustness, and maintain high computational efficiency. On the UCSB dataset, DESB-Net achieved an mIoU of 79.53% and accuracy of 97.02%, outperforming U-Net by 6.5% and 1.89%, respectively, without increasing parameters or computational load. On the BCSS dataset, it achieved an mIoU of 63.4% and accuracy of 85.8%, surpassing U-Net by 4.2% and 2.6%. DESB-Net also outperformed mainstream models like DeepLabv3+, SegFormer, ResUNet, and Connected-UNets, demonstrating its effectiveness in breast pathology image segmentation. These results highlight the potential of DESB-Net to improve diagnostic accuracy and efficiency in clinical settings, making it a promising tool for early detection and treatment of breast cancer.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147466678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variances in 3D radiomic shape features between meningioma, acoustic neuroma, and pituitary adenoma and the impact on dosimetric plan quality in Gamma Knife stereotactic radiosurgery. 脑膜瘤、听神经瘤和垂体腺瘤三维放射学形态特征的差异及其对伽玛刀立体定向放射手术剂量计划质量的影响
IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-03-16 DOI: 10.1088/2057-1976/ae4def
Seungjong Oh, Minsol Kim, Kevin Renick, Ana Heermann, Samuel Oh, Hugh Lee, Timothy Mitchell, Sven Ferguson, Sreekrishna Goddu, Nels Knutson, Taeho Kim

The purpose of this study is to address two questions regarding Gamma Knife Stereotactic Radiosurgery (GKSRS) planning: 1) Are there shape differences among disease types? and 2) Does plan quality vary by disease type? We considered meningioma, acoustic neuroma, and pituitary adenoma. For analysis, we exported a retrospective dataset from the treatment planning system (TPS) in DICOM format, including single fraction treatments with prescriptions at the 50% isodose line from 232 patients, completed between February 2018 and June 2023. The analysis included TPS-reported planning parameters, 3D radiomic shape features, and Gaussian Weighted Conformity Index (GWCI). The results were analyzed using two-sample t-tests and Pearson correlation coefficients. Although there were no statistically significant differences in volume, the cross-section of meningioma was the most circular, as defined by its major and minor axes. Pituitary adenoma exhibited the most flattened shape along its least axis. These results indicate that pituitary adenomas have distinct 3D shape characteristics. Pituitary adenoma required more shots, indicating they are more complex to plan. Acoustic neuromas had a similar number of shots to meningioma but showed better selectivity, implying it was easier to achieve planning guidelines, particularly for coverage. The normalized beam-on time for meningiomas was the shortest and the GWCI of acoustic neuromas was higher than that of the other two diseases, both statistically significant. A weak correlation between normalized BOT and the number of shots was found, suggesting that other factors beyond target shape influence plan complexity. Based on this study, shape differences exist among the considered diseases. Plan quality also varies by disease type. Pituitary adenomas are complex to plan, acoustic neuromas have better selectivity, and meningiomas have the shortest beam-on time. Factors beyond shape can also influence plan complexity.

本研究旨在探讨伽玛刀立体定向放射手术(GKSRS)计划的两个问题:1)疾病类型之间是否存在形状差异?2)计划质量是否因疾病类型而异?我们考虑了脑膜瘤、听神经瘤和垂体腺瘤。为了进行分析,我们以DICOM格式导出了治疗计划系统(TPS)的回顾性数据集,包括2018年2月至2023年6月期间完成的232例患者的单组分治疗,处方在50%等剂量线上。分析包括tps报告的规划参数、三维放射形状特征和高斯加权符合性指数(GWCI)。采用双样本t检验和Pearson相关系数对结果进行分析。虽然在体积上没有统计学上的差异,但脑膜瘤的横切面是最圆的,由其长轴和短轴定义。垂体腺瘤沿最小轴呈扁平状。提示垂体腺瘤具有明显的三维形态特征。垂体腺瘤需要更多的注射,这表明它们的计划更复杂。听神经瘤与脑膜瘤有相似的注射次数,但表现出更好的选择性,这意味着更容易实现计划指南,特别是覆盖范围。脑膜瘤的归一化照射时间最短,听神经瘤的GWCI高于其他两种疾病,均有统计学意义。归一化BOT与射击次数之间存在较弱的相关性,表明除目标形状外还有其他因素影响计划复杂性。根据这项研究,在被考虑的疾病中存在形状差异。计划质量也因疾病类型而异。垂体腺瘤是复杂的计划,听神经瘤有更好的选择性,脑膜瘤有最短的照射时间。形状以外的因素也会影响计划的复杂性。
{"title":"Variances in 3D radiomic shape features between meningioma, acoustic neuroma, and pituitary adenoma and the impact on dosimetric plan quality in Gamma Knife stereotactic radiosurgery.","authors":"Seungjong Oh, Minsol Kim, Kevin Renick, Ana Heermann, Samuel Oh, Hugh Lee, Timothy Mitchell, Sven Ferguson, Sreekrishna Goddu, Nels Knutson, Taeho Kim","doi":"10.1088/2057-1976/ae4def","DOIUrl":"10.1088/2057-1976/ae4def","url":null,"abstract":"<p><p>The purpose of this study is to address two questions regarding Gamma Knife Stereotactic Radiosurgery (GKSRS) planning: 1) Are there shape differences among disease types? and 2) Does plan quality vary by disease type? We considered meningioma, acoustic neuroma, and pituitary adenoma. For analysis, we exported a retrospective dataset from the treatment planning system (TPS) in DICOM format, including single fraction treatments with prescriptions at the 50% isodose line from 232 patients, completed between February 2018 and June 2023. The analysis included TPS-reported planning parameters, 3D radiomic shape features, and Gaussian Weighted Conformity Index (GWCI). The results were analyzed using two-sample t-tests and Pearson correlation coefficients. Although there were no statistically significant differences in volume, the cross-section of meningioma was the most circular, as defined by its major and minor axes. Pituitary adenoma exhibited the most flattened shape along its least axis. These results indicate that pituitary adenomas have distinct 3D shape characteristics. Pituitary adenoma required more shots, indicating they are more complex to plan. Acoustic neuromas had a similar number of shots to meningioma but showed better selectivity, implying it was easier to achieve planning guidelines, particularly for coverage. The normalized beam-on time for meningiomas was the shortest and the GWCI of acoustic neuromas was higher than that of the other two diseases, both statistically significant. A weak correlation between normalized BOT and the number of shots was found, suggesting that other factors beyond target shape influence plan complexity. Based on this study, shape differences exist among the considered diseases. Plan quality also varies by disease type. Pituitary adenomas are complex to plan, acoustic neuromas have better selectivity, and meningiomas have the shortest beam-on time. Factors beyond shape can also influence plan complexity.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147363620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biomedical Physics & Engineering Express
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1