SpermDet: Structure-Aware Network With Local Context Enhancement and Dual-Path Fusion for Object Detection in Sperm Images

IF 5.9 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Instrumentation and Measurement Pub Date : 2025-03-05 DOI:10.1109/TIM.2025.3544697
Hongyu Zhang;Zhujun Hu;Huaying Huang;Shuang Liu;Yunbo Rao;Qifei Wang;Naveed Ahmad
{"title":"SpermDet: Structure-Aware Network With Local Context Enhancement and Dual-Path Fusion for Object Detection in Sperm Images","authors":"Hongyu Zhang;Zhujun Hu;Huaying Huang;Shuang Liu;Yunbo Rao;Qifei Wang;Naveed Ahmad","doi":"10.1109/TIM.2025.3544697","DOIUrl":null,"url":null,"abstract":"Recently, deep-learning-based object detection models have been used to improve the detection performance in sperm images. However, these models encounter three primary challenges: 1) visual similarity between sperm and background noise; 2) neglecting critical local contextual features; and 3) diminishment of tiny sperm features when fusing deep features with an inaccurate foreground. In this article, we propose SpermDet to alleviate these issues. Specifically, we first develop a structure-aware alignment fusion (SAF) module to align and integrate structural information with RGB images, thus improving feature discriminability. Then, we introduce a local context enhancement block (LCEB) to effectively capture the crucial local contextual features of sperm. Furthermore, we design a semantic-aware dual-path fusion (SDF) module that uses both foreground and background information of deep layers to enhance semantic information and preserve detailed sperm features in shallow layers. Finally, we construct a dataset for sperm detection in the testicular biopsy scene, termed as SDTB. It contains 1341 images with 5548 instances, characterized by challenges of tiny sperm size and complex backgrounds. Extensive experiments on the SVIA, VISEM, and the proposed SDTB datasets demonstrate that SpermDet outperforms 24 recent models, achieving <inline-formula> <tex-math>$\\rm mAP_{50}$ </tex-math></inline-formula> scores of 77.7%, 26.3%, and 53.2%, respectively. Compared with the baseline model, SpermDet achieves <inline-formula> <tex-math>$\\rm mAP_{50}$ </tex-math></inline-formula> score improvements of 5.0%, 6.9%, and 9.2% on these datasets while reducing the parameters by 16.3%, which has the potential for application in clinical instruments. The code and dataset are publicly available at: <uri>https://github.com/Hong-yu-Zhang/SpermDet</uri>.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-14"},"PeriodicalIF":5.9000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10910089/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, deep-learning-based object detection models have been used to improve the detection performance in sperm images. However, these models encounter three primary challenges: 1) visual similarity between sperm and background noise; 2) neglecting critical local contextual features; and 3) diminishment of tiny sperm features when fusing deep features with an inaccurate foreground. In this article, we propose SpermDet to alleviate these issues. Specifically, we first develop a structure-aware alignment fusion (SAF) module to align and integrate structural information with RGB images, thus improving feature discriminability. Then, we introduce a local context enhancement block (LCEB) to effectively capture the crucial local contextual features of sperm. Furthermore, we design a semantic-aware dual-path fusion (SDF) module that uses both foreground and background information of deep layers to enhance semantic information and preserve detailed sperm features in shallow layers. Finally, we construct a dataset for sperm detection in the testicular biopsy scene, termed as SDTB. It contains 1341 images with 5548 instances, characterized by challenges of tiny sperm size and complex backgrounds. Extensive experiments on the SVIA, VISEM, and the proposed SDTB datasets demonstrate that SpermDet outperforms 24 recent models, achieving $\rm mAP_{50}$ scores of 77.7%, 26.3%, and 53.2%, respectively. Compared with the baseline model, SpermDet achieves $\rm mAP_{50}$ score improvements of 5.0%, 6.9%, and 9.2% on these datasets while reducing the parameters by 16.3%, which has the potential for application in clinical instruments. The code and dataset are publicly available at: https://github.com/Hong-yu-Zhang/SpermDet.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于局部上下文增强和双路径融合的结构感知网络在精子图像中的目标检测
近年来,基于深度学习的目标检测模型被用于提高精子图像的检测性能。然而,这些模型遇到了三个主要挑战:1)精子和背景噪声之间的视觉相似性;2)忽略关键的局部语境特征;3)在融合深度特征和不准确的前景时,会减少微小的精子特征。在本文中,我们建议使用SpermDet来缓解这些问题。具体而言,我们首先开发了结构感知对齐融合(SAF)模块,将结构信息与RGB图像对齐和集成,从而提高特征的可分辨性。然后,我们引入了一个局部上下文增强块(LCEB)来有效地捕获精子的关键局部上下文特征。此外,我们设计了一个语义感知的双路径融合(SDF)模块,该模块利用深层的前景和背景信息来增强语义信息并保留浅层的详细精子特征。最后,我们构建了一个用于睾丸活检场景中精子检测的数据集,称为SDTB。它包含1341张图片,5548个实例,其特点是精子尺寸小,背景复杂。在SVIA、VISEM和SDTB数据集上的大量实验表明,SpermDet优于24个最近的模型,分别达到了77.7%、26.3%和53.2%的$\rm map_bb_0 $分数。与基线模型相比,SpermDet在这些数据集上实现了5.0%、6.9%和9.2%的$\rm mAP_{50}$评分提高,同时减少了16.3%的参数,具有在临床仪器中应用的潜力。代码和数据集可在:https://github.com/Hong-yu-Zhang/SpermDet上公开获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Instrumentation and Measurement
IEEE Transactions on Instrumentation and Measurement 工程技术-工程:电子与电气
CiteScore
9.00
自引率
23.20%
发文量
1294
审稿时长
3.9 months
期刊介绍: Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.
期刊最新文献
2026 Index IEEE Transactions on Instrumentation and Measurement Vol. 74 A Novel End-to-End Framework for Low-SNR FID Signal Denoising via Rank-Sequential Truncated Tensor Decomposition Corrections to “TAG: A Temporal Attentive Gait Network for Cross-View Gait Recognition” An Adaptive Joint Alignment Method of Angle Misalignment and Seafloor Transponder for Ultrashort Baseline Underwater Positioning Focus Improvement of Multireceiver SAS Based on Range-Doppler Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1