Multiscale convolutional transformer for robust detection of aquaculture defects

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems with Applications Pub Date : 2025-05-10 Epub Date: 2025-02-17 DOI:10.1016/j.eswa.2025.126820
Wilayat Khan , Taimur Hassan , Mobeen Ur Rehman , Mohammad Alsaffar , Irfan Hussain
{"title":"Multiscale convolutional transformer for robust detection of aquaculture defects","authors":"Wilayat Khan ,&nbsp;Taimur Hassan ,&nbsp;Mobeen Ur Rehman ,&nbsp;Mohammad Alsaffar ,&nbsp;Irfan Hussain","doi":"10.1016/j.eswa.2025.126820","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate identification of aquatic defects is paramount for ensuring the safety of marine life within aquaculture environment. However, due to large disparities between photographic and underwater imagery, conventional deep learning models, employed to monitor aquatic defects, produces inadequate recognition performance. Furthermore, they require extensive amount of ground truth supervision on large-scale datasets which limits their scalability in the real-world. To overcome these issues, this paper proposes a novel convolutional transformer architecture that combines multi-scale convolutional feature representations with the attentional projections to robustly recognize aquatic defects from the underwater imagery irrespective of the background clutter, color distortion and scanner specifications. Moreover, unlike the conventional fully supervised methods, the proposed model leverages self-supervision through its prior-learned experiences to perform the aquatic defects extraction tasks across different datasets without incurring additional ground truth labeling and re-training costs. The proposed model consistently outperforms state-of-the-art methods by achieving superior mean average precision scores of 0.72, 0.74, 0.80, and 0.82 across NDv1, NDv2, LABUST, and KU datasets, respectively. These results reflect the effectiveness of the proposed approach in accurately identifying and delineating aquaculture defects across diverse underwater environments.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"273 ","pages":"Article 126820"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425004427","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate identification of aquatic defects is paramount for ensuring the safety of marine life within aquaculture environment. However, due to large disparities between photographic and underwater imagery, conventional deep learning models, employed to monitor aquatic defects, produces inadequate recognition performance. Furthermore, they require extensive amount of ground truth supervision on large-scale datasets which limits their scalability in the real-world. To overcome these issues, this paper proposes a novel convolutional transformer architecture that combines multi-scale convolutional feature representations with the attentional projections to robustly recognize aquatic defects from the underwater imagery irrespective of the background clutter, color distortion and scanner specifications. Moreover, unlike the conventional fully supervised methods, the proposed model leverages self-supervision through its prior-learned experiences to perform the aquatic defects extraction tasks across different datasets without incurring additional ground truth labeling and re-training costs. The proposed model consistently outperforms state-of-the-art methods by achieving superior mean average precision scores of 0.72, 0.74, 0.80, and 0.82 across NDv1, NDv2, LABUST, and KU datasets, respectively. These results reflect the effectiveness of the proposed approach in accurately identifying and delineating aquaculture defects across diverse underwater environments.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
水产养殖缺陷鲁棒检测的多尺度卷积变压器
准确识别水体缺陷对于确保水产养殖环境中海洋生物的安全至关重要。然而,由于照片和水下图像之间的巨大差异,传统的深度学习模型用于监测水生缺陷,产生不足的识别性能。此外,它们需要对大规模数据集进行大量的地面真相监督,这限制了它们在现实世界中的可扩展性。为了克服这些问题,本文提出了一种新颖的卷积变压器结构,该结构将多尺度卷积特征表示与注意投影相结合,可以在不考虑背景杂波、颜色失真和扫描仪规格的情况下,从水下图像中鲁棒地识别水中缺陷。此外,与传统的完全监督方法不同,该模型通过其先前学习的经验利用自我监督来执行跨不同数据集的水生缺陷提取任务,而无需额外的地面真值标记和重新训练成本。该模型在NDv1、NDv2、LABUST和KU数据集上的平均精度得分分别为0.72、0.74、0.80和0.82,始终优于最先进的方法。这些结果反映了所提出的方法在不同水下环境中准确识别和描述水产养殖缺陷的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
期刊最新文献
The cold chain vehicle routing problem in omni-channel retailing A hybrid CNN–Transformer model for pedestrian attribute recognition with multiscale features and Class-Specific residual attention Multi-sensory emotion modulation for alleviation of driver anger: a physiological and behavioural study DPSSNet: a dual-path state space architecture for resolving the context-detail dilemma in high-resolution mangrove mapping Frequency-aware and cross-mamba-enhanced medical image fusion for a real-time surgical navigation framework
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1