Wilayat Khan , Taimur Hassan , Mobeen Ur Rehman , Mohammad Alsaffar , Irfan Hussain
{"title":"Multiscale convolutional transformer for robust detection of aquaculture defects","authors":"Wilayat Khan , Taimur Hassan , Mobeen Ur Rehman , Mohammad Alsaffar , Irfan Hussain","doi":"10.1016/j.eswa.2025.126820","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate identification of aquatic defects is paramount for ensuring the safety of marine life within aquaculture environment. However, due to large disparities between photographic and underwater imagery, conventional deep learning models, employed to monitor aquatic defects, produces inadequate recognition performance. Furthermore, they require extensive amount of ground truth supervision on large-scale datasets which limits their scalability in the real-world. To overcome these issues, this paper proposes a novel convolutional transformer architecture that combines multi-scale convolutional feature representations with the attentional projections to robustly recognize aquatic defects from the underwater imagery irrespective of the background clutter, color distortion and scanner specifications. Moreover, unlike the conventional fully supervised methods, the proposed model leverages self-supervision through its prior-learned experiences to perform the aquatic defects extraction tasks across different datasets without incurring additional ground truth labeling and re-training costs. The proposed model consistently outperforms state-of-the-art methods by achieving superior mean average precision scores of 0.72, 0.74, 0.80, and 0.82 across NDv1, NDv2, LABUST, and KU datasets, respectively. These results reflect the effectiveness of the proposed approach in accurately identifying and delineating aquaculture defects across diverse underwater environments.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"273 ","pages":"Article 126820"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425004427","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate identification of aquatic defects is paramount for ensuring the safety of marine life within aquaculture environment. However, due to large disparities between photographic and underwater imagery, conventional deep learning models, employed to monitor aquatic defects, produces inadequate recognition performance. Furthermore, they require extensive amount of ground truth supervision on large-scale datasets which limits their scalability in the real-world. To overcome these issues, this paper proposes a novel convolutional transformer architecture that combines multi-scale convolutional feature representations with the attentional projections to robustly recognize aquatic defects from the underwater imagery irrespective of the background clutter, color distortion and scanner specifications. Moreover, unlike the conventional fully supervised methods, the proposed model leverages self-supervision through its prior-learned experiences to perform the aquatic defects extraction tasks across different datasets without incurring additional ground truth labeling and re-training costs. The proposed model consistently outperforms state-of-the-art methods by achieving superior mean average precision scores of 0.72, 0.74, 0.80, and 0.82 across NDv1, NDv2, LABUST, and KU datasets, respectively. These results reflect the effectiveness of the proposed approach in accurately identifying and delineating aquaculture defects across diverse underwater environments.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.