Research on Improved VGG-16 Model Based on Transfer Learning for Acoustic Image Recognition of Underwater Search and Rescue Targets

IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Pub Date : 2024-09-12 DOI:10.1109/JSTARS.2024.3459928
Xu Liu;Hanhao Zhu;Weihua Song;Jiahui Wang;Lengleng Yan;Kelin Wang
{"title":"Research on Improved VGG-16 Model Based on Transfer Learning for Acoustic Image Recognition of Underwater Search and Rescue Targets","authors":"Xu Liu;Hanhao Zhu;Weihua Song;Jiahui Wang;Lengleng Yan;Kelin Wang","doi":"10.1109/JSTARS.2024.3459928","DOIUrl":null,"url":null,"abstract":"The advancement of underwater search and rescue technology has made target image recognition crucial for improving efficiency and accuracy in side-scan sonar operations. However, the complexity of the underwater environment presents challenges, such as high computational resource requirements, scarcity of samples, and uneven data distribution. To address these challenges, this article proposes an improved visual geometry group 16 (VGG-16) model based on transfer learning for target image recognition in underwater search and rescue. First, the side-scan sonar target image data are processed through manual supervised segmentation, noise addition, and normalization to enhance data diversity and quantity. Second, the VGG-16 model structure is lightweighted and batch normalization is incorporated to enhance training efficiency. Finally, the improved VGG-16 model is trained and tested using a frozen transfer learning strategy on a small-sample side-scan sonar target image dataset. Results show that compared with the traditional machine learning and VGG-16 models, our proposed transfer learning improved VGG-16 model exhibits higher performance in terms of efficiency and accuracy in underwater search and rescue target image recognition. Its recognition accuracy reaches 97.70%, with a faster convergence speed. Additionally, its average precision value is 97.14%, representing the improvements of 9.50% and 6.40% over VGG-16 and improved VGG-16 models, respectively. This indicates the effectiveness and feasibility of our approach in enhancing model recognition capability and training efficiency, validating its practical application potential.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"17 ","pages":"18112-18128"},"PeriodicalIF":5.3000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10679604","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10679604/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The advancement of underwater search and rescue technology has made target image recognition crucial for improving efficiency and accuracy in side-scan sonar operations. However, the complexity of the underwater environment presents challenges, such as high computational resource requirements, scarcity of samples, and uneven data distribution. To address these challenges, this article proposes an improved visual geometry group 16 (VGG-16) model based on transfer learning for target image recognition in underwater search and rescue. First, the side-scan sonar target image data are processed through manual supervised segmentation, noise addition, and normalization to enhance data diversity and quantity. Second, the VGG-16 model structure is lightweighted and batch normalization is incorporated to enhance training efficiency. Finally, the improved VGG-16 model is trained and tested using a frozen transfer learning strategy on a small-sample side-scan sonar target image dataset. Results show that compared with the traditional machine learning and VGG-16 models, our proposed transfer learning improved VGG-16 model exhibits higher performance in terms of efficiency and accuracy in underwater search and rescue target image recognition. Its recognition accuracy reaches 97.70%, with a faster convergence speed. Additionally, its average precision value is 97.14%, representing the improvements of 9.50% and 6.40% over VGG-16 and improved VGG-16 models, respectively. This indicates the effectiveness and feasibility of our approach in enhancing model recognition capability and training efficiency, validating its practical application potential.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于迁移学习的改进型 VGG-16 模型在水下搜救目标声学图像识别中的应用研究
水下搜救技术的发展使目标图像识别成为提高侧扫声纳操作效率和精度的关键。然而,水下环境的复杂性带来了各种挑战,如计算资源要求高、样本稀缺、数据分布不均等。针对这些挑战,本文提出了一种基于迁移学习的改进型视觉几何群 16(VGG-16)模型,用于水下搜救中的目标图像识别。首先,对侧扫声纳目标图像数据进行人工监督分割、噪声添加和归一化处理,以提高数据的多样性和数量。其次,对 VGG-16 模型结构进行轻量化处理,并加入批量归一化处理,以提高训练效率。最后,在小样本侧扫声纳目标图像数据集上使用冻结迁移学习策略对改进后的 VGG-16 模型进行训练和测试。结果表明,与传统的机器学习和 VGG-16 模型相比,我们提出的迁移学习改进 VGG-16 模型在水下搜救目标图像识别的效率和准确率方面表现出更高的性能。其识别准确率达到 97.70%,收敛速度更快。此外,其平均精度值为 97.14%,比 VGG-16 模型和改进 VGG-16 模型分别提高了 9.50% 和 6.40%。这表明我们的方法在提高模型识别能力和训练效率方面具有有效性和可行性,验证了其实际应用潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.30
自引率
10.90%
发文量
563
审稿时长
4.7 months
期刊介绍: The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.
期刊最新文献
MD2F-Mamba: Multidirectional Depthwise Convolution and Dual-Branch Mamba Feature Fusion Networks for Hyperspectral Image Classification Collaborative Spatiotemporal Anchors and Key Feature Enhancement for UAV Tracking An Advanced Algorithm for Constructing Elevation Change Time Series With Satellite Altimetry Observations A Model for Land Cover Ecological State Assessment of Southern Ukraine Based on Remote Data for Analyzing the Consequences of the Kakhovka Reservoir Shallowing Assessing Geo-Foundational Models for Flood Inundation Mapping: Benchmarking Models for Sentinel-1, Sentinel-2, and Planetscope
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1