Taming the data deluge: A novel end-to-end deep learning system for classifying marine biological and environmental images

IF 2.1 3区地球科学 Q2 LIMNOLOGY Limnology and Oceanography: Methods Pub Date : 2023-11-10 DOI:10.1002/lom3.10591

Hongsheng Bi, Yunhao Cheng, Xuemin Cheng, Mark C. Benfield, David G. Kimmel, Haiyong Zheng, Sabrina Groves, Kezhen Ying

{"title":"Taming the data deluge: A novel end-to-end deep learning system for classifying marine biological and environmental images","authors":"Hongsheng Bi, Yunhao Cheng, Xuemin Cheng, Mark C. Benfield, David G. Kimmel, Haiyong Zheng, Sabrina Groves, Kezhen Ying","doi":"10.1002/lom3.10591","DOIUrl":null,"url":null,"abstract":"<p>Underwater imaging enables nondestructive plankton sampling at frequencies, durations, and resolutions unattainable by traditional methods. These systems necessitate automated processes to identify organisms efficiently. Early underwater image processing used a standard approach: binarizing images to segment targets, then integrating deep learning models for classification. While intuitive, this infrastructure has limitations in handling high concentrations of biotic and abiotic particles, rapid changes in dominant taxa, and highly variable target sizes. To address these challenges, we introduce a new framework that starts with a scene classifier to capture large within-image variation, such as disparities in the layout of particles and dominant taxa. After scene classification, scene-specific Mask regional convolutional neural network (Mask R-CNN) models are trained to separate target objects into different groups. The procedure allows information to be extracted from different image types, while minimizing potential bias for commonly occurring features. Using in situ coastal plankton images, we compared the scene-specific models to the Mask R-CNN model encompassing all scene categories as a single full model. Results showed that the scene-specific approach outperformed the full model by achieving a 20% accuracy improvement in complex noisy images. The full model yielded counts that were up to 78% lower than those enumerated by the scene-specific model for some small-sized plankton groups. We further tested the framework on images from a benthic video camera and an imaging sonar system with good results. The integration of scene classification, which groups similar images together, can improve the accuracy of detection and classification for complex marine biological images.</p>","PeriodicalId":18145,"journal":{"name":"Limnology and Oceanography: Methods","volume":"22 1","pages":"47-64"},"PeriodicalIF":2.1000,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Limnology and Oceanography: Methods","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/lom3.10591","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"LIMNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Underwater imaging enables nondestructive plankton sampling at frequencies, durations, and resolutions unattainable by traditional methods. These systems necessitate automated processes to identify organisms efficiently. Early underwater image processing used a standard approach: binarizing images to segment targets, then integrating deep learning models for classification. While intuitive, this infrastructure has limitations in handling high concentrations of biotic and abiotic particles, rapid changes in dominant taxa, and highly variable target sizes. To address these challenges, we introduce a new framework that starts with a scene classifier to capture large within-image variation, such as disparities in the layout of particles and dominant taxa. After scene classification, scene-specific Mask regional convolutional neural network (Mask R-CNN) models are trained to separate target objects into different groups. The procedure allows information to be extracted from different image types, while minimizing potential bias for commonly occurring features. Using in situ coastal plankton images, we compared the scene-specific models to the Mask R-CNN model encompassing all scene categories as a single full model. Results showed that the scene-specific approach outperformed the full model by achieving a 20% accuracy improvement in complex noisy images. The full model yielded counts that were up to 78% lower than those enumerated by the scene-specific model for some small-sized plankton groups. We further tested the framework on images from a benthic video camera and an imaging sonar system with good results. The integration of scene classification, which groups similar images together, can improve the accuracy of detection and classification for complex marine biological images.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

驯服数据洪流：用于海洋生物和环境图像分类的新型端到端深度学习系统

水下成像能够以传统方法无法达到的频率、持续时间和分辨率对浮游生物进行无损采样。这些系统需要自动化流程来有效识别生物体。早期的水下图像处理采用标准方法：对图像进行二值化处理以分割目标，然后整合深度学习模型进行分类。这种基础架构虽然直观，但在处理高浓度的生物和非生物颗粒、优势类群的快速变化以及高度多变的目标大小方面存在局限性。为了应对这些挑战，我们引入了一个新框架，该框架以场景分类器为起点，捕捉图像内的巨大差异，如颗粒布局和优势类群的差异。场景分类后，训练特定场景的掩膜区域卷积神经网络（Mask R-CNN）模型，将目标物体分成不同的组。该程序允许从不同的图像类型中提取信息，同时最大限度地减少常见特征的潜在偏差。我们使用原位沿海浮游生物图像，比较了场景特定模型和作为单一完整模型包含所有场景类别的掩码 R-CNN 模型。结果表明，在复杂的噪声图像中，特定场景方法的准确率比完整模型高出 20%。对于一些小型浮游生物群，完整模型得出的计数比特定场景模型得出的计数低 78%。我们还在底栖摄像机和声纳成像系统的图像上对该框架进行了测试，结果良好。场景分类将相似的图像组合在一起，可以提高复杂海洋生物图像的检测和分类精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Limnology and Oceanography: Methods 地学-海洋学

CiteScore

4.80

自引率

3.70%

发文量

审稿时长

3 months

期刊介绍： Limnology and Oceanography: Methods (ISSN 1541-5856) is a companion to ASLO''s top-rated journal Limnology and Oceanography, and articles are held to the same high standards. In order to provide the most rapid publication consistent with high standards, Limnology and Oceanography: Methods appears in electronic format only, and the entire submission and review system is online. Articles are posted as soon as they are accepted and formatted for publication. Limnology and Oceanography: Methods will consider manuscripts whose primary focus is methodological, and that deal with problems in the aquatic sciences. Manuscripts may present new measurement equipment, techniques for analyzing observations or samples, methods for understanding and interpreting information, analyses of metadata to examine the effectiveness of approaches, invited and contributed reviews and syntheses, and techniques for communicating and teaching in the aquatic sciences.