{"title":"Creating synthetic data sets for training of neural networks for automatic catch analysis in fisheries","authors":"Jonatan Sjølund Dyrstad, Elling Ruud Øye","doi":"10.1016/j.compag.2025.110160","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate estimation of catch is essential for sustainable fisheries. It ensures precise catch reporting, provides a better basis for stock assessment, and helps prevent overfishing. With recent advances in deep learning, this could be solved using computer vision, however, collecting and annotating data for different fisheries, all with diverse catch distributions and different imaging equipment, is expensive and time-consuming and is currently limiting the adoption of the technology. To address this issue, we propose the use of synthetic data sets, created in simulation, for training of neural networks for the task of automatic catch analysis. Although the domain is subject to large amounts of variation in the image data, we hypothesize that much of this variation is due to clutter and variations in the appearance of the fish as captured by the camera, rather than inherent variations in the raw material itself. As such, the variation can be covered effectively in data sets generated in simulation, without the need for large data sets of 3D-models for each species, which are also costly to produce. This is demonstrated by training a neural network for instance segmentation, instance classification and key point detection, solely on synthetic data created with only five 3D-models of fish. The neural network is evaluated on real data, gathered with a variety of sensors onboard different fishing vessels, demonstrating that it generalizes across different domains. This evaluation concludes that synthetic data can be a valuable addition to real data for computer vision applications for catch analysis.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"233 ","pages":"Article 110160"},"PeriodicalIF":7.7000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925002662","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate estimation of catch is essential for sustainable fisheries. It ensures precise catch reporting, provides a better basis for stock assessment, and helps prevent overfishing. With recent advances in deep learning, this could be solved using computer vision, however, collecting and annotating data for different fisheries, all with diverse catch distributions and different imaging equipment, is expensive and time-consuming and is currently limiting the adoption of the technology. To address this issue, we propose the use of synthetic data sets, created in simulation, for training of neural networks for the task of automatic catch analysis. Although the domain is subject to large amounts of variation in the image data, we hypothesize that much of this variation is due to clutter and variations in the appearance of the fish as captured by the camera, rather than inherent variations in the raw material itself. As such, the variation can be covered effectively in data sets generated in simulation, without the need for large data sets of 3D-models for each species, which are also costly to produce. This is demonstrated by training a neural network for instance segmentation, instance classification and key point detection, solely on synthetic data created with only five 3D-models of fish. The neural network is evaluated on real data, gathered with a variety of sensors onboard different fishing vessels, demonstrating that it generalizes across different domains. This evaluation concludes that synthetic data can be a valuable addition to real data for computer vision applications for catch analysis.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.