{"title":"Prediction on Yellowfin Tuna (Thunnus albacares) Fishing Ground in Waters Near the Marshall Islands Based on SMOTETomek-RF","authors":"Meng Zhang, Liming Song, Chen Pan, Linhui Wang","doi":"10.1111/fog.12704","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This study monitored 37 longliners fishing in waters near the Marshall Islands from 2020 to 2022 by Liancheng Overseas Fishery (Shenzhen) Co., Ltd.'s operation management system. This study developed nine predictive models on the relationship between catch per unit effort (CPUE) data for yellowfin tuna (<i>Thunnus albacares</i>) and the environmental data. The environmental data integrate 48 variables, including eddy kinetic energy, chlorophyll <i>a</i> concentration, sea surface height, and additional measures of vertical oceanic conditions, alongside spatiotemporal parameters (year, month, day, longitude, and latitude). This study employed four spatial resolutions (0.25° × 0.25°, 0.5° × 0.5°, 1° × 1°, and 2° × 2°) to develop nine predictive models: KNN, RF, GBDT, CART, LightGBM, XGBoost, CatBoost, AdaBoost, and Stacking (RF, KNN, GBDT, and LR). These models, with a daily time resolution, were trained using 75% of the data and tested with the remaining 25%. The optimal spatial resolution and model were determined through a comprehensive comparison of model evaluation metrics across these spatial resolutions. The SMOTETomek algorithm was then applied to resample 75% of the data at the optimal spatial resolution, forming a new training dataset. This dataset was used to refine the model, subsequently tested with the remaining 25% of the data. Results indicated that (1) the optimal spatial resolution is 0.25° × 0.25° and the optimal model is RF; (2) the SMOTETomek algorithm enhances the model's predictive performance; and (3) the developed SMK-RF model, exhibiting Acc and AUC values of 76.73% and 82.47%, respectively, accurately predicts the central fishing grounds for yellowfin tuna, consisting closely with actual fishing activity.</p>\n </div>","PeriodicalId":51054,"journal":{"name":"Fisheries Oceanography","volume":"34 2","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fisheries Oceanography","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/fog.12704","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FISHERIES","Score":null,"Total":0}
引用次数: 0
Abstract
This study monitored 37 longliners fishing in waters near the Marshall Islands from 2020 to 2022 by Liancheng Overseas Fishery (Shenzhen) Co., Ltd.'s operation management system. This study developed nine predictive models on the relationship between catch per unit effort (CPUE) data for yellowfin tuna (Thunnus albacares) and the environmental data. The environmental data integrate 48 variables, including eddy kinetic energy, chlorophyll a concentration, sea surface height, and additional measures of vertical oceanic conditions, alongside spatiotemporal parameters (year, month, day, longitude, and latitude). This study employed four spatial resolutions (0.25° × 0.25°, 0.5° × 0.5°, 1° × 1°, and 2° × 2°) to develop nine predictive models: KNN, RF, GBDT, CART, LightGBM, XGBoost, CatBoost, AdaBoost, and Stacking (RF, KNN, GBDT, and LR). These models, with a daily time resolution, were trained using 75% of the data and tested with the remaining 25%. The optimal spatial resolution and model were determined through a comprehensive comparison of model evaluation metrics across these spatial resolutions. The SMOTETomek algorithm was then applied to resample 75% of the data at the optimal spatial resolution, forming a new training dataset. This dataset was used to refine the model, subsequently tested with the remaining 25% of the data. Results indicated that (1) the optimal spatial resolution is 0.25° × 0.25° and the optimal model is RF; (2) the SMOTETomek algorithm enhances the model's predictive performance; and (3) the developed SMK-RF model, exhibiting Acc and AUC values of 76.73% and 82.47%, respectively, accurately predicts the central fishing grounds for yellowfin tuna, consisting closely with actual fishing activity.
期刊介绍:
The international journal of the Japanese Society for Fisheries Oceanography, Fisheries Oceanography is designed to present a forum for the exchange of information amongst fisheries scientists worldwide.
Fisheries Oceanography:
presents original research articles relating the production and dynamics of fish populations to the marine environment
examines entire food chains - not just single species
identifies mechanisms controlling abundance
explores factors affecting the recruitment and abundance of fish species and all higher marine tropic levels