YOLO-SAG: An improved wildlife object detection algorithm based on YOLOv8n

IF 5.8 2区 环境科学与生态学 Q1 ECOLOGY Ecological Informatics Pub Date : 2024-08-22 DOI:10.1016/j.ecoinf.2024.102791
Lingli Chen , Gang Li , Shunkai Zhang , Wenjie Mao , Mei Zhang
{"title":"YOLO-SAG: An improved wildlife object detection algorithm based on YOLOv8n","authors":"Lingli Chen ,&nbsp;Gang Li ,&nbsp;Shunkai Zhang ,&nbsp;Wenjie Mao ,&nbsp;Mei Zhang","doi":"10.1016/j.ecoinf.2024.102791","DOIUrl":null,"url":null,"abstract":"<div><p>Wildlife conservation is crucial for maintaining biodiversity, ensuring ecosystem balance and stability, and fostering sustainable development. Currently, the use of infrared camera traps to monitor and capture photos of wildlife is a vital methodology in protecting and researching wildlife, and automatic detection and identification of animals within captured photographs are paramount. However, factors such as the complexity of the field environment and the varying sizes of animal targets lead to low detection accuracy, while high-precision detection models are hindered by high computational complexity and sluggish training speeds. This paper proposes a wildlife target detection algorithm based on improved YOLOv8n - YOLO-SAG, which aims to balance accuracy and speed. Training stability is enhanced by introducing the Softplus activation function, which increases detection accuracy; incorporating the AIFI enhances intra-scale feature interaction, reducing missed and false detections. Integrating the GSConv and VoV-GSCSP module lightens neck convolutions, reducing computational redundancy and balancing the computational and parametric quantities brought by the AIFI. Experimental results on a self-made wildlife dataset indicate that the YOLO-SAG achieves 94.9%, 90.9%, 96.8%, and 79.9% in Precision, Recall, [email protected], and [email protected]–0.95, respectively, which are 3.4%, 3.3%, 3.2%, and 4.9% higher than the original YOLOv8n. Inference and post-processing times reach 1.2 ms and 0.5 ms, a speedup of 25% and 54.5%, respectively, and the computation volume is only 7.2 GFLOPs, an 11.1% decrease.</p></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":null,"pages":null},"PeriodicalIF":5.8000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1574954124003339/pdfft?md5=fabe82c2fff9fc5f7d0c7fd3b9cca85a&pid=1-s2.0-S1574954124003339-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954124003339","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Wildlife conservation is crucial for maintaining biodiversity, ensuring ecosystem balance and stability, and fostering sustainable development. Currently, the use of infrared camera traps to monitor and capture photos of wildlife is a vital methodology in protecting and researching wildlife, and automatic detection and identification of animals within captured photographs are paramount. However, factors such as the complexity of the field environment and the varying sizes of animal targets lead to low detection accuracy, while high-precision detection models are hindered by high computational complexity and sluggish training speeds. This paper proposes a wildlife target detection algorithm based on improved YOLOv8n - YOLO-SAG, which aims to balance accuracy and speed. Training stability is enhanced by introducing the Softplus activation function, which increases detection accuracy; incorporating the AIFI enhances intra-scale feature interaction, reducing missed and false detections. Integrating the GSConv and VoV-GSCSP module lightens neck convolutions, reducing computational redundancy and balancing the computational and parametric quantities brought by the AIFI. Experimental results on a self-made wildlife dataset indicate that the YOLO-SAG achieves 94.9%, 90.9%, 96.8%, and 79.9% in Precision, Recall, [email protected], and [email protected]–0.95, respectively, which are 3.4%, 3.3%, 3.2%, and 4.9% higher than the original YOLOv8n. Inference and post-processing times reach 1.2 ms and 0.5 ms, a speedup of 25% and 54.5%, respectively, and the computation volume is only 7.2 GFLOPs, an 11.1% decrease.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
YOLO-SAG:基于 YOLOv8n 的改进型野生动物物体检测算法
保护野生动物对于维护生物多样性、确保生态系统平衡和稳定以及促进可持续发展至关重要。目前,使用红外相机陷阱监测和捕捉野生动物照片是保护和研究野生动物的重要方法,而在捕捉到的照片中自动检测和识别动物则是重中之重。然而,野外环境的复杂性和动物目标大小不一等因素导致检测精度较低,而高精度检测模型则因计算复杂度高和训练速度慢而受到阻碍。本文提出了一种基于改进型 YOLOv8n 的野生动物目标检测算法--YOLO-SAG,旨在兼顾精度和速度。通过引入 Softplus 激活函数,增强了训练的稳定性,提高了检测的准确性;加入 AIFI 增强了尺度内的特征交互,减少了漏检和误检。GSConv 和 VoV-GSCSP 模块的集成减轻了颈部卷积,减少了计算冗余,平衡了 AIFI 带来的计算量和参数量。在自制野生动物数据集上的实验结果表明,YOLO-SAG 在精确度、召回率、[email protected]和[email protected]-0.95 方面分别达到了 94.9%、90.9%、96.8% 和 79.9%,比原始 YOLOv8n 分别高出 3.4%、3.3%、3.2% 和 4.9%。推理和后处理时间分别达到 1.2 ms 和 0.5 ms,分别加快了 25% 和 54.5%,计算量仅为 7.2 GFLOPs,减少了 11.1%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
期刊最新文献
The retrospective double-entry of a long-term ecological dataset Integrating infiltration processes in hybrid downscaling methods to estimate sub-surface soil moisture A deep learning pipeline for time-lapse camera monitoring of insects and their floral environments A complete framework for hyperbolic acoustic localization with application to northern bobwhite covey calls Impacts of LULC changes on runoff from rivers through a coupled SWAT and BiLSTM model: A case study in Zhanghe River Basin, China
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1