Lingli Chen , Gang Li , Shunkai Zhang , Wenjie Mao , Mei Zhang
{"title":"YOLO-SAG: An improved wildlife object detection algorithm based on YOLOv8n","authors":"Lingli Chen , Gang Li , Shunkai Zhang , Wenjie Mao , Mei Zhang","doi":"10.1016/j.ecoinf.2024.102791","DOIUrl":null,"url":null,"abstract":"<div><p>Wildlife conservation is crucial for maintaining biodiversity, ensuring ecosystem balance and stability, and fostering sustainable development. Currently, the use of infrared camera traps to monitor and capture photos of wildlife is a vital methodology in protecting and researching wildlife, and automatic detection and identification of animals within captured photographs are paramount. However, factors such as the complexity of the field environment and the varying sizes of animal targets lead to low detection accuracy, while high-precision detection models are hindered by high computational complexity and sluggish training speeds. This paper proposes a wildlife target detection algorithm based on improved YOLOv8n - YOLO-SAG, which aims to balance accuracy and speed. Training stability is enhanced by introducing the Softplus activation function, which increases detection accuracy; incorporating the AIFI enhances intra-scale feature interaction, reducing missed and false detections. Integrating the GSConv and VoV-GSCSP module lightens neck convolutions, reducing computational redundancy and balancing the computational and parametric quantities brought by the AIFI. Experimental results on a self-made wildlife dataset indicate that the YOLO-SAG achieves 94.9%, 90.9%, 96.8%, and 79.9% in Precision, Recall, [email protected], and [email protected]–0.95, respectively, which are 3.4%, 3.3%, 3.2%, and 4.9% higher than the original YOLOv8n. Inference and post-processing times reach 1.2 ms and 0.5 ms, a speedup of 25% and 54.5%, respectively, and the computation volume is only 7.2 GFLOPs, an 11.1% decrease.</p></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":null,"pages":null},"PeriodicalIF":5.8000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1574954124003339/pdfft?md5=fabe82c2fff9fc5f7d0c7fd3b9cca85a&pid=1-s2.0-S1574954124003339-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954124003339","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Wildlife conservation is crucial for maintaining biodiversity, ensuring ecosystem balance and stability, and fostering sustainable development. Currently, the use of infrared camera traps to monitor and capture photos of wildlife is a vital methodology in protecting and researching wildlife, and automatic detection and identification of animals within captured photographs are paramount. However, factors such as the complexity of the field environment and the varying sizes of animal targets lead to low detection accuracy, while high-precision detection models are hindered by high computational complexity and sluggish training speeds. This paper proposes a wildlife target detection algorithm based on improved YOLOv8n - YOLO-SAG, which aims to balance accuracy and speed. Training stability is enhanced by introducing the Softplus activation function, which increases detection accuracy; incorporating the AIFI enhances intra-scale feature interaction, reducing missed and false detections. Integrating the GSConv and VoV-GSCSP module lightens neck convolutions, reducing computational redundancy and balancing the computational and parametric quantities brought by the AIFI. Experimental results on a self-made wildlife dataset indicate that the YOLO-SAG achieves 94.9%, 90.9%, 96.8%, and 79.9% in Precision, Recall, [email protected], and [email protected]–0.95, respectively, which are 3.4%, 3.3%, 3.2%, and 4.9% higher than the original YOLOv8n. Inference and post-processing times reach 1.2 ms and 0.5 ms, a speedup of 25% and 54.5%, respectively, and the computation volume is only 7.2 GFLOPs, an 11.1% decrease.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.