{"title":"Rethinking lightweight sheep face recognition via network latency-accuracy tradeoff","authors":"Xiaopeng Li, Yichi Zhang, Shuqin Li","doi":"10.1016/j.compag.2024.109662","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning has greatly improved the performance of sheep face recognition, but existing recognition methods usually adopt deeper and wider networks to obtain better performance, resulting in heavy computational burden and slow inference speed. This paper proposes a very lightweight sheep face recognition network, referred to as VLFaceNet, which achieves state-of-the-art (SOTA) latency-accuracy tradeoff. The basic module of VLFaceNet is VL, which uses inexpensive linear operations to complement redundant features and reduces the model size and computational complexity through structural re-parameterization during inference, improving inference speed. VLDBlock is formed by concatenating VL and ECA channel attention to enhance the effectiveness of channel-level feature extraction. VLFaceNet is formed by stacking VL and VLDBlock. By fusing features of different scales of VLFaceNet, sheep faces of different scales can be recognized, improving the recognition performance of the model. To address the problem of high similarity and difficulty in distinguishing white sheep faces, this paper proposes a scaling feature enhancement method SFE, which changes the color distribution and texture of sheep face images, improving the distinguishability between sheep face images and thus the recognition performance of VLFaceNet. The recognition performance gains of multiple recognition models demonstrate the effectiveness of SFE. On a self-built dataset, VLFaceNet achieves the best latency-accuracy tradeoff with an inference latency of 2.58 ms and a recognition accuracy of 97.75 %. This research is expected to promote the application of deep learning-based recognition methods in livestock breeding.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"227 ","pages":"Article 109662"},"PeriodicalIF":7.7000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924010536","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning has greatly improved the performance of sheep face recognition, but existing recognition methods usually adopt deeper and wider networks to obtain better performance, resulting in heavy computational burden and slow inference speed. This paper proposes a very lightweight sheep face recognition network, referred to as VLFaceNet, which achieves state-of-the-art (SOTA) latency-accuracy tradeoff. The basic module of VLFaceNet is VL, which uses inexpensive linear operations to complement redundant features and reduces the model size and computational complexity through structural re-parameterization during inference, improving inference speed. VLDBlock is formed by concatenating VL and ECA channel attention to enhance the effectiveness of channel-level feature extraction. VLFaceNet is formed by stacking VL and VLDBlock. By fusing features of different scales of VLFaceNet, sheep faces of different scales can be recognized, improving the recognition performance of the model. To address the problem of high similarity and difficulty in distinguishing white sheep faces, this paper proposes a scaling feature enhancement method SFE, which changes the color distribution and texture of sheep face images, improving the distinguishability between sheep face images and thus the recognition performance of VLFaceNet. The recognition performance gains of multiple recognition models demonstrate the effectiveness of SFE. On a self-built dataset, VLFaceNet achieves the best latency-accuracy tradeoff with an inference latency of 2.58 ms and a recognition accuracy of 97.75 %. This research is expected to promote the application of deep learning-based recognition methods in livestock breeding.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.