Lingling Yang, Xingshi Xu, Jizheng Zhao, Huaibo Song
{"title":"Fusion of RetinaFace and improved FaceNet for individual cow identification in natural scenes","authors":"Lingling Yang, Xingshi Xu, Jizheng Zhao, Huaibo Song","doi":"10.1016/j.inpa.2023.09.001","DOIUrl":null,"url":null,"abstract":"<div><div>Cows’ posture change is the fatal influencing factor for accurate identification of individual cows. To achieve non-contact, high-precision detection and identification of individual cows in farm environment, a cow individual identification method by the fusion of RetinaFace and improved FaceNet was proposed. MobileNet-enhanced RetinaFace was applied to ameliorate the impact of output channel quantity and convolution kernel dynamics using depthwise convolution combined with pointwise convolution. Regression predictions of bovine facial features and keypoints were generated under varying distances, scales and sizes. FaceNet's core feature network was enhanced through MobileNet integration, and the loss function was jointly optimized with Cross Entropy Loss and Triplet Loss to achieve a quicker and more stable convergence curve. The distances between the generated embedding vectors of cow facial features were corresponding to the similarity between cow faces, enabling accurate matching. RetinaFace exhibited detection false negative rates of 2.67%, 0.66%, 2.67%, and 3.33% under conditions of occlusion, no occlusion, low light, and bright light for cow facial detection. For cow facial pattern detection, the false negative rates for black and white patterns, pure black and pure white were 1.33%, 6.00% and 8.00%, respectively. Regarding cow facial posture changes, the false negative rates for face upward, bowing down, profile, and normal posture were 1.33%, 1.33%, 4.00% and 0.66%, respectively. Improved FaceNet model achieved an accuray of 99.50% on training set and 83.60% on test set. In comparison to YOLOX, the recognition model presented in this research demonstrated increased accuracy in cow facial detection under occlusion, no occlusion and strong lighting conditions by 2.67%, 0.40%, and 0.40%, respectively. Moreover, the accuracy for patterns with pure black and pure white tones surpassed that of YOLOX by 1.06% and 5.71%, correspondingly. Additionally, the accuracy rates for face upward, bowing down, profile and normal posture were higher than YOLOX by 2.00%, 3.34%, 2.66% and 0.40%, respectively. The proposed model demonstrates the proficiency in accurately identifying individual cows in natural scenes.</div></div>","PeriodicalId":53443,"journal":{"name":"Information Processing in Agriculture","volume":"11 4","pages":"Pages 512-523"},"PeriodicalIF":7.7000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing in Agriculture","FirstCategoryId":"1091","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214317323000653","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Cows’ posture change is the fatal influencing factor for accurate identification of individual cows. To achieve non-contact, high-precision detection and identification of individual cows in farm environment, a cow individual identification method by the fusion of RetinaFace and improved FaceNet was proposed. MobileNet-enhanced RetinaFace was applied to ameliorate the impact of output channel quantity and convolution kernel dynamics using depthwise convolution combined with pointwise convolution. Regression predictions of bovine facial features and keypoints were generated under varying distances, scales and sizes. FaceNet's core feature network was enhanced through MobileNet integration, and the loss function was jointly optimized with Cross Entropy Loss and Triplet Loss to achieve a quicker and more stable convergence curve. The distances between the generated embedding vectors of cow facial features were corresponding to the similarity between cow faces, enabling accurate matching. RetinaFace exhibited detection false negative rates of 2.67%, 0.66%, 2.67%, and 3.33% under conditions of occlusion, no occlusion, low light, and bright light for cow facial detection. For cow facial pattern detection, the false negative rates for black and white patterns, pure black and pure white were 1.33%, 6.00% and 8.00%, respectively. Regarding cow facial posture changes, the false negative rates for face upward, bowing down, profile, and normal posture were 1.33%, 1.33%, 4.00% and 0.66%, respectively. Improved FaceNet model achieved an accuray of 99.50% on training set and 83.60% on test set. In comparison to YOLOX, the recognition model presented in this research demonstrated increased accuracy in cow facial detection under occlusion, no occlusion and strong lighting conditions by 2.67%, 0.40%, and 0.40%, respectively. Moreover, the accuracy for patterns with pure black and pure white tones surpassed that of YOLOX by 1.06% and 5.71%, correspondingly. Additionally, the accuracy rates for face upward, bowing down, profile and normal posture were higher than YOLOX by 2.00%, 3.34%, 2.66% and 0.40%, respectively. The proposed model demonstrates the proficiency in accurately identifying individual cows in natural scenes.
期刊介绍:
Information Processing in Agriculture (IPA) was established in 2013 and it encourages the development towards a science and technology of information processing in agriculture, through the following aims: • Promote the use of knowledge and methods from the information processing technologies in the agriculture; • Illustrate the experiences and publications of the institutes, universities and government, and also the profitable technologies on agriculture; • Provide opportunities and platform for exchanging knowledge, strategies and experiences among the researchers in information processing worldwide; • Promote and encourage interactions among agriculture Scientists, Meteorologists, Biologists (Pathologists/Entomologists) with IT Professionals and other stakeholders to develop and implement methods, techniques, tools, and issues related to information processing technology in agriculture; • Create and promote expert groups for development of agro-meteorological databases, crop and livestock modelling and applications for development of crop performance based decision support system. Topics of interest include, but are not limited to: • Smart Sensor and Wireless Sensor Network • Remote Sensing • Simulation, Optimization, Modeling and Automatic Control • Decision Support Systems, Intelligent Systems and Artificial Intelligence • Computer Vision and Image Processing • Inspection and Traceability for Food Quality • Precision Agriculture and Intelligent Instrument • The Internet of Things and Cloud Computing • Big Data and Data Mining