卷积神经网络对甘蔗田间收割过程中出现的人为压力进行分类:案例研究

IF 4.2 2区 计算机科学 Q2 ROBOTICS Journal of Field Robotics Pub Date : 2024-06-04 DOI:10.1002/rob.22373
Rajesh U. Modi, Sukhbir Singh, Akhilesh K. Singh, Vallokkunnel A. Blessy
{"title":"卷积神经网络对甘蔗田间收割过程中出现的人为压力进行分类:案例研究","authors":"Rajesh U. Modi,&nbsp;Sukhbir Singh,&nbsp;Akhilesh K. Singh,&nbsp;Vallokkunnel A. Blessy","doi":"10.1002/rob.22373","DOIUrl":null,"url":null,"abstract":"<p>Assessing human stress in agriculture proves to be a complex and time-intensive endeavor within the field of ergonomics, particularly for the development of agricultural systems. This methodology involves the utilization of instrumentation and the establishment of a dedicated laboratory setup. The complexity arises from the need to capture and analyze various physiological and psychological indicators, such as heart rate (HR), muscle activity, and subjective feedback to comprehensively assess the impact of farm operations on subjects. The instrumentation typically includes wearable devices, sensors, and monitoring equipment to gather real-time data of subject during the performance of farm operations. Deep learning (DL) models currently achieve human performance levels on real-world face recognition tasks. In this study, we went beyond face recognition and experimented with the recognition of human stress based on facial features during the drudgery-prone agricultural operation of sugarcane harvesting. This is the first research study for deploying artificial intelligence-driven DL techniques to identify human stress in agriculture instead of monitoring several ergonomic characteristics. A total of 20 (10 each for male and female) subjects comprising 4300 augmented RGB images (215 per subject) were acquired during sugarcane harvesting seasons and then these images were deployed for training (80%) and validation (20%). Human stress and nonstress states were determined based on four ergonomic physiological parameters: heart rate (ΔHR), oxygen consumption rate (OCR), energy expenditure rate (EER), and acceptable workload (AWL). Stress was defined when ΔHR, OCR, EER, and AWL reached or exceeded certain standard threshold values. Four convolutional neural network-based DL models (1) DarkNet53, (2) InceptionV3, (3) MobileNetV2 and (4) ResNet50 were selected due to their remarkable feature extraction abilities, simple and effective implementation to edge computation devices. In all four DL models, training performance results delivered training accuracy ranging from 73.8% to 99.1% at combinations of two mini-batch sizes and four levels of epochs. The maximum training accuracies were 99.1%, 99.0%, 97.7%, and 95.4% at the combination of mini-batch size 16 and 25 epochs for DarkNet53, InceptionV3, ResNet50, and MobileNetV2, respectively. Due to the best performance, DarkNet53 was tested further on an independent data set of 100 images and found 89.8%–93.3% confident to classify stressed images for female subjects while 92.2%–94.5% for male subjects, though it was trained on the integrated data set. The comparative classification of the developed model and ergonomic measurements for stress classification was carried out with a net accuracy of 88% where there were few instances of wrong classifications.</p>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"41 8","pages":"2530-2542"},"PeriodicalIF":4.2000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Convolutional neural networks to classify human stress that occurs during in-field sugarcane harvesting: A case study\",\"authors\":\"Rajesh U. Modi,&nbsp;Sukhbir Singh,&nbsp;Akhilesh K. Singh,&nbsp;Vallokkunnel A. Blessy\",\"doi\":\"10.1002/rob.22373\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Assessing human stress in agriculture proves to be a complex and time-intensive endeavor within the field of ergonomics, particularly for the development of agricultural systems. This methodology involves the utilization of instrumentation and the establishment of a dedicated laboratory setup. The complexity arises from the need to capture and analyze various physiological and psychological indicators, such as heart rate (HR), muscle activity, and subjective feedback to comprehensively assess the impact of farm operations on subjects. The instrumentation typically includes wearable devices, sensors, and monitoring equipment to gather real-time data of subject during the performance of farm operations. Deep learning (DL) models currently achieve human performance levels on real-world face recognition tasks. In this study, we went beyond face recognition and experimented with the recognition of human stress based on facial features during the drudgery-prone agricultural operation of sugarcane harvesting. This is the first research study for deploying artificial intelligence-driven DL techniques to identify human stress in agriculture instead of monitoring several ergonomic characteristics. A total of 20 (10 each for male and female) subjects comprising 4300 augmented RGB images (215 per subject) were acquired during sugarcane harvesting seasons and then these images were deployed for training (80%) and validation (20%). Human stress and nonstress states were determined based on four ergonomic physiological parameters: heart rate (ΔHR), oxygen consumption rate (OCR), energy expenditure rate (EER), and acceptable workload (AWL). Stress was defined when ΔHR, OCR, EER, and AWL reached or exceeded certain standard threshold values. Four convolutional neural network-based DL models (1) DarkNet53, (2) InceptionV3, (3) MobileNetV2 and (4) ResNet50 were selected due to their remarkable feature extraction abilities, simple and effective implementation to edge computation devices. In all four DL models, training performance results delivered training accuracy ranging from 73.8% to 99.1% at combinations of two mini-batch sizes and four levels of epochs. The maximum training accuracies were 99.1%, 99.0%, 97.7%, and 95.4% at the combination of mini-batch size 16 and 25 epochs for DarkNet53, InceptionV3, ResNet50, and MobileNetV2, respectively. Due to the best performance, DarkNet53 was tested further on an independent data set of 100 images and found 89.8%–93.3% confident to classify stressed images for female subjects while 92.2%–94.5% for male subjects, though it was trained on the integrated data set. The comparative classification of the developed model and ergonomic measurements for stress classification was carried out with a net accuracy of 88% where there were few instances of wrong classifications.</p>\",\"PeriodicalId\":192,\"journal\":{\"name\":\"Journal of Field Robotics\",\"volume\":\"41 8\",\"pages\":\"2530-2542\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Field Robotics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/rob.22373\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22373","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0

摘要

在工效学领域,尤其是在农业系统开发方面,对农业中人的压力进行评估是一项复杂且耗时的工作。这种方法涉及仪器的使用和专用实验室的建立。之所以复杂,是因为需要捕捉和分析各种生理和心理指标,如心率(HR)、肌肉活动和主观反馈,以全面评估农场作业对受试者的影响。仪器设备通常包括可穿戴设备、传感器和监控设备,用于收集受试者在执行农场操作过程中的实时数据。目前,深度学习(DL)模型在现实世界的人脸识别任务中达到了人类的性能水平。在本研究中,我们超越了人脸识别的范围,尝试基于人脸特征识别甘蔗收割这一容易产生疲劳的农业操作过程中人的压力。这是首次采用人工智能驱动的 DL 技术来识别农业中的人类压力,而不是监测几个人体工程学特征。研究人员在甘蔗收割季节共采集了 20 名(男女各 10 名)受试者的 4300 张增强 RGB 图像(每个受试者 215 张),然后将这些图像用于训练(80%)和验证(20%)。人体压力和非压力状态是根据四个人体工程学生理参数确定的:心率(ΔHR)、耗氧量(OCR)、能量消耗率(EER)和可接受工作量(AWL)。当 ΔHR、OCR、EER 和 AWL 达到或超过一定的标准阈值时,即定义为压力。由于四种基于卷积神经网络的 DL 模型(1)DarkNet53、(2)InceptionV3、(3)MobileNetV2 和(4)ResNet50 具有显著的特征提取能力,且可在边缘计算设备上简单有效地实施,因此被选中。在所有四种 DL 模型中,在两种迷你批量大小和四级历时的组合下,训练结果的准确率从 73.8% 到 99.1%。DarkNet53、InceptionV3、ResNet50和MobileNetV2在16和25个epochs的迷你批量组合下的最高训练精度分别为99.1%、99.0%、97.7%和95.4%。由于表现最佳,DarkNet53 在一个包含 100 张图像的独立数据集上进行了进一步测试,发现它对女性受试者压力图像的分类可信度为 89.8%-93.3%,而对男性受试者的分类可信度为 92.2%-94.5%,尽管它是在综合数据集上训练的。在压力分类方面,对所开发的模型和人体工程学测量结果进行了比较分类,净准确率为 88%,其中错误分类的情况很少。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Convolutional neural networks to classify human stress that occurs during in-field sugarcane harvesting: A case study

Assessing human stress in agriculture proves to be a complex and time-intensive endeavor within the field of ergonomics, particularly for the development of agricultural systems. This methodology involves the utilization of instrumentation and the establishment of a dedicated laboratory setup. The complexity arises from the need to capture and analyze various physiological and psychological indicators, such as heart rate (HR), muscle activity, and subjective feedback to comprehensively assess the impact of farm operations on subjects. The instrumentation typically includes wearable devices, sensors, and monitoring equipment to gather real-time data of subject during the performance of farm operations. Deep learning (DL) models currently achieve human performance levels on real-world face recognition tasks. In this study, we went beyond face recognition and experimented with the recognition of human stress based on facial features during the drudgery-prone agricultural operation of sugarcane harvesting. This is the first research study for deploying artificial intelligence-driven DL techniques to identify human stress in agriculture instead of monitoring several ergonomic characteristics. A total of 20 (10 each for male and female) subjects comprising 4300 augmented RGB images (215 per subject) were acquired during sugarcane harvesting seasons and then these images were deployed for training (80%) and validation (20%). Human stress and nonstress states were determined based on four ergonomic physiological parameters: heart rate (ΔHR), oxygen consumption rate (OCR), energy expenditure rate (EER), and acceptable workload (AWL). Stress was defined when ΔHR, OCR, EER, and AWL reached or exceeded certain standard threshold values. Four convolutional neural network-based DL models (1) DarkNet53, (2) InceptionV3, (3) MobileNetV2 and (4) ResNet50 were selected due to their remarkable feature extraction abilities, simple and effective implementation to edge computation devices. In all four DL models, training performance results delivered training accuracy ranging from 73.8% to 99.1% at combinations of two mini-batch sizes and four levels of epochs. The maximum training accuracies were 99.1%, 99.0%, 97.7%, and 95.4% at the combination of mini-batch size 16 and 25 epochs for DarkNet53, InceptionV3, ResNet50, and MobileNetV2, respectively. Due to the best performance, DarkNet53 was tested further on an independent data set of 100 images and found 89.8%–93.3% confident to classify stressed images for female subjects while 92.2%–94.5% for male subjects, though it was trained on the integrated data set. The comparative classification of the developed model and ergonomic measurements for stress classification was carried out with a net accuracy of 88% where there were few instances of wrong classifications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Field Robotics
Journal of Field Robotics 工程技术-机器人学
CiteScore
15.00
自引率
3.60%
发文量
80
审稿时长
6 months
期刊介绍: The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments. The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.
期刊最新文献
Issue Information Cover Image, Volume 41, Number 8, December 2024 Issue Information ForzaETH Race Stack—Scaled Autonomous Head‐to‐Head Racing on Fully Commercial Off‐the‐Shelf Hardware Research on Satellite Navigation Control of Six‐Crawler Machinery Based on Fuzzy PID Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1