林区数码照片中重叠树木图像的分割

Igor V. Petukhov, Konstantin O. Ivanov, Dmitry M. Vorozhtsov, Alexey A. Rozhentsov, Nataliya I. Rozhentsova, Ludmila A. Steshina
{"title":"林区数码照片中重叠树木图像的分割","authors":"Igor V. Petukhov, Konstantin O. Ivanov, Dmitry M. Vorozhtsov, Alexey A. Rozhentsov, Nataliya I. Rozhentsova, Ludmila A. Steshina","doi":"10.37482/0536-1036-2024-1-126-140","DOIUrl":null,"url":null,"abstract":"The use of decision support systems based on computer vision and artificial intelligence significantly improves the working conditions for the operators of technological machines in the timber sector, whose work implies high intensity and psycho-emotional overload. By means of computer vision and artificial intelligence the operator can quickly and easily obtain the data on the state of the cutting area and adopt the optimal solution for holding the working operation. This facilitates his work and reduces the time spent searching and analyzing the data on the cutting area. Meanwhile, one of the key elements of such a system is a subsystem for automatic segmentation of objects in the photograph. We have explored the possibility of segmenting overlapping objects in the photographs of forest areas using a convolutional neural network based on the Mask R-CNN architecture. Unlike in most works on similar topics, the objects of this study are color photographs taken by an RGB camera rather than a lidar. This creates the prospect for reducing the cost of hardware and software systems used to support decision-making by the operators of logging machines. The images of the stems and crowns of coniferous and deciduous trees overlapping each other are the segmented objects under consideration. Using the GIMP graphic editor, we have manually marked the color photographs depicting a total of 134 trees of 4 different species: spruce, aspen, birch and pine. Utilizing the developed database, we have carried out an experiment to further train the Mask R-CNN convolutional neural network for segmentation of overlapping parts of the trees in the digital photographs of forest areas. The neural network has been pre-trained using the Microsoft COCO dataset containing more than 200,000 images of 80 different classes of objects such as people, cars, animals and various items. While training the neural network, the images supplied to its input were subjected to a series of various linear and nonlinear geometric transformations, which made it possible to increase the volume of training data by 11 times. As a result, the accuracy of segmentation of the images of the stems and crowns of coniferous and deciduous trees overlapping each other has reached 79 %, which allows the use of neural networks based on a similar architecture in decision support systems for logging machine operators.","PeriodicalId":508281,"journal":{"name":"Lesnoy Zhurnal (Forestry Journal)","volume":"109 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Segmentation of Overlapping Tree Images in the Digital Photographs of Forest Areas\",\"authors\":\"Igor V. Petukhov, Konstantin O. Ivanov, Dmitry M. Vorozhtsov, Alexey A. Rozhentsov, Nataliya I. Rozhentsova, Ludmila A. Steshina\",\"doi\":\"10.37482/0536-1036-2024-1-126-140\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of decision support systems based on computer vision and artificial intelligence significantly improves the working conditions for the operators of technological machines in the timber sector, whose work implies high intensity and psycho-emotional overload. By means of computer vision and artificial intelligence the operator can quickly and easily obtain the data on the state of the cutting area and adopt the optimal solution for holding the working operation. This facilitates his work and reduces the time spent searching and analyzing the data on the cutting area. Meanwhile, one of the key elements of such a system is a subsystem for automatic segmentation of objects in the photograph. We have explored the possibility of segmenting overlapping objects in the photographs of forest areas using a convolutional neural network based on the Mask R-CNN architecture. Unlike in most works on similar topics, the objects of this study are color photographs taken by an RGB camera rather than a lidar. This creates the prospect for reducing the cost of hardware and software systems used to support decision-making by the operators of logging machines. The images of the stems and crowns of coniferous and deciduous trees overlapping each other are the segmented objects under consideration. Using the GIMP graphic editor, we have manually marked the color photographs depicting a total of 134 trees of 4 different species: spruce, aspen, birch and pine. Utilizing the developed database, we have carried out an experiment to further train the Mask R-CNN convolutional neural network for segmentation of overlapping parts of the trees in the digital photographs of forest areas. The neural network has been pre-trained using the Microsoft COCO dataset containing more than 200,000 images of 80 different classes of objects such as people, cars, animals and various items. While training the neural network, the images supplied to its input were subjected to a series of various linear and nonlinear geometric transformations, which made it possible to increase the volume of training data by 11 times. As a result, the accuracy of segmentation of the images of the stems and crowns of coniferous and deciduous trees overlapping each other has reached 79 %, which allows the use of neural networks based on a similar architecture in decision support systems for logging machine operators.\",\"PeriodicalId\":508281,\"journal\":{\"name\":\"Lesnoy Zhurnal (Forestry Journal)\",\"volume\":\"109 \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Lesnoy Zhurnal (Forestry Journal)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37482/0536-1036-2024-1-126-140\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lesnoy Zhurnal (Forestry Journal)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37482/0536-1036-2024-1-126-140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

基于计算机视觉和人工智能的决策支持系统的使用极大地改善了木材行业技术机械操作员的工作条件,因为他们的工作意味着高强度和超负荷的心理情绪。通过计算机视觉和人工智能,操作员可以快速、轻松地获取有关切割区域状态的数据,并采用最佳方案进行作业。这既方便了操作员的工作,又减少了搜索和分析切割区域数据的时间。同时,这种系统的关键要素之一是自动分割照片中物体的子系统。我们探索了使用基于掩码 R-CNN 架构的卷积神经网络分割林区照片中重叠物体的可能性。与大多数类似主题的研究不同,本研究的对象是由 RGB 摄像机而非激光雷达拍摄的彩色照片。这为降低用于支持测井机操作员决策的硬件和软件系统的成本创造了前景。针叶树和落叶树的茎干和树冠相互重叠的图像就是我们要考虑的分割对象。我们使用 GIMP 图形编辑器,对云杉、杨树、桦树和松树 4 个不同树种共 134 棵树木的彩色照片进行了人工标注。利用开发的数据库,我们进行了一项实验,进一步训练用于分割林区数码照片中树木重叠部分的 Mask R-CNN 卷积神经网络。我们使用微软 COCO 数据集对神经网络进行了预训练,该数据集包含 80 种不同类别物体的 20 多万张图像,如人物、汽车、动物和各种物品。在训练神经网络时,对输入的图像进行了一系列不同的线性和非线性几何变换,从而使训练数据量增加了 11 倍。因此,对针叶树和落叶树的茎干和树冠相互重叠的图像进行分割的准确率达到了 79%,这使得基于类似结构的神经网络可以用于测井机操作员的决策支持系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Segmentation of Overlapping Tree Images in the Digital Photographs of Forest Areas
The use of decision support systems based on computer vision and artificial intelligence significantly improves the working conditions for the operators of technological machines in the timber sector, whose work implies high intensity and psycho-emotional overload. By means of computer vision and artificial intelligence the operator can quickly and easily obtain the data on the state of the cutting area and adopt the optimal solution for holding the working operation. This facilitates his work and reduces the time spent searching and analyzing the data on the cutting area. Meanwhile, one of the key elements of such a system is a subsystem for automatic segmentation of objects in the photograph. We have explored the possibility of segmenting overlapping objects in the photographs of forest areas using a convolutional neural network based on the Mask R-CNN architecture. Unlike in most works on similar topics, the objects of this study are color photographs taken by an RGB camera rather than a lidar. This creates the prospect for reducing the cost of hardware and software systems used to support decision-making by the operators of logging machines. The images of the stems and crowns of coniferous and deciduous trees overlapping each other are the segmented objects under consideration. Using the GIMP graphic editor, we have manually marked the color photographs depicting a total of 134 trees of 4 different species: spruce, aspen, birch and pine. Utilizing the developed database, we have carried out an experiment to further train the Mask R-CNN convolutional neural network for segmentation of overlapping parts of the trees in the digital photographs of forest areas. The neural network has been pre-trained using the Microsoft COCO dataset containing more than 200,000 images of 80 different classes of objects such as people, cars, animals and various items. While training the neural network, the images supplied to its input were subjected to a series of various linear and nonlinear geometric transformations, which made it possible to increase the volume of training data by 11 times. As a result, the accuracy of segmentation of the images of the stems and crowns of coniferous and deciduous trees overlapping each other has reached 79 %, which allows the use of neural networks based on a similar architecture in decision support systems for logging machine operators.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Archives of Clones of Scots Pine Plus Trees in the Republic of Karelia The Genetic Structure Features of the Pinus sylvestris L. Population in the Steppe Zone of European Russia Optimization of the Design Parameters of the Regenerative Rod of a Logging Road Train Formation of Tree Morphology in Cultivated Pine Stands Identification of Damage to Coniferous Stands Based on Comprehensive Analysis of the Results of Remote Sensing of the Earth and Ground Surveys
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1