Automatic detection of instream large wood in videos using deep learning

IF 2.8 2区地球科学 Q2 GEOGRAPHY, PHYSICAL Earth Surface Dynamics Pub Date : 2024-04-16 DOI:10.5194/egusphere-2024-792

Janbert Aarnink, Tom Beucler, Marceline Vuaridel, Virginia Ruiz-Villanueva

{"title":"Automatic detection of instream large wood in videos using deep learning","authors":"Janbert Aarnink, Tom Beucler, Marceline Vuaridel, Virginia Ruiz-Villanueva","doi":"10.5194/egusphere-2024-792","DOIUrl":null,"url":null,"abstract":"<strong>Abstract.</strong> Instream large wood (i.e., downed trees, branches and roots larger than 1 m in length and 10 cm diameter) has essential geopmorphological and ecological functions supporting the health of river ecosystems. Still, even though its transport during floods may pose a risk, it is rarely observed and, therefore, poorly understood. This paper presents a novel approach to detect pieces of instream wood from video. The approach uses a Convolutional Neural Network to detect wood automatically. We sampled data to represent different wood transport conditions, combining 20 datasets to yield thousands of instream wood images. We designed multiple scenarios using different data subsets with and without data augmentation and analyzed the contribution of each one to the effectiveness of the model using k-fold cross-validation. The mean average precision of the model varies between 35 and 93 percent, and is highly influenced by the quality of the data which it detects. When the image resolution is low, the identified components in the labeled pieces, rather than exhibiting distinct characteristics such as bark or branches, appear more akin to amorphous masses or 'blobs'. We found that the model detects wood with a mean average precision of 67 percent when using a 418 pixels input image resolution. Also, improvements of up to 23 percent could be achieved in some instances and increasing the input resolution raised the weighted mean average precision to 74 percent. We show that the detection performance on a specific dataset is not solely determined by the complexity of the network or the training data. Therefore, the findings of this paper can be used when designing a custom wood detection network. With the growing availability of flood-related videos featuring wood uploaded to the internet, this methodology facilitates the quantification of wood transport across a wide variety of data sources.","PeriodicalId":48749,"journal":{"name":"Earth Surface Dynamics","volume":"33 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Surface Dynamics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.5194/egusphere-2024-792","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract. Instream large wood (i.e., downed trees, branches and roots larger than 1 m in length and 10 cm diameter) has essential geopmorphological and ecological functions supporting the health of river ecosystems. Still, even though its transport during floods may pose a risk, it is rarely observed and, therefore, poorly understood. This paper presents a novel approach to detect pieces of instream wood from video. The approach uses a Convolutional Neural Network to detect wood automatically. We sampled data to represent different wood transport conditions, combining 20 datasets to yield thousands of instream wood images. We designed multiple scenarios using different data subsets with and without data augmentation and analyzed the contribution of each one to the effectiveness of the model using k-fold cross-validation. The mean average precision of the model varies between 35 and 93 percent, and is highly influenced by the quality of the data which it detects. When the image resolution is low, the identified components in the labeled pieces, rather than exhibiting distinct characteristics such as bark or branches, appear more akin to amorphous masses or 'blobs'. We found that the model detects wood with a mean average precision of 67 percent when using a 418 pixels input image resolution. Also, improvements of up to 23 percent could be achieved in some instances and increasing the input resolution raised the weighted mean average precision to 74 percent. We show that the detection performance on a specific dataset is not solely determined by the complexity of the network or the training data. Therefore, the findings of this paper can be used when designing a custom wood detection network. With the growing availability of flood-related videos featuring wood uploaded to the internet, this methodology facilitates the quantification of wood transport across a wide variety of data sources.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用深度学习自动检测视频中的大木头

摘要河流中的大木头（即长度超过 1 米、直径超过 10 厘米的倒伏树木、树枝和树根）具有重要的地貌和生态功能，可支持河流生态系统的健康发展。然而，尽管洪水期间木头的迁移可能会带来风险，但却很少被观测到，因此人们对其了解甚少。本文介绍了一种从视频中检测内流木片的新方法。该方法使用卷积神经网络自动检测木材。我们对数据进行了采样，以代表不同的木材运输条件，结合 20 个数据集，生成了数千张溪流木材图像。我们设计了多个使用不同数据子集的场景，包括使用和不使用数据增强，并使用 k 倍交叉验证分析了每个场景对模型有效性的贡献。模型的平均精度在 35% 到 93% 之间，受检测数据质量的影响很大。当图像分辨率较低时，标注碎片中被识别的成分不会表现出树皮或树枝等明显特征，而更像是无定形的团块或 "圆球"。我们发现，当使用 418 像素的输入图像分辨率时，模型检测木材的平均精度为 67%。此外，在某些情况下还能提高 23%，而提高输入分辨率则能将加权平均精度提高到 74%。我们的研究表明，特定数据集的检测性能并不完全取决于网络或训练数据的复杂性。因此，本文的研究结果可用于设计定制的木材检测网络。随着上传到互联网上的与洪水有关的木材视频越来越多，这种方法有助于量化各种数据源中的木材运输。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Earth Surface Dynamics GEOGRAPHY, PHYSICALGEOSCIENCES, MULTIDISCI-GEOSCIENCES, MULTIDISCIPLINARY

CiteScore

5.40

自引率

5.90%

发文量

审稿时长

20 weeks

期刊介绍： Earth Surface Dynamics (ESurf) is an international scientific journal dedicated to the publication and discussion of high-quality research on the physical, chemical, and biological processes shaping Earth''s surface and their interactions on all scales.