Automatic detection of instream large wood in videos using deep learning

IF 2.8 2区 地球科学 Q2 GEOGRAPHY, PHYSICAL Earth Surface Dynamics Pub Date : 2024-04-16 DOI:10.5194/egusphere-2024-792
Janbert Aarnink, Tom Beucler, Marceline Vuaridel, Virginia Ruiz-Villanueva
{"title":"Automatic detection of instream large wood in videos using deep learning","authors":"Janbert Aarnink, Tom Beucler, Marceline Vuaridel, Virginia Ruiz-Villanueva","doi":"10.5194/egusphere-2024-792","DOIUrl":null,"url":null,"abstract":"<strong>Abstract.</strong> Instream large wood (i.e., downed trees, branches and roots larger than 1 m in length and 10 cm diameter) has essential geopmorphological and ecological functions supporting the health of river ecosystems. Still, even though its transport during floods may pose a risk, it is rarely observed and, therefore, poorly understood. This paper presents a novel approach to detect pieces of instream wood from video. The approach uses a Convolutional Neural Network to detect wood automatically. We sampled data to represent different wood transport conditions, combining 20 datasets to yield thousands of instream wood images. We designed multiple scenarios using different data subsets with and without data augmentation and analyzed the contribution of each one to the effectiveness of the model using k-fold cross-validation. The mean average precision of the model varies between 35 and 93 percent, and is highly influenced by the quality of the data which it detects. When the image resolution is low, the identified components in the labeled pieces, rather than exhibiting distinct characteristics such as bark or branches, appear more akin to amorphous masses or 'blobs'. We found that the model detects wood with a mean average precision of 67 percent when using a 418 pixels input image resolution. Also, improvements of up to 23 percent could be achieved in some instances and increasing the input resolution raised the weighted mean average precision to 74 percent. We show that the detection performance on a specific dataset is not solely determined by the complexity of the network or the training data. Therefore, the findings of this paper can be used when designing a custom wood detection network. With the growing availability of flood-related videos featuring wood uploaded to the internet, this methodology facilitates the quantification of wood transport across a wide variety of data sources.","PeriodicalId":48749,"journal":{"name":"Earth Surface Dynamics","volume":"33 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Surface Dynamics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.5194/egusphere-2024-792","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract. Instream large wood (i.e., downed trees, branches and roots larger than 1 m in length and 10 cm diameter) has essential geopmorphological and ecological functions supporting the health of river ecosystems. Still, even though its transport during floods may pose a risk, it is rarely observed and, therefore, poorly understood. This paper presents a novel approach to detect pieces of instream wood from video. The approach uses a Convolutional Neural Network to detect wood automatically. We sampled data to represent different wood transport conditions, combining 20 datasets to yield thousands of instream wood images. We designed multiple scenarios using different data subsets with and without data augmentation and analyzed the contribution of each one to the effectiveness of the model using k-fold cross-validation. The mean average precision of the model varies between 35 and 93 percent, and is highly influenced by the quality of the data which it detects. When the image resolution is low, the identified components in the labeled pieces, rather than exhibiting distinct characteristics such as bark or branches, appear more akin to amorphous masses or 'blobs'. We found that the model detects wood with a mean average precision of 67 percent when using a 418 pixels input image resolution. Also, improvements of up to 23 percent could be achieved in some instances and increasing the input resolution raised the weighted mean average precision to 74 percent. We show that the detection performance on a specific dataset is not solely determined by the complexity of the network or the training data. Therefore, the findings of this paper can be used when designing a custom wood detection network. With the growing availability of flood-related videos featuring wood uploaded to the internet, this methodology facilitates the quantification of wood transport across a wide variety of data sources.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用深度学习自动检测视频中的大木头
摘要河流中的大木头(即长度超过 1 米、直径超过 10 厘米的倒伏树木、树枝和树根)具有重要的地貌和生态功能,可支持河流生态系统的健康发展。然而,尽管洪水期间木头的迁移可能会带来风险,但却很少被观测到,因此人们对其了解甚少。本文介绍了一种从视频中检测内流木片的新方法。该方法使用卷积神经网络自动检测木材。我们对数据进行了采样,以代表不同的木材运输条件,结合 20 个数据集,生成了数千张溪流木材图像。我们设计了多个使用不同数据子集的场景,包括使用和不使用数据增强,并使用 k 倍交叉验证分析了每个场景对模型有效性的贡献。模型的平均精度在 35% 到 93% 之间,受检测数据质量的影响很大。当图像分辨率较低时,标注碎片中被识别的成分不会表现出树皮或树枝等明显特征,而更像是无定形的团块或 "圆球"。我们发现,当使用 418 像素的输入图像分辨率时,模型检测木材的平均精度为 67%。此外,在某些情况下还能提高 23%,而提高输入分辨率则能将加权平均精度提高到 74%。我们的研究表明,特定数据集的检测性能并不完全取决于网络或训练数据的复杂性。因此,本文的研究结果可用于设计定制的木材检测网络。随着上传到互联网上的与洪水有关的木材视频越来越多,这种方法有助于量化各种数据源中的木材运输。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Earth Surface Dynamics
Earth Surface Dynamics GEOGRAPHY, PHYSICALGEOSCIENCES, MULTIDISCI-GEOSCIENCES, MULTIDISCIPLINARY
CiteScore
5.40
自引率
5.90%
发文量
56
审稿时长
20 weeks
期刊介绍: Earth Surface Dynamics (ESurf) is an international scientific journal dedicated to the publication and discussion of high-quality research on the physical, chemical, and biological processes shaping Earth''s surface and their interactions on all scales.
期刊最新文献
Exotic tree plantations in the Chilean Coastal Range: balancing the effects of discrete disturbances, connectivity, and a persistent drought on catchment erosion Role of the forcing sources in morphodynamic modelling of an embayed beach Equilibrium distance from long-range dune interactions An empirically-derived hydraulic head model controlling water storage and outflow over a decade in degraded permafrost rock slopes (Zugspitze, D/A) Geomorphic imprint of high mountain floods: Insight from the 2022 hydrological extreme across the Upper Indus terrain in NW Himalayas
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1