从嘈杂的现场记录中检测和提取螯虾图像

Khalif Amir Zakry, Mohamad Syahiran Soria, Irwandi Hipni Mohamad Hipiny, Hamimah Ujir, Ruhana Hassan
{"title":"从嘈杂的现场记录中检测和提取螯虾图像","authors":"Khalif Amir Zakry, Mohamad Syahiran Soria, Irwandi Hipni Mohamad Hipiny, Hamimah Ujir, Ruhana Hassan","doi":"10.11591/ijai.v13.i2.pp2354-2363","DOIUrl":null,"url":null,"abstract":"Wildlife videography is an essential data collection method for conducting research on animals. The video recording process of an animal like the Chelonia Mydas turtle in its natural habitat requires the setting up of special camera traps or by performing complex camera movement to capture the animal in frame whilst the cameraman maneuvers over uneven terrain while filming. The result is hours of footage that only have the presence of the intended subject in it for seconds whilst the rest is background footage; or noisy and blurry footage that has only several usable frames among thousands of noisy and unusable ones. This presents a problem that deep learning models can help to assist, especially in detecting a wildlife subject and extracting usable data from hours of noise and background footage. This paper proposes the use of machine learning models to detect and extract wildlife images of Chelonia Mydas turtles to help prune through hundreds and thousands of frames from several video footages. Our paper shows that utilizing a custom model with various confidence scores can label and crop out images in noisy field video recordings of Chelonia Mydas turtles with up to 99.89% of output images correctly cropped and labeled.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"50 21","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chelonia mydas detection and image extraction from noisy field recordings\",\"authors\":\"Khalif Amir Zakry, Mohamad Syahiran Soria, Irwandi Hipni Mohamad Hipiny, Hamimah Ujir, Ruhana Hassan\",\"doi\":\"10.11591/ijai.v13.i2.pp2354-2363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Wildlife videography is an essential data collection method for conducting research on animals. The video recording process of an animal like the Chelonia Mydas turtle in its natural habitat requires the setting up of special camera traps or by performing complex camera movement to capture the animal in frame whilst the cameraman maneuvers over uneven terrain while filming. The result is hours of footage that only have the presence of the intended subject in it for seconds whilst the rest is background footage; or noisy and blurry footage that has only several usable frames among thousands of noisy and unusable ones. This presents a problem that deep learning models can help to assist, especially in detecting a wildlife subject and extracting usable data from hours of noise and background footage. This paper proposes the use of machine learning models to detect and extract wildlife images of Chelonia Mydas turtles to help prune through hundreds and thousands of frames from several video footages. Our paper shows that utilizing a custom model with various confidence scores can label and crop out images in noisy field video recordings of Chelonia Mydas turtles with up to 99.89% of output images correctly cropped and labeled.\",\"PeriodicalId\":507934,\"journal\":{\"name\":\"IAES International Journal of Artificial Intelligence (IJ-AI)\",\"volume\":\"50 21\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IAES International Journal of Artificial Intelligence (IJ-AI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11591/ijai.v13.i2.pp2354-2363\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IAES International Journal of Artificial Intelligence (IJ-AI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijai.v13.i2.pp2354-2363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

野生动物摄像是进行动物研究的重要数据收集方法。在对自然栖息地中的海龟等动物进行录像时,需要设置特殊的摄像机陷阱,或通过复杂的摄像机移动来捕捉画面中的动物,同时摄像师还要在不平坦的地形上进行操作。这样做的结果是,数小时的镜头中只有几秒钟出现了拍摄对象,其余都是背景镜头;或者是嘈杂、模糊的镜头,在成千上万个嘈杂、无法使用的镜头中,只有几帧是可用的。这就提出了一个深度学习模型可以帮助解决的问题,尤其是在检测野生动物主体以及从数小时的噪声和背景素材中提取可用数据方面。本文提出使用机器学习模型来检测和提取 Chelonia Mydas 海龟的野生动物图像,以帮助从多个视频片段中筛选出成百上千的帧。我们的论文表明,利用具有不同置信度分数的自定义模型,可以在嘈杂的海龟野外视频记录中标注和裁剪出图像,高达 99.89% 的输出图像被正确裁剪和标注。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Chelonia mydas detection and image extraction from noisy field recordings
Wildlife videography is an essential data collection method for conducting research on animals. The video recording process of an animal like the Chelonia Mydas turtle in its natural habitat requires the setting up of special camera traps or by performing complex camera movement to capture the animal in frame whilst the cameraman maneuvers over uneven terrain while filming. The result is hours of footage that only have the presence of the intended subject in it for seconds whilst the rest is background footage; or noisy and blurry footage that has only several usable frames among thousands of noisy and unusable ones. This presents a problem that deep learning models can help to assist, especially in detecting a wildlife subject and extracting usable data from hours of noise and background footage. This paper proposes the use of machine learning models to detect and extract wildlife images of Chelonia Mydas turtles to help prune through hundreds and thousands of frames from several video footages. Our paper shows that utilizing a custom model with various confidence scores can label and crop out images in noisy field video recordings of Chelonia Mydas turtles with up to 99.89% of output images correctly cropped and labeled.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FinTech forecasting using an evolving connectionist system for lenders and borrowers: ecosystem behavior Dealing imbalance dataset problem in sentiment analysis of recession in Indonesia A survey on planet leaf disease identification and classification by various machine-learning technique Effect of dataset distribution on automatic road extraction in very high-resolution orthophoto using DeepLab V3+ Feature selection techniques for microarray dataset: a review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1