Classification of dinosaur footprints using machine learning

Michael Jones, Jens N Lallensack, Ian Jarman, Peter Falkingham, Ivo Siekmann
{"title":"Classification of dinosaur footprints using machine learning","authors":"Michael Jones, Jens N Lallensack, Ian Jarman, Peter Falkingham, Ivo Siekmann","doi":"10.1101/2024.07.15.603597","DOIUrl":null,"url":null,"abstract":"Fossilised dinosaur footprints enable us to study the behaviour of individual dinosaurs as well as interactions between dinosaurs of the same or different species. There are two principal groups of three-toed dinosaurs, ornithopods and theropods. Determining if a footprint is from an ornithopod or a theropod is a challenging problem. Based on a data set of over 300 dinosaur footprints we train several machine learning models for classifying footprints as either ornithopods or theropods. The data are provided in the form of 20 landmarks for representing each footprint which are derived from images. Variable selection using logistic forward regression demonstrates that the selected landmarks are at locations that are intuitively expected to be especially informative locations, such as the top or the bottom of a footprint. Most models show good accuracy but the recall of ornithopods, of which fewer samples were contained in the data set, was generally lower than the recall of theropods. The Multi-Layer Perceptron (MLP) stands out as the model which did best at dealing with the class imbalance. Finally, we investigate which footprints were misclassified by the majority of models. We find that some misclassified samples exhibit features that are characteristic of the other class or have a compromised shape, for example, a middle toe that points to the left or the right rather than straight ahead.","PeriodicalId":501477,"journal":{"name":"bioRxiv - Paleontology","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Paleontology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.15.603597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Fossilised dinosaur footprints enable us to study the behaviour of individual dinosaurs as well as interactions between dinosaurs of the same or different species. There are two principal groups of three-toed dinosaurs, ornithopods and theropods. Determining if a footprint is from an ornithopod or a theropod is a challenging problem. Based on a data set of over 300 dinosaur footprints we train several machine learning models for classifying footprints as either ornithopods or theropods. The data are provided in the form of 20 landmarks for representing each footprint which are derived from images. Variable selection using logistic forward regression demonstrates that the selected landmarks are at locations that are intuitively expected to be especially informative locations, such as the top or the bottom of a footprint. Most models show good accuracy but the recall of ornithopods, of which fewer samples were contained in the data set, was generally lower than the recall of theropods. The Multi-Layer Perceptron (MLP) stands out as the model which did best at dealing with the class imbalance. Finally, we investigate which footprints were misclassified by the majority of models. We find that some misclassified samples exhibit features that are characteristic of the other class or have a compromised shape, for example, a middle toe that points to the left or the right rather than straight ahead.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用机器学习对恐龙脚印进行分类
通过恐龙脚印化石,我们可以研究单个恐龙的行为以及同种或不同种恐龙之间的相互作用。三趾恐龙主要分为两类:鸟脚类和兽脚类。判断一个脚印是鸟脚类恐龙还是兽脚类恐龙的脚印是一个具有挑战性的问题。基于 300 多个恐龙脚印的数据集,我们训练了几个机器学习模型,用于将脚印分类为鸟脚类恐龙或兽脚类恐龙。数据是以 20 个地标的形式提供的,每个地标代表一个从图像中提取的脚印。使用逻辑前向回归进行变量选择表明,所选地标位于直观预期信息量特别大的位置,如脚印的顶部或底部。大多数模型都显示出良好的准确性,但鸟脚类的召回率普遍低于兽脚类,因为数据集中包含的样本较少。多层感知器(MLP)在处理类别不平衡方面表现突出。最后,我们研究了大多数模型对哪些足迹进行了错误分类。我们发现,一些被错误分类的样本表现出了其他类别的特征,或者形状有所偏差,例如,中间的脚趾指向左侧或右侧,而不是正前方。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Tetrapod species-area relationships across the Cretaceous-Paleogene mass extinction Unique dental arrangement in a new species of Groenlandaspis (Placodermi, Arthrodire) from the Middle Devonian of Mount Howitt, Victoria, Australia DeepDiveR – A software for deep learning estimation of palaeodiversity from fossil occurrences Estimating ancestral states of complex characters: a case study on the evolution of feathers An extraordinary larval-like teleost fish from the Eocene of Bolca
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1