A dataset of fine-grained fossils of the conodont genus Hindeodus for classification using convolutional neural networks

X. Duan
{"title":"A dataset of fine-grained fossils of the conodont genus Hindeodus for classification using convolutional neural networks","authors":"X. Duan","doi":"10.11922/11-6035.csd.2022.0075.zh","DOIUrl":null,"url":null,"abstract":"With the rise of artificial intelligence, the booming application of convolutional neural networks to the classification and identification of fossils has attracted more and more attention. According to our survey, it is found that the species classified by previous authors basically belong to different genera, families or higher biological taxonomic units. However, in fact, the identification of fossils between species within a genus is often the focus and challenge for the identification task, which means that the previously trained classifiers may not be suitable for actual fossil identification. On this basis, in this paper, we built a dataset covering 12 species of the conodont genus Hindeodus by means of literature collection, while providing an augmented dataset of the original data. Since the dataset is fine-grained, users can train it by using convolutional neural network combined with fine-grained image feature extraction technology. In view of the deficiencies of the dataset such as small amount of data and unbalanced classes, it is suggested that users use stratified K-fold cross-validation, transfer learning and weighted loss function in the training task to solve the above problems. The dataset is aimed to add a fine-grained fossil dataset to the field of intelligent identification of biological fossils, which can be used as an experimental dataset for intelligent identification of fine-grained (species-level) fossils by convolutional neural networks. The fine-grained primitive followed by this dataset can also be used as a reference for the establishment of other fossil datasets.","PeriodicalId":57643,"journal":{"name":"China Scientific Data","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"China Scientific Data","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.11922/11-6035.csd.2022.0075.zh","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the rise of artificial intelligence, the booming application of convolutional neural networks to the classification and identification of fossils has attracted more and more attention. According to our survey, it is found that the species classified by previous authors basically belong to different genera, families or higher biological taxonomic units. However, in fact, the identification of fossils between species within a genus is often the focus and challenge for the identification task, which means that the previously trained classifiers may not be suitable for actual fossil identification. On this basis, in this paper, we built a dataset covering 12 species of the conodont genus Hindeodus by means of literature collection, while providing an augmented dataset of the original data. Since the dataset is fine-grained, users can train it by using convolutional neural network combined with fine-grained image feature extraction technology. In view of the deficiencies of the dataset such as small amount of data and unbalanced classes, it is suggested that users use stratified K-fold cross-validation, transfer learning and weighted loss function in the training task to solve the above problems. The dataset is aimed to add a fine-grained fossil dataset to the field of intelligent identification of biological fossils, which can be used as an experimental dataset for intelligent identification of fine-grained (species-level) fossils by convolutional neural networks. The fine-grained primitive followed by this dataset can also be used as a reference for the establishment of other fossil datasets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用卷积神经网络进行分类的牙形刺属Hindeodus的细粒度化石数据集
随着人工智能的兴起,卷积神经网络在化石分类鉴定中的蓬勃应用越来越受到人们的关注。通过调查发现,前人所分类的物种基本属于不同的属、科或更高的生物分类单位。然而,事实上,一个属内物种之间的化石识别往往是鉴定任务的重点和挑战,这意味着以前训练的分类器可能不适合实际的化石鉴定。在此基础上,本文通过文献收集的方式构建了包含牙形刺属Hindeodus 12种的数据集,同时提供了原始数据的增强数据集。由于数据集是细粒度的,用户可以使用卷积神经网络结合细粒度图像特征提取技术对其进行训练。针对数据集数据量小、类不均衡等不足,建议用户在训练任务中使用分层K-fold交叉验证、迁移学习和加权损失函数来解决上述问题。该数据集旨在为生物化石智能识别领域增加一个细粒度化石数据集,可作为卷积神经网络对细粒度(物种级)化石智能识别的实验数据集。该数据集所遵循的细粒度原语也可作为其他化石数据集建立的参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
389
期刊最新文献
A dataset of monthly light pollution indexes of rivers in China A dataset of observational key parameters in carbon and water fluxes in a semi-arid steppe, Inner Mongolia (2012 – 2020): based on a long-term manipulative experiment of precipitation pattern A dataset of daily surface water mapping products with a resolution of 0.05° on the Qinghai–Tibet Plateau during A dataset of the observations of carbon, water and heat fluxes over an alpine shrubland in Haibei (2011–2020) A dataset of carbon and water fluxes of the typical grasslands in Duolun County, Inner Mongolia during 2006-2015
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1