使用人工智能机器学习算法确定物种:以Hebeloma为例研究。

IF 5.2 1区 生物学 Q1 MYCOLOGY Ima Fungus Pub Date : 2022-06-30 DOI:10.1186/s43008-022-00099-x
Peter Bartlett, Ursula Eberhardt, Nicole Schütz, Henry J Beker
{"title":"使用人工智能机器学习算法确定物种:以Hebeloma为例研究。","authors":"Peter Bartlett,&nbsp;Ursula Eberhardt,&nbsp;Nicole Schütz,&nbsp;Henry J Beker","doi":"10.1186/s43008-022-00099-x","DOIUrl":null,"url":null,"abstract":"<p><p>The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.</p>","PeriodicalId":54345,"journal":{"name":"Ima Fungus","volume":null,"pages":null},"PeriodicalIF":5.2000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245212/pdf/","citationCount":"5","resultStr":"{\"title\":\"Species determination using AI machine-learning algorithms: Hebeloma as a case study.\",\"authors\":\"Peter Bartlett,&nbsp;Ursula Eberhardt,&nbsp;Nicole Schütz,&nbsp;Henry J Beker\",\"doi\":\"10.1186/s43008-022-00099-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.</p>\",\"PeriodicalId\":54345,\"journal\":{\"name\":\"Ima Fungus\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2022-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245212/pdf/\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ima Fungus\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s43008-022-00099-x\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MYCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ima Fungus","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s43008-022-00099-x","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MYCOLOGY","Score":null,"Total":0}
引用次数: 5

摘要

当涉及到物种确定时,Hebeloma属以困难而闻名。从历史上看,许多二分键已经发布并以不同的成功率使用。在过去的20年里,作者已经建立了一个Hebeloma收集的数据库,其中不仅包含元数据,还包含参数化形态学描述,其中大约三分之一的病例已经分析并包括微形态学特征,以及几乎每个收集的DNA序列。该数据库现在有大约9000个集合,包括世界上几乎所有类型的集合,代表了120多个不同的分类群。除了地点和生境信息外,几乎每一次收集都利用现有的分子和形态数据进行了分析和鉴定。基于这些数据,开发了一种人工智能(AI)机器学习物种识别器,该识别器将位置数据和少量形态参数作为输入。使用数据库中600多个集合的随机测试集(未在用于训练标识符的集合中使用),物种标识符能够在其最高概率匹配下正确识别77%,在其最可能的三个确定中正确识别96%,在其最可能的五个确定中正确识别99%以上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Species determination using AI machine-learning algorithms: Hebeloma as a case study.

The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ima Fungus
Ima Fungus Agricultural and Biological Sciences-Agricultural and Biological Sciences (miscellaneous)
CiteScore
11.00
自引率
3.70%
发文量
18
审稿时长
20 weeks
期刊介绍: The flagship journal of the International Mycological Association. IMA Fungus is an international, peer-reviewed, open-access, full colour, fast-track journal. Papers on any aspect of mycology are considered, and published on-line with final pagination after proofs have been corrected; they are then effectively published under the International Code of Nomenclature for algae, fungi, and plants. The journal strongly supports good practice policies, and requires voucher specimens or cultures to be deposited in a public collection with an online database, DNA sequences in GenBank, alignments in TreeBASE, and validating information on new scientific names, including typifications, to be lodged in MycoBank. News, meeting reports, personalia, research news, correspondence, book news, and information on forthcoming international meetings are included in each issue
期刊最新文献
A new leaf inhabiting ascomycete from the Jurassic (ca 170 Mya) of Yorkshire, UK, and insights into the appearance and diversification of filamentous Ascomycota. Enhanced detection of Pythium insidiosum via lipid profiling with matrix-assisted laser desorption ionization time of flight mass spectrometry. Comparative mitochondrial genomics of Thelebolaceae in Antarctica: insights into their extremophilic adaptations and evolutionary dynamics. Integration of fungal transcriptomics and metabolomics provides insights into the early interaction between the ORM fungus Tulasnella sp. and the orchid Serapias vomeracea seeds. Integrative taxonomy of Metarhizium anisopliae species complex, based on phylogenomics combined with morphometrics, metabolomics, and virulence data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1