Peter Bartlett, Ursula Eberhardt, Nicole Schütz, Henry J Beker
{"title":"使用人工智能机器学习算法确定物种:以Hebeloma为例研究。","authors":"Peter Bartlett, Ursula Eberhardt, Nicole Schütz, Henry J Beker","doi":"10.1186/s43008-022-00099-x","DOIUrl":null,"url":null,"abstract":"<p><p>The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.</p>","PeriodicalId":54345,"journal":{"name":"Ima Fungus","volume":null,"pages":null},"PeriodicalIF":5.2000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245212/pdf/","citationCount":"5","resultStr":"{\"title\":\"Species determination using AI machine-learning algorithms: Hebeloma as a case study.\",\"authors\":\"Peter Bartlett, Ursula Eberhardt, Nicole Schütz, Henry J Beker\",\"doi\":\"10.1186/s43008-022-00099-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.</p>\",\"PeriodicalId\":54345,\"journal\":{\"name\":\"Ima Fungus\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2022-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245212/pdf/\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ima Fungus\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s43008-022-00099-x\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MYCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ima Fungus","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s43008-022-00099-x","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MYCOLOGY","Score":null,"Total":0}
Species determination using AI machine-learning algorithms: Hebeloma as a case study.
The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.
Ima FungusAgricultural and Biological Sciences-Agricultural and Biological Sciences (miscellaneous)
CiteScore
11.00
自引率
3.70%
发文量
18
审稿时长
20 weeks
期刊介绍:
The flagship journal of the International Mycological Association. IMA Fungus is an international, peer-reviewed, open-access, full colour, fast-track journal. Papers on any aspect of mycology are considered, and published on-line with final pagination after proofs have been corrected; they are then effectively published under the International Code of Nomenclature for algae, fungi, and plants. The journal strongly supports good practice policies, and requires voucher specimens or cultures to be deposited in a public collection with an online database, DNA sequences in GenBank, alignments in TreeBASE, and validating information on new scientific names, including typifications, to be lodged in MycoBank. News, meeting reports, personalia, research news, correspondence, book news, and information on forthcoming international meetings are included in each issue