{"title":"Machine Learning for identification and classification of Foraminifera: Testing on monothalamids","authors":"Anna Sabbatini , Francesca Caridi , Domenico Potena , Alessandra Negri","doi":"10.1016/j.marmicro.2025.102442","DOIUrl":null,"url":null,"abstract":"<div><div>Here we propose an AI-based approach using machine learning (ML) to assist species identification and reduce morphotype redundancy in the study of monothalamous foraminifera. In fact, this group of protists, is often overlooked in taxonomic studies due to their morphological simplicity and diversity. These single-celled organisms with “soft” tests are poorly studied, with only a few species identified, while many morphotypes remain undescribed. Taxonomic research on monothalamids is limited by challenges in identification, lack of fossilization, and the time-intensive nature of the work. This gap may lead to underestimating biodiversity and hinder detecting ecosystem degradation. Despite these challenges, monothalamids play key roles in marine ecosystems, making their diversity crucial for conservation and resource management. With this in mind, we analyzed images from the scientific literature, extracting key morphological traits, such as chamber shape, shell type, composition, and aperture type, through objective human annotation to build a dataset processed by ML algorithms. Clustering techniques, such as K-Means, revealed that basic shape, followed by shell type and composition, were the primary features distinguishing clusters. This approach enabled more objective morphotype classification, improving consistency and reducing human bias.</div><div>These findings align with recent taxonomic revisions and demonstrate that applying unsupervised ML methods enhances species identification accuracy and streamlines the analysis of high-dimensional datasets.</div></div>","PeriodicalId":49881,"journal":{"name":"Marine Micropaleontology","volume":"195 ","pages":"Article 102442"},"PeriodicalIF":1.5000,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marine Micropaleontology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377839825000076","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PALEONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Here we propose an AI-based approach using machine learning (ML) to assist species identification and reduce morphotype redundancy in the study of monothalamous foraminifera. In fact, this group of protists, is often overlooked in taxonomic studies due to their morphological simplicity and diversity. These single-celled organisms with “soft” tests are poorly studied, with only a few species identified, while many morphotypes remain undescribed. Taxonomic research on monothalamids is limited by challenges in identification, lack of fossilization, and the time-intensive nature of the work. This gap may lead to underestimating biodiversity and hinder detecting ecosystem degradation. Despite these challenges, monothalamids play key roles in marine ecosystems, making their diversity crucial for conservation and resource management. With this in mind, we analyzed images from the scientific literature, extracting key morphological traits, such as chamber shape, shell type, composition, and aperture type, through objective human annotation to build a dataset processed by ML algorithms. Clustering techniques, such as K-Means, revealed that basic shape, followed by shell type and composition, were the primary features distinguishing clusters. This approach enabled more objective morphotype classification, improving consistency and reducing human bias.
These findings align with recent taxonomic revisions and demonstrate that applying unsupervised ML methods enhances species identification accuracy and streamlines the analysis of high-dimensional datasets.
期刊介绍:
Marine Micropaleontology is an international journal publishing original, innovative and significant scientific papers in all fields related to marine microfossils, including ecology and paleoecology, biology and paleobiology, paleoceanography and paleoclimatology, environmental monitoring, taphonomy, evolution and molecular phylogeny. The journal strongly encourages the publication of articles in which marine microfossils and/or their chemical composition are used to solve fundamental geological, environmental and biological problems. However, it does not publish purely stratigraphic or taxonomic papers. In Marine Micropaleontology, a special section is dedicated to short papers on new methods and protocols using marine microfossils. We solicit special issues on hot topics in marine micropaleontology and review articles on timely subjects.