Roberto Mancebo-Martin, Lin Lin, Elena Dacal, Miguel Luengo-Oroz, David Bermejo-Pelaez
{"title":"How many labels do I need? Self-supervised learning strategies for multiple blood parasites classification in microscopy images","authors":"Roberto Mancebo-Martin, Lin Lin, Elena Dacal, Miguel Luengo-Oroz, David Bermejo-Pelaez","doi":"10.1101/2024.02.29.24303535","DOIUrl":null,"url":null,"abstract":"Bloodborne parasitic diseases such as malaria, filariasis or chagas pose significant challenges in clinical diagnosis, with microscopy as the primary tool for diagnosis. However, limitations such as time-consuming processes and the dependence on trained microscopists is critical, particularly in resource-constrained settings. Deep learning techniques have shown value to interpret microscopy images using large annotated databases for training. In this work, we propose a methodology leveraging self-supervised learning as a foundational model for blood parasite classification. Using a large unannotated database of blood microscopy images, the model is able to learn important image representations that are subsequently transferred to perform parasite classification of 11 different species of parasites requiring a smaller amount of labeled data. Our results show enhanced performance over fully supervised approaches, with ~100 labels per class sufficient to attain an F1 score of ~0.8. This approach is promising for advancing in-vitro diagnostic systems in primary healthcare settings.","PeriodicalId":501203,"journal":{"name":"medRxiv - Hematology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Hematology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.02.29.24303535","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Bloodborne parasitic diseases such as malaria, filariasis or chagas pose significant challenges in clinical diagnosis, with microscopy as the primary tool for diagnosis. However, limitations such as time-consuming processes and the dependence on trained microscopists is critical, particularly in resource-constrained settings. Deep learning techniques have shown value to interpret microscopy images using large annotated databases for training. In this work, we propose a methodology leveraging self-supervised learning as a foundational model for blood parasite classification. Using a large unannotated database of blood microscopy images, the model is able to learn important image representations that are subsequently transferred to perform parasite classification of 11 different species of parasites requiring a smaller amount of labeled data. Our results show enhanced performance over fully supervised approaches, with ~100 labels per class sufficient to attain an F1 score of ~0.8. This approach is promising for advancing in-vitro diagnostic systems in primary healthcare settings.