{"title":"Self-Supervised Learning for Feature Extraction from Glomerular Images and Disease Classification with Minimal Annotations.","authors":"Masatoshi Abe, Hirohiko Niioka, Ayumi Matsumoto, Yusuke Katsuma, Atsuhiro Imai, Hiroki Okushima, Shingo Ozaki, Naohiko Fujii, Kazumasa Oka, Yusuke Sakaguchi, Kazunori Inoue, Yoshitaka Isaka, Isao Matsui","doi":"10.1681/ASN.0000000514","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Deep learning has great potential in digital kidney pathology. However, its effectiveness depends heavily on the availability of extensively labeled datasets, which are often limited due to the specialized knowledge and time required for their creation. This limitation hinders the widespread application of deep learning for the analysis of kidney biopsy images.</p><p><strong>Methods: </strong>We applied self-distillation with no labels (DINO), a self-supervised learning method, to a dataset of 10,423 glomerular images obtained from 384 PAS-stained kidney biopsy slides. Glomerular features extracted from the DINO-pretrained backbone were visualized using principal component analysis (PCA). We then performed classification tasks by adding either k-nearest neighbor (kNN) classifiers or linear head layers to the DINO-pretrained or ImageNet-pretrained backbones. These models were trained on our labeled classification dataset. Performance was evaluated using metrics such as the area under the receiver operating characteristic curve (ROC-AUC). The classification tasks encompassed four disease categories (minimal change disease, mesangial proliferative glomerulonephritis, membranous nephropathy, and diabetic nephropathy) as well as clinical parameters such as hypertension, proteinuria, and hematuria.</p><p><strong>Results: </strong>PCA visualization revealed distinct principal components corresponding to different glomerular structures, demonstrating the capability of the DINO-pretrained backbone to capture morphological features. In disease classification, the DINO-pretrained transferred model (ROC-AUC = 0.93) outperformed the ImageNet-pretrained fine-tuned model (ROC-AUC = 0.89). When the labeled data were limited, the ImageNet-pretrained fine-tuned model's ROC-AUC dropped to 0.76 (95% confidence interval [CI], 0.72-0.80), whereas the DINO-pretrained transferred model maintained superior performance (ROC-AUC 0.88, 95% CI 0.86-0.90). The DINO-pretrained transferred model also exhibited higher AUCs for the classification of several clinical parameters. External validation using two independent datasets confirmed DINO pre-training's superiority, particularly when labeled data were limited.</p><p><strong>Conclusions: </strong>The application of DINO to unlabeled PAS-stained glomerular images facilitated the extraction of histological features that can be effectively utilized for disease classification.</p>","PeriodicalId":17217,"journal":{"name":"Journal of The American Society of Nephrology","volume":" ","pages":""},"PeriodicalIF":10.3000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The American Society of Nephrology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1681/ASN.0000000514","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Deep learning has great potential in digital kidney pathology. However, its effectiveness depends heavily on the availability of extensively labeled datasets, which are often limited due to the specialized knowledge and time required for their creation. This limitation hinders the widespread application of deep learning for the analysis of kidney biopsy images.
Methods: We applied self-distillation with no labels (DINO), a self-supervised learning method, to a dataset of 10,423 glomerular images obtained from 384 PAS-stained kidney biopsy slides. Glomerular features extracted from the DINO-pretrained backbone were visualized using principal component analysis (PCA). We then performed classification tasks by adding either k-nearest neighbor (kNN) classifiers or linear head layers to the DINO-pretrained or ImageNet-pretrained backbones. These models were trained on our labeled classification dataset. Performance was evaluated using metrics such as the area under the receiver operating characteristic curve (ROC-AUC). The classification tasks encompassed four disease categories (minimal change disease, mesangial proliferative glomerulonephritis, membranous nephropathy, and diabetic nephropathy) as well as clinical parameters such as hypertension, proteinuria, and hematuria.
Results: PCA visualization revealed distinct principal components corresponding to different glomerular structures, demonstrating the capability of the DINO-pretrained backbone to capture morphological features. In disease classification, the DINO-pretrained transferred model (ROC-AUC = 0.93) outperformed the ImageNet-pretrained fine-tuned model (ROC-AUC = 0.89). When the labeled data were limited, the ImageNet-pretrained fine-tuned model's ROC-AUC dropped to 0.76 (95% confidence interval [CI], 0.72-0.80), whereas the DINO-pretrained transferred model maintained superior performance (ROC-AUC 0.88, 95% CI 0.86-0.90). The DINO-pretrained transferred model also exhibited higher AUCs for the classification of several clinical parameters. External validation using two independent datasets confirmed DINO pre-training's superiority, particularly when labeled data were limited.
Conclusions: The application of DINO to unlabeled PAS-stained glomerular images facilitated the extraction of histological features that can be effectively utilized for disease classification.
期刊介绍:
The Journal of the American Society of Nephrology (JASN) stands as the preeminent kidney journal globally, offering an exceptional synthesis of cutting-edge basic research, clinical epidemiology, meta-analysis, and relevant editorial content. Representing a comprehensive resource, JASN encompasses clinical research, editorials distilling key findings, perspectives, and timely reviews.
Editorials are skillfully crafted to elucidate the essential insights of the parent article, while JASN actively encourages the submission of Letters to the Editor discussing recently published articles. The reviews featured in JASN are consistently erudite and comprehensive, providing thorough coverage of respective fields. Since its inception in July 1990, JASN has been a monthly publication.
JASN publishes original research reports and editorial content across a spectrum of basic and clinical science relevant to the broad discipline of nephrology. Topics covered include renal cell biology, developmental biology of the kidney, genetics of kidney disease, cell and transport physiology, hemodynamics and vascular regulation, mechanisms of blood pressure regulation, renal immunology, kidney pathology, pathophysiology of kidney diseases, nephrolithiasis, clinical nephrology (including dialysis and transplantation), and hypertension. Furthermore, articles addressing healthcare policy and care delivery issues relevant to nephrology are warmly welcomed.