Liang-I Kang, Kathryn Sarullo, Jon N Marsh, Liang Lu, Pooja Khonde, Changqing Ma, Talin Haritunians, Angela Mujukian, Emebet Mengesha, Dermot P B McGovern, Thaddeus S Stappenbeck, S Joshua Swamidass, Ta-Chiang Liu
{"title":"Development of a deep learning algorithm for Paneth cell density quantification for inflammatory bowel disease.","authors":"Liang-I Kang, Kathryn Sarullo, Jon N Marsh, Liang Lu, Pooja Khonde, Changqing Ma, Talin Haritunians, Angela Mujukian, Emebet Mengesha, Dermot P B McGovern, Thaddeus S Stappenbeck, S Joshua Swamidass, Ta-Chiang Liu","doi":"10.1016/j.ebiom.2024.105440","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Alterations in ileal Paneth cell (PC) density have been described in gut inflammatory diseases such as Crohn's disease (CD) and could be used as a biomarker for disease prognosis. However, quantifying PCs is time-intensive, a barrier for clinical workflow. Deep learning (DL) has transformed the development of robust and accurate tools for complex image evaluation. Our aim was to use DL to quantify PCs for use as a quantitative biomarker.</p><p><strong>Methods: </strong>A retrospective cohort of whole slide images (WSI) of ileal tissue samples from patients with/without inflammatory bowel disease (IBD) was used for the study. A pathologist-annotated training set of WSI were used to train a U-net two-stage DL model to quantify PC number, crypt number, and PC density. For validation, a cohort of 48 WSIs were manually quantified by study pathologists and compared to the DL algorithm, using root mean square error (RMSE) and the coefficient of determination (r<sup>2</sup>) as metrics. To test the value of PC quantification as a biomarker, resection specimens from patients with CD (n = 142) and without IBD (n = 48) patients were analysed with the DL model. Finally, we compared time to disease recurrence in patients with CD with low versus high DL-quantified PC density using Log-rank test.</p><p><strong>Findings: </strong>Initial one-stage DL model showed moderate accuracy in predicting PC density in cross-validation tests (RMSE = 1.880, r<sup>2</sup> = 0.641), but adding a second stage significantly improved accuracy (RMSE = 0.802, r<sup>2</sup> = 0.748). In the validation of the two-stage model compared to expert pathologists, the algorithm showed good performance up to RMSE = 1.148, r<sup>2</sup> = 0.708. The retrospective cross-sectional cohort had mean ages of 62.1 years in the patients without IBD and 38.6 years for the patients with CD. In the non-IBD cohort, 43.75% of the patients were male, compared to 49.3% of the patients with CD. Analysis by the DL model showed significantly higher PC density in non-IBD controls compared to the patients with CD (4.04 versus 2.99 PC/crypt). Finally, the algorithm quantification of PCs density in patients with CD showed patients with the lowest 25% PC density (Quartile 1) have significantly shorter recurrence-free interval (p = 0.0399).</p><p><strong>Interpretation: </strong>The current model performance demonstrates the feasibility of developing a DL-based tool to measure PC density as a predictive biomarker for future clinical practice.</p><p><strong>Funding: </strong>This study was funded by the National Institutes of Health (NIH).</p>","PeriodicalId":11494,"journal":{"name":"EBioMedicine","volume":"110 ","pages":"105440"},"PeriodicalIF":9.7000,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EBioMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ebiom.2024.105440","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Alterations in ileal Paneth cell (PC) density have been described in gut inflammatory diseases such as Crohn's disease (CD) and could be used as a biomarker for disease prognosis. However, quantifying PCs is time-intensive, a barrier for clinical workflow. Deep learning (DL) has transformed the development of robust and accurate tools for complex image evaluation. Our aim was to use DL to quantify PCs for use as a quantitative biomarker.
Methods: A retrospective cohort of whole slide images (WSI) of ileal tissue samples from patients with/without inflammatory bowel disease (IBD) was used for the study. A pathologist-annotated training set of WSI were used to train a U-net two-stage DL model to quantify PC number, crypt number, and PC density. For validation, a cohort of 48 WSIs were manually quantified by study pathologists and compared to the DL algorithm, using root mean square error (RMSE) and the coefficient of determination (r2) as metrics. To test the value of PC quantification as a biomarker, resection specimens from patients with CD (n = 142) and without IBD (n = 48) patients were analysed with the DL model. Finally, we compared time to disease recurrence in patients with CD with low versus high DL-quantified PC density using Log-rank test.
Findings: Initial one-stage DL model showed moderate accuracy in predicting PC density in cross-validation tests (RMSE = 1.880, r2 = 0.641), but adding a second stage significantly improved accuracy (RMSE = 0.802, r2 = 0.748). In the validation of the two-stage model compared to expert pathologists, the algorithm showed good performance up to RMSE = 1.148, r2 = 0.708. The retrospective cross-sectional cohort had mean ages of 62.1 years in the patients without IBD and 38.6 years for the patients with CD. In the non-IBD cohort, 43.75% of the patients were male, compared to 49.3% of the patients with CD. Analysis by the DL model showed significantly higher PC density in non-IBD controls compared to the patients with CD (4.04 versus 2.99 PC/crypt). Finally, the algorithm quantification of PCs density in patients with CD showed patients with the lowest 25% PC density (Quartile 1) have significantly shorter recurrence-free interval (p = 0.0399).
Interpretation: The current model performance demonstrates the feasibility of developing a DL-based tool to measure PC density as a predictive biomarker for future clinical practice.
Funding: This study was funded by the National Institutes of Health (NIH).
EBioMedicineBiochemistry, Genetics and Molecular Biology-General Biochemistry,Genetics and Molecular Biology
CiteScore
17.70
自引率
0.90%
发文量
579
审稿时长
5 weeks
期刊介绍:
eBioMedicine is a comprehensive biomedical research journal that covers a wide range of studies that are relevant to human health. Our focus is on original research that explores the fundamental factors influencing human health and disease, including the discovery of new therapeutic targets and treatments, the identification of biomarkers and diagnostic tools, and the investigation and modification of disease pathways and mechanisms. We welcome studies from any biomedical discipline that contribute to our understanding of disease and aim to improve human health.