Liang-I Kang, Kathryn Sarullo, Jon N Marsh, Liang Lu, Pooja Khonde, Changqing Ma, Talin Haritunians, Angela Mujukian, Emebet Mengesha, Dermot P B McGovern, Thaddeus S Stappenbeck, S Joshua Swamidass, Ta-Chiang Liu
{"title":"为炎症性肠病的Paneth细胞密度定量开发深度学习算法。","authors":"Liang-I Kang, Kathryn Sarullo, Jon N Marsh, Liang Lu, Pooja Khonde, Changqing Ma, Talin Haritunians, Angela Mujukian, Emebet Mengesha, Dermot P B McGovern, Thaddeus S Stappenbeck, S Joshua Swamidass, Ta-Chiang Liu","doi":"10.1016/j.ebiom.2024.105440","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Alterations in ileal Paneth cell (PC) density have been described in gut inflammatory diseases such as Crohn's disease (CD) and could be used as a biomarker for disease prognosis. However, quantifying PCs is time-intensive, a barrier for clinical workflow. Deep learning (DL) has transformed the development of robust and accurate tools for complex image evaluation. Our aim was to use DL to quantify PCs for use as a quantitative biomarker.</p><p><strong>Methods: </strong>A retrospective cohort of whole slide images (WSI) of ileal tissue samples from patients with/without inflammatory bowel disease (IBD) was used for the study. A pathologist-annotated training set of WSI were used to train a U-net two-stage DL model to quantify PC number, crypt number, and PC density. For validation, a cohort of 48 WSIs were manually quantified by study pathologists and compared to the DL algorithm, using root mean square error (RMSE) and the coefficient of determination (r<sup>2</sup>) as metrics. To test the value of PC quantification as a biomarker, resection specimens from patients with CD (n = 142) and without IBD (n = 48) patients were analysed with the DL model. Finally, we compared time to disease recurrence in patients with CD with low versus high DL-quantified PC density using Log-rank test.</p><p><strong>Findings: </strong>Initial one-stage DL model showed moderate accuracy in predicting PC density in cross-validation tests (RMSE = 1.880, r<sup>2</sup> = 0.641), but adding a second stage significantly improved accuracy (RMSE = 0.802, r<sup>2</sup> = 0.748). In the validation of the two-stage model compared to expert pathologists, the algorithm showed good performance up to RMSE = 1.148, r<sup>2</sup> = 0.708. The retrospective cross-sectional cohort had mean ages of 62.1 years in the patients without IBD and 38.6 years for the patients with CD. In the non-IBD cohort, 43.75% of the patients were male, compared to 49.3% of the patients with CD. Analysis by the DL model showed significantly higher PC density in non-IBD controls compared to the patients with CD (4.04 versus 2.99 PC/crypt). Finally, the algorithm quantification of PCs density in patients with CD showed patients with the lowest 25% PC density (Quartile 1) have significantly shorter recurrence-free interval (p = 0.0399).</p><p><strong>Interpretation: </strong>The current model performance demonstrates the feasibility of developing a DL-based tool to measure PC density as a predictive biomarker for future clinical practice.</p><p><strong>Funding: </strong>This study was funded by the National Institutes of Health (NIH).</p>","PeriodicalId":11494,"journal":{"name":"EBioMedicine","volume":"110 ","pages":"105440"},"PeriodicalIF":9.7000,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of a deep learning algorithm for Paneth cell density quantification for inflammatory bowel disease.\",\"authors\":\"Liang-I Kang, Kathryn Sarullo, Jon N Marsh, Liang Lu, Pooja Khonde, Changqing Ma, Talin Haritunians, Angela Mujukian, Emebet Mengesha, Dermot P B McGovern, Thaddeus S Stappenbeck, S Joshua Swamidass, Ta-Chiang Liu\",\"doi\":\"10.1016/j.ebiom.2024.105440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Alterations in ileal Paneth cell (PC) density have been described in gut inflammatory diseases such as Crohn's disease (CD) and could be used as a biomarker for disease prognosis. However, quantifying PCs is time-intensive, a barrier for clinical workflow. Deep learning (DL) has transformed the development of robust and accurate tools for complex image evaluation. Our aim was to use DL to quantify PCs for use as a quantitative biomarker.</p><p><strong>Methods: </strong>A retrospective cohort of whole slide images (WSI) of ileal tissue samples from patients with/without inflammatory bowel disease (IBD) was used for the study. A pathologist-annotated training set of WSI were used to train a U-net two-stage DL model to quantify PC number, crypt number, and PC density. For validation, a cohort of 48 WSIs were manually quantified by study pathologists and compared to the DL algorithm, using root mean square error (RMSE) and the coefficient of determination (r<sup>2</sup>) as metrics. To test the value of PC quantification as a biomarker, resection specimens from patients with CD (n = 142) and without IBD (n = 48) patients were analysed with the DL model. Finally, we compared time to disease recurrence in patients with CD with low versus high DL-quantified PC density using Log-rank test.</p><p><strong>Findings: </strong>Initial one-stage DL model showed moderate accuracy in predicting PC density in cross-validation tests (RMSE = 1.880, r<sup>2</sup> = 0.641), but adding a second stage significantly improved accuracy (RMSE = 0.802, r<sup>2</sup> = 0.748). In the validation of the two-stage model compared to expert pathologists, the algorithm showed good performance up to RMSE = 1.148, r<sup>2</sup> = 0.708. The retrospective cross-sectional cohort had mean ages of 62.1 years in the patients without IBD and 38.6 years for the patients with CD. In the non-IBD cohort, 43.75% of the patients were male, compared to 49.3% of the patients with CD. Analysis by the DL model showed significantly higher PC density in non-IBD controls compared to the patients with CD (4.04 versus 2.99 PC/crypt). Finally, the algorithm quantification of PCs density in patients with CD showed patients with the lowest 25% PC density (Quartile 1) have significantly shorter recurrence-free interval (p = 0.0399).</p><p><strong>Interpretation: </strong>The current model performance demonstrates the feasibility of developing a DL-based tool to measure PC density as a predictive biomarker for future clinical practice.</p><p><strong>Funding: </strong>This study was funded by the National Institutes of Health (NIH).</p>\",\"PeriodicalId\":11494,\"journal\":{\"name\":\"EBioMedicine\",\"volume\":\"110 \",\"pages\":\"105440\"},\"PeriodicalIF\":9.7000,\"publicationDate\":\"2024-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EBioMedicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ebiom.2024.105440\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EBioMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ebiom.2024.105440","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
摘要
背景:在克罗恩病(CD)等肠道炎症性疾病中已描述了回肠Paneth细胞(PC)密度的变化,可作为疾病预后的生物标志物。然而,量化 PC 需要耗费大量时间,这对临床工作流程来说是一个障碍。深度学习(DL)改变了用于复杂图像评估的强大而准确的工具的开发。我们的目标是利用深度学习来量化 PC,将其用作定量生物标记物:研究使用了一组回顾性的回肠组织样本全切片图像(WSI),这些样本来自炎症性肠病(IBD)患者/非炎症性肠病患者。病理学家标注的 WSI 训练集用于训练 U-net 两级 DL 模型,以量化 PC 数量、隐窝数量和 PC 密度。为了进行验证,研究病理学家对一组 48 个 WSI 进行了人工量化,并使用均方根误差 (RMSE) 和判定系数 (r2) 作为衡量指标,与 DL 算法进行比较。为了检验 PC 定量作为生物标志物的价值,我们使用 DL 模型分析了 CD 患者(142 人)和非 IBD 患者(48 人)的切除标本。最后,我们使用对数秩检验比较了经 DL 量化的 PC 密度低与高的 CD 患者的疾病复发时间:在交叉验证测试中,最初的单级 DL 模型在预测 PC 密度方面显示出中等准确度(RMSE = 1.880,r2 = 0.641),但增加第二级后,准确度显著提高(RMSE = 0.802,r2 = 0.748)。在两阶段模型与病理专家的验证中,该算法表现良好,RMSE = 1.148,r2 = 0.708。回顾性横断面队列中,无 IBD 患者的平均年龄为 62.1 岁,CD 患者的平均年龄为 38.6 岁。在非 IBD 患者队列中,43.75% 的患者为男性,而在 CD 患者中,男性占 49.3%。通过 DL 模型进行的分析表明,与 CD 患者相比,非 IBD 对照组的 PC 密度明显更高(4.04 PC/crypt 对 2.99 PC/crypt)。最后,对 CD 患者 PC 密度的算法量化显示,PC 密度最低 25% 的患者(四分位 1)的无复发间隔时间明显较短(p = 0.0399):目前的模型性能证明了开发基于DL的工具测量PC密度作为未来临床实践的预测性生物标志物的可行性:本研究由美国国立卫生研究院(NIH)资助。
Development of a deep learning algorithm for Paneth cell density quantification for inflammatory bowel disease.
Background: Alterations in ileal Paneth cell (PC) density have been described in gut inflammatory diseases such as Crohn's disease (CD) and could be used as a biomarker for disease prognosis. However, quantifying PCs is time-intensive, a barrier for clinical workflow. Deep learning (DL) has transformed the development of robust and accurate tools for complex image evaluation. Our aim was to use DL to quantify PCs for use as a quantitative biomarker.
Methods: A retrospective cohort of whole slide images (WSI) of ileal tissue samples from patients with/without inflammatory bowel disease (IBD) was used for the study. A pathologist-annotated training set of WSI were used to train a U-net two-stage DL model to quantify PC number, crypt number, and PC density. For validation, a cohort of 48 WSIs were manually quantified by study pathologists and compared to the DL algorithm, using root mean square error (RMSE) and the coefficient of determination (r2) as metrics. To test the value of PC quantification as a biomarker, resection specimens from patients with CD (n = 142) and without IBD (n = 48) patients were analysed with the DL model. Finally, we compared time to disease recurrence in patients with CD with low versus high DL-quantified PC density using Log-rank test.
Findings: Initial one-stage DL model showed moderate accuracy in predicting PC density in cross-validation tests (RMSE = 1.880, r2 = 0.641), but adding a second stage significantly improved accuracy (RMSE = 0.802, r2 = 0.748). In the validation of the two-stage model compared to expert pathologists, the algorithm showed good performance up to RMSE = 1.148, r2 = 0.708. The retrospective cross-sectional cohort had mean ages of 62.1 years in the patients without IBD and 38.6 years for the patients with CD. In the non-IBD cohort, 43.75% of the patients were male, compared to 49.3% of the patients with CD. Analysis by the DL model showed significantly higher PC density in non-IBD controls compared to the patients with CD (4.04 versus 2.99 PC/crypt). Finally, the algorithm quantification of PCs density in patients with CD showed patients with the lowest 25% PC density (Quartile 1) have significantly shorter recurrence-free interval (p = 0.0399).
Interpretation: The current model performance demonstrates the feasibility of developing a DL-based tool to measure PC density as a predictive biomarker for future clinical practice.
Funding: This study was funded by the National Institutes of Health (NIH).
EBioMedicineBiochemistry, Genetics and Molecular Biology-General Biochemistry,Genetics and Molecular Biology
CiteScore
17.70
自引率
0.90%
发文量
579
审稿时长
5 weeks
期刊介绍:
eBioMedicine is a comprehensive biomedical research journal that covers a wide range of studies that are relevant to human health. Our focus is on original research that explores the fundamental factors influencing human health and disease, including the discovery of new therapeutic targets and treatments, the identification of biomarkers and diagnostic tools, and the investigation and modification of disease pathways and mechanisms. We welcome studies from any biomedical discipline that contribute to our understanding of disease and aim to improve human health.