Accurate, Robust, and Scalable Machine Abstraction of Mayo Endoscopic Subscores From Colonoscopy Reports.

IF 4.5 3区 医学 Q1 GASTROENTEROLOGY & HEPATOLOGY Inflammatory Bowel Diseases Pub Date : 2025-03-03 DOI:10.1093/ibd/izae068
Anna L Silverman, Balu Bhasuran, Arman Mosenia, Fatema Yasini, Gokul Ramasamy, Imon Banerjee, Saransh Gupta, Taline Mardirossian, Rohan Narain, Justin Sewell, Atul J Butte, Vivek A Rudrapatna
{"title":"Accurate, Robust, and Scalable Machine Abstraction of Mayo Endoscopic Subscores From Colonoscopy Reports.","authors":"Anna L Silverman, Balu Bhasuran, Arman Mosenia, Fatema Yasini, Gokul Ramasamy, Imon Banerjee, Saransh Gupta, Taline Mardirossian, Rohan Narain, Justin Sewell, Atul J Butte, Vivek A Rudrapatna","doi":"10.1093/ibd/izae068","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The Mayo endoscopic subscore (MES) is an important quantitative measure of disease activity in ulcerative colitis. Colonoscopy reports in routine clinical care usually characterize ulcerative colitis disease activity using free text description, limiting their utility for clinical research and quality improvement. We sought to develop algorithms to classify colonoscopy reports according to their MES.</p><p><strong>Methods: </strong>We annotated 500 colonoscopy reports from 2 health systems. We trained and evaluated 4 classes of algorithms. Our primary outcome was accuracy in identifying scorable reports (binary) and assigning an MES (ordinal). Secondary outcomes included learning efficiency, generalizability, and fairness.</p><p><strong>Results: </strong>Automated machine learning models achieved 98% and 97% accuracy on the binary and ordinal prediction tasks, outperforming other models. Binary models trained on the University of California, San Francisco data alone maintained accuracy (96%) on validation data from Zuckerberg San Francisco General. When using 80% of the training data, models remained accurate for the binary task (97% [n = 320]) but lost accuracy on the ordinal task (67% [n = 194]). We found no evidence of bias by gender (P = .65) or area deprivation index (P = .80).</p><p><strong>Conclusions: </strong>We derived a highly accurate pair of models capable of classifying reports by their MES and recognizing when to abstain from prediction. Our models were generalizable on outside institution validation. There was no evidence of algorithmic bias. Our methods have the potential to enable retrospective studies of treatment effectiveness, prospective identification of patients meeting study criteria, and quality improvement efforts in inflammatory bowel diseases.</p>","PeriodicalId":13623,"journal":{"name":"Inflammatory Bowel Diseases","volume":" ","pages":"665-670"},"PeriodicalIF":4.5000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11879245/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Inflammatory Bowel Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/ibd/izae068","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The Mayo endoscopic subscore (MES) is an important quantitative measure of disease activity in ulcerative colitis. Colonoscopy reports in routine clinical care usually characterize ulcerative colitis disease activity using free text description, limiting their utility for clinical research and quality improvement. We sought to develop algorithms to classify colonoscopy reports according to their MES.

Methods: We annotated 500 colonoscopy reports from 2 health systems. We trained and evaluated 4 classes of algorithms. Our primary outcome was accuracy in identifying scorable reports (binary) and assigning an MES (ordinal). Secondary outcomes included learning efficiency, generalizability, and fairness.

Results: Automated machine learning models achieved 98% and 97% accuracy on the binary and ordinal prediction tasks, outperforming other models. Binary models trained on the University of California, San Francisco data alone maintained accuracy (96%) on validation data from Zuckerberg San Francisco General. When using 80% of the training data, models remained accurate for the binary task (97% [n = 320]) but lost accuracy on the ordinal task (67% [n = 194]). We found no evidence of bias by gender (P = .65) or area deprivation index (P = .80).

Conclusions: We derived a highly accurate pair of models capable of classifying reports by their MES and recognizing when to abstain from prediction. Our models were generalizable on outside institution validation. There was no evidence of algorithmic bias. Our methods have the potential to enable retrospective studies of treatment effectiveness, prospective identification of patients meeting study criteria, and quality improvement efforts in inflammatory bowel diseases.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从结肠镜检查报告中提取准确、稳健、可扩展的梅奥内镜评分。
背景:梅奥内镜评分(MES)是衡量溃疡性结肠炎疾病活动性的重要量化指标。常规临床护理中的结肠镜检查报告通常使用自由文本描述来描述溃疡性结肠炎的疾病活动性,这限制了其在临床研究和质量改进方面的实用性。我们试图开发一种算法,根据结肠镜检查报告的 MES 对其进行分类:我们对来自 2 个医疗系统的 500 份结肠镜检查报告进行了注释。我们训练并评估了 4 类算法。我们的主要结果是识别可扫描报告(二进制)和分配 MES(序数)的准确性。次要结果包括学习效率、通用性和公平性:自动机器学习模型在二进制和序数预测任务中的准确率分别达到 98% 和 97%,优于其他模型。仅在加州大学旧金山分校数据上训练的二进制模型在扎克伯格旧金山综合医院的验证数据上保持了 96% 的准确率。当使用 80% 的训练数据时,模型在二进制任务中的准确率保持不变(97% [n = 320]),但在顺序任务中的准确率有所下降(67% [n = 194])。我们没有发现性别偏差(P = .65)或地区贫困指数偏差(P = .80):我们推导出了一对高度准确的模型,能够根据 MES 对报告进行分类,并识别何时应放弃预测。我们的模型在外部机构验证时具有通用性。没有证据表明算法存在偏差。我们的方法可用于治疗效果的回顾性研究、符合研究标准的患者的前瞻性识别以及炎症性肠病的质量改进工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Inflammatory Bowel Diseases
Inflammatory Bowel Diseases 医学-胃肠肝病学
CiteScore
9.70
自引率
6.10%
发文量
462
审稿时长
1 months
期刊介绍: Inflammatory Bowel Diseases® supports the mission of the Crohn''s & Colitis Foundation by bringing the most impactful and cutting edge clinical topics and research findings related to inflammatory bowel diseases to clinicians and researchers working in IBD and related fields. The Journal is committed to publishing on innovative topics that influence the future of clinical care, treatment, and research.
期刊最新文献
Dietary Nut and Legume Intake and Risk of Crohn's Disease and Ulcerative Colitis. Increasing Exposure to Inflammatory Bowel Diseases Education in Gastroenterology Fellowship: The Pilot IBD 101 Experience. Continuous Monitoring of CRP, IL-6, and Calprotectin in Inflammatory Bowel Disease Using a Perspiration-Based Wearable Device. Pan-enteric Capsule Endoscopy to Characterize Crohn's Disease Phenotypes and Predict Clinical Outcomes in Children and Adults: The Bomiro Study. Accurate, Robust, and Scalable Machine Abstraction of Mayo Endoscopic Subscores From Colonoscopy Reports.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1