Application of natural language processing to post-structuring of rectal cancer MRI reports

IF 2.1 3区 医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Clinical radiology Pub Date : 2023-11-17 DOI:10.1016/j.crad.2023.10.032
W. Liu , L. Cai , Y. Li
{"title":"Application of natural language processing to post-structuring of rectal cancer MRI reports","authors":"W. Liu ,&nbsp;L. Cai ,&nbsp;Y. Li","doi":"10.1016/j.crad.2023.10.032","DOIUrl":null,"url":null,"abstract":"<div><h3>AIM</h3><p>To evaluate a natural language processing (NLP) system for extracting structured information from the free-form text of rectal cancer magnetic resonance imaging (MRI) reports written in Chinese.</p></div><div><h3>MATERIALS AND METHODS</h3><p>A rule-based NLP model that could extract 11 key image features of rectal cancer was constructed using 358 MRI reports of rectal cancer written between 2015 and 2021. Fifty reports written before 2015 and 50 written after 2021 were used as test datasets, and the reference standard was determined by manual extraction of information by two radiologists. The length and reporting rate of image features in pre-2015 and post-2021 datasets, as well as the accuracy, precision, recall, and F1 score of feature extraction by the NLP system, were compared. The time required for the NLP to extract data was compared with that required by the radiologists.</p></div><div><h3>RESULTS</h3><p>Reports written after 2021 had longer diagnostic impression sections than reports written before 2015. The reporting rate of key imaging features of rectal cancer was 36.55% before 2015 and 79.82% after 2021. The accuracy, precision, recall, and F1 score of NLP for correct extraction of values from reports were 93.82%, 95.63%, 87.06%, and 91.15%, respectively, for pre-2015 reports, and 92.55%, 98.53%, 94.15%, and 96.29%, respectively, for post-2021 reports. NLP generated all the structured information in &lt;1 second.</p></div><div><h3>CONCLUSIONS</h3><p>The NLP system with rule-based pattern matching achieved rapid and accurate structured processing of rectal cancer MRI reports. MRI reports with structured templates are more suitable for NLP-based extraction of information.</p></div>","PeriodicalId":10695,"journal":{"name":"Clinical radiology","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009926023005172","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

AIM

To evaluate a natural language processing (NLP) system for extracting structured information from the free-form text of rectal cancer magnetic resonance imaging (MRI) reports written in Chinese.

MATERIALS AND METHODS

A rule-based NLP model that could extract 11 key image features of rectal cancer was constructed using 358 MRI reports of rectal cancer written between 2015 and 2021. Fifty reports written before 2015 and 50 written after 2021 were used as test datasets, and the reference standard was determined by manual extraction of information by two radiologists. The length and reporting rate of image features in pre-2015 and post-2021 datasets, as well as the accuracy, precision, recall, and F1 score of feature extraction by the NLP system, were compared. The time required for the NLP to extract data was compared with that required by the radiologists.

RESULTS

Reports written after 2021 had longer diagnostic impression sections than reports written before 2015. The reporting rate of key imaging features of rectal cancer was 36.55% before 2015 and 79.82% after 2021. The accuracy, precision, recall, and F1 score of NLP for correct extraction of values from reports were 93.82%, 95.63%, 87.06%, and 91.15%, respectively, for pre-2015 reports, and 92.55%, 98.53%, 94.15%, and 96.29%, respectively, for post-2021 reports. NLP generated all the structured information in <1 second.

CONCLUSIONS

The NLP system with rule-based pattern matching achieved rapid and accurate structured processing of rectal cancer MRI reports. MRI reports with structured templates are more suitable for NLP-based extraction of information.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自然语言处理在直肠癌MRI报告后期结构化中的应用。
目的:评价一种自然语言处理(NLP)系统从中文直肠癌磁共振成像(MRI)报告的自由格式文本中提取结构化信息。材料与方法:利用2015 - 2021年358份直肠癌MRI报告,构建了一个基于规则的NLP模型,该模型可以提取出直肠癌的11个关键图像特征。以2015年之前的50份报告和2021年之后的50份报告作为测试数据集,参考标准由两名放射科医生人工提取信息确定。比较了2015年前和2021年后数据集图像特征的长度和报告率,以及NLP系统特征提取的准确率、精密度、召回率和F1分。将NLP提取数据所需的时间与放射科医生所需的时间进行比较。结果:与2015年之前的报告相比,2021年之后撰写的报告的诊断印象切片更长。2015年前直肠癌关键影像学特征报告率为36.55%,2021年后报告率为79.82%。NLP对报告中值正确提取的正确率、精密度、召回率和F1分数在2015年前分别为93.82%、95.63%、87.06%和91.15%,在2021年后分别为92.55%、98.53%、94.15%和96.29%。结论:基于规则模式匹配的NLP系统实现了对直肠癌MRI报告的快速、准确的结构化处理。具有结构化模板的MRI报告更适合基于nlp的信息提取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Clinical radiology
Clinical radiology 医学-核医学
CiteScore
4.70
自引率
3.80%
发文量
528
审稿时长
76 days
期刊介绍: Clinical Radiology is published by Elsevier on behalf of The Royal College of Radiologists. Clinical Radiology is an International Journal bringing you original research, editorials and review articles on all aspects of diagnostic imaging, including: • Computed tomography • Magnetic resonance imaging • Ultrasonography • Digital radiology • Interventional radiology • Radiography • Nuclear medicine Papers on radiological protection, quality assurance, audit in radiology and matters relating to radiological training and education are also included. In addition, each issue contains correspondence, book reviews and notices of forthcoming events.
期刊最新文献
Contents RCR meetings Classifying, recognizing, and troubleshooting errors in magnetic resonance imaging (MRI)-guided breast biopsies. Radiation dose to health care workers measured by thermoluminescent dosimetry. Dual-energy subtraction radiography (DESR): a systematic review and meta-analysis of pulmonary nodule detection.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1