Classification of periodontitis stage and grade using natural language processing techniques.

PLOS digital health Pub Date : 2024-12-13 eCollection Date: 2024-12-01 DOI:10.1371/journal.pdig.0000692
Nazila Ameli, Tahereh Firoozi, Monica Gibson, Hollis Lai
{"title":"Classification of periodontitis stage and grade using natural language processing techniques.","authors":"Nazila Ameli, Tahereh Firoozi, Monica Gibson, Hollis Lai","doi":"10.1371/journal.pdig.0000692","DOIUrl":null,"url":null,"abstract":"<p><p>Periodontitis is a complex and microbiome-related inflammatory condition impacting dental supporting tissues. Emphasizing the potential of Clinical Decision Support Systems (CDSS), this study aims to facilitate early diagnosis of periodontitis by extracting patients' information collected as dental charts and notes. We developed a CDSS to predict the stage and grade of periodontitis using natural language processing (NLP) techniques including bidirectional encoder representation for transformers (BERT). We compared the performance of BERT with that of a baseline feature-engineered model. A secondary data analysis was conducted using 309 anonymized patient periodontal charts and corresponding clinician's notes obtained from the university periodontal clinic. After data preprocessing, we added a classification layer on top of the pre-trained BERT model to classify the clinical notes into their corresponding stage and grades. Then, we fine-tuned the pre-trained BERT model on 70% of our data. The performance of the model was evaluated on 32 unseen new patients' clinical notes. The results were compared with the output of a baseline feature-engineered algorithm coupled with MLP techniques to classify the stage and grade of periodontitis. Our proposed BERT model predicted the patients' stage and grade with 77% and 75% accuracy, respectively. MLP model showed that the accuracy of correct classification of stage and grade of the periodontitis on a set of 32 new unseen data was 59.4% and 62.5%, respectively. The BERT model could predict the periodontitis stage and grade on the same new dataset with higher accuracy (66% and 72%, respectively). The utilization of BERT in this context represents a groundbreaking application in dentistry, particularly in CDSS. Our BERT model outperformed baseline models, even with reduced information, promising efficient review of patient notes. This integration of advanced NLP techniques with CDSS frameworks holds potential for timely interventions, preventing complications and reducing healthcare costs.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000692"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11642968/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Periodontitis is a complex and microbiome-related inflammatory condition impacting dental supporting tissues. Emphasizing the potential of Clinical Decision Support Systems (CDSS), this study aims to facilitate early diagnosis of periodontitis by extracting patients' information collected as dental charts and notes. We developed a CDSS to predict the stage and grade of periodontitis using natural language processing (NLP) techniques including bidirectional encoder representation for transformers (BERT). We compared the performance of BERT with that of a baseline feature-engineered model. A secondary data analysis was conducted using 309 anonymized patient periodontal charts and corresponding clinician's notes obtained from the university periodontal clinic. After data preprocessing, we added a classification layer on top of the pre-trained BERT model to classify the clinical notes into their corresponding stage and grades. Then, we fine-tuned the pre-trained BERT model on 70% of our data. The performance of the model was evaluated on 32 unseen new patients' clinical notes. The results were compared with the output of a baseline feature-engineered algorithm coupled with MLP techniques to classify the stage and grade of periodontitis. Our proposed BERT model predicted the patients' stage and grade with 77% and 75% accuracy, respectively. MLP model showed that the accuracy of correct classification of stage and grade of the periodontitis on a set of 32 new unseen data was 59.4% and 62.5%, respectively. The BERT model could predict the periodontitis stage and grade on the same new dataset with higher accuracy (66% and 72%, respectively). The utilization of BERT in this context represents a groundbreaking application in dentistry, particularly in CDSS. Our BERT model outperformed baseline models, even with reduced information, promising efficient review of patient notes. This integration of advanced NLP techniques with CDSS frameworks holds potential for timely interventions, preventing complications and reducing healthcare costs.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用自然语言处理技术对牙周炎阶段和等级进行分类。
牙周炎是一种影响牙支撑组织的复杂的微生物相关炎症。强调临床决策支持系统(CDSS)的潜力,本研究旨在通过提取牙科图表和笔记收集的患者信息来促进牙周炎的早期诊断。我们开发了一个CDSS来预测牙周炎的阶段和等级,使用自然语言处理(NLP)技术,包括双向编码器表示转换器(BERT)。我们比较了BERT与基线特征工程模型的性能。对309名匿名患者牙周病表和相应的临床医生记录进行二次数据分析。数据预处理后,我们在预训练的BERT模型上增加一个分类层,将临床笔记分类到相应的阶段和等级。然后,我们在70%的数据上微调预训练的BERT模型。该模型的性能评估了32个看不见的新患者的临床记录。将结果与基线特征工程算法结合MLP技术的输出进行比较,以分类牙周炎的阶段和等级。我们提出的BERT模型预测患者的分期和分级的准确率分别为77%和75%。MLP模型显示,对32组新的未见数据进行牙周炎分期和分级的正确分类准确率分别为59.4%和62.5%。BERT模型可以在相同的新数据集上预测牙周炎的分期和分级,准确率更高(分别为66%和72%)。在这种情况下,BERT的使用代表了牙科,特别是CDSS的开创性应用。我们的BERT模型优于基线模型,即使在减少信息的情况下,也有望有效地审查患者的记录。这种先进的NLP技术与CDSS框架的整合具有及时干预、预防并发症和降低医疗成本的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Developing a youth-friendly internet-enabled HIV risk calculator: A collaborative approach with young key populations, living in Soweto, South Africa. Comparing imputation approaches to handle systematically missing inputs in risk calculators. Specialty-specific Evaluation of Virtual care Outcomes: A retrospective QUality and safety analysis (S-EVOQUe). Explore barriers to using the internet for health information access in African countries: A systematic review. A comparison of CXR-CAD software to radiologists in identifying COVID-19 in individuals evaluated for Sars CoV-2 infection in Malawi and Zambia.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1