Classification of periodontitis stage and grade using natural language processing techniques.

PLOS digital health Pub Date : 2024-12-13 eCollection Date: 2024-12-01 DOI:10.1371/journal.pdig.0000692

Nazila Ameli, Tahereh Firoozi, Monica Gibson, Hollis Lai

{"title":"Classification of periodontitis stage and grade using natural language processing techniques.","authors":"Nazila Ameli, Tahereh Firoozi, Monica Gibson, Hollis Lai","doi":"10.1371/journal.pdig.0000692","DOIUrl":null,"url":null,"abstract":"<p><p>Periodontitis is a complex and microbiome-related inflammatory condition impacting dental supporting tissues. Emphasizing the potential of Clinical Decision Support Systems (CDSS), this study aims to facilitate early diagnosis of periodontitis by extracting patients' information collected as dental charts and notes. We developed a CDSS to predict the stage and grade of periodontitis using natural language processing (NLP) techniques including bidirectional encoder representation for transformers (BERT). We compared the performance of BERT with that of a baseline feature-engineered model. A secondary data analysis was conducted using 309 anonymized patient periodontal charts and corresponding clinician's notes obtained from the university periodontal clinic. After data preprocessing, we added a classification layer on top of the pre-trained BERT model to classify the clinical notes into their corresponding stage and grades. Then, we fine-tuned the pre-trained BERT model on 70% of our data. The performance of the model was evaluated on 32 unseen new patients' clinical notes. The results were compared with the output of a baseline feature-engineered algorithm coupled with MLP techniques to classify the stage and grade of periodontitis. Our proposed BERT model predicted the patients' stage and grade with 77% and 75% accuracy, respectively. MLP model showed that the accuracy of correct classification of stage and grade of the periodontitis on a set of 32 new unseen data was 59.4% and 62.5%, respectively. The BERT model could predict the periodontitis stage and grade on the same new dataset with higher accuracy (66% and 72%, respectively). The utilization of BERT in this context represents a groundbreaking application in dentistry, particularly in CDSS. Our BERT model outperformed baseline models, even with reduced information, promising efficient review of patient notes. This integration of advanced NLP techniques with CDSS frameworks holds potential for timely interventions, preventing complications and reducing healthcare costs.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000692"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11642968/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Periodontitis is a complex and microbiome-related inflammatory condition impacting dental supporting tissues. Emphasizing the potential of Clinical Decision Support Systems (CDSS), this study aims to facilitate early diagnosis of periodontitis by extracting patients' information collected as dental charts and notes. We developed a CDSS to predict the stage and grade of periodontitis using natural language processing (NLP) techniques including bidirectional encoder representation for transformers (BERT). We compared the performance of BERT with that of a baseline feature-engineered model. A secondary data analysis was conducted using 309 anonymized patient periodontal charts and corresponding clinician's notes obtained from the university periodontal clinic. After data preprocessing, we added a classification layer on top of the pre-trained BERT model to classify the clinical notes into their corresponding stage and grades. Then, we fine-tuned the pre-trained BERT model on 70% of our data. The performance of the model was evaluated on 32 unseen new patients' clinical notes. The results were compared with the output of a baseline feature-engineered algorithm coupled with MLP techniques to classify the stage and grade of periodontitis. Our proposed BERT model predicted the patients' stage and grade with 77% and 75% accuracy, respectively. MLP model showed that the accuracy of correct classification of stage and grade of the periodontitis on a set of 32 new unseen data was 59.4% and 62.5%, respectively. The BERT model could predict the periodontitis stage and grade on the same new dataset with higher accuracy (66% and 72%, respectively). The utilization of BERT in this context represents a groundbreaking application in dentistry, particularly in CDSS. Our BERT model outperformed baseline models, even with reduced information, promising efficient review of patient notes. This integration of advanced NLP techniques with CDSS frameworks holds potential for timely interventions, preventing complications and reducing healthcare costs.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用自然语言处理技术对牙周炎阶段和等级进行分类。

牙周炎是一种影响牙支撑组织的复杂的微生物相关炎症。强调临床决策支持系统（CDSS）的潜力，本研究旨在通过提取牙科图表和笔记收集的患者信息来促进牙周炎的早期诊断。我们开发了一个CDSS来预测牙周炎的阶段和等级，使用自然语言处理（NLP）技术，包括双向编码器表示转换器（BERT）。我们比较了BERT与基线特征工程模型的性能。对309名匿名患者牙周病表和相应的临床医生记录进行二次数据分析。数据预处理后，我们在预训练的BERT模型上增加一个分类层，将临床笔记分类到相应的阶段和等级。然后，我们在70%的数据上微调预训练的BERT模型。该模型的性能评估了32个看不见的新患者的临床记录。将结果与基线特征工程算法结合MLP技术的输出进行比较，以分类牙周炎的阶段和等级。我们提出的BERT模型预测患者的分期和分级的准确率分别为77%和75%。MLP模型显示，对32组新的未见数据进行牙周炎分期和分级的正确分类准确率分别为59.4%和62.5%。BERT模型可以在相同的新数据集上预测牙周炎的分期和分级，准确率更高（分别为66%和72%）。在这种情况下，BERT的使用代表了牙科，特别是CDSS的开创性应用。我们的BERT模型优于基线模型，即使在减少信息的情况下，也有望有效地审查患者的记录。这种先进的NLP技术与CDSS框架的整合具有及时干预、预防并发症和降低医疗成本的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

PLOS digital health

自引率

0.00%

发文量