Automating sedation state assessments using natural language processing

IF 2.9 3区医学 Q1 NURSING Journal of Nursing Scholarship Pub Date : 2024-03-26 DOI:10.1111/jnu.12968

Aaron Conway, Jack Li, Mohammad Goudarzi Rad, Blair Warren, Sebastian Mafeld, Babak Taati

{"title":"Automating sedation state assessments using natural language processing","authors":"Aaron Conway, Jack Li, Mohammad Goudarzi Rad, Blair Warren, Sebastian Mafeld, Babak Taati","doi":"10.1111/jnu.12968","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Introduction</h3>\n \n <p>Common goals for procedural sedation are to control pain and ensure the patient is not moving to an extent that is impeding safe progress or completion of the procedure. Clinicians perform regular assessments of the adequacy of procedural sedation in accordance with these goals to inform their decision-making around sedation titration and also for documentation of the care provided. Natural language processing could be applied to real-time transcriptions of audio recordings made during procedures in order to classify sedation states that involve movement and pain, which could then be integrated into clinical documentation systems. The aim of this study was to determine whether natural language processing algorithms will work with sufficient accuracy to detect sedation states during procedural sedation.</p>\n </section>\n \n <section>\n \n <h3> Design</h3>\n \n <p>A prospective observational study was conducted.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Audio recordings from consenting participants undergoing elective procedures performed in the interventional radiology suite at a large academic hospital were transcribed using an automated speech recognition model. Sentences of transcribed text were used to train and evaluate several different NLP pipelines for a text classification task. The NLP pipelines we evaluated included a simple Bag-of-Words (BOW) model, an ensemble architecture combining a linear BOW model and a “token-to-vector” (Tok2Vec) component, and a transformer-based architecture using the RoBERTa pre-trained model.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>A total of 15,936 sentences from transcriptions of 82 procedures was included in the analysis. The RoBERTa model achieved the highest performance among the three models with an area under the ROC curve (AUC-ROC) of 0.97, an F1 score of 0.87, a precision of 0.86, and a recall of 0.89. The Ensemble model showed a similarly high AUC-ROC of 0.96, but lower F1 score of 0.79, precision of 0.83, and recall of 0.77. The BOW approach achieved an AUC-ROC of 0.97 and the F1 score was 0.7, precision was 0.83 and recall was 0.66.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>The transformer-based architecture using the RoBERTa pre-trained model achieved the best classification performance. Further research is required to confirm the that this natural language processing pipeline can accurately perform text classifications with real-time audio data to allow for automated sedation state assessments.</p>\n </section>\n \n <section>\n \n <h3> Clinical Relevance</h3>\n \n <p>Automating sedation state assessments using natural language processing pipelines would allow for more timely documentation of the care received by sedated patients, and, at the same time, decrease documentation burden for clinicians. Downstream applications can also be generated from the classifications, including for example real-time visualizations of sedation state, which may facilitate improved communication of the adequacy of the sedation between clinicians, who may be performing supervision remotely. Also, accumulation of sedation state assessments from multiple procedures may reveal insights into the efficacy of particular sedative medications or identify procedures where the current approach for sedation and analgesia is not optimal (i.e. a significant amount of time spent in “pain” or “movement” sedation states).</p>\n </section>\n </div>","PeriodicalId":51091,"journal":{"name":"Journal of Nursing Scholarship","volume":"57 1","pages":"17-27"},"PeriodicalIF":2.9000,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11771536/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Nursing Scholarship","FirstCategoryId":"3","ListUrlMain":"https://sigmapubs.onlinelibrary.wiley.com/doi/10.1111/jnu.12968","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NURSING","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction

Common goals for procedural sedation are to control pain and ensure the patient is not moving to an extent that is impeding safe progress or completion of the procedure. Clinicians perform regular assessments of the adequacy of procedural sedation in accordance with these goals to inform their decision-making around sedation titration and also for documentation of the care provided. Natural language processing could be applied to real-time transcriptions of audio recordings made during procedures in order to classify sedation states that involve movement and pain, which could then be integrated into clinical documentation systems. The aim of this study was to determine whether natural language processing algorithms will work with sufficient accuracy to detect sedation states during procedural sedation.

Design

A prospective observational study was conducted.

Methods

Audio recordings from consenting participants undergoing elective procedures performed in the interventional radiology suite at a large academic hospital were transcribed using an automated speech recognition model. Sentences of transcribed text were used to train and evaluate several different NLP pipelines for a text classification task. The NLP pipelines we evaluated included a simple Bag-of-Words (BOW) model, an ensemble architecture combining a linear BOW model and a “token-to-vector” (Tok2Vec) component, and a transformer-based architecture using the RoBERTa pre-trained model.

Results

A total of 15,936 sentences from transcriptions of 82 procedures was included in the analysis. The RoBERTa model achieved the highest performance among the three models with an area under the ROC curve (AUC-ROC) of 0.97, an F1 score of 0.87, a precision of 0.86, and a recall of 0.89. The Ensemble model showed a similarly high AUC-ROC of 0.96, but lower F1 score of 0.79, precision of 0.83, and recall of 0.77. The BOW approach achieved an AUC-ROC of 0.97 and the F1 score was 0.7, precision was 0.83 and recall was 0.66.

Conclusion

The transformer-based architecture using the RoBERTa pre-trained model achieved the best classification performance. Further research is required to confirm the that this natural language processing pipeline can accurately perform text classifications with real-time audio data to allow for automated sedation state assessments.

Clinical Relevance

Automating sedation state assessments using natural language processing pipelines would allow for more timely documentation of the care received by sedated patients, and, at the same time, decrease documentation burden for clinicians. Downstream applications can also be generated from the classifications, including for example real-time visualizations of sedation state, which may facilitate improved communication of the adequacy of the sedation between clinicians, who may be performing supervision remotely. Also, accumulation of sedation state assessments from multiple procedures may reveal insights into the efficacy of particular sedative medications or identify procedures where the current approach for sedation and analgesia is not optimal (i.e. a significant amount of time spent in “pain” or “movement” sedation states).

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用自然语言处理实现镇静状态评估自动化。

导言：手术镇静的共同目标是控制疼痛并确保患者的活动不会妨碍手术的安全进行或完成。临床医生会根据这些目标定期评估手术镇静剂的充分性，以便为镇静剂滴定的决策提供依据，同时也为所提供的护理进行记录。自然语言处理可应用于手术过程中的实时录音转录，以便对涉及运动和疼痛的镇静状态进行分类，然后将其整合到临床文档系统中。本研究旨在确定自然语言处理算法是否能足够准确地检测手术镇静过程中的镇静状态：设计：进行了一项前瞻性观察研究：方法：使用自动语音识别模型转录了在一家大型学术医院介入放射科病房接受择期手术的同意参与者的录音。转录文本的句子被用于训练和评估文本分类任务中几种不同的 NLP 管道。我们评估的 NLP 管道包括一个简单的词袋 (BOW) 模型、一个结合线性 BOW 模型和 "标记到向量"（Tok2Vec）组件的集合架构，以及一个使用 RoBERTa 预训练模型的基于转换器的架构：分析共包括来自 82 个程序转录的 15,936 个句子。在三个模型中，RoBERTa 模型的性能最高，其 ROC 曲线下面积（AUC-ROC）为 0.97，F1 得分为 0.87，精确度为 0.86，召回率为 0.89。集合模型的 AUC-ROC 同样高达 0.96，但 F1 得分为 0.79，精确度为 0.83，召回率为 0.77。BOW 方法的 AUC-ROC 为 0.97，F1 得分为 0.7，精确度为 0.83，召回率为 0.66：结论：使用 RoBERTa 预训练模型的基于变压器的架构取得了最佳分类性能。还需要进一步研究，以确认该自然语言处理管道能够准确地对实时音频数据进行文本分类，从而实现自动镇静状态评估：临床相关性：使用自然语言处理管道自动进行镇静状态评估，可以更及时地记录镇静患者所接受的护理，同时减轻临床医生的记录负担。还可以从分类中生成下游应用，例如包括镇静状态的实时可视化，这有助于改善临床医生之间就镇静是否充分进行交流，而临床医生可能是远程执行监护的。此外，积累多个手术的镇静状态评估结果，还可揭示特定镇静药物的疗效，或确定当前镇静和镇痛方法不理想的手术（即大量时间处于 "疼痛 "或 "运动 "镇静状态）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Nursing Scholarship 医学-护理

CiteScore

6.30

自引率

5.90%

发文量

审稿时长

6-12 weeks

期刊介绍： This widely read and respected journal features peer-reviewed, thought-provoking articles representing research by some of the world’s leading nurse researchers. Reaching health professionals, faculty and students in 103 countries, the Journal of Nursing Scholarship is focused on health of people throughout the world. It is the official journal of Sigma Theta Tau International and it reflects the society’s dedication to providing the tools necessary to improve nursing care around the world.