Inter-rater disagreement in manual scoring of intensive care unit sleep data.

IF 1.7 Q2 MULTIDISCIPLINARY SCIENCES BMC Research Notes Pub Date : 2025-04-01 DOI:10.1186/s13104-025-07198-z
Laurens Reinke, Esther M van der Heide, Pedro Fonseca, Anthony R Absalom, Jaap E Tulleken
{"title":"Inter-rater disagreement in manual scoring of intensive care unit sleep data.","authors":"Laurens Reinke, Esther M van der Heide, Pedro Fonseca, Anthony R Absalom, Jaap E Tulleken","doi":"10.1186/s13104-025-07198-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Severe sleep disruption is common among intensive care unit (ICU) patients. However, the applicability of standard sleep scoring guidelines by the American Academy of Sleep Medicine (AASM) has been questioned, with most polysomnography (PSG) studies in critically ill patients reporting difficulties in setting up and processing and scoring the recordings. The present study explores human inter-rater agreement in sleep stage scoring following the AASM guidelines, within a heterogenous ICU patient cohort.</p><p><strong>Results: </strong>Two human experts independently scored a total of 51,454 epochs in 20 PSG recordings acquired at the ICU. Epoch-per-epoch comparison of scored stages revealed a Cohen's κ coefficient of agreement of 0.36 for standard 5-stage scoring. Highest agreement occurred in Wake (κ = 0.46), while REM showed the lowest (κ = 0.12). Significant correlations were found between inter-rater agreement, and Simplified Acute Physiology Score (SAPS II, r = - 0.506, p = 0.038), and 12-month mortality (r = - 0.524, p = 0.031). Comparison with similar studies underscore challenges in applying AASM criteria to ICU patients. Despite accounting for artifacts, disparities persisted, emphasizing the need for a nuanced exploration of factors influencing scoring inconsistencies in critically ill patients.</p><p><strong>Trial registration: </strong>Trial was registered as \"Sleep and biorhythm in the ICU\", in the Centrale Commissie Mensgebonden Onderzoek register, with number NL-OMON43659 ( https://onderzoekmetmensen.nl/nl/trial/43659 ), on registration date august 4th 2015.</p>","PeriodicalId":9234,"journal":{"name":"BMC Research Notes","volume":"18 1","pages":"138"},"PeriodicalIF":1.7000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963655/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Research Notes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s13104-025-07198-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: Severe sleep disruption is common among intensive care unit (ICU) patients. However, the applicability of standard sleep scoring guidelines by the American Academy of Sleep Medicine (AASM) has been questioned, with most polysomnography (PSG) studies in critically ill patients reporting difficulties in setting up and processing and scoring the recordings. The present study explores human inter-rater agreement in sleep stage scoring following the AASM guidelines, within a heterogenous ICU patient cohort.

Results: Two human experts independently scored a total of 51,454 epochs in 20 PSG recordings acquired at the ICU. Epoch-per-epoch comparison of scored stages revealed a Cohen's κ coefficient of agreement of 0.36 for standard 5-stage scoring. Highest agreement occurred in Wake (κ = 0.46), while REM showed the lowest (κ = 0.12). Significant correlations were found between inter-rater agreement, and Simplified Acute Physiology Score (SAPS II, r = - 0.506, p = 0.038), and 12-month mortality (r = - 0.524, p = 0.031). Comparison with similar studies underscore challenges in applying AASM criteria to ICU patients. Despite accounting for artifacts, disparities persisted, emphasizing the need for a nuanced exploration of factors influencing scoring inconsistencies in critically ill patients.

Trial registration: Trial was registered as "Sleep and biorhythm in the ICU", in the Centrale Commissie Mensgebonden Onderzoek register, with number NL-OMON43659 ( https://onderzoekmetmensen.nl/nl/trial/43659 ), on registration date august 4th 2015.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
重症监护室睡眠数据人工评分中的评分者间分歧。
目的:重症监护病房(ICU)患者普遍存在严重睡眠障碍。然而,美国睡眠医学学会(AASM)标准睡眠评分指南的适用性一直受到质疑,大多数针对危重患者的多导睡眠图(PSG)研究报告在设置、处理和评分记录方面存在困难。本研究在一个异质性ICU患者队列中,探讨了遵循AASM指南的人类睡眠阶段评分的内部一致性。结果:两位人类专家在ICU获得的20份PSG记录中独立评分共51,454个epoch。对评分阶段进行Epoch-per-epoch比较发现,标准5期评分的Cohen’s κ一致系数为0.36。Wake期一致性最高(κ = 0.46), REM期一致性最低(κ = 0.12)。评分间一致性、简化急性生理评分(SAPS II, r = - 0.506, p = 0.038)和12个月死亡率(r = - 0.524, p = 0.031)之间存在显著相关性。与类似研究的比较强调了将AASM标准应用于ICU患者的挑战。尽管考虑到人为因素,差异仍然存在,强调需要对影响危重患者评分不一致的因素进行细致入微的探索。试验注册:试验在Centrale commission Mensgebonden Onderzoek注册为“ICU中的睡眠和生物节律”,注册号为NL-OMON43659 (https://onderzoekmetmensen.nl/nl/trial/43659),注册日期为2015年8月4日。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Research Notes
BMC Research Notes Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (all)
CiteScore
3.60
自引率
0.00%
发文量
363
审稿时长
15 weeks
期刊介绍: BMC Research Notes publishes scientifically valid research outputs that cannot be considered as full research or methodology articles. We support the research community across all scientific and clinical disciplines by providing an open access forum for sharing data and useful information; this includes, but is not limited to, updates to previous work, additions to established methods, short publications, null results, research proposals and data management plans.
期刊最新文献
Evaluation of the hospital service quality using the Importance-Performance Analysis (IPA) tool in Ardabil city. Association of the triglyceride-glucose index and its derived indices with carotid artery plaques in postmenopausal women: a cross-sectional study. Prevalence and multimodal factors associated with impaired kidney function among persons with and without HIV in a routine clinic setting: a cross-sectional study. Knowledge of tongue brushing among school children and their parents, and its effects on children's optimal sugar and salt preferences, dental caries, periodontal diseases, and body mass index. Perceived knowledge and intention to prepare advance directives: a cross-sectional study of Thai gynecologic cancer patients and families.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1