Evaluating Alarm Classifiers with High-confidence Data Programming

IF 8 ACM transactions on computing for healthcare Pub Date : 2022-07-22 DOI:10.1145/3549942

Sydney Pugh, I. Ruchkin, Christopher P. Bonafide, S. Demauro, O. Sokolsky, Insup Lee, James Weimer

{"title":"Evaluating Alarm Classifiers with High-confidence Data Programming","authors":"Sydney Pugh, I. Ruchkin, Christopher P. Bonafide, S. Demauro, O. Sokolsky, Insup Lee, James Weimer","doi":"10.1145/3549942","DOIUrl":null,"url":null,"abstract":"Classification of clinical alarms is at the heart of prioritization, suppression, integration, postponement, and other methods of mitigating alarm fatigue. Since these methods directly affect clinical care, alarm classifiers, such as intelligent suppression systems, need to be evaluated in terms of their sensitivity and specificity, which is typically calculated on a labeled dataset of alarms. Unfortunately, the collection and particularly labeling of such datasets requires substantial effort and time, thus deterring hospitals from investigating mitigations of alarm fatigue. This article develops a lightweight method for evaluating alarm classifiers without perfect alarm labels. The method relies on probabilistic labels obtained from data programming—a labeling paradigm based on combining noisy and cheap-to-obtain labeling heuristics. Based on these labels, the method produces confidence bounds for the sensitivity/specificity values from a hypothetical evaluation with manual labeling. Our experiments on five alarm datasets collected at Children’s Hospital of Philadelphia show that the proposed method provides accurate bounds on the classifier’s sensitivity/specificity, appropriately reflecting the uncertainty from noisy labeling and limited sample sizes.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 1","pages":"1 - 24"},"PeriodicalIF":8.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM transactions on computing for healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3549942","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Classification of clinical alarms is at the heart of prioritization, suppression, integration, postponement, and other methods of mitigating alarm fatigue. Since these methods directly affect clinical care, alarm classifiers, such as intelligent suppression systems, need to be evaluated in terms of their sensitivity and specificity, which is typically calculated on a labeled dataset of alarms. Unfortunately, the collection and particularly labeling of such datasets requires substantial effort and time, thus deterring hospitals from investigating mitigations of alarm fatigue. This article develops a lightweight method for evaluating alarm classifiers without perfect alarm labels. The method relies on probabilistic labels obtained from data programming—a labeling paradigm based on combining noisy and cheap-to-obtain labeling heuristics. Based on these labels, the method produces confidence bounds for the sensitivity/specificity values from a hypothetical evaluation with manual labeling. Our experiments on five alarm datasets collected at Children’s Hospital of Philadelphia show that the proposed method provides accurate bounds on the classifier’s sensitivity/specificity, appropriately reflecting the uncertainty from noisy labeling and limited sample sizes.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用高置信度数据编程评估报警分类器

临床警报的分类是优先级、抑制、整合、延迟和其他缓解警报疲劳方法的核心。由于这些方法直接影响临床护理，因此需要根据其灵敏度和特异性来评估警报分类器，如智能抑制系统，这通常是在标记的警报数据集上计算的。不幸的是，这些数据集的收集，特别是标记需要大量的精力和时间，因此阻碍了医院调查警报疲劳的缓解措施。本文开发了一种轻量级的方法来评估没有完美警报标签的警报分类器。该方法依赖于从数据编程中获得的概率标签——这是一种基于将噪声和廉价相结合来获得标签启发式的标签范式。基于这些标签，该方法通过手动标签的假设评估产生灵敏度/特异性值的置信界限。我们在费城儿童医院收集的五个警报数据集上的实验表明，所提出的方法为分类器的灵敏度/特异性提供了准确的界限，适当地反映了噪声标记和有限样本量的不确定性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM transactions on computing for healthcare

CiteScore

10.30

自引率

0.00%

发文量