Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation.

Q2 Medicine JMIR Cardio Pub Date : 2023-05-03 DOI:10.2196/40524
Jason Hearn, Jef Van den Eynde, Bhargava Chinni, Ari Cedars, Danielle Gottlieb Sen, Shelby Kutty, Cedric Manlhiot
{"title":"Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation.","authors":"Jason Hearn,&nbsp;Jef Van den Eynde,&nbsp;Bhargava Chinni,&nbsp;Ari Cedars,&nbsp;Danielle Gottlieb Sen,&nbsp;Shelby Kutty,&nbsp;Cedric Manlhiot","doi":"10.2196/40524","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously investigated.</p><p><strong>Objective: </strong>The aim of this study is to simulate the effect of data degradation on the reliability of prediction models generated from those data and thus determine the extent to which lower device accuracy might or might not limit their use in clinical settings.</p><p><strong>Methods: </strong>Using the Multilevel Monitoring of Activity and Sleep in Healthy People data set, which includes continuous free-living step count and heart rate data from 21 healthy volunteers, we trained a random forest model to predict cardiac competence. Model performance in 75 perturbed data sets with increasing missingness, noisiness, bias, and a combination of all 3 perturbations was compared to model performance for the unperturbed data set.</p><p><strong>Results: </strong>The unperturbed data set achieved a mean root mean square error (RMSE) of 0.079 (SD 0.001) in predicting cardiac competence index. For all types of perturbations, RMSE remained stable up to 20%-30% perturbation. Above this level, RMSE started increasing and reached the point at which the model was no longer predictive at 80% for noise, 50% for missingness, and 35% for the combination of all perturbations. Introducing systematic bias in the underlying data had no effect on RMSE.</p><p><strong>Conclusions: </strong>In this proof-of-concept study, the performance of predictive models for cardiac competence generated from continuously acquired physiological data was relatively stable with declining quality of the source data. As such, lower accuracy of consumer-oriented wearable devices might not be an absolute contraindication for their use in clinical prediction models.</p>","PeriodicalId":14706,"journal":{"name":"JMIR Cardio","volume":"7 ","pages":"e40524"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193221/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Cardio","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/40524","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously investigated.

Objective: The aim of this study is to simulate the effect of data degradation on the reliability of prediction models generated from those data and thus determine the extent to which lower device accuracy might or might not limit their use in clinical settings.

Methods: Using the Multilevel Monitoring of Activity and Sleep in Healthy People data set, which includes continuous free-living step count and heart rate data from 21 healthy volunteers, we trained a random forest model to predict cardiac competence. Model performance in 75 perturbed data sets with increasing missingness, noisiness, bias, and a combination of all 3 perturbations was compared to model performance for the unperturbed data set.

Results: The unperturbed data set achieved a mean root mean square error (RMSE) of 0.079 (SD 0.001) in predicting cardiac competence index. For all types of perturbations, RMSE remained stable up to 20%-30% perturbation. Above this level, RMSE started increasing and reached the point at which the model was no longer predictive at 80% for noise, 50% for missingness, and 35% for the combination of all perturbations. Introducing systematic bias in the underlying data had no effect on RMSE.

Conclusions: In this proof-of-concept study, the performance of predictive models for cardiac competence generated from continuously acquired physiological data was relatively stable with declining quality of the source data. As such, lower accuracy of consumer-oriented wearable devices might not be an absolute contraindication for their use in clinical prediction models.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从连续活动和心率监测中产生的预测模型的数据质量下降:使用模拟的探索性分析。
背景:有限的数据准确性经常被认为是在护理管理路径中整合从面向消费者的可穿戴设备获得的生理数据时需要谨慎的原因。从这些数据生成的预测模型的准确性降低的影响以前没有被研究过。目的:本研究的目的是模拟数据退化对由这些数据生成的预测模型可靠性的影响,从而确定较低的设备准确性可能或可能不限制其在临床环境中的使用的程度。方法:利用健康人多层次活动和睡眠监测数据集,包括21名健康志愿者的连续自由生活步数和心率数据,我们训练了一个随机森林模型来预测心脏能力。将缺失度、噪声、偏差和所有3种扰动的组合增加的75个扰动数据集中的模型性能与未扰动数据集的模型性能进行比较。结果:未扰动数据集预测心脏功能指数的均方根误差(RMSE)为0.079 (SD 0.001)。对于所有类型的扰动,RMSE在20%-30%的扰动下保持稳定。超过这个水平,RMSE开始增加,并达到模型不再预测的点,噪声为80%,缺失为50%,所有扰动的组合为35%。在基础数据中引入系统偏差对均方根误差没有影响。结论:在这项概念验证研究中,随着源数据质量的下降,由连续获取的生理数据生成的心脏功能预测模型的性能相对稳定。因此,面向消费者的可穿戴设备的准确性较低,可能并不是它们在临床预测模型中使用的绝对禁忌症。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
JMIR Cardio
JMIR Cardio Computer Science-Computer Science Applications
CiteScore
3.50
自引率
0.00%
发文量
25
审稿时长
12 weeks
期刊最新文献
Predicting Atrial Fibrillation Relapse Using Bayesian Networks: Explainable AI Approach. Wearable Electrocardiogram Technology: Help or Hindrance to the Modern Doctor? Technology Readiness Level and Self-Reported Health in Recipients of an Implantable Cardioverter Defibrillator: Cross-Sectional Study. A Medication Management App (Smart-Meds) for Patients After an Acute Coronary Syndrome: Pilot Pre-Post Mixed Methods Study. Causal Inference for Hypertension Prediction With Wearable E lectrocardiogram and P hotoplethysmogram Signals: Feasibility Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1