通过数据增强模拟逼真的连续葡萄糖监测仪时间序列

IF 4.1 Q2 ENDOCRINOLOGY & METABOLISM Journal of Diabetes Science and Technology Pub Date : 2025-01-01 Epub Date: 2023-06-23 DOI:10.1177/19322968231181138
Louis A Gomez, Adedolapo Aishat Toye, R Stanley Hum, Samantha Kleinberg
{"title":"通过数据增强模拟逼真的连续葡萄糖监测仪时间序列","authors":"Louis A Gomez, Adedolapo Aishat Toye, R Stanley Hum, Samantha Kleinberg","doi":"10.1177/19322968231181138","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Simulated data are a powerful tool for research, enabling benchmarking of blood glucose (BG) forecasting and control algorithms. However, expert created models provide an unrealistic view of real-world performance, as they lack the features that make real data challenging, while black-box approaches such as generative adversarial networks do not enable systematic tests to diagnose model performance.</p><p><strong>Methods: </strong>To address this, we propose a method that learns missingness and error properties of continuous glucose monitor (CGM) data collected from people with type 1 diabetes (OpenAPS, OhioT1DM, RCT, and Racial-Disparity), and then augments simulated BG data with these properties. On the task of BG forecasting, we test how well our method brings performance closer to that of real CGM data compared with current simulation practices for missing data (random dropout) and error (Gaussian noise, CGM error model).</p><p><strong>Results: </strong>Our methods had the smallest performance difference versus real data compared with random dropout and Gaussian noise when individually testing the effects of missing data and error on simulated BG in most cases. When combined, our approach was significantly better than Gaussian noise and random dropout for all data sets except OhioT1DM. Our error model significantly improved results on diverse data sets.</p><p><strong>Conclusions: </strong>We find a significant gap between BG forecasting performance on simulated and real data, and our method can be used to close this gap. This will enable researchers to rigorously test algorithms and provide realistic estimates of real-world performance without overfitting to real data or at the expense of data collection.</p>","PeriodicalId":15475,"journal":{"name":"Journal of Diabetes Science and Technology","volume":" ","pages":"114-122"},"PeriodicalIF":4.1000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11688677/pdf/","citationCount":"0","resultStr":"{\"title\":\"Simulating Realistic Continuous Glucose Monitor Time Series By Data Augmentation.\",\"authors\":\"Louis A Gomez, Adedolapo Aishat Toye, R Stanley Hum, Samantha Kleinberg\",\"doi\":\"10.1177/19322968231181138\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Simulated data are a powerful tool for research, enabling benchmarking of blood glucose (BG) forecasting and control algorithms. However, expert created models provide an unrealistic view of real-world performance, as they lack the features that make real data challenging, while black-box approaches such as generative adversarial networks do not enable systematic tests to diagnose model performance.</p><p><strong>Methods: </strong>To address this, we propose a method that learns missingness and error properties of continuous glucose monitor (CGM) data collected from people with type 1 diabetes (OpenAPS, OhioT1DM, RCT, and Racial-Disparity), and then augments simulated BG data with these properties. On the task of BG forecasting, we test how well our method brings performance closer to that of real CGM data compared with current simulation practices for missing data (random dropout) and error (Gaussian noise, CGM error model).</p><p><strong>Results: </strong>Our methods had the smallest performance difference versus real data compared with random dropout and Gaussian noise when individually testing the effects of missing data and error on simulated BG in most cases. When combined, our approach was significantly better than Gaussian noise and random dropout for all data sets except OhioT1DM. Our error model significantly improved results on diverse data sets.</p><p><strong>Conclusions: </strong>We find a significant gap between BG forecasting performance on simulated and real data, and our method can be used to close this gap. This will enable researchers to rigorously test algorithms and provide realistic estimates of real-world performance without overfitting to real data or at the expense of data collection.</p>\",\"PeriodicalId\":15475,\"journal\":{\"name\":\"Journal of Diabetes Science and Technology\",\"volume\":\" \",\"pages\":\"114-122\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11688677/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Diabetes Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/19322968231181138\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/6/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ENDOCRINOLOGY & METABOLISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Diabetes Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/19322968231181138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/6/23 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

摘要

背景:模拟数据是一种强大的研究工具,可对血糖 (BG) 预测和控制算法进行基准测试。然而,专家创建的模型无法反映真实世界的性能,因为它们缺乏使真实数据具有挑战性的特征,而生成式对抗网络等黑盒子方法无法进行系统测试以诊断模型性能:为了解决这个问题,我们提出了一种方法,它可以学习从 1 型糖尿病患者(OpenAPS、OhioT1DM、RCT 和种族差异)处收集的连续血糖监测仪(CGM)数据的遗漏和误差特性,然后利用这些特性增强模拟血糖数据。在血糖预测任务中,我们测试了我们的方法在缺失数据(随机辍学)和误差(高斯噪声、CGM 误差模型)方面与目前的模拟实践相比,如何使性能更接近真实 CGM 数据:在大多数情况下,单独测试缺失数据和误差对模拟 BG 的影响时,我们的方法与真实数据的性能差异最小,而与随机遗漏和高斯噪声的性能差异最大。在除 OhioT1DM 以外的所有数据集上,我们的方法综合起来明显优于高斯噪声和随机遗漏。我们的误差模型明显改善了不同数据集的结果:我们发现模拟数据和真实数据的 BG 预测性能之间存在明显差距,而我们的方法可用来缩小这一差距。这将使研究人员能够严格测试算法,并提供真实世界性能的现实估计,而不会过度拟合真实数据或牺牲数据收集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Simulating Realistic Continuous Glucose Monitor Time Series By Data Augmentation.

Background: Simulated data are a powerful tool for research, enabling benchmarking of blood glucose (BG) forecasting and control algorithms. However, expert created models provide an unrealistic view of real-world performance, as they lack the features that make real data challenging, while black-box approaches such as generative adversarial networks do not enable systematic tests to diagnose model performance.

Methods: To address this, we propose a method that learns missingness and error properties of continuous glucose monitor (CGM) data collected from people with type 1 diabetes (OpenAPS, OhioT1DM, RCT, and Racial-Disparity), and then augments simulated BG data with these properties. On the task of BG forecasting, we test how well our method brings performance closer to that of real CGM data compared with current simulation practices for missing data (random dropout) and error (Gaussian noise, CGM error model).

Results: Our methods had the smallest performance difference versus real data compared with random dropout and Gaussian noise when individually testing the effects of missing data and error on simulated BG in most cases. When combined, our approach was significantly better than Gaussian noise and random dropout for all data sets except OhioT1DM. Our error model significantly improved results on diverse data sets.

Conclusions: We find a significant gap between BG forecasting performance on simulated and real data, and our method can be used to close this gap. This will enable researchers to rigorously test algorithms and provide realistic estimates of real-world performance without overfitting to real data or at the expense of data collection.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Diabetes Science and Technology
Journal of Diabetes Science and Technology Medicine-Internal Medicine
CiteScore
7.50
自引率
12.00%
发文量
148
期刊介绍: The Journal of Diabetes Science and Technology (JDST) is a bi-monthly, peer-reviewed scientific journal published by the Diabetes Technology Society. JDST covers scientific and clinical aspects of diabetes technology including glucose monitoring, insulin and metabolic peptide delivery, the artificial pancreas, digital health, precision medicine, social media, cybersecurity, software for modeling, physiologic monitoring, technology for managing obesity, and diagnostic tests of glycation. The journal also covers the development and use of mobile applications and wireless communication, as well as bioengineered tools such as MEMS, new biomaterials, and nanotechnology to develop new sensors. Articles in JDST cover both basic research and clinical applications of technologies being developed to help people with diabetes.
期刊最新文献
How to Provide Additional Oversight to Ensure That Remote Patient Monitoring for People With Diabetes is Being Used and Billed Appropriately. Impact of Recording Interval in Continuous Glucose Monitoring on Calculating the Metrics of Glycemic Control. Effectiveness of Mobile Health Applications for Cardiometabolic Risk Reduction in Urban and Rural India: A Pilot, Randomized Controlled Study. Efficacy and Safety of an Electronic Glycemic Management System for Optimizing Insulin Therapy in Noncritical Patients With Diabetes: A Randomized Trial. Using One-Shot Prompting of Non-Fine-Tuned Commercial Artificial Intelligence to Assess Nutrients from Photographs of Japanese Meals.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1