首页 > 最新文献

Methods of Information in Medicine最新文献

英文 中文
One Digital Health for more FAIRness. 一个数字健康更公平。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/a-1938-0533
Oscar Tamburis, Arriel Benis

Background: One Digital Health (ODH) aims to propose a framework that merges One Health's and Digital Health's specific features into an innovative landscape. FAIR (Findable, Accessible, Interoperable, and Reusable) principles consider applications and computational agents (or, in other terms, data, metadata, and infrastructures) as stakeholders with the capacity to find, access, interoperate, and reuse data with none or minimal human intervention.

Objectives: This paper aims to elicit how the ODH framework is compliant with FAIR principles and metrics, providing some thinking guide to investigate and define whether adapted metrics need to be figured out for an effective ODH Intervention setup.

Methods: An integrative analysis of the literature was conducted to extract instances of the need-or of the eventual already existing deployment-of FAIR principles, for each of the three layers (keys, perspectives and dimensions) of the ODH framework. The scope was to assess the extent of scatteredness in pursuing the many facets of FAIRness, descending from the lack of a unifying and balanced framework.

Results: A first attempt to interpret the different technological components existing in the different layers of the ODH framework, in the light of the FAIR principles, was conducted. Although the mature and working examples of workflows for data FAIRification processes currently retrievable in the literature provided a robust ground to work on, a nonsuitable capacity to fully assess FAIR aspects for highly interconnected scenarios, which the ODH-based ones are, has emerged. Rooms for improvement are anyway possible to timely deal with all the underlying features of topics like the delivery of health care in a syndemic scenario, the digital transformation of human and animal health data, or the digital nature conservation through digital technology-based intervention.

Conclusions: ODH pillars account for the availability (findability, accessibility) of human, animal, and environmental data allowing a unified understanding of complex interactions (interoperability) over time (reusability). A vision of integration between these two worlds, under the vest of ODH Interventions featuring FAIRness characteristics, toward the development of a systemic lookup of health and ecology in a digitalized way, is therefore auspicable.

背景:One Digital Health (ODH)旨在提出一个框架,将One Health和Digital Health的具体功能合并到一个创新的景观中。FAIR(可查找、可访问、可互操作和可重用)原则将应用程序和计算代理(或者,换句话说,数据、元数据和基础设施)视为具有查找、访问、互操作和重用数据的能力的涉众,无需或最少的人为干预。目的:本文旨在引出ODH框架如何符合FAIR原则和指标,为调查和定义是否需要为有效的ODH干预设置制定适应指标提供一些思路指导。方法:对文献进行了综合分析,以提取对公平原则的需求或最终已经存在的部署的实例,用于ODH框架的三个层面(关键、视角和维度)。其范围是评估由于缺乏统一和平衡的框架,在追求公平的许多方面分散的程度。结果:根据公平原则,首次尝试解释存在于ODH框架不同层中的不同技术组件。虽然目前在文献中可检索到的数据公平流程的成熟和工作示例为工作提供了坚实的基础,但对于高度互联的场景(基于odh的场景),已经出现了不适合全面评估公平方面的能力。无论如何,改进的空间是可以及时处理各种主题的所有潜在特征的,例如在疾病情况下提供医疗保健,人类和动物健康数据的数字化转换,或通过基于数字技术的干预进行数字自然保护。结论:ODH支柱解释了人类、动物和环境数据的可用性(可查找性、可访问性),允许对复杂交互(互操作性)的统一理解(可重用性)。因此,在以公平为特征的ODH干预措施的支持下,将这两个世界整合起来,以数字化的方式对健康和生态进行系统的查找,这是值得期待的。
{"title":"One Digital Health for more FAIRness.","authors":"Oscar Tamburis,&nbsp;Arriel Benis","doi":"10.1055/a-1938-0533","DOIUrl":"https://doi.org/10.1055/a-1938-0533","url":null,"abstract":"<p><strong>Background: </strong>One Digital Health (ODH) aims to propose a framework that merges One Health's and Digital Health's specific features into an innovative landscape. FAIR (Findable, Accessible, Interoperable, and Reusable) principles consider applications and computational agents (or, in other terms, data, metadata, and infrastructures) as stakeholders with the capacity to find, access, interoperate, and reuse data with none or minimal human intervention.</p><p><strong>Objectives: </strong>This paper aims to elicit how the ODH framework is compliant with FAIR principles and metrics, providing some thinking guide to investigate and define whether adapted metrics need to be figured out for an effective ODH Intervention setup.</p><p><strong>Methods: </strong>An integrative analysis of the literature was conducted to extract instances of the need-or of the eventual already existing deployment-of FAIR principles, for each of the three layers (keys, perspectives and dimensions) of the ODH framework. The scope was to assess the extent of scatteredness in pursuing the many facets of FAIRness, descending from the lack of a unifying and balanced framework.</p><p><strong>Results: </strong>A first attempt to interpret the different technological components existing in the different layers of the ODH framework, in the light of the FAIR principles, was conducted. Although the mature and working examples of workflows for data FAIRification processes currently retrievable in the literature provided a robust ground to work on, a nonsuitable capacity to fully assess FAIR aspects for highly interconnected scenarios, which the ODH-based ones are, has emerged. Rooms for improvement are anyway possible to timely deal with all the underlying features of topics like the delivery of health care in a syndemic scenario, the digital transformation of human and animal health data, or the digital nature conservation through digital technology-based intervention.</p><p><strong>Conclusions: </strong>ODH pillars account for the availability (findability, accessibility) of human, animal, and environmental data allowing a unified understanding of complex interactions (interoperability) over time (reusability). A vision of integration between these two worlds, under the vest of ODH Interventions featuring FAIRness characteristics, toward the development of a systemic lookup of health and ecology in a digitalized way, is therefore auspicable.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e116-e124"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/85/69/10-1055-a-1938-0533.PMC9788917.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9254115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Multivariate Sequential Analytics for Cardiovascular Disease Event Prediction. 用于心血管疾病事件预测的多变量序列分析。
IF 1.3 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 Epub Date: 2022-12-23 DOI: 10.1055/s-0042-1758687
William Hsu, Jim Warren, Patricia Riddle

Background: Automated clinical decision support for risk assessment is a powerful tool in combating cardiovascular disease (CVD), enabling targeted early intervention that could avoid issues of overtreatment or undertreatment. However, current CVD risk prediction models use observations at baseline without explicitly representing patient history as a time series.

Objective: The aim of this study is to examine whether by explicitly modelling the temporal dimension of patient history event prediction may be improved.

Methods: This study investigates methods for multivariate sequential modelling with a particular emphasis on long short-term memory (LSTM) recurrent neural networks. Data from a CVD decision support tool is linked to routinely collected national datasets including pharmaceutical dispensing, hospitalization, laboratory test results, and deaths. The study uses a 2-year observation and a 5-year prediction window. Selected methods are applied to the linked dataset. The experiments performed focus on CVD event prediction. CVD death or hospitalization in a 5-year interval was predicted for patients with history of lipid-lowering therapy.

Results: The results of the experiments showed temporal models are valuable for CVD event prediction over a 5-year interval. This is especially the case for LSTM, which produced the best predictive performance among all models compared achieving AUROC of 0.801 and average precision of 0.425. The non-temporal model comparator ridge classifier (RC) trained using all quarterly data or by aggregating quarterly data (averaging time-varying features) was highly competitive achieving AUROC of 0.799 and average precision of 0.420 and AUROC of 0.800 and average precision of 0.421, respectively.

Conclusion: This study provides evidence that the use of deep temporal models particularly LSTM in clinical decision support for chronic disease would be advantageous with LSTM significantly improving on commonly used regression models such as logistic regression and Cox proportional hazards on the task of CVD event prediction.

背景:风险评估的自动化临床决策支持是防治心血管疾病(CVD)的有力工具,可实现有针对性的早期干预,避免过度治疗或治疗不当的问题。然而,目前的心血管疾病风险预测模型使用的是基线观测数据,没有将患者病史明确表示为时间序列:本研究旨在探讨是否可以通过明确模拟患者病史的时间维度来改进事件预测:本研究探讨了多变量序列建模方法,并特别强调了长短期记忆(LSTM)递归神经网络。来自心血管疾病决策支持工具的数据与常规收集的国家数据集(包括配药、住院、实验室检测结果和死亡)相连接。研究使用了 2 年观察期和 5 年预测期。选定的方法被应用于链接数据集。实验重点是心血管疾病事件预测。对有降脂治疗史的患者进行了 5 年间隔期内心血管疾病死亡或住院预测:实验结果表明,时间模型对于预测 5 年间的心血管疾病事件很有价值。在所有比较模型中,LSTM 的预测性能最佳,AUROC 为 0.801,平均精度为 0.425。使用所有季度数据或通过聚合季度数据(平均时变特征)训练的非时态模型比较模型脊分类器(RC)具有很强的竞争力,AUROC 为 0.799,平均精度为 0.420;AUROC 为 0.800,平均精度为 0.421:这项研究证明,在慢性病临床决策支持中使用深度时空模型,尤其是 LSTM,将具有优势,在心血管疾病事件预测任务中,LSTM 明显优于逻辑回归和 Cox 比例危险等常用回归模型。
{"title":"Multivariate Sequential Analytics for Cardiovascular Disease Event Prediction.","authors":"William Hsu, Jim Warren, Patricia Riddle","doi":"10.1055/s-0042-1758687","DOIUrl":"10.1055/s-0042-1758687","url":null,"abstract":"<p><strong>Background: </strong>Automated clinical decision support for risk assessment is a powerful tool in combating cardiovascular disease (CVD), enabling targeted early intervention that could avoid issues of overtreatment or undertreatment. However, current CVD risk prediction models use observations at baseline without explicitly representing patient history as a time series.</p><p><strong>Objective: </strong>The aim of this study is to examine whether by explicitly modelling the temporal dimension of patient history event prediction may be improved.</p><p><strong>Methods: </strong>This study investigates methods for multivariate sequential modelling with a particular emphasis on long short-term memory (LSTM) recurrent neural networks. Data from a CVD decision support tool is linked to routinely collected national datasets including pharmaceutical dispensing, hospitalization, laboratory test results, and deaths. The study uses a 2-year observation and a 5-year prediction window. Selected methods are applied to the linked dataset. The experiments performed focus on CVD event prediction. CVD death or hospitalization in a 5-year interval was predicted for patients with history of lipid-lowering therapy.</p><p><strong>Results: </strong>The results of the experiments showed temporal models are valuable for CVD event prediction over a 5-year interval. This is especially the case for LSTM, which produced the best predictive performance among all models compared achieving AUROC of 0.801 and average precision of 0.425. The non-temporal model comparator ridge classifier (RC) trained using all quarterly data or by aggregating quarterly data (averaging time-varying features) was highly competitive achieving AUROC of 0.799 and average precision of 0.420 and AUROC of 0.800 and average precision of 0.421, respectively.</p><p><strong>Conclusion: </strong>This study provides evidence that the use of deep temporal models particularly LSTM in clinical decision support for chronic disease would be advantageous with LSTM significantly improving on commonly used regression models such as logistic regression and Cox proportional hazards on the task of CVD event prediction.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e149-e171"},"PeriodicalIF":1.3,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/af/00/10-1055-s-0042-1758687.PMC9788915.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9247803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FAIR Aspects of a Health Information Protection and Management System. 健康信息保护和管理系统的公平方面。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/s-0042-1758765
Jaime Delgado, Silvia Llorente

Background: Privacy management is a key issue when dealing with storage and distribution of health information. However, FAIR (Findability, Accessibility, Interoperability, and Reusability) principles when sharing information are in increasing demand in several organizations, especially for information generated in public-funded research projects.

Objectives: The two main objectives of the presented work are the definition of a secure and interoperable modular architecture to manage different kinds of medical content (xIPAMS [x, for Any kind of content, Information Protection And Management System] and HIPAMS [Health Information Protection And Management System]), and the application of FAIR principles to that architecture in such a way that privacy and security are compatible with FAIR.

Methods: We propose the concept of xIPAMS as a modular architecture, following standards for interoperability, which defines mechanisms for privacy, protection, storage, search, and access to health-related information.

Results: xIPAMS provides FAIR principles and preserves patient's privacy. For each module, we identify how FAIR principles apply.

Conclusions: We have analyzed how xIPAMS, and in particular HIPAMS (Health content), support the FAIR principles focusing on security and privacy. We have identified the FAIR principles supported by the different xIPAMS modules, concluding that the four principles are supported. Our analysis has also considered a possible implementation based on the concept of DACS (Document Access and Communication System), a system storing medical documents in a private and secure way. In addition, we have analyzed security aspects of the FAIRification process and how they are provided by xIPAMS modules.

背景:隐私管理是处理健康信息存储和分发时的一个关键问题。然而,在一些组织中,共享信息时对FAIR(可查找性、可访问性、互操作性和可重用性)原则的需求越来越大,特别是对于公共资助的研究项目中生成的信息。目标:提出的工作的两个主要目标是定义一个安全的、可互操作的模块化架构来管理不同类型的医疗内容(xIPAMS [x,用于任何类型的内容、信息保护和管理系统]和HIPAMS[健康信息保护和管理系统]),以及将FAIR原则应用于该架构,从而使隐私和安全与FAIR兼容。方法:我们提出了xIPAMS作为模块化架构的概念,遵循互操作性标准,定义了隐私、保护、存储、搜索和访问健康相关信息的机制。结果:xIPAMS遵循公平原则,保护患者隐私。对于每个模块,我们确定公平原则如何适用。结论:我们分析了xIPAMS,特别是HIPAMS(健康内容)如何支持以安全和隐私为重点的FAIR原则。我们已经确定了不同xIPAMS模块支持的FAIR原则,得出的结论是支持这四项原则。我们的分析还考虑了一种基于DACS(文档访问和通信系统)概念的可能实现,DACS是一种以私有和安全的方式存储医疗文档的系统。此外,我们还分析了标准化过程的安全方面以及xIPAMS模块如何提供这些方面。
{"title":"FAIR Aspects of a Health Information Protection and Management System.","authors":"Jaime Delgado,&nbsp;Silvia Llorente","doi":"10.1055/s-0042-1758765","DOIUrl":"https://doi.org/10.1055/s-0042-1758765","url":null,"abstract":"<p><strong>Background: </strong>Privacy management is a key issue when dealing with storage and distribution of health information. However, FAIR (Findability, Accessibility, Interoperability, and Reusability) principles when sharing information are in increasing demand in several organizations, especially for information generated in public-funded research projects.</p><p><strong>Objectives: </strong>The two main objectives of the presented work are the definition of a secure and interoperable modular architecture to manage different kinds of medical content (xIPAMS [x, for Any kind of content, Information Protection And Management System] and HIPAMS [Health Information Protection And Management System]), and the application of FAIR principles to that architecture in such a way that privacy and security are compatible with FAIR.</p><p><strong>Methods: </strong>We propose the concept of xIPAMS as a modular architecture, following standards for interoperability, which defines mechanisms for privacy, protection, storage, search, and access to health-related information.</p><p><strong>Results: </strong>xIPAMS provides FAIR principles and preserves patient's privacy. For each module, we identify how FAIR principles apply.</p><p><strong>Conclusions: </strong>We have analyzed how xIPAMS, and in particular HIPAMS (Health content), support the FAIR principles focusing on security and privacy. We have identified the FAIR principles supported by the different xIPAMS modules, concluding that the four principles are supported. Our analysis has also considered a possible implementation based on the concept of DACS (Document Access and Communication System), a system storing medical documents in a private and secure way. In addition, we have analyzed security aspects of the FAIRification process and how they are provided by xIPAMS modules.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e172-e182"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/99/13/10-1055-s-0042-1758765.PMC9788908.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9609042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
DxGenerator: An Improved Differential Diagnosis Generator for Primary Care Based on MetaMap and Semantic Reasoning. DxGenerator:基于元地图和语义推理的初级保健改进鉴别诊断生成器。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/a-1905-5639
Ali Sanaeifar, Saeid Eslami, Mitra Ahadi, Mohsen Kahani, Hassan Vakili Arki

Background: In recent years, researchers have used many computerized interventions to reduce medical errors, the third cause of death in developed countries. One of such interventions is using differential diagnosis generators in primary care, where physicians may encounter initial symptoms without any diagnostic presuppositions. These systems generate multiple diagnoses, ranked by their likelihood. As such, these reports' accuracy can be determined by the location of the correct diagnosis in the list.

Objective: This study aimed to design and evaluate a novel practical web-based differential diagnosis generator solution in primary care.

Methods: In this research, a new online clinical decision support system, called DxGenerator, was designed to improve diagnostic accuracy; to this end, an attempt was made to converge a semantic database with the unified medical language system (UMLS) knowledge base, using MetaMap tool and natural language processing. In this regard, 120 diseases of gastrointestinal organs causing abdominal pain were modeled into the database. After designing an inference engine and a pseudo-free-text interactive interface, 172 patient vignettes were inputted into DxGenerator and ISABEL, the most accurate similar system. The Wilcoxon signed ranked test was used to compare the position of correct diagnoses in DxGenerator and ISABEL. The α level was defined as 0.05.

Results: On a total of 172 vignettes, the mean and standard deviation of correct diagnosis positions improved from 4.2 ± 5.3 in ISABEL to 3.2 ± 3.9 in DxGenerator. This improvement was significant in the subgroup of uncommon diseases (p-value < 0.05).

Conclusion: Using UMLS knowledge base and MetaMap Tools can improve the accuracy of diagnostic systems in which terms are entered in a free text manner. Applying these new methods will help the medical community accept medical diagnostic systems better.

背景:近年来,研究人员使用了许多计算机化的干预措施来减少医疗事故,这是发达国家的第三大死亡原因。其中一种干预措施是在初级保健中使用鉴别诊断发生器,在初级保健中,医生可能在没有任何诊断前提的情况下遇到初始症状。这些系统产生多种诊断,并根据其可能性进行排序。因此,这些报告的准确性可以通过正确诊断在列表中的位置来确定。目的:本研究旨在设计和评估一种新颖实用的基于网络的初级保健鉴别诊断发生器解决方案。方法:本研究设计了一种新的在线临床决策支持系统DxGenerator,以提高诊断准确性;为此,利用MetaMap工具和自然语言处理技术,尝试将语义数据库与统一医学语言系统(UMLS)知识库进行融合。因此,120种引起腹痛的胃肠道器官疾病被建模到数据库中。在设计了推理引擎和伪自由文本交互界面后,将172个病人的小片段输入到DxGenerator和ISABEL中,这是最准确的类似系统。使用Wilcoxon符号排序检验比较DxGenerator和ISABEL中正确诊断的位置。α水平定义为0.05。结果:在172个样本中,正确诊断位置的平均值和标准差由ISABEL的4.2±5.3提高到DxGenerator的3.2±3.9。结论:使用UMLS知识库和MetaMap工具可以提高以自由文本方式输入术语的诊断系统的准确性。应用这些新方法将有助于医学界更好地接受医疗诊断系统。
{"title":"DxGenerator: An Improved Differential Diagnosis Generator for Primary Care Based on MetaMap and Semantic Reasoning.","authors":"Ali Sanaeifar,&nbsp;Saeid Eslami,&nbsp;Mitra Ahadi,&nbsp;Mohsen Kahani,&nbsp;Hassan Vakili Arki","doi":"10.1055/a-1905-5639","DOIUrl":"https://doi.org/10.1055/a-1905-5639","url":null,"abstract":"<p><strong>Background: </strong>In recent years, researchers have used many computerized interventions to reduce medical errors, the third cause of death in developed countries. One of such interventions is using differential diagnosis generators in primary care, where physicians may encounter initial symptoms without any diagnostic presuppositions. These systems generate multiple diagnoses, ranked by their likelihood. As such, these reports' accuracy can be determined by the location of the correct diagnosis in the list.</p><p><strong>Objective: </strong>This study aimed to design and evaluate a novel practical web-based differential diagnosis generator solution in primary care.</p><p><strong>Methods: </strong>In this research, a new online clinical decision support system, called DxGenerator, was designed to improve diagnostic accuracy; to this end, an attempt was made to converge a semantic database with the unified medical language system (UMLS) knowledge base, using MetaMap tool and natural language processing. In this regard, 120 diseases of gastrointestinal organs causing abdominal pain were modeled into the database. After designing an inference engine and a pseudo-free-text interactive interface, 172 patient vignettes were inputted into DxGenerator and ISABEL, the most accurate similar system. The Wilcoxon signed ranked test was used to compare the position of correct diagnoses in DxGenerator and ISABEL. The α level was defined as 0.05.</p><p><strong>Results: </strong>On a total of 172 vignettes, the mean and standard deviation of correct diagnosis positions improved from 4.2 ± 5.3 in ISABEL to 3.2 ± 3.9 in DxGenerator. This improvement was significant in the subgroup of uncommon diseases (<i>p</i>-value < 0.05).</p><p><strong>Conclusion: </strong>Using UMLS knowledge base and MetaMap Tools can improve the accuracy of diagnostic systems in which terms are entered in a free text manner. Applying these new methods will help the medical community accept medical diagnostic systems better.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 5-06","pages":"174-184"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9253490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Cognitive Health Assessment Using Partially Complete Time Series Sensor Data. 利用部分完整的时间序列传感器数据自动进行认知健康评估。
IF 1.3 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-09-01 Epub Date: 2022-10-11 DOI: 10.1055/s-0042-1756649
Brian L Thomas, Lawrence B Holder, Diane J Cook

Background: Behavior and health are inextricably linked. As a result, continuous wearable sensor data offer the potential to predict clinical measures. However, interruptions in the data collection occur, which create a need for strategic data imputation.

Objective: The objective of this work is to adapt a data generation algorithm to impute multivariate time series data. This will allow us to create digital behavior markers that can predict clinical health measures.

Methods: We created a bidirectional time series generative adversarial network to impute missing sensor readings. Values are imputed based on relationships between multiple fields and multiple points in time, for single time points or larger time gaps. From the complete data, digital behavior markers are extracted and are mapped to predicted clinical measures.

Results: We validate our approach using continuous smartwatch data for n = 14 participants. When reconstructing omitted data, we observe an average normalized mean absolute error of 0.0197. We then create machine learning models to predict clinical measures from the reconstructed, complete data with correlations ranging from r = 0.1230 to r = 0.7623. This work indicates that wearable sensor data collected in the wild can be used to offer insights on a person's health in natural settings.

背景:行为与健康密不可分:行为与健康密不可分。因此,连续的可穿戴传感器数据具有预测临床指标的潜力。然而,数据收集过程中会出现中断,这就需要对数据进行战略性估算:这项工作的目的是调整数据生成算法,对多元时间序列数据进行估算。这将使我们能够创建可预测临床健康指标的数字行为标记:方法:我们创建了一个双向时间序列生成对抗网络,以弥补缺失的传感器读数。对于单个时间点或较大的时间间隙,我们会根据多个字段和多个时间点之间的关系来估算数值。从完整的数据中提取数字行为标记,并映射到预测的临床指标:我们使用 14 名参与者的连续智能手表数据验证了我们的方法。在重建遗漏数据时,我们发现平均归一化平均绝对误差为 0.0197。然后,我们创建了机器学习模型,从重建的完整数据中预测临床指标,相关性从 r = 0.1230 到 r = 0.7623 不等。这项工作表明,在野外收集的可穿戴传感器数据可用于洞察自然环境中人的健康状况。
{"title":"Automated Cognitive Health Assessment Using Partially Complete Time Series Sensor Data.","authors":"Brian L Thomas, Lawrence B Holder, Diane J Cook","doi":"10.1055/s-0042-1756649","DOIUrl":"10.1055/s-0042-1756649","url":null,"abstract":"<p><strong>Background: </strong>Behavior and health are inextricably linked. As a result, continuous wearable sensor data offer the potential to predict clinical measures. However, interruptions in the data collection occur, which create a need for strategic data imputation.</p><p><strong>Objective: </strong>The objective of this work is to adapt a data generation algorithm to impute multivariate time series data. This will allow us to create digital behavior markers that can predict clinical health measures.</p><p><strong>Methods: </strong>We created a bidirectional time series generative adversarial network to impute missing sensor readings. Values are imputed based on relationships between multiple fields and multiple points in time, for single time points or larger time gaps. From the complete data, digital behavior markers are extracted and are mapped to predicted clinical measures.</p><p><strong>Results: </strong>We validate our approach using continuous smartwatch data for <i>n</i> = 14 participants. When reconstructing omitted data, we observe an average normalized mean absolute error of 0.0197. We then create machine learning models to predict clinical measures from the reconstructed, complete data with correlations ranging from <i>r</i> = 0.1230 to <i>r</i> = 0.7623. This work indicates that wearable sensor data collected in the wild can be used to offer insights on a person's health in natural settings.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 3-04","pages":"99-110"},"PeriodicalIF":1.3,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9847015/pdf/nihms-1862055.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10616391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reliability of cusp angulation using three-dimensional (3D) digital models:A Preliminary In Vitro Study. 使用三维(3D)数字模型进行牙尖顶成角的可靠性:体外初步研究。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-06-03 DOI: 10.1055/a-1868-6555
Xinggang Liu, Xiao-xian Chen
Background At present, artificial intelligence (AI) is incrementally used in clinical data analysis and clinical decision-making. Dental cusp angulation provide valuable insight into chewing efficiency and prosthesis safety issues. AI-enable computing cusp angles have potential important value but there is no reliable digital measurement method at present. Objectives To establish a digital method for measuring cusp angles and investigate the inter-rater and intra-rater reliability. Methods Two cusp angles (angle α and angle β) of the first molar were measured on 21 plaster casts using a goniometer, and on their corresponding digital models using PicPick software after scanning with a CEREC Bluecam three-dimensional (3D) intraoral scanner. Means±standard deviations as well as intraclass correlation coefficients (ICCs) and Pearson's correlation coefficients (PCCs) were calculated and paired sample t-test was carried out. Results Angle α was 139.19°±13.86°, angle β was 19.25°±6.86°. A very strong positive correlation between the two methods was found when the examiner was experienced (r>0.914; p<0.05), and no significant difference between the two methods was found using the paired sample t-test (p>0.20). For inter-rater and intra-rater assessments, the PCC and ICC of the cusp angulation using the digital method showed that 15 of 16 values were higher than the corresponding values measured on traditional plaster casts. However, both measurement methods showed weak positive correlation (r<0.501) and significant differences (p=0.00) for repeated measurements of angle β at two different time points by an inexperienced examiner. Conclusionss Cusp angulation using 3D digital models was a clinical option and appeared to improve the reliability of cusp angulation compared with measuring plaster casts using a goniometer. Intra-rater variability was still evident in measuring small cusp angles using the digital model.
背景目前,人工智能正在逐步应用于临床数据分析和临床决策。牙尖顶角度为咀嚼效率和假体安全问题提供了有价值的见解。人工智能计算牙尖角具有潜在的重要价值,但目前还没有可靠的数字测量方法。目的建立一种测量牙尖顶角的数字化方法,并研究评分者之间和评分者内部的可靠性。方法用角度计在21个石膏模型上测量第一磨牙的两个牙尖顶角(α角和β角),并用CEREC Bluecam三维口腔扫描仪扫描后,用PicPick软件在相应的数字模型上测量。计算平均值±标准差以及组内相关系数(ICCs)和皮尔逊相关系数(PCCs),并进行配对样本t检验。结果角度α为139.19°±13.86°,角度β为19.25°±6.86°。当检查者有经验时,两种方法之间存在非常强的正相关性(r>0.914;p0.20),使用数字方法测量的牙尖角度的PCC和ICC显示,16个值中有15个高于在传统石膏模上测量的相应值。然而,两种测量方法在两个不同时间点由缺乏经验的检查员重复测量角度β时,显示出弱正相关(r<0.501)和显著差异(p=0.00)。结论使用3D数字模型进行ss尖瓣成角是一种临床选择,与使用角度计测量石膏模型相比,似乎可以提高尖瓣成角度的可靠性。在使用数字模型测量小尖角时,评分者内部的变异性仍然很明显。
{"title":"Reliability of cusp angulation using three-dimensional (3D) digital models:A Preliminary In Vitro Study.","authors":"Xinggang Liu, Xiao-xian Chen","doi":"10.1055/a-1868-6555","DOIUrl":"https://doi.org/10.1055/a-1868-6555","url":null,"abstract":"Background At present, artificial intelligence (AI) is incrementally used in clinical data analysis and clinical decision-making. Dental cusp angulation provide valuable insight into chewing efficiency and prosthesis safety issues. AI-enable computing cusp angles have potential important value but there is no reliable digital measurement method at present. Objectives To establish a digital method for measuring cusp angles and investigate the inter-rater and intra-rater reliability. Methods Two cusp angles (angle α and angle β) of the first molar were measured on 21 plaster casts using a goniometer, and on their corresponding digital models using PicPick software after scanning with a CEREC Bluecam three-dimensional (3D) intraoral scanner. Means±standard deviations as well as intraclass correlation coefficients (ICCs) and Pearson's correlation coefficients (PCCs) were calculated and paired sample t-test was carried out. Results Angle α was 139.19°±13.86°, angle β was 19.25°±6.86°. A very strong positive correlation between the two methods was found when the examiner was experienced (r>0.914; p<0.05), and no significant difference between the two methods was found using the paired sample t-test (p>0.20). For inter-rater and intra-rater assessments, the PCC and ICC of the cusp angulation using the digital method showed that 15 of 16 values were higher than the corresponding values measured on traditional plaster casts. However, both measurement methods showed weak positive correlation (r<0.501) and significant differences (p=0.00) for repeated measurements of angle β at two different time points by an inexperienced examiner. Conclusionss Cusp angulation using 3D digital models was a clinical option and appeared to improve the reliability of cusp angulation compared with measuring plaster casts using a goniometer. Intra-rater variability was still evident in measuring small cusp angles using the digital model.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43806671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-Preserving Artificial Intelligence Techniques in Biomedicine. 生物医学中的隐私保护人工智能技术。
IF 1.3 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-06-01 Epub Date: 2022-01-21 DOI: 10.1055/s-0041-1740630
Reihaneh Torkzadehmahani, Reza Nasirigerdeh, David B Blumenthal, Tim Kacprowski, Markus List, Julian Matschinske, Julian Spaeth, Nina Kerstin Wenke, Jan Baumbach

Background: Artificial intelligence (AI) has been successfully applied in numerous scientific domains. In biomedicine, AI has already shown tremendous potential, e.g., in the interpretation of next-generation sequencing data and in the design of clinical decision support systems.

Objectives: However, training an AI model on sensitive data raises concerns about the privacy of individual participants. For example, summary statistics of a genome-wide association study can be used to determine the presence or absence of an individual in a given dataset. This considerable privacy risk has led to restrictions in accessing genomic and other biomedical data, which is detrimental for collaborative research and impedes scientific progress. Hence, there has been a substantial effort to develop AI methods that can learn from sensitive data while protecting individuals' privacy.

Method: This paper provides a structured overview of recent advances in privacy-preserving AI techniques in biomedicine. It places the most important state-of-the-art approaches within a unified taxonomy and discusses their strengths, limitations, and open problems.

Conclusion: As the most promising direction, we suggest combining federated machine learning as a more scalable approach with other additional privacy-preserving techniques. This would allow to merge the advantages to provide privacy guarantees in a distributed way for biomedical applications. Nonetheless, more research is necessary as hybrid approaches pose new challenges such as additional network or computation overhead.

背景:人工智能(AI)已成功应用于众多科学领域。在生物医学领域,人工智能已经显示出巨大的潜力,例如在解读下一代测序数据和设计临床决策支持系统方面:然而,在敏感数据上训练人工智能模型会引发对参与者个人隐私的担忧。例如,全基因组关联研究的汇总统计数据可用于确定特定数据集中是否存在某个个体。这种巨大的隐私风险导致了对基因组和其他生物医学数据访问的限制,不利于合作研究,阻碍了科学进步。因此,人们一直在努力开发既能从敏感数据中学习,又能保护个人隐私的人工智能方法:本文对生物医学中保护隐私的人工智能技术的最新进展进行了结构化概述。本文将最重要的最新方法归入一个统一的分类法,并讨论了这些方法的优势、局限性和有待解决的问题:作为最有前途的方向,我们建议将联合机器学习作为一种更具可扩展性的方法与其他额外的隐私保护技术相结合。这样就能将各种优势结合起来,以分布式方式为生物医学应用提供隐私保障。不过,由于混合方法会带来新的挑战,如额外的网络或计算开销,因此有必要开展更多研究。
{"title":"Privacy-Preserving Artificial Intelligence Techniques in Biomedicine.","authors":"Reihaneh Torkzadehmahani, Reza Nasirigerdeh, David B Blumenthal, Tim Kacprowski, Markus List, Julian Matschinske, Julian Spaeth, Nina Kerstin Wenke, Jan Baumbach","doi":"10.1055/s-0041-1740630","DOIUrl":"10.1055/s-0041-1740630","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI) has been successfully applied in numerous scientific domains. In biomedicine, AI has already shown tremendous potential, e.g., in the interpretation of next-generation sequencing data and in the design of clinical decision support systems.</p><p><strong>Objectives: </strong>However, training an AI model on sensitive data raises concerns about the privacy of individual participants. For example, summary statistics of a genome-wide association study can be used to determine the presence or absence of an individual in a given dataset. This considerable privacy risk has led to restrictions in accessing genomic and other biomedical data, which is detrimental for collaborative research and impedes scientific progress. Hence, there has been a substantial effort to develop AI methods that can learn from sensitive data while protecting individuals' privacy.</p><p><strong>Method: </strong>This paper provides a structured overview of recent advances in privacy-preserving AI techniques in biomedicine. It places the most important state-of-the-art approaches within a unified taxonomy and discusses their strengths, limitations, and open problems.</p><p><strong>Conclusion: </strong>As the most promising direction, we suggest combining federated machine learning as a more scalable approach with other additional privacy-preserving techniques. This would allow to merge the advantages to provide privacy guarantees in a distributed way for biomedical applications. Nonetheless, more research is necessary as hybrid approaches pose new challenges such as additional network or computation overhead.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 01","pages":"e12-e27"},"PeriodicalIF":1.3,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/dd/7f/10-1055-s-0041-1740630.PMC9246509.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9246732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Privacy-Preserving Distributed Analytics Platform for Health Care Data. 一种保护隐私的医疗数据分布式分析平台。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-06-01 DOI: 10.1055/s-0041-1740564
Sascha Welten, Yongli Mou, Laurenz Neumann, Mehrshad Jaberansary, Yeliz Yediel Ucer, Toralf Kirsten, Stefan Decker, Oya Beyan

Background: In recent years, data-driven medicine has gained increasing importance in terms of diagnosis, treatment, and research due to the exponential growth of health care data. However, data protection regulations prohibit data centralisation for analysis purposes because of potential privacy risks like the accidental disclosure of data to third parties. Therefore, alternative data usage policies, which comply with present privacy guidelines, are of particular interest.

Objective: We aim to enable analyses on sensitive patient data by simultaneously complying with local data protection regulations using an approach called the Personal Health Train (PHT), which is a paradigm that utilises distributed analytics (DA) methods. The main principle of the PHT is that the analytical task is brought to the data provider and the data instances remain in their original location.

Methods: In this work, we present our implementation of the PHT paradigm, which preserves the sovereignty and autonomy of the data providers and operates with a limited number of communication channels. We further conduct a DA use case on data stored in three different and distributed data providers.

Results: We show that our infrastructure enables the training of data models based on distributed data sources.

Conclusion: Our work presents the capabilities of DA infrastructures in the health care sector, which lower the regulatory obstacles of sharing patient data. We further demonstrate its ability to fuel medical science by making distributed data sets available for scientists or health care practitioners.

背景:近年来,由于医疗保健数据呈指数级增长,数据驱动医学在诊断、治疗和研究方面的重要性日益增加。然而,数据保护法规禁止出于分析目的而将数据集中,因为存在潜在的隐私风险,例如意外向第三方披露数据。因此,符合现行隐私准则的替代数据使用政策是特别值得关注的。目标:我们的目标是通过使用一种称为个人健康培训(PHT)的方法同时遵守当地数据保护法规,从而实现对敏感患者数据的分析,这是一种利用分布式分析(DA)方法的范例。PHT的主要原则是将分析任务交给数据提供程序,而数据实例保持在原始位置。方法:在这项工作中,我们提出了PHT范式的实现,它保留了数据提供者的主权和自主权,并在有限数量的通信渠道下运行。我们进一步对存储在三个不同的分布式数据提供程序中的数据执行数据处理用例。结果:我们展示了我们的基础设施能够训练基于分布式数据源的数据模型。结论:我们的工作展示了数据处理基础设施在医疗保健部门的能力,它降低了共享患者数据的监管障碍。通过为科学家或卫生保健从业者提供分布式数据集,我们进一步证明了它推动医学科学的能力。
{"title":"A Privacy-Preserving Distributed Analytics Platform for Health Care Data.","authors":"Sascha Welten,&nbsp;Yongli Mou,&nbsp;Laurenz Neumann,&nbsp;Mehrshad Jaberansary,&nbsp;Yeliz Yediel Ucer,&nbsp;Toralf Kirsten,&nbsp;Stefan Decker,&nbsp;Oya Beyan","doi":"10.1055/s-0041-1740564","DOIUrl":"https://doi.org/10.1055/s-0041-1740564","url":null,"abstract":"<p><strong>Background: </strong>In recent years, data-driven medicine has gained increasing importance in terms of diagnosis, treatment, and research due to the exponential growth of health care data. However, data protection regulations prohibit data centralisation for analysis purposes because of potential privacy risks like the accidental disclosure of data to third parties. Therefore, alternative data usage policies, which comply with present privacy guidelines, are of particular interest.</p><p><strong>Objective: </strong>We aim to enable analyses on sensitive patient data by simultaneously complying with local data protection regulations using an approach called the Personal Health Train (PHT), which is a paradigm that utilises distributed analytics (DA) methods. The main principle of the PHT is that the analytical task is brought to the data provider and the data instances remain in their original location.</p><p><strong>Methods: </strong>In this work, we present our implementation of the PHT paradigm, which preserves the sovereignty and autonomy of the data providers and operates with a limited number of communication channels. We further conduct a DA use case on data stored in three different and distributed data providers.</p><p><strong>Results: </strong>We show that our infrastructure enables the training of data models based on distributed data sources.</p><p><strong>Conclusion: </strong>Our work presents the capabilities of DA infrastructures in the health care sector, which lower the regulatory obstacles of sharing patient data. We further demonstrate its ability to fuel medical science by making distributed data sets available for scientists or health care practitioners.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 01","pages":"e1-e11"},"PeriodicalIF":1.7,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/66/85/10-1055-s-0041-1740564.PMC9246511.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9253597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Security and Privacy in Distributed Health Care Environments 分布式医疗保健环境中的安全和隐私
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-05-01 DOI: 10.1055/s-0042-1744484
Stephen Flowerday, C. Xenakis
There is an increasing demand for distributed health care systems. Nevertheless, distributed health care environments do not come without risks. At the same time that distributed health care systems are growing, so are the cybersecurity threats targeting them. Additionally, the demand for compliance to new regulations increases as these distributed health caresystemshold sensitivepatientdata. Theuseofdata-driven technologies presents a promising opportunity for significant advances in the field toward improved health care for patients and the general public.1,2 Several recent studies have highlighted the importance and the necessity of developing a data-driven approach where patient data are collected, analyzed, and leveraged for medical research purposes with the help of different types of artificial intelligence. To address the privacy-related challenges, novel methods, such as protection of personal health information, ensuring compliance, guaranteeing FAIR information processing, and building of trust, are required. In this issue, newparadigmsandprominent applications are presented for secure, trustworthy, and privacy-preserving data sharing and knowledge representation to address the emerging needs.
对分布式医疗保健系统的需求越来越大。然而,分布式医疗保健环境并非没有风险。在分布式医疗保健系统不断发展的同时,针对它们的网络安全威胁也在增长。此外,随着这些分布式医疗系统拥有敏感的患者数据,对遵守新法规的需求也在增加。数据驱动技术的使用为在改善患者和公众的医疗保健领域取得重大进展提供了一个有希望的机会。1,2最近的几项研究强调了开发数据驱动方法的重要性和必要性,并在不同类型的人工智能的帮助下用于医学研究目的。为了解决与隐私相关的挑战,需要新的方法,如保护个人健康信息、确保合规性、保证FAIR信息处理和建立信任。在本期中,提出了安全、可信和保护隐私的数据共享和知识表示的新范式和主要应用程序,以满足新出现的需求。
{"title":"Security and Privacy in Distributed Health Care Environments","authors":"Stephen Flowerday, C. Xenakis","doi":"10.1055/s-0042-1744484","DOIUrl":"https://doi.org/10.1055/s-0042-1744484","url":null,"abstract":"There is an increasing demand for distributed health care systems. Nevertheless, distributed health care environments do not come without risks. At the same time that distributed health care systems are growing, so are the cybersecurity threats targeting them. Additionally, the demand for compliance to new regulations increases as these distributed health caresystemshold sensitivepatientdata. Theuseofdata-driven technologies presents a promising opportunity for significant advances in the field toward improved health care for patients and the general public.1,2 Several recent studies have highlighted the importance and the necessity of developing a data-driven approach where patient data are collected, analyzed, and leveraged for medical research purposes with the help of different types of artificial intelligence. To address the privacy-related challenges, novel methods, such as protection of personal health information, ensuring compliance, guaranteeing FAIR information processing, and building of trust, are required. In this issue, newparadigmsandprominent applications are presented for secure, trustworthy, and privacy-preserving data sharing and knowledge representation to address the emerging needs.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 1","pages":"1 - 2"},"PeriodicalIF":1.7,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42386958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparison of Methods to Detect Changes in Prediction Models. 预测模型变化检测方法的比较。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-05-01 DOI: 10.1055/s-0042-1742672
Erin M Schnellinger, Wei Yang, Michael O Harhay, Stephen E Kimmel

Background: Prediction models inform decisions in many areas of medicine. Most models are fitted once and then applied to new (future) patients, despite the fact that model coefficients can vary over time due to changes in patients' clinical characteristics and disease risk. However, the optimal method to detect changes in model parameters has not been rigorously assessed.

Methods: We simulated data, informed by post-lung transplant mortality data and tested the following two approaches for detecting model change: (1) the "Direct Approach," it compares coefficients of the model refit on recent data to those at baseline; and (2) "Calibration Regression," it fits a logistic regression model of the log-odds of the observed outcomes versus the linear predictor from the baseline model (i.e., the log-odds of the predicted probabilities obtained from the baseline model) and tests whether the intercept and slope differ from 0 and 1, respectively. Four scenarios were simulated using logistic regression for binary outcomes as follows: (1) we fixed all model parameters, (2) we varied the outcome prevalence between 0.1 and 0.2, (3) we varied the coefficient of one of the ten predictors between 0.2 and 0.4, and (4) we varied the outcome prevalence and coefficient of one predictor simultaneously.

Results: Calibration regression tended to detect changes sooner than the Direct Approach, with better performance (e.g., larger proportion of true claims). When the sample size was large, both methods performed well. When two parameters changed simultaneously, neither method performed well.

Conclusion: Neither change detection method examined here proved optimal under all circumstances. However, our results suggest that if one is interested in detecting a change in overall incidence of an outcome (e.g., intercept), the Calibration Regression method may be superior to the Direct Approach. Conversely, if one is interested in detecting a change in other model covariates (e.g., slope), the Direct Approach may be superior.

背景:预测模型为许多医学领域的决策提供信息。大多数模型只拟合一次,然后应用于新的(未来的)患者,尽管由于患者临床特征和疾病风险的变化,模型系数可能随着时间的推移而变化。然而,检测模型参数变化的最佳方法尚未得到严格的评估。方法:我们模拟数据,根据肺移植后死亡率数据,并测试了以下两种检测模型变化的方法:(1)“直接方法”,它将最近数据的模型改装系数与基线数据进行比较;和(2)“校准回归”,它拟合观察结果的对数赔率与基线模型的线性预测器的对数赔率的逻辑回归模型(即,从基线模型获得的预测概率的对数赔率),并测试截距和斜率是否分别不同于0和1。采用logistic回归方法对四种情况进行了模拟:(1)固定所有模型参数,(2)将结果患病率在0.1和0.2之间变化,(3)将十个预测因子中的一个的系数在0.2和0.4之间变化,(4)同时改变一个预测因子的结果患病率和系数。结果:校准回归倾向于比直接法更快地检测到变化,具有更好的性能(例如,真实声明的比例更大)。当样本量较大时,两种方法均表现良好。当两个参数同时变化时,两种方法的效果都不好。结论:本文研究的两种变化检测方法在所有情况下都是最优的。然而,我们的结果表明,如果有人对检测结果的总体发生率(例如,截距)的变化感兴趣,则校准回归方法可能优于直接方法。相反,如果对检测其他模型协变量(例如斜率)的变化感兴趣,则直接方法可能更优越。
{"title":"A Comparison of Methods to Detect Changes in Prediction Models.","authors":"Erin M Schnellinger,&nbsp;Wei Yang,&nbsp;Michael O Harhay,&nbsp;Stephen E Kimmel","doi":"10.1055/s-0042-1742672","DOIUrl":"https://doi.org/10.1055/s-0042-1742672","url":null,"abstract":"<p><strong>Background: </strong>Prediction models inform decisions in many areas of medicine. Most models are fitted once and then applied to new (future) patients, despite the fact that model coefficients can vary over time due to changes in patients' clinical characteristics and disease risk. However, the optimal method to detect changes in model parameters has not been rigorously assessed.</p><p><strong>Methods: </strong>We simulated data, informed by post-lung transplant mortality data and tested the following two approaches for detecting model change: (1) the \"Direct Approach,\" it compares coefficients of the model refit on recent data to those at baseline; and (2) \"Calibration Regression,\" it fits a logistic regression model of the log-odds of the observed outcomes versus the linear predictor from the baseline model (i.e., the log-odds of the predicted probabilities obtained from the baseline model) and tests whether the intercept and slope differ from 0 and 1, respectively. Four scenarios were simulated using logistic regression for binary outcomes as follows: (1) we fixed all model parameters, (2) we varied the outcome prevalence between 0.1 and 0.2, (3) we varied the coefficient of one of the ten predictors between 0.2 and 0.4, and (4) we varied the outcome prevalence and coefficient of one predictor simultaneously.</p><p><strong>Results: </strong>Calibration regression tended to detect changes sooner than the Direct Approach, with better performance (e.g., larger proportion of true claims). When the sample size was large, both methods performed well. When two parameters changed simultaneously, neither method performed well.</p><p><strong>Conclusion: </strong>Neither change detection method examined here proved optimal under all circumstances. However, our results suggest that if one is interested in detecting a change in overall incidence of an outcome (e.g., intercept), the Calibration Regression method may be superior to the Direct Approach. Conversely, if one is interested in detecting a change in other model covariates (e.g., slope), the Direct Approach may be superior.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 1-02","pages":"19-28"},"PeriodicalIF":1.7,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10413959/pdf/nihms-1887521.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9976306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Methods of Information in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1