人类免疫缺陷病毒(HIV)队列和电子健康记录数据中多变量误差:统计挑战和机遇。

Statistical communications in infectious diseases Pub Date : 2020-10-07 eCollection Date: 2020-09-01 DOI:10.1515/scid-2019-0015
Bryan E Shepherd, Pamela A Shaw
{"title":"人类免疫缺陷病毒(HIV)队列和电子健康记录数据中多变量误差:统计挑战和机遇。","authors":"Bryan E Shepherd, Pamela A Shaw","doi":"10.1515/scid-2019-0015","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objectives:</b> Observational data derived from patient electronic health records (EHR) data are increasingly used for human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS) research. There are challenges to using these data, in particular with regards to data quality; some are recognized, some unrecognized, and some recognized but ignored. There are great opportunities for the statistical community to improve inference by incorporating validation subsampling into analyses of EHR data.<b>Methods:</b> Methods to address measurement error, misclassification, and missing data are relevant, as are sampling designs such as two-phase sampling. However, many of the existing statistical methods for measurement error, for example, only address relatively simple settings, whereas the errors seen in these datasets span multiple variables (both predictors and outcomes), are correlated, and even affect who is included in the study.<b>Results/Conclusion:</b> We will discuss some preliminary methods in this area with a particular focus on time-to-event outcomes and outline areas of future research.</p>","PeriodicalId":74867,"journal":{"name":"Statistical communications in infectious diseases","volume":"12 Suppl1","pages":"20190015"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204761/pdf/scid-12-101-scid-2019-0015.pdf","citationCount":"0","resultStr":"{\"title\":\"Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.\",\"authors\":\"Bryan E Shepherd, Pamela A Shaw\",\"doi\":\"10.1515/scid-2019-0015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><b>Objectives:</b> Observational data derived from patient electronic health records (EHR) data are increasingly used for human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS) research. There are challenges to using these data, in particular with regards to data quality; some are recognized, some unrecognized, and some recognized but ignored. There are great opportunities for the statistical community to improve inference by incorporating validation subsampling into analyses of EHR data.<b>Methods:</b> Methods to address measurement error, misclassification, and missing data are relevant, as are sampling designs such as two-phase sampling. However, many of the existing statistical methods for measurement error, for example, only address relatively simple settings, whereas the errors seen in these datasets span multiple variables (both predictors and outcomes), are correlated, and even affect who is included in the study.<b>Results/Conclusion:</b> We will discuss some preliminary methods in this area with a particular focus on time-to-event outcomes and outline areas of future research.</p>\",\"PeriodicalId\":74867,\"journal\":{\"name\":\"Statistical communications in infectious diseases\",\"volume\":\"12 Suppl1\",\"pages\":\"20190015\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204761/pdf/scid-12-101-scid-2019-0015.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistical communications in infectious diseases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1515/scid-2019-0015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/9/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical communications in infectious diseases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/scid-2019-0015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/9/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:来自患者电子健康记录(EHR)数据的观察性数据越来越多地用于人类免疫缺陷病毒/获得性免疫缺陷综合症(艾滋病毒/艾滋病)的研究。使用这些数据存在挑战,特别是在数据质量方面;有些被认可,有些未被认可,还有一些被认可但被忽视。统计界有很大的机会通过将验证子抽样纳入EHR数据分析来改进推理。方法:解决测量误差、错误分类和缺失数据的方法是相关的,抽样设计如两阶段抽样也是相关的。然而,许多现有的测量误差的统计方法,例如,只处理相对简单的设置,而在这些数据集中看到的误差跨越多个变量(预测因子和结果),是相关的,甚至影响谁被纳入研究。结果/结论:我们将讨论这一领域的一些初步方法,特别关注事件时间结果,并概述未来研究的领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

Objectives: Observational data derived from patient electronic health records (EHR) data are increasingly used for human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS) research. There are challenges to using these data, in particular with regards to data quality; some are recognized, some unrecognized, and some recognized but ignored. There are great opportunities for the statistical community to improve inference by incorporating validation subsampling into analyses of EHR data.Methods: Methods to address measurement error, misclassification, and missing data are relevant, as are sampling designs such as two-phase sampling. However, many of the existing statistical methods for measurement error, for example, only address relatively simple settings, whereas the errors seen in these datasets span multiple variables (both predictors and outcomes), are correlated, and even affect who is included in the study.Results/Conclusion: We will discuss some preliminary methods in this area with a particular focus on time-to-event outcomes and outline areas of future research.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Study design approaches for future active-controlled HIV prevention trials. The role of randomization inference in unraveling individual treatment effects in early phase vaccine trials. Nonlinear mixed-effects models for HIV viral load trajectories before and after antiretroviral therapy interruption, incorporating left censoring. Estimation and interpretation of vaccine efficacy in COVID-19 randomized clinical trials Sample size calculation for active-arm trial with counterfactual incidence based on recency assay.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1