Data Exploration in Secondary Use of Healthcare Data

Jian Wang
{"title":"Data Exploration in Secondary Use of Healthcare Data","authors":"Jian Wang","doi":"10.1109/BIBM.2011.129","DOIUrl":null,"url":null,"abstract":"Real world data sets (as opposed to data from randomized, controlled clinical trials) are becoming increasing available from the healthcare industry. Large databases from EMRs/EHRs, insurance claims, pharmacy records, disease registries etc present unique challenges when they are utilized to support pharmaceutical R&D activities. Such \"secondary use\" of healthcare data usually starts with an exploratory phase when the researcher takes a high-level view of the available data and starts to \"connect the dots\". Data exploration is a highly dynamic process: exploratory paths change frequently, sometimes converging, other times diverging, and often resulting in dead ends. Only a small subset of exploratory results end up being formally analyzed to derive quantitative insights. Because of this dynamic nature of data exploration, it is critical that researchers who generate hypotheses, the domain experts, can directly explore in the available data space. Data exploration on large healthcare data sets is often a bottleneck because these data sets tend to be poorly understood in terms of their quality, completeness, consistency, etc. We will discuss this emerging landscape, focusing on case studies to illustrate the powerful convergence of real-world data and technological advancements to help leverage this data.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"43 1","pages":"658-658"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2011.129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Real world data sets (as opposed to data from randomized, controlled clinical trials) are becoming increasing available from the healthcare industry. Large databases from EMRs/EHRs, insurance claims, pharmacy records, disease registries etc present unique challenges when they are utilized to support pharmaceutical R&D activities. Such "secondary use" of healthcare data usually starts with an exploratory phase when the researcher takes a high-level view of the available data and starts to "connect the dots". Data exploration is a highly dynamic process: exploratory paths change frequently, sometimes converging, other times diverging, and often resulting in dead ends. Only a small subset of exploratory results end up being formally analyzed to derive quantitative insights. Because of this dynamic nature of data exploration, it is critical that researchers who generate hypotheses, the domain experts, can directly explore in the available data space. Data exploration on large healthcare data sets is often a bottleneck because these data sets tend to be poorly understood in terms of their quality, completeness, consistency, etc. We will discuss this emerging landscape, focusing on case studies to illustrate the powerful convergence of real-world data and technological advancements to help leverage this data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
医疗数据二次使用中的数据探索
现实世界的数据集(相对于随机对照临床试验的数据)越来越多地来自医疗保健行业。来自电子病历/电子病历、保险索赔、药房记录、疾病登记等的大型数据库在用于支持药物研发活动时面临着独特的挑战。医疗保健数据的这种“二次使用”通常始于探索阶段,即研究人员对可用数据进行高级视图并开始“连接点”。数据探索是一个高度动态的过程:探索路径经常变化,有时收敛,有时发散,并且经常导致死胡同。只有一小部分探索性结果最终被正式分析,以获得定量的见解。由于数据探索的这种动态性质,产生假设的研究人员,即领域专家,可以直接在可用的数据空间中进行探索,这一点至关重要。对大型医疗保健数据集的数据探索通常是一个瓶颈,因为这些数据集在质量、完整性、一致性等方面往往难以理解。我们将讨论这一新兴领域,重点关注案例研究,以说明现实世界数据和技术进步的强大融合,以帮助利用这些数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Evolution of protein architectures inferred from phylogenomic analysis of CATH Hierarchical modeling of alternative exon usage associations with survival 3D point cloud sensors for low-cost medical in-situ visualization Bayesian Classifiers for Chemical Toxicity Prediction Normal mode analysis of protein structure dynamics based on residue contact energy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1