A Framework to Assess Healthcare Data Quality

W. Warwick, Sophie M. Johnson, Judy Bond, G. Fletcher, P. Kanellakis
{"title":"A Framework to Assess Healthcare Data Quality","authors":"W. Warwick, Sophie M. Johnson, Judy Bond, G. Fletcher, P. Kanellakis","doi":"10.15405/EJSBS.156","DOIUrl":null,"url":null,"abstract":"1. IntroductionData quality assessment is a fundamental task when undertaking research. A wealth of healthcare data provided from the National Health Service is available and can be easily accessed and utilised for research. Even though health related datasets are obtained from authoritative sources, issues within the quality of data may be apparent. Data quality issues can lead to an array of errors within research findings including incorrect demographical information and exaggeration of disorder prevalence. Moreover, the consequences of decisions made from inaccurate results can be damaging to organisations within the healthcare sector (Goodchild, 1993).It is therefore important to use a framework to assess the quality of data obtained from the data mining process. This will help determine whether it can be used to test hypotheses, and increase confidence of validity.The motivation for this current research was to investigate data quality issues encountered for a research report undertaken by the NHS Coventry and Warwickshire Partnership Trust entitled 'Up Skilling the Adult Mental Health Workforce in Psychological Practice Skills'. As researchers had access to a wealth of data from several sources, it was important to examine the data available to the research and what data quality criteria would be necessary to draw conclusions on its suitability. As many of the available datasets had not been collected with a specific research question, the selection quality and methods were not under control of the research and therefore, were difficult to validate (Sorensen, Sabroe & Olsen, 1996). From this, there was a need to construct a robust framework to assess the quality of data. This led to a review of existing frameworks and the formation of a new framework specific for this research.2. Review of Quality FrameworkThe existing literature instigated that the criteria for a quality framework must be general, applicable across application domains and data types and clearly defined, Price & Shanks, (2004). Eppler (2001), put forward that quality frameworks should show interdependencies between different quality criteria, to allow researchers to become familiar with how data quality issues impact other criteria.The Data Quality Assessment Methods and Tools (DatQAM) provides a systematic implementation of data quality assessment which includes a range of quality measures which considers the strengths of official statistics. It is concerned with user satisfaction concerning relevance, sampling and non-sampling errors, production dates concerning timeliness, availability of metadata and forms for dissemination, changes over time and geographical differences and coherence (Eurostat, 2007).The Quality Assurance Framework (QAF) developed by Statistics Canada (2010) includes a number of quality measures for assessing data quality including measures for timeliness, relevance, interpretability (completeness of metadata), accuracy (coefficient of variance, imputation rates), coherence and accessibility. These two data quality (DQ) frameworks are similar in the way that they consider measures for data quality and for the data quality criteria themselves. They are also widely used, an example of this is that the HSCIC uses the DatQAM for data quality assessments (HSCIC, 2013).In order to build a framework which considers measures for DQ we can consider these two frameworks and how the criteria are measured within them in order to gain a comprehensive framework that can be applied to data that we use within our research. These measures have been adapted from the DatQAM and QAF frameworks in order to quantify our data quality assessments.Furthermore, the World Health Organisation's (WHO) 'quality criteria' was utilised in order to categorise the quality measurements. The Data Quality Audit Tool (DQAT) is utilised by the WHO and Global Fund. After cross referencing it was decided that a 'confidentiality' criteria be added to the framework which was adapted from the DQAT (2008). …","PeriodicalId":164632,"journal":{"name":"European Journal of Social & Behavioural Sciences","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Social & Behavioural Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15405/EJSBS.156","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14

Abstract

1. IntroductionData quality assessment is a fundamental task when undertaking research. A wealth of healthcare data provided from the National Health Service is available and can be easily accessed and utilised for research. Even though health related datasets are obtained from authoritative sources, issues within the quality of data may be apparent. Data quality issues can lead to an array of errors within research findings including incorrect demographical information and exaggeration of disorder prevalence. Moreover, the consequences of decisions made from inaccurate results can be damaging to organisations within the healthcare sector (Goodchild, 1993).It is therefore important to use a framework to assess the quality of data obtained from the data mining process. This will help determine whether it can be used to test hypotheses, and increase confidence of validity.The motivation for this current research was to investigate data quality issues encountered for a research report undertaken by the NHS Coventry and Warwickshire Partnership Trust entitled 'Up Skilling the Adult Mental Health Workforce in Psychological Practice Skills'. As researchers had access to a wealth of data from several sources, it was important to examine the data available to the research and what data quality criteria would be necessary to draw conclusions on its suitability. As many of the available datasets had not been collected with a specific research question, the selection quality and methods were not under control of the research and therefore, were difficult to validate (Sorensen, Sabroe & Olsen, 1996). From this, there was a need to construct a robust framework to assess the quality of data. This led to a review of existing frameworks and the formation of a new framework specific for this research.2. Review of Quality FrameworkThe existing literature instigated that the criteria for a quality framework must be general, applicable across application domains and data types and clearly defined, Price & Shanks, (2004). Eppler (2001), put forward that quality frameworks should show interdependencies between different quality criteria, to allow researchers to become familiar with how data quality issues impact other criteria.The Data Quality Assessment Methods and Tools (DatQAM) provides a systematic implementation of data quality assessment which includes a range of quality measures which considers the strengths of official statistics. It is concerned with user satisfaction concerning relevance, sampling and non-sampling errors, production dates concerning timeliness, availability of metadata and forms for dissemination, changes over time and geographical differences and coherence (Eurostat, 2007).The Quality Assurance Framework (QAF) developed by Statistics Canada (2010) includes a number of quality measures for assessing data quality including measures for timeliness, relevance, interpretability (completeness of metadata), accuracy (coefficient of variance, imputation rates), coherence and accessibility. These two data quality (DQ) frameworks are similar in the way that they consider measures for data quality and for the data quality criteria themselves. They are also widely used, an example of this is that the HSCIC uses the DatQAM for data quality assessments (HSCIC, 2013).In order to build a framework which considers measures for DQ we can consider these two frameworks and how the criteria are measured within them in order to gain a comprehensive framework that can be applied to data that we use within our research. These measures have been adapted from the DatQAM and QAF frameworks in order to quantify our data quality assessments.Furthermore, the World Health Organisation's (WHO) 'quality criteria' was utilised in order to categorise the quality measurements. The Data Quality Audit Tool (DQAT) is utilised by the WHO and Global Fund. After cross referencing it was decided that a 'confidentiality' criteria be added to the framework which was adapted from the DQAT (2008). …
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估医疗保健数据质量的框架
1. 数据质量评估是开展研究的一项基本任务。国民保健服务提供了丰富的保健数据,可以很容易地访问和利用这些数据进行研究。尽管与健康有关的数据集是从权威来源获得的,但数据质量方面的问题可能很明显。数据质量问题可能导致研究结果中的一系列错误,包括不正确的人口统计信息和夸大疾病流行率。此外,根据不准确的结果作出决定的后果可能会损害医疗保健部门内的组织(Goodchild, 1993年)。因此,使用一个框架来评估从数据挖掘过程中获得的数据的质量是很重要的。这将有助于确定它是否可以用于检验假设,并增加有效性的信心。目前这项研究的动机是为了调查一项研究报告中遇到的数据质量问题,该报告由NHS考文垂和沃里克郡合作信托基金承担,题为“提高成人心理健康工作人员的心理实践技能”。由于研究人员可以从几个来源获得大量数据,因此必须审查研究可获得的数据,以及需要什么样的数据质量标准才能得出关于其适用性的结论。由于许多可用的数据集没有收集到一个特定的研究问题,选择的质量和方法不受研究的控制,因此,很难验证(Sorensen, Sabroe & Olsen, 1996)。由此,有必要构建一个强有力的框架来评估数据的质量。这导致了对现有框架的审查,并形成了针对本研究的新框架。质量框架的回顾现有文献表明,质量框架的标准必须是通用的,适用于应用领域和数据类型,并明确定义,Price & Shanks,(2004)。Eppler(2001)提出,质量框架应该显示不同质量标准之间的相互依赖关系,使研究人员能够熟悉数据质量问题如何影响其他标准。数据质量评估方法和工具(DatQAM)提供了数据质量评估的系统实施,其中包括一系列考虑官方统计优势的质量措施。它涉及用户对相关性的满意度、抽样和非抽样误差、及时性的生产日期、元数据和传播形式的可用性、随时间的变化以及地理差异和一致性(欧盟统计局,2007年)。加拿大统计局(2010年)制定的质量保证框架(QAF)包括一些用于评估数据质量的质量措施,包括及时性、相关性、可解释性(元数据的完整性)、准确性(方差系数、代入率)、一致性和可及性的措施。这两个数据质量(DQ)框架在考虑数据质量度量和数据质量标准本身的方式上是相似的。它们也被广泛使用,其中一个例子是HSCIC使用DatQAM进行数据质量评估(HSCIC, 2013)。为了建立一个考虑DQ测量的框架,我们可以考虑这两个框架以及如何在其中测量标准,以便获得一个可以应用于我们在研究中使用的数据的全面框架。这些措施是根据DatQAM和QAF框架改编的,目的是量化我们的数据质量评估。此外,世界卫生组织(WHO)的为了对质量测量进行分类,使用了“质量标准”。数据质量审计工具(DQAT)由世卫组织和全球基金使用。经过交叉参考,决定将“保密”标准添加到改编自DQAT(2008)的框架中。...
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Knowledge, Attitude and Perceptions towards Basic Life Support Training among Student Teachers in a Malaysian University The Development of Artique - Independent Artists and Online Art Criticism Analysis of Personal-Professional Status of Women Managers for Sustainable School Management in Turkey Enforced Isolation: How does it affect the Psychosocial and Physical Development of Children? Online Assessment: How Effectively Do They Measure Student Learning at the Tertiary Level?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1