Data quality in hospital information systems: Lessons learned from analyzing 30 years of patient data in a regional German hospital

IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal of Medical Informatics Pub Date : 2024-09-24 DOI:10.1016/j.ijmedinf.2024.105636
Stefan Förstel , Markus Förstel , Markus Gallistl , Dario Zanca , Bjoern M. Eskofier , Eva M. Rothgang
{"title":"Data quality in hospital information systems: Lessons learned from analyzing 30 years of patient data in a regional German hospital","authors":"Stefan Förstel ,&nbsp;Markus Förstel ,&nbsp;Markus Gallistl ,&nbsp;Dario Zanca ,&nbsp;Bjoern M. Eskofier ,&nbsp;Eva M. Rothgang","doi":"10.1016/j.ijmedinf.2024.105636","DOIUrl":null,"url":null,"abstract":"<div><div><em>Background</em>: The integration of Hospital Information Systems (HIS) into healthcare delivery has significantly enhanced patient care and operational efficiency. Nonetheless, the rapid acceleration of digital transformation has led to a substantial increase in the volume of data managed by these systems. This emphasizes the need for robust mechanisms for data management and quality assurance.</div><div><em>Objective</em>: This study addresses data quality issues related to patient identifiers within the Hospital Information System (HIS) of a regional German hospital, focusing on improving the accuracy and consistency of these administrative data entries.</div><div><em>Methods</em>: Employing a combination of data analysis and expert interviews, this study reviews and programmatically cleanses a dataset with over 2,000,000 patient data entries extracted from the HIS. The areas of investigation are patient admissions, discharges, and geographical data.</div><div><em>Results</em>: The analysis revealed that roughly 25% of the dataset was rendered unusable by errors and inconsistencies. By implementing a thorough data cleansing process, we significantly enhanced the utility of the dataset. In doing so, we identified the primary issues affecting data quality, including ambiguities among similar variables and a gap between the intended and actual use of the system.</div><div><em>Conclusion</em>: The findings highlight the critical importance of enhancing data quality in healthcare information systems. This study shows the necessity of a careful review of data extracted from the HIS before it can be reliably utilized for machine learning tasks, thereby rendering the data more usable for both clinical and analytical purposes.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"192 ","pages":"Article 105636"},"PeriodicalIF":3.7000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505624002995","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The integration of Hospital Information Systems (HIS) into healthcare delivery has significantly enhanced patient care and operational efficiency. Nonetheless, the rapid acceleration of digital transformation has led to a substantial increase in the volume of data managed by these systems. This emphasizes the need for robust mechanisms for data management and quality assurance.
Objective: This study addresses data quality issues related to patient identifiers within the Hospital Information System (HIS) of a regional German hospital, focusing on improving the accuracy and consistency of these administrative data entries.
Methods: Employing a combination of data analysis and expert interviews, this study reviews and programmatically cleanses a dataset with over 2,000,000 patient data entries extracted from the HIS. The areas of investigation are patient admissions, discharges, and geographical data.
Results: The analysis revealed that roughly 25% of the dataset was rendered unusable by errors and inconsistencies. By implementing a thorough data cleansing process, we significantly enhanced the utility of the dataset. In doing so, we identified the primary issues affecting data quality, including ambiguities among similar variables and a gap between the intended and actual use of the system.
Conclusion: The findings highlight the critical importance of enhancing data quality in healthcare information systems. This study shows the necessity of a careful review of data extracted from the HIS before it can be reliably utilized for machine learning tasks, thereby rendering the data more usable for both clinical and analytical purposes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
医院信息系统的数据质量:分析德国一家地区医院 30 年病人数据的经验教训。
背景:医院信息系统(HIS)与医疗保健服务的整合大大提高了患者护理和运营效率。然而,数字化转型的迅猛发展导致这些系统管理的数据量大幅增加。这就强调了建立健全的数据管理和质量保证机制的必要性:本研究探讨了德国一家地区医院的医院信息系统(HIS)中与患者标识符相关的数据质量问题,重点是提高这些管理数据条目的准确性和一致性:本研究采用数据分析和专家访谈相结合的方法,对从 HIS 中提取的超过 2,000,000 条患者数据进行了审查和程序化清理。调查领域包括病人入院、出院和地理数据:分析结果显示,大约 25% 的数据集因错误和不一致而无法使用。通过实施彻底的数据清理流程,我们大大提高了数据集的实用性。在此过程中,我们发现了影响数据质量的主要问题,包括类似变量之间的歧义以及系统预期用途与实际用途之间的差距:研究结果凸显了提高医疗信息系统数据质量的重要性。这项研究表明,有必要对从医疗信息系统中提取的数据进行仔细审查,然后才能将其可靠地用于机器学习任务,从而使数据更适用于临床和分析目的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Medical Informatics
International Journal of Medical Informatics 医学-计算机:信息系统
CiteScore
8.90
自引率
4.10%
发文量
217
审稿时长
42 days
期刊介绍: International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings. The scope of journal covers: Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.; Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc. Educational computer based programs pertaining to medical informatics or medicine in general; Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.
期刊最新文献
Analysis of missing data in electronic health records of people with diabetes in primary care in Spain: A population-based cohort study Systematic construction of composite radiation therapy dataset using automated data pipeline for prognosis prediction Perceptions of healthcare professionals and patients with cardiovascular diseases on mHealth lifestyle apps: A qualitative study Smart data-driven medical decisions through collective and individual anomaly detection in healthcare time series An interpretable machine learning scoring tool for estimating time to recurrence readmissions in stroke patients
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1