Stefan Förstel , Markus Förstel , Markus Gallistl , Dario Zanca , Bjoern M. Eskofier , Eva M. Rothgang
{"title":"医院信息系统的数据质量:分析德国一家地区医院 30 年病人数据的经验教训。","authors":"Stefan Förstel , Markus Förstel , Markus Gallistl , Dario Zanca , Bjoern M. Eskofier , Eva M. Rothgang","doi":"10.1016/j.ijmedinf.2024.105636","DOIUrl":null,"url":null,"abstract":"<div><div><em>Background</em>: The integration of Hospital Information Systems (HIS) into healthcare delivery has significantly enhanced patient care and operational efficiency. Nonetheless, the rapid acceleration of digital transformation has led to a substantial increase in the volume of data managed by these systems. This emphasizes the need for robust mechanisms for data management and quality assurance.</div><div><em>Objective</em>: This study addresses data quality issues related to patient identifiers within the Hospital Information System (HIS) of a regional German hospital, focusing on improving the accuracy and consistency of these administrative data entries.</div><div><em>Methods</em>: Employing a combination of data analysis and expert interviews, this study reviews and programmatically cleanses a dataset with over 2,000,000 patient data entries extracted from the HIS. The areas of investigation are patient admissions, discharges, and geographical data.</div><div><em>Results</em>: The analysis revealed that roughly 25% of the dataset was rendered unusable by errors and inconsistencies. By implementing a thorough data cleansing process, we significantly enhanced the utility of the dataset. In doing so, we identified the primary issues affecting data quality, including ambiguities among similar variables and a gap between the intended and actual use of the system.</div><div><em>Conclusion</em>: The findings highlight the critical importance of enhancing data quality in healthcare information systems. This study shows the necessity of a careful review of data extracted from the HIS before it can be reliably utilized for machine learning tasks, thereby rendering the data more usable for both clinical and analytical purposes.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"192 ","pages":"Article 105636"},"PeriodicalIF":3.7000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data quality in hospital information systems: Lessons learned from analyzing 30 years of patient data in a regional German hospital\",\"authors\":\"Stefan Förstel , Markus Förstel , Markus Gallistl , Dario Zanca , Bjoern M. Eskofier , Eva M. Rothgang\",\"doi\":\"10.1016/j.ijmedinf.2024.105636\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div><em>Background</em>: The integration of Hospital Information Systems (HIS) into healthcare delivery has significantly enhanced patient care and operational efficiency. Nonetheless, the rapid acceleration of digital transformation has led to a substantial increase in the volume of data managed by these systems. This emphasizes the need for robust mechanisms for data management and quality assurance.</div><div><em>Objective</em>: This study addresses data quality issues related to patient identifiers within the Hospital Information System (HIS) of a regional German hospital, focusing on improving the accuracy and consistency of these administrative data entries.</div><div><em>Methods</em>: Employing a combination of data analysis and expert interviews, this study reviews and programmatically cleanses a dataset with over 2,000,000 patient data entries extracted from the HIS. The areas of investigation are patient admissions, discharges, and geographical data.</div><div><em>Results</em>: The analysis revealed that roughly 25% of the dataset was rendered unusable by errors and inconsistencies. By implementing a thorough data cleansing process, we significantly enhanced the utility of the dataset. In doing so, we identified the primary issues affecting data quality, including ambiguities among similar variables and a gap between the intended and actual use of the system.</div><div><em>Conclusion</em>: The findings highlight the critical importance of enhancing data quality in healthcare information systems. This study shows the necessity of a careful review of data extracted from the HIS before it can be reliably utilized for machine learning tasks, thereby rendering the data more usable for both clinical and analytical purposes.</div></div>\",\"PeriodicalId\":54950,\"journal\":{\"name\":\"International Journal of Medical Informatics\",\"volume\":\"192 \",\"pages\":\"Article 105636\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386505624002995\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505624002995","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
背景:医院信息系统(HIS)与医疗保健服务的整合大大提高了患者护理和运营效率。然而,数字化转型的迅猛发展导致这些系统管理的数据量大幅增加。这就强调了建立健全的数据管理和质量保证机制的必要性:本研究探讨了德国一家地区医院的医院信息系统(HIS)中与患者标识符相关的数据质量问题,重点是提高这些管理数据条目的准确性和一致性:本研究采用数据分析和专家访谈相结合的方法,对从 HIS 中提取的超过 2,000,000 条患者数据进行了审查和程序化清理。调查领域包括病人入院、出院和地理数据:分析结果显示,大约 25% 的数据集因错误和不一致而无法使用。通过实施彻底的数据清理流程,我们大大提高了数据集的实用性。在此过程中,我们发现了影响数据质量的主要问题,包括类似变量之间的歧义以及系统预期用途与实际用途之间的差距:研究结果凸显了提高医疗信息系统数据质量的重要性。这项研究表明,有必要对从医疗信息系统中提取的数据进行仔细审查,然后才能将其可靠地用于机器学习任务,从而使数据更适用于临床和分析目的。
Data quality in hospital information systems: Lessons learned from analyzing 30 years of patient data in a regional German hospital
Background: The integration of Hospital Information Systems (HIS) into healthcare delivery has significantly enhanced patient care and operational efficiency. Nonetheless, the rapid acceleration of digital transformation has led to a substantial increase in the volume of data managed by these systems. This emphasizes the need for robust mechanisms for data management and quality assurance.
Objective: This study addresses data quality issues related to patient identifiers within the Hospital Information System (HIS) of a regional German hospital, focusing on improving the accuracy and consistency of these administrative data entries.
Methods: Employing a combination of data analysis and expert interviews, this study reviews and programmatically cleanses a dataset with over 2,000,000 patient data entries extracted from the HIS. The areas of investigation are patient admissions, discharges, and geographical data.
Results: The analysis revealed that roughly 25% of the dataset was rendered unusable by errors and inconsistencies. By implementing a thorough data cleansing process, we significantly enhanced the utility of the dataset. In doing so, we identified the primary issues affecting data quality, including ambiguities among similar variables and a gap between the intended and actual use of the system.
Conclusion: The findings highlight the critical importance of enhancing data quality in healthcare information systems. This study shows the necessity of a careful review of data extracted from the HIS before it can be reliably utilized for machine learning tasks, thereby rendering the data more usable for both clinical and analytical purposes.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.