[数据驱动的重症监护:缺乏全面的数据集]。

IF 1.3 4区 医学 Q2 MEDICINE, GENERAL & INTERNAL Medizinische Klinik-Intensivmedizin Und Notfallmedizin Pub Date : 2024-06-01 Epub Date: 2024-04-26 DOI:10.1007/s00063-024-01141-z
Jan-Hendrik B Hardenberg
{"title":"[数据驱动的重症监护:缺乏全面的数据集]。","authors":"Jan-Hendrik B Hardenberg","doi":"10.1007/s00063-024-01141-z","DOIUrl":null,"url":null,"abstract":"<p><p>Intensive care units provide a data-rich environment with the potential to generate datasets in the realm of big data, which could be utilized to train powerful machine learning (ML) models. However, the currently available datasets are too small and exhibit too little diversity due to their limitation to individual hospitals. This lack of extensive and varied datasets is a primary reason for the limited generalizability and resulting low clinical utility of current ML models. Often, these models are based on data from single centers and suffer from poor external validity. There is an urgent need for the development of large-scale, multicentric, and multinational datasets. Ensuring data protection and minimizing re-identification risks pose central challenges in this process. The \"Amsterdam University Medical Center database (AmsterdamUMCdb)\" and the \"Salzburg Intensive Care database (SICdb)\" demonstrate that open access datasets are possible in Europe while complying with the data protection regulations of the General Data Protection Regulation (GDPR). Another challenge in building intensive care datasets is the absence of semantic definitions in the source data and the heterogeneity of data formats. Establishing binding industry standards for the semantic definition is crucial to ensure seamless semantic interoperability between datasets.</p>","PeriodicalId":49019,"journal":{"name":"Medizinische Klinik-Intensivmedizin Und Notfallmedizin","volume":" ","pages":"352-357"},"PeriodicalIF":1.3000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Data-driven intensive care: a lack of comprehensive datasets].\",\"authors\":\"Jan-Hendrik B Hardenberg\",\"doi\":\"10.1007/s00063-024-01141-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Intensive care units provide a data-rich environment with the potential to generate datasets in the realm of big data, which could be utilized to train powerful machine learning (ML) models. However, the currently available datasets are too small and exhibit too little diversity due to their limitation to individual hospitals. This lack of extensive and varied datasets is a primary reason for the limited generalizability and resulting low clinical utility of current ML models. Often, these models are based on data from single centers and suffer from poor external validity. There is an urgent need for the development of large-scale, multicentric, and multinational datasets. Ensuring data protection and minimizing re-identification risks pose central challenges in this process. The \\\"Amsterdam University Medical Center database (AmsterdamUMCdb)\\\" and the \\\"Salzburg Intensive Care database (SICdb)\\\" demonstrate that open access datasets are possible in Europe while complying with the data protection regulations of the General Data Protection Regulation (GDPR). Another challenge in building intensive care datasets is the absence of semantic definitions in the source data and the heterogeneity of data formats. Establishing binding industry standards for the semantic definition is crucial to ensure seamless semantic interoperability between datasets.</p>\",\"PeriodicalId\":49019,\"journal\":{\"name\":\"Medizinische Klinik-Intensivmedizin Und Notfallmedizin\",\"volume\":\" \",\"pages\":\"352-357\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medizinische Klinik-Intensivmedizin Und Notfallmedizin\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00063-024-01141-z\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/4/26 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medizinische Klinik-Intensivmedizin Und Notfallmedizin","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00063-024-01141-z","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/4/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

摘要

重症监护病房提供了一个数据丰富的环境,有可能产生大数据领域的数据集,可用于训练强大的机器学习(ML)模型。然而,由于局限于单个医院,目前可用的数据集规模太小,表现出的多样性太少。缺乏广泛而多样的数据集是导致当前 ML 模型通用性有限、临床实用性低的主要原因。这些模型通常基于单个中心的数据,外部有效性较差。目前迫切需要开发大规模、多中心和多国数据集。在这一过程中,确保数据保护和最大限度降低重新识别风险是核心挑战。阿姆斯特丹大学医学中心数据库(AmsterdamUMCdb)"和 "萨尔茨堡重症监护数据库(SICdb)"表明,在欧洲,开放访问数据集是可能的,同时也符合《通用数据保护条例》(GDPR)的数据保护规定。建立重症监护数据集的另一个挑战是源数据中语义定义的缺失和数据格式的不统一。为语义定义建立具有约束力的行业标准对于确保数据集之间无缝的语义互操作性至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
[Data-driven intensive care: a lack of comprehensive datasets].

Intensive care units provide a data-rich environment with the potential to generate datasets in the realm of big data, which could be utilized to train powerful machine learning (ML) models. However, the currently available datasets are too small and exhibit too little diversity due to their limitation to individual hospitals. This lack of extensive and varied datasets is a primary reason for the limited generalizability and resulting low clinical utility of current ML models. Often, these models are based on data from single centers and suffer from poor external validity. There is an urgent need for the development of large-scale, multicentric, and multinational datasets. Ensuring data protection and minimizing re-identification risks pose central challenges in this process. The "Amsterdam University Medical Center database (AmsterdamUMCdb)" and the "Salzburg Intensive Care database (SICdb)" demonstrate that open access datasets are possible in Europe while complying with the data protection regulations of the General Data Protection Regulation (GDPR). Another challenge in building intensive care datasets is the absence of semantic definitions in the source data and the heterogeneity of data formats. Establishing binding industry standards for the semantic definition is crucial to ensure seamless semantic interoperability between datasets.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.60
自引率
9.10%
发文量
93
审稿时长
6-12 weeks
期刊介绍: Medizinische Klinik – Intensivmedizin und Notfallmedizin is an internationally respected interdisciplinary journal. It is intended for physicians, nurses, respiratory and physical therapists active in intensive care and accident/emergency units, but also for internists, anesthesiologists, surgeons, neurologists, and pediatricians with special interest in intensive care medicine. Comprehensive reviews describe the most recent advances in the field of internal medicine with special focus on intensive care problems. Freely submitted original articles present important studies in this discipline and promote scientific exchange, while articles in the category Photo essay feature interesting cases and aim at optimizing diagnostic and therapeutic strategies. In the rubric journal club well-respected experts comment on outstanding international publications. Review articles under the rubric "Continuing Medical Education" present verified results of scientific research and their integration into daily practice. The rubrics "Nursing practice" and "Physical therapy" round out the information.
期刊最新文献
[In-hospital management of intracerebral hemorrhage]. [Drug-drug interactions in critically ill patients]. [Summary of the S1 guideline on sustainability in intensive care and emergency medicine]. [Mechanical support with impella decreases mortality in cardiogenic shock (DanGer Shock trial)]. [Developing kinaesthetic competence and team dynamics : A quality improvement project in a stroke unit].
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1