首页 > 最新文献

Studies in health technology and informatics最新文献

英文 中文
Improving Requirements Documentation in the Medical Informatics Initiative Core Data Set Using FHIR Obligations - Lessons Learned. 使用FHIR义务改进医学信息学倡议核心数据集中的需求文档-经验教训。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251401
Julian Saß, Sylvia Thun

Introduction: The Medical Informatics Initiative (MII) aims to enable cross-site secondary use of clinical data in Germany using a FHIR-based Core Data Set (CDS). However, current FHIR Implementation Guides (IG) often lack actor-specific guidance, leading to inconsistent interpretations and implementations.

Methods: This technical case report explores the use of FHIR Implementation Obligations to clarify responsibilities and expected system behavior within the MII infrastructure. Obligations were modeled using the FHIR obligation extension and ActorDefinition resources, applied to the Patient profile from the CDS Person module. A prototype IG was generated using the HL7 FHIR IG publisher tooling.

Results: Obligations were defined and rendered for multiple actors - such as Data Integration Centers (DIC) and the Health Research Data Portal (FDPG) - across selected Patient profile elements. Obligations were also linked to specific operations, enabling precise workflow targeting. The implementation improved the explicitness of responsibilities that were previously only implied.

Discussion: The study demonstrates that obligations enhance the clarity of FHIR IGs. However, limitations remain: the MII's current IG tooling does not yet support obligations, and conformance testing was not addressed. Further work is needed to standardize ActorDefinition resources, align obligations across modules, and develop validation tooling to realize the full potential of obligation-driven specifications.

简介:医学信息学倡议(MII)旨在通过基于fhir的核心数据集(CDS)实现德国临床数据的跨站点二次使用。然而,目前的FHIR实施指南(IG)往往缺乏针对行为者的指导,导致解释和实施不一致。方法:本技术案例报告探讨了使用FHIR实现义务来澄清MII基础结构中的责任和预期的系统行为。使用FHIR义务扩展和ActorDefinition资源对义务进行建模,并将其应用于来自CDS Person模块的患者配置文件。使用HL7 FHIR IG发布者工具生成了一个原型IG。结果:在选定的患者档案元素中,为多个参与者(如数据集成中心(DIC)和健康研究数据门户(FDPG))定义和呈现了义务。还将义务与具体行动联系起来,从而能够精确地确定工作流程。该实现改进了以前只是隐含的责任的显式性。讨论:研究表明,义务增强了FHIR ig的清晰度。然而,限制仍然存在:MII当前的IG工具还不支持义务,并且没有解决一致性测试问题。需要进一步的工作来标准化ActorDefinition资源,跨模块调整义务,并开发验证工具以实现义务驱动规范的全部潜力。
{"title":"Improving Requirements Documentation in the Medical Informatics Initiative Core Data Set Using FHIR Obligations - Lessons Learned.","authors":"Julian Saß, Sylvia Thun","doi":"10.3233/SHTI251401","DOIUrl":"10.3233/SHTI251401","url":null,"abstract":"<p><strong>Introduction: </strong>The Medical Informatics Initiative (MII) aims to enable cross-site secondary use of clinical data in Germany using a FHIR-based Core Data Set (CDS). However, current FHIR Implementation Guides (IG) often lack actor-specific guidance, leading to inconsistent interpretations and implementations.</p><p><strong>Methods: </strong>This technical case report explores the use of FHIR Implementation Obligations to clarify responsibilities and expected system behavior within the MII infrastructure. Obligations were modeled using the FHIR obligation extension and ActorDefinition resources, applied to the Patient profile from the CDS Person module. A prototype IG was generated using the HL7 FHIR IG publisher tooling.</p><p><strong>Results: </strong>Obligations were defined and rendered for multiple actors - such as Data Integration Centers (DIC) and the Health Research Data Portal (FDPG) - across selected Patient profile elements. Obligations were also linked to specific operations, enabling precise workflow targeting. The implementation improved the explicitness of responsibilities that were previously only implied.</p><p><strong>Discussion: </strong>The study demonstrates that obligations enhance the clarity of FHIR IGs. However, limitations remain: the MII's current IG tooling does not yet support obligations, and conformance testing was not addressed. Further work is needed to standardize ActorDefinition resources, align obligations across modules, and develop validation tooling to realize the full potential of obligation-driven specifications.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"235-244"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Medical Entity Linking in Low-Resource Settings with Fine-Tuning-Free LLMs. 使用无微调llm的低资源环境中的医疗实体链接。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251402
Suteera Seeha, Martin Boeker, Luise Modersohn

Introduction: Medical entity linking is an important task in biomedical natural language processing, aiming to align textual mentions of medical concepts with standardized concepts in ontologies. Most existing approaches rely on supervised models or domain-specific embeddings, which require large datasets and significant computational resources.

Objective: The objective of this work is (1) to investigate the effectiveness of large language models (LLMs) in improving both candidate generation and disambiguation for medical entity linking through synonym expansion and in-context learning, and (2) to evaluate this approach against traditional string-matching and supervised methods.

Methods: We propose a simple yet effective approach that combines string matching with an LLM through in-context learning. Our method avoids fine-tuning and minimizes annotation requirements, making it suitable for low-resource settings. Our system enhances fuzzy string matching by expanding mention spans with LLM-generated synonyms during candidate generation. UMLS entity names, aliases, and synonyms are indexed in Elasticsearch, and candidates are retrieved using both the original span and generated variants. Disambiguation is performed using an LLM with few-shot prompting to select the correct entity from the candidate list.

Results: Evaluated on the MedMentions dataset, our approach achieves 56% linking accuracy, outperforming baseline string matching but falling behind supervised learning methods. The candidate generation component reaches 70% recall@5, while the disambiguation step achieves 80% accuracy when the correct entity is among the top five. We also observe that LLM-generated descriptions do not always improve accuracy.

Conclusion: Our results demonstrate that LLMs have the potential to support medical entity linking in low-resource settings. Although our method is still outperformed by supervised models, it remains a lightweight alternative, requiring no fine-tuning or a large amount of annotated data. The approach is also adaptable to other domains and ontologies beyond biomedicine due to its flexible and domain-agnostic design.

医学实体链接是生物医学自然语言处理中的一项重要任务,旨在将医学概念的文本提及与本体中的标准化概念对齐。大多数现有方法依赖于监督模型或特定领域的嵌入,这需要大量的数据集和大量的计算资源。目的:本研究的目的是:(1)研究大型语言模型(llm)在通过同义词扩展和上下文学习改善医学实体链接的候选词生成和消歧方面的有效性,以及(2)将这种方法与传统的字符串匹配和监督方法进行比较。方法:我们提出了一种简单而有效的方法,通过上下文学习将字符串匹配与LLM相结合。我们的方法避免了微调并最小化了注释需求,使其适合低资源设置。我们的系统通过在候选词生成过程中扩展llm生成的同义词的提及范围来增强模糊字符串匹配。在Elasticsearch中索引UMLS实体名称、别名和同义词,并使用原始跨度和生成的变体检索候选对象。消除歧义是使用LLM执行的,并带有少量提示,以便从候选列表中选择正确的实体。结果:在med提及数据集上进行评估,我们的方法达到56%的链接准确率,优于基线字符串匹配,但落后于监督学习方法。候选生成组件达到70% recall@5,而当正确实体位于前五名时,消歧步骤达到80%的准确率。我们还观察到llm生成的描述并不总是提高准确性。结论:我们的研究结果表明,法学硕士有潜力支持医疗实体链接在低资源设置。尽管我们的方法仍然优于监督模型,但它仍然是一种轻量级的替代方法,不需要微调或大量注释数据。该方法由于其灵活和领域不可知的设计,也适用于生物医学以外的其他领域和本体。
{"title":"Medical Entity Linking in Low-Resource Settings with Fine-Tuning-Free LLMs.","authors":"Suteera Seeha, Martin Boeker, Luise Modersohn","doi":"10.3233/SHTI251402","DOIUrl":"10.3233/SHTI251402","url":null,"abstract":"<p><strong>Introduction: </strong>Medical entity linking is an important task in biomedical natural language processing, aiming to align textual mentions of medical concepts with standardized concepts in ontologies. Most existing approaches rely on supervised models or domain-specific embeddings, which require large datasets and significant computational resources.</p><p><strong>Objective: </strong>The objective of this work is (1) to investigate the effectiveness of large language models (LLMs) in improving both candidate generation and disambiguation for medical entity linking through synonym expansion and in-context learning, and (2) to evaluate this approach against traditional string-matching and supervised methods.</p><p><strong>Methods: </strong>We propose a simple yet effective approach that combines string matching with an LLM through in-context learning. Our method avoids fine-tuning and minimizes annotation requirements, making it suitable for low-resource settings. Our system enhances fuzzy string matching by expanding mention spans with LLM-generated synonyms during candidate generation. UMLS entity names, aliases, and synonyms are indexed in Elasticsearch, and candidates are retrieved using both the original span and generated variants. Disambiguation is performed using an LLM with few-shot prompting to select the correct entity from the candidate list.</p><p><strong>Results: </strong>Evaluated on the MedMentions dataset, our approach achieves 56% linking accuracy, outperforming baseline string matching but falling behind supervised learning methods. The candidate generation component reaches 70% recall@5, while the disambiguation step achieves 80% accuracy when the correct entity is among the top five. We also observe that LLM-generated descriptions do not always improve accuracy.</p><p><strong>Conclusion: </strong>Our results demonstrate that LLMs have the potential to support medical entity linking in low-resource settings. Although our method is still outperformed by supervised models, it remains a lightweight alternative, requiring no fine-tuning or a large amount of annotated data. The approach is also adaptable to other domains and ontologies beyond biomedicine due to its flexible and domain-agnostic design.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"245-254"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CAMIH - The Complementary and Alternative Medicine Insights Hub. CAMIH -补充和替代医学见解中心。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251377
Christian Otto, Jennifer Dörfler, Cord Spreckelsen, Jutta Hübner

Introduction: Assessing the ever-growing number of publications in evidence-based medicine by means of their risk of biases is as essential as it is challenging. This is especially true for the field of complementary and alternative medicine (CAM), a field that remains underrepresented in systematic review collections such as those by the Cochrane Review Groups.

Methods: In this work, we present CAMIH, a semantic wiki platform that offers clinicians a collaborative space to find, summarize, and discuss CAM evidence. CAMIH is built on semantic web technologies and structures information using semantic triplets. By structuring like this, CAMIH goes beyond simple data collection. Our goal is to enable a deeper understanding and organization of evidence, thereby acting as a CAM-specific supplement to existing evidence-synthesis frameworks inspired by the Cochrane methodology.

Results: We anticipate the implemented platform to make evidence synthesis and risk of bias assessment more efficient, but also reduce the time required to derive treatment strategies. Given its foundation in semantic web technologies, it serves both as a practical tool for clinicians and as a methodological blueprint for other research domains seeking to systematically organize gathered evidence.

Discussion: Given the advantages of the platform, it requires, in its current state, manual efforts to be kept up to date. However, our goal is too semi-automize this process to sustainably keep CAMIH relevant.

Conclusion: This work provides an addition to the evidence database-landscape for the CAM field. We hope it will enable clinicians to create, discuss, and synthesize evidence while also providing a blueprint for other research areas that want to organize evidence.

引言:通过评估其偏倚风险来评估越来越多的循证医学出版物是必要的,也是具有挑战性的。这对于补充和替代医学(CAM)领域尤其如此,这个领域在诸如Cochrane综述组的系统综述集合中仍然代表性不足。方法:在这项工作中,我们提出了CAMIH,这是一个语义wiki平台,为临床医生提供了一个协作空间来查找、总结和讨论CAM证据。CAMIH建立在语义web技术的基础上,并使用语义三元组来构建信息。通过这样的结构,CAMIH超越了简单的数据收集。我们的目标是对证据进行更深入的理解和组织,从而作为cam特有的补充,补充受Cochrane方法论启发的现有证据合成框架。结果:我们期望实施的平台能够更有效地进行证据综合和偏倚风险评估,同时减少制定治疗策略所需的时间。鉴于其基于语义网技术,它既是临床医生的实用工具,也是其他研究领域寻求系统组织收集到的证据的方法论蓝图。讨论:考虑到平台的优势,在目前的状态下,它需要手动努力保持最新。然而,我们的目标是将这一过程半自动化,以持续保持CAMIH的相关性。结论:这项工作为CAM领域的证据数据库景观提供了一个补充。我们希望它将使临床医生能够创造、讨论和综合证据,同时也为其他想要组织证据的研究领域提供蓝图。
{"title":"CAMIH - The Complementary and Alternative Medicine Insights Hub.","authors":"Christian Otto, Jennifer Dörfler, Cord Spreckelsen, Jutta Hübner","doi":"10.3233/SHTI251377","DOIUrl":"10.3233/SHTI251377","url":null,"abstract":"<p><strong>Introduction: </strong>Assessing the ever-growing number of publications in evidence-based medicine by means of their risk of biases is as essential as it is challenging. This is especially true for the field of complementary and alternative medicine (CAM), a field that remains underrepresented in systematic review collections such as those by the Cochrane Review Groups.</p><p><strong>Methods: </strong>In this work, we present CAMIH, a semantic wiki platform that offers clinicians a collaborative space to find, summarize, and discuss CAM evidence. CAMIH is built on semantic web technologies and structures information using semantic triplets. By structuring like this, CAMIH goes beyond simple data collection. Our goal is to enable a deeper understanding and organization of evidence, thereby acting as a CAM-specific supplement to existing evidence-synthesis frameworks inspired by the Cochrane methodology.</p><p><strong>Results: </strong>We anticipate the implemented platform to make evidence synthesis and risk of bias assessment more efficient, but also reduce the time required to derive treatment strategies. Given its foundation in semantic web technologies, it serves both as a practical tool for clinicians and as a methodological blueprint for other research domains seeking to systematically organize gathered evidence.</p><p><strong>Discussion: </strong>Given the advantages of the platform, it requires, in its current state, manual efforts to be kept up to date. However, our goal is too semi-automize this process to sustainably keep CAMIH relevant.</p><p><strong>Conclusion: </strong>This work provides an addition to the evidence database-landscape for the CAM field. We hope it will enable clinicians to create, discuss, and synthesize evidence while also providing a blueprint for other research areas that want to organize evidence.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"35-43"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consent Management 2.0: Empowering Patient Will in Medical Research and Care. 同意管理2.0:在医学研究和护理中增强患者意愿。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251389
Sebastian Stäubert, Angela Merzweiler, Jörg Römhild, Stefan Lang, Martin Bialke

Introduction: The lawful processing of health data in medical research necessitates robust mechanisms for managing patient consent and objections, aligning with national and european regulations. While the initial version of the HL7 standard Consent Management" primarily focused on opt-in scenarios, evolving legal landscapes and practical implementation challenges highlight the need for comprehensive solutions encompassing both opt-in and opt-out approaches, including withdrawals and objections. This paper details the systematic revision of the latest HL7 FHIR-based "Consent Management 2.0" standard to address these limitations.

Methods: Our methodology involved a critical assessment of the 2021 standard against three years of practical experience and emerging regulatory requirements.

Results: Key improvements include enhanced support for diverse document types (consent, withdrawal, refusal, objection), refined technical specifications for automated conversion of questionnaire responses into machine-readable Consent Resources, and the introduction of a novel "ResultType" category. This new category enables use-case-specific aggregation of consent information, simplifying downstream processing and reducing interpretation ambiguities. Additionally, uniform FHIR search parameters were defined, and comprehensive examples were integrated into the implementation guide. The revised standard successfully underwent the HL7 ballot process in April 2025, with early practical implementations already demonstrating its utility.

Conclusion: This extended standard significantly enhances the interoperability and legal robustness of consent management in complex research infrastructures, fostering improved patient autonomy and trust in digital health data reuse.

导言:医学研究中健康数据的合法处理需要强有力的机制来管理患者的同意和反对,并与国家和欧洲法规保持一致。虽然HL7标准“同意管理”的初始版本主要侧重于选择加入的场景,但不断变化的法律环境和实际实施中的挑战凸显了对选择加入和选择退出方法(包括撤回和反对)的综合解决方案的需求。本文详细介绍了最新HL7基于fhr的“同意管理2.0”标准的系统修订,以解决这些限制。方法:我们的方法包括根据三年的实践经验和新兴的监管要求对2021年标准进行关键评估。结果:主要改进包括增强了对不同文档类型(同意、撤回、拒绝、反对)的支持,改进了将问卷回答自动转换为机器可读的同意资源的技术规范,并引入了新的“ResultType”类别。这一新类别允许特定用例的同意信息聚合,简化下游处理并减少解释歧义。此外,定义了统一的FHIR搜索参数,并将综合示例集成到实现指南中。修订后的标准在2025年4月成功地通过了HL7投票过程,早期的实际实现已经证明了它的实用性。结论:该扩展标准显著增强了复杂研究基础设施中同意管理的互操作性和法律稳健性,促进了患者在数字健康数据重用方面的自主权和信任。
{"title":"Consent Management 2.0: Empowering Patient Will in Medical Research and Care.","authors":"Sebastian Stäubert, Angela Merzweiler, Jörg Römhild, Stefan Lang, Martin Bialke","doi":"10.3233/SHTI251389","DOIUrl":"10.3233/SHTI251389","url":null,"abstract":"<p><strong>Introduction: </strong>The lawful processing of health data in medical research necessitates robust mechanisms for managing patient consent and objections, aligning with national and european regulations. While the initial version of the HL7 standard Consent Management\" primarily focused on opt-in scenarios, evolving legal landscapes and practical implementation challenges highlight the need for comprehensive solutions encompassing both opt-in and opt-out approaches, including withdrawals and objections. This paper details the systematic revision of the latest HL7 FHIR-based \"Consent Management 2.0\" standard to address these limitations.</p><p><strong>Methods: </strong>Our methodology involved a critical assessment of the 2021 standard against three years of practical experience and emerging regulatory requirements.</p><p><strong>Results: </strong>Key improvements include enhanced support for diverse document types (consent, withdrawal, refusal, objection), refined technical specifications for automated conversion of questionnaire responses into machine-readable Consent Resources, and the introduction of a novel \"ResultType\" category. This new category enables use-case-specific aggregation of consent information, simplifying downstream processing and reducing interpretation ambiguities. Additionally, uniform FHIR search parameters were defined, and comprehensive examples were integrated into the implementation guide. The revised standard successfully underwent the HL7 ballot process in April 2025, with early practical implementations already demonstrating its utility.</p><p><strong>Conclusion: </strong>This extended standard significantly enhances the interoperability and legal robustness of consent management in complex research infrastructures, fostering improved patient autonomy and trust in digital health data reuse.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"133-141"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Medium Scale, Open-Source Large Language Models: Towards Decision Support in a Precision Oncology Care Delivery Context. 评估中等规模,开源的大型语言模型:在精确肿瘤护理交付环境下的决策支持。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251382
Kevin Kaufmes, Georg Mathes, Dilyana Vladimirova, Stephanie Berger, Christian Fegeler, Stefan Sigle

Introduction: In the context of precision oncology, patients often have complex conditions that require treatment based on specific and up-to-date knowledge of guidelines and research. This entails considerable effort when preparing such cases for molecular tumor boards (MTBs). Large language models (LLMs) could help to lower this burden if they could provide such information quickly and precisely on demand. Since out-of-the-box LLMs are not specialized for clinical contexts, this work aims to investigate their usefulness for answering questions arising during MTB preparation. As such questions can contain sensitive data, we evaluated medium-scale models suitable for running on-premise using consumer grade hardware.

Methods: Three recent LLMs to be tested were selected based on established benchmarks and unique characteristics like reasoning capability. Exemplary questions related to MTBs were collected from domain experts. Six of those were selected for the LLMs to generate responses to. Response quality and correctness was evaluated by experts using a questionnaire.

Results: Out of 60 contacted domain experts, 5 fully completed the survey, with another 5 completing it partially. The evaluation revealed a modest overall performance. Our findings identified significant issues, where a large percentage of answers contained outdated or incomplete information, as well as factual errors. Additionally, a high discordance between evaluators regarding correctness and varying rater confidence has been observed.

Conclusion: Our results seem to be indicating that medium-scale LLMs are currently insufficiently reliable for use in precision oncology. Common issues include outdated information and confident presentation of misinformation, which indicates a gap between benchmark- and real-world performance. Future research should focus on mitigating limitations with advanced techniques such as Retrieval-Augmented-Generation (RAG), web search capability or advanced prompting, while prioritizing patient safety.

在精确肿瘤学的背景下,患者往往有复杂的情况,需要基于特定的和最新的指导方针和研究知识的治疗。在为分子肿瘤板(MTBs)准备此类病例时,这需要相当大的努力。大型语言模型(llm)如果能够根据需要快速准确地提供这些信息,就可以帮助减轻这种负担。由于开箱即用的法学硕士不是专门为临床环境,这项工作的目的是调查他们的有用性,以回答在MTB准备过程中出现的问题。由于这些问题可能包含敏感数据,我们评估了适合使用消费级硬件在本地运行的中等规模模型。方法:根据已建立的基准和推理能力等独特特征,选择三个最新的llm进行测试。从领域专家那里收集了与mtb相关的示例性问题。其中6个被法学硕士选中以产生响应。回答的质量和正确性由专家使用问卷进行评估。结果:在联系的60位领域专家中,5位完全完成了调查,另外5位部分完成了调查。评价显示总体表现一般。我们的发现发现了一些重大问题,其中很大一部分答案包含过时或不完整的信息,以及事实错误。此外,评估者之间关于正确性和不同的可信度的高度不一致已被观察到。结论:我们的结果似乎表明,中等规模的llm目前在精确肿瘤学中应用不够可靠。常见的问题包括过时的信息和错误信息的自信呈现,这表明基准测试和实际性能之间存在差距。未来的研究应侧重于利用检索增强生成(RAG)、网络搜索能力或高级提示等先进技术减轻局限性,同时优先考虑患者安全。
{"title":"Evaluating Medium Scale, Open-Source Large Language Models: Towards Decision Support in a Precision Oncology Care Delivery Context.","authors":"Kevin Kaufmes, Georg Mathes, Dilyana Vladimirova, Stephanie Berger, Christian Fegeler, Stefan Sigle","doi":"10.3233/SHTI251382","DOIUrl":"10.3233/SHTI251382","url":null,"abstract":"<p><strong>Introduction: </strong>In the context of precision oncology, patients often have complex conditions that require treatment based on specific and up-to-date knowledge of guidelines and research. This entails considerable effort when preparing such cases for molecular tumor boards (MTBs). Large language models (LLMs) could help to lower this burden if they could provide such information quickly and precisely on demand. Since out-of-the-box LLMs are not specialized for clinical contexts, this work aims to investigate their usefulness for answering questions arising during MTB preparation. As such questions can contain sensitive data, we evaluated medium-scale models suitable for running on-premise using consumer grade hardware.</p><p><strong>Methods: </strong>Three recent LLMs to be tested were selected based on established benchmarks and unique characteristics like reasoning capability. Exemplary questions related to MTBs were collected from domain experts. Six of those were selected for the LLMs to generate responses to. Response quality and correctness was evaluated by experts using a questionnaire.</p><p><strong>Results: </strong>Out of 60 contacted domain experts, 5 fully completed the survey, with another 5 completing it partially. The evaluation revealed a modest overall performance. Our findings identified significant issues, where a large percentage of answers contained outdated or incomplete information, as well as factual errors. Additionally, a high discordance between evaluators regarding correctness and varying rater confidence has been observed.</p><p><strong>Conclusion: </strong>Our results seem to be indicating that medium-scale LLMs are currently insufficiently reliable for use in precision oncology. Common issues include outdated information and confident presentation of misinformation, which indicates a gap between benchmark- and real-world performance. Future research should focus on mitigating limitations with advanced techniques such as Retrieval-Augmented-Generation (RAG), web search capability or advanced prompting, while prioritizing patient safety.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"81-90"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementation of HL7 FHIR-Based Terminology Services for a National Federated Health Research Infrastructure. 国家联邦卫生研究基础设施HL7基于fhir的术语服务的实现。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251396
Joshua Wiedekopf, Tessa Ohlsen, Alan Koops, Ann-Kristin Kock-Schoppenhauer, Muhammad Adnan, Sarah Ballout, Nele Philipzik, Oya Beyan, Andreas Beyer, Michael Marschollek, Josef Ingenerf

Introduction: As part of the German Medical Informatics Initiative (MII) and Network University Medicine (NUM), a central research terminology service (TS) is provided by the Service Unit Terminology Services (SU-TermServ). This HL7 FHIR-based service depends on the timely and comprehensive availability of FHIR terminology resources to provide the necessary interactions for the distributed MII/NUM infrastructure. While German legislation has recently instituted a national terminology service for medical classifications and terminologies, the scope of the MII and NUM extends beyond routine patient care, encompassing the need for supplementary or specialized services and terminologies that are not commonly utilized elsewhere.

Methods: The SU-TermServ's processes are based on established FHIR principles and the recently-proposed Canonical Resources Management Infrastructure Implementation Guide, which are outlined in this paper.

Results: The strategy and processes implemented within the project can deliver the needed resources both to the central FHIR terminology service, but also to the local data integration centers, in a transparent and consistent fashion. The service currently provides approximately 7000 resources to users via the standardized FHIR API.

Conclusion: The professionalized distribution and maintenance of these terminological resources and the provision of a powerful TS implementation aids both the development of the Core Data Set and the data integration centers, and ultimately biomedical researchers requesting access to this rich data.

简介:作为德国医学信息学倡议(MII)和网络大学医学(NUM)的一部分,中央研究术语服务(TS)由服务单位术语服务(SU-TermServ)提供。这种基于HL7 FHIR的服务依赖于FHIR术语资源的及时和全面可用性,从而为分布式MII/NUM基础设施提供必要的交互。虽然德国立法机构最近设立了医疗分类和术语的国家术语服务机构,但MII和NUM的范围超出了常规的病人护理,包括补充或专门服务的需要以及在其他地方不常用的术语。方法:SU-TermServ的流程基于已建立的FHIR原则和最近提出的规范资源管理基础设施实施指南,本文对此进行了概述。结果:在项目中实施的策略和流程可以以透明和一致的方式向中央FHIR术语服务以及本地数据集成中心提供所需的资源。该服务目前通过标准化的FHIR API为用户提供了大约7000个资源。结论:这些术语资源的专业化分布和维护以及提供强大的TS实现有助于核心数据集和数据集成中心的发展,并最终帮助生物医学研究人员访问这些丰富的数据。
{"title":"Implementation of HL7 FHIR-Based Terminology Services for a National Federated Health Research Infrastructure.","authors":"Joshua Wiedekopf, Tessa Ohlsen, Alan Koops, Ann-Kristin Kock-Schoppenhauer, Muhammad Adnan, Sarah Ballout, Nele Philipzik, Oya Beyan, Andreas Beyer, Michael Marschollek, Josef Ingenerf","doi":"10.3233/SHTI251396","DOIUrl":"10.3233/SHTI251396","url":null,"abstract":"<p><strong>Introduction: </strong>As part of the German Medical Informatics Initiative (MII) and Network University Medicine (NUM), a central research terminology service (TS) is provided by the Service Unit Terminology Services (SU-TermServ). This HL7 FHIR-based service depends on the timely and comprehensive availability of FHIR terminology resources to provide the necessary interactions for the distributed MII/NUM infrastructure. While German legislation has recently instituted a national terminology service for medical classifications and terminologies, the scope of the MII and NUM extends beyond routine patient care, encompassing the need for supplementary or specialized services and terminologies that are not commonly utilized elsewhere.</p><p><strong>Methods: </strong>The SU-TermServ's processes are based on established FHIR principles and the recently-proposed Canonical Resources Management Infrastructure Implementation Guide, which are outlined in this paper.</p><p><strong>Results: </strong>The strategy and processes implemented within the project can deliver the needed resources both to the central FHIR terminology service, but also to the local data integration centers, in a transparent and consistent fashion. The service currently provides approximately 7000 resources to users via the standardized FHIR API.</p><p><strong>Conclusion: </strong>The professionalized distribution and maintenance of these terminological resources and the provision of a powerful TS implementation aids both the development of the Core Data Set and the data integration centers, and ultimately biomedical researchers requesting access to this rich data.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"195-203"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anonymization of Health Insurance Claims Data for Medication Safety Assessments. 用于药物安全评估的健康保险索赔数据的匿名化。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251407
Mehmed Halilovic, Karen Otte, Thierry Meurers, Marco Alibone, Marion Ludwig, Nico Riedel, Steven Wolter, Lisa Kühnel, Steffen Hess, Fabian Prasser

Introduction: The re-use of health insurance claims data for research purposes can provide valuable insights to improve patient care. However, as health data is often highly sensitive and subject to strict regulatory frameworks, the privacy of individuals must be protected. Anonymization is a common approach to do so, but finding an effective strategy is challenging due to an inherent trade-off between privacy protection and data utility. A structured approach is needed to balance these objectives and guide the selection of appropriate anonymization strategies.

Methods: In this paper, we present a systematic evaluation of twelve anonymization strategies applied to German health insurance claims data that has previously been used in a drug safety study. The dataset consisted of 1727 records and 45 variables. Based on a structured threat modeling, we compare a conservative and a threat modeling-based approach, each with six different privacy models and risk thresholds using the ARX Data Anonymization Tool. We assess general data utility and empirically evaluate residual privacy risks using both the Anonymeter framework and a membership inference attack.

Results: Our results show that conservative anonymization ensures strong privacy protection but reduces data utility. In contrast, threat modeling retains more utility while still providing acceptable privacy under moderate thresholds.

Conclusion: The proposed process enables a systematic comparison of privacy-utility trade-offs and can be adapted to other medical datasets. Our findings highlight the importance of context-specific anonymization strategies and empirical risk evaluation to guide anonymized data sharing in healthcare.

简介:出于研究目的重用健康保险索赔数据可以为改善患者护理提供有价值的见解。然而,由于健康数据往往高度敏感,并受到严格监管框架的约束,因此必须保护个人隐私。匿名化是一种常见的方法,但由于隐私保护和数据效用之间的内在权衡,找到一种有效的策略是具有挑战性的。需要一种结构化的方法来平衡这些目标,并指导选择适当的匿名化策略。方法:在本文中,我们提出了12个匿名化策略应用于德国健康保险索赔数据的系统评估,这些数据以前曾用于药物安全研究。该数据集由1727条记录和45个变量组成。基于结构化的威胁建模,我们比较了保守和基于威胁建模的方法,每种方法使用ARX数据匿名化工具使用六种不同的隐私模型和风险阈值。我们评估一般数据效用和经验评估剩余隐私风险使用匿名框架和成员推理攻击。结果:我们的研究结果表明,保守匿名化确保了强大的隐私保护,但降低了数据效用。相比之下,威胁建模保留了更多的效用,同时仍然在中等阈值下提供可接受的隐私。结论:所提出的过程能够系统地比较隐私-效用权衡,并可适用于其他医疗数据集。我们的研究结果强调了上下文特定的匿名化策略和经验风险评估对指导医疗保健中的匿名数据共享的重要性。
{"title":"Anonymization of Health Insurance Claims Data for Medication Safety Assessments.","authors":"Mehmed Halilovic, Karen Otte, Thierry Meurers, Marco Alibone, Marion Ludwig, Nico Riedel, Steven Wolter, Lisa Kühnel, Steffen Hess, Fabian Prasser","doi":"10.3233/SHTI251407","DOIUrl":"10.3233/SHTI251407","url":null,"abstract":"<p><strong>Introduction: </strong>The re-use of health insurance claims data for research purposes can provide valuable insights to improve patient care. However, as health data is often highly sensitive and subject to strict regulatory frameworks, the privacy of individuals must be protected. Anonymization is a common approach to do so, but finding an effective strategy is challenging due to an inherent trade-off between privacy protection and data utility. A structured approach is needed to balance these objectives and guide the selection of appropriate anonymization strategies.</p><p><strong>Methods: </strong>In this paper, we present a systematic evaluation of twelve anonymization strategies applied to German health insurance claims data that has previously been used in a drug safety study. The dataset consisted of 1727 records and 45 variables. Based on a structured threat modeling, we compare a conservative and a threat modeling-based approach, each with six different privacy models and risk thresholds using the ARX Data Anonymization Tool. We assess general data utility and empirically evaluate residual privacy risks using both the Anonymeter framework and a membership inference attack.</p><p><strong>Results: </strong>Our results show that conservative anonymization ensures strong privacy protection but reduces data utility. In contrast, threat modeling retains more utility while still providing acceptable privacy under moderate thresholds.</p><p><strong>Conclusion: </strong>The proposed process enables a systematic comparison of privacy-utility trade-offs and can be adapted to other medical datasets. Our findings highlight the importance of context-specific anonymization strategies and empirical risk evaluation to guide anonymized data sharing in healthcare.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"283-291"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Broad Consent to Patient Engagement: A Framework for Consent Management and Study Oversight. 从广泛同意到患者参与:同意管理和研究监督的框架。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251405
Anne Pelz, Philipp Heinrich, Gabriele Mueller, Anne Seim, Peter Penndorf, Martin Bialke, Martin Sedlmayr, Ines Reinecke, Markus Wolfien, Katja Hoffmann

Introduction: The German Medical Informatics Initiative (MII) promotes the use of routine clinical data for research, supported by the broad consent framework to ensure patient engagement. This work proposes a data management process and reference infrastructure to improve transparency by enabling patients to track their consent history and data use in research.

Methods: We analyzed the data provision process at the University Hospital Dresden (UKD) to identify roles and data flows relevant to secondary data use under broad consent. Established MII tools in use at UKD were evaluated for their suitability in enabling secure data access.

Results: We developed a structured data access process and implemented a reference infrastructure that lays the groundwork for a potential patient-facing application providing secure access to consent and study details.

Conclusion: The reference infrastructure demonstrates how existing MII tools can be repurposed to offer patient-centric transparency in secondary data use. Future work will address scalability, access control, and ethical considerations, such as patient expectations and the clarity of information.

简介:德国医学信息学倡议(MII)促进常规临床数据用于研究,并得到广泛同意框架的支持,以确保患者参与。这项工作提出了一个数据管理过程和参考基础设施,使患者能够跟踪他们的同意历史和数据在研究中的使用,从而提高透明度。方法:我们分析了德累斯顿大学医院(UKD)的数据提供过程,以确定在广泛同意下与二次数据使用相关的角色和数据流。评估了UKD使用的已建立的MII工具在实现安全数据访问方面的适用性。结果:我们开发了一个结构化的数据访问流程,并实施了一个参考基础设施,为潜在的面向患者的应用程序奠定了基础,提供了对同意书和研究细节的安全访问。结论:参考基础设施展示了现有的MII工具如何重新利用,以提供以患者为中心的二级数据使用透明度。未来的工作将涉及可扩展性、访问控制和伦理考虑,如患者期望和信息的清晰度。
{"title":"From Broad Consent to Patient Engagement: A Framework for Consent Management and Study Oversight.","authors":"Anne Pelz, Philipp Heinrich, Gabriele Mueller, Anne Seim, Peter Penndorf, Martin Bialke, Martin Sedlmayr, Ines Reinecke, Markus Wolfien, Katja Hoffmann","doi":"10.3233/SHTI251405","DOIUrl":"10.3233/SHTI251405","url":null,"abstract":"<p><strong>Introduction: </strong>The German Medical Informatics Initiative (MII) promotes the use of routine clinical data for research, supported by the broad consent framework to ensure patient engagement. This work proposes a data management process and reference infrastructure to improve transparency by enabling patients to track their consent history and data use in research.</p><p><strong>Methods: </strong>We analyzed the data provision process at the University Hospital Dresden (UKD) to identify roles and data flows relevant to secondary data use under broad consent. Established MII tools in use at UKD were evaluated for their suitability in enabling secure data access.</p><p><strong>Results: </strong>We developed a structured data access process and implemented a reference infrastructure that lays the groundwork for a potential patient-facing application providing secure access to consent and study details.</p><p><strong>Conclusion: </strong>The reference infrastructure demonstrates how existing MII tools can be repurposed to offer patient-centric transparency in secondary data use. Future work will address scalability, access control, and ethical considerations, such as patient expectations and the clarity of information.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"265-273"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conception and Development of a Metadata Search Portal Based on the Data Dictionary Minimal Information Model (DDMIM) Specification. 基于数据字典最小信息模型(DDMIM)规范的元数据搜索门户的构想与开发。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251400
Leslie Diana Wamba Makem, Abishaa Vengadeswaran, Dupleix Achille Takoulegha, Dennis Kadioglu

Introduction: The heterogeneity of metadata continues to be a key challenge in the healthcare sector. The Data Dictionary Minimal Information Model (DDMIM) aims to meet the need for interoperability between different standards and data dictionaries to facilitate the exchange of metadata.

Objective: This paper presents the conception, and the development of a metadata search portal based on the DDMIM specification, designed to improve the discoverability and accessibility of health datasets and enhance interoperability.

Methods: We conducted a literature review of existing metadata repositories to select potentially relevant ones for further work. A mapping was created to transform metadata from different MDRs into the DDMIM format. In parallel, the requirements for a prototype search portal are being evaluated, which integrates metadata from various public repositories.

Results: The results show that a DDMIM-based search portal can effectively integrate heterogeneous metadata sources and improve the finding of health datasets.

Discussion: Such a portal supports the integration of heterogeneous metadata sources and ensures compliance with FAIR principles to optimize the use of health data for research and clinical applications. It is therefore of great importance to address the existing challenges in the field of medical data integration and utilization.

引言:元数据的异构性仍然是医疗保健领域的一个关键挑战。数据字典最小信息模型(DDMIM)旨在满足不同标准和数据字典之间的互操作性需求,以促进元数据的交换。目的:提出了基于DDMIM规范的元数据搜索门户的概念和开发,旨在提高卫生数据集的可发现性和可访问性,增强互操作性。方法:我们对现有元数据存储库进行了文献综述,以选择可能相关的元数据存储库进行进一步的工作。创建了一个映射,将来自不同mdr的元数据转换为DDMIM格式。与此同时,正在评估原型搜索门户的需求,该门户集成了来自各种公共存储库的元数据。结果:基于ddmim的搜索门户能够有效整合异构元数据,提高健康数据集的查找效率。讨论:这种门户支持异构元数据来源的集成,并确保遵守公平原则,以优化用于研究和临床应用的健康数据的使用。因此,解决医疗数据集成与利用领域存在的挑战具有重要意义。
{"title":"Conception and Development of a Metadata Search Portal Based on the Data Dictionary Minimal Information Model (DDMIM) Specification.","authors":"Leslie Diana Wamba Makem, Abishaa Vengadeswaran, Dupleix Achille Takoulegha, Dennis Kadioglu","doi":"10.3233/SHTI251400","DOIUrl":"10.3233/SHTI251400","url":null,"abstract":"<p><strong>Introduction: </strong>The heterogeneity of metadata continues to be a key challenge in the healthcare sector. The Data Dictionary Minimal Information Model (DDMIM) aims to meet the need for interoperability between different standards and data dictionaries to facilitate the exchange of metadata.</p><p><strong>Objective: </strong>This paper presents the conception, and the development of a metadata search portal based on the DDMIM specification, designed to improve the discoverability and accessibility of health datasets and enhance interoperability.</p><p><strong>Methods: </strong>We conducted a literature review of existing metadata repositories to select potentially relevant ones for further work. A mapping was created to transform metadata from different MDRs into the DDMIM format. In parallel, the requirements for a prototype search portal are being evaluated, which integrates metadata from various public repositories.</p><p><strong>Results: </strong>The results show that a DDMIM-based search portal can effectively integrate heterogeneous metadata sources and improve the finding of health datasets.</p><p><strong>Discussion: </strong>Such a portal supports the integration of heterogeneous metadata sources and ensures compliance with FAIR principles to optimize the use of health data for research and clinical applications. It is therefore of great importance to address the existing challenges in the field of medical data integration and utilization.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"228-234"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PED-DATA: A Privacy-Preserving Framework for Data-Driven, Pediatric Multi-Center Studies. PED-DATA:数据驱动的隐私保护框架,儿科多中心研究。
Pub Date : 2025-09-03 DOI: 10.3233/SHTI251409
Gorkem Yilmaz, Jonathan M Mang, Markus Metzler, Hans-Ulrich Prokosch, Manfred Rauh, Jakob Zierk

Introduction: Data-driven analysis of clinical databases is an efficient method for clinical knowledge generation, which is especially suitable when exceptional ethical and practical restrictions apply, such as in pediatrics. In the multi-center PEDREF 2.0 study, we are analyzing children's laboratory test results, diagnoses, and procedures from more than 20 German tertiary care centers to establish pediatric reference intervals. The PEDREF 2.0 study uses the framework of the German Medical Informatics Initiative, but the specific study needs require the development of a customized module for distributed pediatric analyses.

Methods: We developed the Pediatric Distributed Analysis, Anonymization, and Aggregation Module (PED-DATA), which is a containerized application that we deployed to all participating centers. PED-DATA transforms the input datasets to a harmonized internal representation and enables their decentralized analysis in compliance with data protection rules, resulting in an anonymous output dataset that is transferred for central analysis.

Results: In a preliminary analysis of data from 15 centers, we analyzed 52,807,236 laboratory test results from 753,774 different patients (323,943 to 4,338,317 test results per laboratory test), enabling us to establish pediatric reference intervals with previously unmatched precision.

Conclusion: PED-DATA facilitates the implementation of pediatric data-driven multicenter studies in a decentralized and privacy-respecting manner, and its use throughout German University Hospitals in the PEDREF 2.0 study demonstrates its usefulness in a real-world use case.

临床数据库的数据驱动分析是临床知识生成的一种有效方法,尤其适用于特殊的伦理和实践限制,如儿科。在多中心PEDREF 2.0研究中,我们分析了来自20多家德国三级医疗中心的儿童实验室检测结果、诊断和程序,以建立儿童参考区间。PEDREF 2.0研究使用了德国医学信息学倡议的框架,但具体的研究需要为分布式儿科分析开发一个定制的模块。方法:我们开发了儿科分布式分析、匿名化和聚合模块(PED-DATA),这是一个容器化的应用程序,我们部署到所有参与的中心。PED-DATA将输入数据集转换为统一的内部表示,并使其能够按照数据保护规则进行分散分析,从而产生用于集中分析的匿名输出数据集。结果:在15个中心的初步数据分析中,我们分析了来自753,774名不同患者的52,807,236个实验室检查结果(每个实验室检查结果为323,943至4,338,317),使我们能够以前所未有的精度建立儿科参考区间。结论:PED-DATA以分散和尊重隐私的方式促进了儿科数据驱动的多中心研究的实施,在PEDREF 2.0研究中,它在德国大学医院的使用证明了它在现实世界用例中的有用性。
{"title":"PED-DATA: A Privacy-Preserving Framework for Data-Driven, Pediatric Multi-Center Studies.","authors":"Gorkem Yilmaz, Jonathan M Mang, Markus Metzler, Hans-Ulrich Prokosch, Manfred Rauh, Jakob Zierk","doi":"10.3233/SHTI251409","DOIUrl":"10.3233/SHTI251409","url":null,"abstract":"<p><strong>Introduction: </strong>Data-driven analysis of clinical databases is an efficient method for clinical knowledge generation, which is especially suitable when exceptional ethical and practical restrictions apply, such as in pediatrics. In the multi-center PEDREF 2.0 study, we are analyzing children's laboratory test results, diagnoses, and procedures from more than 20 German tertiary care centers to establish pediatric reference intervals. The PEDREF 2.0 study uses the framework of the German Medical Informatics Initiative, but the specific study needs require the development of a customized module for distributed pediatric analyses.</p><p><strong>Methods: </strong>We developed the Pediatric Distributed Analysis, Anonymization, and Aggregation Module (PED-DATA), which is a containerized application that we deployed to all participating centers. PED-DATA transforms the input datasets to a harmonized internal representation and enables their decentralized analysis in compliance with data protection rules, resulting in an anonymous output dataset that is transferred for central analysis.</p><p><strong>Results: </strong>In a preliminary analysis of data from 15 centers, we analyzed 52,807,236 laboratory test results from 753,774 different patients (323,943 to 4,338,317 test results per laboratory test), enabling us to establish pediatric reference intervals with previously unmatched precision.</p><p><strong>Conclusion: </strong>PED-DATA facilitates the implementation of pediatric data-driven multicenter studies in a decentralized and privacy-respecting manner, and its use throughout German University Hospitals in the PEDREF 2.0 study demonstrates its usefulness in a real-world use case.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"307-317"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144985129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Studies in health technology and informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1