首页 > 最新文献

Methods of Information in Medicine最新文献

英文 中文
A Systematic Approach to Configuring MetaMap for Optimal Performance. 配置元地图以获得最佳性能的系统方法。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/a-1862-0421
Xia Jing, Akash Indani, Nina Hubig, Hua Min, Yang Gong, James J Cimino, Dean F Sittig, Lior Rennert, David Robinson, Paul Biondich, Adam Wright, Christian Nøhr, Timothy Law, Arild Faxvaag, Ronald Gimbel

Background: MetaMap is a valuable tool for processing biomedical texts to identify concepts. Although MetaMap is highly configurative, configuration decisions are not straightforward.

Objective: To develop a systematic, data-driven methodology for configuring MetaMap for optimal performance.

Methods: MetaMap, the word2vec model, and the phrase model were used to build a pipeline. For unsupervised training, the phrase and word2vec models used abstracts related to clinical decision support as input. During testing, MetaMap was configured with the default option, one behavior option, and two behavior options. For each configuration, cosine and soft cosine similarity scores between identified entities and gold-standard terms were computed for 40 annotated abstracts (422 sentences). The similarity scores were used to calculate and compare the overall percentages of exact matches, similar matches, and missing gold-standard terms among the abstracts for each configuration. The results were manually spot-checked. The precision, recall, and F-measure (β =1) were calculated.

Results: The percentages of exact matches and missing gold-standard terms were 0.6-0.79 and 0.09-0.3 for one behavior option, and 0.56-0.8 and 0.09-0.3 for two behavior options, respectively. The percentages of exact matches and missing terms for soft cosine similarity scores exceeded those for cosine similarity scores. The average precision, recall, and F-measure were 0.59, 0.82, and 0.68 for exact matches, and 1.00, 0.53, and 0.69 for missing terms, respectively.

Conclusion: We demonstrated a systematic approach that provides objective and accurate evidence guiding MetaMap configurations for optimizing performance. Combining objective evidence and the current practice of using principles, experience, and intuitions outperforms a single strategy in MetaMap configurations. Our methodology, reference codes, measurements, results, and workflow are valuable references for optimizing and configuring MetaMap.

背景:MetaMap是处理生物医学文本以识别概念的宝贵工具。尽管MetaMap是高度可配置的,但配置决策并不简单。目的:开发一种系统的、数据驱动的方法来配置元地图以获得最佳性能。方法:采用MetaMap、word2vec模型和短语模型构建管道。对于无监督训练,短语和word2vec模型使用与临床决策支持相关的摘要作为输入。在测试期间,MetaMap配置了默认选项、一个行为选项和两个行为选项。对于每种配置,计算了40个注释摘要(422个句子)的识别实体与金标准术语之间的余弦和软余弦相似度得分。相似性分数用于计算和比较每个配置的摘要中精确匹配、相似匹配和缺失金标准术语的总体百分比。结果是手工抽查的。计算精密度、召回率和f测量值(β =1)。结果:一个行为选项的精确匹配和缺失金标准项的百分比分别为0.6-0.79和0.09-0.3,两个行为选项的精确匹配和缺失金标准项的百分比分别为0.56-0.8和0.09-0.3。软余弦相似度分数的精确匹配和缺失项的百分比超过了余弦相似度分数。准确匹配的平均精密度、召回率和F-measure分别为0.59、0.82和0.68,缺失项分别为1.00、0.53和0.69。结论:我们展示了一种系统的方法,提供了客观和准确的证据来指导MetaMap配置以优化性能。将客观证据与使用原则、经验和直觉的当前实践相结合,在MetaMap配置中优于单一策略。我们的方法、参考代码、测量、结果和工作流程是优化和配置MetaMap的宝贵参考。
{"title":"A Systematic Approach to Configuring MetaMap for Optimal Performance.","authors":"Xia Jing,&nbsp;Akash Indani,&nbsp;Nina Hubig,&nbsp;Hua Min,&nbsp;Yang Gong,&nbsp;James J Cimino,&nbsp;Dean F Sittig,&nbsp;Lior Rennert,&nbsp;David Robinson,&nbsp;Paul Biondich,&nbsp;Adam Wright,&nbsp;Christian Nøhr,&nbsp;Timothy Law,&nbsp;Arild Faxvaag,&nbsp;Ronald Gimbel","doi":"10.1055/a-1862-0421","DOIUrl":"https://doi.org/10.1055/a-1862-0421","url":null,"abstract":"<p><strong>Background: </strong>MetaMap is a valuable tool for processing biomedical texts to identify concepts. Although MetaMap is highly configurative, configuration decisions are not straightforward.</p><p><strong>Objective: </strong>To develop a systematic, data-driven methodology for configuring MetaMap for optimal performance.</p><p><strong>Methods: </strong>MetaMap, the word2vec model, and the phrase model were used to build a pipeline. For unsupervised training, the phrase and word2vec models used abstracts related to clinical decision support as input. During testing, MetaMap was configured with the default option, one behavior option, and two behavior options. For each configuration, cosine and soft cosine similarity scores between identified entities and gold-standard terms were computed for 40 annotated abstracts (422 sentences). The similarity scores were used to calculate and compare the overall percentages of exact matches, similar matches, and missing gold-standard terms among the abstracts for each configuration. The results were manually spot-checked. The precision, recall, and F-measure (<i>β</i> =1) were calculated.</p><p><strong>Results: </strong>The percentages of exact matches and missing gold-standard terms were 0.6-0.79 and 0.09-0.3 for one behavior option, and 0.56-0.8 and 0.09-0.3 for two behavior options, respectively. The percentages of exact matches and missing terms for soft cosine similarity scores exceeded those for cosine similarity scores. The average precision, recall, and F-measure were 0.59, 0.82, and 0.68 for exact matches, and 1.00, 0.53, and 0.69 for missing terms, respectively.</p><p><strong>Conclusion: </strong>We demonstrated a systematic approach that provides objective and accurate evidence guiding MetaMap configurations for optimizing performance. Combining objective evidence and the current practice of using principles, experience, and intuitions outperforms a single strategy in MetaMap configurations. Our methodology, reference codes, measurements, results, and workflow are valuable references for optimizing and configuring MetaMap.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e51-e63"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/95/91/10-1055-a-1862-0421.PMC9788913.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9809549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Maturity Level of Digital Reproductive, Maternal, Newborn, and Child Health Initiatives in Jordan and Palestine. 约旦和巴勒斯坦数字生殖、孕产妇、新生儿和儿童健康倡议的成熟度。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/s-0042-1756651
Mohammad S Alyahya, Niveen M E Abu-Rmeileh, Yousef S Khader, Maysaa Nemer, Nihaya A Al-Sheyab, Alexandrine Pirlot de Corbion, Laura Lazaro Cabrera, Sundeep Sahay

Background: While there is a rapid increase in digital health initiatives focusing on the processing of personal data for strengthening the delivery of reproductive, maternal, newborn, and child health (RMNCH) services in fragile settings, these are often unaccompanied at both the policy and operational levels with adequate legal and regulatory frameworks.

Objective: The main aim was to understand the maturity level of digital personal data initiatives for RMNCH services within fragile contexts. This aim was performed by choosing digital health initiatives from each country (two in Jordan and three in Palestine) based on RMNCH.

Methods: A qualitative study design was adopted. We developed a digital maturity assessment tool assessing two maturity levels: the information and communications technology digital infrastructure, and data governance and interoperability in place for the five selected RMNCH initiatives in Jordan and Palestine.

Results: Overall, the digital infrastructure and technological readiness components are more advanced and show higher maturity levels compared with data governance and interoperability components in Jordan and Palestine. In Jordan, the overall Jordan stillbirths and neonatal deaths surveillance initiative maturity indicators are somehow less advanced than those of the Electronic Maternal and Child Health Handbook-Jordan (EMCH-J) application. In Palestine, the Electronic Maternal and Child Health-registry initiative maturity indicators are more advanced than both Avicenna and EMCH-Palestine initiatives.

Conclusion: The findings highlighted several challenges and opportunities around the application and implementation of selected digital health initiatives in the provision of RMNCH in Jordan and Palestine. Our findings shed lights on the maturity level of these initiatives within fragile contexts. The maturity level of the five RMNCH initiatives in both countries is inadequate and requires further advancement before they can be scaled up and scaled out. Taking the World Health Organization recommendations into account when developing, implementing, and scaling digital health initiatives in low- and middle-income countries can result in successful and sustainable initiatives, thus meeting health needs and improving the quality of health care received by individuals especially those living in fragile contexts.

背景:虽然侧重于处理个人数据以加强在脆弱环境中提供生殖、孕产妇、新生儿和儿童健康服务的数字卫生举措迅速增加,但这些举措在政策和业务层面往往缺乏适当的法律和监管框架。目的:主要目的是了解在脆弱环境中RMNCH服务的数字个人数据计划的成熟度。实现这一目标的方法是根据RMNCH从每个国家(约旦两个,巴勒斯坦三个)选择数字卫生倡议。方法:采用定性研究设计。我们开发了一个数字成熟度评估工具,评估两个成熟度级别:信息和通信技术数字基础设施,以及约旦和巴勒斯坦五个选定的RMNCH计划的数据治理和互操作性。结果:总体而言,与约旦和巴勒斯坦的数据治理和互操作性组件相比,约旦和巴勒斯坦的数字基础设施和技术就绪组件更先进,成熟度更高。在约旦,约旦死产和新生儿死亡监测倡议的总体成熟度指标在某种程度上不如《约旦妇幼保健电子手册》(EMCH-J)应用程序的指标先进。在巴勒斯坦,电子妇幼保健登记倡议的成熟度指标比阿维森纳倡议和巴勒斯坦emhs倡议都先进。结论:调查结果突出了在约旦和巴勒斯坦提供RMNCH时,围绕应用和实施选定的数字卫生倡议所面临的若干挑战和机遇。我们的研究结果揭示了这些举措在脆弱环境中的成熟度。两国的五项母婴健康合作倡议成熟度不足,需要进一步推进,才能扩大和扩大规模。在低收入和中等收入国家制定、实施和扩大数字卫生举措时考虑到世界卫生组织的建议,可促成成功和可持续的举措,从而满足个人,特别是生活在脆弱环境中的个人的卫生需求并提高其获得的卫生保健质量。
{"title":"Maturity Level of Digital Reproductive, Maternal, Newborn, and Child Health Initiatives in Jordan and Palestine.","authors":"Mohammad S Alyahya,&nbsp;Niveen M E Abu-Rmeileh,&nbsp;Yousef S Khader,&nbsp;Maysaa Nemer,&nbsp;Nihaya A Al-Sheyab,&nbsp;Alexandrine Pirlot de Corbion,&nbsp;Laura Lazaro Cabrera,&nbsp;Sundeep Sahay","doi":"10.1055/s-0042-1756651","DOIUrl":"https://doi.org/10.1055/s-0042-1756651","url":null,"abstract":"<p><strong>Background: </strong>While there is a rapid increase in digital health initiatives focusing on the processing of personal data for strengthening the delivery of reproductive, maternal, newborn, and child health (RMNCH) services in fragile settings, these are often unaccompanied at both the policy and operational levels with adequate legal and regulatory frameworks.</p><p><strong>Objective: </strong>The main aim was to understand the maturity level of digital personal data initiatives for RMNCH services within fragile contexts. This aim was performed by choosing digital health initiatives from each country (two in Jordan and three in Palestine) based on RMNCH.</p><p><strong>Methods: </strong>A qualitative study design was adopted. We developed a digital maturity assessment tool assessing two maturity levels: the information and communications technology digital infrastructure, and data governance and interoperability in place for the five selected RMNCH initiatives in Jordan and Palestine.</p><p><strong>Results: </strong>Overall, the digital infrastructure and technological readiness components are more advanced and show higher maturity levels compared with data governance and interoperability components in Jordan and Palestine. In Jordan, the overall Jordan stillbirths and neonatal deaths surveillance initiative maturity indicators are somehow less advanced than those of the Electronic Maternal and Child Health Handbook-Jordan (EMCH-J) application. In Palestine, the Electronic Maternal and Child Health-registry initiative maturity indicators are more advanced than both Avicenna and EMCH-Palestine initiatives.</p><p><strong>Conclusion: </strong>The findings highlighted several challenges and opportunities around the application and implementation of selected digital health initiatives in the provision of RMNCH in Jordan and Palestine. Our findings shed lights on the maturity level of these initiatives within fragile contexts. The maturity level of the five RMNCH initiatives in both countries is inadequate and requires further advancement before they can be scaled up and scaled out. Taking the World Health Organization recommendations into account when developing, implementing, and scaling digital health initiatives in low- and middle-income countries can result in successful and sustainable initiatives, thus meeting health needs and improving the quality of health care received by individuals especially those living in fragile contexts.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 5-06","pages":"139-154"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9609024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TransformEHRs: a flexible methodology for building transparent ETL processes for EHR reuse. transformhhr:一种灵活的方法,用于为EHR重用构建透明的ETL过程。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/s-0042-1757763
Miguel Pedrera-Jiménez, Noelia García-Barrio, Paula Rubio-Mayo, Alberto Tato-Gómez, Juan Luis Cruz-Bermúdez, José Luis Bernal-Sobrino, Adolfo Muñoz-Carrero, Pablo Serrano-Balazote

Background: During the COVID-19 pandemic, several methodologies were designed for obtaining electronic health record (EHR)-derived datasets for research. These processes are often based on black boxes, on which clinical researchers are unaware of how the data were recorded, extracted, and transformed. In order to solve this, it is essential that extract, transform, and load (ETL) processes are based on transparent, homogeneous, and formal methodologies, making them understandable, reproducible, and auditable.

Objectives: This study aims to design and implement a methodology, according with FAIR Principles, for building ETL processes (focused on data extraction, selection, and transformation) for EHR reuse in a transparent and flexible manner, applicable to any clinical condition and health care organization.

Methods: The proposed methodology comprises four stages: (1) analysis of secondary use models and identification of data operations, based on internationally used clinical repositories, case report forms, and aggregated datasets; (2) modeling and formalization of data operations, through the paradigm of the Detailed Clinical Models; (3) agnostic development of data operations, selecting SQL and R as programming languages; and (4) automation of the ETL instantiation, building a formal configuration file with XML.

Results: First, four international projects were analyzed to identify 17 operations, necessary to obtain datasets according to the specifications of these projects from the EHR. With this, each of the data operations was formalized, using the ISO 13606 reference model, specifying the valid data types as arguments, inputs and outputs, and their cardinality. Then, an agnostic catalog of data was developed through data-oriented programming languages previously selected. Finally, an automated ETL instantiation process was built from an ETL configuration file formally defined.

Conclusions: This study has provided a transparent and flexible solution to the difficulty of making the processes for obtaining EHR-derived data for secondary use understandable, auditable, and reproducible. Moreover, the abstraction carried out in this study means that any previous EHR reuse methodology can incorporate these results into them.

背景:在2019冠状病毒病大流行期间,设计了几种方法来获取用于研究的电子健康记录(EHR)衍生数据集。这些过程通常基于黑箱,临床研究人员不知道数据是如何记录、提取和转换的。为了解决这个问题,提取、转换和加载(ETL)过程必须基于透明、同质和形式化的方法,使它们易于理解、可重复和可审计。目的:本研究旨在设计和实施一种方法,根据公平原则,以透明和灵活的方式构建电子病历重用的ETL流程(重点是数据提取、选择和转换),适用于任何临床条件和卫生保健组织。方法:提出的方法包括四个阶段:(1)基于国际通用的临床知识库、病例报告表格和汇总数据集,分析二次使用模型和识别数据操作;(2)通过《详细临床模型》范式对数据操作进行建模和形式化;(3)数据操作的不可知论开发,选择SQL和R作为编程语言;(4)自动化ETL实例化,用XML构建正式的配置文件。结果:首先,对4个国际项目进行了分析,确定了17项操作,需要根据这些项目的规范从EHR中获取数据集。这样,使用ISO 13606参考模型对每个数据操作进行了形式化,指定了有效的数据类型作为参数、输入和输出,以及它们的基数。然后,通过先前选择的面向数据的编程语言开发了一个不可知的数据目录。最后,从正式定义的ETL配置文件构建了一个自动化的ETL实例化过程。结论:本研究提供了一个透明和灵活的解决方案,使获取ehr衍生数据用于二次使用的过程易于理解,可审计和可重复。此外,本研究中进行的抽象意味着任何以前的EHR重用方法都可以将这些结果纳入其中。
{"title":"TransformEHRs: a flexible methodology for building transparent ETL processes for EHR reuse.","authors":"Miguel Pedrera-Jiménez,&nbsp;Noelia García-Barrio,&nbsp;Paula Rubio-Mayo,&nbsp;Alberto Tato-Gómez,&nbsp;Juan Luis Cruz-Bermúdez,&nbsp;José Luis Bernal-Sobrino,&nbsp;Adolfo Muñoz-Carrero,&nbsp;Pablo Serrano-Balazote","doi":"10.1055/s-0042-1757763","DOIUrl":"https://doi.org/10.1055/s-0042-1757763","url":null,"abstract":"<p><strong>Background: </strong>During the COVID-19 pandemic, several methodologies were designed for obtaining electronic health record (EHR)-derived datasets for research. These processes are often based on black boxes, on which clinical researchers are unaware of how the data were recorded, extracted, and transformed. In order to solve this, it is essential that extract, transform, and load (ETL) processes are based on transparent, homogeneous, and formal methodologies, making them understandable, reproducible, and auditable.</p><p><strong>Objectives: </strong>This study aims to design and implement a methodology, according with FAIR Principles, for building ETL processes (focused on data extraction, selection, and transformation) for EHR reuse in a transparent and flexible manner, applicable to any clinical condition and health care organization.</p><p><strong>Methods: </strong>The proposed methodology comprises four stages: (1) analysis of secondary use models and identification of data operations, based on internationally used clinical repositories, case report forms, and aggregated datasets; (2) modeling and formalization of data operations, through the paradigm of the Detailed Clinical Models; (3) agnostic development of data operations, selecting SQL and R as programming languages; and (4) automation of the ETL instantiation, building a formal configuration file with XML.</p><p><strong>Results: </strong>First, four international projects were analyzed to identify 17 operations, necessary to obtain datasets according to the specifications of these projects from the EHR. With this, each of the data operations was formalized, using the ISO 13606 reference model, specifying the valid data types as arguments, inputs and outputs, and their cardinality. Then, an agnostic catalog of data was developed through data-oriented programming languages previously selected. Finally, an automated ETL instantiation process was built from an ETL configuration file formally defined.</p><p><strong>Conclusions: </strong>This study has provided a transparent and flexible solution to the difficulty of making the processes for obtaining EHR-derived data for secondary use understandable, auditable, and reproducible. Moreover, the abstraction carried out in this study means that any previous EHR reuse methodology can incorporate these results into them.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e89-e102"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/54/b2/10-1055-s-0042-1757763.PMC9788916.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9253954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The Leipzig Health Atlas-An Open Platform to Present, Archive, and Share Biomedical Data, Analyses, and Models Online. 莱比锡健康地图集-一个开放的平台,以呈现,存档和共享生物医学数据,分析和模型在线。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/a-1914-1985
Toralf Kirsten, Frank Meineke, Henry Löffler-Wirth, Alexandr Uciteli, Christoph Beger, Sebastian Stäubert, Matthias Löbe, Rene Hänsel, Franziska G Rauscher, Judith Christina Schuster, Thomas Peschel, Heinrich Herre, Jonas Wagner, Silke Zachariae, Christoph Engel, Markus Scholz, Erhard Rahm, Hans Binder, Markus Löffler

Background: Clinical trials, epidemiological studies, clinical registries, and other prospective research projects, together with patient care services, are main sources of data in the medical research domain. They serve often as a basis for secondary research in evidence-based medicine, prediction models for disease, and its progression. This data are often neither sufficiently described nor accessible. Related models are often not accessible as a functional program tool for interested users from the health care and biomedical domains.

Objective: The interdisciplinary project Leipzig Health Atlas (LHA) was developed to close this gap. LHA is an online platform that serves as a sustainable archive providing medical data, metadata, models, and novel phenotypes from clinical trials, epidemiological studies, and other medical research projects.

Methods: Data, models, and phenotypes are described by semantically rich metadata. The platform prefers to share data and models presented in original publications but is also open for nonpublished data. LHA provides and associates unique permanent identifiers for each dataset and model. Hence, the platform can be used to share prepared, quality-assured datasets and models while they are referenced in publications. All managed data, models, and phenotypes in LHA follow the FAIR principles, with public availability or restricted access for specific user groups.

Results: The LHA platform is in productive mode (https://www.health-atlas.de/). It is already used by a variety of clinical trial and research groups and is becoming increasingly popular also in the biomedical community. LHA is an integral part of the forthcoming initiative building a national research data infrastructure for health in Germany.

背景:临床试验、流行病学研究、临床登记和其他前瞻性研究项目,以及患者护理服务,是医学研究领域的主要数据来源。它们通常作为循证医学、疾病预测模型及其进展的二次研究的基础。这些数据往往没有得到充分的描述,也无法获得。对于医疗保健和生物医学领域感兴趣的用户来说,相关模型通常不能作为功能程序工具访问。目的:跨学科项目莱比锡健康地图集(LHA)的开发是为了缩小这一差距。LHA是一个在线平台,作为一个可持续的档案,提供来自临床试验、流行病学研究和其他医学研究项目的医疗数据、元数据、模型和新表型。方法:数据、模型和表型由语义丰富的元数据描述。该平台更倾向于共享原始出版物中的数据和模型,但也对未发表的数据开放。LHA为每个数据集和模型提供并关联唯一的永久标识符。因此,该平台可用于在出版物中引用时共享准备好的、有质量保证的数据集和模型。LHA中的所有管理数据、模型和表型都遵循FAIR原则,对特定用户组具有公开可用性或限制访问。结果:LHA平台已进入生产模式(https://www.health-atlas.de/)。它已经被各种临床试验和研究小组使用,并且在生物医学界也越来越受欢迎。LHA是即将在德国建立国家卫生研究数据基础设施的倡议的一个组成部分。
{"title":"The Leipzig Health Atlas-An Open Platform to Present, Archive, and Share Biomedical Data, Analyses, and Models Online.","authors":"Toralf Kirsten,&nbsp;Frank Meineke,&nbsp;Henry Löffler-Wirth,&nbsp;Alexandr Uciteli,&nbsp;Christoph Beger,&nbsp;Sebastian Stäubert,&nbsp;Matthias Löbe,&nbsp;Rene Hänsel,&nbsp;Franziska G Rauscher,&nbsp;Judith Christina Schuster,&nbsp;Thomas Peschel,&nbsp;Heinrich Herre,&nbsp;Jonas Wagner,&nbsp;Silke Zachariae,&nbsp;Christoph Engel,&nbsp;Markus Scholz,&nbsp;Erhard Rahm,&nbsp;Hans Binder,&nbsp;Markus Löffler","doi":"10.1055/a-1914-1985","DOIUrl":"https://doi.org/10.1055/a-1914-1985","url":null,"abstract":"<p><strong>Background: </strong>Clinical trials, epidemiological studies, clinical registries, and other prospective research projects, together with patient care services, are main sources of data in the medical research domain. They serve often as a basis for secondary research in evidence-based medicine, prediction models for disease, and its progression. This data are often neither sufficiently described nor accessible. Related models are often not accessible as a functional program tool for interested users from the health care and biomedical domains.</p><p><strong>Objective: </strong>The interdisciplinary project Leipzig Health Atlas (LHA) was developed to close this gap. LHA is an online platform that serves as a sustainable archive providing medical data, metadata, models, and novel phenotypes from clinical trials, epidemiological studies, and other medical research projects.</p><p><strong>Methods: </strong>Data, models, and phenotypes are described by semantically rich metadata. The platform prefers to share data and models presented in original publications but is also open for nonpublished data. LHA provides and associates unique permanent identifiers for each dataset and model. Hence, the platform can be used to share prepared, quality-assured datasets and models while they are referenced in publications. All managed data, models, and phenotypes in LHA follow the FAIR principles, with public availability or restricted access for specific user groups.</p><p><strong>Results: </strong>The LHA platform is in productive mode (https://www.health-atlas.de/). It is already used by a variety of clinical trial and research groups and is becoming increasingly popular also in the biomedical community. LHA is an integral part of the forthcoming initiative building a national research data infrastructure for health in Germany.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e103-e115"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/bf/4e/10-1055-a-1914-1985.PMC9788914.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9308245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
One Digital Health for more FAIRness. 一个数字健康更公平。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/a-1938-0533
Oscar Tamburis, Arriel Benis

Background: One Digital Health (ODH) aims to propose a framework that merges One Health's and Digital Health's specific features into an innovative landscape. FAIR (Findable, Accessible, Interoperable, and Reusable) principles consider applications and computational agents (or, in other terms, data, metadata, and infrastructures) as stakeholders with the capacity to find, access, interoperate, and reuse data with none or minimal human intervention.

Objectives: This paper aims to elicit how the ODH framework is compliant with FAIR principles and metrics, providing some thinking guide to investigate and define whether adapted metrics need to be figured out for an effective ODH Intervention setup.

Methods: An integrative analysis of the literature was conducted to extract instances of the need-or of the eventual already existing deployment-of FAIR principles, for each of the three layers (keys, perspectives and dimensions) of the ODH framework. The scope was to assess the extent of scatteredness in pursuing the many facets of FAIRness, descending from the lack of a unifying and balanced framework.

Results: A first attempt to interpret the different technological components existing in the different layers of the ODH framework, in the light of the FAIR principles, was conducted. Although the mature and working examples of workflows for data FAIRification processes currently retrievable in the literature provided a robust ground to work on, a nonsuitable capacity to fully assess FAIR aspects for highly interconnected scenarios, which the ODH-based ones are, has emerged. Rooms for improvement are anyway possible to timely deal with all the underlying features of topics like the delivery of health care in a syndemic scenario, the digital transformation of human and animal health data, or the digital nature conservation through digital technology-based intervention.

Conclusions: ODH pillars account for the availability (findability, accessibility) of human, animal, and environmental data allowing a unified understanding of complex interactions (interoperability) over time (reusability). A vision of integration between these two worlds, under the vest of ODH Interventions featuring FAIRness characteristics, toward the development of a systemic lookup of health and ecology in a digitalized way, is therefore auspicable.

背景:One Digital Health (ODH)旨在提出一个框架,将One Health和Digital Health的具体功能合并到一个创新的景观中。FAIR(可查找、可访问、可互操作和可重用)原则将应用程序和计算代理(或者,换句话说,数据、元数据和基础设施)视为具有查找、访问、互操作和重用数据的能力的涉众,无需或最少的人为干预。目的:本文旨在引出ODH框架如何符合FAIR原则和指标,为调查和定义是否需要为有效的ODH干预设置制定适应指标提供一些思路指导。方法:对文献进行了综合分析,以提取对公平原则的需求或最终已经存在的部署的实例,用于ODH框架的三个层面(关键、视角和维度)。其范围是评估由于缺乏统一和平衡的框架,在追求公平的许多方面分散的程度。结果:根据公平原则,首次尝试解释存在于ODH框架不同层中的不同技术组件。虽然目前在文献中可检索到的数据公平流程的成熟和工作示例为工作提供了坚实的基础,但对于高度互联的场景(基于odh的场景),已经出现了不适合全面评估公平方面的能力。无论如何,改进的空间是可以及时处理各种主题的所有潜在特征的,例如在疾病情况下提供医疗保健,人类和动物健康数据的数字化转换,或通过基于数字技术的干预进行数字自然保护。结论:ODH支柱解释了人类、动物和环境数据的可用性(可查找性、可访问性),允许对复杂交互(互操作性)的统一理解(可重用性)。因此,在以公平为特征的ODH干预措施的支持下,将这两个世界整合起来,以数字化的方式对健康和生态进行系统的查找,这是值得期待的。
{"title":"One Digital Health for more FAIRness.","authors":"Oscar Tamburis,&nbsp;Arriel Benis","doi":"10.1055/a-1938-0533","DOIUrl":"https://doi.org/10.1055/a-1938-0533","url":null,"abstract":"<p><strong>Background: </strong>One Digital Health (ODH) aims to propose a framework that merges One Health's and Digital Health's specific features into an innovative landscape. FAIR (Findable, Accessible, Interoperable, and Reusable) principles consider applications and computational agents (or, in other terms, data, metadata, and infrastructures) as stakeholders with the capacity to find, access, interoperate, and reuse data with none or minimal human intervention.</p><p><strong>Objectives: </strong>This paper aims to elicit how the ODH framework is compliant with FAIR principles and metrics, providing some thinking guide to investigate and define whether adapted metrics need to be figured out for an effective ODH Intervention setup.</p><p><strong>Methods: </strong>An integrative analysis of the literature was conducted to extract instances of the need-or of the eventual already existing deployment-of FAIR principles, for each of the three layers (keys, perspectives and dimensions) of the ODH framework. The scope was to assess the extent of scatteredness in pursuing the many facets of FAIRness, descending from the lack of a unifying and balanced framework.</p><p><strong>Results: </strong>A first attempt to interpret the different technological components existing in the different layers of the ODH framework, in the light of the FAIR principles, was conducted. Although the mature and working examples of workflows for data FAIRification processes currently retrievable in the literature provided a robust ground to work on, a nonsuitable capacity to fully assess FAIR aspects for highly interconnected scenarios, which the ODH-based ones are, has emerged. Rooms for improvement are anyway possible to timely deal with all the underlying features of topics like the delivery of health care in a syndemic scenario, the digital transformation of human and animal health data, or the digital nature conservation through digital technology-based intervention.</p><p><strong>Conclusions: </strong>ODH pillars account for the availability (findability, accessibility) of human, animal, and environmental data allowing a unified understanding of complex interactions (interoperability) over time (reusability). A vision of integration between these two worlds, under the vest of ODH Interventions featuring FAIRness characteristics, toward the development of a systemic lookup of health and ecology in a digitalized way, is therefore auspicable.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e116-e124"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/85/69/10-1055-a-1938-0533.PMC9788917.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9254115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
FAIR Aspects of a Health Information Protection and Management System. 健康信息保护和管理系统的公平方面。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/s-0042-1758765
Jaime Delgado, Silvia Llorente

Background: Privacy management is a key issue when dealing with storage and distribution of health information. However, FAIR (Findability, Accessibility, Interoperability, and Reusability) principles when sharing information are in increasing demand in several organizations, especially for information generated in public-funded research projects.

Objectives: The two main objectives of the presented work are the definition of a secure and interoperable modular architecture to manage different kinds of medical content (xIPAMS [x, for Any kind of content, Information Protection And Management System] and HIPAMS [Health Information Protection And Management System]), and the application of FAIR principles to that architecture in such a way that privacy and security are compatible with FAIR.

Methods: We propose the concept of xIPAMS as a modular architecture, following standards for interoperability, which defines mechanisms for privacy, protection, storage, search, and access to health-related information.

Results: xIPAMS provides FAIR principles and preserves patient's privacy. For each module, we identify how FAIR principles apply.

Conclusions: We have analyzed how xIPAMS, and in particular HIPAMS (Health content), support the FAIR principles focusing on security and privacy. We have identified the FAIR principles supported by the different xIPAMS modules, concluding that the four principles are supported. Our analysis has also considered a possible implementation based on the concept of DACS (Document Access and Communication System), a system storing medical documents in a private and secure way. In addition, we have analyzed security aspects of the FAIRification process and how they are provided by xIPAMS modules.

背景:隐私管理是处理健康信息存储和分发时的一个关键问题。然而,在一些组织中,共享信息时对FAIR(可查找性、可访问性、互操作性和可重用性)原则的需求越来越大,特别是对于公共资助的研究项目中生成的信息。目标:提出的工作的两个主要目标是定义一个安全的、可互操作的模块化架构来管理不同类型的医疗内容(xIPAMS [x,用于任何类型的内容、信息保护和管理系统]和HIPAMS[健康信息保护和管理系统]),以及将FAIR原则应用于该架构,从而使隐私和安全与FAIR兼容。方法:我们提出了xIPAMS作为模块化架构的概念,遵循互操作性标准,定义了隐私、保护、存储、搜索和访问健康相关信息的机制。结果:xIPAMS遵循公平原则,保护患者隐私。对于每个模块,我们确定公平原则如何适用。结论:我们分析了xIPAMS,特别是HIPAMS(健康内容)如何支持以安全和隐私为重点的FAIR原则。我们已经确定了不同xIPAMS模块支持的FAIR原则,得出的结论是支持这四项原则。我们的分析还考虑了一种基于DACS(文档访问和通信系统)概念的可能实现,DACS是一种以私有和安全的方式存储医疗文档的系统。此外,我们还分析了标准化过程的安全方面以及xIPAMS模块如何提供这些方面。
{"title":"FAIR Aspects of a Health Information Protection and Management System.","authors":"Jaime Delgado,&nbsp;Silvia Llorente","doi":"10.1055/s-0042-1758765","DOIUrl":"https://doi.org/10.1055/s-0042-1758765","url":null,"abstract":"<p><strong>Background: </strong>Privacy management is a key issue when dealing with storage and distribution of health information. However, FAIR (Findability, Accessibility, Interoperability, and Reusability) principles when sharing information are in increasing demand in several organizations, especially for information generated in public-funded research projects.</p><p><strong>Objectives: </strong>The two main objectives of the presented work are the definition of a secure and interoperable modular architecture to manage different kinds of medical content (xIPAMS [x, for Any kind of content, Information Protection And Management System] and HIPAMS [Health Information Protection And Management System]), and the application of FAIR principles to that architecture in such a way that privacy and security are compatible with FAIR.</p><p><strong>Methods: </strong>We propose the concept of xIPAMS as a modular architecture, following standards for interoperability, which defines mechanisms for privacy, protection, storage, search, and access to health-related information.</p><p><strong>Results: </strong>xIPAMS provides FAIR principles and preserves patient's privacy. For each module, we identify how FAIR principles apply.</p><p><strong>Conclusions: </strong>We have analyzed how xIPAMS, and in particular HIPAMS (Health content), support the FAIR principles focusing on security and privacy. We have identified the FAIR principles supported by the different xIPAMS modules, concluding that the four principles are supported. Our analysis has also considered a possible implementation based on the concept of DACS (Document Access and Communication System), a system storing medical documents in a private and secure way. In addition, we have analyzed security aspects of the FAIRification process and how they are provided by xIPAMS modules.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e172-e182"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/99/13/10-1055-s-0042-1758765.PMC9788908.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9609042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multivariate Sequential Analytics for Cardiovascular Disease Event Prediction. 用于心血管疾病事件预测的多变量序列分析。
IF 1.3 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 Epub Date: 2022-12-23 DOI: 10.1055/s-0042-1758687
William Hsu, Jim Warren, Patricia Riddle

Background: Automated clinical decision support for risk assessment is a powerful tool in combating cardiovascular disease (CVD), enabling targeted early intervention that could avoid issues of overtreatment or undertreatment. However, current CVD risk prediction models use observations at baseline without explicitly representing patient history as a time series.

Objective: The aim of this study is to examine whether by explicitly modelling the temporal dimension of patient history event prediction may be improved.

Methods: This study investigates methods for multivariate sequential modelling with a particular emphasis on long short-term memory (LSTM) recurrent neural networks. Data from a CVD decision support tool is linked to routinely collected national datasets including pharmaceutical dispensing, hospitalization, laboratory test results, and deaths. The study uses a 2-year observation and a 5-year prediction window. Selected methods are applied to the linked dataset. The experiments performed focus on CVD event prediction. CVD death or hospitalization in a 5-year interval was predicted for patients with history of lipid-lowering therapy.

Results: The results of the experiments showed temporal models are valuable for CVD event prediction over a 5-year interval. This is especially the case for LSTM, which produced the best predictive performance among all models compared achieving AUROC of 0.801 and average precision of 0.425. The non-temporal model comparator ridge classifier (RC) trained using all quarterly data or by aggregating quarterly data (averaging time-varying features) was highly competitive achieving AUROC of 0.799 and average precision of 0.420 and AUROC of 0.800 and average precision of 0.421, respectively.

Conclusion: This study provides evidence that the use of deep temporal models particularly LSTM in clinical decision support for chronic disease would be advantageous with LSTM significantly improving on commonly used regression models such as logistic regression and Cox proportional hazards on the task of CVD event prediction.

背景:风险评估的自动化临床决策支持是防治心血管疾病(CVD)的有力工具,可实现有针对性的早期干预,避免过度治疗或治疗不当的问题。然而,目前的心血管疾病风险预测模型使用的是基线观测数据,没有将患者病史明确表示为时间序列:本研究旨在探讨是否可以通过明确模拟患者病史的时间维度来改进事件预测:本研究探讨了多变量序列建模方法,并特别强调了长短期记忆(LSTM)递归神经网络。来自心血管疾病决策支持工具的数据与常规收集的国家数据集(包括配药、住院、实验室检测结果和死亡)相连接。研究使用了 2 年观察期和 5 年预测期。选定的方法被应用于链接数据集。实验重点是心血管疾病事件预测。对有降脂治疗史的患者进行了 5 年间隔期内心血管疾病死亡或住院预测:实验结果表明,时间模型对于预测 5 年间的心血管疾病事件很有价值。在所有比较模型中,LSTM 的预测性能最佳,AUROC 为 0.801,平均精度为 0.425。使用所有季度数据或通过聚合季度数据(平均时变特征)训练的非时态模型比较模型脊分类器(RC)具有很强的竞争力,AUROC 为 0.799,平均精度为 0.420;AUROC 为 0.800,平均精度为 0.421:这项研究证明,在慢性病临床决策支持中使用深度时空模型,尤其是 LSTM,将具有优势,在心血管疾病事件预测任务中,LSTM 明显优于逻辑回归和 Cox 比例危险等常用回归模型。
{"title":"Multivariate Sequential Analytics for Cardiovascular Disease Event Prediction.","authors":"William Hsu, Jim Warren, Patricia Riddle","doi":"10.1055/s-0042-1758687","DOIUrl":"10.1055/s-0042-1758687","url":null,"abstract":"<p><strong>Background: </strong>Automated clinical decision support for risk assessment is a powerful tool in combating cardiovascular disease (CVD), enabling targeted early intervention that could avoid issues of overtreatment or undertreatment. However, current CVD risk prediction models use observations at baseline without explicitly representing patient history as a time series.</p><p><strong>Objective: </strong>The aim of this study is to examine whether by explicitly modelling the temporal dimension of patient history event prediction may be improved.</p><p><strong>Methods: </strong>This study investigates methods for multivariate sequential modelling with a particular emphasis on long short-term memory (LSTM) recurrent neural networks. Data from a CVD decision support tool is linked to routinely collected national datasets including pharmaceutical dispensing, hospitalization, laboratory test results, and deaths. The study uses a 2-year observation and a 5-year prediction window. Selected methods are applied to the linked dataset. The experiments performed focus on CVD event prediction. CVD death or hospitalization in a 5-year interval was predicted for patients with history of lipid-lowering therapy.</p><p><strong>Results: </strong>The results of the experiments showed temporal models are valuable for CVD event prediction over a 5-year interval. This is especially the case for LSTM, which produced the best predictive performance among all models compared achieving AUROC of 0.801 and average precision of 0.425. The non-temporal model comparator ridge classifier (RC) trained using all quarterly data or by aggregating quarterly data (averaging time-varying features) was highly competitive achieving AUROC of 0.799 and average precision of 0.420 and AUROC of 0.800 and average precision of 0.421, respectively.</p><p><strong>Conclusion: </strong>This study provides evidence that the use of deep temporal models particularly LSTM in clinical decision support for chronic disease would be advantageous with LSTM significantly improving on commonly used regression models such as logistic regression and Cox proportional hazards on the task of CVD event prediction.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e149-e171"},"PeriodicalIF":1.3,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/af/00/10-1055-s-0042-1758687.PMC9788915.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9247803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DxGenerator: An Improved Differential Diagnosis Generator for Primary Care Based on MetaMap and Semantic Reasoning. DxGenerator:基于元地图和语义推理的初级保健改进鉴别诊断生成器。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-01 DOI: 10.1055/a-1905-5639
Ali Sanaeifar, Saeid Eslami, Mitra Ahadi, Mohsen Kahani, Hassan Vakili Arki

Background: In recent years, researchers have used many computerized interventions to reduce medical errors, the third cause of death in developed countries. One of such interventions is using differential diagnosis generators in primary care, where physicians may encounter initial symptoms without any diagnostic presuppositions. These systems generate multiple diagnoses, ranked by their likelihood. As such, these reports' accuracy can be determined by the location of the correct diagnosis in the list.

Objective: This study aimed to design and evaluate a novel practical web-based differential diagnosis generator solution in primary care.

Methods: In this research, a new online clinical decision support system, called DxGenerator, was designed to improve diagnostic accuracy; to this end, an attempt was made to converge a semantic database with the unified medical language system (UMLS) knowledge base, using MetaMap tool and natural language processing. In this regard, 120 diseases of gastrointestinal organs causing abdominal pain were modeled into the database. After designing an inference engine and a pseudo-free-text interactive interface, 172 patient vignettes were inputted into DxGenerator and ISABEL, the most accurate similar system. The Wilcoxon signed ranked test was used to compare the position of correct diagnoses in DxGenerator and ISABEL. The α level was defined as 0.05.

Results: On a total of 172 vignettes, the mean and standard deviation of correct diagnosis positions improved from 4.2 ± 5.3 in ISABEL to 3.2 ± 3.9 in DxGenerator. This improvement was significant in the subgroup of uncommon diseases (p-value < 0.05).

Conclusion: Using UMLS knowledge base and MetaMap Tools can improve the accuracy of diagnostic systems in which terms are entered in a free text manner. Applying these new methods will help the medical community accept medical diagnostic systems better.

背景:近年来,研究人员使用了许多计算机化的干预措施来减少医疗事故,这是发达国家的第三大死亡原因。其中一种干预措施是在初级保健中使用鉴别诊断发生器,在初级保健中,医生可能在没有任何诊断前提的情况下遇到初始症状。这些系统产生多种诊断,并根据其可能性进行排序。因此,这些报告的准确性可以通过正确诊断在列表中的位置来确定。目的:本研究旨在设计和评估一种新颖实用的基于网络的初级保健鉴别诊断发生器解决方案。方法:本研究设计了一种新的在线临床决策支持系统DxGenerator,以提高诊断准确性;为此,利用MetaMap工具和自然语言处理技术,尝试将语义数据库与统一医学语言系统(UMLS)知识库进行融合。因此,120种引起腹痛的胃肠道器官疾病被建模到数据库中。在设计了推理引擎和伪自由文本交互界面后,将172个病人的小片段输入到DxGenerator和ISABEL中,这是最准确的类似系统。使用Wilcoxon符号排序检验比较DxGenerator和ISABEL中正确诊断的位置。α水平定义为0.05。结果:在172个样本中,正确诊断位置的平均值和标准差由ISABEL的4.2±5.3提高到DxGenerator的3.2±3.9。结论:使用UMLS知识库和MetaMap工具可以提高以自由文本方式输入术语的诊断系统的准确性。应用这些新方法将有助于医学界更好地接受医疗诊断系统。
{"title":"DxGenerator: An Improved Differential Diagnosis Generator for Primary Care Based on MetaMap and Semantic Reasoning.","authors":"Ali Sanaeifar,&nbsp;Saeid Eslami,&nbsp;Mitra Ahadi,&nbsp;Mohsen Kahani,&nbsp;Hassan Vakili Arki","doi":"10.1055/a-1905-5639","DOIUrl":"https://doi.org/10.1055/a-1905-5639","url":null,"abstract":"<p><strong>Background: </strong>In recent years, researchers have used many computerized interventions to reduce medical errors, the third cause of death in developed countries. One of such interventions is using differential diagnosis generators in primary care, where physicians may encounter initial symptoms without any diagnostic presuppositions. These systems generate multiple diagnoses, ranked by their likelihood. As such, these reports' accuracy can be determined by the location of the correct diagnosis in the list.</p><p><strong>Objective: </strong>This study aimed to design and evaluate a novel practical web-based differential diagnosis generator solution in primary care.</p><p><strong>Methods: </strong>In this research, a new online clinical decision support system, called DxGenerator, was designed to improve diagnostic accuracy; to this end, an attempt was made to converge a semantic database with the unified medical language system (UMLS) knowledge base, using MetaMap tool and natural language processing. In this regard, 120 diseases of gastrointestinal organs causing abdominal pain were modeled into the database. After designing an inference engine and a pseudo-free-text interactive interface, 172 patient vignettes were inputted into DxGenerator and ISABEL, the most accurate similar system. The Wilcoxon signed ranked test was used to compare the position of correct diagnoses in DxGenerator and ISABEL. The α level was defined as 0.05.</p><p><strong>Results: </strong>On a total of 172 vignettes, the mean and standard deviation of correct diagnosis positions improved from 4.2 ± 5.3 in ISABEL to 3.2 ± 3.9 in DxGenerator. This improvement was significant in the subgroup of uncommon diseases (<i>p</i>-value < 0.05).</p><p><strong>Conclusion: </strong>Using UMLS knowledge base and MetaMap Tools can improve the accuracy of diagnostic systems in which terms are entered in a free text manner. Applying these new methods will help the medical community accept medical diagnostic systems better.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 5-06","pages":"174-184"},"PeriodicalIF":1.7,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9253490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of Machine Learning to Identify Clinical Variables in Pregnant and Non-Pregnant Women with SARS-CoV-2 Infection. 使用机器学习识别感染SARS-CoV-2的孕妇和非孕妇的临床变量
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-09-01 Epub Date: 2022-09-12 DOI: 10.1055/s-0042-1756282
Itamar D Futterman, Rodney McLaren, Hila Friedmann, Nael Musleh, Shoshana Haberman

Objective: The aim of the study is to identify the important clinical variables found in both pregnant and non-pregnant women who tested positive for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, using an artificial intelligence (AI) platform.

Methods: This was a retrospective cohort study of all women between the ages of 18 to 45, who were admitted to Maimonides Medical Center between March 10, 2020 and December 20, 2021. Patients were included if they had nasopharyngeal PCR swab positive for SARS-CoV-2. Safe People Artificial Intelligence (SPAI) platform, developed by Gynisus, Inc., was used to identify key clinical variables predicting a positive test in pregnant and non-pregnant women. A list of mathematically important clinical variables was generated for both non-pregnant and pregnant women.

Results: Positive results were obtained in 1,935 non-pregnant women and 1,909 non-pregnant women tested negative for SARS-CoV-2 infection. Among pregnant women, 280 tested positive, and 1,000 tested negative. The most important clinical variable to predict a positive swab result in non-pregnant women was age, while elevated D-dimer levels and presence of an abnormal fetal heart rate pattern were the most important clinical variable in pregnant women to predict a positive test.

Conclusion: In an attempt to better understand the natural history of the SARS-CoV-2 infection we present a side-by-side analysis of clinical variables found in pregnant and non-pregnant women who tested positive for COVID-19. These clinical variables can help stratify and highlight those at risk for SARS-CoV-2 infection and shed light on the individual patient risk for testing positive.

目的:利用人工智能(AI)平台,识别严重急性呼吸综合征冠状病毒2 (SARS-CoV-2)感染阳性孕妇和非孕妇的重要临床变量。方法:这是一项回顾性队列研究,纳入了2020年3月10日至2021年12月20日期间在迈蒙尼德医疗中心住院的所有年龄在18至45岁之间的女性。如果患者鼻咽PCR拭子呈SARS-CoV-2阳性,则纳入患者。由gyynisus公司开发的安全人员人工智能(SPAI)平台用于识别预测孕妇和非孕妇检测阳性的关键临床变量。一个数学上重要的临床变量列表被生成,包括孕妇和非孕妇。结果:1935名非孕妇检测结果为阳性,1909名非孕妇检测结果为阴性。在孕妇中,280人检测呈阳性,1000人检测呈阴性。预测非孕妇拭子阳性结果最重要的临床变量是年龄,而d -二聚体水平升高和胎儿心率异常是孕妇预测阳性测试最重要的临床变量。结论:为了更好地了解SARS-CoV-2感染的自然历史,我们对COVID-19检测呈阳性的孕妇和非孕妇的临床变量进行了并排分析。这些临床变量可以帮助分层和突出那些有SARS-CoV-2感染风险的人,并揭示个体患者检测呈阳性的风险。
{"title":"Use of Machine Learning to Identify Clinical Variables in Pregnant and Non-Pregnant Women with SARS-CoV-2 Infection.","authors":"Itamar D Futterman,&nbsp;Rodney McLaren,&nbsp;Hila Friedmann,&nbsp;Nael Musleh,&nbsp;Shoshana Haberman","doi":"10.1055/s-0042-1756282","DOIUrl":"https://doi.org/10.1055/s-0042-1756282","url":null,"abstract":"<p><strong>Objective: </strong>The aim of the study is to identify the important clinical variables found in both pregnant and non-pregnant women who tested positive for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, using an artificial intelligence (AI) platform.</p><p><strong>Methods: </strong>This was a retrospective cohort study of all women between the ages of 18 to 45, who were admitted to Maimonides Medical Center between March 10, 2020 and December 20, 2021. Patients were included if they had nasopharyngeal PCR swab positive for SARS-CoV-2. Safe People Artificial Intelligence (SPAI) platform, developed by Gynisus, Inc., was used to identify key clinical variables predicting a positive test in pregnant and non-pregnant women. A list of mathematically important clinical variables was generated for both non-pregnant and pregnant women.</p><p><strong>Results: </strong>Positive results were obtained in 1,935 non-pregnant women and 1,909 non-pregnant women tested negative for SARS-CoV-2 infection. Among pregnant women, 280 tested positive, and 1,000 tested negative. The most important clinical variable to predict a positive swab result in non-pregnant women was age, while elevated D-dimer levels and presence of an abnormal fetal heart rate pattern were the most important clinical variable in pregnant women to predict a positive test.</p><p><strong>Conclusion: </strong>In an attempt to better understand the natural history of the SARS-CoV-2 infection we present a side-by-side analysis of clinical variables found in pregnant and non-pregnant women who tested positive for COVID-19. These clinical variables can help stratify and highlight those at risk for SARS-CoV-2 infection and shed light on the individual patient risk for testing positive.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":" ","pages":"61-67"},"PeriodicalIF":1.7,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33462790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Explainable Knowledge-Based System Using Subjective Preferences and Objective Data for Ranking Decision Alternatives. 一个可解释的基于知识的系统,使用主观偏好和客观数据对决策方案进行排序。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-09-01 Epub Date: 2022-10-11 DOI: 10.1055/s-0042-1756650
Kavya Ramisetty, Jabez Christopher, Subhrakanta Panda, Baktha Singh Lazarus, Julie Dayalan

Background: Allergy is a hypersensitive reaction that occurs when the allergen reacts with the immune system. The prevalence and severity of the allergies are uprising in South Asian countries. Allergy often occurs in combinations which becomes difficult for physicians to diagnose.

Objectives: This work aims to develop a decision-making model which aids physicians in diagnosing allergy comorbidities. The model intends to not only provide rational decisions, but also explainable knowledge about all alternatives.

Methods: The allergy data gathered from real-time sources contain a smaller number of samples for comorbidities. Decision-making model applies three sampling strategies, namely, ideal, single, and complete, to balance the data. Bayes theorem-based probabilistic approaches are used to extract knowledge from the balanced data. Preference weights for attributes with respect to alternatives are gathered from a group of domain-experts affiliated to different allergy testing centers. The weights are combined with objective knowledge to assign confidence values to alternatives. The system provides these values along with explanations to aid decision-makers in choosing an optimal decision.

Results: Metrics of explainability and user satisfaction are used to evaluate the effectiveness of the system in real-time diagnosis. Fleiss' Kappa statistic is 0.48, and hence the diagnosis of experts is said to be in moderate agreement. The decision-making model provides a maximum of 10 suitable and relevant pieces of evidence to explain a decision alternative. Clinicians have improved their diagnostic performance by 3% after using CDSS (77.93%) with a decrease in 20% of time taken.

Conclusion: The performance of less-experienced clinicians has improved with the support of an explainable decision-making model. The code for the framework with all intermediate results is available at https://github.com/kavya6697/Allergy-PT.git.

背景:过敏是过敏原与免疫系统发生反应时发生的超敏反应。在南亚国家,过敏的患病率和严重程度正在上升。过敏通常发生在组合中,这对医生来说很难诊断。目的:本工作旨在建立一个决策模型,以帮助医生诊断过敏合并症。该模型不仅要提供理性的决策,而且要提供关于所有选择的可解释的知识。方法:从实时来源收集的过敏数据包含较少的合并症样本。决策模型采用理想、单一和完整三种抽样策略来平衡数据。利用基于贝叶斯定理的概率方法从平衡数据中提取知识。相对于替代品属性的偏好权重是从隶属于不同过敏测试中心的一组领域专家那里收集的。权重与客观知识相结合,为备选方案分配置信度。该系统提供这些值以及解释,以帮助决策者选择最佳决策。结果:可解释性和用户满意度指标用于评估系统在实时诊断中的有效性。Fleiss的Kappa统计值为0.48,因此专家们的诊断被认为是中等一致的。决策模型提供最多10个合适和相关的证据来解释决策选择。使用CDSS后,临床医生的诊断性能提高了3%(77.93%),所需时间减少了20%。结论:在可解释的决策模型的支持下,经验不足的临床医生的表现得到了改善。包含所有中间结果的框架代码可在https://github.com/kavya6697/Allergy-PT.git上获得。
{"title":"An Explainable Knowledge-Based System Using Subjective Preferences and Objective Data for Ranking Decision Alternatives.","authors":"Kavya Ramisetty,&nbsp;Jabez Christopher,&nbsp;Subhrakanta Panda,&nbsp;Baktha Singh Lazarus,&nbsp;Julie Dayalan","doi":"10.1055/s-0042-1756650","DOIUrl":"https://doi.org/10.1055/s-0042-1756650","url":null,"abstract":"<p><strong>Background: </strong>Allergy is a hypersensitive reaction that occurs when the allergen reacts with the immune system. The prevalence and severity of the allergies are uprising in South Asian countries. Allergy often occurs in combinations which becomes difficult for physicians to diagnose.</p><p><strong>Objectives: </strong>This work aims to develop a decision-making model which aids physicians in diagnosing allergy comorbidities. The model intends to not only provide rational decisions, but also explainable knowledge about all alternatives.</p><p><strong>Methods: </strong>The allergy data gathered from real-time sources contain a smaller number of samples for comorbidities. Decision-making model applies three sampling strategies, namely, ideal, single, and complete, to balance the data. Bayes theorem-based probabilistic approaches are used to extract knowledge from the balanced data. Preference weights for attributes with respect to alternatives are gathered from a group of domain-experts affiliated to different allergy testing centers. The weights are combined with objective knowledge to assign confidence values to alternatives. The system provides these values along with explanations to aid decision-makers in choosing an optimal decision.</p><p><strong>Results: </strong>Metrics of explainability and user satisfaction are used to evaluate the effectiveness of the system in real-time diagnosis. Fleiss' Kappa statistic is 0.48, and hence the diagnosis of experts is said to be in moderate agreement. The decision-making model provides a maximum of 10 suitable and relevant pieces of evidence to explain a decision alternative. Clinicians have improved their diagnostic performance by 3% after using CDSS (77.93%) with a decrease in 20% of time taken.</p><p><strong>Conclusion: </strong>The performance of less-experienced clinicians has improved with the support of an explainable decision-making model. The code for the framework with all intermediate results is available at https://github.com/kavya6697/Allergy-PT.git.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":" ","pages":"111-122"},"PeriodicalIF":1.7,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33522531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Methods of Information in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1