首页 > 最新文献

Methods of Information in Medicine最新文献

英文 中文
A Proposal for a Robust Validated Weighted General Data Protection Regulation-Based Scale to Assess the Quality of Privacy Policies of Mobile Health Applications: An eDelphi Study. 基于《通用数据保护条例》的强效验证加权量表建议,用于评估移动健康应用的隐私政策质量:一项 eDelphi 研究。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-12-01 Epub Date: 2023-08-17 DOI: 10.1055/a-2155-2021
Jaime Benjumea, Jorge Ropero, Enrique Dorronzoro-Zubiete, Octavio Rivera-Romero, Alejandro Carrasco

Background: Health care services are undergoing a digital transformation in which the Participatory Health Informatics field has a key role. Within this field, studies aimed to assess the quality of digital tools, including mHealth apps, are conducted. Privacy is one dimension of the quality of an mHealth app. Privacy consists of several components, including organizational, technical, and legal safeguards. Within legal safeguards, giving transparent information to the users on how their data are handled is crucial. This information is usually disclosed to users through the privacy policy document. Assessing the quality of a privacy policy is a complex task and several scales supporting this process have been proposed in the literature. However, these scales are heterogeneous and even not very objective. In our previous study, we proposed a checklist of items guiding the assessment of the quality of an mHealth app privacy policy, based on the General Data Protection Regulation.

Objective: To refine the robustness of our General Data Protection Regulation-based privacy scale to assess the quality of an mHealth app privacy policy, to identify new items, and to assign weights for every item in the scale.

Methods: A two-round modified eDelphi study was conducted involving a privacy expert panel.

Results: After the Delphi process, all the items in the scale were considered "important" or "very important" (4 and 5 in a 5-point Likert scale, respectively) by most of the experts. One of the original items was suggested to be reworded, while eight tentative items were suggested. Only two of them were finally added after Round 2. Eleven of the 16 items in the scale were considered "very important" (weight of 1), while the other 5 were considered "important" (weight of 0.5).

Conclusion: The Benjumea privacy scale is a new robust tool to assess the quality of an mHealth app privacy policy, providing a deeper and complementary analysis to other scales. Also, this robust scale provides a guideline for the development of high-quality privacy policies of mHealth apps.

背景:医疗保健服务正在经历数字化转型,其中参与式健康信息学领域扮演着重要角色。在这一领域,开展了旨在评估数字工具(包括移动医疗应用程序)质量的研究。隐私是移动医疗应用程序质量的一个维度。隐私由几个部分组成,包括组织、技术和法律保障。在法律保障措施中,向用户提供有关如何处理其数据的透明信息至关重要。这些信息通常通过隐私政策文件向用户披露。评估隐私政策的质量是一项复杂的任务,文献中提出了若干支持这一过程的量表。然而,这些量表各不相同,甚至不是很客观。在之前的研究中,我们根据《通用数据保护条例》,提出了一份移动医疗应用程序隐私政策质量评估指导项目清单:目的:完善我们基于《一般数据保护条例》的隐私量表的稳健性,以评估移动医疗应用程序隐私政策的质量,确定新的项目,并为量表中的每个项目分配权重:方法:由隐私专家小组进行两轮修改后的德尔菲研究:结果:经过德尔菲程序后,大多数专家认为量表中的所有项目都 "重要 "或 "非常重要"(在 5 分制李克特量表中分别为 4 分和 5 分)。其中一个原始项目被建议重新措辞,同时提出了八个暂定项目。第二轮之后,最终只增加了其中两个项目。本量表的 16 个项目中有 11 个被认为 "非常重要"(权重为 1),另外 5 个被认为 "重要"(权重为 0.5):Benjumea隐私量表是评估移动医疗应用程序隐私政策质量的一种新的稳健工具,可对其他量表进行更深入的补充分析。此外,该量表还为制定高质量的移动医疗应用程序隐私政策提供了指导。
{"title":"A Proposal for a Robust Validated Weighted General Data Protection Regulation-Based Scale to Assess the Quality of Privacy Policies of Mobile Health Applications: An eDelphi Study.","authors":"Jaime Benjumea, Jorge Ropero, Enrique Dorronzoro-Zubiete, Octavio Rivera-Romero, Alejandro Carrasco","doi":"10.1055/a-2155-2021","DOIUrl":"10.1055/a-2155-2021","url":null,"abstract":"<p><strong>Background: </strong>Health care services are undergoing a digital transformation in which the Participatory Health Informatics field has a key role. Within this field, studies aimed to assess the quality of digital tools, including mHealth apps, are conducted. Privacy is one dimension of the quality of an mHealth app. Privacy consists of several components, including organizational, technical, and legal safeguards. Within legal safeguards, giving transparent information to the users on how their data are handled is crucial. This information is usually disclosed to users through the privacy policy document. Assessing the quality of a privacy policy is a complex task and several scales supporting this process have been proposed in the literature. However, these scales are heterogeneous and even not very objective. In our previous study, we proposed a checklist of items guiding the assessment of the quality of an mHealth app privacy policy, based on the General Data Protection Regulation.</p><p><strong>Objective: </strong>To refine the robustness of our General Data Protection Regulation-based privacy scale to assess the quality of an mHealth app privacy policy, to identify new items, and to assign weights for every item in the scale.</p><p><strong>Methods: </strong>A two-round modified eDelphi study was conducted involving a privacy expert panel.</p><p><strong>Results: </strong>After the Delphi process, all the items in the scale were considered \"important\" or \"very important\" (4 and 5 in a 5-point Likert scale, respectively) by most of the experts. One of the original items was suggested to be reworded, while eight tentative items were suggested. Only two of them were finally added after Round 2. Eleven of the 16 items in the scale were considered \"very important\" (weight of 1), while the other 5 were considered \"important\" (weight of 0.5).</p><p><strong>Conclusion: </strong>The Benjumea privacy scale is a new robust tool to assess the quality of an mHealth app privacy policy, providing a deeper and complementary analysis to other scales. Also, this robust scale provides a guideline for the development of high-quality privacy policies of mHealth apps.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":" ","pages":"154-164"},"PeriodicalIF":1.7,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10878744/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10077516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prehospital Cardiac Arrest Should be Considered When Evaluating Coronavirus Disease 2019 Mortality in the United States. 在评估2019年美国冠状病毒病死亡率时应考虑院前心脏骤停。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-01 DOI: 10.1055/a-2015-1244
Nick Williams

Background: Public health emergencies leave little time to develop novel surveillance efforts. Understanding which preexisting clinical datasets are fit for surveillance use is of high value. Coronavirus disease 2019 (COVID-19) offers a natural applied informatics experiment to understand the fitness of clinical datasets for use in disease surveillance.

Objectives: This study evaluates the agreement between legacy surveillance time series data and discovers their relative fitness for use in understanding the severity of the COVID-19 emergency. Here fitness for use means the statistical agreement between events across series.

Methods: Thirteen weekly clinical event series from before and during the COVID-19 era for the United States were collected and integrated into a (multi) time series event data model. The Centers for Disease Control and Prevention (CDC) COVID-19 attributable mortality, CDC's excess mortality model, national Emergency Medical Services (EMS) calls, and Medicare encounter level claims were the data sources considered in this study. Cases were indexed by week from January 2015 through June of 2021 and fit to Distributed Random Forest models. Models returned the variable importance when predicting the series of interest from the remaining time series.

Results: Model r2 statistics ranged from 0.78 to 0.99 for the share of the volumes predicted correctly. Prehospital data were of high value, and cardiac arrest (CA) prior to EMS arrival was on average the best predictor (tied with study week). COVID-19 Medicare claims volumes can predict COVID-19 death certificates (agreement), while viral respiratory Medicare claim volumes cannot predict Medicare COVID-19 claims (disagreement).

Conclusion: Prehospital EMS data should be considered when evaluating the severity of COVID-19 because prehospital CA known to EMS was the strongest predictor on average across indices.

背景:突发公共卫生事件几乎没有时间发展新的监测工作。了解哪些预先存在的临床数据集适合监测使用是很有价值的。2019冠状病毒病(COVID-19)为了解临床数据集在疾病监测中的适用性提供了一个自然的应用信息学实验。目的:本研究评估了遗留监测时间序列数据之间的一致性,并发现它们在理解COVID-19紧急情况严重程度方面的相对适用性。这里的适应度是指跨系列事件之间的统计一致性。方法:收集美国新冠肺炎疫情之前和期间的13个每周临床事件系列,并将其整合到一个(多)时间序列事件数据模型中。美国疾病控制与预防中心(CDC)的COVID-19归因死亡率、CDC的超额死亡率模型、国家紧急医疗服务(EMS)电话和医疗保险遭遇水平索赔是本研究中考虑的数据源。从2015年1月到2021年6月,病例按周索引,并符合分布式随机森林模型。当从剩余时间序列中预测感兴趣的序列时,模型返回变量重要性。结果:模型r2统计量在0.78 ~ 0.99之间,正确预测的体积份额。院前数据具有很高的价值,EMS到达前的心脏骤停(CA)平均是最好的预测因子(与研究周相关)。COVID-19医疗保险索赔量可以预测COVID-19死亡证明(一致),而病毒性呼吸道医疗保险索赔量无法预测COVID-19医疗保险索赔(不一致)。结论:在评估COVID-19严重程度时应考虑院前EMS数据,因为EMS已知的院前CA是各指标平均最强的预测因子。
{"title":"Prehospital Cardiac Arrest Should be Considered When Evaluating Coronavirus Disease 2019 Mortality in the United States.","authors":"Nick Williams","doi":"10.1055/a-2015-1244","DOIUrl":"https://doi.org/10.1055/a-2015-1244","url":null,"abstract":"<p><strong>Background: </strong>Public health emergencies leave little time to develop novel surveillance efforts. Understanding which preexisting clinical datasets are fit for surveillance use is of high value. Coronavirus disease 2019 (COVID-19) offers a natural applied informatics experiment to understand the fitness of clinical datasets for use in disease surveillance.</p><p><strong>Objectives: </strong>This study evaluates the agreement between legacy surveillance time series data and discovers their relative fitness for use in understanding the severity of the COVID-19 emergency. Here fitness for use means the statistical agreement between events across series.</p><p><strong>Methods: </strong>Thirteen weekly clinical event series from before and during the COVID-19 era for the United States were collected and integrated into a (multi) time series event data model. The Centers for Disease Control and Prevention (CDC) COVID-19 attributable mortality, CDC's excess mortality model, national Emergency Medical Services (EMS) calls, and Medicare encounter level claims were the data sources considered in this study. Cases were indexed by week from January 2015 through June of 2021 and fit to Distributed Random Forest models. Models returned the variable importance when predicting the series of interest from the remaining time series.</p><p><strong>Results: </strong>Model r2 statistics ranged from 0.78 to 0.99 for the share of the volumes predicted correctly. Prehospital data were of high value, and cardiac arrest (CA) prior to EMS arrival was on average the best predictor (tied with study week). COVID-19 Medicare claims volumes can predict COVID-19 death certificates (agreement), while viral respiratory Medicare claim volumes cannot predict Medicare COVID-19 claims (disagreement).</p><p><strong>Conclusion: </strong>Prehospital EMS data should be considered when evaluating the severity of COVID-19 because prehospital CA known to EMS was the strongest predictor on average across indices.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 3-04","pages":"100-109"},"PeriodicalIF":1.7,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/81/24/10-1055-a-2015-1244.PMC10462431.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10512033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trans-O-MIM-An International Research Project on Open Access Transformation: Outcomes and Lessons Learned. trans - o - mim -开放获取转型国际研究项目:成果和经验教训。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-01 DOI: 10.1055/s-0043-1761499
Reinhold Haux, Esther Greussing, Stefanie Kuballa, Corinna Mielke, Mareike Schulze, Monika Taddicken
<p><strong>Background: </strong>During the last decades, the Open Access paradigm has become an important approach for publishing new scientific knowledge. From 2015 to 2020, the Trans-O-MIM research project was undertaken with the intention to identify and to explore solutions in transforming subscription-based journals into Open Access journals. Trans-O-MIM stands for strategies, models, and evaluation metrics for the goal-oriented, stepwise, sustainable, and fair transformation of established subscription-based scientific journals into Open-Access-based journals with <i>Methods of Information in Medicine</i> as an example.</p><p><strong>Objectives: </strong>To present an overview of the outcomes of the Trans-O-MIM research project as a whole and to share our major lessons learned.</p><p><strong>Methods: </strong>As an approach for transforming journals, a Tandem Model has been proposed and implemented for <i>Methods of Information in Medicine</i>. For developing a metric to observe and assess journal transformations, scenario analysis has been used. A qualitative and a two-tier quantitative study on drivers and obstacles of Open Access publishing for medical informatics researchers was designed and conducted. A project setup with a research team, a steering committee, and an international advisory board was established. Major international medical informatics events have been used for reporting and for receiving feedback.</p><p><strong>Results: </strong>Based on the Tandem Model, the journal <i>Methods of Information in Medicine</i> has been transformed into a journal where, in addition to its subscription-based track, from 2017 onwards a Gold Open Access track has been successfully added. An evaluation metric, composed of 5 scenarios and 65 parameters, has been developed, which can assist respective decision makers in assessing such transformations. The studies on drivers and obstacles of Open Access publishing showed that, while most researchers support the idea of making scientific knowledge freely accessible to everyone, they are hesitant about actually living this practice by choosing Open Access journals to publish their own work. Article-processing charges and quality issues are perceived as the main obstacles in this respect, revealing a two-sided evaluation of Open Access models, reflecting the different viewpoints of researchers as authors or readers. Especially researchers from low-income countries benefit from a barrier-free communication mainly in their role as readers and much less in their role as authors of scientific information. This became also evident at the institutional level, as Open Access policies or financial support through funding bodies are most prevalent in Europe and North America.</p><p><strong>Conclusion: </strong>With Trans-O-MIM, an international research project was performed. An existing journal has been transformed. In addition, with the support of the International Medical Informatics Association, as well
背景:在过去的几十年里,开放获取范式已经成为发表新科学知识的重要途径。从2015年到2020年,Trans-O-MIM研究项目旨在确定和探索将订阅型期刊转变为开放获取期刊的解决方案。Trans-O-MIM是指将现有的基于订阅的科学期刊以目标为导向、逐步、可持续和公平地转变为基于开放获取的期刊的策略、模型和评估指标,以《医学信息方法》为例。目标:概述整个跨o - mim研究项目的成果,并分享我们的主要经验教训。方法:作为期刊转化的途径,提出并实现了医学信息方法的串联模型。为了开发一个度量来观察和评估日志转换,场景分析已经被使用。对医学信息学研究人员开放获取出版的驱动因素和障碍进行了定性和两层定量研究。建立了一个由研究小组、指导委员会和国际咨询委员会组成的项目机构。主要的国际医学信息学活动已被用于报告和接收反馈。结果:基于串联模型,《医学信息方法》已转型为一份期刊,除其基于订阅的轨道外,自2017年起成功增加了黄金开放获取轨道。已经制定了一个由5个情景和65个参数组成的评价量度,它可以协助各自的决策者评估这种转变。关于开放获取出版的驱动因素和障碍的研究表明,尽管大多数研究人员支持让每个人都能免费获取科学知识的想法,但他们对选择开放获取期刊发表自己的研究成果是否真正实现这一实践犹豫不决。文章处理费用和质量问题被认为是这方面的主要障碍,揭示了开放获取模式的双向评估,反映了研究人员作为作者或读者的不同观点。尤其是来自低收入国家的科学家,他们从无障碍交流中受益的主要是他们作为读者的角色,而不是他们作为科学信息作者的角色。这在机构层面也很明显,因为开放获取政策或通过资助机构提供的财政支持在欧洲和北美最为普遍。结论:Trans-O-MIM是一项国际研究项目。已经转换了一个现有的日志。此外,在国际医学信息学协会、欧洲医学信息学联合会以及作为欧洲和德国医学信息学组织的德国医学信息学、生物计量学和流行病学协会的支持下,我们确实开展了一项关于开放获取激励措施的国际实验。据作者所知,两者合在一起是独一无二的。因此,我们期望这项研究可以为开放获取转型增加新的知识。
{"title":"Trans-O-MIM-An International Research Project on Open Access Transformation: Outcomes and Lessons Learned.","authors":"Reinhold Haux,&nbsp;Esther Greussing,&nbsp;Stefanie Kuballa,&nbsp;Corinna Mielke,&nbsp;Mareike Schulze,&nbsp;Monika Taddicken","doi":"10.1055/s-0043-1761499","DOIUrl":"https://doi.org/10.1055/s-0043-1761499","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;During the last decades, the Open Access paradigm has become an important approach for publishing new scientific knowledge. From 2015 to 2020, the Trans-O-MIM research project was undertaken with the intention to identify and to explore solutions in transforming subscription-based journals into Open Access journals. Trans-O-MIM stands for strategies, models, and evaluation metrics for the goal-oriented, stepwise, sustainable, and fair transformation of established subscription-based scientific journals into Open-Access-based journals with &lt;i&gt;Methods of Information in Medicine&lt;/i&gt; as an example.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objectives: &lt;/strong&gt;To present an overview of the outcomes of the Trans-O-MIM research project as a whole and to share our major lessons learned.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;As an approach for transforming journals, a Tandem Model has been proposed and implemented for &lt;i&gt;Methods of Information in Medicine&lt;/i&gt;. For developing a metric to observe and assess journal transformations, scenario analysis has been used. A qualitative and a two-tier quantitative study on drivers and obstacles of Open Access publishing for medical informatics researchers was designed and conducted. A project setup with a research team, a steering committee, and an international advisory board was established. Major international medical informatics events have been used for reporting and for receiving feedback.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Based on the Tandem Model, the journal &lt;i&gt;Methods of Information in Medicine&lt;/i&gt; has been transformed into a journal where, in addition to its subscription-based track, from 2017 onwards a Gold Open Access track has been successfully added. An evaluation metric, composed of 5 scenarios and 65 parameters, has been developed, which can assist respective decision makers in assessing such transformations. The studies on drivers and obstacles of Open Access publishing showed that, while most researchers support the idea of making scientific knowledge freely accessible to everyone, they are hesitant about actually living this practice by choosing Open Access journals to publish their own work. Article-processing charges and quality issues are perceived as the main obstacles in this respect, revealing a two-sided evaluation of Open Access models, reflecting the different viewpoints of researchers as authors or readers. Especially researchers from low-income countries benefit from a barrier-free communication mainly in their role as readers and much less in their role as authors of scientific information. This became also evident at the institutional level, as Open Access policies or financial support through funding bodies are most prevalent in Europe and North America.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusion: &lt;/strong&gt;With Trans-O-MIM, an international research project was performed. An existing journal has been transformed. In addition, with the support of the International Medical Informatics Association, as well","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 3-04","pages":"140-150"},"PeriodicalIF":1.7,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/d3/5e/10-1055-s-0043-1761499.PMC10462433.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10139694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Defining and Scoping Participatory Health Informatics: An eDelphi Study. 参与式健康信息学的定义和范围:一项爱德菲研究。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-01 DOI: 10.1055/a-2035-3008
Kerstin Denecke, Octavio Rivera Romero, Carolyn Petersen, Marge Benham-Hutchins, Miguel Cabrer, Shauna Davies, Rebecca Grainger, Rada Hussein, Guillermo Lopez-Campos, Fernando Martin-Sanchez, Mollie McKillop, Mark Merolli, Talya Miron-Shatz, Jesús Daniel Trigo, Graham Wright, Rolf Wynn, Carol Hullin Lucay Cossio, Elia Gabarron

Background: Health care has evolved to support the involvement of individuals in decision making by, for example, using mobile apps and wearables that may help empower people to actively participate in their treatment and health monitoring. While the term "participatory health informatics" (PHI) has emerged in literature to describe these activities, along with the use of social media for health purposes, the scope of the research field of PHI is not yet well defined.

Objective: This article proposes a preliminary definition of PHI and defines the scope of the field.

Methods: We used an adapted Delphi study design to gain consensus from participants on a definition developed from a previous review of literature. From the literature we derived a set of attributes describing PHI as comprising 18 characteristics, 14 aims, and 4 relations. We invited researchers, health professionals, and health informaticians to score these characteristics and aims of PHI and their relations to other fields over three survey rounds. In the first round participants were able to offer additional attributes for voting.

Results: The first round had 44 participants, with 28 participants participating in all three rounds. These 28 participants were gender-balanced and comprised participants from industry, academia, and health sectors from all continents. Consensus was reached on 16 characteristics, 9 aims, and 6 related fields.

Discussion: The consensus reached on attributes of PHI describe PHI as a multidisciplinary field that uses information technology and delivers tools with a focus on individual-centered care. It studies various effects of the use of such tools and technology. Its aims address the individuals in the role of patients, but also the health of a society as a whole. There are relationships to the fields of health informatics, digital health, medical informatics, and consumer health informatics.

Conclusion: We have proposed a preliminary definition, aims, and relationships of PHI based on literature and expert consensus. These can begin to be used to support development of research priorities and outcomes measurements.

背景:卫生保健已经发展到支持个人参与决策,例如,通过使用移动应用程序和可穿戴设备,可能有助于人们积极参与他们的治疗和健康监测。虽然文献中出现了“参与式健康信息学”(PHI)一词来描述这些活动,以及为健康目的使用社交媒体,但PHI研究领域的范围尚未得到很好的定义。目的:本文提出了PHI的初步定义,并界定了该领域的范围。方法:我们采用了一种适应性德尔菲研究设计,以获得参与者对先前文献综述中制定的定义的共识。从文献中,我们得出了一组描述PHI的属性,包括18个特征,14个目标和4个关系。我们邀请研究人员、卫生专业人员和卫生信息学家在三轮调查中对PHI的这些特征和目标以及它们与其他领域的关系进行评分。在第一轮中,参与者可以为投票提供额外的属性。结果:第一轮有44名参与者,其中28名参与者参加了所有三轮。这28名与会者性别均衡,包括来自各大洲的工业界、学术界和卫生部门的与会者。会议就16个特点、9个目标和6个相关领域达成共识。讨论:关于PHI属性达成的共识将PHI描述为一个多学科领域,它使用信息技术并提供以个人为中心的护理工具。它研究了使用这些工具和技术的各种影响。它的目标是解决个人在病人角色中的问题,同时也解决整个社会的健康问题。与健康信息学、数字健康、医学信息学和消费者健康信息学领域有关系。结论:我们在文献和专家共识的基础上提出了PHI的初步定义、目的和关系。这些可以开始用于支持研究重点和成果测量的发展。
{"title":"Defining and Scoping Participatory Health Informatics: An eDelphi Study.","authors":"Kerstin Denecke,&nbsp;Octavio Rivera Romero,&nbsp;Carolyn Petersen,&nbsp;Marge Benham-Hutchins,&nbsp;Miguel Cabrer,&nbsp;Shauna Davies,&nbsp;Rebecca Grainger,&nbsp;Rada Hussein,&nbsp;Guillermo Lopez-Campos,&nbsp;Fernando Martin-Sanchez,&nbsp;Mollie McKillop,&nbsp;Mark Merolli,&nbsp;Talya Miron-Shatz,&nbsp;Jesús Daniel Trigo,&nbsp;Graham Wright,&nbsp;Rolf Wynn,&nbsp;Carol Hullin Lucay Cossio,&nbsp;Elia Gabarron","doi":"10.1055/a-2035-3008","DOIUrl":"https://doi.org/10.1055/a-2035-3008","url":null,"abstract":"<p><strong>Background: </strong>Health care has evolved to support the involvement of individuals in decision making by, for example, using mobile apps and wearables that may help empower people to actively participate in their treatment and health monitoring. While the term \"participatory health informatics\" (PHI) has emerged in literature to describe these activities, along with the use of social media for health purposes, the scope of the research field of PHI is not yet well defined.</p><p><strong>Objective: </strong>This article proposes a preliminary definition of PHI and defines the scope of the field.</p><p><strong>Methods: </strong>We used an adapted Delphi study design to gain consensus from participants on a definition developed from a previous review of literature. From the literature we derived a set of attributes describing PHI as comprising 18 characteristics, 14 aims, and 4 relations. We invited researchers, health professionals, and health informaticians to score these characteristics and aims of PHI and their relations to other fields over three survey rounds. In the first round participants were able to offer additional attributes for voting.</p><p><strong>Results: </strong>The first round had 44 participants, with 28 participants participating in all three rounds. These 28 participants were gender-balanced and comprised participants from industry, academia, and health sectors from all continents. Consensus was reached on 16 characteristics, 9 aims, and 6 related fields.</p><p><strong>Discussion: </strong>The consensus reached on attributes of PHI describe PHI as a multidisciplinary field that uses information technology and delivers tools with a focus on individual-centered care. It studies various effects of the use of such tools and technology. Its aims address the individuals in the role of patients, but also the health of a society as a whole. There are relationships to the fields of health informatics, digital health, medical informatics, and consumer health informatics.</p><p><strong>Conclusion: </strong>We have proposed a preliminary definition, aims, and relationships of PHI based on literature and expert consensus. These can begin to be used to support development of research priorities and outcomes measurements.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 3-04","pages":"90-99"},"PeriodicalIF":1.7,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/67/87/10-1055-a-2035-3008.PMC10462430.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10139697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Alternative Application of Natural Language Processing to Express a Characteristic Feature of Diseases in Japanese Medical Records. 自然语言处理在日本病历中表达疾病特征的另一种应用。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-01 DOI: 10.1055/a-2039-3773
Yoshinori Yamanouchi, Taishi Nakamura, Tokunori Ikeda, Koichiro Usuku

Background: Owing to the linguistic situation, Japanese natural language processing (NLP) requires morphological analyses for word segmentation using dictionary techniques.

Objective: We aimed to clarify whether it can be substituted with an open-end discovery-based NLP (OD-NLP), which does not use any dictionary techniques.

Methods: Clinical texts at the first medical visit were collected for comparison of OD-NLP with word dictionary-based-NLP (WD-NLP). Topics were generated in each document using a topic model, which later corresponded to the respective diseases determined in International Statistical Classification of Diseases and Related Health Problems 10 revision. The prediction accuracy and expressivity of each disease were examined in equivalent number of entities/words after filtration with either term frequency and inverse document frequency (TF-IDF) or dominance value (DMV).

Results: In documents from 10,520 observed patients, 169,913 entities and 44,758 words were segmented using OD-NLP and WD-NLP, simultaneously. Without filtering, accuracy and recall levels were low, and there was no difference in the harmonic mean of the F-measure between NLPs. However, physicians reported OD-NLP contained more meaningful words than WD-NLP. When datasets were created in an equivalent number of entities/words with TF-IDF, F-measure in OD-NLP was higher than WD-NLP at lower thresholds. When the threshold increased, the number of datasets created decreased, resulting in increased values of F-measure, although the differences disappeared. Two datasets near the maximum threshold showing differences in F-measure were examined whether their topics were associated with diseases. The results showed that more diseases were found in OD-NLP at lower thresholds, indicating that the topics described characteristics of diseases. The superiority remained as much as that of TF-IDF when filtration was changed to DMV.

Conclusion: The current findings prefer the use of OD-NLP to express characteristics of diseases from Japanese clinical texts and may help in the construction of document summaries and retrieval in clinical settings.

背景:日语自然语言处理(NLP)中,由于语言环境的原因,需要使用词典技术进行词法分析来进行分词。目的:我们旨在澄清是否可以用不使用任何字典技术的开放式基于发现的NLP (OD-NLP)代替它。方法:收集首次就诊时的临床文献,将OD-NLP与基于单词词典的nlp (WD-NLP)进行比较。在每个文件中使用主题模型生成主题,这些主题后来对应于《疾病和相关健康问题国际统计分类10》修订版中确定的各自疾病。每一种疾病的预测准确性和表达性在用术语频率和逆文档频率(TF-IDF)或优势值(DMV)过滤后,以等量的实体/单词进行检测。结果:在10520例患者的文献中,同时使用OD-NLP和WD-NLP对169,913个实体和44,758个单词进行了分割。在未进行过滤的情况下,nlp的准确率和召回率都很低,f测量的谐波平均值在nlp之间没有差异。然而,医生报告OD-NLP比WD-NLP包含更多有意义的单词。当使用TF-IDF以相同数量的实体/词创建数据集时,在较低阈值下,OD-NLP的F-measure高于WD-NLP。当阈值增加时,创建的数据集数量减少,导致F-measure值增加,尽管差异消失。两个接近最大阈值的数据集显示f值差异,检查其主题是否与疾病相关。结果表明,在较低阈值下,OD-NLP中发现的疾病较多,说明主题描述了疾病的特征。当过滤改为DMV时,其优越性与TF-IDF相同。结论:目前的研究结果更倾向于使用OD-NLP来表达日本临床文献的疾病特征,可能有助于临床文献摘要和检索的构建。
{"title":"An Alternative Application of Natural Language Processing to Express a Characteristic Feature of Diseases in Japanese Medical Records.","authors":"Yoshinori Yamanouchi,&nbsp;Taishi Nakamura,&nbsp;Tokunori Ikeda,&nbsp;Koichiro Usuku","doi":"10.1055/a-2039-3773","DOIUrl":"https://doi.org/10.1055/a-2039-3773","url":null,"abstract":"<p><strong>Background: </strong>Owing to the linguistic situation, Japanese natural language processing (NLP) requires morphological analyses for word segmentation using dictionary techniques.</p><p><strong>Objective: </strong>We aimed to clarify whether it can be substituted with an open-end discovery-based NLP (OD-NLP), which does not use any dictionary techniques.</p><p><strong>Methods: </strong>Clinical texts at the first medical visit were collected for comparison of OD-NLP with word dictionary-based-NLP (WD-NLP). Topics were generated in each document using a topic model, which later corresponded to the respective diseases determined in International Statistical Classification of Diseases and Related Health Problems 10 revision. The prediction accuracy and expressivity of each disease were examined in equivalent number of entities/words after filtration with either term frequency and inverse document frequency (TF-IDF) or dominance value (DMV).</p><p><strong>Results: </strong>In documents from 10,520 observed patients, 169,913 entities and 44,758 words were segmented using OD-NLP and WD-NLP, simultaneously. Without filtering, accuracy and recall levels were low, and there was no difference in the harmonic mean of the F-measure between NLPs. However, physicians reported OD-NLP contained more meaningful words than WD-NLP. When datasets were created in an equivalent number of entities/words with TF-IDF, F-measure in OD-NLP was higher than WD-NLP at lower thresholds. When the threshold increased, the number of datasets created decreased, resulting in increased values of F-measure, although the differences disappeared. Two datasets near the maximum threshold showing differences in F-measure were examined whether their topics were associated with diseases. The results showed that more diseases were found in OD-NLP at lower thresholds, indicating that the topics described characteristics of diseases. The superiority remained as much as that of TF-IDF when filtration was changed to DMV.</p><p><strong>Conclusion: </strong>The current findings prefer the use of OD-NLP to express characteristics of diseases from Japanese clinical texts and may help in the construction of document summaries and retrieval in clinical settings.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 3-04","pages":"110-118"},"PeriodicalIF":1.7,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/2b/3b/10-1055-a-2039-3773.PMC10462427.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10141870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Paper Files to Web-Based Application for Data-Driven Monitoring of HIV Programs: Nigeria's Journey to a National Data Repository for Decision-Making and Patient Care. 从纸质文件到基于网络的数据驱动的艾滋病毒监测应用程序:尼日利亚建立国家决策和患者护理数据库之旅。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-01 DOI: 10.1055/s-0043-1768711
Ibrahim Dalhatu, Chinedu Aniekwe, Adebobola Bashorun, Alhassan Abdulkadir, Emilio Dirlikov, Stephen Ohakanu, Oluwasanmi Adedokun, Ademola Oladipo, Ibrahim Jahun, Lisa Murie, Steven Yoon, Mubarak G Abdu-Aguye, Ahmed Sylvanus, Samuel Indyer, Isah Abbas, Mustapha Bello, Nannim Nalda, Matthias Alagi, Solomon Odafe, Sylvia Adebajo, Otse Ogorry, Murphy Akpu, Ifeanyi Okoye, Kunle Kakanfo, Amobi Andrew Onovo, Gregory Ashefor, Charles Nzelu, Akudo Ikpeazu, Gambo Aliyu, Tedd Ellerbrock, Mary Boyd, Kristen A Stafford, Mahesh Swaminathan

Background: Timely and reliable data are crucial for clinical, epidemiologic, and program management decision making. Electronic health information systems provide platforms for managing large longitudinal patient records. Nigeria implemented the National Data Repository (NDR) to create a central data warehouse of all people living with human immunodeficiency virus (PLHIV) while providing useful functionalities to aid decision making at different levels of program implementation.

Objective: We describe the Nigeria NDR and its development process, including its use for surveillance, research, and national HIV program monitoring toward achieving HIV epidemic control.

Methods: Stakeholder engagement meetings were held in 2013 to gather information on data elements and vocabulary standards for reporting patient-level information, technical infrastructure, human capacity requirements, and information flow. Findings from these meetings guided the development of the NDR. An implementation guide provided common terminologies and data reporting structures for data exchange between the NDR and the electronic medical record (EMR) systems. Data from the EMR were encoded in extensible markup language and sent to the NDR over secure hypertext transfer protocol after going through a series of validation processes.

Results: By June 30, 2021, the NDR had up-to-date records of 1,477,064 (94.4%) patients receiving HIV treatment across 1,985 health facilities, of which 1,266,512 (85.7%) patient records had fingerprint template data to support unique patient identification and record linkage to prevent registration of the same patient under different identities. Data from the NDR was used to support HIV program monitoring, case-based surveillance and production of products like the monthly lists of patients who have treatment interruptions and dashboards for monitoring HIV test and start.

Conclusion: The NDR enabled the availability of reliable and timely data for surveillance, research, and HIV program monitoring to guide program improvements to accelerate progress toward epidemic control.

背景:及时可靠的数据对临床、流行病学和项目管理决策至关重要。电子健康信息系统为管理大型纵向患者记录提供了平台。尼日利亚实施了国家数据存储库(NDR),以建立一个关于所有人类免疫缺陷病毒(艾滋病毒)感染者的中央数据仓库,同时提供有用的功能,以协助不同级别方案执行的决策。目的:我们描述了尼日利亚的NDR及其发展过程,包括其用于监测、研究和国家艾滋病毒规划监测,以实现艾滋病毒流行控制。方法:于2013年召开利益相关者参与会议,收集报告患者级信息、技术基础设施、人员能力要求和信息流的数据元素和词汇标准信息。这些会议的结论指导了《国家发展规划》的制定。一份实施指南为NDR和电子病历系统之间的数据交换提供了通用术语和数据报告结构。EMR中的数据用可扩展标记语言编码,经过一系列验证过程后,通过安全超文本传输协议发送给NDR。结果:截至2021年6月30日,全国共有1985家卫生机构的147.7064万例(94.4%)艾滋病患者接受了NDR的最新记录,其中126.6512万例(85.7%)患者记录具有指纹模板数据,支持患者唯一识别和记录链接,防止同一患者以不同身份登记。来自《国家发展规划》的数据被用于支持艾滋病毒规划监测、基于病例的监测和产品的制作,如每月中断治疗的患者名单和监测艾滋病毒检测和启动的仪表板。结论:NDR为监测、研究和艾滋病毒规划监测提供了可靠和及时的数据,以指导规划改进,加快流行病控制的进展。
{"title":"From Paper Files to Web-Based Application for Data-Driven Monitoring of HIV Programs: Nigeria's Journey to a National Data Repository for Decision-Making and Patient Care.","authors":"Ibrahim Dalhatu,&nbsp;Chinedu Aniekwe,&nbsp;Adebobola Bashorun,&nbsp;Alhassan Abdulkadir,&nbsp;Emilio Dirlikov,&nbsp;Stephen Ohakanu,&nbsp;Oluwasanmi Adedokun,&nbsp;Ademola Oladipo,&nbsp;Ibrahim Jahun,&nbsp;Lisa Murie,&nbsp;Steven Yoon,&nbsp;Mubarak G Abdu-Aguye,&nbsp;Ahmed Sylvanus,&nbsp;Samuel Indyer,&nbsp;Isah Abbas,&nbsp;Mustapha Bello,&nbsp;Nannim Nalda,&nbsp;Matthias Alagi,&nbsp;Solomon Odafe,&nbsp;Sylvia Adebajo,&nbsp;Otse Ogorry,&nbsp;Murphy Akpu,&nbsp;Ifeanyi Okoye,&nbsp;Kunle Kakanfo,&nbsp;Amobi Andrew Onovo,&nbsp;Gregory Ashefor,&nbsp;Charles Nzelu,&nbsp;Akudo Ikpeazu,&nbsp;Gambo Aliyu,&nbsp;Tedd Ellerbrock,&nbsp;Mary Boyd,&nbsp;Kristen A Stafford,&nbsp;Mahesh Swaminathan","doi":"10.1055/s-0043-1768711","DOIUrl":"https://doi.org/10.1055/s-0043-1768711","url":null,"abstract":"<p><strong>Background: </strong>Timely and reliable data are crucial for clinical, epidemiologic, and program management decision making. Electronic health information systems provide platforms for managing large longitudinal patient records. Nigeria implemented the National Data Repository (NDR) to create a central data warehouse of all people living with human immunodeficiency virus (PLHIV) while providing useful functionalities to aid decision making at different levels of program implementation.</p><p><strong>Objective: </strong>We describe the Nigeria NDR and its development process, including its use for surveillance, research, and national HIV program monitoring toward achieving HIV epidemic control.</p><p><strong>Methods: </strong>Stakeholder engagement meetings were held in 2013 to gather information on data elements and vocabulary standards for reporting patient-level information, technical infrastructure, human capacity requirements, and information flow. Findings from these meetings guided the development of the NDR. An implementation guide provided common terminologies and data reporting structures for data exchange between the NDR and the electronic medical record (EMR) systems. Data from the EMR were encoded in extensible markup language and sent to the NDR over secure hypertext transfer protocol after going through a series of validation processes.</p><p><strong>Results: </strong>By June 30, 2021, the NDR had up-to-date records of 1,477,064 (94.4%) patients receiving HIV treatment across 1,985 health facilities, of which 1,266,512 (85.7%) patient records had fingerprint template data to support unique patient identification and record linkage to prevent registration of the same patient under different identities. Data from the NDR was used to support HIV program monitoring, case-based surveillance and production of products like the monthly lists of patients who have treatment interruptions and dashboards for monitoring HIV test and start.</p><p><strong>Conclusion: </strong>The NDR enabled the availability of reliable and timely data for surveillance, research, and HIV program monitoring to guide program improvements to accelerate progress toward epidemic control.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 3-04","pages":"130-139"},"PeriodicalIF":1.7,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/b5/f9/10-1055-s-0043-1768711.PMC10462428.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10136836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rare Diseases in Hospital Information Systems-An Interoperable Methodology for Distributed Data Quality Assessments. 医院信息系统中的罕见疾病——分布式数据质量评估的可互操作方法。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-01 DOI: 10.1055/a-2006-1018
Kais Tahar, Tamara Martin, Yongli Mou, Raphael Verbuecheln, Holm Graessner, Dagmar Krefting

Background: Multisite research networks such as the project "Collaboration on Rare Diseases" connect various hospitals to obtain sufficient data for clinical research. However, data quality (DQ) remains a challenge for the secondary use of data recorded in different health information systems. High levels of DQ as well as appropriate quality assessment methods are needed to support the reuse of such distributed data.

Objectives: The aim of this work is the development of an interoperable methodology for assessing the quality of data recorded in heterogeneous sources to improve the quality of rare disease (RD) documentation and support clinical research.

Methods: We first developed a conceptual framework for DQ assessment. Using this theoretical guidance, we implemented a software framework that provides appropriate tools for calculating DQ metrics and for generating local as well as cross-institutional reports. We further applied our methodology on synthetic data distributed across multiple hospitals using Personal Health Train. Finally, we used precision and recall as metrics to validate our implementation.

Results: Four DQ dimensions were defined and represented as disjunct ontological categories. Based on these top dimensions, 9 DQ concepts, 10 DQ indicators, and 25 DQ parameters were developed and applied to different data sets. Randomly introduced DQ issues were all identified and reported automatically. The generated reports show the resulting DQ indicators and detected DQ issues.

Conclusion: We have shown that our approach yields promising results, which can be used for local and cross-institutional DQ assessments. The developed frameworks provide useful methods for interoperable and privacy-preserving assessments of DQ that meet the specified requirements. This study has demonstrated that our methodology is capable of detecting DQ issues such as ambiguity or implausibility of coded diagnoses. It can be used for DQ benchmarking to improve the quality of RD documentation and to support clinical research on distributed data.

背景:“罕见病合作”项目等多站点研究网络将各医院连接起来,以获得临床研究所需的足够数据。然而,对于不同卫生信息系统中记录的数据的二次使用,数据质量仍然是一个挑战。需要高水平的DQ以及适当的质量评估方法来支持这种分布式数据的重用。目的:这项工作的目的是开发一种可互操作的方法来评估异质来源记录的数据质量,以提高罕见病(RD)文献的质量并支持临床研究。方法:我们首先开发了DQ评估的概念框架。使用这一理论指导,我们实现了一个软件框架,该框架为计算DQ度量和生成本地以及跨机构报告提供了适当的工具。我们进一步将我们的方法应用于使用Personal Health Train分布在多家医院的合成数据。最后,我们使用精确度和召回率作为度量来验证我们的实现。结果:定义了四个DQ维度,并将其表示为不相交的本体范畴。基于这些顶级维度,我们开发了9个DQ概念、10个DQ指标和25个DQ参数,并将其应用于不同的数据集。随机引入的DQ问题都被自动识别和报告。生成的报告显示结果DQ指示器和检测到的DQ问题。结论:我们已经表明,我们的方法产生了有希望的结果,可用于本地和跨机构DQ评估。所开发的框架为满足指定需求的DQ互操作和隐私保护评估提供了有用的方法。这项研究表明,我们的方法是能够检测DQ问题,如编码诊断的歧义或不可信。它可以用于DQ基准测试,以提高RD文档的质量,并支持分布式数据的临床研究。
{"title":"Rare Diseases in Hospital Information Systems-An Interoperable Methodology for Distributed Data Quality Assessments.","authors":"Kais Tahar,&nbsp;Tamara Martin,&nbsp;Yongli Mou,&nbsp;Raphael Verbuecheln,&nbsp;Holm Graessner,&nbsp;Dagmar Krefting","doi":"10.1055/a-2006-1018","DOIUrl":"https://doi.org/10.1055/a-2006-1018","url":null,"abstract":"<p><strong>Background: </strong>Multisite research networks such as the project \"Collaboration on Rare Diseases\" connect various hospitals to obtain sufficient data for clinical research. However, data quality (DQ) remains a challenge for the secondary use of data recorded in different health information systems. High levels of DQ as well as appropriate quality assessment methods are needed to support the reuse of such distributed data.</p><p><strong>Objectives: </strong>The aim of this work is the development of an interoperable methodology for assessing the quality of data recorded in heterogeneous sources to improve the quality of rare disease (RD) documentation and support clinical research.</p><p><strong>Methods: </strong>We first developed a conceptual framework for DQ assessment. Using this theoretical guidance, we implemented a software framework that provides appropriate tools for calculating DQ metrics and for generating local as well as cross-institutional reports. We further applied our methodology on synthetic data distributed across multiple hospitals using Personal Health Train. Finally, we used precision and recall as metrics to validate our implementation.</p><p><strong>Results: </strong>Four DQ dimensions were defined and represented as disjunct ontological categories. Based on these top dimensions, 9 DQ concepts, 10 DQ indicators, and 25 DQ parameters were developed and applied to different data sets. Randomly introduced DQ issues were all identified and reported automatically. The generated reports show the resulting DQ indicators and detected DQ issues.</p><p><strong>Conclusion: </strong>We have shown that our approach yields promising results, which can be used for local and cross-institutional DQ assessments. The developed frameworks provide useful methods for interoperable and privacy-preserving assessments of DQ that meet the specified requirements. This study has demonstrated that our methodology is capable of detecting DQ issues such as ambiguity or implausibility of coded diagnoses. It can be used for DQ benchmarking to improve the quality of RD documentation and to support clinical research on distributed data.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 3-04","pages":"71-89"},"PeriodicalIF":1.7,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462432/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10138370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Simple-to-Use R Package for Mimicking Study Data by Simulations. 一个简单易用的R包,用于模拟研究数据。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-01 DOI: 10.1055/a-2048-7692
Giorgos Koliopanos, Francisco Ojeda, Andreas Ziegler

Background: Data protection policies might prohibit the transfer of existing study data to interested research groups. To overcome legal restrictions, simulated data can be transferred that mimic the structure but are different from the existing study data.

Objectives: The aim of this work is to introduce the simple-to-use R package Mock Data Generation (modgo) that may be used for simulating data from existing study data for continuous, ordinal categorical, and dichotomous variables.

Methods: The core is to combine rank inverse normal transformation with the calculation of a correlation matrix for all variables. Data can then be simulated from a multivariate normal and transferred back to the original scale of the variables. Unique features of modgo are that it allows to change the correlation between variables, to perform perturbation analysis, to handle multicenter data, and to change inclusion/exclusion criteria by selecting specific values of one or a set of variables. Simulation studies on real data demonstrate the validity and flexibility of modgo.

Results: modgo mimicked the structure of the original study data. Results of modgo were similar with those from two other existing packages in standard simulation scenarios. modgo's flexibility was demonstrated on several expansions.

Conclusion: The R package modgo is useful when existing study data may not be shared. Its perturbation expansion permits to simulate truly anonymized subjects. The expansion to multicenter studies can be used for validating prediction models. Additional expansions can support the unraveling of associations even in large study data and can be useful in power calculations.

背景:数据保护政策可能会禁止将现有研究数据转移到感兴趣的研究小组。为了克服法律限制,可以传输模拟结构但与现有研究数据不同的模拟数据。目的:这项工作的目的是介绍简单易用的R包模拟数据生成(modgo),可用于模拟现有研究数据中的连续、有序分类和二分类变量的数据。方法:将秩反正态变换与各变量的相关矩阵的计算相结合。然后,可以从多元正态态模拟数据,并将其转移回变量的原始尺度。modgo的独特之处在于它允许改变变量之间的相关性,执行扰动分析,处理多中心数据,并通过选择一个或一组变量的特定值来改变纳入/排除标准。对实际数据的仿真研究表明了该模型的有效性和灵活性。结果:modgo模拟了原始研究数据的结构。modgo的结果与其他两个现有软件包在标准模拟场景中的结果相似。Modgo的灵活性在几个扩展中得到了证明。结论:当现有研究数据不能共享时,R包模式是有用的。它的扰动扩展允许模拟真正匿名的对象。扩展到多中心研究可用于验证预测模型。额外的扩展甚至可以在大型研究数据中支持关联的解开,并且在功率计算中很有用。
{"title":"A Simple-to-Use R Package for Mimicking Study Data by Simulations.","authors":"Giorgos Koliopanos,&nbsp;Francisco Ojeda,&nbsp;Andreas Ziegler","doi":"10.1055/a-2048-7692","DOIUrl":"https://doi.org/10.1055/a-2048-7692","url":null,"abstract":"<p><strong>Background: </strong>Data protection policies might prohibit the transfer of existing study data to interested research groups. To overcome legal restrictions, simulated data can be transferred that mimic the structure but are different from the existing study data.</p><p><strong>Objectives: </strong>The aim of this work is to introduce the simple-to-use R package Mock Data Generation (modgo) that may be used for simulating data from existing study data for continuous, ordinal categorical, and dichotomous variables.</p><p><strong>Methods: </strong>The core is to combine rank inverse normal transformation with the calculation of a correlation matrix for all variables. Data can then be simulated from a multivariate normal and transferred back to the original scale of the variables. Unique features of modgo are that it allows to change the correlation between variables, to perform perturbation analysis, to handle multicenter data, and to change inclusion/exclusion criteria by selecting specific values of one or a set of variables. Simulation studies on real data demonstrate the validity and flexibility of modgo.</p><p><strong>Results: </strong>modgo mimicked the structure of the original study data. Results of modgo were similar with those from two other existing packages in standard simulation scenarios. modgo's flexibility was demonstrated on several expansions.</p><p><strong>Conclusion: </strong>The R package modgo is useful when existing study data may not be shared. Its perturbation expansion permits to simulate truly anonymized subjects. The expansion to multicenter studies can be used for validating prediction models. Additional expansions can support the unraveling of associations even in large study data and can be useful in power calculations.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 3-04","pages":"119-129"},"PeriodicalIF":1.7,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/75/40/10-1055-a-2048-7692.PMC10462429.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10492948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthetic Tabular Data Evaluation in the Health Domain Covering Resemblance, Utility, and Privacy Dimensions. 健康领域的综合表格数据评估,涵盖相似性、效用和隐私维度。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-01 DOI: 10.1055/s-0042-1760247
Mikel Hernadez, Gorka Epelde, Ane Alberdi, Rodrigo Cilla, Debbie Rankin

Background: Synthetic tabular data generation is a potentially valuable technology with great promise for data augmentation and privacy preservation. However, prior to adoption, an empirical assessment of generated synthetic tabular data is required across dimensions relevant to the target application to determine its efficacy. A lack of standardized and objective evaluation and benchmarking strategy for synthetic tabular data in the health domain has been found in the literature.

Objective: The aim of this paper is to identify key dimensions, per dimension metrics, and methods for evaluating synthetic tabular data generated with different techniques and configurations for health domain application development and to provide a strategy to orchestrate them.

Methods: Based on the literature, the resemblance, utility, and privacy dimensions have been prioritized, and a collection of metrics and methods for their evaluation are orchestrated into a complete evaluation pipeline. This way, a guided and comparative assessment of generated synthetic tabular data can be done, categorizing its quality into three categories ("Excellent," "Good," and "Poor"). Six health care-related datasets and four synthetic tabular data generation approaches have been chosen to conduct an analysis and evaluation to verify the utility of the proposed evaluation pipeline.

Results: The synthetic tabular data generated with the four selected approaches has maintained resemblance, utility, and privacy for most datasets and synthetic tabular data generation approach combination. In several datasets, some approaches have outperformed others, while in other datasets, more than one approach has yielded the same performance.

Conclusion: The results have shown that the proposed pipeline can effectively be used to evaluate and benchmark the synthetic tabular data generated by various synthetic tabular data generation approaches. Therefore, this pipeline can support the scientific community in selecting the most suitable synthetic tabular data generation approaches for their data and application of interest.

背景:合成表格数据生成是一种潜在的有价值的技术,在数据增强和隐私保护方面具有很大的前景。然而,在采用之前,需要跨与目标应用程序相关的维度对生成的合成表格数据进行经验评估,以确定其有效性。文献中发现,卫生领域合成表格数据缺乏标准化和客观的评估和基准策略。目的:本文的目的是确定关键维度、每维度度量和评估使用不同技术和配置生成的健康领域应用程序开发的综合表格数据的方法,并提供编排它们的策略。方法:基于文献,相似性、效用和隐私维度已被优先考虑,并将其评估的度量和方法集合编排成一个完整的评估管道。通过这种方式,可以对生成的合成表格数据进行指导和比较评估,将其质量分为三类(“优秀”、“良好”和“差”)。选择了六个卫生保健相关数据集和四种综合表格数据生成方法进行分析和评估,以验证拟议的评估管道的效用。结果:对于大多数数据集和合成表格数据生成方法组合,四种方法生成的合成表格数据保持了相似性、实用性和隐私性。在一些数据集中,一些方法优于其他方法,而在其他数据集中,不止一种方法产生了相同的性能。结论:实验结果表明,该管道可有效地对各种合成表格数据生成方法生成的合成表格数据进行评价和基准测试。因此,这个管道可以支持科学界为他们感兴趣的数据和应用选择最合适的合成表格数据生成方法。
{"title":"Synthetic Tabular Data Evaluation in the Health Domain Covering Resemblance, Utility, and Privacy Dimensions.","authors":"Mikel Hernadez,&nbsp;Gorka Epelde,&nbsp;Ane Alberdi,&nbsp;Rodrigo Cilla,&nbsp;Debbie Rankin","doi":"10.1055/s-0042-1760247","DOIUrl":"https://doi.org/10.1055/s-0042-1760247","url":null,"abstract":"<p><strong>Background: </strong>Synthetic tabular data generation is a potentially valuable technology with great promise for data augmentation and privacy preservation. However, prior to adoption, an empirical assessment of generated synthetic tabular data is required across dimensions relevant to the target application to determine its efficacy. A lack of standardized and objective evaluation and benchmarking strategy for synthetic tabular data in the health domain has been found in the literature.</p><p><strong>Objective: </strong>The aim of this paper is to identify key dimensions, per dimension metrics, and methods for evaluating synthetic tabular data generated with different techniques and configurations for health domain application development and to provide a strategy to orchestrate them.</p><p><strong>Methods: </strong>Based on the literature, the resemblance, utility, and privacy dimensions have been prioritized, and a collection of metrics and methods for their evaluation are orchestrated into a complete evaluation pipeline. This way, a guided and comparative assessment of generated synthetic tabular data can be done, categorizing its quality into three categories (\"<i>Excellent,</i>\" \"<i>Good,</i>\" and \"<i>Poor</i>\"). Six health care-related datasets and four synthetic tabular data generation approaches have been chosen to conduct an analysis and evaluation to verify the utility of the proposed evaluation pipeline.</p><p><strong>Results: </strong>The synthetic tabular data generated with the four selected approaches has maintained resemblance, utility, and privacy for most datasets and synthetic tabular data generation approach combination. In several datasets, some approaches have outperformed others, while in other datasets, more than one approach has yielded the same performance.</p><p><strong>Conclusion: </strong>The results have shown that the proposed pipeline can effectively be used to evaluate and benchmark the synthetic tabular data generated by various synthetic tabular data generation approaches. Therefore, this pipeline can support the scientific community in selecting the most suitable synthetic tabular data generation approaches for their data and application of interest.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 S 01","pages":"e19-e38"},"PeriodicalIF":1.7,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/31/67/10-1055-s-0042-1760247.PMC10306449.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9789348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Aligning Semantic Interoperability Frameworks with the FOXS Stack for FAIR Health Data. 将语义互操作性框架与FOXS堆栈对齐以实现公平健康数据。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-01 DOI: 10.1055/a-1993-8036
John Meredith, Nicola Whitehead, Michael Dacey

Background: FAIR Guiding Principles present a synergy with the use cases for digital health records, in that clinical data need to be found, accessible within a range of environments, and data must interoperate between systems and subsequently reused. The use of HL7 FHIR, openEHR, IHE XDS, and SNOMED CT (FOXS) together represents a specification to create an open digital health platform for modern health care applications.

Objectives: To describe where logical FOXS components align to the European Open Science Cloud Interoperability Framework (EOSC-IF) reference architecture for semantic interoperability. This should provide a means of defining if FOXS aligns to FAIR principles and to establish the data models and structures that support longitudinal care records as being fit to underpin scientific research.

Methods: The EOSC-IF Semantic View is a representation of semantic interoperability where meaning is preserved between systems and users. This was analyzed and cross-referenced with FOXS architectural components, mapping concepts, and objects that describe content such as catalogues and semantic artifacts.

Results: Majority of conceptual Semantic View components were featured within the FOXS architecture. Semantic Business Objects are composed of a range of elements such as openEHR archetypes and templates, FHIR resources and profiles, SNOMED CT concepts, and XDS document identifiers. Semantic Functional Content comprises catalogues of metadata that were also supported by openEHR and FHIR tools.

Conclusions: Despite some elements of EOSC-IF being vague (e.g., FAIR Digital Object), there was a broad conformance to the framework concepts and the components of a FOXS platform. This work supports a health-domain-specific view of semantic interoperability and how this may be achieved to support FAIR data for health research via a standardized framework.

背景:FAIR指导原则提出了与数字健康记录用例的协同作用,因为临床数据需要在一系列环境中被发现和访问,并且数据必须在系统之间互操作并随后被重用。HL7 FHIR、openEHR、IHE XDS和SNOMED CT (FOXS)的共同使用代表了为现代医疗保健应用程序创建开放数字医疗平台的规范。目标:描述逻辑FOXS组件与欧洲开放科学云互操作性框架(EOSC-IF)参考架构在语义互操作性方面的一致性。这应该提供一种方法来定义FOXS是否符合FAIR原则,并建立数据模型和结构,以支持纵向护理记录适合支撑科学研究。方法:EOSC-IF语义视图是语义互操作性的一种表示,其中在系统和用户之间保留了意义。对其进行了分析,并与FOXS体系结构组件、映射概念和描述目录和语义构件等内容的对象进行了交叉引用。结果:大多数概念语义视图组件在FOXS体系结构中具有特征。语义业务对象由一系列元素组成,例如openEHR原型和模板、FHIR资源和概要文件、SNOMED CT概念和XDS文档标识符。语义功能内容包括元数据目录,openEHR和FHIR工具也支持这些目录。结论:尽管eoc - if的一些元素是模糊的(例如,FAIR数字对象),但对框架概念和FOXS平台的组件有广泛的一致性。这项工作支持卫生领域特定的语义互操作性观点,以及如何通过标准化框架实现这一点,以支持卫生研究的FAIR数据。
{"title":"Aligning Semantic Interoperability Frameworks with the FOXS Stack for FAIR Health Data.","authors":"John Meredith,&nbsp;Nicola Whitehead,&nbsp;Michael Dacey","doi":"10.1055/a-1993-8036","DOIUrl":"https://doi.org/10.1055/a-1993-8036","url":null,"abstract":"<p><strong>Background: </strong>FAIR Guiding Principles present a synergy with the use cases for digital health records, in that clinical data need to be found, accessible within a range of environments, and data must interoperate between systems and subsequently reused. The use of HL7 FHIR, openEHR, IHE XDS, and SNOMED CT (FOXS) together represents a specification to create an open digital health platform for modern health care applications.</p><p><strong>Objectives: </strong>To describe where logical FOXS components align to the European Open Science Cloud Interoperability Framework (EOSC-IF) reference architecture for semantic interoperability. This should provide a means of defining if FOXS aligns to FAIR principles and to establish the data models and structures that support longitudinal care records as being fit to underpin scientific research.</p><p><strong>Methods: </strong>The EOSC-IF Semantic View is a representation of semantic interoperability where meaning is preserved between systems and users. This was analyzed and cross-referenced with FOXS architectural components, mapping concepts, and objects that describe content such as catalogues and semantic artifacts.</p><p><strong>Results: </strong>Majority of conceptual Semantic View components were featured within the FOXS architecture. Semantic Business Objects are composed of a range of elements such as openEHR archetypes and templates, FHIR resources and profiles, SNOMED CT concepts, and XDS document identifiers. Semantic Functional Content comprises catalogues of metadata that were also supported by openEHR and FHIR tools.</p><p><strong>Conclusions: </strong>Despite some elements of EOSC-IF being vague (e.g., FAIR Digital Object), there was a broad conformance to the framework concepts and the components of a FOXS platform. This work supports a health-domain-specific view of semantic interoperability and how this may be achieved to support FAIR data for health research via a standardized framework.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"62 S 01","pages":"e39-e46"},"PeriodicalIF":1.7,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/a3/76/10-1055-a-1993-8036.PMC10306448.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9786736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Methods of Information in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1