首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Telehealth Uptake Among Hispanic People During COVID-19: Retrospective Observational Study. COVID-19 期间西语裔人群对远程医疗的接受程度:回顾性观察研究
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-24 DOI: 10.2196/57717
Di Shang, Cynthia Williams, Hera Culiqi
<p><strong>Background: </strong>The Hispanic community represents a sizeable community that experiences inequities in the US health care system. As the system has moved toward digital health platforms, evaluating the potential impact on Hispanic communities is critical.</p><p><strong>Objective: </strong>The study aimed to investigate demographic, socioeconomic, and behavioral factors contributing to low telehealth use in Hispanic communities.</p><p><strong>Methods: </strong>We used a retrospective observation study design to examine the study objectives. The COVID-19 Research Database Consortium provided the Analytics IQ PeopleCore consumer data and Office Alley claims data. The study period was from March 2020 to April 2021. Multiple logistic regression was used to determine the odds of using telehealth services.</p><p><strong>Results: </strong>We examined 3,478,287 unique Hispanic patients, 16.6% (577,396) of whom used telehealth. Results suggested that patients aged between 18 and 44 years were more likely to use telehealth (odds ratio [OR] 1.07, 95% CI 1.05-1.1; P<.001) than patients aged older than 65 years. Across all age groups, patients with high incomes were at least 20% more likely to use telehealth than patients with lower incomes (P<.001); patients who had a primary care physician (P=.01), exhibited high medical usage (P<.001), or were interested in exercise (P=.03) were more likely to use telehealth; patients who had unhealthy behaviors such as smoking and alcohol consumption were less likely to use telehealth (P<.001). Male patients were less likely than female patients to use telehealth among patients aged 65 years and older (OR 0.94, 95% CI 0.93-0.95; P<.001), while male patients aged between 18 and 44 years were more likely to use telehealth (OR 1.05, 95% CI 1.03-1.07; P<.001). Among patients younger than 65 years, full-time employment was positively associated with telehealth use (P<.001). Patients aged between 18 and 44 years with high school or less education were 2% less likely to use telehealth (OR 0.98, 95% CI 0.97-0.99; P=.005). Results also revealed a positive association with using WebMD (WebMD LLC) among patients aged older than 44 years (P<.001), while there was a negative association with electronic prescriptions among those who were aged between 18 and 44 years (P=.009) and aged between 45 and 64 years (P=.004).</p><p><strong>Conclusions: </strong>This study demonstrates that telehealth use among Hispanic communities is dependent upon factors such as age, gender, education, socioeconomic status, current health care engagement, and health behaviors. To address these challenges, we advocate for interdisciplinary approaches that involve medical professionals, insurance providers, and community-based services actively engaging with Hispanic communities and promoting telehealth use. We propose the following recommendations: enhance access to health insurance, improve access to primary care providers, and allocate fiscal a
背景:拉美裔社区是美国医疗保健系统中存在不公平现象的一个相当大的群体。随着医疗系统向数字医疗平台发展,评估其对西班牙裔社区的潜在影响至关重要:本研究旨在调查导致西班牙裔社区远程医疗使用率低的人口、社会经济和行为因素:我们采用回顾性观察研究设计来探讨研究目标。COVID-19 研究数据库联盟提供了 Analytics IQ PeopleCore 消费者数据和 Office Alley 索赔数据。研究时间为 2020 年 3 月至 2021 年 4 月。采用多元逻辑回归法确定使用远程医疗服务的几率:我们研究了 3,478,287 名独特的西班牙裔患者,其中 16.6% (577,396 人)使用了远程医疗。结果表明,年龄在 18-44 岁之间的患者更有可能使用远程保健服务(几率比 [OR] 1.07,95% CI 1.05-1.1;PC 结论:这项研究表明,远程保健服务在西班牙裔患者中的使用率较高:本研究表明,西语裔社区使用远程医疗取决于年龄、性别、教育程度、社会经济地位、当前医疗保健参与度和健康行为等因素。为了应对这些挑战,我们主张采用跨学科的方法,让医疗专业人员、保险提供商和社区服务机构积极参与拉美裔社区的活动,促进远程保健的使用。我们提出以下建议:提高医疗保险的可及性,改善初级保健提供者的可及性,分配财政和教育资源以支持远程保健的使用。随着远程保健对医疗服务的影响越来越大,专业人员必须促进人们使用所有可用的渠道来获得医疗服务。
{"title":"Telehealth Uptake Among Hispanic People During COVID-19: Retrospective Observational Study.","authors":"Di Shang, Cynthia Williams, Hera Culiqi","doi":"10.2196/57717","DOIUrl":"10.2196/57717","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;The Hispanic community represents a sizeable community that experiences inequities in the US health care system. As the system has moved toward digital health platforms, evaluating the potential impact on Hispanic communities is critical.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;The study aimed to investigate demographic, socioeconomic, and behavioral factors contributing to low telehealth use in Hispanic communities.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We used a retrospective observation study design to examine the study objectives. The COVID-19 Research Database Consortium provided the Analytics IQ PeopleCore consumer data and Office Alley claims data. The study period was from March 2020 to April 2021. Multiple logistic regression was used to determine the odds of using telehealth services.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;We examined 3,478,287 unique Hispanic patients, 16.6% (577,396) of whom used telehealth. Results suggested that patients aged between 18 and 44 years were more likely to use telehealth (odds ratio [OR] 1.07, 95% CI 1.05-1.1; P&lt;.001) than patients aged older than 65 years. Across all age groups, patients with high incomes were at least 20% more likely to use telehealth than patients with lower incomes (P&lt;.001); patients who had a primary care physician (P=.01), exhibited high medical usage (P&lt;.001), or were interested in exercise (P=.03) were more likely to use telehealth; patients who had unhealthy behaviors such as smoking and alcohol consumption were less likely to use telehealth (P&lt;.001). Male patients were less likely than female patients to use telehealth among patients aged 65 years and older (OR 0.94, 95% CI 0.93-0.95; P&lt;.001), while male patients aged between 18 and 44 years were more likely to use telehealth (OR 1.05, 95% CI 1.03-1.07; P&lt;.001). Among patients younger than 65 years, full-time employment was positively associated with telehealth use (P&lt;.001). Patients aged between 18 and 44 years with high school or less education were 2% less likely to use telehealth (OR 0.98, 95% CI 0.97-0.99; P=.005). Results also revealed a positive association with using WebMD (WebMD LLC) among patients aged older than 44 years (P&lt;.001), while there was a negative association with electronic prescriptions among those who were aged between 18 and 44 years (P=.009) and aged between 45 and 64 years (P=.004).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This study demonstrates that telehealth use among Hispanic communities is dependent upon factors such as age, gender, education, socioeconomic status, current health care engagement, and health behaviors. To address these challenges, we advocate for interdisciplinary approaches that involve medical professionals, insurance providers, and community-based services actively engaging with Hispanic communities and promoting telehealth use. We propose the following recommendations: enhance access to health insurance, improve access to primary care providers, and allocate fiscal a","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e57717"},"PeriodicalIF":3.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11289584/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering Harmonization Potential in Health Care Data Through Iterative Refinement of Fast Healthcare Interoperability Resources Profiles Based on Retrospective Discrepancy Analysis: Case Study. 基于回顾性差异分析,通过迭代完善快速医疗互操作性资源档案,发掘医疗数据的协调潜力:案例研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-23 DOI: 10.2196/57005
Lorenz Rosenau, Paul Behrend, Joshua Wiedekopf, Julian Gruendner, Josef Ingenerf
<p><strong>Background: </strong>Cross-institutional interoperability between health care providers remains a recurring challenge worldwide. The German Medical Informatics Initiative, a collaboration of 37 university hospitals in Germany, aims to enable interoperability between partner sites by defining Fast Healthcare Interoperability Resources (FHIR) profiles for the cross-institutional exchange of health care data, the Core Data Set (CDS). The current CDS and its extension modules define elements representing patients' health care records. All university hospitals in Germany have made significant progress in providing routine data in a standardized format based on the CDS. In addition, the central research platform for health, the German Portal for Medical Research Data feasibility tool, allows medical researchers to query the available CDS data items across many participating hospitals.</p><p><strong>Objective: </strong>In this study, we aimed to evaluate a novel approach of combining the current top-down generated FHIR profiles with the bottom-up generated knowledge gained by the analysis of respective instance data. This allowed us to derive options for iteratively refining FHIR profiles using the information obtained from a discrepancy analysis.</p><p><strong>Methods: </strong>We developed an FHIR validation pipeline and opted to derive more restrictive profiles from the original CDS profiles. This decision was driven by the need to align more closely with the specific assumptions and requirements of the central feasibility platform's search ontology. While the original CDS profiles offer a generic framework adaptable for a broad spectrum of medical informatics use cases, they lack the specificity to model the nuanced criteria essential for medical researchers. A key example of this is the necessity to represent specific laboratory codings and values interdependencies accurately. The validation results allow us to identify discrepancies between the instance data at the clinical sites and the profiles specified by the feasibility platform and addressed in the future.</p><p><strong>Results: </strong>A total of 20 university hospitals participated in this study. Historical factors, lack of harmonization, a wide range of source systems, and case sensitivity of coding are some of the causes for the discrepancies identified. While in our case study, Conditions, Procedures, and Medications have a high degree of uniformity in the coding of instance data due to legislative requirements for billing in Germany, we found that laboratory values pose a significant data harmonization challenge due to their interdependency between coding and value.</p><p><strong>Conclusions: </strong>While the CDS achieves interoperability, different challenges for federated data access arise, requiring more specificity in the profiles to make assumptions on the instance data. We further argue that further harmonization of the instance data can significantly lower required
背景:医疗服务提供者之间的跨机构互操作性仍然是全球范围内经常面临的挑战。德国医疗信息学倡议是德国 37 家大学医院的合作项目,旨在通过定义用于跨机构交换医疗保健数据的快速医疗保健互操作性资源(FHIR)配置文件,即核心数据集(CDS),来实现合作医院之间的互操作性。当前的 CDS 及其扩展模块定义了代表患者医疗记录的元素。德国所有大学医院在以 CDS 为基础的标准化格式提供常规数据方面都取得了重大进展。此外,健康中央研究平台--德国医学研究数据门户网站可行性工具允许医学研究人员查询许多参与医院的可用 CDS 数据项:在这项研究中,我们旨在评估一种将当前自上而下生成的 FHIR 配置文件与通过分析各自实例数据获得的自下而上生成的知识相结合的新方法。这样,我们就能利用差异分析获得的信息,得出迭代完善 FHIR 配置文件的方案:我们开发了一个 FHIR 验证管道,并选择从原始 CDS 配置文件中提取限制性更强的配置文件。之所以做出这一决定,是因为需要更紧密地与中央可行性平台搜索本体的具体假设和要求保持一致。虽然原始 CDS 配置文件提供了一个通用框架,可适用于广泛的医学信息学用例,但它们缺乏具体性,无法模拟医学研究人员所必需的细微标准。这方面的一个重要例子就是必须准确地表示特定的实验室编码和值之间的相互依存关系。通过验证结果,我们可以发现临床站点的实例数据与可行性平台指定的配置文件之间的差异,并在今后加以解决:共有 20 家大学医院参与了这项研究。历史因素、缺乏统一、源系统范围广以及编码的病例敏感性是造成差异的部分原因。在我们的案例研究中,由于德国计费的立法要求,条件、程序和药物在实例数据编码方面具有高度的统一性,但我们发现,由于编码和价值之间的相互依存关系,实验室价值对数据协调构成了重大挑战:结论:虽然 CDS 实现了互操作性,但联合数据访问也面临着不同的挑战,需要更具体的配置文件才能对实例数据做出假设。我们还认为,进一步协调实例数据可以大大减少所需的追溯协调工作。我们认识到,差异不可能仅在临床现场得到解决;因此,我们的研究结果具有广泛的影响,需要各利益相关方在多个层面采取行动。
{"title":"Uncovering Harmonization Potential in Health Care Data Through Iterative Refinement of Fast Healthcare Interoperability Resources Profiles Based on Retrospective Discrepancy Analysis: Case Study.","authors":"Lorenz Rosenau, Paul Behrend, Joshua Wiedekopf, Julian Gruendner, Josef Ingenerf","doi":"10.2196/57005","DOIUrl":"10.2196/57005","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Cross-institutional interoperability between health care providers remains a recurring challenge worldwide. The German Medical Informatics Initiative, a collaboration of 37 university hospitals in Germany, aims to enable interoperability between partner sites by defining Fast Healthcare Interoperability Resources (FHIR) profiles for the cross-institutional exchange of health care data, the Core Data Set (CDS). The current CDS and its extension modules define elements representing patients' health care records. All university hospitals in Germany have made significant progress in providing routine data in a standardized format based on the CDS. In addition, the central research platform for health, the German Portal for Medical Research Data feasibility tool, allows medical researchers to query the available CDS data items across many participating hospitals.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;In this study, we aimed to evaluate a novel approach of combining the current top-down generated FHIR profiles with the bottom-up generated knowledge gained by the analysis of respective instance data. This allowed us to derive options for iteratively refining FHIR profiles using the information obtained from a discrepancy analysis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We developed an FHIR validation pipeline and opted to derive more restrictive profiles from the original CDS profiles. This decision was driven by the need to align more closely with the specific assumptions and requirements of the central feasibility platform's search ontology. While the original CDS profiles offer a generic framework adaptable for a broad spectrum of medical informatics use cases, they lack the specificity to model the nuanced criteria essential for medical researchers. A key example of this is the necessity to represent specific laboratory codings and values interdependencies accurately. The validation results allow us to identify discrepancies between the instance data at the clinical sites and the profiles specified by the feasibility platform and addressed in the future.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;A total of 20 university hospitals participated in this study. Historical factors, lack of harmonization, a wide range of source systems, and case sensitivity of coding are some of the causes for the discrepancies identified. While in our case study, Conditions, Procedures, and Medications have a high degree of uniformity in the coding of instance data due to legislative requirements for billing in Germany, we found that laboratory values pose a significant data harmonization challenge due to their interdependency between coding and value.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;While the CDS achieves interoperability, different challenges for federated data access arise, requiring more specificity in the profiles to make assumptions on the instance data. We further argue that further harmonization of the instance data can significantly lower required","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e57005"},"PeriodicalIF":3.1,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11303887/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141749840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of In-Hospital Cardiac Arrest in the Intensive Care Unit: Machine Learning-Based Multimodal Approach. 重症监护病房院内心脏骤停预测:基于机器学习的多模态方法。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-23 DOI: 10.2196/49142
Hsin-Ying Lee, Po-Chih Kuo, Frank Qian, Chien-Hung Li, Jiun-Ruey Hu, Wan-Ting Hsu, Hong-Jie Jhou, Po-Huang Chen, Cho-Hao Lee, Chin-Hua Su, Po-Chun Liao, I-Ju Wu, Chien-Chang Lee

Background: Early identification of impending in-hospital cardiac arrest (IHCA) improves clinical outcomes but remains elusive for practicing clinicians.

Objective: We aimed to develop a multimodal machine learning algorithm based on ensemble techniques to predict the occurrence of IHCA.

Methods: Our model was developed by the Multiparameter Intelligent Monitoring of Intensive Care (MIMIC)-IV database and validated in the Electronic Intensive Care Unit Collaborative Research Database (eICU-CRD). Baseline features consisting of patient demographics, presenting illness, and comorbidities were collected to train a random forest model. Next, vital signs were extracted to train a long short-term memory model. A support vector machine algorithm then stacked the results to form the final prediction model.

Results: Of 23,909 patients in the MIMIC-IV database and 10,049 patients in the eICU-CRD database, 452 and 85 patients, respectively, had IHCA. At 13 hours in advance of an IHCA event, our algorithm had already demonstrated an area under the receiver operating characteristic curve of 0.85 (95% CI 0.815-0.885) in the MIMIC-IV database. External validation with the eICU-CRD and National Taiwan University Hospital databases also presented satisfactory results, showing area under the receiver operating characteristic curve values of 0.81 (95% CI 0.763-0.851) and 0.945 (95% CI 0.934-0.956), respectively.

Conclusions: Using only vital signs and information available in the electronic medical record, our model demonstrates it is possible to detect a trajectory of clinical deterioration up to 13 hours in advance. This predictive tool, which has undergone external validation, could forewarn and help clinicians identify patients in need of assessment to improve their overall prognosis.

背景:早期识别即将发生的院内心脏骤停(IHCA)可改善临床预后,但对于临床医生来说仍难以捉摸:我们旨在开发一种基于集合技术的多模态机器学习算法,以预测 IHCA 的发生:我们的模型由重症监护多参数智能监测(MIMIC)-IV 数据库开发,并在重症监护室合作研究电子数据库(eICU-CRD)中进行了验证。收集的基线特征包括患者人口统计学特征、主诉疾病和合并症,用于训练随机森林模型。接着,提取生命体征来训练长短期记忆模型。然后,支持向量机算法将结果叠加,形成最终预测模型:在 MIMIC-IV 数据库的 23909 名患者和 eICU-CRD 数据库的 10049 名患者中,分别有 452 名和 85 名患者发生了 IHCA。在 IHCA 事件发生前 13 小时,我们的算法在 MIMIC-IV 数据库中的接收器操作特征曲线下面积已达到 0.85(95% CI 0.815-0.885)。eICU-CRD和台湾大学医院数据库的外部验证结果也令人满意,接收器操作特征曲线下面积值分别为0.81(95% CI 0.763-0.851)和0.945(95% CI 0.934-0.956):我们的模型仅使用生命体征和电子病历中的信息,就能提前 13 小时发现临床恶化的轨迹。这一预测工具已经过外部验证,可以提前预警并帮助临床医生识别需要评估的患者,从而改善他们的整体预后。
{"title":"Prediction of In-Hospital Cardiac Arrest in the Intensive Care Unit: Machine Learning-Based Multimodal Approach.","authors":"Hsin-Ying Lee, Po-Chih Kuo, Frank Qian, Chien-Hung Li, Jiun-Ruey Hu, Wan-Ting Hsu, Hong-Jie Jhou, Po-Huang Chen, Cho-Hao Lee, Chin-Hua Su, Po-Chun Liao, I-Ju Wu, Chien-Chang Lee","doi":"10.2196/49142","DOIUrl":"10.2196/49142","url":null,"abstract":"<p><strong>Background: </strong>Early identification of impending in-hospital cardiac arrest (IHCA) improves clinical outcomes but remains elusive for practicing clinicians.</p><p><strong>Objective: </strong>We aimed to develop a multimodal machine learning algorithm based on ensemble techniques to predict the occurrence of IHCA.</p><p><strong>Methods: </strong>Our model was developed by the Multiparameter Intelligent Monitoring of Intensive Care (MIMIC)-IV database and validated in the Electronic Intensive Care Unit Collaborative Research Database (eICU-CRD). Baseline features consisting of patient demographics, presenting illness, and comorbidities were collected to train a random forest model. Next, vital signs were extracted to train a long short-term memory model. A support vector machine algorithm then stacked the results to form the final prediction model.</p><p><strong>Results: </strong>Of 23,909 patients in the MIMIC-IV database and 10,049 patients in the eICU-CRD database, 452 and 85 patients, respectively, had IHCA. At 13 hours in advance of an IHCA event, our algorithm had already demonstrated an area under the receiver operating characteristic curve of 0.85 (95% CI 0.815-0.885) in the MIMIC-IV database. External validation with the eICU-CRD and National Taiwan University Hospital databases also presented satisfactory results, showing area under the receiver operating characteristic curve values of 0.81 (95% CI 0.763-0.851) and 0.945 (95% CI 0.934-0.956), respectively.</p><p><strong>Conclusions: </strong>Using only vital signs and information available in the electronic medical record, our model demonstrates it is possible to detect a trajectory of clinical deterioration up to 13 hours in advance. This predictive tool, which has undergone external validation, could forewarn and help clinicians identify patients in need of assessment to improve their overall prognosis.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e49142"},"PeriodicalIF":3.1,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11287234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction of a Multi-Label Classifier for Extracting Multiple Incident Factors From Medication Incident Reports in Residential Care Facilities: Natural Language Processing Approach. 从养老院用药事故报告中提取多种事故因素的多标签分类器的构建:自然语言处理方法
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-23 DOI: 10.2196/58141
Hayato Kizaki, Hiroki Satoh, Sayaka Ebara, Satoshi Watabe, Yasufumi Sawada, Shungo Imai, Satoko Hori
<p><strong>Background: </strong>Medication safety in residential care facilities is a critical concern, particularly when nonmedical staff provide medication assistance. The complex nature of medication-related incidents in these settings, coupled with the psychological impact on health care providers, underscores the need for effective incident analysis and preventive strategies. A thorough understanding of the root causes, typically through incident-report analysis, is essential for mitigating medication-related incidents.</p><p><strong>Objective: </strong>We aimed to develop and evaluate a multilabel classifier using natural language processing to identify factors contributing to medication-related incidents using incident report descriptions from residential care facilities, with a focus on incidents involving nonmedical staff.</p><p><strong>Methods: </strong>We analyzed 2143 incident reports, comprising 7121 sentences, from residential care facilities in Japan between April 1, 2015, and March 31, 2016. The incident factors were annotated using sentences based on an established organizational factor model and previous research findings. The following 9 factors were defined: procedure adherence, medicine, resident, resident family, nonmedical staff, medical staff, team, environment, and organizational management. To assess the label criteria, 2 researchers with relevant medical knowledge annotated a subset of 50 reports; the interannotator agreement was measured using Cohen κ. The entire data set was subsequently annotated by 1 researcher. Multiple labels were assigned to each sentence. A multilabel classifier was developed using deep learning models, including 2 Bidirectional Encoder Representations From Transformers (BERT)-type models (Tohoku-BERT and a University of Tokyo Hospital BERT pretrained with Japanese clinical text: UTH-BERT) and an Efficiently Learning Encoder That Classifies Token Replacements Accurately (ELECTRA), pretrained on Japanese text. Both sentence- and report-level training were performed; the performance was evaluated by the F<sub>1</sub>-score and exact match accuracy through 5-fold cross-validation.</p><p><strong>Results: </strong>Among all 7121 sentences, 1167, 694, 2455, 23, 1905, 46, 195, 1104, and 195 included "procedure adherence," "medicine," "resident," "resident family," "nonmedical staff," "medical staff," "team," "environment," and "organizational management," respectively. Owing to limited labels, "resident family" and "medical staff" were omitted from the model development process. The interannotator agreement values were higher than 0.6 for each label. A total of 10, 278, and 1855 reports contained no, 1, and multiple labels, respectively. The models trained using the report data outperformed those trained using sentences, with macro F<sub>1</sub>-scores of 0.744, 0.675, and 0.735 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. The report-trained models also demonstrated better exact match accuracy
背景:住院护理设施中的用药安全是一个至关重要的问题,尤其是在非医务人员提供用药协助的情况下。在这些环境中,与用药相关的事故性质复杂,加上对医护人员的心理影响,凸显了有效的事故分析和预防策略的必要性。通常情况下,通过事故报告分析来透彻了解根本原因,对于减少用药相关事故至关重要:我们的目标是开发并评估一种使用自然语言处理的多标签分类器,该分类器可利用寄宿式护理机构的事故报告描述识别导致药物相关事故的因素,重点关注涉及非医务人员的事故:我们分析了 2015 年 4 月 1 日至 2016 年 3 月 31 日期间来自日本养老机构的 2143 份事故报告,其中包含 7121 个句子。根据已建立的组织因素模型和以往的研究成果,使用句子对事件因素进行了注释。定义了以下 9 个因素:程序遵守、医疗、居民、居民家庭、非医疗人员、医疗人员、团队、环境和组织管理。为了评估标注标准,两名具有相关医学知识的研究人员对 50 份报告的子集进行了标注;标注者之间的一致性采用 Cohen κ 进行测量。随后,由一名研究人员对整个数据集进行标注。每个句子都有多个标签。使用深度学习模型开发了多标签分类器,其中包括 2 个双向编码器表征转换器(BERT)型模型(Tohoku-BERT 和东京大学医院 BERT,使用日语临床文本进行预训练:UTH-BERT)和以日语文本为基础进行预训练的 "可准确分类标记替换的高效学习编码器"(ELECTRA)。对句子和报告进行了训练;通过 5 倍交叉验证,以 F1 分数和精确匹配准确率来评估性能:在所有 7121 个句子中,分别有 1167、694、2455、23、1905、46、195、1104 和 195 个句子包含 "遵守程序"、"医学"、"居民"、"居民家庭"、"非医务人员"、"医务人员"、"团队"、"环境 "和 "组织管理"。由于标签有限,"居民家庭 "和 "医务人员 "在模型开发过程中被省略。每个标签的注释者间一致值均高于 0.6。分别有 10 份、278 份和 1855 份报告没有、1 份和多个标签。使用报告数据训练的模型优于使用句子训练的模型,Tohoku-BERT、UTH-BERT 和 ELECTRA 的宏观 F1 分数分别为 0.744、0.675 和 0.735。经过报告训练的模型也表现出更高的精确匹配准确率,Tohoku-BERT、UTH-BERT 和 ELECTRA 的准确匹配准确率分别为 0.411、0.389 和 0.399。值得注意的是,即使只分析包含多个标签的报告,准确率也是一致的:我们在研究中开发的多标签分类器证明了它在利用养老院的事故报告识别与用药相关事故有关的各种因素方面的潜力。因此,该分类器可促进对事故因素的及时分析,从而有助于风险管理和预防策略的制定。
{"title":"Construction of a Multi-Label Classifier for Extracting Multiple Incident Factors From Medication Incident Reports in Residential Care Facilities: Natural Language Processing Approach.","authors":"Hayato Kizaki, Hiroki Satoh, Sayaka Ebara, Satoshi Watabe, Yasufumi Sawada, Shungo Imai, Satoko Hori","doi":"10.2196/58141","DOIUrl":"10.2196/58141","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Medication safety in residential care facilities is a critical concern, particularly when nonmedical staff provide medication assistance. The complex nature of medication-related incidents in these settings, coupled with the psychological impact on health care providers, underscores the need for effective incident analysis and preventive strategies. A thorough understanding of the root causes, typically through incident-report analysis, is essential for mitigating medication-related incidents.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We aimed to develop and evaluate a multilabel classifier using natural language processing to identify factors contributing to medication-related incidents using incident report descriptions from residential care facilities, with a focus on incidents involving nonmedical staff.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We analyzed 2143 incident reports, comprising 7121 sentences, from residential care facilities in Japan between April 1, 2015, and March 31, 2016. The incident factors were annotated using sentences based on an established organizational factor model and previous research findings. The following 9 factors were defined: procedure adherence, medicine, resident, resident family, nonmedical staff, medical staff, team, environment, and organizational management. To assess the label criteria, 2 researchers with relevant medical knowledge annotated a subset of 50 reports; the interannotator agreement was measured using Cohen κ. The entire data set was subsequently annotated by 1 researcher. Multiple labels were assigned to each sentence. A multilabel classifier was developed using deep learning models, including 2 Bidirectional Encoder Representations From Transformers (BERT)-type models (Tohoku-BERT and a University of Tokyo Hospital BERT pretrained with Japanese clinical text: UTH-BERT) and an Efficiently Learning Encoder That Classifies Token Replacements Accurately (ELECTRA), pretrained on Japanese text. Both sentence- and report-level training were performed; the performance was evaluated by the F&lt;sub&gt;1&lt;/sub&gt;-score and exact match accuracy through 5-fold cross-validation.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Among all 7121 sentences, 1167, 694, 2455, 23, 1905, 46, 195, 1104, and 195 included \"procedure adherence,\" \"medicine,\" \"resident,\" \"resident family,\" \"nonmedical staff,\" \"medical staff,\" \"team,\" \"environment,\" and \"organizational management,\" respectively. Owing to limited labels, \"resident family\" and \"medical staff\" were omitted from the model development process. The interannotator agreement values were higher than 0.6 for each label. A total of 10, 278, and 1855 reports contained no, 1, and multiple labels, respectively. The models trained using the report data outperformed those trained using sentences, with macro F&lt;sub&gt;1&lt;/sub&gt;-scores of 0.744, 0.675, and 0.735 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. The report-trained models also demonstrated better exact match accuracy","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e58141"},"PeriodicalIF":3.1,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11303886/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141749839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Statistical Analyses: A Scoping Review and Examples of Operational Frameworks Adapted to Health Analytics. 分布式统计分析:范围审查和适用于健康分析的操作框架实例》。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-19 DOI: 10.2196/53622
Félix Camirand Lemyre, Simon Lévesque, Marie-Pier Domingue, Klaus Herrmann, Jean-François Ethier

Background: Data from multiple organizations are crucial for advancing learning health systems. However, ethical, legal, and social concerns may restrict the use of standard statistical methods that rely on pooling data. Although distributed algorithms offer alternatives, they may not always be suitable for health frameworks.

Objective: This paper aims to support researchers and data custodians in three ways: (1) providing a concise overview of the literature on statistical inference methods for horizontally partitioned data; (2) describing the methods applicable to generalized linear models (GLM) and assessing their underlying distributional assumptions; (3) adapting existing methods to make them fully usable in health settings.

Methods: A scoping review methodology was employed for the literature mapping, from which methods presenting a methodological framework for GLM analyses with horizontally partitioned data were identified and assessed from the perspective of applicability in health settings. Statistical theory was used to adapt methods and to derive the properties of the resulting estimators.

Results: From the review, 41 articles were selected, and six approaches were extracted for conducting standard GLM-based statistical analysis. However, these approaches assumed evenly and identically distributed data across nodes. Consequently, statistical procedures were derived to accommodate uneven node sample sizes and heterogeneous data distributions across nodes. Workflows and detailed algorithms were developed to highlight information-sharing requirements and operational complexity.

Conclusions: This paper contributes to the field of health analytics by providing an overview of the methods that can be used with horizontally partitioned data, by adapting these methods to the context of heterogeneous health data and by clarifying the workflows and quantities exchanged by the methods discussed. Further analysis of the confidentiality preserved by these methods is needed to fully understand the risk associated with the sharing of summary statistics.

背景:来自多个组织的数据对于推进学习型医疗系统至关重要。然而,伦理、法律和社会问题可能会限制使用依赖于数据汇集的标准统计方法。尽管分布式算法提供了替代方案,但它们并不总是适合健康框架:本文旨在从三个方面为研究人员和数据保管人员提供支持:(1)提供有关横向分割数据统计推断方法的文献概览;(2)描述适用于广义线性模型(GLM)的方法并评估其基本分布假设;(3)调整现有方法,使其完全适用于卫生环境:方法:采用范围综述的方法绘制文献图谱,从中发现并从卫生环境适用性的角度评估了为横向分割数据的广义线性模型分析提供方法框架的方法。统计理论被用来调整方法和推导所产生的估计器的特性:从综述中筛选出 41 篇文章,并提取出六种方法用于进行基于 GLM 的标准统计分析。然而,这些方法都假定各节点的数据分布均匀且相同。因此,为了适应节点样本大小不均和节点间数据分布不均的情况,我们推导出了统计程序。还开发了工作流程和详细算法,以突出信息共享要求和操作复杂性:本文概述了可用于水平分割数据的方法,将这些方法调整到异构健康数据的环境中,并阐明了所讨论方法的工作流程和交换的数量,从而为健康分析领域做出了贡献。需要进一步分析这些方法的保密性,以充分了解与共享汇总统计数据相关的风险。
{"title":"Distributed Statistical Analyses: A Scoping Review and Examples of Operational Frameworks Adapted to Health Analytics.","authors":"Félix Camirand Lemyre, Simon Lévesque, Marie-Pier Domingue, Klaus Herrmann, Jean-François Ethier","doi":"10.2196/53622","DOIUrl":"https://doi.org/10.2196/53622","url":null,"abstract":"<p><strong>Background: </strong>Data from multiple organizations are crucial for advancing learning health systems. However, ethical, legal, and social concerns may restrict the use of standard statistical methods that rely on pooling data. Although distributed algorithms offer alternatives, they may not always be suitable for health frameworks.</p><p><strong>Objective: </strong>This paper aims to support researchers and data custodians in three ways: (1) providing a concise overview of the literature on statistical inference methods for horizontally partitioned data; (2) describing the methods applicable to generalized linear models (GLM) and assessing their underlying distributional assumptions; (3) adapting existing methods to make them fully usable in health settings.</p><p><strong>Methods: </strong>A scoping review methodology was employed for the literature mapping, from which methods presenting a methodological framework for GLM analyses with horizontally partitioned data were identified and assessed from the perspective of applicability in health settings. Statistical theory was used to adapt methods and to derive the properties of the resulting estimators.</p><p><strong>Results: </strong>From the review, 41 articles were selected, and six approaches were extracted for conducting standard GLM-based statistical analysis. However, these approaches assumed evenly and identically distributed data across nodes. Consequently, statistical procedures were derived to accommodate uneven node sample sizes and heterogeneous data distributions across nodes. Workflows and detailed algorithms were developed to highlight information-sharing requirements and operational complexity.</p><p><strong>Conclusions: </strong>This paper contributes to the field of health analytics by providing an overview of the methods that can be used with horizontally partitioned data, by adapting these methods to the context of heterogeneous health data and by clarifying the workflows and quantities exchanged by the methods discussed. Further analysis of the confidentiality preserved by these methods is needed to fully understand the risk associated with the sharing of summary statistics.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":" ","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141728335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Information and Communication Technology Maturity Assessment at Primary Health Care Services Across 9 Provinces in Indonesia: Evaluation Study. 印度尼西亚 9 个省的初级卫生保健服务信息和通信技术成熟度评估:评价研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-18 DOI: 10.2196/55959
Dewi Nur Aisyah, Agus Heri Setiawan, Alfiano Fawwaz Lokopessy, Nadia Faradiba, Setiaji Setiaji, Logan Manikam, Zisis Kozlakidis
<p><strong>Background: </strong>Indonesia has rapidly embraced digital health, particularly during the COVID-19 pandemic, with over 15 million daily health application users. To advance its digital health vision, the government is prioritizing the development of health data and application systems into an integrated health care technology ecosystem. This initiative involves all levels of health care, from primary to tertiary, across all provinces. In particular, it aims to enhance primary health care services (as the main interface with the general population) and contribute to Indonesia's digital health transformation.</p><p><strong>Objective: </strong>This study assesses the information and communication technology (ICT) maturity in Indonesian health care services to advance digital health initiatives. ICT maturity assessment tools, specifically designed for middle-income countries, were used to evaluate digital health capabilities in 9 provinces across 5 Indonesian islands.</p><p><strong>Methods: </strong>A cross-sectional survey was conducted from February to March 2022, in 9 provinces across Indonesia, representing the country's diverse conditions on its major islands. Respondents included staff from public health centers (Puskesmas), primary care clinics (Klinik Pratama), and district health offices (Dinas Kesehatan Kabupaten/Kota). The survey used adapted ICT maturity assessment questionnaires, covering human resources, software and system, hardware, and infrastructure. It was administered electronically and involved 121 public health centers, 49 primary care clinics, and 67 IT staff from district health offices. Focus group discussions were held to delve deeper into the assessment results and gain more descriptive insights.</p><p><strong>Results: </strong>In this study, 237 participants represented 3 distinct categories: 121 public health centers, 67 district health offices, and 49 primary clinics. These instances were selected from a sample of 9 of the 34 provinces in Indonesia. Collected data from interviews and focus group discussions were transformed into scores on a scale of 1 to 5, with 1 indicating low ICT readiness and 5 indicating high ICT readiness. On average, the breakdown of ICT maturity scores was as follows: 2.71 for human resources' capability in ICT use and system management, 2.83 for software and information systems, 2.59 for hardware, and 2.84 for infrastructure, resulting in an overall average score of 2.74. According to the ICT maturity level pyramid, the ICT maturity of health care providers in Indonesia fell between the basic and good levels. The need to pursue best practices also emerged strongly. Further analysis of the ICT maturity scores, when examined by province, revealed regional variations.</p><p><strong>Conclusions: </strong>The maturity of ICT use is influenced by several critical components. Enhancing human resources, ensuring infrastructure, the availability of supportive hardware, and optimizing informa
背景:印度尼西亚已迅速采用数字医疗技术,尤其是在 COVID-19 大流行期间,每天有超过 1500 万健康应用用户。为推进其数字医疗愿景,政府正在优先发展医疗数据和应用系统,使其成为一个综合医疗保健技术生态系统。这一举措涉及各省从初级到三级的各级医疗保健。尤其是,它旨在加强初级医疗保健服务(作为与普通民众的主要联系渠道),并为印尼的数字医疗转型做出贡献:本研究对印尼医疗保健服务中的信息与通信技术(ICT)成熟度进行评估,以推进数字医疗计划。采用专为中等收入国家设计的 ICT 成熟度评估工具,对印度尼西亚 5 个岛屿 9 个省的数字医疗能力进行评估:2022 年 2 月至 3 月,我们在印度尼西亚的 9 个省份进行了横向调查,这些省份代表了该国主要岛屿的不同情况。受访者包括公共卫生中心(Puskesmas)、初级保健诊所(Klinik Pratama)和地区卫生办公室(Dinas Kesehatan Kabupaten/Kota)的工作人员。调查使用了经过改编的信息与通信技术成熟度评估问卷,涵盖人力资源、软件与系统、硬件和基础设施。调查以电子方式进行,涉及 121 家公共卫生中心、49 家初级保健诊所和 67 名来自地区卫生办公室的 IT 人员。为深入了解评估结果并获得更多描述性见解,还举行了焦点小组讨论:在这项研究中,237 名参与者代表了 3 个不同的类别:结果:在这项研究中,237 名参与者代表了 3 个不同的类别:121 个公共卫生中心、67 个地区卫生办公室和 49 个初级诊所。这些实例选自印度尼西亚 34 个省中的 9 个省。从访谈和焦点小组讨论中收集到的数据被转换成 1 到 5 的分数,1 表示信息与通讯技术准备程度低,5 表示信息与通讯技术准备程度高。平均而言,信息和通信技术成熟度得分细分如下:人力资源在信息与传播技术使用和系统管理方面的能力为 2.71 分,软件和信息系统为 2.83 分,硬件为 2.59 分,基础设施为 2.84 分,总平均分为 2.74 分。根据信息和通信技术成熟度金字塔,印度尼西亚医疗机构的信息和通信技术成熟度介于基本和良好之间。追求最佳做法的必要性也非常明显。按省份对信息和通信技术成熟度得分进行的进一步分析表明,各地区之间存在差异:信息和通信技术使用的成熟度受几个关键因素的影响。加强人力资源、确保基础设施、支持性硬件的可用性以及优化信息系统,是在医疗保健服务中实现信息与传播技术成熟度的当务之急。在信息和通信技术成熟度评估方面,9 个省的各级医疗保健机构的得分差异很大,这突出表明了信息和通信技术就绪程度的多样性,以及采取因地制宜的后续行动的必要性。
{"title":"The Information and Communication Technology Maturity Assessment at Primary Health Care Services Across 9 Provinces in Indonesia: Evaluation Study.","authors":"Dewi Nur Aisyah, Agus Heri Setiawan, Alfiano Fawwaz Lokopessy, Nadia Faradiba, Setiaji Setiaji, Logan Manikam, Zisis Kozlakidis","doi":"10.2196/55959","DOIUrl":"10.2196/55959","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Indonesia has rapidly embraced digital health, particularly during the COVID-19 pandemic, with over 15 million daily health application users. To advance its digital health vision, the government is prioritizing the development of health data and application systems into an integrated health care technology ecosystem. This initiative involves all levels of health care, from primary to tertiary, across all provinces. In particular, it aims to enhance primary health care services (as the main interface with the general population) and contribute to Indonesia's digital health transformation.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study assesses the information and communication technology (ICT) maturity in Indonesian health care services to advance digital health initiatives. ICT maturity assessment tools, specifically designed for middle-income countries, were used to evaluate digital health capabilities in 9 provinces across 5 Indonesian islands.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;A cross-sectional survey was conducted from February to March 2022, in 9 provinces across Indonesia, representing the country's diverse conditions on its major islands. Respondents included staff from public health centers (Puskesmas), primary care clinics (Klinik Pratama), and district health offices (Dinas Kesehatan Kabupaten/Kota). The survey used adapted ICT maturity assessment questionnaires, covering human resources, software and system, hardware, and infrastructure. It was administered electronically and involved 121 public health centers, 49 primary care clinics, and 67 IT staff from district health offices. Focus group discussions were held to delve deeper into the assessment results and gain more descriptive insights.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;In this study, 237 participants represented 3 distinct categories: 121 public health centers, 67 district health offices, and 49 primary clinics. These instances were selected from a sample of 9 of the 34 provinces in Indonesia. Collected data from interviews and focus group discussions were transformed into scores on a scale of 1 to 5, with 1 indicating low ICT readiness and 5 indicating high ICT readiness. On average, the breakdown of ICT maturity scores was as follows: 2.71 for human resources' capability in ICT use and system management, 2.83 for software and information systems, 2.59 for hardware, and 2.84 for infrastructure, resulting in an overall average score of 2.74. According to the ICT maturity level pyramid, the ICT maturity of health care providers in Indonesia fell between the basic and good levels. The need to pursue best practices also emerged strongly. Further analysis of the ICT maturity scores, when examined by province, revealed regional variations.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;The maturity of ICT use is influenced by several critical components. Enhancing human resources, ensuring infrastructure, the availability of supportive hardware, and optimizing informa","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e55959"},"PeriodicalIF":3.1,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11269960/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141735796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Lake, Data Warehouse, Datamart, and Feature Store: Their Contributions to the Complete Data Reuse Pipeline 数据湖、数据仓库、数据图表和特征库:它们对完整数据重用管道的贡献
IF 3.2 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-17 DOI: 10.2196/54590
Antoine Lamer, Chloé Saint-Dizier, Nicolas Paris, Emmanuel Chazard
The growing adoption and utilization of health information technology has generated a wealth of clinical data in electronic format, offering opportunities for data reuse beyond direct patient care. However, as data are distributed across multiple software, it becomes challenging to cross-reference information between sources due to differences in formats, vocabularies, technologies, and the absence of common identifiers among software. To address these challenges, hospitals have adopted data warehouses to consolidate and standardize these data for research. Additionally, as a complement or alternative, data lakes store both source data and metadata in a detailed and unprocessed format, empowering exploration, manipulation, and adaptation of the data to meet specific analytical needs. Subsequently, datamarts are utilized to further refine data into usable information tailored to specific research questions. However, for efficient analysis, a feature store is essential to pivot and denormalize the data, simplifying queries. In conclusion, while data warehouses are crucial, data lakes, datamarts and feature stores play essential and complementary roles in facilitating data reuse for research and analysis in healthcare.
随着医疗信息技术被越来越多地采用和使用,产生了大量电子格式的临床数据,为病人直接护理之外的数据再利用提供了机会。然而,由于数据分布在多个软件中,格式、词汇、技术存在差异,而且软件之间缺乏通用标识符,因此在不同来源之间交叉引用信息变得十分困难。为了应对这些挑战,医院采用了数据仓库来整合这些数据并使其标准化,以便进行研究。此外,作为一种补充或替代方法,数据湖以详细和未经处理的格式存储源数据和元数据,允许对数据进行探索、操作和调整,以满足特定的分析需求。随后,数据图表可用于将数据进一步细化为针对特定研究问题的可用信息。不过,为了进行高效分析,必须使用特征存储来透视和去规范化数据,从而简化查询。总之,虽然数据仓库至关重要,但数据湖、数据图表和特征库在促进医疗保健研究和分析中的数据再利用方面也发挥着重要的互补作用。
{"title":"Data Lake, Data Warehouse, Datamart, and Feature Store: Their Contributions to the Complete Data Reuse Pipeline","authors":"Antoine Lamer, Chloé Saint-Dizier, Nicolas Paris, Emmanuel Chazard","doi":"10.2196/54590","DOIUrl":"https://doi.org/10.2196/54590","url":null,"abstract":"The growing adoption and utilization of health information technology has generated a wealth of clinical data in electronic format, offering opportunities for data reuse beyond direct patient care. However, as data are distributed across multiple software, it becomes challenging to cross-reference information between sources due to differences in formats, vocabularies, technologies, and the absence of common identifiers among software. To address these challenges, hospitals have adopted data warehouses to consolidate and standardize these data for research. Additionally, as a complement or alternative, data lakes store both source data and metadata in a detailed and unprocessed format, empowering exploration, manipulation, and adaptation of the data to meet specific analytical needs. Subsequently, datamarts are utilized to further refine data into usable information tailored to specific research questions. However, for efficient analysis, a feature store is essential to pivot and denormalize the data, simplifying queries. In conclusion, while data warehouses are crucial, data lakes, datamarts and feature stores play essential and complementary roles in facilitating data reuse for research and analysis in healthcare.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"66 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141719754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Large Language Models for Automated Reporting and Data Systems Categorization: Cross-Sectional Study. 评估用于自动报告和数据系统分类的大型语言模型:横断面研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-17 DOI: 10.2196/55799
Qingxia Wu, Qingxia Wu, Huali Li, Yan Wang, Yan Bai, Yaping Wu, Xuan Yu, Xiaodong Li, Pei Dong, Jon Xue, Dinggang Shen, Meiyun Wang

Background: Large language models show promise for improving radiology workflows, but their performance on structured radiological tasks such as Reporting and Data Systems (RADS) categorization remains unexplored.

Objective: This study aims to evaluate 3 large language model chatbots-Claude-2, GPT-3.5, and GPT-4-on assigning RADS categories to radiology reports and assess the impact of different prompting strategies.

Methods: This cross-sectional study compared 3 chatbots using 30 radiology reports (10 per RADS criteria), using a 3-level prompting strategy: zero-shot, few-shot, and guideline PDF-informed prompts. The cases were grounded in Liver Imaging Reporting & Data System (LI-RADS) version 2018, Lung CT (computed tomography) Screening Reporting & Data System (Lung-RADS) version 2022, and Ovarian-Adnexal Reporting & Data System (O-RADS) magnetic resonance imaging, meticulously prepared by board-certified radiologists. Each report underwent 6 assessments. Two blinded reviewers assessed the chatbots' response at patient-level RADS categorization and overall ratings. The agreement across repetitions was assessed using Fleiss κ.

Results: Claude-2 achieved the highest accuracy in overall ratings with few-shot prompts and guideline PDFs (prompt-2), attaining 57% (17/30) average accuracy over 6 runs and 50% (15/30) accuracy with k-pass voting. Without prompt engineering, all chatbots performed poorly. The introduction of a structured exemplar prompt (prompt-1) increased the accuracy of overall ratings for all chatbots. Providing prompt-2 further improved Claude-2's performance, an enhancement not replicated by GPT-4. The interrun agreement was substantial for Claude-2 (k=0.66 for overall rating and k=0.69 for RADS categorization), fair for GPT-4 (k=0.39 for both), and fair for GPT-3.5 (k=0.21 for overall rating and k=0.39 for RADS categorization). All chatbots showed significantly higher accuracy with LI-RADS version 2018 than with Lung-RADS version 2022 and O-RADS (P<.05); with prompt-2, Claude-2 achieved the highest overall rating accuracy of 75% (45/60) in LI-RADS version 2018.

Conclusions: When equipped with structured prompts and guideline PDFs, Claude-2 demonstrated potential in assigning RADS categories to radiology cases according to established criteria such as LI-RADS version 2018. However, the current generation of chatbots lags in accurately categorizing cases based on more recent RADS criteria.

背景:大型语言模型有望改善放射学工作流程,但它们在报告和数据系统(RADS)分类等结构化放射学任务中的表现仍有待探索:本研究旨在评估 3 个大型语言模型聊天机器人--Claude-2、GPT-3.5 和 GPT-4--在为放射学报告分配 RADS 类别方面的表现,并评估不同提示策略的影响:这项横断面研究使用 30 份放射学报告(每份 RADS 标准 10 份)对 3 个聊天机器人进行了比较,并使用了 3 级提示策略:零镜头、少镜头和指南 PDF 信息提示。这些病例基于肝脏成像报告和数据系统(LI-RADS)2018 年版、肺部 CT(计算机断层扫描)筛查报告和数据系统(Lung-RADS)2022 年版以及卵巢-附件报告和数据系统(O-RADS)磁共振成像,由经委员会认证的放射科医生精心准备。每份报告都经过 6 次评估。两名盲审员评估聊天机器人在患者级别 RADS 分类和总体评分方面的反应。结果:结果:Claude-2 在使用很少的提示和指南 PDF(提示-2)进行总体评分时达到了最高的准确率,6 次运行的平均准确率为 57%(17/30),使用 k-pass 投票的准确率为 50%(15/30)。如果没有提示工程,所有聊天机器人的表现都很差。引入结构化示例提示(提示-1)后,所有聊天机器人的总体评分准确率都有所提高。提示-2进一步提高了Claude-2的表现,而GPT-4没有复制这种提高。Claude-2 的运行间一致性很高(总体评分 k=0.66,RADS 分类 k=0.69),GPT-4 的运行间一致性一般(两者的 k=0.39),GPT-3.5 的运行间一致性一般(总体评分 k=0.21,RADS 分类 k=0.39)。所有聊天机器人对LI-RADS 2018版的准确率都明显高于Lung-RADS 2022版和O-RADS(PConclusions:当配备结构化提示和指南 PDF 时,Claude-2 在根据既定标准(如 LI-RADS 2018 版)为放射病例分配 RADS 类别方面表现出了潜力。然而,目前的聊天机器人在根据最新的 RADS 标准对病例进行准确分类方面还比较落后。
{"title":"Evaluating Large Language Models for Automated Reporting and Data Systems Categorization: Cross-Sectional Study.","authors":"Qingxia Wu, Qingxia Wu, Huali Li, Yan Wang, Yan Bai, Yaping Wu, Xuan Yu, Xiaodong Li, Pei Dong, Jon Xue, Dinggang Shen, Meiyun Wang","doi":"10.2196/55799","DOIUrl":"10.2196/55799","url":null,"abstract":"<p><strong>Background: </strong>Large language models show promise for improving radiology workflows, but their performance on structured radiological tasks such as Reporting and Data Systems (RADS) categorization remains unexplored.</p><p><strong>Objective: </strong>This study aims to evaluate 3 large language model chatbots-Claude-2, GPT-3.5, and GPT-4-on assigning RADS categories to radiology reports and assess the impact of different prompting strategies.</p><p><strong>Methods: </strong>This cross-sectional study compared 3 chatbots using 30 radiology reports (10 per RADS criteria), using a 3-level prompting strategy: zero-shot, few-shot, and guideline PDF-informed prompts. The cases were grounded in Liver Imaging Reporting & Data System (LI-RADS) version 2018, Lung CT (computed tomography) Screening Reporting & Data System (Lung-RADS) version 2022, and Ovarian-Adnexal Reporting & Data System (O-RADS) magnetic resonance imaging, meticulously prepared by board-certified radiologists. Each report underwent 6 assessments. Two blinded reviewers assessed the chatbots' response at patient-level RADS categorization and overall ratings. The agreement across repetitions was assessed using Fleiss κ.</p><p><strong>Results: </strong>Claude-2 achieved the highest accuracy in overall ratings with few-shot prompts and guideline PDFs (prompt-2), attaining 57% (17/30) average accuracy over 6 runs and 50% (15/30) accuracy with k-pass voting. Without prompt engineering, all chatbots performed poorly. The introduction of a structured exemplar prompt (prompt-1) increased the accuracy of overall ratings for all chatbots. Providing prompt-2 further improved Claude-2's performance, an enhancement not replicated by GPT-4. The interrun agreement was substantial for Claude-2 (k=0.66 for overall rating and k=0.69 for RADS categorization), fair for GPT-4 (k=0.39 for both), and fair for GPT-3.5 (k=0.21 for overall rating and k=0.39 for RADS categorization). All chatbots showed significantly higher accuracy with LI-RADS version 2018 than with Lung-RADS version 2022 and O-RADS (P<.05); with prompt-2, Claude-2 achieved the highest overall rating accuracy of 75% (45/60) in LI-RADS version 2018.</p><p><strong>Conclusions: </strong>When equipped with structured prompts and guideline PDFs, Claude-2 demonstrated potential in assigning RADS categories to radiology cases according to established criteria such as LI-RADS version 2018. However, the current generation of chatbots lags in accurately categorizing cases based on more recent RADS criteria.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e55799"},"PeriodicalIF":3.1,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11292156/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141629385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diagnostic Accuracy of Artificial Intelligence in Endoscopy: Umbrella Review 人工智能在内窥镜检查中的诊断准确性:综述
IF 3.2 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-15 DOI: 10.2196/56361
Bowen Zha, Angshu Cai, Guiqi Wang
Background: Some research has already reported the diagnostic value of artificial intelligence (AI) in different endoscopy outcomes. However, the evidence is confusing and of varying quality. Objective: To comprehensively evaluate the credibility of the evidence of the diagnostic accuracy of artificial intelligence in endoscopy. Methods: Before the study began, the protocol was registered in the International prospective register of systematic reviews (CRD42023483073). Firstly, two researchers searched PubMed, Web of Science, Embase, and Cochrane Library using comprehensive search terms. The deadline is November 2023. Then, researchers conduct screening research and extract information. We use A Measurement Tool to Assess Systematic Reviews 2 (AMSTAR2) to evaluate the quality of the article. We choose the research with higher quality evaluation for the same outcome for further analysis. In order to ensure the reliability of the conclusion, we have calculated each outcome again. Finally, the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) is used to evaluate the credibility of the outcome. Results: A total of 21 studies were included for analysis. Through AMSTAR2, it was found that eight research methodologies were of moderate quality, while other studies were regarded as low or critical low. The sensitivity and specificity of 17 different outcomes were analyzed. There are four different outcomes related to the esophagus, stomach, and colorectal, respectively. Two outcomes are associated with capsule endoscopy and laryngoscope, respectively. While the other is related to ultrasonic endoscopy. In terms of sensitivity, gastroesophageal reflux disease has the highest accuracy rate, reaching 97%, while the invasion depth of colon neoplasia has the lowest accuracy rate, only 71%. On the other hand, the specificity of colorectal cancer is the highest, reaching 98%, while the gastrointestinal stromal tumor has the lowest, only 80%. The GRADE evaluation suggests that the reliability of most outcomes are evaluated as low or very low. Conclusions: AI shows the value of diagnosis in endoscopy, especially in esophageal and colorectal diseases. These findings provide a theoretical basis for the development and evaluation of the use of AI-assisted systems, which are aimed at assisting endoscopists to carry out examinations to improve human health. However, it is worth noting further high-quality research is needed in the future.
背景:一些研究已经报告了人工智能(AI)在不同内窥镜检查结果中的诊断价值。然而,这些证据混乱且质量参差不齐。目的全面评估人工智能在内窥镜检查中诊断准确性证据的可信度。研究方法研究开始前,研究方案已在国际前瞻性系统综述注册中心(CRD42023483073)注册。首先,两名研究人员使用综合检索词检索了 PubMed、Web of Science、Embase 和 Cochrane Library。截止日期为 2023 年 11 月。然后,研究人员进行筛选研究并提取信息。我们使用 "评估系统性综述的测量工具 2"(AMSTAR2)来评价文章的质量。我们会选择对相同结果评价质量较高的研究进行进一步分析。为了确保结论的可靠性,我们对每项结果都进行了重新计算。最后,我们采用建议评估、发展和评价分级法(GRADE)来评价结果的可信度。结果:共纳入 21 项研究进行分析。通过 AMSTAR2,发现有 8 项研究方法的质量为中等,其他研究则被视为低质量或临界低质量。分析了 17 种不同结果的敏感性和特异性。有四种不同的结果分别与食道、胃和结肠直肠有关。两种结果分别与胶囊内窥镜和喉镜有关。另一种则与超声波内窥镜检查有关。在灵敏度方面,胃食管反流病的准确率最高,达到 97%,而结肠肿瘤侵犯深度的准确率最低,仅为 71%。另一方面,结直肠癌的特异性最高,达到 98%,而胃肠道间质瘤的特异性最低,仅为 80%。GRADE 评估表明,大多数结果的可靠性被评为较低或非常低。结论人工智能显示了内窥镜诊断的价值,尤其是在食道和结直肠疾病方面。这些发现为开发和评估人工智能辅助系统的使用提供了理论依据,这些系统旨在协助内镜医师进行检查,以改善人类健康。不过,值得注意的是,今后还需要进一步开展高质量的研究。
{"title":"Diagnostic Accuracy of Artificial Intelligence in Endoscopy: Umbrella Review","authors":"Bowen Zha, Angshu Cai, Guiqi Wang","doi":"10.2196/56361","DOIUrl":"https://doi.org/10.2196/56361","url":null,"abstract":"Background: Some research has already reported the diagnostic value of artificial intelligence (AI) in different endoscopy outcomes. However, the evidence is confusing and of varying quality. Objective: To comprehensively evaluate the credibility of the evidence of the diagnostic accuracy of artificial intelligence in endoscopy. Methods: Before the study began, the protocol was registered in the International prospective register of systematic reviews (CRD42023483073). Firstly, two researchers searched PubMed, Web of Science, Embase, and Cochrane Library using comprehensive search terms. The deadline is November 2023. Then, researchers conduct screening research and extract information. We use A Measurement Tool to Assess Systematic Reviews 2 (AMSTAR2) to evaluate the quality of the article. We choose the research with higher quality evaluation for the same outcome for further analysis. In order to ensure the reliability of the conclusion, we have calculated each outcome again. Finally, the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) is used to evaluate the credibility of the outcome. Results: A total of 21 studies were included for analysis. Through AMSTAR2, it was found that eight research methodologies were of moderate quality, while other studies were regarded as low or critical low. The sensitivity and specificity of 17 different outcomes were analyzed. There are four different outcomes related to the esophagus, stomach, and colorectal, respectively. Two outcomes are associated with capsule endoscopy and laryngoscope, respectively. While the other is related to ultrasonic endoscopy. In terms of sensitivity, gastroesophageal reflux disease has the highest accuracy rate, reaching 97%, while the invasion depth of colon neoplasia has the lowest accuracy rate, only 71%. On the other hand, the specificity of colorectal cancer is the highest, reaching 98%, while the gastrointestinal stromal tumor has the lowest, only 80%. The GRADE evaluation suggests that the reliability of most outcomes are evaluated as low or very low. Conclusions: AI shows the value of diagnosis in endoscopy, especially in esophageal and colorectal diseases. These findings provide a theoretical basis for the development and evaluation of the use of AI-assisted systems, which are aimed at assisting endoscopists to carry out examinations to improve human health. However, it is worth noting further high-quality research is needed in the future.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"90 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141719753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating Clinical Data and Medical Imaging in Lung Cancer: Feasibility Study Using the Observational Medical Outcomes Partnership Common Data Model Extension. 整合肺癌临床数据和医学影像:使用观察性医疗结果伙伴关系通用数据模型扩展的可行性研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-12 DOI: 10.2196/59187
Hyerim Ji, Seok Kim, Leonard Sunwoo, Sowon Jang, Ho-Young Lee, Sooyoung Yoo

Background: Digital transformation, particularly the integration of medical imaging with clinical data, is vital in personalized medicine. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) standardizes health data. However, integrating medical imaging remains a challenge.

Objective: This study proposes a method for combining medical imaging data with the OMOP CDM to improve multimodal research.

Methods: Our approach included the analysis and selection of digital imaging and communications in medicine header tags, validation of data formats, and alignment according to the OMOP CDM framework. The Fast Healthcare Interoperability Resources ImagingStudy profile guided our consistency in column naming and definitions. Imaging Common Data Model (I-CDM), constructed using the entity-attribute-value model, facilitates scalable and efficient medical imaging data management. For patients with lung cancer diagnosed between 2010 and 2017, we introduced 4 new tables-IMAGING_STUDY, IMAGING_SERIES, IMAGING_ANNOTATION, and FILEPATH-to standardize various imaging-related data and link to clinical data.

Results: This framework underscores the effectiveness of I-CDM in enhancing our understanding of lung cancer diagnostics and treatment strategies. The implementation of the I-CDM tables enabled the structured organization of a comprehensive data set, including 282,098 IMAGING_STUDY, 5,674,425 IMAGING_SERIES, and 48,536 IMAGING_ANNOTATION records, illustrating the extensive scope and depth of the approach. A scenario-based analysis using actual data from patients with lung cancer underscored the feasibility of our approach. A data quality check applying 44 specific rules confirmed the high integrity of the constructed data set, with all checks successfully passed, underscoring the reliability of our findings.

Conclusions: These findings indicate that I-CDM can improve the integration and analysis of medical imaging and clinical data. By addressing the challenges in data standardization and management, our approach contributes toward enhancing diagnostics and treatment strategies. Future research should expand the application of I-CDM to diverse disease populations and explore its wide-ranging utility for medical conditions.

背景介绍数字化转型,尤其是医学影像与临床数据的整合,对个性化医疗至关重要。观察性医疗结果合作组织(OMOP)通用数据模型(CDM)实现了健康数据的标准化。然而,整合医学影像仍然是一项挑战:本研究提出了一种将医学影像数据与 OMOP CDM 相结合的方法,以改进多模态研究:方法:我们的方法包括分析和选择医学数字成像和通信标题标签,验证数据格式,并根据 OMOP CDM 框架进行调整。快速医疗保健互操作性资源成像研究(Fast Healthcare Interoperability Resources ImagingStudy profile)指导我们保持列命名和定义的一致性。成像通用数据模型(I-CDM)采用实体-属性-值模型构建,有利于可扩展和高效的医学成像数据管理。对于 2010 年至 2017 年期间确诊的肺癌患者,我们引入了 4 个新表--IMAGING_STUDY、IMAGING_SERIES、IMAGING_ANNOTATION 和 FILEPATH--以标准化各种影像相关数据并链接到临床数据:该框架强调了 I-CDM 在增强我们对肺癌诊断和治疗策略的理解方面的有效性。通过实施 I-CDM 表格,我们可以结构化地组织一个综合数据集,其中包括 282,098 个 IMAGING_STUDY、5,674,425 个 IMAGING_SERIES 和 48,536 个 IMAGING_ANNOTATION 记录,这说明了该方法的广度和深度。利用肺癌患者的实际数据进行的情景分析强调了我们方法的可行性。应用 44 条特定规则进行的数据质量检查证实了所构建数据集的高度完整性,所有检查均顺利通过,从而强调了我们研究结果的可靠性:这些研究结果表明,I-CDM 可以改进医学成像和临床数据的整合与分析。通过应对数据标准化和管理方面的挑战,我们的方法有助于改进诊断和治疗策略。未来的研究应将 I-CDM 的应用扩展到不同的疾病人群,并探索其在医疗条件方面的广泛用途。
{"title":"Integrating Clinical Data and Medical Imaging in Lung Cancer: Feasibility Study Using the Observational Medical Outcomes Partnership Common Data Model Extension.","authors":"Hyerim Ji, Seok Kim, Leonard Sunwoo, Sowon Jang, Ho-Young Lee, Sooyoung Yoo","doi":"10.2196/59187","DOIUrl":"10.2196/59187","url":null,"abstract":"<p><strong>Background: </strong>Digital transformation, particularly the integration of medical imaging with clinical data, is vital in personalized medicine. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) standardizes health data. However, integrating medical imaging remains a challenge.</p><p><strong>Objective: </strong>This study proposes a method for combining medical imaging data with the OMOP CDM to improve multimodal research.</p><p><strong>Methods: </strong>Our approach included the analysis and selection of digital imaging and communications in medicine header tags, validation of data formats, and alignment according to the OMOP CDM framework. The Fast Healthcare Interoperability Resources ImagingStudy profile guided our consistency in column naming and definitions. Imaging Common Data Model (I-CDM), constructed using the entity-attribute-value model, facilitates scalable and efficient medical imaging data management. For patients with lung cancer diagnosed between 2010 and 2017, we introduced 4 new tables-IMAGING_STUDY, IMAGING_SERIES, IMAGING_ANNOTATION, and FILEPATH-to standardize various imaging-related data and link to clinical data.</p><p><strong>Results: </strong>This framework underscores the effectiveness of I-CDM in enhancing our understanding of lung cancer diagnostics and treatment strategies. The implementation of the I-CDM tables enabled the structured organization of a comprehensive data set, including 282,098 IMAGING_STUDY, 5,674,425 IMAGING_SERIES, and 48,536 IMAGING_ANNOTATION records, illustrating the extensive scope and depth of the approach. A scenario-based analysis using actual data from patients with lung cancer underscored the feasibility of our approach. A data quality check applying 44 specific rules confirmed the high integrity of the constructed data set, with all checks successfully passed, underscoring the reliability of our findings.</p><p><strong>Conclusions: </strong>These findings indicate that I-CDM can improve the integration and analysis of medical imaging and clinical data. By addressing the challenges in data standardization and management, our approach contributes toward enhancing diagnostics and treatment strategies. Future research should expand the application of I-CDM to diverse disease populations and explore its wide-ranging utility for medical conditions.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e59187"},"PeriodicalIF":3.1,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282389/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141602289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1