首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Evaluating the Accuracy of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints for Triage of Musculoskeletal Diseases: Algorithm Development and Validation Study. 评估肌肉骨骼疾病分诊中肌肉骨骼主诉的Frysian问卷的准确性:算法开发与验证研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-17 DOI: 10.2196/77345
Tjardo Daniël Maarseveen, Floor Reimann, Ahmed Al Hasan, Annemarie Schilder, Dan Zhang, Freke Wink, Lidy Hendriks, Rachel Knevel, Reinhard Bos
<p><strong>Background: </strong>Inflammatory rheumatic diseases (IRDs) affect 5% of the general population, whereas 35% of the population experiences musculoskeletal concerns. IRDs cause early disability, reduced life expectancy, and considerable health care costs. Early diagnosis is essential to prevent long-term damage. Similarly important is the early identification of patients with musculoskeletal concerns without IRDs to prevent unnecessary health care expenses. Of the population referred to the rheumatologist, 60% have noninflammatory musculoskeletal concerns, whereas only 20% of patients with an IRD see a rheumatologist within 3 months of symptom onset. The need for digital predictive (triage) tools for rheumatic and musculoskeletal diseases led to the development of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints (FRYQ).</p><p><strong>Objective: </strong>This study aimed to assess whether the FRYQ can distinguish IRD from noninflammatory musculoskeletal concerns in general, and rheumatoid arthritis and fibromyalgia specifically, in newly referred patients.</p><p><strong>Methods: </strong>The FRYQ is an 87-item tool (20 open-ended and 67 closed-ended questions) used to triage new rheumatology patients at Frisius Medical Center in the Netherlands. We analyzed data from 2 sources: dataset A with 728 outpatient clinic patients and dataset B with 373 patients from the Joint Pain Assessment Scoring Tool study. We built a classifier using Extreme Gradient Boosting to distinguish inflammatory from noninflammatory conditions based on closed-ended questions. Using elastic net regularization, we identified the most informative questions. We evaluated classification using receiver operating characteristic curve analysis and assessed feature importance through Shapley Additive Explanation analysis. To test generalizability, we replicated our analysis on dataset B. Finally, we examined whether the questions of the FRYQ could be used to identify specific conditions beyond the general categories of IRD and non-IRD, specifically for detecting fibromyalgia and rheumatoid arthritis.</p><p><strong>Results: </strong>Feature selection reduced the questionnaire from 67 to 28 items while maintaining discriminative power. After initial development, the model achieved an area under the receiver operating characteristic curve (AUC-ROC) of 0.72 (95% CI 0.67-0.78) for distinguishing inflammatory from noninflammatory conditions in an external validation set. Using a probability threshold of 0.30, the model achieved 71% sensitivity and 56% specificity on external validation. The FRYQ demonstrated stronger performance in identifying specific diagnoses such as fibromyalgia (AUC-ROC=0.81) and rheumatoid arthritis (AUC-ROC=0.77). Key discriminating features included symptom duration, pain response to movement, and anti-inflammatory medication effectiveness.</p><p><strong>Conclusions: </strong>The FRYQ effectively distinguishes inflammatory from
背景:炎性风湿病(IRDs)影响5%的普通人群,而35%的人群有肌肉骨骼问题。IRDs会导致早期残疾、预期寿命缩短和大量医疗保健费用。早期诊断对于预防长期损害至关重要。同样重要的是,在没有ird的情况下早期识别患有肌肉骨骼问题的患者,以防止不必要的医疗保健费用。在向风湿病学家求诊的人群中,60%有非炎症性肌肉骨骼问题,而只有20%的IRD患者在症状出现后3个月内求诊。对风湿病和肌肉骨骼疾病的数字预测(分诊)工具的需求导致了肌肉骨骼疾病鉴别弗里斯调查问卷(FRYQ)的发展。目的:本研究旨在评估FRYQ是否可以在新转诊的患者中区分IRD与非炎症性肌肉骨骼问题,特别是类风湿性关节炎和纤维肌痛。方法:FRYQ是一个87项工具(20个开放式问题和67个封闭式问题),用于对荷兰Frisius医学中心的新风湿病患者进行分类。我们分析了来自两个来源的数据:数据集A有728名门诊患者,数据集B有373名来自关节疼痛评估评分工具研究的患者。我们建立了一个基于封闭问题的分类器,使用极端梯度增强来区分炎症和非炎症状况。使用弹性网正则化,我们确定了信息量最大的问题。我们使用受试者工作特征曲线分析评估分类,并通过Shapley加性解释分析评估特征重要性。为了测试通用性,我们在数据集b上重复了我们的分析。最后,我们检查了FRYQ的问题是否可以用于识别IRD和非IRD的一般类别之外的特定条件,特别是用于检测纤维肌痛和类风湿性关节炎。结果:特征选择使问卷从67个条目减少到28个条目,同时保持了判别能力。经过初步开发,该模型在外部验证集中区分炎症和非炎症情况的受试者工作特征曲线下面积(AUC-ROC)为0.72 (95% CI 0.67-0.78)。使用0.30的概率阈值,该模型在外部验证中获得了71%的灵敏度和56%的特异性。FRYQ在识别纤维肌痛(AUC-ROC=0.81)和类风湿关节炎(AUC-ROC=0.77)等特定诊断方面表现出更强的性能。关键的鉴别特征包括症状持续时间、疼痛对运动的反应和抗炎药物的有效性。结论:FRYQ在专家会诊前能有效区分炎症性和非炎症性风湿病,在识别纤维肌痛和类风湿性关节炎方面表现出特别的优势。该工具可以通过优先考虑具有高IRD可能性的转诊进行早期风湿病学家评估,同时指导其他患者使用适当的替代资源,从而改善风湿病分诊。需要前瞻性研究来确定FRYQ对临床结果和卫生保健效率的影响。
{"title":"Evaluating the Accuracy of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints for Triage of Musculoskeletal Diseases: Algorithm Development and Validation Study.","authors":"Tjardo Daniël Maarseveen, Floor Reimann, Ahmed Al Hasan, Annemarie Schilder, Dan Zhang, Freke Wink, Lidy Hendriks, Rachel Knevel, Reinhard Bos","doi":"10.2196/77345","DOIUrl":"10.2196/77345","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Inflammatory rheumatic diseases (IRDs) affect 5% of the general population, whereas 35% of the population experiences musculoskeletal concerns. IRDs cause early disability, reduced life expectancy, and considerable health care costs. Early diagnosis is essential to prevent long-term damage. Similarly important is the early identification of patients with musculoskeletal concerns without IRDs to prevent unnecessary health care expenses. Of the population referred to the rheumatologist, 60% have noninflammatory musculoskeletal concerns, whereas only 20% of patients with an IRD see a rheumatologist within 3 months of symptom onset. The need for digital predictive (triage) tools for rheumatic and musculoskeletal diseases led to the development of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints (FRYQ).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to assess whether the FRYQ can distinguish IRD from noninflammatory musculoskeletal concerns in general, and rheumatoid arthritis and fibromyalgia specifically, in newly referred patients.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;The FRYQ is an 87-item tool (20 open-ended and 67 closed-ended questions) used to triage new rheumatology patients at Frisius Medical Center in the Netherlands. We analyzed data from 2 sources: dataset A with 728 outpatient clinic patients and dataset B with 373 patients from the Joint Pain Assessment Scoring Tool study. We built a classifier using Extreme Gradient Boosting to distinguish inflammatory from noninflammatory conditions based on closed-ended questions. Using elastic net regularization, we identified the most informative questions. We evaluated classification using receiver operating characteristic curve analysis and assessed feature importance through Shapley Additive Explanation analysis. To test generalizability, we replicated our analysis on dataset B. Finally, we examined whether the questions of the FRYQ could be used to identify specific conditions beyond the general categories of IRD and non-IRD, specifically for detecting fibromyalgia and rheumatoid arthritis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Feature selection reduced the questionnaire from 67 to 28 items while maintaining discriminative power. After initial development, the model achieved an area under the receiver operating characteristic curve (AUC-ROC) of 0.72 (95% CI 0.67-0.78) for distinguishing inflammatory from noninflammatory conditions in an external validation set. Using a probability threshold of 0.30, the model achieved 71% sensitivity and 56% specificity on external validation. The FRYQ demonstrated stronger performance in identifying specific diagnoses such as fibromyalgia (AUC-ROC=0.81) and rheumatoid arthritis (AUC-ROC=0.77). Key discriminating features included symptom duration, pain response to movement, and anti-inflammatory medication effectiveness.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;The FRYQ effectively distinguishes inflammatory from ","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e77345"},"PeriodicalIF":3.8,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12622856/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145544266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methods for Addressing Missingness in Electronic Health Record Data for Clinical Prediction Models: Comparative Evaluation. 解决临床预测模型中电子健康记录数据缺失的方法:比较评价。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-14 DOI: 10.2196/79307
Jean Digitale, Deborah Franzon, Mark J Pletcher, Charles E McCulloch, Efstathios D Gennatas

Background: Missing data are a common challenge in electronic health record (EHR)-based prediction modeling. Traditional imputation methods may not suit prediction or machine learning models, and real-world use requires workflows that are implementable for both model development and real-time prediction.

Objective: We evaluated methods for handling missing data when using EHR data to build clinical prediction models for patients admitted to the pediatric intensive care unit (PICU).

Methods: Using EHR data containing missing values from an academic medical center PICU, we generated a synthetic complete dataset. From this, we created 300 datasets with missing data under varying mechanisms and proportions of missingness for the outcomes of (1) successful extubation (binary) and (2) blood pressure (continuous). We assessed strategies to address missing data including simple methods (eg, last observation carried forward [LOCF]), complex methods (eg, random forest multiple imputation), and native support for missing values in outcome prediction models.

Results: Across 886 patients and 1220 intubation events, 18.2% of original data were missing. LOCF had the lowest imputation error, followed by random forest imputation (average mean squared error [MSE] improvement over mean imputation: 0.41 [range: 0.30, 0.50] and 0.33 [0.21, 0.43], respectively). LOCF generally outperformed other imputation methods across outcome metrics and models (mean improvement: 1.28% [range: -0.07%, 7.2%]). Imputation methods showed more performance variability for the binary outcome (balanced accuracy coefficient of variation: 0.042) than the continuous outcome (mean squared error coefficient of variation: 0.001).

Conclusions: Traditional imputation methods for inferential statistics, such as multiple imputation, may not be optimal for prediction models. The amount of missingness influenced performance more than the missingness mechanism. In datasets with frequent measurements, LOCF and native support for missing values in machine learning models offer reasonable performance for handling missingness at minimal computational cost in predictive analyses.

背景:在基于电子健康记录(EHR)的预测建模中,数据缺失是一个常见的挑战。传统的输入方法可能不适合预测或机器学习模型,并且现实世界的使用需要可实现模型开发和实时预测的工作流。目的:我们评估在使用电子病历数据建立儿科重症监护病房(PICU)患者临床预测模型时处理缺失数据的方法。方法:利用某学术医疗中心PICU中包含缺失值的EHR数据,我们生成了一个合成的完整数据集。由此,我们创建了300个数据集,其中包含不同机制和缺失比例下的缺失数据,用于(1)成功拔管(二元)和(2)血压(连续)的结果。我们评估了解决缺失数据的策略,包括简单方法(例如,最后一次观测结转[LOCF])、复杂方法(例如,随机森林多重插值)和结果预测模型中缺失值的本地支持。结果:在886例患者和1220例插管事件中,18.2%的原始数据丢失。LOCF的估计误差最低,其次是随机森林估计(平均均方误差[MSE]比平均估计分别提高了0.41[范围:0.30,0.50]和0.33[0.21,0.43])。在结果指标和模型上,LOCF总体上优于其他估算方法(平均改进:1.28%[范围:-0.07%,7.2%])。与连续结果(均方误差变异系数:0.001)相比,二元结果(平衡精度变异系数:0.042)的归算方法表现出更多的性能可变性。结论:传统的推断统计方法,如多重插值,可能不适合预测模型。缺失量对性能的影响大于缺失机制。在频繁测量的数据集中,LOCF和机器学习模型中对缺失值的本地支持为在预测分析中以最小的计算成本处理缺失提供了合理的性能。
{"title":"Methods for Addressing Missingness in Electronic Health Record Data for Clinical Prediction Models: Comparative Evaluation.","authors":"Jean Digitale, Deborah Franzon, Mark J Pletcher, Charles E McCulloch, Efstathios D Gennatas","doi":"10.2196/79307","DOIUrl":"10.2196/79307","url":null,"abstract":"<p><strong>Background: </strong>Missing data are a common challenge in electronic health record (EHR)-based prediction modeling. Traditional imputation methods may not suit prediction or machine learning models, and real-world use requires workflows that are implementable for both model development and real-time prediction.</p><p><strong>Objective: </strong>We evaluated methods for handling missing data when using EHR data to build clinical prediction models for patients admitted to the pediatric intensive care unit (PICU).</p><p><strong>Methods: </strong>Using EHR data containing missing values from an academic medical center PICU, we generated a synthetic complete dataset. From this, we created 300 datasets with missing data under varying mechanisms and proportions of missingness for the outcomes of (1) successful extubation (binary) and (2) blood pressure (continuous). We assessed strategies to address missing data including simple methods (eg, last observation carried forward [LOCF]), complex methods (eg, random forest multiple imputation), and native support for missing values in outcome prediction models.</p><p><strong>Results: </strong>Across 886 patients and 1220 intubation events, 18.2% of original data were missing. LOCF had the lowest imputation error, followed by random forest imputation (average mean squared error [MSE] improvement over mean imputation: 0.41 [range: 0.30, 0.50] and 0.33 [0.21, 0.43], respectively). LOCF generally outperformed other imputation methods across outcome metrics and models (mean improvement: 1.28% [range: -0.07%, 7.2%]). Imputation methods showed more performance variability for the binary outcome (balanced accuracy coefficient of variation: 0.042) than the continuous outcome (mean squared error coefficient of variation: 0.001).</p><p><strong>Conclusions: </strong>Traditional imputation methods for inferential statistics, such as multiple imputation, may not be optimal for prediction models. The amount of missingness influenced performance more than the missingness mechanism. In datasets with frequent measurements, LOCF and native support for missing values in machine learning models offer reasonable performance for handling missingness at minimal computational cost in predictive analyses.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e79307"},"PeriodicalIF":3.8,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12617989/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Named Entity Recognition for Chinese Cancer Electronic Health Records-Development and Evaluation of a Domain-Specific BERT Model: Quantitative Study. 中国癌症电子病历的命名实体识别——特定领域BERT模型的开发与评价:定量研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-14 DOI: 10.2196/76912
Junbai Chen, Butian Zhao, Xiaohan Tian, Zhengkai Zou, Ruojia Wang, Jiarui Wu, Songxing Du, Fengying Guo

Background: The unstructured data of Chinese cancer electronic health records (EHRs) contains valuable medical expertise. Accurate medical entity recognition is crucial for building a medical-assisted decision system. Named entity recognition (NER) in cancer EHRs typically uses general models designed for English medical records. There is a lack of specialized handling for cancer-specific records and limited application to Chinese medical records.

Objective: This study aims to propose a specific NER model to enhance the recognition of medical entities in Chinese cancer EHRs.

Methods: Desensitized inpatient EHRs related to breast cancer were collected from a leading hospital in Beijing. Building upon the MC Bidirectional Encoder Representations from Transformers (BERT) foundation, the study further incorporated a Chinese cancer corpus for pretraining, resulting in the construction of the ChCancerBERT pretrained model. In conjunction with dilated-gated convolutional neural networks, bidirectional long short-term memory, multihead attention mechanism, and a conditional random field, this model forms a multimodel, multilevel integrated NER approach.

Results: This approach effectively extracts medical entity features related to symptoms, signs, tests, treatments, and time in Chinese breast cancer EHRs. The entity recognition performance of the proposed model surpasses that of the baseline model and other models compared in the experiment. The F1-score reached 86.93%, precision reached 87.24%, and recall reached 86.61%. The model introduced in this study demonstrates exceptional performance on the CCKS2019 dataset, attaining a precision rate of 87.26%, a recall rate of 87.27%, and an impressive F1-score of 87.26%, surpassing that of existing models.

Conclusions: The experiments demonstrate that the approach proposed in this study exhibits excellent performance in NER within breast cancer EHRs. This advancement will further contribute to clinical decision support for cancer treatment and research. In addition, the study reveals that incorporating domain-specific corpora in clinical NER tasks can further enhance the performance of BERT models in specialized domains.

背景:中国癌症电子健康档案(EHRs)的非结构化数据包含了宝贵的医学知识。准确的医疗实体识别是构建医疗辅助决策系统的关键。癌症电子病历中的命名实体识别(NER)通常使用针对英语医疗记录设计的通用模型。缺乏对癌症特定记录的专门处理,对中国医疗记录的应用也有限。目的:本研究旨在提出一个特定的NER模型,以提高中国癌症电子病历对医疗实体的识别。方法:收集北京市某知名医院脱敏乳腺癌住院患者的电子病历。在BERT (MC Bidirectional Encoder Representations from Transformers)的基础上,进一步纳入中文癌症语料库进行预训练,构建了ChCancerBERT预训练模型。该模型与扩张型门控卷积神经网络、双向长短期记忆、多头注意机制和条件随机场相结合,形成了一种多模型、多层次的集成NER方法。结果:该方法有效地提取了中国乳腺癌电子病历中与症状、体征、检查、治疗和时间相关的医疗实体特征。该模型的实体识别性能优于基线模型和实验中比较的其他模型。f1得分达到86.93%,准确率达到87.24%,召回率达到86.61%。本研究引入的模型在CCKS2019数据集上表现出色,准确率达到87.26%,召回率达到87.27%,f1得分达到87.26%,超过了现有模型。结论:实验表明,本研究提出的方法在乳腺癌电子病历的NER中表现优异。这一进展将进一步有助于癌症治疗和研究的临床决策支持。此外,研究表明,在临床NER任务中加入特定领域的语料库可以进一步提高BERT模型在专业领域的性能。
{"title":"Named Entity Recognition for Chinese Cancer Electronic Health Records-Development and Evaluation of a Domain-Specific BERT Model: Quantitative Study.","authors":"Junbai Chen, Butian Zhao, Xiaohan Tian, Zhengkai Zou, Ruojia Wang, Jiarui Wu, Songxing Du, Fengying Guo","doi":"10.2196/76912","DOIUrl":"10.2196/76912","url":null,"abstract":"<p><strong>Background: </strong>The unstructured data of Chinese cancer electronic health records (EHRs) contains valuable medical expertise. Accurate medical entity recognition is crucial for building a medical-assisted decision system. Named entity recognition (NER) in cancer EHRs typically uses general models designed for English medical records. There is a lack of specialized handling for cancer-specific records and limited application to Chinese medical records.</p><p><strong>Objective: </strong>This study aims to propose a specific NER model to enhance the recognition of medical entities in Chinese cancer EHRs.</p><p><strong>Methods: </strong>Desensitized inpatient EHRs related to breast cancer were collected from a leading hospital in Beijing. Building upon the MC Bidirectional Encoder Representations from Transformers (BERT) foundation, the study further incorporated a Chinese cancer corpus for pretraining, resulting in the construction of the ChCancerBERT pretrained model. In conjunction with dilated-gated convolutional neural networks, bidirectional long short-term memory, multihead attention mechanism, and a conditional random field, this model forms a multimodel, multilevel integrated NER approach.</p><p><strong>Results: </strong>This approach effectively extracts medical entity features related to symptoms, signs, tests, treatments, and time in Chinese breast cancer EHRs. The entity recognition performance of the proposed model surpasses that of the baseline model and other models compared in the experiment. The F1-score reached 86.93%, precision reached 87.24%, and recall reached 86.61%. The model introduced in this study demonstrates exceptional performance on the CCKS2019 dataset, attaining a precision rate of 87.26%, a recall rate of 87.27%, and an impressive F1-score of 87.26%, surpassing that of existing models.</p><p><strong>Conclusions: </strong>The experiments demonstrate that the approach proposed in this study exhibits excellent performance in NER within breast cancer EHRs. This advancement will further contribute to clinical decision support for cancer treatment and research. In addition, the study reveals that incorporating domain-specific corpora in clinical NER tasks can further enhance the performance of BERT models in specialized domains.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76912"},"PeriodicalIF":3.8,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12620309/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multisource Coherence Analysis of the First European Multicenter Cohort Study for Cancer Prevention in People Experiencing Homelessness: Data Quality Study. 第一项欧洲多中心队列研究对无家可归者癌症预防的多源一致性分析:数据质量研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-14 DOI: 10.2196/73596
Antonio Blasco-Calafat, Vicent Blanes-Selva, Tobias Fragner, Ascensión Doñate-Martínez, Tamara Alhambra-Borrás, Julia Gawronska, Lee Smith, Juan M Garcia-Gomez, Igor Grabovac, Carlos Sáez

Background: Coherence across sites in multicenter datasets is one substantial data quality dimension for reliable health data reuse, as unexpected heterogeneity in data can lead to biases in data analyses and suboptimal generalization of results.

Objective: This work aims to characterize and label the data coherence across sites in the first European multicenter dataset for cancer prevention in people and early detection among the homeless population in Europe: coadapting and implementing the health navigator model. This dataset emerged to enable research to address disparities in health challenges and health care access due to barriers such as unstable housing, limited resources, and social stigma in people experiencing homelessness.

Methods: The dataset comprises 652 cases: 142 from Austria, 158 from Greece, 197 from Spain, and 155 from the United Kingdom. All participants fit classifications from the European Typology of Homelessness and Housing Exclusion. This longitudinal study collected questionnaires at baseline, after 4 weeks, and at the end of the intervention. The 180-question survey covered sociodemographic data, overall health, mental health, empowerment, and interpersonal communication. Data variability was assessed using information theory and geometric methods to analyze discrepancies in distributions and completeness across the dataset.

Results: Substantial variability was observed among the 4 pilot countries, both in the overall analysis and within specific domains. In particular, measures of health care empowerment, quality of life, and interpersonal communication demonstrated the greatest discrepancies among pilot sites, with the exception of the health domain. Notably, Spain exhibited the most pronounced differences, characterized by a high number of missing values related to interpersonal communication and the use of health care services.

Conclusions: Health data may be comparable across the 4 countries; however, substantial differences were observed in the other questionnaires, requiring independent, country-specific analyses. This study underscores the heterogeneity among people experiencing homelessness and the critical need for data quality assessments to inform future research and policymaking in this field.

背景:在多中心数据集中,跨站点的一致性是可靠的卫生数据重用的一个重要数据质量维度,因为数据中的意外异质性可能导致数据分析中的偏差和结果的次优泛化。目的:本工作旨在描述和标记欧洲第一个多中心数据集中各站点的数据一致性,该数据集用于欧洲人的癌症预防和无家可归人口的早期检测:协调和实施健康导航员模型。该数据集的出现是为了使研究能够解决由于住房不稳定、资源有限和无家可归者的社会耻辱等障碍而导致的健康挑战和医疗保健获取方面的差异。方法:数据集包括652例:奥地利142例,希腊158例,西班牙197例,英国155例。所有参与者都符合欧洲无家可归和住房排斥类型学的分类。这项纵向研究在基线、4周后和干预结束时收集问卷。这项180个问题的调查涵盖了社会人口统计数据、整体健康、心理健康、赋权和人际沟通。使用信息理论和几何方法评估数据变异性,以分析数据集分布和完整性的差异。结果:在4个试点国家之间,无论是在总体分析还是在特定领域,都观察到实质性的差异。特别是,在保健赋权、生活质量和人际交往方面的措施显示,各试验点之间的差异最大,但保健领域除外。值得注意的是,西班牙表现出最明显的差异,其特点是与人际交往和保健服务的使用有关的大量价值缺失。结论:这四个国家的卫生数据可能具有可比性;然而,在其他调查表中观察到重大差异,需要独立的、针对具体国家的分析。这项研究强调了无家可归者之间的异质性,以及对数据质量评估的迫切需要,以便为该领域的未来研究和政策制定提供信息。
{"title":"Multisource Coherence Analysis of the First European Multicenter Cohort Study for Cancer Prevention in People Experiencing Homelessness: Data Quality Study.","authors":"Antonio Blasco-Calafat, Vicent Blanes-Selva, Tobias Fragner, Ascensión Doñate-Martínez, Tamara Alhambra-Borrás, Julia Gawronska, Lee Smith, Juan M Garcia-Gomez, Igor Grabovac, Carlos Sáez","doi":"10.2196/73596","DOIUrl":"10.2196/73596","url":null,"abstract":"<p><strong>Background: </strong>Coherence across sites in multicenter datasets is one substantial data quality dimension for reliable health data reuse, as unexpected heterogeneity in data can lead to biases in data analyses and suboptimal generalization of results.</p><p><strong>Objective: </strong>This work aims to characterize and label the data coherence across sites in the first European multicenter dataset for cancer prevention in people and early detection among the homeless population in Europe: coadapting and implementing the health navigator model. This dataset emerged to enable research to address disparities in health challenges and health care access due to barriers such as unstable housing, limited resources, and social stigma in people experiencing homelessness.</p><p><strong>Methods: </strong>The dataset comprises 652 cases: 142 from Austria, 158 from Greece, 197 from Spain, and 155 from the United Kingdom. All participants fit classifications from the European Typology of Homelessness and Housing Exclusion. This longitudinal study collected questionnaires at baseline, after 4 weeks, and at the end of the intervention. The 180-question survey covered sociodemographic data, overall health, mental health, empowerment, and interpersonal communication. Data variability was assessed using information theory and geometric methods to analyze discrepancies in distributions and completeness across the dataset.</p><p><strong>Results: </strong>Substantial variability was observed among the 4 pilot countries, both in the overall analysis and within specific domains. In particular, measures of health care empowerment, quality of life, and interpersonal communication demonstrated the greatest discrepancies among pilot sites, with the exception of the health domain. Notably, Spain exhibited the most pronounced differences, characterized by a high number of missing values related to interpersonal communication and the use of health care services.</p><p><strong>Conclusions: </strong>Health data may be comparable across the 4 countries; however, substantial differences were observed in the other questionnaires, requiring independent, country-specific analyses. This study underscores the heterogeneity among people experiencing homelessness and the critical need for data quality assessments to inform future research and policymaking in this field.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73596"},"PeriodicalIF":3.8,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12663700/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Large Language Models With AI Agents for Chronic Gastritis Management: Comprehensive Comparative Study. 用人工智能智能体增强大型语言模型用于慢性胃炎管理:综合比较研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-13 DOI: 10.2196/73857
Shurui Wang, Qing Ye
<p><strong>Background: </strong>The prevalence of chronic gastritis is high, and if not intervened in a timely manner, it may eventually lead to gastric cancer. Managing chronic gastritis essentially requires comprehensive lifestyle changes. However, the current health care environment does not support continuous follow-up by professional health care providers, making self-management a key component of postdiagnosis care. Increasingly, researchers are exploring the use of large language models (LLMs) for patient management. However, LLMs have limitations, including hallucinations, limited knowledge scope, and lack of timeliness. Artificial intelligence (AI) agents may provide a more effective solution. Nevertheless, it remains uncertain whether AI agents can effectively support postdiagnosis self-management for patients with chronic gastritis.</p><p><strong>Objective: </strong>The purpose of this study was to explore the effectiveness of AI agents in the postdiagnosis management of patients with chronic gastritis from different perspectives.</p><p><strong>Methods: </strong>In this study, we developed an agent framework for the health management of patients with chronic gastritis based on LLMs in conjunction with retrieval-augmented generation and a search engine tool. We collected real questions from patients with chronic gastritis in clinical settings and tested the framework's performance across different difficulty levels and scenarios. We analyzed its safety and robustness and compared it with state-of-the-art models to comprehensively evaluate its effectiveness.</p><p><strong>Results: </strong>Using a dual-evaluation framework comprising automated metrics and expert manual assessments, our results demonstrated that AI agents substantially outperformed LLMs in addressing high-complexity questions (embedding average score: 82.849 for AI agents vs 77.825 for LLMs) and were particularly effective in clinical consultation tasks. Clinical evaluation of safety based on a 5-point Likert scale by physicians indicated that the safety of the agents was 4.98 (SD 0.15; 95% CI 4.96-4.99). After 30 repeated experiments, the mean absolute deviation of the AI agents in the embedding average score and BERTScore metrics were 0.0167 and 0.0387, respectively. Therefore, the safety and robustness analysis confirmed that the AI agents can produce safe, stable, and minimally variable responses. In addition, comparative results with those of advanced medical-domain LLMs (Baichuan-14B-M1 and MedGemma-27B) and general-domain LLMs (Qwen3-32B) also demonstrated that the AI agents in this study performed outstandingly in the field of chronic gastritis. Our findings underscore the superior reliability, interpretability, and practical applicability of AI agents over conventional LLMs in chronic gastritis management, offering a robust foundation for their broader adoption in health care settings.</p><p><strong>Conclusions: </strong>AI agents based on LLMs have high applicat
背景:慢性胃炎患病率高,如果不及时干预,最终可能导致胃癌。治疗慢性胃炎基本上需要全面改变生活方式。然而,目前的卫生保健环境不支持专业卫生保健提供者的持续随访,使自我管理成为诊断后护理的关键组成部分。越来越多的研究人员正在探索使用大型语言模型(llm)进行患者管理。然而,法学硕士也有局限性,包括幻觉、知识范围有限、缺乏时效性。人工智能(AI)代理可能会提供更有效的解决方案。然而,人工智能制剂能否有效支持慢性胃炎患者的诊断后自我管理尚不确定。目的:本研究旨在从不同角度探讨人工智能药物在慢性胃炎患者诊断后管理中的有效性。方法:在本研究中,我们开发了一个基于LLMs的慢性胃炎患者健康管理代理框架,并结合检索增强生成和搜索引擎工具。我们收集了临床慢性胃炎患者的真实问题,并测试了该框架在不同难度水平和场景下的性能。我们分析了其安全性和鲁棒性,并将其与最先进的模型进行了比较,以综合评价其有效性。结果:使用由自动指标和专家手动评估组成的双重评估框架,我们的结果表明,人工智能代理在解决高复杂性问题方面的表现明显优于法学硕士(人工智能代理的嵌入平均分:82.849比法学硕士的77.825),并且在临床咨询任务中特别有效。医生基于5点Likert量表的临床安全性评价表明,药物的安全性为4.98 (SD 0.15; 95% CI 4.96-4.99)。经过30次重复实验,人工智能主体在嵌入平均分和BERTScore指标上的平均绝对偏差分别为0.0167和0.0387。因此,安全性和鲁棒性分析证实了AI代理可以产生安全、稳定和最小变量的响应。此外,与先进医学域LLMs(百川- 14b - m1和MedGemma-27B)和通用域LLMs (Qwen3-32B)的对比结果也表明,本研究中的AI agent在慢性胃炎领域表现突出。我们的研究结果强调了人工智能代理在慢性胃炎管理方面优于传统llm的可靠性、可解释性和实用性,为其在医疗保健领域的广泛应用提供了坚实的基础。结论:基于llm的人工智能药物在慢性胃炎的治疗中具有较高的应用价值。他们可以有效地指导慢性病患者解决常见问题,这可能会减少医生的工作量,提高患者家庭护理的质量。
{"title":"Enhancing Large Language Models With AI Agents for Chronic Gastritis Management: Comprehensive Comparative Study.","authors":"Shurui Wang, Qing Ye","doi":"10.2196/73857","DOIUrl":"10.2196/73857","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;The prevalence of chronic gastritis is high, and if not intervened in a timely manner, it may eventually lead to gastric cancer. Managing chronic gastritis essentially requires comprehensive lifestyle changes. However, the current health care environment does not support continuous follow-up by professional health care providers, making self-management a key component of postdiagnosis care. Increasingly, researchers are exploring the use of large language models (LLMs) for patient management. However, LLMs have limitations, including hallucinations, limited knowledge scope, and lack of timeliness. Artificial intelligence (AI) agents may provide a more effective solution. Nevertheless, it remains uncertain whether AI agents can effectively support postdiagnosis self-management for patients with chronic gastritis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;The purpose of this study was to explore the effectiveness of AI agents in the postdiagnosis management of patients with chronic gastritis from different perspectives.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;In this study, we developed an agent framework for the health management of patients with chronic gastritis based on LLMs in conjunction with retrieval-augmented generation and a search engine tool. We collected real questions from patients with chronic gastritis in clinical settings and tested the framework's performance across different difficulty levels and scenarios. We analyzed its safety and robustness and compared it with state-of-the-art models to comprehensively evaluate its effectiveness.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Using a dual-evaluation framework comprising automated metrics and expert manual assessments, our results demonstrated that AI agents substantially outperformed LLMs in addressing high-complexity questions (embedding average score: 82.849 for AI agents vs 77.825 for LLMs) and were particularly effective in clinical consultation tasks. Clinical evaluation of safety based on a 5-point Likert scale by physicians indicated that the safety of the agents was 4.98 (SD 0.15; 95% CI 4.96-4.99). After 30 repeated experiments, the mean absolute deviation of the AI agents in the embedding average score and BERTScore metrics were 0.0167 and 0.0387, respectively. Therefore, the safety and robustness analysis confirmed that the AI agents can produce safe, stable, and minimally variable responses. In addition, comparative results with those of advanced medical-domain LLMs (Baichuan-14B-M1 and MedGemma-27B) and general-domain LLMs (Qwen3-32B) also demonstrated that the AI agents in this study performed outstandingly in the field of chronic gastritis. Our findings underscore the superior reliability, interpretability, and practical applicability of AI agents over conventional LLMs in chronic gastritis management, offering a robust foundation for their broader adoption in health care settings.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;AI agents based on LLMs have high applicat","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73857"},"PeriodicalIF":3.8,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12617012/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating 10-Year Cardiovascular Disease Risk in Primary Prevention Using UK Electronic Health Records and a Hybrid Multitask BERT Model: Retrospective Cohort Study. 使用英国电子健康记录和混合多任务BERT模型估计初级预防10年心血管疾病风险:回顾性队列研究
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-13 DOI: 10.2196/76659
Tianyi Liu, Lei Lu, Yanzhong Wang, Andrew J Krentz, Vasa Curcin
<p><strong>Background: </strong>Cardiovascular disease (CVD) remains a leading cause of preventable morbidity and mortality, highlighting the need for early risk stratification in primary prevention. Traditional Cox models assume proportional hazards and linear effects, limiting flexibility. While machine learning offers greater expressiveness, many models rely solely on structured data and overlook time-to-event (TTE) information. Integrating structured and textual representations may enhance prediction and support equitable assessment across clinical subgroups.</p><p><strong>Objective: </strong>This study aims to develop a hybrid multitask deep learning model (MT-BERT [multitask Bidirectional Encoder Representations from Transformers]) integrating structured and textual features from electronic health records (EHRs) to predict 10-year CVD risk, enhancing individualized stratification and supporting equitable assessment across diverse demographic groups.</p><p><strong>Methods: </strong>We used data from Clinical Practice Research Datalink (CPRD) Aurum comprising 469,496 patients aged 40-85 years to develop MT-BERT for 10-year CVD risk prediction. Structured EHR variables and their corresponding textual representations were jointly encoded using a multilayer perceptron and a distilled version of the BERT model (DistilBERT), respectively. A fusion layer and stacked multihead attention modules enabled cross-modal interaction modeling. The model generated both binary classification outputs and TTE risk scores, optimized using a custom FocalCoxLoss function with uncertainty-based weighting. Prediction targets encompassed composite and individual CVD outcomes. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), concordance index, and Brier score, with subgroup analyses by ethnicity and deprivation, and heterogeneity assessed using Higgins I² and Cochran Q statistics. Generalizability was assessed via external validation in a held-out London cohort.</p><p><strong>Results: </strong>The MT-BERT model yielded AUROC values of 0.744 (95% CI 0.738-0.749) in males and 0.782 (95% CI 0.768-0.796) in females on the test set (n=711,052), and 0.736 (95% CI 0.729-0.741) and 0.775 (95% CI 0.768-0.780), respectively in "spatial external" validation (n=144,370). Brier scores were 0.130 in males and 0.091 in females. Individuals classified as high-risk (≥40% risk in males and ≥34% in females) demonstrated significantly reduced 10-year event-free survival relative to lower-risk individuals (log-rank P<.001). Model performance was consistently higher in females across all metrics. Subgroup analyses revealed substantial heterogeneity across ethnicity and deprivation (I²>70%), especially among males, with lower AUROC in South Asian and Black ethnic groups. These findings reflect variation in model performance across demographic groups while supporting its applicability to large-scale CVD risk stratification.</p><p><stro
背景:心血管疾病(CVD)仍然是可预防的发病率和死亡率的主要原因,突出了在初级预防中进行早期风险分层的必要性。传统的Cox模型假设风险成比例和线性效应,限制了灵活性。虽然机器学习提供了更强的表达能力,但许多模型仅依赖于结构化数据,而忽略了时间到事件(TTE)信息。整合结构化和文本表示可以增强预测并支持跨临床亚组的公平评估。目的:本研究旨在开发一种混合多任务深度学习模型(MT-BERT[多任务双向编码器表示来自变压器]),整合电子健康记录(EHRs)的结构化和文本特征来预测10年心血管疾病风险,增强个性化分层并支持不同人口群体的公平评估。方法:我们使用临床实践研究数据链(CPRD) Aurum的数据,包括469,496名年龄在40-85岁之间的患者,开发MT-BERT用于10年心血管疾病风险预测。结构化EHR变量及其相应的文本表示分别使用多层感知器和BERT模型的蒸馏版本(蒸馏BERT)进行联合编码。融合层和堆叠的多头注意模块实现了跨模态交互建模。该模型生成二元分类输出和TTE风险评分,并使用基于不确定性加权的自定义FocalCoxLoss函数进行优化。预测目标包括复合和个体CVD结果。采用受试者工作特征曲线下面积(AUROC)、一致性指数和Brier评分评估模型性能,并采用种族和剥夺亚组分析,采用Higgins I²和Cochran Q统计评估异质性。概括性是通过外部验证在一个伦敦的队列评估。结果:MT-BERT模型在测试集(n=711,052)上,男性的AUROC值为0.744 (95% CI 0.738-0.749),女性的AUROC值为0.782 (95% CI 0.768-0.796),在“空间外部”验证(n=144,370)中,其AUROC值分别为0.736 (95% CI 0.729-0.741)和0.775 (95% CI 0.768-0.780)。男性的Brier评分为0.130,女性为0.091。高危人群(男性风险≥40%,女性风险≥34%)的10年无事件生存率明显低于低危人群(log-rank P70%),尤其是男性,南亚和黑人群体的AUROC较低。这些发现反映了模型在不同人口群体中的表现差异,同时支持了其对大规模心血管疾病风险分层的适用性。结论:提出的混合MT-BERT模型通过整合来自电子病历的结构化变量和非结构化临床文本来预测初级预防的10年心血管疾病风险。它的多任务设计有利于个性化风险分层和TTE估计。虽然贫困和少数族裔亚群体的表现略有下降,但这些发现为在日益多样化的卫生保健环境中推进公平意识、数据驱动的预防策略提供了初步支持。
{"title":"Estimating 10-Year Cardiovascular Disease Risk in Primary Prevention Using UK Electronic Health Records and a Hybrid Multitask BERT Model: Retrospective Cohort Study.","authors":"Tianyi Liu, Lei Lu, Yanzhong Wang, Andrew J Krentz, Vasa Curcin","doi":"10.2196/76659","DOIUrl":"10.2196/76659","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Cardiovascular disease (CVD) remains a leading cause of preventable morbidity and mortality, highlighting the need for early risk stratification in primary prevention. Traditional Cox models assume proportional hazards and linear effects, limiting flexibility. While machine learning offers greater expressiveness, many models rely solely on structured data and overlook time-to-event (TTE) information. Integrating structured and textual representations may enhance prediction and support equitable assessment across clinical subgroups.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aims to develop a hybrid multitask deep learning model (MT-BERT [multitask Bidirectional Encoder Representations from Transformers]) integrating structured and textual features from electronic health records (EHRs) to predict 10-year CVD risk, enhancing individualized stratification and supporting equitable assessment across diverse demographic groups.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We used data from Clinical Practice Research Datalink (CPRD) Aurum comprising 469,496 patients aged 40-85 years to develop MT-BERT for 10-year CVD risk prediction. Structured EHR variables and their corresponding textual representations were jointly encoded using a multilayer perceptron and a distilled version of the BERT model (DistilBERT), respectively. A fusion layer and stacked multihead attention modules enabled cross-modal interaction modeling. The model generated both binary classification outputs and TTE risk scores, optimized using a custom FocalCoxLoss function with uncertainty-based weighting. Prediction targets encompassed composite and individual CVD outcomes. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), concordance index, and Brier score, with subgroup analyses by ethnicity and deprivation, and heterogeneity assessed using Higgins I² and Cochran Q statistics. Generalizability was assessed via external validation in a held-out London cohort.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The MT-BERT model yielded AUROC values of 0.744 (95% CI 0.738-0.749) in males and 0.782 (95% CI 0.768-0.796) in females on the test set (n=711,052), and 0.736 (95% CI 0.729-0.741) and 0.775 (95% CI 0.768-0.780), respectively in \"spatial external\" validation (n=144,370). Brier scores were 0.130 in males and 0.091 in females. Individuals classified as high-risk (≥40% risk in males and ≥34% in females) demonstrated significantly reduced 10-year event-free survival relative to lower-risk individuals (log-rank P&lt;.001). Model performance was consistently higher in females across all metrics. Subgroup analyses revealed substantial heterogeneity across ethnicity and deprivation (I²&gt;70%), especially among males, with lower AUROC in South Asian and Black ethnic groups. These findings reflect variation in model performance across demographic groups while supporting its applicability to large-scale CVD risk stratification.&lt;/p&gt;&lt;p&gt;&lt;stro","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76659"},"PeriodicalIF":3.8,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12620595/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Process for Quality Management of Electronic Medical Records-Based Data: Case Study Using Real Colorectal Cancer Data. 基于电子病历的数据质量管理流程:使用真实结直肠癌数据的案例研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-13 DOI: 10.2196/73884
NaYoung Park, Kyungmin Na, Woongsang Sunwoo, Jeong-Heum Baek, Youngho Lee, Suehyun Lee, Hyekyung Woo

Background: As data-driven medical research advances, vast amounts of medical data are being collected, giving researchers access to important information. However, issues such as heterogeneity, complexity, and incompleteness of datasets limit their practical use. Errors and missing data negatively affect artificial intelligence-based predictive models, undermining the reliability of clinical decision-making. Thus, it is important to develop a quality management process (QMP) for clinical data.

Objective: This study aimed to develop a rules-based QMP to address errors and impute missing values in real-world data, establishing high-quality data for clinical research.

Methods: We used clinical data from 6491 patients with colorectal cancer (CRC) collected at Gachon University Gil Medical Center between 2010 and 2022, leveraging the clinical library established within the Korea Clinical Data Use Network for Research Excellence. First, we conducted a literature review on the prognostic prediction of CRC to assess whether the data met our research purposes, comparing selected variables with real-world data. A labeling process was then implemented to extract key variables, which facilitated the creation of an automatic staging library. This library, combined with a rule-based process, allowed for systematic analysis and evaluation.

Results: Theoretically, the tumor, node, metastasis (TNM) stage was identified as an important prognostic factor for CRC, but it was not selected through feature selection in real-world data. After applying the QMP, rates of missing data were reduced from 75.3% to 35.7% for TNM and from 24.3% to 18.5% for surveillance, epidemiology, and end results across 6491 cases, confirming the system's effectiveness. Variable importance analysis through feature selection revealed that TNM stage and detailed code variables, which were previously unselected, were included in the improved model.

Conclusions: In sum, we developed a rules-based QMP to address errors and impute missing values in Korea Clinical Data Use Network for Research Excellence data, enhancing data quality. The applicability of the process to real-world datasets highlights its potential for broader use in clinical studies and cancer research.

背景:随着数据驱动的医学研究的进展,大量的医学数据正在被收集,使研究人员能够获得重要的信息。然而,数据集的异质性、复杂性和不完整性等问题限制了它们的实际应用。错误和缺失的数据会对基于人工智能的预测模型产生负面影响,破坏临床决策的可靠性。因此,制定临床数据质量管理流程(QMP)非常重要。目的:本研究旨在开发一个基于规则的QMP,以解决现实世界数据中的错误和缺失值,为临床研究建立高质量的数据。方法:我们使用了2010年至2022年间在Gachon大学吉尔医学中心收集的6491例结直肠癌(CRC)患者的临床数据,利用了韩国临床数据使用网络中建立的临床图书馆进行卓越研究。首先,我们对CRC的预后预测进行了文献综述,以评估数据是否符合我们的研究目的,并将所选变量与现实数据进行了比较。然后实现了一个标记过程来提取关键变量,这促进了自动staging库的创建。该库与基于规则的过程相结合,可以进行系统的分析和评估。结果:理论上,肿瘤、淋巴结、转移(TNM)分期被确定为结直肠癌的重要预后因素,但在现实数据中并未通过特征选择来选择。应用QMP后,TNM的数据缺失率从75.3%降至35.7%,6491例的监测、流行病学和最终结果的数据缺失率从24.3%降至18.5%,证实了该系统的有效性。通过特征选择进行变量重要性分析,发现改进模型中包含了之前未选择的TNM阶段和详细代码变量。结论:总之,我们开发了一个基于规则的QMP来解决韩国临床数据使用网络中研究卓越数据的错误和缺失值,提高了数据质量。该过程对现实世界数据集的适用性突出了其在临床研究和癌症研究中广泛应用的潜力。
{"title":"Process for Quality Management of Electronic Medical Records-Based Data: Case Study Using Real Colorectal Cancer Data.","authors":"NaYoung Park, Kyungmin Na, Woongsang Sunwoo, Jeong-Heum Baek, Youngho Lee, Suehyun Lee, Hyekyung Woo","doi":"10.2196/73884","DOIUrl":"10.2196/73884","url":null,"abstract":"<p><strong>Background: </strong>As data-driven medical research advances, vast amounts of medical data are being collected, giving researchers access to important information. However, issues such as heterogeneity, complexity, and incompleteness of datasets limit their practical use. Errors and missing data negatively affect artificial intelligence-based predictive models, undermining the reliability of clinical decision-making. Thus, it is important to develop a quality management process (QMP) for clinical data.</p><p><strong>Objective: </strong>This study aimed to develop a rules-based QMP to address errors and impute missing values in real-world data, establishing high-quality data for clinical research.</p><p><strong>Methods: </strong>We used clinical data from 6491 patients with colorectal cancer (CRC) collected at Gachon University Gil Medical Center between 2010 and 2022, leveraging the clinical library established within the Korea Clinical Data Use Network for Research Excellence. First, we conducted a literature review on the prognostic prediction of CRC to assess whether the data met our research purposes, comparing selected variables with real-world data. A labeling process was then implemented to extract key variables, which facilitated the creation of an automatic staging library. This library, combined with a rule-based process, allowed for systematic analysis and evaluation.</p><p><strong>Results: </strong>Theoretically, the tumor, node, metastasis (TNM) stage was identified as an important prognostic factor for CRC, but it was not selected through feature selection in real-world data. After applying the QMP, rates of missing data were reduced from 75.3% to 35.7% for TNM and from 24.3% to 18.5% for surveillance, epidemiology, and end results across 6491 cases, confirming the system's effectiveness. Variable importance analysis through feature selection revealed that TNM stage and detailed code variables, which were previously unselected, were included in the improved model.</p><p><strong>Conclusions: </strong>In sum, we developed a rules-based QMP to address errors and impute missing values in Korea Clinical Data Use Network for Research Excellence data, enhancing data quality. The applicability of the process to real-world datasets highlights its potential for broader use in clinical studies and cancer research.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73884"},"PeriodicalIF":3.8,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12614659/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing Data Quality in Heterogeneous Health Care Integration: Simulation Study of the AIDAVA Framework. 异构医疗保健整合中的数据质量评估:AIDAVA框架的仿真研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-12 DOI: 10.2196/75275
Jens Declerck, Ömer Durukan Kılıç, Ensar Emir Erol, Shervin Mehryar, Dipak Kalra, Isabelle de Zegher, Remzi Celebi

Background: Integrated health data are foundational for secondary use, research, and policymaking. However, data quality issues-such as missing values and inconsistencies-are common due to the heterogeneity of health data sources. Existing frameworks often use static, 1-time assessments, which limit their ability to address quality issues across evolving data pipelines.

Objective: This study evaluates the AIDAVA (artificial intelligence-powered data curation and validation) data quality framework, which introduces dynamic, life cycle-based validation of health data using knowledge graph technologies and SHACL (Shapes Constraint Language)-based rules. The framework is assessed for its ability to detect and manage data quality issues-specifically, completeness and consistency-during integration.

Methods: Using the MIMIC-III (Medical Information Mart for Intensive Care-III) dataset, we simulated real-world data quality challenges by introducing structured noise, including missing values and logical inconsistencies. The data was transformed into source knowledge graphs and integrated into a unified personal health knowledge graph. SHACL validation rules were applied iteratively during the integration process, and data quality was assessed under varying noise levels and integration orders.

Results: The AIDAVA framework effectively detected completeness and consistency issues across all scenarios. Completeness was shown to influence the interpretability of consistency scores, and domain-specific attributes (eg, diagnoses and procedures) were more sensitive to integration order and data gaps.

Conclusions: AIDAVA supports dynamic, rule-based validation throughout the data life cycle. By addressing both dimension-specific vulnerabilities and cross-dimensional effects, it lays the groundwork for scalable, high-quality health data integration. Future work should explore deployment in live clinical settings and expand to additional quality dimensions.

背景:综合卫生数据是二次利用、研究和决策的基础。然而,由于卫生数据源的异质性,数据质量问题(如缺失值和不一致)很常见。现有框架通常使用静态的一次性评估,这限制了它们在不断发展的数据管道中解决质量问题的能力。目的:本研究评估了AIDAVA(人工智能驱动的数据管理和验证)数据质量框架,该框架使用知识图谱技术和基于SHACL(形状约束语言)的规则引入了基于生命周期的健康数据动态验证。评估框架在集成过程中检测和管理数据质量问题(特别是完整性和一致性)的能力。方法:使用MIMIC-III(重症监护医疗信息市场- iii)数据集,我们通过引入结构化噪声(包括缺失值和逻辑不一致)来模拟现实世界的数据质量挑战。将数据转化为源知识图,整合成统一的个人健康知识图。在集成过程中迭代应用SHACL验证规则,并在不同噪声水平和集成顺序下评估数据质量。结果:AIDAVA框架有效地检测了所有场景中的完整性和一致性问题。完整性被证明会影响一致性分数的可解释性,并且领域特定属性(例如,诊断和程序)对集成顺序和数据差距更敏感。结论:AIDAVA在整个数据生命周期中支持动态的、基于规则的验证。通过解决特定维度的漏洞和跨维度的影响,它为可扩展的高质量健康数据集成奠定了基础。未来的工作应该探索在现场临床环境中的部署,并扩展到额外的质量维度。
{"title":"Assessing Data Quality in Heterogeneous Health Care Integration: Simulation Study of the AIDAVA Framework.","authors":"Jens Declerck, Ömer Durukan Kılıç, Ensar Emir Erol, Shervin Mehryar, Dipak Kalra, Isabelle de Zegher, Remzi Celebi","doi":"10.2196/75275","DOIUrl":"10.2196/75275","url":null,"abstract":"<p><strong>Background: </strong>Integrated health data are foundational for secondary use, research, and policymaking. However, data quality issues-such as missing values and inconsistencies-are common due to the heterogeneity of health data sources. Existing frameworks often use static, 1-time assessments, which limit their ability to address quality issues across evolving data pipelines.</p><p><strong>Objective: </strong>This study evaluates the AIDAVA (artificial intelligence-powered data curation and validation) data quality framework, which introduces dynamic, life cycle-based validation of health data using knowledge graph technologies and SHACL (Shapes Constraint Language)-based rules. The framework is assessed for its ability to detect and manage data quality issues-specifically, completeness and consistency-during integration.</p><p><strong>Methods: </strong>Using the MIMIC-III (Medical Information Mart for Intensive Care-III) dataset, we simulated real-world data quality challenges by introducing structured noise, including missing values and logical inconsistencies. The data was transformed into source knowledge graphs and integrated into a unified personal health knowledge graph. SHACL validation rules were applied iteratively during the integration process, and data quality was assessed under varying noise levels and integration orders.</p><p><strong>Results: </strong>The AIDAVA framework effectively detected completeness and consistency issues across all scenarios. Completeness was shown to influence the interpretability of consistency scores, and domain-specific attributes (eg, diagnoses and procedures) were more sensitive to integration order and data gaps.</p><p><strong>Conclusions: </strong>AIDAVA supports dynamic, rule-based validation throughout the data life cycle. By addressing both dimension-specific vulnerabilities and cross-dimensional effects, it lays the groundwork for scalable, high-quality health data integration. Future work should explore deployment in live clinical settings and expand to additional quality dimensions.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e75275"},"PeriodicalIF":3.8,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12779104/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145508244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of Polyphonic Alarm Sounds From Medical Devices Using Frequency-Enhanced Deep Learning: Simulation Study. 使用频率增强深度学习检测医疗设备的复调报警声音:仿真研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-12 DOI: 10.2196/35987
Kazumasa Kishimoto, Tadamasa Takemura, Osamu Sugiyama, Ryosuke Kojima, Masahiro Yakami, Goshiro Yamamoto, Tomohiro Kuroda

Background: Although an increasing number of bedside medical devices are equipped with wireless connections for reliable notifications, many nonnetworked devices remain effective at detecting abnormal patient conditions and alerting medical staff through auditory alarms. Staff members, however, can miss these notifications, especially when in distant areas or other private rooms. In contrast, the signal-to-noise ratio of alarm systems for medical devices in the neonatal intensive care unit is 0 dB or higher. A feasible system for automatic sound identification with high accuracy is needed to prevent alarm sounds from being missed by the staff.

Objective: The purpose of this study was to design a method for classifying multiple alarm sounds collected with a monaural microphone in a noisy environment.

Methods: Features of 7 alarm sounds were extracted using a mel filter bank and incorporated into a classifier using convolutional and recurrent neural networks. To estimate its clinical usefulness, the classifier was evaluated with mixtures of up to 7 alarm sounds and hospital ward noise.

Results: The proposed convolutional recurrent neural network model was evaluated using a simulation dataset of 7 alarm sounds mixed with hospital ward noise. At a signal-to-noise ratio of 0 dB, the best-performing model (convolutional neural network 3+bidirectional gate recurrent unit) achieved an event-based F1-score of 0.967, with a precision of 0.944 and a recall of 0.991. When the venous foot pump class was excluded, the classwise recall of the classifier ranged from 0.990 to 1.000.

Conclusions: The proposed classifier was found to be highly accurate in detecting alarm sounds. Although the performance of the proposed classifier in a clinical environment can be improved, the classifier could be incorporated into an alarm sound detection system. The classifier, combined with network connectivity, could improve the notification of abnormal status detected by unconnected medical devices.

背景:尽管越来越多的床边医疗设备配备了无线连接,以提供可靠的通知,但许多非联网设备仍然有效地检测患者的异常情况,并通过听觉警报提醒医务人员。然而,工作人员可能会错过这些通知,特别是在遥远的地方或其他私人房间时。相比之下,新生儿重症监护病房医疗设备报警系统的信噪比为0 dB或更高。需要一种可行的、高精度的声音自动识别系统,防止报警声音被工作人员遗漏。目的:本研究的目的是设计一种在嘈杂环境下用单耳麦克风采集的多个报警声音的分类方法。方法:采用mel滤波器组提取7种报警声音的特征,并结合卷积神经网络和递归神经网络进行分类。为了评估其临床用途,分类器被评估与多达7报警声音和医院病房噪音的混合物。结果:使用混合了医院病房噪声的7个报警声音的模拟数据集对所提出的卷积递归神经网络模型进行了评估。在信噪比为0 dB时,表现最好的模型(卷积神经网络3+双向门循环单元)基于事件的f1得分为0.967,精度为0.944,召回率为0.991。当排除静脉足泵类别时,分类器的分类召回率为0.990 ~ 1.000。结论:所提出的分类器在检测报警声音方面具有较高的准确率。虽然所提出的分类器在临床环境中的性能可以得到改善,但分类器可以纳入报警声音检测系统。该分类器结合网络连通性,可以提高对未连接医疗设备检测到的异常状态的通知。
{"title":"Detection of Polyphonic Alarm Sounds From Medical Devices Using Frequency-Enhanced Deep Learning: Simulation Study.","authors":"Kazumasa Kishimoto, Tadamasa Takemura, Osamu Sugiyama, Ryosuke Kojima, Masahiro Yakami, Goshiro Yamamoto, Tomohiro Kuroda","doi":"10.2196/35987","DOIUrl":"10.2196/35987","url":null,"abstract":"<p><strong>Background: </strong>Although an increasing number of bedside medical devices are equipped with wireless connections for reliable notifications, many nonnetworked devices remain effective at detecting abnormal patient conditions and alerting medical staff through auditory alarms. Staff members, however, can miss these notifications, especially when in distant areas or other private rooms. In contrast, the signal-to-noise ratio of alarm systems for medical devices in the neonatal intensive care unit is 0 dB or higher. A feasible system for automatic sound identification with high accuracy is needed to prevent alarm sounds from being missed by the staff.</p><p><strong>Objective: </strong>The purpose of this study was to design a method for classifying multiple alarm sounds collected with a monaural microphone in a noisy environment.</p><p><strong>Methods: </strong>Features of 7 alarm sounds were extracted using a mel filter bank and incorporated into a classifier using convolutional and recurrent neural networks. To estimate its clinical usefulness, the classifier was evaluated with mixtures of up to 7 alarm sounds and hospital ward noise.</p><p><strong>Results: </strong>The proposed convolutional recurrent neural network model was evaluated using a simulation dataset of 7 alarm sounds mixed with hospital ward noise. At a signal-to-noise ratio of 0 dB, the best-performing model (convolutional neural network 3+bidirectional gate recurrent unit) achieved an event-based F1-score of 0.967, with a precision of 0.944 and a recall of 0.991. When the venous foot pump class was excluded, the classwise recall of the classifier ranged from 0.990 to 1.000.</p><p><strong>Conclusions: </strong>The proposed classifier was found to be highly accurate in detecting alarm sounds. Although the performance of the proposed classifier in a clinical environment can be improved, the classifier could be incorporated into an alarm sound detection system. The classifier, combined with network connectivity, could improve the notification of abnormal status detected by unconnected medical devices.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e35987"},"PeriodicalIF":3.8,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12611226/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145508208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model for Predicting Serious Hematological Adverse Events in Individuals With Ovarian Cancer Receiving Poly (Adenosine Diphosphate Ribose) Polymerase Inhibitor Treatment: Prospective Cohort Study. 预测接受聚二磷酸腺苷核糖聚合酶抑制剂治疗的卵巢癌患者严重血液学不良事件的模型:前瞻性队列研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-12 DOI: 10.2196/72994
Xiaotong Lian, Yu Lei

Background: Predicting serious hematological adverse events (SHAEs) from poly (adenosine diphosphate ribose) polymerase inhibitors (PARPis) would allow us to prioritize patients with ovarian cancer at higher risk for more intensive care, ultimately lowering morbidity and preventing them from premature termination of medication.

Objective: This study aimed to explore the risk factors for SHAEs in patients with ovarian cancer receiving PARPi treatment and develop a risk prediction model for such events.

Methods: Prospective clinical data were collected on patients with ovarian cancer who received PARPi treatment at the Guangxi Medical University Affiliated Tumor Hospital from December 2018 to August 2024. They were divided into a SHAE group and a no-SHAE group based on the occurrence of SHAEs. Variable differences were screened using the chi-square test or Fisher exact test. Multivariate logistic regression was used to determine independent factors influencing SHAEs in patients with ovarian cancer. A predictive model for serious blood-related complications in ovarian cancer treatment was developed from identified independent risk factors using the R software. The model's clinical utility was assessed through decision curve analysis (net benefit), calibration (calibration curve), and discrimination (receiver operating characteristic curve).

Results: A total of 70 patients with ovarian cancer receiving PARPi treatment were included in this study. Of these 70 patients, 16 (23%) experienced SHAEs, with decreases in red blood cell (RBC) count and hemoglobin levels being the most common. Multiple logistic regression analysis identified 4 independent predictors of PARPi-associated SHAEs in patients with ovarian cancer: lymph node metastasis (odds ratio [OR] 6.733, 95% CI 1.197-37.873; P=.03), creatinine clearance rate of ≤60 mL per minute (OR 23.722, 95% CI 3.121-180.303; P=.002), RBC count of ≤3.3×1012 per liter (OR 4.847, 95% CI 1.020-23.041; P=.047), and combination therapy with vascular endothelial growth factor inhibitors (OR 6.749, 95% CI 1.313-34.689; P=.02). The internal validation yielded an area under the curve of 0.874 (95% CI 0.793-0.955), indicating moderate clinical utility and accuracy for the risk prediction model incorporating these predictors.

Conclusions: Lymph node metastasis, creatinine clearance rate of ≤60 mL per minute, RBC count of ≤3.3×1012 per liter, and combination therapy with vascular endothelial growth factor inhibitors are independent risk factors for PARPi SHAEs in patients with ovarian cancer. The risk prediction model established based on these factors demonstrated moderate predictive value.

背景:预测聚腺苷二磷酸核糖聚合酶抑制剂(PARPis)的严重血液学不良事件(SHAEs)将使我们能够优先考虑高风险卵巢癌患者进行更多的重症监护,最终降低发病率并防止过早终止药物治疗。目的:本研究旨在探讨PARPi治疗的卵巢癌患者发生SHAEs的危险因素,并建立SHAEs的风险预测模型。方法:收集2018年12月至2024年8月在广西医科大学附属肿瘤医院接受PARPi治疗的卵巢癌患者的前瞻性临床资料。根据SHAE的发生情况分为SHAE组和无SHAE组。使用卡方检验或Fisher精确检验筛选变量差异。采用多因素logistic回归确定影响卵巢癌患者SHAEs的独立因素。使用R软件从确定的独立危险因素中开发了卵巢癌治疗中严重血液相关并发症的预测模型。通过决策曲线分析(净效益)、校准(校准曲线)和鉴别(受试者工作特征曲线)来评估模型的临床效用。结果:本研究共纳入70例接受PARPi治疗的卵巢癌患者。在这70例患者中,16例(23%)经历了SHAEs,其中最常见的是红细胞(RBC)计数和血红蛋白水平下降。多元logistic回归分析确定了卵巢癌患者parpi相关SHAEs的4个独立预测因素:淋巴结转移(比值比[OR] 6.733, 95% CI 1.197-37.873; P= 0.03)、肌酐清除率≤60 mL / min(比值比[OR] 23.722, 95% CI 3.121-180.303; P= 0.002)、红细胞计数≤3.3×1012 / l(比值比[OR] 4.847, 95% CI 1.020-23.041; P= 0.047)、血管内皮生长因子抑制剂联合治疗(比值比[OR] 6.749, 95% CI 1.313-34.689; P= 0.02)。内部验证的曲线下面积为0.874 (95% CI 0.793-0.955),表明纳入这些预测因子的风险预测模型具有中等的临床实用性和准确性。结论:淋巴结转移、肌酐清除率≤60ml / min、红细胞计数≤3.3×1012 / l、联合血管内皮生长因子抑制剂治疗是卵巢癌PARPi SHAEs的独立危险因素。基于这些因素建立的风险预测模型具有中等的预测价值。
{"title":"Model for Predicting Serious Hematological Adverse Events in Individuals With Ovarian Cancer Receiving Poly (Adenosine Diphosphate Ribose) Polymerase Inhibitor Treatment: Prospective Cohort Study.","authors":"Xiaotong Lian, Yu Lei","doi":"10.2196/72994","DOIUrl":"10.2196/72994","url":null,"abstract":"<p><strong>Background: </strong>Predicting serious hematological adverse events (SHAEs) from poly (adenosine diphosphate ribose) polymerase inhibitors (PARPis) would allow us to prioritize patients with ovarian cancer at higher risk for more intensive care, ultimately lowering morbidity and preventing them from premature termination of medication.</p><p><strong>Objective: </strong>This study aimed to explore the risk factors for SHAEs in patients with ovarian cancer receiving PARPi treatment and develop a risk prediction model for such events.</p><p><strong>Methods: </strong>Prospective clinical data were collected on patients with ovarian cancer who received PARPi treatment at the Guangxi Medical University Affiliated Tumor Hospital from December 2018 to August 2024. They were divided into a SHAE group and a no-SHAE group based on the occurrence of SHAEs. Variable differences were screened using the chi-square test or Fisher exact test. Multivariate logistic regression was used to determine independent factors influencing SHAEs in patients with ovarian cancer. A predictive model for serious blood-related complications in ovarian cancer treatment was developed from identified independent risk factors using the R software. The model's clinical utility was assessed through decision curve analysis (net benefit), calibration (calibration curve), and discrimination (receiver operating characteristic curve).</p><p><strong>Results: </strong>A total of 70 patients with ovarian cancer receiving PARPi treatment were included in this study. Of these 70 patients, 16 (23%) experienced SHAEs, with decreases in red blood cell (RBC) count and hemoglobin levels being the most common. Multiple logistic regression analysis identified 4 independent predictors of PARPi-associated SHAEs in patients with ovarian cancer: lymph node metastasis (odds ratio [OR] 6.733, 95% CI 1.197-37.873; P=.03), creatinine clearance rate of ≤60 mL per minute (OR 23.722, 95% CI 3.121-180.303; P=.002), RBC count of ≤3.3×10<sup>12</sup> per liter (OR 4.847, 95% CI 1.020-23.041; P=.047), and combination therapy with vascular endothelial growth factor inhibitors (OR 6.749, 95% CI 1.313-34.689; P=.02). The internal validation yielded an area under the curve of 0.874 (95% CI 0.793-0.955), indicating moderate clinical utility and accuracy for the risk prediction model incorporating these predictors.</p><p><strong>Conclusions: </strong>Lymph node metastasis, creatinine clearance rate of ≤60 mL per minute, RBC count of ≤3.3×10<sup>12</sup> per liter, and combination therapy with vascular endothelial growth factor inhibitors are independent risk factors for PARPi SHAEs in patients with ovarian cancer. The risk prediction model established based on these factors demonstrated moderate predictive value.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e72994"},"PeriodicalIF":3.8,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12658389/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145508273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1