首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
The State of Practice About Security in Telemedicine Systems in Chile: Exploratory Study. 智利远程医疗系统安全实践现状:探索性研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-21 DOI: 10.2196/77395
Gaston Marquez, Michelle Pacheco, Priscilla Vergara, Felix Liberona, May Chomalí, Eric Rojas

Background: Information security within telemedicine systems is essential to advancing the digital transformation of health care. Telemedicine encompasses diverse modalities, including teleconsultation, telehealth, and remote patient monitoring, all of which depend on digital platforms, secured communication networks, and internet-connected devices. Although these systems have progressed in aligning with information security standards and regulations, there remains a shortage of comprehensive, practice-oriented studies evaluating which aspects of security are effectively addressed and which remain insufficiently managed, particularly within the Chilean context.

Objective: This study aims to examine how effectively telemedicine systems in Chile address the core security attributes of confidentiality, availability, and integrity.

Methods: Data were analyzed from an evaluation tool designed to assess the quality of telemedicine systems in Chile. Over a 6-year period, 25 telemedicine systems from different providers were assessed, and an in-depth examination of how companies manage key information security subcharacteristics within their systems was undertaken.

Results: The findings indicate that 52% (n=13) of telemedicine systems optimally implement cryptographic techniques to protect confidentiality. In contrast, 44% (n=11) lack robust strategies for adapting to, recovering from, and mitigating security-related incidents. Fault tolerance mechanisms are frequently integrated to minimize service disruption caused by system failures. However, the prioritization of data integrity varies: while some companies treat it as a critical requirement, others assign it limited importance.

Conclusions: This study offers an understanding of the security priorities and practices adopted by telemedicine providers. It highlights a prevailing tendency to prioritize security measures over usability, underscoring the need for a balanced approach that safeguards patient information while supporting efficient clinical workflows.

背景:远程医疗系统中的信息安全对于推进医疗保健的数字化转型至关重要。远程医疗包括多种模式,包括远程会诊、远程保健和远程患者监护,所有这些都依赖于数字平台、安全的通信网络和连接互联网的设备。尽管这些系统在与信息安全标准和法规保持一致方面取得了进展,但仍然缺乏全面的、以实践为导向的研究,评估安全的哪些方面得到了有效解决,哪些方面仍然管理不足,特别是在智利的情况下。目的:本研究旨在研究智利远程医疗系统如何有效地解决机密性、可用性和完整性的核心安全属性。方法:利用智利远程医疗系统质量评估工具对数据进行分析。在6年的时间里,对来自不同供应商的25个远程医疗系统进行了评估,并对公司如何管理其系统内的关键信息安全子特征进行了深入研究。结果:研究结果表明,52% (n=13)的远程医疗系统采用了最佳的加密技术来保护机密性。相比之下,44% (n=11)的企业缺乏适应、恢复和减轻安全相关事件的稳健策略。经常集成容错机制,以尽量减少系统故障造成的服务中断。然而,数据完整性的优先级各不相同:一些公司将其视为关键需求,而另一些公司则将其赋予有限的重要性。结论:本研究提供了对远程医疗提供商采用的安全优先级和实践的理解。它强调了优先考虑安全措施而不是可用性的普遍趋势,强调需要一种平衡的方法来保护患者信息,同时支持有效的临床工作流程。
{"title":"The State of Practice About Security in Telemedicine Systems in Chile: Exploratory Study.","authors":"Gaston Marquez, Michelle Pacheco, Priscilla Vergara, Felix Liberona, May Chomalí, Eric Rojas","doi":"10.2196/77395","DOIUrl":"10.2196/77395","url":null,"abstract":"<p><strong>Background: </strong>Information security within telemedicine systems is essential to advancing the digital transformation of health care. Telemedicine encompasses diverse modalities, including teleconsultation, telehealth, and remote patient monitoring, all of which depend on digital platforms, secured communication networks, and internet-connected devices. Although these systems have progressed in aligning with information security standards and regulations, there remains a shortage of comprehensive, practice-oriented studies evaluating which aspects of security are effectively addressed and which remain insufficiently managed, particularly within the Chilean context.</p><p><strong>Objective: </strong>This study aims to examine how effectively telemedicine systems in Chile address the core security attributes of confidentiality, availability, and integrity.</p><p><strong>Methods: </strong>Data were analyzed from an evaluation tool designed to assess the quality of telemedicine systems in Chile. Over a 6-year period, 25 telemedicine systems from different providers were assessed, and an in-depth examination of how companies manage key information security subcharacteristics within their systems was undertaken.</p><p><strong>Results: </strong>The findings indicate that 52% (n=13) of telemedicine systems optimally implement cryptographic techniques to protect confidentiality. In contrast, 44% (n=11) lack robust strategies for adapting to, recovering from, and mitigating security-related incidents. Fault tolerance mechanisms are frequently integrated to minimize service disruption caused by system failures. However, the prioritization of data integrity varies: while some companies treat it as a critical requirement, others assign it limited importance.</p><p><strong>Conclusions: </strong>This study offers an understanding of the security priorities and practices adopted by telemedicine providers. It highlights a prevailing tendency to prioritize security measures over usability, underscoring the need for a balanced approach that safeguards patient information while supporting efficient clinical workflows.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e77395"},"PeriodicalIF":3.8,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12638033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145574951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Creation and Implementation of an Electronic Sexual Assault Record at the Geneva University Hospital. 在日内瓦大学医院创建和实施电子性侵犯记录。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-20 DOI: 10.2196/66764
Sara Cottler-Casanova, Laurène Rimondi, Monique Lamuela Naulin, Tony Fracasso, Jasmine Abdulcadir

Background: In Switzerland, sexual assault reports have historically been documented on paper, which limited standardization, completeness, and challenges to produce reliable statistics.

Objective: This study describes the development and implementation of an Electronic Sexual Assault Record (eSAR) within Geneva University Hospitals' Electronic Medical Record (EMR) system, with the aim of improving data quality, documentation, and multidisciplinary coordination.

Methods: The eSAR was developed by a multidisciplinary team including forensic doctors, gynecologists, nurses (clinical and informatics), epidemiologists, and IT specialists. Its structure was based on existing hospital protocols and international recommendations. Variables were defined as "essential" or "highly recommended," with structured fields to ensure completeness and comparability. Confidentiality was safeguarded through restricted access and regular audits.

Results: The eSAR was launched in June 2022 and revised in 2023 after user feedback and training. Since implementation, 382 reports have been completed. Data quality improved substantially, with major reductions in missing information. The system also streamlined workflows and strengthened collaboration across specialties.

Conclusions: The eSAR improved documentation and data reliability, providing a replicable model for standardized sexual assault reporting in Switzerland.

背景:在瑞士,性侵犯报告历来以书面形式记录,这限制了标准化、完整性和产生可靠统计数据的挑战。目的:本研究描述了日内瓦大学医院电子病历(EMR)系统中电子性侵犯记录(eSAR)的开发和实施,旨在提高数据质量、文件记录和多学科协调。方法:eSAR是由包括法医、妇科医生、护士(临床和信息学)、流行病学家和IT专家在内的多学科团队开发的。其结构以现有的医院规程和国际建议为基础。变量被定义为“必要的”或“强烈推荐的”,具有结构化字段以确保完整性和可比性。通过限制接触和定期审计来保障机密性。结果:eSAR于2022年6月上线,经过用户反馈和培训,于2023年进行修订。自执行以来,已完成382份报告。数据质量大大提高,缺失信息大大减少。该系统还简化了工作流程,加强了各专业之间的协作。结论:eSAR提高了文件和数据的可靠性,为瑞士的标准化性侵犯报告提供了可复制的模式。
{"title":"Creation and Implementation of an Electronic Sexual Assault Record at the Geneva University Hospital.","authors":"Sara Cottler-Casanova, Laurène Rimondi, Monique Lamuela Naulin, Tony Fracasso, Jasmine Abdulcadir","doi":"10.2196/66764","DOIUrl":"10.2196/66764","url":null,"abstract":"<p><strong>Background: </strong>In Switzerland, sexual assault reports have historically been documented on paper, which limited standardization, completeness, and challenges to produce reliable statistics.</p><p><strong>Objective: </strong>This study describes the development and implementation of an Electronic Sexual Assault Record (eSAR) within Geneva University Hospitals' Electronic Medical Record (EMR) system, with the aim of improving data quality, documentation, and multidisciplinary coordination.</p><p><strong>Methods: </strong>The eSAR was developed by a multidisciplinary team including forensic doctors, gynecologists, nurses (clinical and informatics), epidemiologists, and IT specialists. Its structure was based on existing hospital protocols and international recommendations. Variables were defined as \"essential\" or \"highly recommended,\" with structured fields to ensure completeness and comparability. Confidentiality was safeguarded through restricted access and regular audits.</p><p><strong>Results: </strong>The eSAR was launched in June 2022 and revised in 2023 after user feedback and training. Since implementation, 382 reports have been completed. Data quality improved substantially, with major reductions in missing information. The system also streamlined workflows and strengthened collaboration across specialties.</p><p><strong>Conclusions: </strong>The eSAR improved documentation and data reliability, providing a replicable model for standardized sexual assault reporting in Switzerland.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e66764"},"PeriodicalIF":3.8,"publicationDate":"2025-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12634005/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Machine Learning and Robotic Process Automation to Identify and Convert Unstructured Colonoscopy Results Into Actionable Data: Proof-of-Concept Study. 利用机器学习和机器人过程自动化来识别和转换非结构化结肠镜检查结果为可操作的数据:概念验证研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-20 DOI: 10.2196/73504
Elizabeth R Stevens, Jager Hartman, Paul Testa, Ajay Mansukhani, Casey Monina, Amelia Shunk, David Ranson, Yana Imberg, Ann Cote, Dinesha Prabhu, Adam Szerencsy

Background: With rising patient volumes and a focus on quality, our health system had the objective to create a more efficient way to ensure accurate documentation of colorectal cancer (CRC) screening intervals from inbound colonoscopy reports to ensure timely follow-up. We developed an integrated end-to-end workflow solution using machine learning (ML) and robotic process automation (RPA) to extract and update electronic health record (EHR) follow-up dates from unstructured data.

Objective: This study aimed to automate data extraction from external, free-text colonoscopy reports to identify and document recommended follow-up dates for CRC screening in structured EHR fields.

Methods: As proof of concept, we outline the process development, validity, and implementation of an approach that integrates available tools to automate data retrieval and entry within the EHR of a large academic health system. The health system uses Epic Systems as its EHR platform, and the ML model used was trained on health system patient colonoscopy reports. This proof-of-concept process study consisted of six stages: (1) identification of gaps in documenting recommendations for follow-up CRC screening from external colonoscopy reports, (2) defining process objectives, (3) identification of technologies, (4) creation of process architecture, (5) process validation, and (6) health system-wide implementation. A chart review was performed to validate process outcomes and estimate impact.

Results: We developed an automated process with 3 primary steps leveraging ML and RPA to create a fully orchestrated workflow to update CRC screening recall dates based on colonoscopy reports received from external sources. Process validity was assessed with 690 scanned colonoscopy reports. During process validation, the overall automated process achieved an accuracy of 80.7% (557/690, 95% CI 77.8%-83.7%) for correctly identifying the presence or absence of a valid follow-up date and a follow-up date false negative identification rate of 32.9% (130/395, 95% CI 29.4%-36.4%). From the organization-wide implementation to go-live until December 31, 2024, the system processed 16,563 external colonoscopy reports. Of these, 35.3% (5841/16,563) had a follow-up date meeting the relevant ML model threshold and thus were identified as ready for RPA processing.

Conclusions: Implementation of an automated workflow to extract and update CRC screening follow-up dates from colonoscopy reports is feasible and has the potential to improve accuracy in patient recall while reducing documentation burden. By standardizing data ingestion, extending this approach to various unstructured data types can address deficiencies in structured EHR documentation and solve for a lack of data integration and reporting for quality measures. Automated workflows leveraging ML and RPA offer practical solutions to overc

背景:随着患者数量的增加和对质量的关注,我们的卫生系统的目标是创造一种更有效的方法来确保从入境结肠镜检查报告中准确记录结直肠癌(CRC)筛查间隔,以确保及时随访。我们开发了一个集成的端到端工作流解决方案,使用机器学习(ML)和机器人过程自动化(RPA)从非结构化数据中提取和更新电子健康记录(EHR)随访日期。目的:本研究旨在从外部、自由文本结肠镜检查报告中自动提取数据,以确定和记录结构化电子病历领域中CRC筛查的推荐随访日期。方法:作为概念的证明,我们概述了过程的发展,有效性,并实施了一种方法,该方法集成了可用的工具,以在大型学术卫生系统的电子病历中自动检索和输入数据。该卫生系统使用Epic Systems作为其电子病历平台,所使用的ML模型是根据卫生系统患者结肠镜检查报告进行培训的。这项概念验证过程研究包括六个阶段:(1)确定外部结肠镜检查报告中CRC后续筛查建议文件的差距;(2)确定过程目标;(3)确定技术;(4)创建过程架构;(5)过程验证;(6)卫生系统范围内的实施。进行了图表评审,以验证过程结果并估计影响。结果:我们开发了一个包含3个主要步骤的自动化流程,利用ML和RPA来创建一个完全编排的工作流程,以根据从外部来源收到的结肠镜检查报告更新CRC筛查召回日期。通过690份扫描结肠镜检查报告评估过程效度。在工艺验证过程中,整个自动化过程在正确识别有效随访日期的存在与否方面的准确率为80.7% (557/690,95% CI 77.8%-83.7%),随访日期假阴性识别率为32.9% (130/395,95% CI 29.4%-36.4%)。从整个组织范围的实施到上线,直到2024年12月31日,该系统处理了16,563份外部结肠镜检查报告。其中,35.3%(5841/ 16563)的随访日期符合相关的ML模型阈值,因此被确定为准备好进行RPA处理。结论:从结肠镜检查报告中提取和更新结直肠癌筛查随访日期的自动化工作流程的实施是可行的,并且有可能提高患者回忆的准确性,同时减少文件负担。通过标准化数据摄取,将此方法扩展到各种非结构化数据类型,可以解决结构化EHR文档中的缺陷,并解决缺乏数据集成和质量度量报告的问题。利用ML和RPA的自动化工作流为克服互操作性挑战和医疗保健系统中非结构化数据的使用提供了实用的解决方案。
{"title":"Leveraging Machine Learning and Robotic Process Automation to Identify and Convert Unstructured Colonoscopy Results Into Actionable Data: Proof-of-Concept Study.","authors":"Elizabeth R Stevens, Jager Hartman, Paul Testa, Ajay Mansukhani, Casey Monina, Amelia Shunk, David Ranson, Yana Imberg, Ann Cote, Dinesha Prabhu, Adam Szerencsy","doi":"10.2196/73504","DOIUrl":"10.2196/73504","url":null,"abstract":"<p><strong>Background: </strong>With rising patient volumes and a focus on quality, our health system had the objective to create a more efficient way to ensure accurate documentation of colorectal cancer (CRC) screening intervals from inbound colonoscopy reports to ensure timely follow-up. We developed an integrated end-to-end workflow solution using machine learning (ML) and robotic process automation (RPA) to extract and update electronic health record (EHR) follow-up dates from unstructured data.</p><p><strong>Objective: </strong>This study aimed to automate data extraction from external, free-text colonoscopy reports to identify and document recommended follow-up dates for CRC screening in structured EHR fields.</p><p><strong>Methods: </strong>As proof of concept, we outline the process development, validity, and implementation of an approach that integrates available tools to automate data retrieval and entry within the EHR of a large academic health system. The health system uses Epic Systems as its EHR platform, and the ML model used was trained on health system patient colonoscopy reports. This proof-of-concept process study consisted of six stages: (1) identification of gaps in documenting recommendations for follow-up CRC screening from external colonoscopy reports, (2) defining process objectives, (3) identification of technologies, (4) creation of process architecture, (5) process validation, and (6) health system-wide implementation. A chart review was performed to validate process outcomes and estimate impact.</p><p><strong>Results: </strong>We developed an automated process with 3 primary steps leveraging ML and RPA to create a fully orchestrated workflow to update CRC screening recall dates based on colonoscopy reports received from external sources. Process validity was assessed with 690 scanned colonoscopy reports. During process validation, the overall automated process achieved an accuracy of 80.7% (557/690, 95% CI 77.8%-83.7%) for correctly identifying the presence or absence of a valid follow-up date and a follow-up date false negative identification rate of 32.9% (130/395, 95% CI 29.4%-36.4%). From the organization-wide implementation to go-live until December 31, 2024, the system processed 16,563 external colonoscopy reports. Of these, 35.3% (5841/16,563) had a follow-up date meeting the relevant ML model threshold and thus were identified as ready for RPA processing.</p><p><strong>Conclusions: </strong>Implementation of an automated workflow to extract and update CRC screening follow-up dates from colonoscopy reports is feasible and has the potential to improve accuracy in patient recall while reducing documentation burden. By standardizing data ingestion, extending this approach to various unstructured data types can address deficiencies in structured EHR documentation and solve for a lack of data integration and reporting for quality measures. Automated workflows leveraging ML and RPA offer practical solutions to overc","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73504"},"PeriodicalIF":3.8,"publicationDate":"2025-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12634012/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Postoperative Venous Thromboembolism in Patients With Traumatic Brain Injury: Model Development and Validation Study. 外伤性脑损伤患者术后静脉血栓栓塞的预测:模型开发和验证研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-17 DOI: 10.2196/78655
Jiang Zheng, Qiling Jiang, Yusheng Zhan, Yanming Tang, Xiaohui Du, Guangrong Xiang, Yufang Ouyang, Hong Fu

Background: Venous thromboembolism (VTE) remains a critical cause of mortality among patients who are hospitalized. Patients with traumatic brain injury (TBI) are particularly susceptible to VTE due to coagulation abnormalities and immobilization. Despite this elevated risk, no validated predictive model currently exists for postoperative VTE in populations with TBI.

Objective: This study aims to develop machine learning (ML)-based predictive models for VTE in patients with TBI undergoing surgical procedures, with a focus on clinical translatability.

Methods: Data were collected from patients with TBI who underwent surgical treatment at Chongqing University Central Hospital (from October 2016 to December 2024). The dataset was randomly partitioned into a training set and an internal test set in a 7:3 ratio. The recursive feature elimination algorithm was applied for feature selection, followed by the synthetic minority oversampling technique to address class imbalance. Six ML models, including logistic regression (LR), random forest, gradient boosting decision tree, extreme gradient boosting, support vector machine, and categorical boosting, were trained and validated. Model performance was evaluated using receiver operating characteristic analysis, calibration curves (assessing probability-observation alignment), and decision curve analysis to quantify clinical net benefit. For the LR model, clinical utility was enhanced through nomogram construction, with Shapley additive explanation values providing interpretability.

Results: A total of 1806 participants were enrolled in this study, and 257 (14.2%) experienced VTE events. All ML models demonstrated strong predictive performance, with area under the receiver operating characteristic curve values ranging from 0.79 to 0.83. The LR model exhibited the highest discriminatory power (area under the receiver operating characteristic curve 0.83; accuracy 0.80; specificity 0.83). Calibration curves confirmed that the LR model provided well-calibrated probability estimates. Shapley additive explanations analysis identified key contributors to VTE risk and transformed model outputs into individualized risk predictions. A user-friendly postoperative VTE risk prediction nomogram was developed for patients with TBI.

Conclusions: This study successfully developed and validated multiple ML models for postoperative VTE prediction in patients with TBI. The LR-based nomogram, supported by calibration and decision curve validation, offers a clinically actionable tool to guide thromboprophylaxis strategies. Future external validation across diverse populations is warranted to confirm generalizability.

背景:静脉血栓栓塞(VTE)仍然是住院患者死亡的一个重要原因。外伤性脑损伤(TBI)患者由于凝血异常和固定化特别容易发生静脉血栓栓塞。尽管这种风险升高,但目前还没有有效的预测TBI患者术后静脉血栓栓塞的模型。目的:本研究旨在开发基于机器学习(ML)的TBI手术患者静脉血栓栓塞预测模型,重点是临床可翻译性。方法:收集2016年10月至2024年12月重庆大学中心医院手术治疗的TBI患者资料。将数据集按7:3的比例随机划分为训练集和内部测试集。采用递归特征消除算法进行特征选择,采用合成少数派过采样技术解决类不平衡问题。6个ML模型,包括逻辑回归(LR)、随机森林、梯度增强决策树、极端梯度增强、支持向量机和分类增强,进行训练和验证。使用受试者工作特征分析、校准曲线(评估概率-观察对齐)和决策曲线分析来评估模型的性能,以量化临床净效益。对于LR模型,通过nomogram construction, Shapley additive explanation values提供了可解释性,增强了临床实用性。结果:共有1806名参与者参加了这项研究,257名(14.2%)经历了静脉血栓栓塞事件。所有ML模型均表现出较强的预测性能,受试者工作特征曲线下的面积范围为0.79至0.83。LR模型表现出最高的区分能力(受试者工作特征曲线下面积0.83,准确度0.80,特异性0.83)。校准曲线证实LR模型提供了校准良好的概率估计。Shapley加性解释分析确定了静脉血栓栓塞风险的关键因素,并将模型输出转化为个性化的风险预测。为TBI患者开发了一个用户友好的术后静脉血栓栓塞风险预测图。结论:本研究成功开发并验证了用于TBI患者术后VTE预测的多种ML模型。基于lr的nomogram,在校准和决策曲线验证的支持下,为指导血栓预防策略提供了一种临床可行的工具。未来有必要对不同人群进行外部验证,以确认通用性。
{"title":"Prediction of Postoperative Venous Thromboembolism in Patients With Traumatic Brain Injury: Model Development and Validation Study.","authors":"Jiang Zheng, Qiling Jiang, Yusheng Zhan, Yanming Tang, Xiaohui Du, Guangrong Xiang, Yufang Ouyang, Hong Fu","doi":"10.2196/78655","DOIUrl":"10.2196/78655","url":null,"abstract":"<p><strong>Background: </strong>Venous thromboembolism (VTE) remains a critical cause of mortality among patients who are hospitalized. Patients with traumatic brain injury (TBI) are particularly susceptible to VTE due to coagulation abnormalities and immobilization. Despite this elevated risk, no validated predictive model currently exists for postoperative VTE in populations with TBI.</p><p><strong>Objective: </strong>This study aims to develop machine learning (ML)-based predictive models for VTE in patients with TBI undergoing surgical procedures, with a focus on clinical translatability.</p><p><strong>Methods: </strong>Data were collected from patients with TBI who underwent surgical treatment at Chongqing University Central Hospital (from October 2016 to December 2024). The dataset was randomly partitioned into a training set and an internal test set in a 7:3 ratio. The recursive feature elimination algorithm was applied for feature selection, followed by the synthetic minority oversampling technique to address class imbalance. Six ML models, including logistic regression (LR), random forest, gradient boosting decision tree, extreme gradient boosting, support vector machine, and categorical boosting, were trained and validated. Model performance was evaluated using receiver operating characteristic analysis, calibration curves (assessing probability-observation alignment), and decision curve analysis to quantify clinical net benefit. For the LR model, clinical utility was enhanced through nomogram construction, with Shapley additive explanation values providing interpretability.</p><p><strong>Results: </strong>A total of 1806 participants were enrolled in this study, and 257 (14.2%) experienced VTE events. All ML models demonstrated strong predictive performance, with area under the receiver operating characteristic curve values ranging from 0.79 to 0.83. The LR model exhibited the highest discriminatory power (area under the receiver operating characteristic curve 0.83; accuracy 0.80; specificity 0.83). Calibration curves confirmed that the LR model provided well-calibrated probability estimates. Shapley additive explanations analysis identified key contributors to VTE risk and transformed model outputs into individualized risk predictions. A user-friendly postoperative VTE risk prediction nomogram was developed for patients with TBI.</p><p><strong>Conclusions: </strong>This study successfully developed and validated multiple ML models for postoperative VTE prediction in patients with TBI. The LR-based nomogram, supported by calibration and decision curve validation, offers a clinically actionable tool to guide thromboprophylaxis strategies. Future external validation across diverse populations is warranted to confirm generalizability.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e78655"},"PeriodicalIF":3.8,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12670048/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145544234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Accuracy of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints for Triage of Musculoskeletal Diseases: Algorithm Development and Validation Study. 评估肌肉骨骼疾病分诊中肌肉骨骼主诉的Frysian问卷的准确性:算法开发与验证研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-17 DOI: 10.2196/77345
Tjardo Daniël Maarseveen, Floor Reimann, Ahmed Al Hasan, Annemarie Schilder, Dan Zhang, Freke Wink, Lidy Hendriks, Rachel Knevel, Reinhard Bos
<p><strong>Background: </strong>Inflammatory rheumatic diseases (IRDs) affect 5% of the general population, whereas 35% of the population experiences musculoskeletal concerns. IRDs cause early disability, reduced life expectancy, and considerable health care costs. Early diagnosis is essential to prevent long-term damage. Similarly important is the early identification of patients with musculoskeletal concerns without IRDs to prevent unnecessary health care expenses. Of the population referred to the rheumatologist, 60% have noninflammatory musculoskeletal concerns, whereas only 20% of patients with an IRD see a rheumatologist within 3 months of symptom onset. The need for digital predictive (triage) tools for rheumatic and musculoskeletal diseases led to the development of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints (FRYQ).</p><p><strong>Objective: </strong>This study aimed to assess whether the FRYQ can distinguish IRD from noninflammatory musculoskeletal concerns in general, and rheumatoid arthritis and fibromyalgia specifically, in newly referred patients.</p><p><strong>Methods: </strong>The FRYQ is an 87-item tool (20 open-ended and 67 closed-ended questions) used to triage new rheumatology patients at Frisius Medical Center in the Netherlands. We analyzed data from 2 sources: dataset A with 728 outpatient clinic patients and dataset B with 373 patients from the Joint Pain Assessment Scoring Tool study. We built a classifier using Extreme Gradient Boosting to distinguish inflammatory from noninflammatory conditions based on closed-ended questions. Using elastic net regularization, we identified the most informative questions. We evaluated classification using receiver operating characteristic curve analysis and assessed feature importance through Shapley Additive Explanation analysis. To test generalizability, we replicated our analysis on dataset B. Finally, we examined whether the questions of the FRYQ could be used to identify specific conditions beyond the general categories of IRD and non-IRD, specifically for detecting fibromyalgia and rheumatoid arthritis.</p><p><strong>Results: </strong>Feature selection reduced the questionnaire from 67 to 28 items while maintaining discriminative power. After initial development, the model achieved an area under the receiver operating characteristic curve (AUC-ROC) of 0.72 (95% CI 0.67-0.78) for distinguishing inflammatory from noninflammatory conditions in an external validation set. Using a probability threshold of 0.30, the model achieved 71% sensitivity and 56% specificity on external validation. The FRYQ demonstrated stronger performance in identifying specific diagnoses such as fibromyalgia (AUC-ROC=0.81) and rheumatoid arthritis (AUC-ROC=0.77). Key discriminating features included symptom duration, pain response to movement, and anti-inflammatory medication effectiveness.</p><p><strong>Conclusions: </strong>The FRYQ effectively distinguishes inflammatory from
背景:炎性风湿病(IRDs)影响5%的普通人群,而35%的人群有肌肉骨骼问题。IRDs会导致早期残疾、预期寿命缩短和大量医疗保健费用。早期诊断对于预防长期损害至关重要。同样重要的是,在没有ird的情况下早期识别患有肌肉骨骼问题的患者,以防止不必要的医疗保健费用。在向风湿病学家求诊的人群中,60%有非炎症性肌肉骨骼问题,而只有20%的IRD患者在症状出现后3个月内求诊。对风湿病和肌肉骨骼疾病的数字预测(分诊)工具的需求导致了肌肉骨骼疾病鉴别弗里斯调查问卷(FRYQ)的发展。目的:本研究旨在评估FRYQ是否可以在新转诊的患者中区分IRD与非炎症性肌肉骨骼问题,特别是类风湿性关节炎和纤维肌痛。方法:FRYQ是一个87项工具(20个开放式问题和67个封闭式问题),用于对荷兰Frisius医学中心的新风湿病患者进行分类。我们分析了来自两个来源的数据:数据集A有728名门诊患者,数据集B有373名来自关节疼痛评估评分工具研究的患者。我们建立了一个基于封闭问题的分类器,使用极端梯度增强来区分炎症和非炎症状况。使用弹性网正则化,我们确定了信息量最大的问题。我们使用受试者工作特征曲线分析评估分类,并通过Shapley加性解释分析评估特征重要性。为了测试通用性,我们在数据集b上重复了我们的分析。最后,我们检查了FRYQ的问题是否可以用于识别IRD和非IRD的一般类别之外的特定条件,特别是用于检测纤维肌痛和类风湿性关节炎。结果:特征选择使问卷从67个条目减少到28个条目,同时保持了判别能力。经过初步开发,该模型在外部验证集中区分炎症和非炎症情况的受试者工作特征曲线下面积(AUC-ROC)为0.72 (95% CI 0.67-0.78)。使用0.30的概率阈值,该模型在外部验证中获得了71%的灵敏度和56%的特异性。FRYQ在识别纤维肌痛(AUC-ROC=0.81)和类风湿关节炎(AUC-ROC=0.77)等特定诊断方面表现出更强的性能。关键的鉴别特征包括症状持续时间、疼痛对运动的反应和抗炎药物的有效性。结论:FRYQ在专家会诊前能有效区分炎症性和非炎症性风湿病,在识别纤维肌痛和类风湿性关节炎方面表现出特别的优势。该工具可以通过优先考虑具有高IRD可能性的转诊进行早期风湿病学家评估,同时指导其他患者使用适当的替代资源,从而改善风湿病分诊。需要前瞻性研究来确定FRYQ对临床结果和卫生保健效率的影响。
{"title":"Evaluating the Accuracy of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints for Triage of Musculoskeletal Diseases: Algorithm Development and Validation Study.","authors":"Tjardo Daniël Maarseveen, Floor Reimann, Ahmed Al Hasan, Annemarie Schilder, Dan Zhang, Freke Wink, Lidy Hendriks, Rachel Knevel, Reinhard Bos","doi":"10.2196/77345","DOIUrl":"10.2196/77345","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Inflammatory rheumatic diseases (IRDs) affect 5% of the general population, whereas 35% of the population experiences musculoskeletal concerns. IRDs cause early disability, reduced life expectancy, and considerable health care costs. Early diagnosis is essential to prevent long-term damage. Similarly important is the early identification of patients with musculoskeletal concerns without IRDs to prevent unnecessary health care expenses. Of the population referred to the rheumatologist, 60% have noninflammatory musculoskeletal concerns, whereas only 20% of patients with an IRD see a rheumatologist within 3 months of symptom onset. The need for digital predictive (triage) tools for rheumatic and musculoskeletal diseases led to the development of the Frysian Questionnaire for Differentiation of Musculoskeletal Complaints (FRYQ).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to assess whether the FRYQ can distinguish IRD from noninflammatory musculoskeletal concerns in general, and rheumatoid arthritis and fibromyalgia specifically, in newly referred patients.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;The FRYQ is an 87-item tool (20 open-ended and 67 closed-ended questions) used to triage new rheumatology patients at Frisius Medical Center in the Netherlands. We analyzed data from 2 sources: dataset A with 728 outpatient clinic patients and dataset B with 373 patients from the Joint Pain Assessment Scoring Tool study. We built a classifier using Extreme Gradient Boosting to distinguish inflammatory from noninflammatory conditions based on closed-ended questions. Using elastic net regularization, we identified the most informative questions. We evaluated classification using receiver operating characteristic curve analysis and assessed feature importance through Shapley Additive Explanation analysis. To test generalizability, we replicated our analysis on dataset B. Finally, we examined whether the questions of the FRYQ could be used to identify specific conditions beyond the general categories of IRD and non-IRD, specifically for detecting fibromyalgia and rheumatoid arthritis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Feature selection reduced the questionnaire from 67 to 28 items while maintaining discriminative power. After initial development, the model achieved an area under the receiver operating characteristic curve (AUC-ROC) of 0.72 (95% CI 0.67-0.78) for distinguishing inflammatory from noninflammatory conditions in an external validation set. Using a probability threshold of 0.30, the model achieved 71% sensitivity and 56% specificity on external validation. The FRYQ demonstrated stronger performance in identifying specific diagnoses such as fibromyalgia (AUC-ROC=0.81) and rheumatoid arthritis (AUC-ROC=0.77). Key discriminating features included symptom duration, pain response to movement, and anti-inflammatory medication effectiveness.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;The FRYQ effectively distinguishes inflammatory from ","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e77345"},"PeriodicalIF":3.8,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12622856/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145544266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methods for Addressing Missingness in Electronic Health Record Data for Clinical Prediction Models: Comparative Evaluation. 解决临床预测模型中电子健康记录数据缺失的方法:比较评价。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-14 DOI: 10.2196/79307
Jean Digitale, Deborah Franzon, Mark J Pletcher, Charles E McCulloch, Efstathios D Gennatas

Background: Missing data are a common challenge in electronic health record (EHR)-based prediction modeling. Traditional imputation methods may not suit prediction or machine learning models, and real-world use requires workflows that are implementable for both model development and real-time prediction.

Objective: We evaluated methods for handling missing data when using EHR data to build clinical prediction models for patients admitted to the pediatric intensive care unit (PICU).

Methods: Using EHR data containing missing values from an academic medical center PICU, we generated a synthetic complete dataset. From this, we created 300 datasets with missing data under varying mechanisms and proportions of missingness for the outcomes of (1) successful extubation (binary) and (2) blood pressure (continuous). We assessed strategies to address missing data including simple methods (eg, last observation carried forward [LOCF]), complex methods (eg, random forest multiple imputation), and native support for missing values in outcome prediction models.

Results: Across 886 patients and 1220 intubation events, 18.2% of original data were missing. LOCF had the lowest imputation error, followed by random forest imputation (average mean squared error [MSE] improvement over mean imputation: 0.41 [range: 0.30, 0.50] and 0.33 [0.21, 0.43], respectively). LOCF generally outperformed other imputation methods across outcome metrics and models (mean improvement: 1.28% [range: -0.07%, 7.2%]). Imputation methods showed more performance variability for the binary outcome (balanced accuracy coefficient of variation: 0.042) than the continuous outcome (mean squared error coefficient of variation: 0.001).

Conclusions: Traditional imputation methods for inferential statistics, such as multiple imputation, may not be optimal for prediction models. The amount of missingness influenced performance more than the missingness mechanism. In datasets with frequent measurements, LOCF and native support for missing values in machine learning models offer reasonable performance for handling missingness at minimal computational cost in predictive analyses.

背景:在基于电子健康记录(EHR)的预测建模中,数据缺失是一个常见的挑战。传统的输入方法可能不适合预测或机器学习模型,并且现实世界的使用需要可实现模型开发和实时预测的工作流。目的:我们评估在使用电子病历数据建立儿科重症监护病房(PICU)患者临床预测模型时处理缺失数据的方法。方法:利用某学术医疗中心PICU中包含缺失值的EHR数据,我们生成了一个合成的完整数据集。由此,我们创建了300个数据集,其中包含不同机制和缺失比例下的缺失数据,用于(1)成功拔管(二元)和(2)血压(连续)的结果。我们评估了解决缺失数据的策略,包括简单方法(例如,最后一次观测结转[LOCF])、复杂方法(例如,随机森林多重插值)和结果预测模型中缺失值的本地支持。结果:在886例患者和1220例插管事件中,18.2%的原始数据丢失。LOCF的估计误差最低,其次是随机森林估计(平均均方误差[MSE]比平均估计分别提高了0.41[范围:0.30,0.50]和0.33[0.21,0.43])。在结果指标和模型上,LOCF总体上优于其他估算方法(平均改进:1.28%[范围:-0.07%,7.2%])。与连续结果(均方误差变异系数:0.001)相比,二元结果(平衡精度变异系数:0.042)的归算方法表现出更多的性能可变性。结论:传统的推断统计方法,如多重插值,可能不适合预测模型。缺失量对性能的影响大于缺失机制。在频繁测量的数据集中,LOCF和机器学习模型中对缺失值的本地支持为在预测分析中以最小的计算成本处理缺失提供了合理的性能。
{"title":"Methods for Addressing Missingness in Electronic Health Record Data for Clinical Prediction Models: Comparative Evaluation.","authors":"Jean Digitale, Deborah Franzon, Mark J Pletcher, Charles E McCulloch, Efstathios D Gennatas","doi":"10.2196/79307","DOIUrl":"10.2196/79307","url":null,"abstract":"<p><strong>Background: </strong>Missing data are a common challenge in electronic health record (EHR)-based prediction modeling. Traditional imputation methods may not suit prediction or machine learning models, and real-world use requires workflows that are implementable for both model development and real-time prediction.</p><p><strong>Objective: </strong>We evaluated methods for handling missing data when using EHR data to build clinical prediction models for patients admitted to the pediatric intensive care unit (PICU).</p><p><strong>Methods: </strong>Using EHR data containing missing values from an academic medical center PICU, we generated a synthetic complete dataset. From this, we created 300 datasets with missing data under varying mechanisms and proportions of missingness for the outcomes of (1) successful extubation (binary) and (2) blood pressure (continuous). We assessed strategies to address missing data including simple methods (eg, last observation carried forward [LOCF]), complex methods (eg, random forest multiple imputation), and native support for missing values in outcome prediction models.</p><p><strong>Results: </strong>Across 886 patients and 1220 intubation events, 18.2% of original data were missing. LOCF had the lowest imputation error, followed by random forest imputation (average mean squared error [MSE] improvement over mean imputation: 0.41 [range: 0.30, 0.50] and 0.33 [0.21, 0.43], respectively). LOCF generally outperformed other imputation methods across outcome metrics and models (mean improvement: 1.28% [range: -0.07%, 7.2%]). Imputation methods showed more performance variability for the binary outcome (balanced accuracy coefficient of variation: 0.042) than the continuous outcome (mean squared error coefficient of variation: 0.001).</p><p><strong>Conclusions: </strong>Traditional imputation methods for inferential statistics, such as multiple imputation, may not be optimal for prediction models. The amount of missingness influenced performance more than the missingness mechanism. In datasets with frequent measurements, LOCF and native support for missing values in machine learning models offer reasonable performance for handling missingness at minimal computational cost in predictive analyses.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e79307"},"PeriodicalIF":3.8,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12617989/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Named Entity Recognition for Chinese Cancer Electronic Health Records-Development and Evaluation of a Domain-Specific BERT Model: Quantitative Study. 中国癌症电子病历的命名实体识别——特定领域BERT模型的开发与评价:定量研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-14 DOI: 10.2196/76912
Junbai Chen, Butian Zhao, Xiaohan Tian, Zhengkai Zou, Ruojia Wang, Jiarui Wu, Songxing Du, Fengying Guo

Background: The unstructured data of Chinese cancer electronic health records (EHRs) contains valuable medical expertise. Accurate medical entity recognition is crucial for building a medical-assisted decision system. Named entity recognition (NER) in cancer EHRs typically uses general models designed for English medical records. There is a lack of specialized handling for cancer-specific records and limited application to Chinese medical records.

Objective: This study aims to propose a specific NER model to enhance the recognition of medical entities in Chinese cancer EHRs.

Methods: Desensitized inpatient EHRs related to breast cancer were collected from a leading hospital in Beijing. Building upon the MC Bidirectional Encoder Representations from Transformers (BERT) foundation, the study further incorporated a Chinese cancer corpus for pretraining, resulting in the construction of the ChCancerBERT pretrained model. In conjunction with dilated-gated convolutional neural networks, bidirectional long short-term memory, multihead attention mechanism, and a conditional random field, this model forms a multimodel, multilevel integrated NER approach.

Results: This approach effectively extracts medical entity features related to symptoms, signs, tests, treatments, and time in Chinese breast cancer EHRs. The entity recognition performance of the proposed model surpasses that of the baseline model and other models compared in the experiment. The F1-score reached 86.93%, precision reached 87.24%, and recall reached 86.61%. The model introduced in this study demonstrates exceptional performance on the CCKS2019 dataset, attaining a precision rate of 87.26%, a recall rate of 87.27%, and an impressive F1-score of 87.26%, surpassing that of existing models.

Conclusions: The experiments demonstrate that the approach proposed in this study exhibits excellent performance in NER within breast cancer EHRs. This advancement will further contribute to clinical decision support for cancer treatment and research. In addition, the study reveals that incorporating domain-specific corpora in clinical NER tasks can further enhance the performance of BERT models in specialized domains.

背景:中国癌症电子健康档案(EHRs)的非结构化数据包含了宝贵的医学知识。准确的医疗实体识别是构建医疗辅助决策系统的关键。癌症电子病历中的命名实体识别(NER)通常使用针对英语医疗记录设计的通用模型。缺乏对癌症特定记录的专门处理,对中国医疗记录的应用也有限。目的:本研究旨在提出一个特定的NER模型,以提高中国癌症电子病历对医疗实体的识别。方法:收集北京市某知名医院脱敏乳腺癌住院患者的电子病历。在BERT (MC Bidirectional Encoder Representations from Transformers)的基础上,进一步纳入中文癌症语料库进行预训练,构建了ChCancerBERT预训练模型。该模型与扩张型门控卷积神经网络、双向长短期记忆、多头注意机制和条件随机场相结合,形成了一种多模型、多层次的集成NER方法。结果:该方法有效地提取了中国乳腺癌电子病历中与症状、体征、检查、治疗和时间相关的医疗实体特征。该模型的实体识别性能优于基线模型和实验中比较的其他模型。f1得分达到86.93%,准确率达到87.24%,召回率达到86.61%。本研究引入的模型在CCKS2019数据集上表现出色,准确率达到87.26%,召回率达到87.27%,f1得分达到87.26%,超过了现有模型。结论:实验表明,本研究提出的方法在乳腺癌电子病历的NER中表现优异。这一进展将进一步有助于癌症治疗和研究的临床决策支持。此外,研究表明,在临床NER任务中加入特定领域的语料库可以进一步提高BERT模型在专业领域的性能。
{"title":"Named Entity Recognition for Chinese Cancer Electronic Health Records-Development and Evaluation of a Domain-Specific BERT Model: Quantitative Study.","authors":"Junbai Chen, Butian Zhao, Xiaohan Tian, Zhengkai Zou, Ruojia Wang, Jiarui Wu, Songxing Du, Fengying Guo","doi":"10.2196/76912","DOIUrl":"10.2196/76912","url":null,"abstract":"<p><strong>Background: </strong>The unstructured data of Chinese cancer electronic health records (EHRs) contains valuable medical expertise. Accurate medical entity recognition is crucial for building a medical-assisted decision system. Named entity recognition (NER) in cancer EHRs typically uses general models designed for English medical records. There is a lack of specialized handling for cancer-specific records and limited application to Chinese medical records.</p><p><strong>Objective: </strong>This study aims to propose a specific NER model to enhance the recognition of medical entities in Chinese cancer EHRs.</p><p><strong>Methods: </strong>Desensitized inpatient EHRs related to breast cancer were collected from a leading hospital in Beijing. Building upon the MC Bidirectional Encoder Representations from Transformers (BERT) foundation, the study further incorporated a Chinese cancer corpus for pretraining, resulting in the construction of the ChCancerBERT pretrained model. In conjunction with dilated-gated convolutional neural networks, bidirectional long short-term memory, multihead attention mechanism, and a conditional random field, this model forms a multimodel, multilevel integrated NER approach.</p><p><strong>Results: </strong>This approach effectively extracts medical entity features related to symptoms, signs, tests, treatments, and time in Chinese breast cancer EHRs. The entity recognition performance of the proposed model surpasses that of the baseline model and other models compared in the experiment. The F1-score reached 86.93%, precision reached 87.24%, and recall reached 86.61%. The model introduced in this study demonstrates exceptional performance on the CCKS2019 dataset, attaining a precision rate of 87.26%, a recall rate of 87.27%, and an impressive F1-score of 87.26%, surpassing that of existing models.</p><p><strong>Conclusions: </strong>The experiments demonstrate that the approach proposed in this study exhibits excellent performance in NER within breast cancer EHRs. This advancement will further contribute to clinical decision support for cancer treatment and research. In addition, the study reveals that incorporating domain-specific corpora in clinical NER tasks can further enhance the performance of BERT models in specialized domains.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76912"},"PeriodicalIF":3.8,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12620309/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multisource Coherence Analysis of the First European Multicenter Cohort Study for Cancer Prevention in People Experiencing Homelessness: Data Quality Study. 第一项欧洲多中心队列研究对无家可归者癌症预防的多源一致性分析:数据质量研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-14 DOI: 10.2196/73596
Antonio Blasco-Calafat, Vicent Blanes-Selva, Tobias Fragner, Ascensión Doñate-Martínez, Tamara Alhambra-Borrás, Julia Gawronska, Lee Smith, Juan M Garcia-Gomez, Igor Grabovac, Carlos Sáez

Background: Coherence across sites in multicenter datasets is one substantial data quality dimension for reliable health data reuse, as unexpected heterogeneity in data can lead to biases in data analyses and suboptimal generalization of results.

Objective: This work aims to characterize and label the data coherence across sites in the first European multicenter dataset for cancer prevention in people and early detection among the homeless population in Europe: coadapting and implementing the health navigator model. This dataset emerged to enable research to address disparities in health challenges and health care access due to barriers such as unstable housing, limited resources, and social stigma in people experiencing homelessness.

Methods: The dataset comprises 652 cases: 142 from Austria, 158 from Greece, 197 from Spain, and 155 from the United Kingdom. All participants fit classifications from the European Typology of Homelessness and Housing Exclusion. This longitudinal study collected questionnaires at baseline, after 4 weeks, and at the end of the intervention. The 180-question survey covered sociodemographic data, overall health, mental health, empowerment, and interpersonal communication. Data variability was assessed using information theory and geometric methods to analyze discrepancies in distributions and completeness across the dataset.

Results: Substantial variability was observed among the 4 pilot countries, both in the overall analysis and within specific domains. In particular, measures of health care empowerment, quality of life, and interpersonal communication demonstrated the greatest discrepancies among pilot sites, with the exception of the health domain. Notably, Spain exhibited the most pronounced differences, characterized by a high number of missing values related to interpersonal communication and the use of health care services.

Conclusions: Health data may be comparable across the 4 countries; however, substantial differences were observed in the other questionnaires, requiring independent, country-specific analyses. This study underscores the heterogeneity among people experiencing homelessness and the critical need for data quality assessments to inform future research and policymaking in this field.

背景:在多中心数据集中,跨站点的一致性是可靠的卫生数据重用的一个重要数据质量维度,因为数据中的意外异质性可能导致数据分析中的偏差和结果的次优泛化。目的:本工作旨在描述和标记欧洲第一个多中心数据集中各站点的数据一致性,该数据集用于欧洲人的癌症预防和无家可归人口的早期检测:协调和实施健康导航员模型。该数据集的出现是为了使研究能够解决由于住房不稳定、资源有限和无家可归者的社会耻辱等障碍而导致的健康挑战和医疗保健获取方面的差异。方法:数据集包括652例:奥地利142例,希腊158例,西班牙197例,英国155例。所有参与者都符合欧洲无家可归和住房排斥类型学的分类。这项纵向研究在基线、4周后和干预结束时收集问卷。这项180个问题的调查涵盖了社会人口统计数据、整体健康、心理健康、赋权和人际沟通。使用信息理论和几何方法评估数据变异性,以分析数据集分布和完整性的差异。结果:在4个试点国家之间,无论是在总体分析还是在特定领域,都观察到实质性的差异。特别是,在保健赋权、生活质量和人际交往方面的措施显示,各试验点之间的差异最大,但保健领域除外。值得注意的是,西班牙表现出最明显的差异,其特点是与人际交往和保健服务的使用有关的大量价值缺失。结论:这四个国家的卫生数据可能具有可比性;然而,在其他调查表中观察到重大差异,需要独立的、针对具体国家的分析。这项研究强调了无家可归者之间的异质性,以及对数据质量评估的迫切需要,以便为该领域的未来研究和政策制定提供信息。
{"title":"Multisource Coherence Analysis of the First European Multicenter Cohort Study for Cancer Prevention in People Experiencing Homelessness: Data Quality Study.","authors":"Antonio Blasco-Calafat, Vicent Blanes-Selva, Tobias Fragner, Ascensión Doñate-Martínez, Tamara Alhambra-Borrás, Julia Gawronska, Lee Smith, Juan M Garcia-Gomez, Igor Grabovac, Carlos Sáez","doi":"10.2196/73596","DOIUrl":"10.2196/73596","url":null,"abstract":"<p><strong>Background: </strong>Coherence across sites in multicenter datasets is one substantial data quality dimension for reliable health data reuse, as unexpected heterogeneity in data can lead to biases in data analyses and suboptimal generalization of results.</p><p><strong>Objective: </strong>This work aims to characterize and label the data coherence across sites in the first European multicenter dataset for cancer prevention in people and early detection among the homeless population in Europe: coadapting and implementing the health navigator model. This dataset emerged to enable research to address disparities in health challenges and health care access due to barriers such as unstable housing, limited resources, and social stigma in people experiencing homelessness.</p><p><strong>Methods: </strong>The dataset comprises 652 cases: 142 from Austria, 158 from Greece, 197 from Spain, and 155 from the United Kingdom. All participants fit classifications from the European Typology of Homelessness and Housing Exclusion. This longitudinal study collected questionnaires at baseline, after 4 weeks, and at the end of the intervention. The 180-question survey covered sociodemographic data, overall health, mental health, empowerment, and interpersonal communication. Data variability was assessed using information theory and geometric methods to analyze discrepancies in distributions and completeness across the dataset.</p><p><strong>Results: </strong>Substantial variability was observed among the 4 pilot countries, both in the overall analysis and within specific domains. In particular, measures of health care empowerment, quality of life, and interpersonal communication demonstrated the greatest discrepancies among pilot sites, with the exception of the health domain. Notably, Spain exhibited the most pronounced differences, characterized by a high number of missing values related to interpersonal communication and the use of health care services.</p><p><strong>Conclusions: </strong>Health data may be comparable across the 4 countries; however, substantial differences were observed in the other questionnaires, requiring independent, country-specific analyses. This study underscores the heterogeneity among people experiencing homelessness and the critical need for data quality assessments to inform future research and policymaking in this field.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73596"},"PeriodicalIF":3.8,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12663700/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Large Language Models With AI Agents for Chronic Gastritis Management: Comprehensive Comparative Study. 用人工智能智能体增强大型语言模型用于慢性胃炎管理:综合比较研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-13 DOI: 10.2196/73857
Shurui Wang, Qing Ye
<p><strong>Background: </strong>The prevalence of chronic gastritis is high, and if not intervened in a timely manner, it may eventually lead to gastric cancer. Managing chronic gastritis essentially requires comprehensive lifestyle changes. However, the current health care environment does not support continuous follow-up by professional health care providers, making self-management a key component of postdiagnosis care. Increasingly, researchers are exploring the use of large language models (LLMs) for patient management. However, LLMs have limitations, including hallucinations, limited knowledge scope, and lack of timeliness. Artificial intelligence (AI) agents may provide a more effective solution. Nevertheless, it remains uncertain whether AI agents can effectively support postdiagnosis self-management for patients with chronic gastritis.</p><p><strong>Objective: </strong>The purpose of this study was to explore the effectiveness of AI agents in the postdiagnosis management of patients with chronic gastritis from different perspectives.</p><p><strong>Methods: </strong>In this study, we developed an agent framework for the health management of patients with chronic gastritis based on LLMs in conjunction with retrieval-augmented generation and a search engine tool. We collected real questions from patients with chronic gastritis in clinical settings and tested the framework's performance across different difficulty levels and scenarios. We analyzed its safety and robustness and compared it with state-of-the-art models to comprehensively evaluate its effectiveness.</p><p><strong>Results: </strong>Using a dual-evaluation framework comprising automated metrics and expert manual assessments, our results demonstrated that AI agents substantially outperformed LLMs in addressing high-complexity questions (embedding average score: 82.849 for AI agents vs 77.825 for LLMs) and were particularly effective in clinical consultation tasks. Clinical evaluation of safety based on a 5-point Likert scale by physicians indicated that the safety of the agents was 4.98 (SD 0.15; 95% CI 4.96-4.99). After 30 repeated experiments, the mean absolute deviation of the AI agents in the embedding average score and BERTScore metrics were 0.0167 and 0.0387, respectively. Therefore, the safety and robustness analysis confirmed that the AI agents can produce safe, stable, and minimally variable responses. In addition, comparative results with those of advanced medical-domain LLMs (Baichuan-14B-M1 and MedGemma-27B) and general-domain LLMs (Qwen3-32B) also demonstrated that the AI agents in this study performed outstandingly in the field of chronic gastritis. Our findings underscore the superior reliability, interpretability, and practical applicability of AI agents over conventional LLMs in chronic gastritis management, offering a robust foundation for their broader adoption in health care settings.</p><p><strong>Conclusions: </strong>AI agents based on LLMs have high applicat
背景:慢性胃炎患病率高,如果不及时干预,最终可能导致胃癌。治疗慢性胃炎基本上需要全面改变生活方式。然而,目前的卫生保健环境不支持专业卫生保健提供者的持续随访,使自我管理成为诊断后护理的关键组成部分。越来越多的研究人员正在探索使用大型语言模型(llm)进行患者管理。然而,法学硕士也有局限性,包括幻觉、知识范围有限、缺乏时效性。人工智能(AI)代理可能会提供更有效的解决方案。然而,人工智能制剂能否有效支持慢性胃炎患者的诊断后自我管理尚不确定。目的:本研究旨在从不同角度探讨人工智能药物在慢性胃炎患者诊断后管理中的有效性。方法:在本研究中,我们开发了一个基于LLMs的慢性胃炎患者健康管理代理框架,并结合检索增强生成和搜索引擎工具。我们收集了临床慢性胃炎患者的真实问题,并测试了该框架在不同难度水平和场景下的性能。我们分析了其安全性和鲁棒性,并将其与最先进的模型进行了比较,以综合评价其有效性。结果:使用由自动指标和专家手动评估组成的双重评估框架,我们的结果表明,人工智能代理在解决高复杂性问题方面的表现明显优于法学硕士(人工智能代理的嵌入平均分:82.849比法学硕士的77.825),并且在临床咨询任务中特别有效。医生基于5点Likert量表的临床安全性评价表明,药物的安全性为4.98 (SD 0.15; 95% CI 4.96-4.99)。经过30次重复实验,人工智能主体在嵌入平均分和BERTScore指标上的平均绝对偏差分别为0.0167和0.0387。因此,安全性和鲁棒性分析证实了AI代理可以产生安全、稳定和最小变量的响应。此外,与先进医学域LLMs(百川- 14b - m1和MedGemma-27B)和通用域LLMs (Qwen3-32B)的对比结果也表明,本研究中的AI agent在慢性胃炎领域表现突出。我们的研究结果强调了人工智能代理在慢性胃炎管理方面优于传统llm的可靠性、可解释性和实用性,为其在医疗保健领域的广泛应用提供了坚实的基础。结论:基于llm的人工智能药物在慢性胃炎的治疗中具有较高的应用价值。他们可以有效地指导慢性病患者解决常见问题,这可能会减少医生的工作量,提高患者家庭护理的质量。
{"title":"Enhancing Large Language Models With AI Agents for Chronic Gastritis Management: Comprehensive Comparative Study.","authors":"Shurui Wang, Qing Ye","doi":"10.2196/73857","DOIUrl":"10.2196/73857","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;The prevalence of chronic gastritis is high, and if not intervened in a timely manner, it may eventually lead to gastric cancer. Managing chronic gastritis essentially requires comprehensive lifestyle changes. However, the current health care environment does not support continuous follow-up by professional health care providers, making self-management a key component of postdiagnosis care. Increasingly, researchers are exploring the use of large language models (LLMs) for patient management. However, LLMs have limitations, including hallucinations, limited knowledge scope, and lack of timeliness. Artificial intelligence (AI) agents may provide a more effective solution. Nevertheless, it remains uncertain whether AI agents can effectively support postdiagnosis self-management for patients with chronic gastritis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;The purpose of this study was to explore the effectiveness of AI agents in the postdiagnosis management of patients with chronic gastritis from different perspectives.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;In this study, we developed an agent framework for the health management of patients with chronic gastritis based on LLMs in conjunction with retrieval-augmented generation and a search engine tool. We collected real questions from patients with chronic gastritis in clinical settings and tested the framework's performance across different difficulty levels and scenarios. We analyzed its safety and robustness and compared it with state-of-the-art models to comprehensively evaluate its effectiveness.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Using a dual-evaluation framework comprising automated metrics and expert manual assessments, our results demonstrated that AI agents substantially outperformed LLMs in addressing high-complexity questions (embedding average score: 82.849 for AI agents vs 77.825 for LLMs) and were particularly effective in clinical consultation tasks. Clinical evaluation of safety based on a 5-point Likert scale by physicians indicated that the safety of the agents was 4.98 (SD 0.15; 95% CI 4.96-4.99). After 30 repeated experiments, the mean absolute deviation of the AI agents in the embedding average score and BERTScore metrics were 0.0167 and 0.0387, respectively. Therefore, the safety and robustness analysis confirmed that the AI agents can produce safe, stable, and minimally variable responses. In addition, comparative results with those of advanced medical-domain LLMs (Baichuan-14B-M1 and MedGemma-27B) and general-domain LLMs (Qwen3-32B) also demonstrated that the AI agents in this study performed outstandingly in the field of chronic gastritis. Our findings underscore the superior reliability, interpretability, and practical applicability of AI agents over conventional LLMs in chronic gastritis management, offering a robust foundation for their broader adoption in health care settings.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;AI agents based on LLMs have high applicat","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73857"},"PeriodicalIF":3.8,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12617012/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating 10-Year Cardiovascular Disease Risk in Primary Prevention Using UK Electronic Health Records and a Hybrid Multitask BERT Model: Retrospective Cohort Study. 使用英国电子健康记录和混合多任务BERT模型估计初级预防10年心血管疾病风险:回顾性队列研究
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-13 DOI: 10.2196/76659
Tianyi Liu, Lei Lu, Yanzhong Wang, Andrew J Krentz, Vasa Curcin
<p><strong>Background: </strong>Cardiovascular disease (CVD) remains a leading cause of preventable morbidity and mortality, highlighting the need for early risk stratification in primary prevention. Traditional Cox models assume proportional hazards and linear effects, limiting flexibility. While machine learning offers greater expressiveness, many models rely solely on structured data and overlook time-to-event (TTE) information. Integrating structured and textual representations may enhance prediction and support equitable assessment across clinical subgroups.</p><p><strong>Objective: </strong>This study aims to develop a hybrid multitask deep learning model (MT-BERT [multitask Bidirectional Encoder Representations from Transformers]) integrating structured and textual features from electronic health records (EHRs) to predict 10-year CVD risk, enhancing individualized stratification and supporting equitable assessment across diverse demographic groups.</p><p><strong>Methods: </strong>We used data from Clinical Practice Research Datalink (CPRD) Aurum comprising 469,496 patients aged 40-85 years to develop MT-BERT for 10-year CVD risk prediction. Structured EHR variables and their corresponding textual representations were jointly encoded using a multilayer perceptron and a distilled version of the BERT model (DistilBERT), respectively. A fusion layer and stacked multihead attention modules enabled cross-modal interaction modeling. The model generated both binary classification outputs and TTE risk scores, optimized using a custom FocalCoxLoss function with uncertainty-based weighting. Prediction targets encompassed composite and individual CVD outcomes. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), concordance index, and Brier score, with subgroup analyses by ethnicity and deprivation, and heterogeneity assessed using Higgins I² and Cochran Q statistics. Generalizability was assessed via external validation in a held-out London cohort.</p><p><strong>Results: </strong>The MT-BERT model yielded AUROC values of 0.744 (95% CI 0.738-0.749) in males and 0.782 (95% CI 0.768-0.796) in females on the test set (n=711,052), and 0.736 (95% CI 0.729-0.741) and 0.775 (95% CI 0.768-0.780), respectively in "spatial external" validation (n=144,370). Brier scores were 0.130 in males and 0.091 in females. Individuals classified as high-risk (≥40% risk in males and ≥34% in females) demonstrated significantly reduced 10-year event-free survival relative to lower-risk individuals (log-rank P<.001). Model performance was consistently higher in females across all metrics. Subgroup analyses revealed substantial heterogeneity across ethnicity and deprivation (I²>70%), especially among males, with lower AUROC in South Asian and Black ethnic groups. These findings reflect variation in model performance across demographic groups while supporting its applicability to large-scale CVD risk stratification.</p><p><stro
背景:心血管疾病(CVD)仍然是可预防的发病率和死亡率的主要原因,突出了在初级预防中进行早期风险分层的必要性。传统的Cox模型假设风险成比例和线性效应,限制了灵活性。虽然机器学习提供了更强的表达能力,但许多模型仅依赖于结构化数据,而忽略了时间到事件(TTE)信息。整合结构化和文本表示可以增强预测并支持跨临床亚组的公平评估。目的:本研究旨在开发一种混合多任务深度学习模型(MT-BERT[多任务双向编码器表示来自变压器]),整合电子健康记录(EHRs)的结构化和文本特征来预测10年心血管疾病风险,增强个性化分层并支持不同人口群体的公平评估。方法:我们使用临床实践研究数据链(CPRD) Aurum的数据,包括469,496名年龄在40-85岁之间的患者,开发MT-BERT用于10年心血管疾病风险预测。结构化EHR变量及其相应的文本表示分别使用多层感知器和BERT模型的蒸馏版本(蒸馏BERT)进行联合编码。融合层和堆叠的多头注意模块实现了跨模态交互建模。该模型生成二元分类输出和TTE风险评分,并使用基于不确定性加权的自定义FocalCoxLoss函数进行优化。预测目标包括复合和个体CVD结果。采用受试者工作特征曲线下面积(AUROC)、一致性指数和Brier评分评估模型性能,并采用种族和剥夺亚组分析,采用Higgins I²和Cochran Q统计评估异质性。概括性是通过外部验证在一个伦敦的队列评估。结果:MT-BERT模型在测试集(n=711,052)上,男性的AUROC值为0.744 (95% CI 0.738-0.749),女性的AUROC值为0.782 (95% CI 0.768-0.796),在“空间外部”验证(n=144,370)中,其AUROC值分别为0.736 (95% CI 0.729-0.741)和0.775 (95% CI 0.768-0.780)。男性的Brier评分为0.130,女性为0.091。高危人群(男性风险≥40%,女性风险≥34%)的10年无事件生存率明显低于低危人群(log-rank P70%),尤其是男性,南亚和黑人群体的AUROC较低。这些发现反映了模型在不同人口群体中的表现差异,同时支持了其对大规模心血管疾病风险分层的适用性。结论:提出的混合MT-BERT模型通过整合来自电子病历的结构化变量和非结构化临床文本来预测初级预防的10年心血管疾病风险。它的多任务设计有利于个性化风险分层和TTE估计。虽然贫困和少数族裔亚群体的表现略有下降,但这些发现为在日益多样化的卫生保健环境中推进公平意识、数据驱动的预防策略提供了初步支持。
{"title":"Estimating 10-Year Cardiovascular Disease Risk in Primary Prevention Using UK Electronic Health Records and a Hybrid Multitask BERT Model: Retrospective Cohort Study.","authors":"Tianyi Liu, Lei Lu, Yanzhong Wang, Andrew J Krentz, Vasa Curcin","doi":"10.2196/76659","DOIUrl":"10.2196/76659","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Cardiovascular disease (CVD) remains a leading cause of preventable morbidity and mortality, highlighting the need for early risk stratification in primary prevention. Traditional Cox models assume proportional hazards and linear effects, limiting flexibility. While machine learning offers greater expressiveness, many models rely solely on structured data and overlook time-to-event (TTE) information. Integrating structured and textual representations may enhance prediction and support equitable assessment across clinical subgroups.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aims to develop a hybrid multitask deep learning model (MT-BERT [multitask Bidirectional Encoder Representations from Transformers]) integrating structured and textual features from electronic health records (EHRs) to predict 10-year CVD risk, enhancing individualized stratification and supporting equitable assessment across diverse demographic groups.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We used data from Clinical Practice Research Datalink (CPRD) Aurum comprising 469,496 patients aged 40-85 years to develop MT-BERT for 10-year CVD risk prediction. Structured EHR variables and their corresponding textual representations were jointly encoded using a multilayer perceptron and a distilled version of the BERT model (DistilBERT), respectively. A fusion layer and stacked multihead attention modules enabled cross-modal interaction modeling. The model generated both binary classification outputs and TTE risk scores, optimized using a custom FocalCoxLoss function with uncertainty-based weighting. Prediction targets encompassed composite and individual CVD outcomes. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), concordance index, and Brier score, with subgroup analyses by ethnicity and deprivation, and heterogeneity assessed using Higgins I² and Cochran Q statistics. Generalizability was assessed via external validation in a held-out London cohort.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The MT-BERT model yielded AUROC values of 0.744 (95% CI 0.738-0.749) in males and 0.782 (95% CI 0.768-0.796) in females on the test set (n=711,052), and 0.736 (95% CI 0.729-0.741) and 0.775 (95% CI 0.768-0.780), respectively in \"spatial external\" validation (n=144,370). Brier scores were 0.130 in males and 0.091 in females. Individuals classified as high-risk (≥40% risk in males and ≥34% in females) demonstrated significantly reduced 10-year event-free survival relative to lower-risk individuals (log-rank P&lt;.001). Model performance was consistently higher in females across all metrics. Subgroup analyses revealed substantial heterogeneity across ethnicity and deprivation (I²&gt;70%), especially among males, with lower AUROC in South Asian and Black ethnic groups. These findings reflect variation in model performance across demographic groups while supporting its applicability to large-scale CVD risk stratification.&lt;/p&gt;&lt;p&gt;&lt;stro","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76659"},"PeriodicalIF":3.8,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12620595/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1