首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Simulation of Clinical Visits as a Novel Approach to Evaluate Digital Health in Multiple Sclerosis: Simulation Study. 模拟临床访问作为评估多发性硬化症数字健康的新方法:模拟研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-11 DOI: 10.2196/67845
Riley Bove, Luca Capezzuto, Imogen West, Simon Dryden, Saira Ghafur, Jack Halligan, Stanislas Hubeaux, Agne Kazlauskaite
<p><strong>Background: </strong>Efforts are being made to integrate digital health technologies into clinical care for multiple sclerosis (MS) to improve patient monitoring. Efficiently probing how they might impact clinical care could streamline digital tool development. The Floodlight digital tool, comprising 5 smartphone sensor-based tests, was used to generate health-related data on patient function and symptoms in a clinical simulation.</p><p><strong>Objective: </strong>The study had 3 objectives: (1) assess the utility of simulated clinical encounters as a research methodology for exploring the introduction of digital health technologies into clinical practice in MS, (2) confirm the fidelity of the simulated environment and patient cases developed and understand what metrics (eg, workflow, comprehensive evaluation) could be generated, and (3) generate insights into the utility of digitally collected data, including usability, clinical decision contribution, and impact on workflows, in clinical practice.</p><p><strong>Methods: </strong>A total of 2 patient cases consisting of clinical, radiological, and digital health data were developed with clinician input. US-based neurologists prepared for and conducted 2 simulated teleconsultations each, with an actor briefed on case profiles. Floodlight data were available, via the Floodlight MS™ Health Care Professional Portal, for 1 of the 2 consultations. Participant neurologists completed interviews and surveys assessing the fidelity of the cases presented, user experience and workflow metrics, patient concerns identified, care decisions made, and confidence in making decisions.</p><p><strong>Results: </strong>All 10 neurologists indicated that the simulations were high-fidelity representations of real consultations. Using the Floodlight technology for the first time, median time taken to prepare for and conduct the consultation was ~1.7-2 minutes longer, with slightly greater mental effort reported by participants, compared with not using the tool. The Floodlight MS Health Care Professional Portal scored an "above average" 79 on the System Usability Scale and an "acceptable" Net Promoter Score of 10. In total, 6 of the 10 neurologists "strongly agreed" that it was easier and quicker to identify patient concerns when they had access to the patient-generated Floodlight data to prepare for their encounters than when they did not. Overall, more care and management decisions were taken when the digital tool was used (37 vs 29). Of those 37 decisions, Floodlight data were reported as a trigger for 20 decisions, always in combination with other elements including patient history (20/20) and clinical exam findings (9/20).</p><p><strong>Conclusions: </strong>These findings advance our understanding of clinical simulation as a method for evaluating digital tools and other innovative technologies for MS care. High-fidelity patient cases could be provided for the mock teleconsultations, and the simulated clin
背景:人们正在努力将数字健康技术整合到多发性硬化症(MS)的临床护理中,以改善患者的监测。有效地探索它们如何影响临床护理可以简化数字工具的开发。泛光灯数字工具包括5个基于智能手机传感器的测试,用于在临床模拟中生成有关患者功能和症状的健康相关数据。目的:本研究有3个目的:(1)评估模拟临床接触的效用,作为探索将数字健康技术引入MS临床实践的研究方法;(2)确认模拟环境和患者病例的保真度,并了解可以产生哪些指标(例如,工作流,综合评估);(3)对数字收集数据的效用产生见解,包括可用性、临床决策贡献和对工作流程的影响。临床实践中。方法:在临床医生的输入下,编制2例患者的临床、放射学和数字健康资料。美国的神经学家准备并进行了两次模拟远程会诊,并向一名演员简要介绍了病例概况。2次咨询中的1次可通过Floodlight MS™医疗保健专业门户获得泛光灯数据。参与的神经科医生完成了访谈和调查,评估了所呈现病例的保真度、用户体验和工作流程指标、确定的患者关注点、做出的护理决策以及做出决策的信心。结果:所有10名神经科医生表示,模拟是真实咨询的高保真表现。第一次使用泛光灯技术,准备和进行咨询的平均时间约为1.7-2分钟,与不使用该工具相比,参与者报告的心理努力略大。泛光灯MS医疗保健专业门户网站在系统可用性量表上获得了79分“高于平均水平”,净推荐值为10分,“可接受”。总的来说,10位神经科医生中有6位“强烈同意”,当他们可以访问患者生成的泛光灯数据以为他们的遭遇做准备时,比没有访问时更容易、更快地识别患者的担忧。总体而言,当使用数字工具时,采取了更多的谨慎和管理决策(37 vs 29)。在这37项决策中,泛光灯数据被报告为20项决策的触发因素,这些决策总是与其他因素相结合,包括患者病史(20/20)和临床检查结果(9/20)。结论:这些发现促进了我们对临床模拟作为评估MS护理的数字工具和其他创新技术的方法的理解。可以为模拟远程会诊提供高保真的患者病例,模拟临床环境有助于评估新的数字工具的可用性和实用性,从而为神经科医生如何访问和利用数字数据来支持常规MS护理提供初步证据。
{"title":"Simulation of Clinical Visits as a Novel Approach to Evaluate Digital Health in Multiple Sclerosis: Simulation Study.","authors":"Riley Bove, Luca Capezzuto, Imogen West, Simon Dryden, Saira Ghafur, Jack Halligan, Stanislas Hubeaux, Agne Kazlauskaite","doi":"10.2196/67845","DOIUrl":"10.2196/67845","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Efforts are being made to integrate digital health technologies into clinical care for multiple sclerosis (MS) to improve patient monitoring. Efficiently probing how they might impact clinical care could streamline digital tool development. The Floodlight digital tool, comprising 5 smartphone sensor-based tests, was used to generate health-related data on patient function and symptoms in a clinical simulation.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;The study had 3 objectives: (1) assess the utility of simulated clinical encounters as a research methodology for exploring the introduction of digital health technologies into clinical practice in MS, (2) confirm the fidelity of the simulated environment and patient cases developed and understand what metrics (eg, workflow, comprehensive evaluation) could be generated, and (3) generate insights into the utility of digitally collected data, including usability, clinical decision contribution, and impact on workflows, in clinical practice.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;A total of 2 patient cases consisting of clinical, radiological, and digital health data were developed with clinician input. US-based neurologists prepared for and conducted 2 simulated teleconsultations each, with an actor briefed on case profiles. Floodlight data were available, via the Floodlight MS™ Health Care Professional Portal, for 1 of the 2 consultations. Participant neurologists completed interviews and surveys assessing the fidelity of the cases presented, user experience and workflow metrics, patient concerns identified, care decisions made, and confidence in making decisions.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;All 10 neurologists indicated that the simulations were high-fidelity representations of real consultations. Using the Floodlight technology for the first time, median time taken to prepare for and conduct the consultation was ~1.7-2 minutes longer, with slightly greater mental effort reported by participants, compared with not using the tool. The Floodlight MS Health Care Professional Portal scored an \"above average\" 79 on the System Usability Scale and an \"acceptable\" Net Promoter Score of 10. In total, 6 of the 10 neurologists \"strongly agreed\" that it was easier and quicker to identify patient concerns when they had access to the patient-generated Floodlight data to prepare for their encounters than when they did not. Overall, more care and management decisions were taken when the digital tool was used (37 vs 29). Of those 37 decisions, Floodlight data were reported as a trigger for 20 decisions, always in combination with other elements including patient history (20/20) and clinical exam findings (9/20).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;These findings advance our understanding of clinical simulation as a method for evaluating digital tools and other innovative technologies for MS care. High-fidelity patient cases could be provided for the mock teleconsultations, and the simulated clin","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e67845"},"PeriodicalIF":3.8,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12697915/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Intelligent Trial Eligibility Screening Tool Using Natural Language Processing With a Block-Based Visual Programming Interface: Development and Usability Study. 使用自然语言处理和基于块的可视化编程界面的智能试验资格筛选工具:开发和可用性研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-11 DOI: 10.2196/80072
Ya-Han Hu, Yi-Ying Cheng, Chung-Ching Lan, Yu-Hsiang Su, Sheng-Feng Sung
<p><strong>Background: </strong>Clinical trial eligibility screening using electronic medical records (EMRs) is challenging due to the complexity of patient data and the varied clinical terminologies. Manual screening is time-consuming, requires specialized knowledge, and can lead to inconsistent participant selection, potentially compromising patient safety and research outcomes. This is critical in time-sensitive conditions like acute ischemic stroke. While computerized clinical decision support tools offer solutions, most require software engineering expertise to update, limiting their practical utility when eligibility criteria change.</p><p><strong>Objective: </strong>We developed and evaluated the intelligent trial eligibility screening tool (iTEST), which combines natural language processing with a block-based visual programming interface designed to enable clinicians to create and modify eligibility screening rules independently. In this study, we assessed iTEST's rule evaluation module using pre-configured rules and compared its effectiveness with that of standard EMR interfaces.</p><p><strong>Methods: </strong>We conducted an experiment at a tertiary teaching hospital in Taiwan with 12 clinicians using a 2-period crossover design. The clinicians assessed the eligibility of 4 patients with stroke for 2 clinical trials using both standard EMR and iTEST in a counterbalanced order, resulting in 48 evaluation scenarios. The iTEST comprised a rule authoring module using Google Blockly and a rule evaluation module utilizing MetaMap Lite for extracting medical concepts from unstructured EMR documents and structured laboratory data. Primary outcomes included accuracy in determining eligibility. Secondary outcomes measured task completion time, cognitive workload using the National Aeronautics and Space Administration Task Load Index scale (range 0-100, with lower scores indicating a lower cognitive workload), and system usability through the system usability scale (range: 0-100, with higher scores indicating higher system usability).</p><p><strong>Results: </strong>The iTEST significantly improved accuracy scores (from 0.91 to 1.00, P<.001) and reduced completion time (from 3.18 to 2.44 min, P=.004) compared to the standard EMR interface. Users reported lower cognitive workload (National Aeronautics and Space Administration Task Load Index scale, 39.7 vs 62.8, P=.02) and higher system usability scale scores (71.3 vs 46.3, P=.01) with the iTEST. Particularly notable improvements in perceived cognitive workload were observed in temporal demand, effort, and frustration levels.</p><p><strong>Conclusions: </strong>The iTEST demonstrated superior performance in clinical trial eligibility screening, delivering improved accuracy, reduced completion time, lower cognitive workload, and better usability when evaluating preconfigured eligibility rules. The improved accuracy is critical for patient safety, as the misidentification of eligibility criteria cou
背景:由于患者数据的复杂性和临床术语的多样性,使用电子病历(EMRs)进行临床试验资格筛选具有挑战性。人工筛查耗时,需要专业知识,并可能导致参与者选择不一致,可能危及患者安全和研究结果。这在急性缺血性中风等对时间敏感的情况下至关重要。虽然计算机化的临床决策支持工具提供了解决方案,但大多数需要软件工程专业知识来更新,当资格标准发生变化时,限制了它们的实际效用。目的:我们开发并评估了智能试验资格筛选工具(iTEST),该工具将自然语言处理与基于块的可视化编程界面相结合,旨在使临床医生能够独立创建和修改资格筛选规则。在本研究中,我们使用预配置的规则评估了iTEST的规则评估模块,并将其与标准EMR接口的有效性进行了比较。方法:采用2期交叉设计,在台湾某三级教学医院对12名临床医生进行实验。临床医生使用标准EMR和iTEST以平衡顺序评估4例卒中患者参加2项临床试验的资格,产生48个评估方案。iTEST包括一个使用谷歌block的规则编写模块和一个使用MetaMap Lite的规则评估模块,用于从非结构化EMR文档和结构化实验室数据中提取医学概念。主要结局包括确定资格的准确性。次要结果测量任务完成时间,使用美国国家航空航天局任务负荷指数量表(范围0-100,分数越低表明认知负荷越低)的认知负荷,以及通过系统可用性量表(范围:0-100,分数越高表明系统可用性越高)的系统可用性。结果:iTEST显著提高了准确性得分(从0.91提高到1.00)。结论:iTEST在临床试验资格筛选中表现出优异的性能,在评估预配置的资格规则时,提高了准确性,缩短了完成时间,降低了认知工作量,并且更好的可用性。准确性的提高对患者安全至关重要,因为对资格标准的错误识别可能使患者接受不适当的治疗或将他们排除在有益的试验之外。iTEST处理结构化和非结构化数据的适应性和能力使其对时间敏感的场景和不断发展的研究方案很有价值。未来的研究应该评估临床医生使用基于块的编写界面创建和修改资格规则的能力,以及在不同类型的临床试验和卫生保健环境中评估iTEST。
{"title":"An Intelligent Trial Eligibility Screening Tool Using Natural Language Processing With a Block-Based Visual Programming Interface: Development and Usability Study.","authors":"Ya-Han Hu, Yi-Ying Cheng, Chung-Ching Lan, Yu-Hsiang Su, Sheng-Feng Sung","doi":"10.2196/80072","DOIUrl":"10.2196/80072","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Clinical trial eligibility screening using electronic medical records (EMRs) is challenging due to the complexity of patient data and the varied clinical terminologies. Manual screening is time-consuming, requires specialized knowledge, and can lead to inconsistent participant selection, potentially compromising patient safety and research outcomes. This is critical in time-sensitive conditions like acute ischemic stroke. While computerized clinical decision support tools offer solutions, most require software engineering expertise to update, limiting their practical utility when eligibility criteria change.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We developed and evaluated the intelligent trial eligibility screening tool (iTEST), which combines natural language processing with a block-based visual programming interface designed to enable clinicians to create and modify eligibility screening rules independently. In this study, we assessed iTEST's rule evaluation module using pre-configured rules and compared its effectiveness with that of standard EMR interfaces.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We conducted an experiment at a tertiary teaching hospital in Taiwan with 12 clinicians using a 2-period crossover design. The clinicians assessed the eligibility of 4 patients with stroke for 2 clinical trials using both standard EMR and iTEST in a counterbalanced order, resulting in 48 evaluation scenarios. The iTEST comprised a rule authoring module using Google Blockly and a rule evaluation module utilizing MetaMap Lite for extracting medical concepts from unstructured EMR documents and structured laboratory data. Primary outcomes included accuracy in determining eligibility. Secondary outcomes measured task completion time, cognitive workload using the National Aeronautics and Space Administration Task Load Index scale (range 0-100, with lower scores indicating a lower cognitive workload), and system usability through the system usability scale (range: 0-100, with higher scores indicating higher system usability).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The iTEST significantly improved accuracy scores (from 0.91 to 1.00, P&lt;.001) and reduced completion time (from 3.18 to 2.44 min, P=.004) compared to the standard EMR interface. Users reported lower cognitive workload (National Aeronautics and Space Administration Task Load Index scale, 39.7 vs 62.8, P=.02) and higher system usability scale scores (71.3 vs 46.3, P=.01) with the iTEST. Particularly notable improvements in perceived cognitive workload were observed in temporal demand, effort, and frustration levels.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;The iTEST demonstrated superior performance in clinical trial eligibility screening, delivering improved accuracy, reduced completion time, lower cognitive workload, and better usability when evaluating preconfigured eligibility rules. The improved accuracy is critical for patient safety, as the misidentification of eligibility criteria cou","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e80072"},"PeriodicalIF":3.8,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12698033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cost-Benefit Analysis of Preventing Acute Care Use in Oncology Patients Following Systemic Therapy Using Medicare Claims Data: Retrospective Cohort Study. 利用医疗保险索赔数据预防肿瘤患者接受全身治疗后使用急性护理的成本效益分析:回顾性队列研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-11 DOI: 10.2196/77891
Sara Alessandra Keller, Maximilian Schuessler, Behzad Naderalvojoud, Tina Seto, Lu Tian, Mohana Roy, Tina Hernandez-Boussard

Background: Acute care use (ACU) represents a major economic burden in oncology, which can ideally be prevented. Existing models effectively predict such events.

Objective: We aimed to quantify the cost savings achieved by implementing a model to predict ACU in oncology patients undergoing systemic therapy.

Methods: This retrospective cohort study analyzed patients with cancer at an academic medical center from 2010 to 2022. We included patients who received systemic therapy and identified ACU events occurring after treatment initiation, excluding those with known death dates within the study period. Data on ACU-related expenses were gathered from Medicare claims and mapped to service codes in electronic health records, yielding average daily costs for each patient over 180 days following the start of therapy. The exposure was an ACU event.

Results: The main outcome was the average daily cost per patient at the end of the first 180 days of systemic therapy. We observed that expense accumulation flattened earlier and more rapidly among non-ACU patients. This study included 20,556 patients, of whom 3820 (18.58%) experienced at least 1 ACU. The average daily cost per patient for those with and without ACU was US $94.62 (SD US $72.54; 95% CI US $92.32-$96.92) and US $53.28 (SD US $59.92; 95% CI US $52.37-$54.19), respectively. The average total cost per ACU and non-ACU patient was US $17,031.92 (SD US $13,056.63; 95% CI US $16,616.74-$17,445.09) and US $9591.06 (SD US $10,785.83; 95% CI US $9427.64-$9754.48), respectively. To estimate the long-term financial impact of deploying the predictive model, we conducted a cost-benefit analysis based on an annual cohort size of 2177 patients. In the first year alone, the model yielded projected savings of US $910,000. By year 6, projected savings grew to US $9.46 million annually. The cumulative avoided costs over a 6-year deployment period totaled approximately US $31.11 million. These estimates compared the baseline cost model to the intervention model assuming a prevention rate of 35% for preventable ACU events and an average ACU cost of US $17,031.92 (SD US $13,037).

Conclusions: Predictive analytics can significantly reduce costs associated with ACU events, enhancing economic efficiency in cancer care. Further research is needed to explore potential health benefits.

背景:急性护理使用(ACU)是肿瘤学的主要经济负担,理想情况下是可以预防的。现有的模型可以有效地预测这类事件。目的:我们旨在量化通过实施预测肿瘤患者接受全身治疗的ACU模型所实现的成本节约。方法:本回顾性队列研究分析了2010年至2022年在某学术医疗中心就诊的癌症患者。我们纳入了接受全身治疗并确定在治疗开始后发生ACU事件的患者,排除了研究期间已知死亡日期的患者。从医疗保险索赔中收集了与acu相关的费用数据,并将其映射到电子健康记录中的服务代码中,得出每位患者在治疗开始后180天内的平均每日费用。暴露是ACU造成的。结果:主要观察指标为每位患者在前180天全身治疗结束时的平均每日费用。我们观察到,在非acu患者中,费用积累更早、更快地趋于平缓。本研究纳入20,556例患者,其中3820例(18.58%)经历过至少1次ACU。有无ACU患者的平均每日费用分别为94.62美元(SD $72.54; 95% CI为92.32- 96.92美元)和53.28美元(SD $59.92; 95% CI为52.37- 54.19美元)。ACU和非ACU患者的平均总成本分别为17031.92美元(标准差13,056.63美元;95% CI 16,616.74美元- 17,445.09美元)和9591.06美元(标准差10,785.83美元;95% CI 9427.64美元- 9754.48美元)。为了评估部署预测模型的长期财务影响,我们基于每年2177例患者的队列规模进行了成本效益分析。仅在第一年,该模式就预计节省了91万美元。到第6年,预计每年节余增加到946万美元。在6年的部署期间,累计避免的费用总计约为3111万美元。这些估计将基线成本模型与干预模型进行了比较,假设可预防的ACU事件的预防率为35%,平均ACU成本为17031.92美元(SD $13,037)。结论:预测分析可以显著降低与ACU事件相关的成本,提高癌症治疗的经济效率。需要进一步的研究来探索其潜在的健康益处。
{"title":"Cost-Benefit Analysis of Preventing Acute Care Use in Oncology Patients Following Systemic Therapy Using Medicare Claims Data: Retrospective Cohort Study.","authors":"Sara Alessandra Keller, Maximilian Schuessler, Behzad Naderalvojoud, Tina Seto, Lu Tian, Mohana Roy, Tina Hernandez-Boussard","doi":"10.2196/77891","DOIUrl":"10.2196/77891","url":null,"abstract":"<p><strong>Background: </strong>Acute care use (ACU) represents a major economic burden in oncology, which can ideally be prevented. Existing models effectively predict such events.</p><p><strong>Objective: </strong>We aimed to quantify the cost savings achieved by implementing a model to predict ACU in oncology patients undergoing systemic therapy.</p><p><strong>Methods: </strong>This retrospective cohort study analyzed patients with cancer at an academic medical center from 2010 to 2022. We included patients who received systemic therapy and identified ACU events occurring after treatment initiation, excluding those with known death dates within the study period. Data on ACU-related expenses were gathered from Medicare claims and mapped to service codes in electronic health records, yielding average daily costs for each patient over 180 days following the start of therapy. The exposure was an ACU event.</p><p><strong>Results: </strong>The main outcome was the average daily cost per patient at the end of the first 180 days of systemic therapy. We observed that expense accumulation flattened earlier and more rapidly among non-ACU patients. This study included 20,556 patients, of whom 3820 (18.58%) experienced at least 1 ACU. The average daily cost per patient for those with and without ACU was US $94.62 (SD US $72.54; 95% CI US $92.32-$96.92) and US $53.28 (SD US $59.92; 95% CI US $52.37-$54.19), respectively. The average total cost per ACU and non-ACU patient was US $17,031.92 (SD US $13,056.63; 95% CI US $16,616.74-$17,445.09) and US $9591.06 (SD US $10,785.83; 95% CI US $9427.64-$9754.48), respectively. To estimate the long-term financial impact of deploying the predictive model, we conducted a cost-benefit analysis based on an annual cohort size of 2177 patients. In the first year alone, the model yielded projected savings of US $910,000. By year 6, projected savings grew to US $9.46 million annually. The cumulative avoided costs over a 6-year deployment period totaled approximately US $31.11 million. These estimates compared the baseline cost model to the intervention model assuming a prevention rate of 35% for preventable ACU events and an average ACU cost of US $17,031.92 (SD US $13,037).</p><p><strong>Conclusions: </strong>Predictive analytics can significantly reduce costs associated with ACU events, enhancing economic efficiency in cancer care. Further research is needed to explore potential health benefits.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e77891"},"PeriodicalIF":3.8,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12698071/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trends and Trajectories in the Rise of Large Language Models in Radiology: Scoping Review. 放射学中大型语言模型兴起的趋势和轨迹:范围综述。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-09 DOI: 10.2196/78041
Adhari Al Zaabi, Rashid Alshibli, Abdullah AlAmri, Ibrahim AlRuheili, Syaheerah Lebai Lutfi

Background: The use of large language models (LLMs) in radiology is expanding rapidly, offering new possibilities in report generation, decision support, and workflow optimization. However, a comprehensive evaluation of their applications, performance, and limitations across the radiology domain remains limited.

Objective: This review aimed to map current applications of LLMs in radiology, evaluate their performance across key tasks, and identify prevailing limitations and directions for future research.

Methods: A scoping review was conducted in accordance with the framework by Arksey and O'Malley framework and the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines. Three databases-PubMed, ScopusCOPUS, and IEEE Xplore-were searched for peer-reviewed studies published between January 2022 and December 2024. Eligible studies included empirical evaluations of LLMs applied to radiological data or workflows. Commentaries, reviews, and technical model proposals without evaluation were excluded. Two reviewers independently screened studies and extracted data on study characteristics, LLM type, radiological use case, data modality, and evaluation metrics. A thematic synthesis was used to identify key domains of application. No formal risk-of-bias assessment was performed, but a narrative appraisal of dataset representativeness and study quality was included.

Results: A total of 67 studies were included. (n/N, %)GPT-4 was the most frequently used model (n=28, 42%), with text-based corpora as the primary type of data used (n=43, 64%). Identified use cases fell into three thematic domains: (1) decision support (n=39, 58%), (2) report generation and summarization (n=16, 24%), and (3) workflow optimization (n=12, 18%). While LLMs demonstrated strong performance in structured-text tasks (eg, report simplification with >94% accuracy), diagnostic performance varied widely (16%-86%) and was limited by dataset bias, lack of fine tuning, and minimal clinical validation. Most studies (n=53, 79.1%) had single-center, proof-of-concept designs with limited generalizability.

Conclusions: LLMs show strong potential for augmenting radiological workflows, particularly for structured reporting, summarization, and educational tasks. However, their diagnostic performance remains inconsistent, and current implementations lack robust external validation. Future work should prioritize prospective, multicenter validation of domain-adapted and multimodal models to support safe clinical integration.

背景:大型语言模型(llm)在放射学中的应用正在迅速扩大,为报告生成、决策支持和工作流程优化提供了新的可能性。然而,对它们在放射学领域的应用、性能和局限性的综合评估仍然有限。目的:本综述旨在绘制llm在放射学中的当前应用,评估其在关键任务中的表现,并确定当前的局限性和未来研究的方向。方法:根据Arksey和O'Malley框架和PRISMA-ScR(系统评价和荟萃分析扩展范围评价的首选报告项目)指南的框架进行范围评价。在pubmed、ScopusCOPUS和IEEE explore三个数据库中检索了2022年1月至2024年12月间发表的同行评议研究。合格的研究包括法学硕士应用于放射学数据或工作流程的实证评估。没有评估的评论、评论和技术模型建议被排除在外。两位审稿人独立筛选研究并提取研究特征、LLM类型、放射学用例、数据模式和评估指标方面的数据。专题综合用于确定关键的应用领域。没有进行正式的偏倚风险评估,但包括对数据集代表性和研究质量的叙述性评估。结果:共纳入67项研究。(n/ n, %)GPT-4是最常用的模型(n=28, 42%),基于文本的语料库是使用的主要数据类型(n=43, 64%)。确定的用例分为三个主题领域:(1)决策支持(n=39, 58%),(2)报告生成和总结(n=16, 24%),以及(3)工作流优化(n=12, 18%)。虽然llm在结构化文本任务中表现出强大的性能(例如,报告简化的准确率为60% - 94%),但诊断性能差异很大(16%-86%),并且受到数据集偏差,缺乏微调和最小临床验证的限制。大多数研究(n=53, 79.1%)采用单中心、概念验证设计,通用性有限。结论:法学硕士显示出增强放射学工作流程的强大潜力,特别是在结构化报告、总结和教育任务方面。然而,它们的诊断性能仍然不一致,并且当前的实现缺乏健壮的外部验证。未来的工作应优先考虑前瞻性、多中心的领域适应性和多模态模型验证,以支持安全的临床整合。
{"title":"Trends and Trajectories in the Rise of Large Language Models in Radiology: Scoping Review.","authors":"Adhari Al Zaabi, Rashid Alshibli, Abdullah AlAmri, Ibrahim AlRuheili, Syaheerah Lebai Lutfi","doi":"10.2196/78041","DOIUrl":"10.2196/78041","url":null,"abstract":"<p><strong>Background: </strong>The use of large language models (LLMs) in radiology is expanding rapidly, offering new possibilities in report generation, decision support, and workflow optimization. However, a comprehensive evaluation of their applications, performance, and limitations across the radiology domain remains limited.</p><p><strong>Objective: </strong>This review aimed to map current applications of LLMs in radiology, evaluate their performance across key tasks, and identify prevailing limitations and directions for future research.</p><p><strong>Methods: </strong>A scoping review was conducted in accordance with the framework by Arksey and O'Malley framework and the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines. Three databases-PubMed, ScopusCOPUS, and IEEE Xplore-were searched for peer-reviewed studies published between January 2022 and December 2024. Eligible studies included empirical evaluations of LLMs applied to radiological data or workflows. Commentaries, reviews, and technical model proposals without evaluation were excluded. Two reviewers independently screened studies and extracted data on study characteristics, LLM type, radiological use case, data modality, and evaluation metrics. A thematic synthesis was used to identify key domains of application. No formal risk-of-bias assessment was performed, but a narrative appraisal of dataset representativeness and study quality was included.</p><p><strong>Results: </strong>A total of 67 studies were included. (n/N, %)GPT-4 was the most frequently used model (n=28, 42%), with text-based corpora as the primary type of data used (n=43, 64%). Identified use cases fell into three thematic domains: (1) decision support (n=39, 58%), (2) report generation and summarization (n=16, 24%), and (3) workflow optimization (n=12, 18%). While LLMs demonstrated strong performance in structured-text tasks (eg, report simplification with >94% accuracy), diagnostic performance varied widely (16%-86%) and was limited by dataset bias, lack of fine tuning, and minimal clinical validation. Most studies (n=53, 79.1%) had single-center, proof-of-concept designs with limited generalizability.</p><p><strong>Conclusions: </strong>LLMs show strong potential for augmenting radiological workflows, particularly for structured reporting, summarization, and educational tasks. However, their diagnostic performance remains inconsistent, and current implementations lack robust external validation. Future work should prioritize prospective, multicenter validation of domain-adapted and multimodal models to support safe clinical integration.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e78041"},"PeriodicalIF":3.8,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12688054/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145716933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coagulation Risk Prediction in Patients With Liver Failure: Integrated Meta-Analysis and Machine Learning Model Study. 肝衰竭患者凝血风险预测:综合meta分析和机器学习模型研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-08 DOI: 10.2196/76348
Hao Wang, Tao He, Liang Ren, Tingjun Zhang

Background: Liver failure often results in significant coagulation dysfunction, which is a major complication. Artificial liver support systems (ALSS) have been used to ameliorate coagulation parameters, but the dynamic nature of these improvements and the development of predictive models remain insufficiently explored.

Objective: This study aimed to evaluate the effects of ALSS on coagulation function and to develop a dynamic prediction model using machine learning techniques to predict the improvement trends of coagulation parameters.

Methods: A systematic search was conducted in PubMed, Embase, and other databases to identify relevant studies, resulting in 18 studies comprising 1771 patients. A meta-analysis was performed to assess the impact of ALSS on coagulation parameters, including international normalized ratio (INR), prothrombin time (PT), activated partial thromboplastin time (APTT), and fibrinogen levels. In addition, clinical data from the Medical Information Mart for Intensive Care database were used to construct prediction models using logistic regression, extreme gradient boosting, random forest, and long short-term memory networks.

Results: Meta-analysis results showed that ALSS significantly improved INR, PT, APTT, and fibrinogen levels (all P<.05), with the treatment efficacy varying by modality. Among the machine learning models, the random forest model demonstrated the best performance, achieving an area under the curve of 92.12%. Dynamic INR was identified as the key predictor for coagulation abnormalities.

Conclusions: This study systematically evaluated the effects of ALSS on coagulation function in patients with liver failure, demonstrating significant improvements in key parameters such as INR, PT, and APTT, with efficacy varying across different treatment modalities. Simultaneously, a machine learning model built using intensive care unit clinical data exhibited strong predictive capability for identifying the risk of coagulation dysfunction, particularly useful in supporting early-stage clinical recognition of high-risk patients and guiding personalized coagulation management strategies. It is important to emphasize that this model is positioned as a dynamic risk alert and assessment tool, intended to assist clinical baseline evaluation and nursing interventions, rather than serving as direct validation of ALSS therapeutic efficacy.

背景:肝功能衰竭常导致明显的凝血功能障碍,这是主要的并发症。人工肝支持系统(ALSS)已被用于改善凝血参数,但这些改善的动态性质和预测模型的发展仍未得到充分探讨。目的:本研究旨在评价ALSS对凝血功能的影响,并利用机器学习技术建立动态预测模型,预测凝血参数的改善趋势。方法:系统检索PubMed、Embase等数据库,确定相关研究,共纳入18项研究,1771例患者。荟萃分析评估ALSS对凝血参数的影响,包括国际标准化比率(INR)、凝血酶原时间(PT)、活化部分凝血活素时间(APTT)和纤维蛋白原水平。此外,利用重症监护医学信息市场数据库中的临床数据,利用逻辑回归、极端梯度增强、随机森林和长短期记忆网络构建预测模型。结果:荟萃分析结果显示,ALSS可显著改善INR、PT、APTT和纤维蛋白原水平(均为p)。结论:本研究系统评估了ALSS对肝功能衰竭患者凝血功能的影响,显示出INR、PT、APTT等关键参数均有显著改善,且不同治疗方式的疗效不同。同时,利用重症监护室临床数据建立的机器学习模型在识别凝血功能障碍风险方面表现出很强的预测能力,特别是在支持高危患者的早期临床识别和指导个性化凝血管理策略方面。需要强调的是,该模型定位为动态风险预警和评估工具,旨在协助临床基线评估和护理干预,而不是直接验证ALSS的治疗效果。
{"title":"Coagulation Risk Prediction in Patients With Liver Failure: Integrated Meta-Analysis and Machine Learning Model Study.","authors":"Hao Wang, Tao He, Liang Ren, Tingjun Zhang","doi":"10.2196/76348","DOIUrl":"10.2196/76348","url":null,"abstract":"<p><strong>Background: </strong>Liver failure often results in significant coagulation dysfunction, which is a major complication. Artificial liver support systems (ALSS) have been used to ameliorate coagulation parameters, but the dynamic nature of these improvements and the development of predictive models remain insufficiently explored.</p><p><strong>Objective: </strong>This study aimed to evaluate the effects of ALSS on coagulation function and to develop a dynamic prediction model using machine learning techniques to predict the improvement trends of coagulation parameters.</p><p><strong>Methods: </strong>A systematic search was conducted in PubMed, Embase, and other databases to identify relevant studies, resulting in 18 studies comprising 1771 patients. A meta-analysis was performed to assess the impact of ALSS on coagulation parameters, including international normalized ratio (INR), prothrombin time (PT), activated partial thromboplastin time (APTT), and fibrinogen levels. In addition, clinical data from the Medical Information Mart for Intensive Care database were used to construct prediction models using logistic regression, extreme gradient boosting, random forest, and long short-term memory networks.</p><p><strong>Results: </strong>Meta-analysis results showed that ALSS significantly improved INR, PT, APTT, and fibrinogen levels (all P<.05), with the treatment efficacy varying by modality. Among the machine learning models, the random forest model demonstrated the best performance, achieving an area under the curve of 92.12%. Dynamic INR was identified as the key predictor for coagulation abnormalities.</p><p><strong>Conclusions: </strong>This study systematically evaluated the effects of ALSS on coagulation function in patients with liver failure, demonstrating significant improvements in key parameters such as INR, PT, and APTT, with efficacy varying across different treatment modalities. Simultaneously, a machine learning model built using intensive care unit clinical data exhibited strong predictive capability for identifying the risk of coagulation dysfunction, particularly useful in supporting early-stage clinical recognition of high-risk patients and guiding personalized coagulation management strategies. It is important to emphasize that this model is positioned as a dynamic risk alert and assessment tool, intended to assist clinical baseline evaluation and nursing interventions, rather than serving as direct validation of ALSS therapeutic efficacy.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76348"},"PeriodicalIF":3.8,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12723362/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145709516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Speech Analysis for Screening and Monitoring Bipolar Depression: Machine Learning Model Development and Interpretation Study. 筛选和监测双相抑郁症的自动语音分析:机器学习模型开发和解释研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-04 DOI: 10.2196/79093
Sooyeon Min, Tae-Sung Yeum, Daun Shin, Sang Jin Rhee, Hyunju Lee, Han-Sung Lee, Seongmin Park, Jihwa Lee, Yong Min Ahn
<p><strong>Background: </strong>Depressive episodes in bipolar disorder are frequent, prolonged, and contribute substantially to functional impairment and reduced quality of life. Therefore, early and objective detection of bipolar depression is critical for timely intervention and improved outcomes. Multimodal speech analyses hold promise for capturing psychomotor, cognitive, and affective changes associated with bipolar depression.</p><p><strong>Objective: </strong>This study aims to develop between- and within-person classifiers to screen for bipolar depression and monitor longitudinal changes to detect depressive recurrence in patients with bipolar disorder. A secondary objective was to compare the predictive performance across speech modalities.</p><p><strong>Methods: </strong>We collected 304 voice audio recordings obtained during semistructured interviews with 92 patients diagnosed with bipolar disorder over a 1-year period. Depression severity was assessed using the Hamilton Depression Rating Scale. Acoustic features were extracted using the openSMILE toolkit, and linguistic features were extracted using the Linguistic Inquiry and Word Count frameworks following automatic speech recognition and machine translation. Mixed-effects multivariate linear regression evaluated the associations between speech markers and Hamilton Depression Rating Scale scores adjusting for demographic variables, diagnosis, and feature-specific covariates. Extreme gradient boosting and light gradient boosting were used as base learners. We developed a between-person classifier to detect moderate to severe depression and a within-person classifier to detect recurrence. Hyperparameter tuning and 95% CI estimation were performed using a bootstrap bias-corrected cross-validation (k=5) approach combined with a grid search. Feature contributions were interpreted using Shapley additive explanations.</p><p><strong>Results: </strong>Patients with depression showed reduced energy modulation, prolonged monotony, and more frequent use of words related to death and negative emotions. The between-person classifier combining acoustic and linguistic features detected moderate to severe depression with an area under the curve of 0.76 compared to 0.54 for the demographic model. The within-person classifier based on speech features detected depression recurrence with an area under the curve of 0.70 compared to 0.55 for the demographic model.</p><p><strong>Conclusions: </strong>Between- and within-person comparisons of speech markers can be leveraged in detecting and monitoring bipolar depression. We demonstrate the feasibility of applying Linguistic Inquiry and Word Count-based psycholinguistic analysis to machine-transcribed and translated speech, supporting the replicability of this approach across languages. Automated multimodal voice analysis can be integrated into digital health platforms, providing a scalable and effective approach for accessing mental health monitoring and ca
背景:双相情感障碍中的抑郁发作是频繁的、持续的,并且在很大程度上导致功能障碍和生活质量下降。因此,早期客观发现双相抑郁对于及时干预和改善预后至关重要。多模态言语分析有望捕获与双相抑郁症相关的精神运动、认知和情感变化。目的:本研究旨在建立人与人之间和人与人之间的分类器来筛查双相情感障碍和监测纵向变化,以检测双相情感障碍患者的抑郁复发。第二个目标是比较不同语音模式的预测性能。方法:我们收集了92名双相情感障碍患者在1年半结构化访谈中获得的304份语音录音。采用汉密尔顿抑郁评定量表评估抑郁严重程度。声学特征提取使用openSMILE工具包,语言特征提取使用语言查询和单词计数框架,然后进行自动语音识别和机器翻译。混合效应多变量线性回归评估了语音标记与汉密尔顿抑郁评定量表评分之间的关系,调整了人口统计学变量、诊断和特征特异性协变量。使用极端梯度增强和光梯度增强作为基础学习器。我们开发了一个人与人之间的分类器来检测中度到重度抑郁症和一个人与人之间的分类器来检测复发。使用自举偏差校正交叉验证(k=5)方法结合网格搜索进行超参数调整和95% CI估计。特征贡献用Shapley加性解释解释。结果:抑郁症患者表现出能量调节减弱、单调感延长、死亡和负面情绪相关词汇使用频率增加。结合声学和语言特征的人之间分类器检测到中度至重度抑郁症,曲线下面积为0.76,而人口统计学模型的曲线下面积为0.54。基于语音特征的人内分类器检测到抑郁症复发的曲线下面积为0.70,而人口统计学模型的曲线下面积为0.55。结论:人与人之间的言语标记比较可用于检测和监测双相抑郁症。我们展示了将语言探究和基于词计数的心理语言学分析应用于机器转录和翻译语音的可行性,支持这种方法在不同语言之间的可复制性。自动化多模态语音分析可以集成到数字健康平台中,为获取精神健康监测和护理提供可扩展和有效的方法。
{"title":"Automated Speech Analysis for Screening and Monitoring Bipolar Depression: Machine Learning Model Development and Interpretation Study.","authors":"Sooyeon Min, Tae-Sung Yeum, Daun Shin, Sang Jin Rhee, Hyunju Lee, Han-Sung Lee, Seongmin Park, Jihwa Lee, Yong Min Ahn","doi":"10.2196/79093","DOIUrl":"10.2196/79093","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Depressive episodes in bipolar disorder are frequent, prolonged, and contribute substantially to functional impairment and reduced quality of life. Therefore, early and objective detection of bipolar depression is critical for timely intervention and improved outcomes. Multimodal speech analyses hold promise for capturing psychomotor, cognitive, and affective changes associated with bipolar depression.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aims to develop between- and within-person classifiers to screen for bipolar depression and monitor longitudinal changes to detect depressive recurrence in patients with bipolar disorder. A secondary objective was to compare the predictive performance across speech modalities.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We collected 304 voice audio recordings obtained during semistructured interviews with 92 patients diagnosed with bipolar disorder over a 1-year period. Depression severity was assessed using the Hamilton Depression Rating Scale. Acoustic features were extracted using the openSMILE toolkit, and linguistic features were extracted using the Linguistic Inquiry and Word Count frameworks following automatic speech recognition and machine translation. Mixed-effects multivariate linear regression evaluated the associations between speech markers and Hamilton Depression Rating Scale scores adjusting for demographic variables, diagnosis, and feature-specific covariates. Extreme gradient boosting and light gradient boosting were used as base learners. We developed a between-person classifier to detect moderate to severe depression and a within-person classifier to detect recurrence. Hyperparameter tuning and 95% CI estimation were performed using a bootstrap bias-corrected cross-validation (k=5) approach combined with a grid search. Feature contributions were interpreted using Shapley additive explanations.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Patients with depression showed reduced energy modulation, prolonged monotony, and more frequent use of words related to death and negative emotions. The between-person classifier combining acoustic and linguistic features detected moderate to severe depression with an area under the curve of 0.76 compared to 0.54 for the demographic model. The within-person classifier based on speech features detected depression recurrence with an area under the curve of 0.70 compared to 0.55 for the demographic model.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;Between- and within-person comparisons of speech markers can be leveraged in detecting and monitoring bipolar depression. We demonstrate the feasibility of applying Linguistic Inquiry and Word Count-based psycholinguistic analysis to machine-transcribed and translated speech, supporting the replicability of this approach across languages. Automated multimodal voice analysis can be integrated into digital health platforms, providing a scalable and effective approach for accessing mental health monitoring and ca","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e79093"},"PeriodicalIF":3.8,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12715464/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145678683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Involving Health, Technology, and Financial Stakeholders in Co-Designing Digital Pathways for Value-Based Care. 让健康、技术和财务利益相关者共同设计基于价值的护理的数字途径。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-04 DOI: 10.2196/84885
Pieter Vandekerckhove, Benjamin H L Harris, Louis J Koizia, Steven Howard
{"title":"Involving Health, Technology, and Financial Stakeholders in Co-Designing Digital Pathways for Value-Based Care.","authors":"Pieter Vandekerckhove, Benjamin H L Harris, Louis J Koizia, Steven Howard","doi":"10.2196/84885","DOIUrl":"10.2196/84885","url":null,"abstract":"","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e84885"},"PeriodicalIF":3.8,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12677727/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145678722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Clinical Calculator for Predicting 28-Day Mortality in Older Adult Patients With Sepsis-Associated Encephalopathy: Retrospective Study Using MIMIC-IV. 预测老年败血症相关脑病患者28天死亡率的在线临床计算器:使用MIMIC-IV的回顾性研究
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-04 DOI: 10.2196/76417
Guangyong Jin, Menglu Zhou, Jiayi Chen, Mengyuan Diao, Wei Hu
<p><strong>Background: </strong>Sepsis-associated encephalopathy (SAE) represents a critical complication of sepsis, especially among older adults. Despite its clinical relevance, there remains a lack of accessible and practical tools specifically designed to predict 28-day mortality in this vulnerable population.</p><p><strong>Objective: </strong>We aimed to enhance the practical applicability of the model by creating a web-based tool that allows real-time, individualized mortality risk prediction, facilitating early intervention and informed decision-making in clinical practice.</p><p><strong>Methods: </strong>Using data extracted from the MIMIC-IV (Medical Information Mart for Intensive Care IV) database, we identified older patients (≥65 years) with SAE (n=2165) and divided them into a development cohort (n=1531) and a validation cohort (n=634). Key risk factors associated with 28-day mortality were identified, and a predictive nomogram was constructed. Model performance was evaluated using the concordance index, integrated discrimination improvement, net reclassification index, and calibration curve analysis. Clinical applicability was assessed through decision curve analysis and benchmarked against traditional intensive care unit (ICU) scoring systems. Furthermore, the nomogram was deployed as a web-based application, enabling clinicians to input data and generate individualized mortality predictions.</p><p><strong>Results: </strong>A total of 2165 older patients with SAE were included, among whom 290 (13.4%) died within 28 days of ICU admission. Multivariable logistic regression identified lower body weight (odds ratio [OR] 0.985, 95% CI 0.975-0.994; P=.001), lower systolic blood pressure (OR 0.972, 95% CI 0.957-0.986; P<.001), lower hemoglobin (OR 0.984, 95% CI 0.974-0.995; P=.005), lower PaO2 (OR 0.996, 95% CI 0.994-0.997; P<.001), and lower Glasgow Coma Scale score (OR 0.825, 95% CI 0.786-0.864; P<.001) as mortality risk factors. Higher respiratory rate (OR 1.083, 95% CI 1.029-1.141; P=.002), increased anion gap (OR 1.081, 95% CI 1.031-1.135; P=.001), elevated blood urea nitrogen (OR 1.045, 95% CI 1.016-1.076; P=.002), prolonged partial thromboplastin time (OR 1.033, 95% CI 1.016-1.050; P<.001), and reduced urine output (OR>0.99, 95% CI 0.999-1.000; P=.002) were also predictive. Patients admitted to "other" ICU types had lower mortality compared with the medical ICU reference group (OR 0.327, 95% CI 0.176-0.609; P<.001). The nomogram achieved concordance index values of 0.899 (development) and 0.897 (validation), outperforming sequential organ failure assessment (0.692), Acute Physiology Score III (0.804), Logistic Organ Dysfunction System (0.771), Simplified Acute Physiology Score II (0.704), and Oxford Acute Severity of Illness Score (0.753), with significant integrated discrimination improvement and net reclassification index improvements (all P<.001). Calibration curves confirmed good agreement between predicted and observed outcome
背景:脓毒症相关脑病(SAE)是脓毒症的一种重要并发症,尤其是在老年人中。尽管它具有临床意义,但仍然缺乏专门用于预测这一弱势群体28天死亡率的可获得和实用的工具。目的:我们旨在通过创建一个基于网络的工具来增强模型的实际适用性,该工具可以实时、个性化地预测死亡风险,促进临床实践中的早期干预和知情决策。方法:使用从MIMIC-IV(重症医疗信息市场IV)数据库中提取的数据,我们确定老年SAE患者(n=2165),并将其分为发展队列(n=1531)和验证队列(n=634)。确定与28天死亡率相关的关键危险因素,并构建预测nomogram。采用一致性指数、综合判别改进、净重分类指数和校准曲线分析对模型性能进行评价。通过决策曲线分析评估临床适用性,并以传统的重症监护病房(ICU)评分系统为基准。此外,nomogram作为一个基于网络的应用程序部署,使临床医生能够输入数据并生成个性化的死亡率预测。结果:共纳入2165例老年SAE患者,其中290例(13.4%)在入院后28天内死亡。多变量logistic回归发现,较低的体重(比值比[OR] 0.985, 95% CI 0.975-0.994; P=.001)、较低的收缩压(OR 0.972, 95% CI 0.957-0.986; P0.99, 95% CI 0.999-1.000; P=.002)也是预测因素。与内科ICU参照组相比,入住“其他”ICU类型的患者死亡率更低(OR 0.327, 95% CI 0.176-0.609)。结论:本研究结合常规临床数据,提出了一种新的、有效的预测老年SAE患者28天死亡率的nomogram。该模型作为数字工具的部署增强了其可访问性和可用性,为临床医生提供了风险分层和个性化患者管理的实用资源。
{"title":"Online Clinical Calculator for Predicting 28-Day Mortality in Older Adult Patients With Sepsis-Associated Encephalopathy: Retrospective Study Using MIMIC-IV.","authors":"Guangyong Jin, Menglu Zhou, Jiayi Chen, Mengyuan Diao, Wei Hu","doi":"10.2196/76417","DOIUrl":"10.2196/76417","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Sepsis-associated encephalopathy (SAE) represents a critical complication of sepsis, especially among older adults. Despite its clinical relevance, there remains a lack of accessible and practical tools specifically designed to predict 28-day mortality in this vulnerable population.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We aimed to enhance the practical applicability of the model by creating a web-based tool that allows real-time, individualized mortality risk prediction, facilitating early intervention and informed decision-making in clinical practice.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;Using data extracted from the MIMIC-IV (Medical Information Mart for Intensive Care IV) database, we identified older patients (≥65 years) with SAE (n=2165) and divided them into a development cohort (n=1531) and a validation cohort (n=634). Key risk factors associated with 28-day mortality were identified, and a predictive nomogram was constructed. Model performance was evaluated using the concordance index, integrated discrimination improvement, net reclassification index, and calibration curve analysis. Clinical applicability was assessed through decision curve analysis and benchmarked against traditional intensive care unit (ICU) scoring systems. Furthermore, the nomogram was deployed as a web-based application, enabling clinicians to input data and generate individualized mortality predictions.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;A total of 2165 older patients with SAE were included, among whom 290 (13.4%) died within 28 days of ICU admission. Multivariable logistic regression identified lower body weight (odds ratio [OR] 0.985, 95% CI 0.975-0.994; P=.001), lower systolic blood pressure (OR 0.972, 95% CI 0.957-0.986; P&lt;.001), lower hemoglobin (OR 0.984, 95% CI 0.974-0.995; P=.005), lower PaO2 (OR 0.996, 95% CI 0.994-0.997; P&lt;.001), and lower Glasgow Coma Scale score (OR 0.825, 95% CI 0.786-0.864; P&lt;.001) as mortality risk factors. Higher respiratory rate (OR 1.083, 95% CI 1.029-1.141; P=.002), increased anion gap (OR 1.081, 95% CI 1.031-1.135; P=.001), elevated blood urea nitrogen (OR 1.045, 95% CI 1.016-1.076; P=.002), prolonged partial thromboplastin time (OR 1.033, 95% CI 1.016-1.050; P&lt;.001), and reduced urine output (OR&gt;0.99, 95% CI 0.999-1.000; P=.002) were also predictive. Patients admitted to \"other\" ICU types had lower mortality compared with the medical ICU reference group (OR 0.327, 95% CI 0.176-0.609; P&lt;.001). The nomogram achieved concordance index values of 0.899 (development) and 0.897 (validation), outperforming sequential organ failure assessment (0.692), Acute Physiology Score III (0.804), Logistic Organ Dysfunction System (0.771), Simplified Acute Physiology Score II (0.704), and Oxford Acute Severity of Illness Score (0.753), with significant integrated discrimination improvement and net reclassification index improvements (all P&lt;.001). Calibration curves confirmed good agreement between predicted and observed outcome","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76417"},"PeriodicalIF":3.8,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12715468/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145679434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning-Based Prediction of In-Hospital Falls in Adult Inpatients: Retrospective Observational Multicenter Study. 基于机器学习的成人住院患者住院跌倒预测:回顾性观察性多中心研究
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-04 DOI: 10.2196/75958
Takuya Nishino, Kotone Matsuyama, Yasuo Miyagi, Nari Tanabe, Fumiko Yamaguchi, Hiroki Ito, Shizuka Soh, Ayako Yano, Masako Mizuno, Katsuhito Kato, Hiroshige Jinnouchi, Chol Kim, Yosuke Ishii, Hiroki Yamaguchi, Yukihiro Kondo

Background: Falls among hospitalized patients are a critical issue that often leads to prolonged hospital stays and increased health care costs. Traditional fall risk assessments typically rely on standardized scoring systems; however, these may fail to capture the complex and multifactorial nature of fall risk factors.

Objective: This retrospective observational multicenter study aimed to develop and validate a machine learning-based model to predict in-hospital falls and to evaluate its performance in terms of discrimination and calibration.

Methods: We analyzed the data of 83,917 inpatients aged 65 years and older with a hospital stay of at least 3 days. Using Diagnosis Procedure Combination data and laboratory results, we extracted demographic, clinical, functional, and pharmacological variables. Following the selection of 30 key features, 4 predictive models were constructed: logistic regression, extreme gradient boosting, light gradient boosting machine (LGBM), and categorical boosting (CatBoost). The synthetic minority oversampling technique and isotonic regression calibration were applied to improve the prediction quality and address class imbalance.

Results: Falls occurred in 2173 (2.6%) patients. CatBoost achieved the highest F1-score (0.189, 95% CI 0.162-0.215) and area under the precision-recall curve (0.112, 95% CI 0.091-0.136), whereas LGBM had the best calibration slope (0.964, 95% CI 0.858-1.070) with good discrimination (F1-score 0.182, 95% CI 0.156-0.209; area under the precision-recall curve 0.094, 95% CI 0.078-0.113). Logistic regression had the lowest discrimination (F1-score 0.120, 95% CI 0.100-0.143). Shapley Additive Explanations analysis consistently identified low albumin, impaired transfer ability, and the use of sedative-hypnotics or diabetes medications as major contributors to fall risk. In incident report analysis (n=435), 49.2% of falls were toileting-related, peaking between 4 and 6 AM, with bedside falls predominating in high or very high risk groups.

Conclusions: CatBoost and LGBM offer clinically valuable prediction performance, with CatBoost favored for high-risk patient identification and LGBM for probability-based intervention thresholds. Integrating such models into electronic health records could enable real-time risk scoring and trigger targeted interventions (eg, toileting assistance and mobility support). Future work should incorporate dynamic, time-varying patient data to improve real-time risk prediction.

背景:住院患者跌倒是一个关键问题,经常导致住院时间延长和医疗保健费用增加。传统的跌倒风险评估通常依赖于标准化的评分系统;然而,这些可能无法捕捉到跌倒危险因素的复杂性和多因素性质。目的:本回顾性观察性多中心研究旨在开发和验证基于机器学习的住院跌倒预测模型,并评估其在判别和校准方面的性能。方法:对83917例65岁及以上住院3天以上患者资料进行分析。使用诊断程序组合数据和实验室结果,我们提取了人口学、临床、功能和药理学变量。在选取30个关键特征的基础上,构建了逻辑回归、极端梯度增强、轻梯度增强机(LGBM)和分类增强(CatBoost) 4个预测模型。采用合成少数过采样技术和等渗回归校正,提高了预测质量,解决了类不平衡问题。结果:2173例(2.6%)患者发生跌倒。CatBoost具有最高的f1评分(0.189,95% CI 0.162 ~ 0.215)和精确召回曲线下面积(0.112,95% CI 0.091 ~ 0.136),而LGBM具有最佳的校准斜率(0.964,95% CI 0.858 ~ 1.070),具有良好的判别性(f1评分0.182,95% CI 0.156 ~ 0.209;精确召回曲线下面积0.094,95% CI 0.078 ~ 0.113)。Logistic回归的鉴别性最低(f1评分0.120,95% CI 0.100-0.143)。Shapley加性解释分析一致认为,低白蛋白、转运能力受损、镇静催眠药或糖尿病药物的使用是导致跌倒风险的主要因素。在事件报告分析(n=435)中,49.2%的跌倒与如厕有关,高峰发生在早上4点至6点之间,在高或极高风险人群中,床边跌倒占主导地位。结论:CatBoost和LGBM具有具有临床价值的预测性能,CatBoost适用于高风险患者识别,LGBM适用于基于概率的干预阈值。将这些模型纳入电子健康记录可以实现实时风险评分并触发有针对性的干预措施(例如,如厕协助和行动支助)。未来的工作应纳入动态的、时变的患者数据,以提高实时风险预测。
{"title":"Machine Learning-Based Prediction of In-Hospital Falls in Adult Inpatients: Retrospective Observational Multicenter Study.","authors":"Takuya Nishino, Kotone Matsuyama, Yasuo Miyagi, Nari Tanabe, Fumiko Yamaguchi, Hiroki Ito, Shizuka Soh, Ayako Yano, Masako Mizuno, Katsuhito Kato, Hiroshige Jinnouchi, Chol Kim, Yosuke Ishii, Hiroki Yamaguchi, Yukihiro Kondo","doi":"10.2196/75958","DOIUrl":"10.2196/75958","url":null,"abstract":"<p><strong>Background: </strong>Falls among hospitalized patients are a critical issue that often leads to prolonged hospital stays and increased health care costs. Traditional fall risk assessments typically rely on standardized scoring systems; however, these may fail to capture the complex and multifactorial nature of fall risk factors.</p><p><strong>Objective: </strong>This retrospective observational multicenter study aimed to develop and validate a machine learning-based model to predict in-hospital falls and to evaluate its performance in terms of discrimination and calibration.</p><p><strong>Methods: </strong>We analyzed the data of 83,917 inpatients aged 65 years and older with a hospital stay of at least 3 days. Using Diagnosis Procedure Combination data and laboratory results, we extracted demographic, clinical, functional, and pharmacological variables. Following the selection of 30 key features, 4 predictive models were constructed: logistic regression, extreme gradient boosting, light gradient boosting machine (LGBM), and categorical boosting (CatBoost). The synthetic minority oversampling technique and isotonic regression calibration were applied to improve the prediction quality and address class imbalance.</p><p><strong>Results: </strong>Falls occurred in 2173 (2.6%) patients. CatBoost achieved the highest F<sub>1</sub>-score (0.189, 95% CI 0.162-0.215) and area under the precision-recall curve (0.112, 95% CI 0.091-0.136), whereas LGBM had the best calibration slope (0.964, 95% CI 0.858-1.070) with good discrimination (F<sub>1</sub>-score 0.182, 95% CI 0.156-0.209; area under the precision-recall curve 0.094, 95% CI 0.078-0.113). Logistic regression had the lowest discrimination (F<sub>1</sub>-score 0.120, 95% CI 0.100-0.143). Shapley Additive Explanations analysis consistently identified low albumin, impaired transfer ability, and the use of sedative-hypnotics or diabetes medications as major contributors to fall risk. In incident report analysis (n=435), 49.2% of falls were toileting-related, peaking between 4 and 6 AM, with bedside falls predominating in high or very high risk groups.</p><p><strong>Conclusions: </strong>CatBoost and LGBM offer clinically valuable prediction performance, with CatBoost favored for high-risk patient identification and LGBM for probability-based intervention thresholds. Integrating such models into electronic health records could enable real-time risk scoring and trigger targeted interventions (eg, toileting assistance and mobility support). Future work should incorporate dynamic, time-varying patient data to improve real-time risk prediction.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e75958"},"PeriodicalIF":3.8,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12715471/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145679326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Authors' Reply: Involving Health, Technology, and Financial Stakeholders in Co-Designing Digital Pathways for Value-Based Care. 作者回复:让健康、技术和财务利益相关者共同设计基于价值的护理的数字途径。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-04 DOI: 10.2196/86837
Jinsong Chen, Christopher Bullen, Lan Zhang
{"title":"Authors' Reply: Involving Health, Technology, and Financial Stakeholders in Co-Designing Digital Pathways for Value-Based Care.","authors":"Jinsong Chen, Christopher Bullen, Lan Zhang","doi":"10.2196/86837","DOIUrl":"10.2196/86837","url":null,"abstract":"","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e86837"},"PeriodicalIF":3.8,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12714545/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145783853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1