首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Evaluating Multiple Input Strategies of Large Language Models for Gallbladder Polyps on Ultrasound: Comparative Study. 超声评价胆囊息肉大语言模型的多种输入策略:比较研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-23 DOI: 10.2196/71178
Lin Jiang, Jiaqian Yao, Zebang Yang, Fuqiu Tang, Xin Zheng, Xiaoer Zhang, Xiaoyan Xie, Ming Xu, Tongyi Huang
<p><strong>Background: </strong>Gallbladder polyps have a high prevalence and are predominantly benign lesions, often detected via ultrasound. They impose diagnostic burdens on radiologists while generating substantial patient demand for report interpretation. Benign polyps include nonneoplastic polyps without malignant potential and premalignant adenomas that require cholecystectomy. Current guidelines recommending surgery for polyps ≥1.0 cm may lead to unnecessary interventions. Advanced multimodal large language models (LLMs) such as ChatGPT-4o (OpenAI) and Claude 3.5 Sonnet (Anthropic PBC) demonstrate emerging capabilities in medical image analysis. Implementing LLMs in gallbladder polyp ultrasound evaluation can potentially alleviate radiologists' workload, provide patient-accessible consultation platforms, and even reduce overtreatment.</p><p><strong>Objective: </strong>We aimed to analyze the feasibility and conduct an early-stage evaluation of using LLMs for differentiating between adenomatous and nonneoplastic gallbladder polyps (≥1.0 cm) based on ChatGPT-4o and Claude 3.5 Sonnet, compared to assessments by radiologists and the guideline.</p><p><strong>Methods: </strong>Ultrasound images and reports of gallbladder polyps ≥1.0 cm with pathology were retrospectively collected from a hospital between January 2011 and January 2022. LLM performance was evaluated using three input strategies: (1) direct image analysis (LLMs-image), (2) feature-based text analysis (LLMs-text), and (3) scoring model-based text analysis (LLMs-model). Both intra- and interreader agreement and diagnostic performance of LLMs were evaluated for all three strategies. The diagnostic performance metrics-including sensitivity, specificity, accuracy, area under the receiver operating characteristic curve, and unnecessary resection rate of nonneoplastic polyps of LLMs in the three strategies were compared with the guideline. Additionally, the strategy LLMs-model was specifically compared with radiologists using the same scoring system (strategy readers-model).</p><p><strong>Results: </strong>This study included 223 patients (aged 18-72 years; 132/223, 59.2% female) as the initial cohort, with 48 adenomatous polyps and 175 nonneoplastic polyps. The external test set comprised 100 patients. The intrareader agreement coefficients for strategy LLMs-model were significantly higher than those for strategy LLMs-image and LLMs-text (all P<.01). The interreader agreement of the three diagnostic strategies was ranked as LLMs-model>LLMs-text>LLMs-image. The sensitivity of strategies LLMs-image and LLMs-text was significantly lower than that of the guideline (all P<.001). When applying a scoring model (readers/LLMs-model strategy), both radiologists and the LLMs achieved a significantly higher accuracy compared to the guideline (0.34, 0.35, and 0.34 vs 0.22, all P<.01), and the unnecessary resection rate of nonneoplastic polyps was significantly lower (82%, 83%, and 83% vs 100%, all P
背景:胆囊息肉发病率高,主要为良性病变,常通过超声检测。它们给放射科医生增加了诊断负担,同时产生了大量患者对报告解释的需求。良性息肉包括无恶性潜能的非肿瘤性息肉和需要胆囊切除术的癌前腺瘤。目前的指南建议对≥1.0 cm的息肉进行手术可能导致不必要的干预。先进的多模态大型语言模型(llm),如chatgpt - 40 (OpenAI)和Claude 3.5 Sonnet (Anthropic PBC),展示了医学图像分析的新兴能力。在胆囊息肉超声评估中实施llm可以潜在地减轻放射科医生的工作量,为患者提供方便的咨询平台,甚至减少过度治疗。目的:对比放射科医师的评估和指南,我们旨在分析基于chatgpt - 40和Claude 3.5 Sonnet的LLMs鉴别腺瘤性和非肿瘤性胆囊息肉(≥1.0 cm)的可行性并进行早期评估。方法:回顾性收集我院2011年1月~ 2022年1月收治的≥1.0 cm胆囊息肉的超声影像及病理报告。使用三种输入策略对LLM性能进行评估:(1)直接图像分析(LLMs-image),(2)基于特征的文本分析(LLMs-text)和(3)基于评分模型的文本分析(LLMs-model)。对所有三种策略的llm的内部和解读者一致性和诊断性能进行了评估。比较三种策略的诊断性能指标,包括敏感性、特异性、准确性、受者工作特征曲线下面积、LLMs非肿瘤性息肉的不必要切除率。此外,使用相同的评分系统(策略阅读者模型),专门将llms策略模型与放射科医生进行比较。结果:本研究纳入223例患者(18-72岁;132/223,女性59.2%)作为初始队列,其中腺瘤性息肉48例,非肿瘤性息肉175例。外部测试组包括100名患者。策略LLMs-model的读者内部一致系数显著高于策略LLMs-image和策略LLMs-text(均为plms -text>LLMs-image)。LLMs-image和LLMs-text策略的敏感性显著低于指南(p < 0.05)。gpt模型和claude模型的所有诊断绩效指标与放射科医师无显著差异(P < 0.05)。结论:LLMs对医学图像的识别和解读能力有待进一步提高。带有评分系统的文本策略是目前llm最合适的诊断策略。
{"title":"Evaluating Multiple Input Strategies of Large Language Models for Gallbladder Polyps on Ultrasound: Comparative Study.","authors":"Lin Jiang, Jiaqian Yao, Zebang Yang, Fuqiu Tang, Xin Zheng, Xiaoer Zhang, Xiaoyan Xie, Ming Xu, Tongyi Huang","doi":"10.2196/71178","DOIUrl":"10.2196/71178","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Gallbladder polyps have a high prevalence and are predominantly benign lesions, often detected via ultrasound. They impose diagnostic burdens on radiologists while generating substantial patient demand for report interpretation. Benign polyps include nonneoplastic polyps without malignant potential and premalignant adenomas that require cholecystectomy. Current guidelines recommending surgery for polyps ≥1.0 cm may lead to unnecessary interventions. Advanced multimodal large language models (LLMs) such as ChatGPT-4o (OpenAI) and Claude 3.5 Sonnet (Anthropic PBC) demonstrate emerging capabilities in medical image analysis. Implementing LLMs in gallbladder polyp ultrasound evaluation can potentially alleviate radiologists' workload, provide patient-accessible consultation platforms, and even reduce overtreatment.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We aimed to analyze the feasibility and conduct an early-stage evaluation of using LLMs for differentiating between adenomatous and nonneoplastic gallbladder polyps (≥1.0 cm) based on ChatGPT-4o and Claude 3.5 Sonnet, compared to assessments by radiologists and the guideline.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;Ultrasound images and reports of gallbladder polyps ≥1.0 cm with pathology were retrospectively collected from a hospital between January 2011 and January 2022. LLM performance was evaluated using three input strategies: (1) direct image analysis (LLMs-image), (2) feature-based text analysis (LLMs-text), and (3) scoring model-based text analysis (LLMs-model). Both intra- and interreader agreement and diagnostic performance of LLMs were evaluated for all three strategies. The diagnostic performance metrics-including sensitivity, specificity, accuracy, area under the receiver operating characteristic curve, and unnecessary resection rate of nonneoplastic polyps of LLMs in the three strategies were compared with the guideline. Additionally, the strategy LLMs-model was specifically compared with radiologists using the same scoring system (strategy readers-model).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;This study included 223 patients (aged 18-72 years; 132/223, 59.2% female) as the initial cohort, with 48 adenomatous polyps and 175 nonneoplastic polyps. The external test set comprised 100 patients. The intrareader agreement coefficients for strategy LLMs-model were significantly higher than those for strategy LLMs-image and LLMs-text (all P&lt;.01). The interreader agreement of the three diagnostic strategies was ranked as LLMs-model&gt;LLMs-text&gt;LLMs-image. The sensitivity of strategies LLMs-image and LLMs-text was significantly lower than that of the guideline (all P&lt;.001). When applying a scoring model (readers/LLMs-model strategy), both radiologists and the LLMs achieved a significantly higher accuracy compared to the guideline (0.34, 0.35, and 0.34 vs 0.22, all P&lt;.01), and the unnecessary resection rate of nonneoplastic polyps was significantly lower (82%, 83%, and 83% vs 100%, all P","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e71178"},"PeriodicalIF":3.8,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12777648/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145822236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Body Roundness Index as a Novel Digital Biomarker for Psoriasis Risk Prediction: Cross-Sectional Study. 评估身体圆度指数作为牛皮癣风险预测的新型数字生物标志物:横断面研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-23 DOI: 10.2196/75727
Pengfei Wen, Xiaoyan Wang, Xiaoxue Zhuo, Siliang Xue

Background: Psoriasis is a chronic inflammatory skin disorder that has been increasingly linked to metabolic imbalances, particularly obesity. Conventional anthropometric indicators such as BMI and waist circumference (WC) may not sufficiently capture body fat distribution or reflect metabolic risk. The body roundness index (BRI), which integrates both height and waist measurements, has emerged as a potentially superior metric, though its relevance to psoriasis risk remains underexplored.

Objective: This study aimed to investigate the use of BRI as a digital biomarker for assessing psoriasis risk and to compare its predictive strength against BMI and WC across various demographic and metabolic subgroups using data from a nationally representative sample.

Methods: A cross-sectional analysis was conducted using data from 13,798 adults aged 20 to 59 years who participated in the National Health and Nutrition Examination Survey between 2003 and 2006 as well as between 2009 and 2014. Psoriasis status was self-reported. Anthropometric measures (BRI, BMI, and WC) were calculated from standardized physical assessments. Weighted multivariable logistic regression models and restricted cubic spline analyses were used to examine associations while adjusting for demographic, metabolic, and lifestyle variables. A nomogram was constructed to quantify the relative predictive contributions of each metric.

Results: BRI exhibited a strong linear association with psoriasis risk (odds ratio [OR] 1.11 per unit increase, 95% CI 1.05-1.17; P<.001), outperforming BMI (OR 1.03) and WC (OR 1.01). Tertile analysis revealed a 1.73-fold increased risk of psoriasis in the highest BRI group (P=.003). Subgroup analyses confirmed consistent associations across age, sex, race or ethnicity, and metabolic status (P for interaction >.05). The nomogram highlighted BRI as the most influential predictor, indicated by its broad scoring range.

Conclusions: BRI shows stronger and more consistent associations with psoriasis risk than BMI or WC, supporting its potential role as a digital biomarker for early risk stratification. Incorporating BRI into clinical decision-making tools may enhance personalized approaches to psoriasis prevention and management.

背景:牛皮癣是一种慢性炎症性皮肤病,与代谢失衡,尤其是肥胖的关系越来越密切。传统的人体测量指标,如BMI和腰围(WC)可能不能充分捕捉身体脂肪分布或反映代谢风险。身体圆度指数(BRI)结合了身高和腰围,已经成为一种潜在的优越指标,尽管它与牛皮癣风险的相关性仍未得到充分研究。目的:本研究旨在研究BRI作为评估牛皮癣风险的数字生物标志物的使用,并使用来自全国代表性样本的数据,比较其与BMI和WC在不同人口统计学和代谢亚组中的预测强度。方法:对2003 - 2006年和2009 - 2014年参加全国健康与营养检查调查的13798名20 - 59岁成年人的数据进行横断面分析。牛皮癣状况自行报告。人体测量(BRI、BMI和WC)是根据标准化的身体评估计算的。加权多变量logistic回归模型和限制三次样条分析用于检验在调整人口统计学、代谢和生活方式变量时的相关性。构建了一个模态图来量化每个指标的相对预测贡献。结果:BRI与牛皮癣风险呈强线性相关(比值比[OR] 1.11 /单位增加,95% CI 1.05-1.17; P.05)。nomogram强调BRI是最具影响力的预测指标,其广泛的评分范围表明。结论:与BMI或WC相比,BRI与牛皮癣风险的相关性更强、更一致,支持其作为早期风险分层的数字生物标志物的潜在作用。将BRI纳入临床决策工具可以增强牛皮癣预防和管理的个性化方法。
{"title":"Evaluating the Body Roundness Index as a Novel Digital Biomarker for Psoriasis Risk Prediction: Cross-Sectional Study.","authors":"Pengfei Wen, Xiaoyan Wang, Xiaoxue Zhuo, Siliang Xue","doi":"10.2196/75727","DOIUrl":"10.2196/75727","url":null,"abstract":"<p><strong>Background: </strong>Psoriasis is a chronic inflammatory skin disorder that has been increasingly linked to metabolic imbalances, particularly obesity. Conventional anthropometric indicators such as BMI and waist circumference (WC) may not sufficiently capture body fat distribution or reflect metabolic risk. The body roundness index (BRI), which integrates both height and waist measurements, has emerged as a potentially superior metric, though its relevance to psoriasis risk remains underexplored.</p><p><strong>Objective: </strong>This study aimed to investigate the use of BRI as a digital biomarker for assessing psoriasis risk and to compare its predictive strength against BMI and WC across various demographic and metabolic subgroups using data from a nationally representative sample.</p><p><strong>Methods: </strong>A cross-sectional analysis was conducted using data from 13,798 adults aged 20 to 59 years who participated in the National Health and Nutrition Examination Survey between 2003 and 2006 as well as between 2009 and 2014. Psoriasis status was self-reported. Anthropometric measures (BRI, BMI, and WC) were calculated from standardized physical assessments. Weighted multivariable logistic regression models and restricted cubic spline analyses were used to examine associations while adjusting for demographic, metabolic, and lifestyle variables. A nomogram was constructed to quantify the relative predictive contributions of each metric.</p><p><strong>Results: </strong>BRI exhibited a strong linear association with psoriasis risk (odds ratio [OR] 1.11 per unit increase, 95% CI 1.05-1.17; P<.001), outperforming BMI (OR 1.03) and WC (OR 1.01). Tertile analysis revealed a 1.73-fold increased risk of psoriasis in the highest BRI group (P=.003). Subgroup analyses confirmed consistent associations across age, sex, race or ethnicity, and metabolic status (P for interaction >.05). The nomogram highlighted BRI as the most influential predictor, indicated by its broad scoring range.</p><p><strong>Conclusions: </strong>BRI shows stronger and more consistent associations with psoriasis risk than BMI or WC, supporting its potential role as a digital biomarker for early risk stratification. Incorporating BRI into clinical decision-making tools may enhance personalized approaches to psoriasis prevention and management.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e75727"},"PeriodicalIF":3.8,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12724066/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145812282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning Approaches for Classifying Children With and Without Autism Spectrum Disorder Using Inertial Measurement Unit Hand Tracking Data: Comparative Study. 基于惯性测量单元手部跟踪数据的深度学习方法对自闭症谱系障碍儿童进行分类:比较研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-22 DOI: 10.2196/73440
John Mutersbaugh, Wan-Chun Su, Anjana Bhat, Amir Gandjbakhche

Background: Autism spectrum disorder (ASD) is a prevalent neurodevelopmental condition that can be quite difficult to diagnose due to a lack of objective diagnostic methods in the currently used behavioral assessments. Recent work has shown that children with ASD have a higher incidence of motor control differences. A compilation of studies indicates that between 50% and 88% of the children with ASD have issues with movement control based on standardized motor assessments or parent-reported questionnaires.

Objective: In this study, we assess a variety of deep learning approaches for the classification of ASD, utilizing data collected via inertial measurement unit (IMU) hand tracking during goal-directed arm movements.

Methods: IMU hand tracking data were recorded from 41 school-aged children both with and without an ASD diagnosis to track their arm movements during a reach-to-clean up task. The IMU data were then preprocessed using a moving average and z score normalization to prepare the data for deep learning models. We evaluated the effectiveness of different deep learning models using the preprocessed data and a k-fold validation approach, as well as a patient-separated approach.

Results: The best result was achieved with a convolutional autoencoder combined with long short-term memory layers, reaching an accuracy of 90.21% and an F1-score of 90.02%. Once the convolutional autoencoder+long short-term memory was determined to be the most effective model for this datatype, it was retrained and evaluated with a patient-separated dataset to assess the generalization capability of the model, achieving an accuracy of 91.87% and an F1-score of 93.66%.

Conclusions: Our deep learning approach demonstrates that our models hold potential for facilitating ASD diagnosis in clinical settings. This work validates that there are significant differences between the physical movements of typically developing children and children with ASD, and these differences can be identified by analyzing hand-eye coordination skills. Additionally, we have validated that small-scale models can still achieve a high accuracy and good generalization when classifying medical data, opening the door for future research into diagnostic models that may not require massive amounts of data.

背景:自闭症谱系障碍(ASD)是一种普遍存在的神经发育疾病,由于目前使用的行为评估缺乏客观的诊断方法,因此很难诊断。最近的研究表明,自闭症儿童在运动控制方面的差异发生率更高。一项研究汇编表明,根据标准化的运动评估或家长报告的问卷,50%至88%的自闭症儿童存在运动控制问题。目的:在本研究中,我们利用惯性测量单元(IMU)手部跟踪在目标定向手臂运动中收集的数据,评估了各种用于ASD分类的深度学习方法。方法:记录41名有或没有ASD诊断的学龄儿童的IMU手部跟踪数据,以跟踪他们在伸手清理任务中的手臂运动。然后使用移动平均线和z分数归一化对IMU数据进行预处理,为深度学习模型准备数据。我们使用预处理数据和k-fold验证方法以及患者分离方法评估了不同深度学习模型的有效性。结果:结合长短期记忆层的卷积自编码器的准确率为90.21%,f1评分为90.02%,效果最好。一旦确定卷积自编码器+长短期记忆是该数据类型最有效的模型,则使用患者分离的数据集对其进行重新训练和评估,以评估模型的泛化能力,准确率为91.87%,f1得分为93.66%。结论:我们的深度学习方法表明,我们的模型在临床环境中具有促进ASD诊断的潜力。这项工作证实了正常发育儿童和自闭症儿童的身体运动存在显著差异,这些差异可以通过分析手眼协调技能来识别。此外,我们已经验证了小规模模型在对医疗数据进行分类时仍然可以达到很高的准确性和良好的泛化,这为未来可能不需要大量数据的诊断模型的研究打开了大门。
{"title":"Deep Learning Approaches for Classifying Children With and Without Autism Spectrum Disorder Using Inertial Measurement Unit Hand Tracking Data: Comparative Study.","authors":"John Mutersbaugh, Wan-Chun Su, Anjana Bhat, Amir Gandjbakhche","doi":"10.2196/73440","DOIUrl":"10.2196/73440","url":null,"abstract":"<p><strong>Background: </strong>Autism spectrum disorder (ASD) is a prevalent neurodevelopmental condition that can be quite difficult to diagnose due to a lack of objective diagnostic methods in the currently used behavioral assessments. Recent work has shown that children with ASD have a higher incidence of motor control differences. A compilation of studies indicates that between 50% and 88% of the children with ASD have issues with movement control based on standardized motor assessments or parent-reported questionnaires.</p><p><strong>Objective: </strong>In this study, we assess a variety of deep learning approaches for the classification of ASD, utilizing data collected via inertial measurement unit (IMU) hand tracking during goal-directed arm movements.</p><p><strong>Methods: </strong>IMU hand tracking data were recorded from 41 school-aged children both with and without an ASD diagnosis to track their arm movements during a reach-to-clean up task. The IMU data were then preprocessed using a moving average and z score normalization to prepare the data for deep learning models. We evaluated the effectiveness of different deep learning models using the preprocessed data and a k-fold validation approach, as well as a patient-separated approach.</p><p><strong>Results: </strong>The best result was achieved with a convolutional autoencoder combined with long short-term memory layers, reaching an accuracy of 90.21% and an F1-score of 90.02%. Once the convolutional autoencoder+long short-term memory was determined to be the most effective model for this datatype, it was retrained and evaluated with a patient-separated dataset to assess the generalization capability of the model, achieving an accuracy of 91.87% and an F1-score of 93.66%.</p><p><strong>Conclusions: </strong>Our deep learning approach demonstrates that our models hold potential for facilitating ASD diagnosis in clinical settings. This work validates that there are significant differences between the physical movements of typically developing children and children with ASD, and these differences can be identified by analyzing hand-eye coordination skills. Additionally, we have validated that small-scale models can still achieve a high accuracy and good generalization when classifying medical data, opening the door for future research into diagnostic models that may not require massive amounts of data.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73440"},"PeriodicalIF":3.8,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12721220/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Big Data Platform With End-to-End Traceability for Health Data Monitoring in Older Adults: Development and Performance Evaluation. 具有端到端可追溯性的可扩展大数据平台,用于老年人健康数据监测:开发和绩效评估。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-22 DOI: 10.2196/81701
Ander Cejudo, Yone Tellechea, Amaia Calvo, Aitor Almeida, Cristina Martín, Andoni Beristain
<p><strong>Background: </strong>The increasing use of real-time health data from wearable devices and self-reported questionnaires offers significant opportunities for preventive care in aging populations. However, current health data platforms often lack built-in mechanisms for data and model traceability, version control, and coordinated management of heterogeneous data streams, which are essential for clinical accountability, regulatory compliance, and reproducibility. The absence of these features limits the reuse of health data and the reproducibility of analytical workflows across research and clinical environments.</p><p><strong>Objective: </strong>This work presents DeltaTrace, a unified big data health platform designed with traceability as a key architectural feature. The platform integrates end-to-end tracking of data and model versions with real-time and batch processing capabilities. Built entirely on open source technologies, DeltaTrace combines components for data management, model management, orchestration, and visualization. The main objective is to demonstrate that embedding traceability within the architecture enables scalable, auditable, and version-controlled processing of health data, thereby facilitating reproducible analytics and long-term maintenance of health monitoring systems.</p><p><strong>Methods: </strong>DeltaTrace adopts a medallion architecture implemented with Delta Lake to ensure atomic and version-controlled data transformations. Apache Spark is used for distributed computation, Apache Kafka for continuous data ingestion, and Apache Airflow for orchestration of batch and streaming workflows. MLflow manages the lifecycle and versioning of machine learning models, while Grafana provides visualization dashboards for real-time and aggregated data inspection. The platform is evaluated using continuous physiological signals from wearable devices and batch-ingested questionnaire data, combining synthetic and real data from the LifeSnaps dataset. Performance tests are conducted on central processing unit-only servers with 8-core and 24-core configurations to assess ingestion, aggregation, visualization, and anomaly detection latency.</p><p><strong>Results: </strong>DeltaTrace supports continuous processing for approximately 1500 users with end-to-end delays below 10 minutes. Ingestion and visualization tasks operate between mean 4.9 (SD 0.12) and 7.5 (SD 0.28) minutes, while aggregation and anomaly detection required less than mean 5.6 (SD 0.04) and 10.5 (SD 1.70) minutes, respectively. Increasing from 8 to 24 cores improved ingestion and cleaning latency by up to 25% and anomaly detection performance by up to 50%. The system maintains consistent performance across different data types, processing modes, and loads.</p><p><strong>Conclusions: </strong>DeltaTrace provides a scalable and modular architecture that incorporates traceability as a core component together with functions for model management, orchestration, an
背景:越来越多地使用来自可穿戴设备和自我报告问卷的实时健康数据,为老年人的预防保健提供了重要的机会。然而,当前的健康数据平台通常缺乏数据和模型可追溯性、版本控制和异构数据流协调管理的内置机制,这些机制对于临床问责制、法规遵从性和可重复性至关重要。这些特性的缺失限制了健康数据的重用以及研究和临床环境中分析工作流程的可重复性。目的:本文介绍了一个统一的大数据健康平台DeltaTrace,该平台以可追溯性为主要架构特征。该平台将数据和模型版本的端到端跟踪与实时和批处理功能集成在一起。DeltaTrace完全建立在开源技术之上,它结合了数据管理、模型管理、编排和可视化的组件。主要目标是证明在体系结构中嵌入可追溯性可以实现对健康数据的可伸缩、可审计和版本控制处理,从而促进健康监测系统的可重复分析和长期维护。方法:DeltaTrace采用Delta Lake实现的纪念章架构,以确保原子和版本控制的数据转换。Apache Spark用于分布式计算,Apache Kafka用于连续数据摄取,Apache Airflow用于批处理和流工作流程的编排。MLflow管理机器学习模型的生命周期和版本控制,而Grafana为实时和聚合数据检查提供可视化仪表板。该平台使用来自可穿戴设备的连续生理信号和批量摄取的问卷数据进行评估,并结合来自LifeSnaps数据集的合成和真实数据。性能测试在8核和24核配置的仅中央处理单元的服务器上进行,以评估摄取、聚合、可视化和异常检测延迟。结果:DeltaTrace支持大约1500个用户的连续处理,端到端延迟低于10分钟。摄取和可视化任务的平均操作时间在4.9 (SD 0.12)和7.5 (SD 0.28)分钟之间,而聚合和异常检测所需的时间分别低于平均5.6 (SD 0.04)和10.5 (SD 1.70)分钟。从8核增加到24核将摄取和清理延迟提高了25%,异常检测性能提高了50%。系统在不同的数据类型、处理模式和负载之间保持一致的性能。结论:DeltaTrace提供了一个可伸缩和模块化的体系结构,它将可追溯性作为核心组件与模型管理、编排和可视化的功能结合在一起。该平台支持跨数据和模型的完整版本控制,并在有限的硬件条件下保持性能。这些特性支持可重复和可审计的健康数据处理,使DeltaTrace适合于老龄化人口的持续监测和预防性保健。
{"title":"Scalable Big Data Platform With End-to-End Traceability for Health Data Monitoring in Older Adults: Development and Performance Evaluation.","authors":"Ander Cejudo, Yone Tellechea, Amaia Calvo, Aitor Almeida, Cristina Martín, Andoni Beristain","doi":"10.2196/81701","DOIUrl":"10.2196/81701","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;The increasing use of real-time health data from wearable devices and self-reported questionnaires offers significant opportunities for preventive care in aging populations. However, current health data platforms often lack built-in mechanisms for data and model traceability, version control, and coordinated management of heterogeneous data streams, which are essential for clinical accountability, regulatory compliance, and reproducibility. The absence of these features limits the reuse of health data and the reproducibility of analytical workflows across research and clinical environments.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This work presents DeltaTrace, a unified big data health platform designed with traceability as a key architectural feature. The platform integrates end-to-end tracking of data and model versions with real-time and batch processing capabilities. Built entirely on open source technologies, DeltaTrace combines components for data management, model management, orchestration, and visualization. The main objective is to demonstrate that embedding traceability within the architecture enables scalable, auditable, and version-controlled processing of health data, thereby facilitating reproducible analytics and long-term maintenance of health monitoring systems.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;DeltaTrace adopts a medallion architecture implemented with Delta Lake to ensure atomic and version-controlled data transformations. Apache Spark is used for distributed computation, Apache Kafka for continuous data ingestion, and Apache Airflow for orchestration of batch and streaming workflows. MLflow manages the lifecycle and versioning of machine learning models, while Grafana provides visualization dashboards for real-time and aggregated data inspection. The platform is evaluated using continuous physiological signals from wearable devices and batch-ingested questionnaire data, combining synthetic and real data from the LifeSnaps dataset. Performance tests are conducted on central processing unit-only servers with 8-core and 24-core configurations to assess ingestion, aggregation, visualization, and anomaly detection latency.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;DeltaTrace supports continuous processing for approximately 1500 users with end-to-end delays below 10 minutes. Ingestion and visualization tasks operate between mean 4.9 (SD 0.12) and 7.5 (SD 0.28) minutes, while aggregation and anomaly detection required less than mean 5.6 (SD 0.04) and 10.5 (SD 1.70) minutes, respectively. Increasing from 8 to 24 cores improved ingestion and cleaning latency by up to 25% and anomaly detection performance by up to 50%. The system maintains consistent performance across different data types, processing modes, and loads.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;DeltaTrace provides a scalable and modular architecture that incorporates traceability as a core component together with functions for model management, orchestration, an","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e81701"},"PeriodicalIF":3.8,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12721222/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Machine Learning Model Based on Clinical Factors to Predict the Efficacy of First-Line Immunochemotherapy for Patients With Advanced Gastric Cancer: Retrospective Study. 基于临床因素的机器学习模型预测晚期胃癌患者一线免疫化疗疗效:回顾性研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-22 DOI: 10.2196/82533
Xu Cheng, Ping Li, Enqing Meng, Xinyi Wu, Hao Wu
<p><strong>Background: </strong>The development of immunotherapy has provided new hope for patients with advanced gastric cancer (AGC). However, due to the high heterogeneity of the disease, the efficacy of first-line immunochemotherapy varies among patients. There is still a lack of simple and effective models to predict the efficacy of immunochemotherapy in this setting.</p><p><strong>Objective: </strong>This study aimed to identify critical factors and develop predictive models to evaluate the efficacy of first-line immunochemotherapy in patients with AGC using clinically available data. The goal was to offer evidence-based guidance for clinical practice and enable personalized treatment strategies.</p><p><strong>Methods: </strong>To evaluate the effectiveness of first-line immunochemotherapy in AGC, we retrospectively collected clinical data from The First Affiliated Hospital of Nanjing Medical University between January 2018 and October 2023. The data collected were divided into a training set (168/240, 70%) and an internal validation set (72/240, 30%). Additionally, a temporal validation cohort of 76 patients recruited from November 2023 to September 2024 was assembled to further evaluate the predictive performance of the models. We used univariate and multivariate Cox regression analyses, along with the least absolute shrinkage and selection operator (LASSO) regression, and integrated clinical expertise to identify key predictors of treatment efficacy and to construct the LASSO-Cox model. We developed 4 models (LASSO-Cox, random survival forest [RSF], extreme gradient boosting, and survival support vector machine) and evaluated their performance using the C-index, area under the curve (AUC), calibration curves, and decision curve analysis. The optimal model was interpreted using Shapley additive explanations, and its risk scores were used to stratify patients for Kaplan-Meier survival analysis.</p><p><strong>Results: </strong>Among the 4 prognostic models developed in this study, the RSF model demonstrated superior predictive accuracy and discrimination for progression-free survival, as evidenced by its higher AUC, concordance index, continuous AUC curves, and calibration curves compared with the other 3 models. Additionally, decision curve analysis showed that the RSF model offered greater net clinical benefit. The Shapley additive explanations results identified that age, histological subtype, the proportion of CD19<sup>+</sup> B cells, CD16<sup>+</sup>CD56<sup>+</sup> natural killer cells, and the presence of liver metastasis were key prognostic factors influencing patient outcomes. Patients in the low-risk group, as determined by the RSF model's risk score, exhibited a significantly higher progression-free survival rate than those in the high-risk group, further validating the value of the RSF model for risk stratification.</p><p><strong>Conclusions: </strong>This study is the first to use machine learning algorithms to develop a predi
背景:免疫疗法的发展为晚期胃癌患者提供了新的希望。然而,由于疾病的高度异质性,一线免疫化疗的疗效因患者而异。目前仍缺乏简单有效的模型来预测免疫化疗在这种情况下的疗效。目的:本研究旨在利用临床数据,确定关键因素并建立预测模型,以评估一线免疫化疗对AGC患者的疗效。目标是为临床实践提供循证指导,并实现个性化治疗策略。方法:为评价一线免疫化疗治疗AGC的有效性,回顾性收集南京医科大学第一附属医院2018年1月至2023年10月的临床资料。将收集到的数据分为训练集(168/ 240,70%)和内部验证集(72/ 240,30%)。此外,还收集了2023年11月至2024年9月期间招募的76名患者的时间验证队列,以进一步评估模型的预测性能。我们使用单变量和多变量Cox回归分析,以及最小绝对收缩和选择算子(LASSO)回归,并整合临床专业知识来确定治疗效果的关键预测因素,并构建LASSO-Cox模型。我们开发了LASSO-Cox、随机生存森林(RSF)、极端梯度增强(extreme gradient boosting)和生存支持向量机(survival support vector machine) 4个模型,并使用c指数、曲线下面积(AUC)、校准曲线和决策曲线分析来评估它们的性能。使用Shapley加性解释解释最优模型,并使用其风险评分对患者进行分层,进行Kaplan-Meier生存分析。结果:在本研究建立的4种预后模型中,RSF模型的AUC、一致性指数、连续AUC曲线和校准曲线均高于其他3种模型,对无进展生存期的预测准确性和判别性均优于其他3种模型。此外,决策曲线分析显示,RSF模型提供了更大的净临床效益。Shapley加性解释结果发现,年龄、组织学亚型、CD19+ B细胞、CD16+CD56+自然杀伤细胞的比例以及是否存在肝转移是影响患者预后的关键因素。根据RSF模型的风险评分,低危组患者的无进展生存率明显高于高危组,进一步验证了RSF模型在风险分层中的价值。结论:本研究首次使用机器学习算法建立了一线免疫化疗治疗AGC疗效的预测模型,并确定了治疗结果的关键预测因素。结果表明,RSF模型不仅可以对可能受益的患者进行精确分层,更重要的是,为个性化临床策略提供可量化的决策支持,强调了其在临床决策中的潜在价值。
{"title":"A Machine Learning Model Based on Clinical Factors to Predict the Efficacy of First-Line Immunochemotherapy for Patients With Advanced Gastric Cancer: Retrospective Study.","authors":"Xu Cheng, Ping Li, Enqing Meng, Xinyi Wu, Hao Wu","doi":"10.2196/82533","DOIUrl":"10.2196/82533","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;The development of immunotherapy has provided new hope for patients with advanced gastric cancer (AGC). However, due to the high heterogeneity of the disease, the efficacy of first-line immunochemotherapy varies among patients. There is still a lack of simple and effective models to predict the efficacy of immunochemotherapy in this setting.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to identify critical factors and develop predictive models to evaluate the efficacy of first-line immunochemotherapy in patients with AGC using clinically available data. The goal was to offer evidence-based guidance for clinical practice and enable personalized treatment strategies.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;To evaluate the effectiveness of first-line immunochemotherapy in AGC, we retrospectively collected clinical data from The First Affiliated Hospital of Nanjing Medical University between January 2018 and October 2023. The data collected were divided into a training set (168/240, 70%) and an internal validation set (72/240, 30%). Additionally, a temporal validation cohort of 76 patients recruited from November 2023 to September 2024 was assembled to further evaluate the predictive performance of the models. We used univariate and multivariate Cox regression analyses, along with the least absolute shrinkage and selection operator (LASSO) regression, and integrated clinical expertise to identify key predictors of treatment efficacy and to construct the LASSO-Cox model. We developed 4 models (LASSO-Cox, random survival forest [RSF], extreme gradient boosting, and survival support vector machine) and evaluated their performance using the C-index, area under the curve (AUC), calibration curves, and decision curve analysis. The optimal model was interpreted using Shapley additive explanations, and its risk scores were used to stratify patients for Kaplan-Meier survival analysis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Among the 4 prognostic models developed in this study, the RSF model demonstrated superior predictive accuracy and discrimination for progression-free survival, as evidenced by its higher AUC, concordance index, continuous AUC curves, and calibration curves compared with the other 3 models. Additionally, decision curve analysis showed that the RSF model offered greater net clinical benefit. The Shapley additive explanations results identified that age, histological subtype, the proportion of CD19&lt;sup&gt;+&lt;/sup&gt; B cells, CD16&lt;sup&gt;+&lt;/sup&gt;CD56&lt;sup&gt;+&lt;/sup&gt; natural killer cells, and the presence of liver metastasis were key prognostic factors influencing patient outcomes. Patients in the low-risk group, as determined by the RSF model's risk score, exhibited a significantly higher progression-free survival rate than those in the high-risk group, further validating the value of the RSF model for risk stratification.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This study is the first to use machine learning algorithms to develop a predi","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e82533"},"PeriodicalIF":3.8,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12770927/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145812326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Copy Tools in the Electronic Health Record: Perceptions, Implications, and Future Directions. 电子健康记录中的复制工具:感知、含义和未来方向。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-19 DOI: 10.2196/78502
Sky Corby, Joan S Ash, Rebecca M Jungbauer, Gretchen Scholl, Sarah Florig, Vishnu Mohan, Jeffrey A Gold

Background: Electronic health records (EHRs) can aid in provider efficiency, but may also lead to unintended consequences, such as documentation burden and increased length of notes. To combat issues related to documentation, copying and pasting (CP) and copying or carrying forward (CF) are tools that have been used to aid in documentation burden. Multiple studies have identified the benefits and challenges of using these tools; however, few studies have identified the unintended consequences of CP and CF, and how the adoption of these tools may affect users.

Objective: The objective was to describe providers' perceptions and use of copying tools available in the EHR and describe their suggestions for improvement on these copying tools.

Methods: Research team members conducted semistructured interviews with faculty members, advanced practice providers, residents or fellow trainees, and medical students at a single academic health sciences center. The Diffusion of Innovations Theory of Unintended Consequences guided the analysis and interpretation of interview results.

Results: A total of 22 semistructured interviews were conducted in 2023 and analyzed during 2024. The findings showed that respondents use and value these tools for efficiency and communication purposes. The negative unintended consequences include inaccuracies and errors in documentation and increased patient safety risks. Some respondents experience inner angst or moral injury related to using CP/CF, but they feel that they must use them to satisfy organizational requirements surrounding documentation. The respondents suggested that artificial intelligence will likely help improve documentation tools, as would further training around these types of documentation tools.

Conclusions: Some respondents noted feeling both internal and external pressures that influenced when and how they use CP/CF. Respondents noted that they value EHR copying tools for efficiency purposes, but they also understand the risks involved. This tension may lead to moral angst or moral injury. They offered numerous suggestions for lowering the risk, especially by improving the documentation capabilities of the EHR through artificial intelligence. Future research should investigate both technical and educational solutions to relieve the documentation burden and moral angst they are experiencing.

背景:电子健康记录(EHRs)有助于提高提供者的效率,但也可能导致意想不到的后果,例如文件负担和说明的长度增加。为了解决与文档相关的问题,复制和粘贴(CP)和复制或转移(CF)是用来帮助减轻文档负担的工具。多项研究已经确定了使用这些工具的好处和挑战;然而,很少有研究确定了CP和CF的意外后果,以及采用这些工具可能对用户产生的影响。目的:目的是描述提供者对电子病历中可用的复制工具的看法和使用情况,并描述他们对这些复制工具的改进建议。方法:研究团队成员对单个学术健康科学中心的教职员工、高级执业医师、住院医师或实习生以及医学生进行了半结构化访谈。意外后果的创新扩散理论指导了访谈结果的分析和解释。结果:2023年共进行了22次半结构化访谈,并对2024年进行了分析。调查结果表明,受访者使用和重视这些工具的效率和沟通的目的。负面的意想不到的后果包括文件的不准确和错误以及增加的患者安全风险。一些受访者经历了与使用CP/CF相关的内心焦虑或道德伤害,但他们觉得必须使用它们来满足组织对文档的要求。受访者表示,人工智能可能有助于改进文档工具,围绕这些类型的文档工具进行进一步培训也是如此。结论:一些受访者指出,内部和外部压力影响了他们何时以及如何使用CP/CF。受访者指出,出于效率的考虑,他们重视电子病历复制工具,但他们也了解其中的风险。这种紧张关系可能导致道德焦虑或道德伤害。他们提出了许多降低风险的建议,特别是通过人工智能提高电子病历的记录能力。未来的研究应该调查技术和教育的解决方案,以减轻他们正在经历的文件负担和道德焦虑。
{"title":"Copy Tools in the Electronic Health Record: Perceptions, Implications, and Future Directions.","authors":"Sky Corby, Joan S Ash, Rebecca M Jungbauer, Gretchen Scholl, Sarah Florig, Vishnu Mohan, Jeffrey A Gold","doi":"10.2196/78502","DOIUrl":"10.2196/78502","url":null,"abstract":"<p><strong>Background: </strong>Electronic health records (EHRs) can aid in provider efficiency, but may also lead to unintended consequences, such as documentation burden and increased length of notes. To combat issues related to documentation, copying and pasting (CP) and copying or carrying forward (CF) are tools that have been used to aid in documentation burden. Multiple studies have identified the benefits and challenges of using these tools; however, few studies have identified the unintended consequences of CP and CF, and how the adoption of these tools may affect users.</p><p><strong>Objective: </strong>The objective was to describe providers' perceptions and use of copying tools available in the EHR and describe their suggestions for improvement on these copying tools.</p><p><strong>Methods: </strong>Research team members conducted semistructured interviews with faculty members, advanced practice providers, residents or fellow trainees, and medical students at a single academic health sciences center. The Diffusion of Innovations Theory of Unintended Consequences guided the analysis and interpretation of interview results.</p><p><strong>Results: </strong>A total of 22 semistructured interviews were conducted in 2023 and analyzed during 2024. The findings showed that respondents use and value these tools for efficiency and communication purposes. The negative unintended consequences include inaccuracies and errors in documentation and increased patient safety risks. Some respondents experience inner angst or moral injury related to using CP/CF, but they feel that they must use them to satisfy organizational requirements surrounding documentation. The respondents suggested that artificial intelligence will likely help improve documentation tools, as would further training around these types of documentation tools.</p><p><strong>Conclusions: </strong>Some respondents noted feeling both internal and external pressures that influenced when and how they use CP/CF. Respondents noted that they value EHR copying tools for efficiency purposes, but they also understand the risks involved. This tension may lead to moral angst or moral injury. They offered numerous suggestions for lowering the risk, especially by improving the documentation capabilities of the EHR through artificial intelligence. Future research should investigate both technical and educational solutions to relieve the documentation burden and moral angst they are experiencing.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e78502"},"PeriodicalIF":3.8,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12759298/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145795363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Machine Learning Approach to Predicting Mortality Risk in Chemotherapy-Treated Lung Cancer: Machine Learning Model Development and Validation. 预测化疗肺癌死亡风险的机器学习方法:机器学习模型的开发和验证。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-18 DOI: 10.2196/72424
Jianjun Zou, Jinyi Huang, Katie Lu, Ao Lin, Chen Xie, Jinrong Zhang, Boqi Rao, Zhi Li, Dongming Xie, Ling Lu, Feng Luo, Jinbin Chen, Lei Yang, Fuman Qiu, Xin Zhang, Yibin Deng, Jiachun Lu

Background: Accurately predicting the survival outcomes of patients with lung cancer receiving chemotherapy remains challenging.

Objective: To improve clinical management of this population, this study developed a multivariate machine learning (ML) model to assess all-cause mortality risk in chemotherapy-treated patients with lung cancer.

Methods: This study retrospectively recruited 1278 postchemotherapy patients with lung cancer from Guangzhou Chest Hospital between 2017 and 2019. Candidate features such as demographic characteristics, environmental exposures, clinical information, and patient-reported symptoms were collected via questionnaires and the electronic medical record system. The survival status and the deceased date were investigated twice a year. A total of 84 predictive models were constructed on the training set using 5 ML algorithms either individually or in pairwise combinations. The concordance index was used to identify the optimal model on the testing set, with performance validated via receiver operating characteristic curves, calibration curves, and decision curve analysis. Additionally, Shapley Additive Explanations and restricted cubic splines were applied for feature attribution analysis.

Results: The optimal model ultimately retained 21 prognosis-association features, including age, sex, BMI, smoking status, environmental smoke, the MD Anderson Symptom Inventory for Lung Cancer total score trajectories, cluster of differentiation 56, TNM stage, histology, and prechemotherapy blood biomarkers. On the testing set, the model acquired a concordance index of 0.702 (95% CI 0.652-0.753). The decision curves demonstrated positive clinical benefit when the risk thresholds were 0.40-0.69, 0.62-0.99, and 0.72-0.99 for 1-, 3-, and 5-year mortality predictions, respectively. The calibration curves showed that the predicted mortality probabilities fluctuated around the observed probabilities, and the Brier scores for 1-, 3-, and 5-year predictions were 0.20, 0.18, and 0.11, respectively. The area under the curve of the model was 0.740, 0.777, and 0.915 for 1-, 3-, and 5-year mortality predictions, respectively. Interpretability feature attribution analysis revealed that the significant features could predict all-cause mortality risk in chemotherapy-treated patients with lung cancer.

Conclusions: Our ML models exhibited acceptable discrimination, calibration, and clinical benefit in predicting the mortality risk of chemotherapy-treated patients with lung cancer, which could help clinicians in personalized prognostic management.

背景:准确预测接受化疗的肺癌患者的生存结果仍然具有挑战性。目的:为了改善这一人群的临床管理,本研究建立了一个多变量机器学习(ML)模型来评估化疗肺癌患者的全因死亡率风险。方法:本研究回顾性招募2017 - 2019年广州胸科医院肺癌化疗后患者1278例。候选特征,如人口统计学特征、环境暴露、临床信息和患者报告的症状,通过问卷调查和电子病历系统收集。每年调查两次患者的生存状况和死亡日期。在训练集上,使用5种ML算法单独或成对组合构建了84个预测模型。使用一致性指数在测试集上识别最优模型,并通过受试者工作特征曲线、校准曲线和决策曲线分析验证其性能。此外,应用Shapley加性解释和受限三次样条进行特征归因分析。结果:最佳模型最终保留了21个预后相关特征,包括年龄、性别、BMI、吸烟状况、环境烟雾、MD安德森肺癌症状量表总分轨迹、分化聚类56、TNM分期、组织学和化疗前血液生物标志物。在检验集上,模型的一致性指数为0.702 (95% CI 0.652-0.753)。当1年、3年和5年死亡率预测的风险阈值分别为0.40-0.69、0.62-0.99和0.72-0.99时,决策曲线显示出积极的临床获益。校正曲线显示,预测的死亡率概率在观测概率周围波动,1年、3年和5年预测的Brier评分分别为0.20、0.18和0.11。1年、3年和5年死亡率预测的曲线下面积分别为0.740、0.777和0.915。可解释性特征归因分析显示,显著性特征可以预测化疗肺癌患者的全因死亡风险。结论:我们的ML模型在预测化疗肺癌患者死亡风险方面表现出可接受的区分、校准和临床益处,可以帮助临床医生进行个性化预后管理。
{"title":"A Machine Learning Approach to Predicting Mortality Risk in Chemotherapy-Treated Lung Cancer: Machine Learning Model Development and Validation.","authors":"Jianjun Zou, Jinyi Huang, Katie Lu, Ao Lin, Chen Xie, Jinrong Zhang, Boqi Rao, Zhi Li, Dongming Xie, Ling Lu, Feng Luo, Jinbin Chen, Lei Yang, Fuman Qiu, Xin Zhang, Yibin Deng, Jiachun Lu","doi":"10.2196/72424","DOIUrl":"10.2196/72424","url":null,"abstract":"<p><strong>Background: </strong>Accurately predicting the survival outcomes of patients with lung cancer receiving chemotherapy remains challenging.</p><p><strong>Objective: </strong>To improve clinical management of this population, this study developed a multivariate machine learning (ML) model to assess all-cause mortality risk in chemotherapy-treated patients with lung cancer.</p><p><strong>Methods: </strong>This study retrospectively recruited 1278 postchemotherapy patients with lung cancer from Guangzhou Chest Hospital between 2017 and 2019. Candidate features such as demographic characteristics, environmental exposures, clinical information, and patient-reported symptoms were collected via questionnaires and the electronic medical record system. The survival status and the deceased date were investigated twice a year. A total of 84 predictive models were constructed on the training set using 5 ML algorithms either individually or in pairwise combinations. The concordance index was used to identify the optimal model on the testing set, with performance validated via receiver operating characteristic curves, calibration curves, and decision curve analysis. Additionally, Shapley Additive Explanations and restricted cubic splines were applied for feature attribution analysis.</p><p><strong>Results: </strong>The optimal model ultimately retained 21 prognosis-association features, including age, sex, BMI, smoking status, environmental smoke, the MD Anderson Symptom Inventory for Lung Cancer total score trajectories, cluster of differentiation 56, TNM stage, histology, and prechemotherapy blood biomarkers. On the testing set, the model acquired a concordance index of 0.702 (95% CI 0.652-0.753). The decision curves demonstrated positive clinical benefit when the risk thresholds were 0.40-0.69, 0.62-0.99, and 0.72-0.99 for 1-, 3-, and 5-year mortality predictions, respectively. The calibration curves showed that the predicted mortality probabilities fluctuated around the observed probabilities, and the Brier scores for 1-, 3-, and 5-year predictions were 0.20, 0.18, and 0.11, respectively. The area under the curve of the model was 0.740, 0.777, and 0.915 for 1-, 3-, and 5-year mortality predictions, respectively. Interpretability feature attribution analysis revealed that the significant features could predict all-cause mortality risk in chemotherapy-treated patients with lung cancer.</p><p><strong>Conclusions: </strong>Our ML models exhibited acceptable discrimination, calibration, and clinical benefit in predicting the mortality risk of chemotherapy-treated patients with lung cancer, which could help clinicians in personalized prognostic management.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e72424"},"PeriodicalIF":3.8,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12757710/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Separable Kernel Attention-Driven Multidimensional Feature Cross-Level Fusion Classification Network of Knee Cartilage Injury: Algorithm Development and Validation. 膝关节软骨损伤的大可分离核注意驱动多维特征交叉融合分类网络:算法开发与验证。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-17 DOI: 10.2196/79748
Lirong Zhang, Hang Yu, Yating Yang

Background: Knee cartilage injury (KCI) poses significant challenges in the early clinical diagnosis process, primarily due to its high incidence, the complexity of healing, and the limited sensitivity of initial imaging modalities.

Objective: This study aims to employ magnetic resonance imaging and machine learning methods to enhance the classification accuracy of the classifier for KCI, improve the existing network structure, and demonstrate important clinical application value.

Methods: The proposed methodology is a multidimensional feature cross-level fusion classification network driven by the large separable kernel attention, which enables high-precision hierarchical diagnosis of KCI through deep learning. The network first fuses shallow high-resolution features with deep semantic features via the cross-level fusion module. Then, the large separable kernel attention module is embedded in the YOLOv8 network. This network utilizes the combined optimization of depth-separable and point-by-point convolutions to enhance features at multiple scales, thereby dramatically improving the hierarchical characterization of cartilage damage. Finally, five classifications of knee cartilage injuries are performed by classifiers.

Results: To overcome the limitations of network models trained with single-plane images, this study presents the first hospital-based multidimensional magnetic resonance imaging real dataset for KCI, on which the classification accuracy is 99.7%, the Kappa statistic is 99.6%, the F-measure is 99.7%, the sensitivity is 99.7%, and the specificity is 99.9%. The experimental results validate the feasibility of the proposed method.

Conclusions: The experimental outcomes confirm that the proposed methodology not only achieves exceptional performance in classifying knee cartilage injuries but also offers substantial improvements over existing techniques. This underscores its potential for clinical deployment in enhancing diagnostic precision and efficiency.

背景:膝关节软骨损伤(KCI)在早期临床诊断过程中提出了重大挑战,主要是由于其发病率高,愈合复杂,以及初始成像方式的灵敏度有限。目的:本研究旨在利用磁共振成像和机器学习方法提高KCI分类器的分类准确率,改进现有网络结构,展现重要的临床应用价值。方法:提出的方法是由大可分离核注意驱动的多维特征跨层次融合分类网络,通过深度学习实现KCI的高精度分层诊断。该网络首先通过跨层融合模块融合浅层高分辨率特征和深层语义特征。然后,在YOLOv8网络中嵌入大型可分离核关注模块。该网络利用深度可分卷积和逐点卷积的组合优化,在多个尺度上增强特征,从而显著提高软骨损伤的分层表征。最后,用分类器对膝关节软骨损伤进行五种分类。结果:为克服单平面图像训练网络模型的局限性,本文建立了首个基于医院的多维磁共振成像KCI真实数据集,分类准确率为99.7%,Kappa统计量为99.6%,F-measure为99.7%,灵敏度为99.7%,特异性为99.9%。实验结果验证了该方法的可行性。结论:实验结果证实,所提出的方法不仅在分类膝关节软骨损伤方面取得了优异的成绩,而且比现有技术有了实质性的改进。这强调了其在提高诊断精度和效率方面的临床应用潜力。
{"title":"Large Separable Kernel Attention-Driven Multidimensional Feature Cross-Level Fusion Classification Network of Knee Cartilage Injury: Algorithm Development and Validation.","authors":"Lirong Zhang, Hang Yu, Yating Yang","doi":"10.2196/79748","DOIUrl":"10.2196/79748","url":null,"abstract":"<p><strong>Background: </strong>Knee cartilage injury (KCI) poses significant challenges in the early clinical diagnosis process, primarily due to its high incidence, the complexity of healing, and the limited sensitivity of initial imaging modalities.</p><p><strong>Objective: </strong>This study aims to employ magnetic resonance imaging and machine learning methods to enhance the classification accuracy of the classifier for KCI, improve the existing network structure, and demonstrate important clinical application value.</p><p><strong>Methods: </strong>The proposed methodology is a multidimensional feature cross-level fusion classification network driven by the large separable kernel attention, which enables high-precision hierarchical diagnosis of KCI through deep learning. The network first fuses shallow high-resolution features with deep semantic features via the cross-level fusion module. Then, the large separable kernel attention module is embedded in the YOLOv8 network. This network utilizes the combined optimization of depth-separable and point-by-point convolutions to enhance features at multiple scales, thereby dramatically improving the hierarchical characterization of cartilage damage. Finally, five classifications of knee cartilage injuries are performed by classifiers.</p><p><strong>Results: </strong>To overcome the limitations of network models trained with single-plane images, this study presents the first hospital-based multidimensional magnetic resonance imaging real dataset for KCI, on which the classification accuracy is 99.7%, the Kappa statistic is 99.6%, the F-measure is 99.7%, the sensitivity is 99.7%, and the specificity is 99.9%. The experimental results validate the feasibility of the proposed method.</p><p><strong>Conclusions: </strong>The experimental outcomes confirm that the proposed methodology not only achieves exceptional performance in classifying knee cartilage injuries but also offers substantial improvements over existing techniques. This underscores its potential for clinical deployment in enhancing diagnostic precision and efficiency.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e79748"},"PeriodicalIF":3.8,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12711134/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incorporating Generative AI Into a Health Informatics Curriculum to Build 21st Century Competencies: Multisite Pre-Post Study. 将生成式人工智能纳入健康信息学课程以建立21世纪的能力:多站点前后研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-16 DOI: 10.2196/76507
Freddie Seba, Miriam Isola, Laura Mills, Mohan Zalake, Jacob Krive

Background: We designed learning assignments for students to develop knowledge, skills, and professional attitudes about generative artificial intelligence (AI) in 2 different Master's level courses in health informatics. Our innovative approach assumed that the students had no technical background or experience in using generative AI tools.

Objective: This study aims to offer generalizable methods and experiences on integration and assessment of generative AI content into the higher education's health informatics curricula. The study's central driver is the preparation of graduate students with generative AI tools, skills, ethical discernment, and critical thinking capacities aligned with the rapidly shifting job-market requirements, independent of graduate students' backgrounds and technical expertise.

Methods: During the semester, students completed a pretest and posttest to assess knowledge about generative AI. Reflections explored their expectations and experiences using generative AI to complete their assignments and projects during the semester. Strong emphasis was placed on building skills and professional attitudes by using generative AI. Student engagement in behavioral, emotional, and cognitive domains was explored via detailed analysis of student reflections by faculty.

Results: Students at the University of Illinois Chicago increased their knowledge about generative AI from 81% to 93% through research of the basic generative AI concepts, as evidenced from outcomes of the open-book pre-and posttests given at the beginning and end of the capstone course. University of San Francisco students also improved from 77% to 80% by the end of the semester. Faculty analysis of student reflections upon completion of the course revealed primary interests in the essentials of generative AI, AI transformations to information and knowledge, and organizational changes influenced by AI adoption in the health care organizations, with ethics being a primary driver of students' interests and engagement.

Conclusions: Data from student reflections provided insight into generative AI skills that students developed and that health informatics programs can consider incorporating into their curricula. Building competencies in generative AI will prepare students for the 21st century workforce and enable them to build skills employers are seeking in the new digital health environment.

背景:我们设计了学习作业,让学生在两个不同的健康信息学硕士课程中发展关于生成式人工智能(AI)的知识、技能和专业态度。我们的创新方法假设学生没有使用生成式人工智能工具的技术背景或经验。目的:本研究旨在为高等教育健康信息学课程整合与评估生成式人工智能内容提供可推广的方法和经验。该研究的核心驱动力是培养具有生成式人工智能工具、技能、道德洞察力和批判性思维能力的研究生,使其与快速变化的就业市场需求保持一致,独立于研究生的背景和技术专长。方法:在学期中,学生通过前测和后测来评估生成式人工智能的知识。反思探讨了他们在学期中使用生成式人工智能完成作业和项目的期望和经验。重点是通过使用生成式人工智能来培养技能和专业态度。通过教师对学生反映的详细分析,探讨了学生在行为、情感和认知领域的参与情况。结果:通过对生成式人工智能基本概念的研究,伊利诺伊大学芝加哥分校的学生对生成式人工智能的了解从81%增加到93%,这一点从顶点课程开始和结束时进行的开卷前测和后测的结果可以看出。到学期末,旧金山大学(University of San Francisco)的学生也从77%提高到了80%。教师对学生完成课程后反思的分析显示,他们对生成式人工智能的基本要素、人工智能向信息和知识的转化以及卫生保健组织中人工智能采用所影响的组织变革感兴趣,而道德是学生兴趣和参与的主要驱动力。结论:来自学生反思的数据提供了对学生发展的生成式人工智能技能的见解,健康信息学专业可以考虑将其纳入课程。在生成式人工智能方面培养能力将使学生为21世纪的劳动力做好准备,并使他们能够掌握雇主在新的数字健康环境中所寻求的技能。
{"title":"Incorporating Generative AI Into a Health Informatics Curriculum to Build 21st Century Competencies: Multisite Pre-Post Study.","authors":"Freddie Seba, Miriam Isola, Laura Mills, Mohan Zalake, Jacob Krive","doi":"10.2196/76507","DOIUrl":"10.2196/76507","url":null,"abstract":"<p><strong>Background: </strong>We designed learning assignments for students to develop knowledge, skills, and professional attitudes about generative artificial intelligence (AI) in 2 different Master's level courses in health informatics. Our innovative approach assumed that the students had no technical background or experience in using generative AI tools.</p><p><strong>Objective: </strong>This study aims to offer generalizable methods and experiences on integration and assessment of generative AI content into the higher education's health informatics curricula. The study's central driver is the preparation of graduate students with generative AI tools, skills, ethical discernment, and critical thinking capacities aligned with the rapidly shifting job-market requirements, independent of graduate students' backgrounds and technical expertise.</p><p><strong>Methods: </strong>During the semester, students completed a pretest and posttest to assess knowledge about generative AI. Reflections explored their expectations and experiences using generative AI to complete their assignments and projects during the semester. Strong emphasis was placed on building skills and professional attitudes by using generative AI. Student engagement in behavioral, emotional, and cognitive domains was explored via detailed analysis of student reflections by faculty.</p><p><strong>Results: </strong>Students at the University of Illinois Chicago increased their knowledge about generative AI from 81% to 93% through research of the basic generative AI concepts, as evidenced from outcomes of the open-book pre-and posttests given at the beginning and end of the capstone course. University of San Francisco students also improved from 77% to 80% by the end of the semester. Faculty analysis of student reflections upon completion of the course revealed primary interests in the essentials of generative AI, AI transformations to information and knowledge, and organizational changes influenced by AI adoption in the health care organizations, with ethics being a primary driver of students' interests and engagement.</p><p><strong>Conclusions: </strong>Data from student reflections provided insight into generative AI skills that students developed and that health informatics programs can consider incorporating into their curricula. Building competencies in generative AI will prepare students for the 21st century workforce and enable them to build skills employers are seeking in the new digital health environment.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76507"},"PeriodicalIF":3.8,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12707436/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145769877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Metabolic Dysfunction-Associated Fatty Liver Disease Phenotypes Among Adults: 2-Stage Contrastive Learning Method. 预测成人代谢功能障碍相关脂肪性肝病表型:两阶段对比学习方法
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-12-12 DOI: 10.2196/75747
Sizhe Jasmine Chen, Da Xu, Derek K Hu, Paul Jen-Hwa Hu, Ting-Shuo Huang

Background: Metabolic dysfunction-associated fatty liver disease (MAFLD) is a leading cause of chronic disease and can progress to liver fibrosis or hepatocellular carcinoma. Its subtypes-obese, diabetic, and lean-are associated with varying degrees of fibrotic burden and different complications, yet the existing analytics methods often overlook its multisystem nature, intraphenotype variability, and disease dynamics. These limitations hinder accurate risk stratification and restrict personalized intervention planning.

Objective: This study developed a novel, 2-stage, contrastive learning-based method to predict the phenotype of MAFLD among adults. This method leverages multiview contrastive learning; it models individual heterogeneities and important relationships in clinical and survey-based data to predict phenotypes among adults, thus supporting clinical decision-making and personalized care.

Methods: Demographic, clinical, lifestyle, and genetic family history data of 4408 adults revealed how capturing essential relationships in patient data from different sources can transform individual-level representations into multiple, complementary views. Evaluation of the predictive efficacy of the proposed method in comparison with 8 prevalent methods relied on recall, precision, F1-score, and area under the curve values. Moreover, a Shapley additive explanation analysis was performed for interpretability.

Results: The proposed method consistently and significantly outperformed all benchmark methods. It attained the highest F1-score, showing a 32.8% improvement for nondiabetic MAFLD (0.531 vs 0.400) and 30.4% improvement for diabetic MAFLD (0.519 vs 0.398) over the respective best-performing benchmark. The results underscore the clinical value and utility of integrating clinical and survey-based data in the prediction of MAFLD phenotypes among adults.

Conclusions: The proposed method is a viable approach for MAFLD phenotype prediction. It is more effective in identifying at-risk adults than many prevalent data-driven analytics methods and thereby can enhance clinical decision-making and support patient-centric care and management.

背景:代谢功能障碍相关脂肪性肝病(MAFLD)是慢性疾病的主要原因,可发展为肝纤维化或肝细胞癌。其亚型(肥胖、糖尿病和消瘦)与不同程度的纤维化负担和不同的并发症相关,但现有的分析方法往往忽略了其多系统性质、表型内变异性和疾病动力学。这些限制阻碍了准确的风险分层,并限制了个性化的干预计划。目的:本研究开发了一种新的、两阶段的、基于对比学习的方法来预测成人MAFLD的表型。这种方法利用了多视角对比学习;它模拟了临床和基于调查的数据中的个体异质性和重要关系,以预测成人的表型,从而支持临床决策和个性化护理。方法:4408名成年人的人口统计、临床、生活方式和遗传家族史数据揭示了如何从不同来源的患者数据中捕捉基本关系,将个人层面的表征转化为多个互补的观点。与8种流行方法相比,该方法的预测效果评估依赖于召回率、精度、f1评分和曲线下面积值。此外,对可解释性进行了Shapley加性解释分析。结果:所提方法一致且显著优于所有基准方法。它获得了最高的f1评分,与各自的最佳表现基准相比,非糖尿病性MAFLD改善了32.8% (0.531 vs 0.400),糖尿病性MAFLD改善了30.4% (0.519 vs 0.398)。研究结果强调了综合临床和基于调查的数据预测成人MAFLD表型的临床价值和效用。结论:该方法是一种预测MAFLD表型的可行方法。它比许多流行的数据驱动分析方法更有效地识别有风险的成年人,从而可以增强临床决策并支持以患者为中心的护理和管理。
{"title":"Predicting Metabolic Dysfunction-Associated Fatty Liver Disease Phenotypes Among Adults: 2-Stage Contrastive Learning Method.","authors":"Sizhe Jasmine Chen, Da Xu, Derek K Hu, Paul Jen-Hwa Hu, Ting-Shuo Huang","doi":"10.2196/75747","DOIUrl":"10.2196/75747","url":null,"abstract":"<p><strong>Background: </strong>Metabolic dysfunction-associated fatty liver disease (MAFLD) is a leading cause of chronic disease and can progress to liver fibrosis or hepatocellular carcinoma. Its subtypes-obese, diabetic, and lean-are associated with varying degrees of fibrotic burden and different complications, yet the existing analytics methods often overlook its multisystem nature, intraphenotype variability, and disease dynamics. These limitations hinder accurate risk stratification and restrict personalized intervention planning.</p><p><strong>Objective: </strong>This study developed a novel, 2-stage, contrastive learning-based method to predict the phenotype of MAFLD among adults. This method leverages multiview contrastive learning; it models individual heterogeneities and important relationships in clinical and survey-based data to predict phenotypes among adults, thus supporting clinical decision-making and personalized care.</p><p><strong>Methods: </strong>Demographic, clinical, lifestyle, and genetic family history data of 4408 adults revealed how capturing essential relationships in patient data from different sources can transform individual-level representations into multiple, complementary views. Evaluation of the predictive efficacy of the proposed method in comparison with 8 prevalent methods relied on recall, precision, F1-score, and area under the curve values. Moreover, a Shapley additive explanation analysis was performed for interpretability.</p><p><strong>Results: </strong>The proposed method consistently and significantly outperformed all benchmark methods. It attained the highest F1-score, showing a 32.8% improvement for nondiabetic MAFLD (0.531 vs 0.400) and 30.4% improvement for diabetic MAFLD (0.519 vs 0.398) over the respective best-performing benchmark. The results underscore the clinical value and utility of integrating clinical and survey-based data in the prediction of MAFLD phenotypes among adults.</p><p><strong>Conclusions: </strong>The proposed method is a viable approach for MAFLD phenotype prediction. It is more effective in identifying at-risk adults than many prevalent data-driven analytics methods and thereby can enhance clinical decision-making and support patient-centric care and management.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e75747"},"PeriodicalIF":3.8,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12702840/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145758554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1