首页 > 最新文献

NPJ Digital Medicine最新文献

英文 中文
AI driven decision support reduces antibiotic mismatches and inappropriate use in outpatient urinary tract infections
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-27 DOI: 10.1038/s41746-024-01400-5
Shirley Shapiro Ben David, Roni Romano, Daniella Rahamim-Cohen, Joseph Azuri, Shira Greenfeld, Ben Gedassi, Uri Lerner

Urinary tract infections (UTIs) often prompt empiric outpatient antibiotic prescriptions, risking mismatches. This study evaluates the impact of “UTI Smart-Set” (UTIS), an AI-driven decision-support tool, on prescribing patterns and mismatches in a large outpatient organization. UTIS integrates machine learning forecasts of antibiotic resistance, patient data, and guidelines into a user-friendly order set for UTI management. From 6/1/2021–8/31/2022, 171,010 UTI diagnoses were recorded, with UTIS used in 75,630 cases involving antibiotic prescriptions. Overall acceptance rate of UTIS recommendations was 66.0%. Among 19,287 cases with urine cultures, antibiotic mismatch rate was significantly lower when UTIS recommendations were followed (8.9% vs. 14.2%, p < 0.0001). Among women over 18, mismatch rate was 47.5% lower, and among women over 50, 55.6% lower (p < 0.001). Additionally, an overall reduction of 80.5% in ciprofloxacin usage (6.4% vs 32.9%, p < 0.0001) was observed. UTIS improved prescribing accuracy, reduced mismatches, and minimized quinolone use, highlighting AI’s potential for personalized infection management.

{"title":"AI driven decision support reduces antibiotic mismatches and inappropriate use in outpatient urinary tract infections","authors":"Shirley Shapiro Ben David, Roni Romano, Daniella Rahamim-Cohen, Joseph Azuri, Shira Greenfeld, Ben Gedassi, Uri Lerner","doi":"10.1038/s41746-024-01400-5","DOIUrl":"https://doi.org/10.1038/s41746-024-01400-5","url":null,"abstract":"<p>Urinary tract infections (UTIs) often prompt empiric outpatient antibiotic prescriptions, risking mismatches. This study evaluates the impact of “UTI Smart-Set” (UTIS), an AI-driven decision-support tool, on prescribing patterns and mismatches in a large outpatient organization. UTIS integrates machine learning forecasts of antibiotic resistance, patient data, and guidelines into a user-friendly order set for UTI management. From 6/1/2021–8/31/2022, 171,010 UTI diagnoses were recorded, with UTIS used in 75,630 cases involving antibiotic prescriptions. Overall acceptance rate of UTIS recommendations was 66.0%. Among 19,287 cases with urine cultures, antibiotic mismatch rate was significantly lower when UTIS recommendations were followed (8.9% vs. 14.2%, <i>p</i> &lt; 0.0001). Among women over 18, mismatch rate was 47.5% lower, and among women over 50, 55.6% lower (<i>p</i> &lt; 0.001). Additionally, an overall reduction of 80.5% in ciprofloxacin usage (6.4% vs 32.9%, <i>p</i> &lt; 0.0001) was observed. UTIS improved prescribing accuracy, reduced mismatches, and minimized quinolone use, highlighting AI’s potential for personalized infection management.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"22 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143044108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extensive benchmarking of a method that estimates external model performance from limited statistical characteristics
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-27 DOI: 10.1038/s41746-024-01414-z
Tal El-Hay, Jenna M. Reps, Chen Yanover

Predictive model performance may deteriorate when applied to data sources that were not used for training, thus, external validation is a key step in successful model deployment. As access to patient-level external data sources is typically limited, we recently proposed a method that estimates external model performance using only external summary statistics. Here, we benchmark the proposed method on multiple tasks using five large heterogeneous US data sources, where each, in turn, plays the role of an internal source and the remaining—external. Results showed accurate estimations for all metrics: 95th error percentiles for the area under the receiver operating characteristics (discrimination), calibration-in-the-large (calibration), Brier and scaled Brier scores (overall accuracy) of 0.03, 0.08, 0.0002, and 0.07, respectively. These results demonstrate the feasibility of estimating the transportability of prediction models using an internal cohort and external statistics. It may become an important accelerator of model deployment.

{"title":"Extensive benchmarking of a method that estimates external model performance from limited statistical characteristics","authors":"Tal El-Hay, Jenna M. Reps, Chen Yanover","doi":"10.1038/s41746-024-01414-z","DOIUrl":"https://doi.org/10.1038/s41746-024-01414-z","url":null,"abstract":"<p>Predictive model performance may deteriorate when applied to data sources that were not used for training, thus, external validation is a key step in successful model deployment. As access to patient-level external data sources is typically limited, we recently proposed a method that estimates external model performance using only external summary statistics. Here, we benchmark the proposed method on multiple tasks using five large heterogeneous US data sources, where each, in turn, plays the role of an internal source and the remaining—external. Results showed accurate estimations for all metrics: 95th error percentiles for the area under the receiver operating characteristics (discrimination), calibration-in-the-large (calibration), Brier and scaled Brier scores (overall accuracy) of 0.03, 0.08, 0.0002, and 0.07, respectively. These results demonstrate the feasibility of estimating the transportability of prediction models using an internal cohort and external statistics. It may become an important accelerator of model deployment.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"20 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143044106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A scoping review of privacy and utility metrics in medical synthetic data
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-27 DOI: 10.1038/s41746-024-01359-3
Bayrem Kaabachi, Jérémie Despraz, Thierry Meurers, Karen Otte, Mehmed Halilovic, Bogdan Kulynych, Fabian Prasser, Jean Louis Raisaro

The use of synthetic data is a promising solution to facilitate the sharing and reuse of health-related data beyond its initial collection while addressing privacy concerns. However, there is still no consensus on a standardized approach for systematically evaluating the privacy and utility of synthetic data, impeding its broader adoption. In this work, we present a comprehensive review and systematization of current methods for evaluating synthetic health-related data, focusing on both privacy and utility aspects. Our findings suggest that there are a variety of methods for assessing the utility of synthetic data, but no consensus on which method is optimal in which scenario. Moreover, we found that most studies included in this review do not evaluate the privacy protection provided by synthetic data, and those that do often significantly underestimate the risks.

{"title":"A scoping review of privacy and utility metrics in medical synthetic data","authors":"Bayrem Kaabachi, Jérémie Despraz, Thierry Meurers, Karen Otte, Mehmed Halilovic, Bogdan Kulynych, Fabian Prasser, Jean Louis Raisaro","doi":"10.1038/s41746-024-01359-3","DOIUrl":"https://doi.org/10.1038/s41746-024-01359-3","url":null,"abstract":"<p>The use of synthetic data is a promising solution to facilitate the sharing and reuse of health-related data beyond its initial collection while addressing privacy concerns. However, there is still no consensus on a standardized approach for systematically evaluating the privacy and utility of synthetic data, impeding its broader adoption. In this work, we present a comprehensive review and systematization of current methods for evaluating synthetic health-related data, focusing on both privacy and utility aspects. Our findings suggest that there are a variety of methods for assessing the utility of synthetic data, but no consensus on which method is optimal in which scenario. Moreover, we found that most studies included in this review do not evaluate the privacy protection provided by synthetic data, and those that do often significantly underestimate the risks.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"113 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143044110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal machine learning enables AI chatbot to diagnose ophthalmic diseases and provide high-quality medical responses
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-27 DOI: 10.1038/s41746-025-01461-0
Ruiqi Ma, Qian Cheng, Jing Yao, Zhiyu Peng, Mingxu Yan, Jie Lu, Jingjing Liao, Lejin Tian, Wenjun Shu, Yunqiu Zhang, Jinghan Wang, Pengfei Jiang, Weiyi Xia, Xiaofeng Li, Lu Gan, Yue Zhao, Jiang Zhu, Bing Qin, Qin Jiang, Xiawei Wang, Xintong Lin, Haifeng Chen, Weifang Zhu, Dehui Xiang, Baoqing Nie, Jingtao Wang, Jie Guo, Kang Xue, Hongguang Cui, Jinwei Cheng, Xiangjia Zhu, Jiaxu Hong, Fei Shi, Rui Zhang, Xinjian Chen, Chen Zhao

Chatbot-based multimodal AI holds promise for collecting medical histories and diagnosing ophthalmic diseases using textual and imaging data. This study developed and evaluated the ChatGPT-powered Intelligent Ophthalmic Multimodal Interactive Diagnostic System (IOMIDS) to enable patient self-diagnosis and self-triage. IOMIDS included a text model and three multimodal models (text + slit-lamp, text + smartphone, text + slit-lamp + smartphone). The performance was evaluated through a two-stage cross-sectional study across three medical centers involving 10 subspecialties and 50 diseases. Using 15640 data entries, IOMIDS actively collected and analyzed medical history alongside slit-lamp and/or smartphone images. The text + smartphone model showed the highest diagnostic accuracy (internal: 79.6%, external: 81.1%), while other multimodal models underperformed or matched the text model (internal: 69.6%, external: 72.5%). Moreover, triage accuracy was consistent across models. Multimodal approaches enhanced response quality and reduced misinformation. This proof-of-concept study highlights the potential of chatbot-based multimodal AI for self-diagnosis and self-triage. (The clinical trial was registered on June 26, 2023, on ClinicalTrials.gov under the registration number NCT05930444.).

{"title":"Multimodal machine learning enables AI chatbot to diagnose ophthalmic diseases and provide high-quality medical responses","authors":"Ruiqi Ma, Qian Cheng, Jing Yao, Zhiyu Peng, Mingxu Yan, Jie Lu, Jingjing Liao, Lejin Tian, Wenjun Shu, Yunqiu Zhang, Jinghan Wang, Pengfei Jiang, Weiyi Xia, Xiaofeng Li, Lu Gan, Yue Zhao, Jiang Zhu, Bing Qin, Qin Jiang, Xiawei Wang, Xintong Lin, Haifeng Chen, Weifang Zhu, Dehui Xiang, Baoqing Nie, Jingtao Wang, Jie Guo, Kang Xue, Hongguang Cui, Jinwei Cheng, Xiangjia Zhu, Jiaxu Hong, Fei Shi, Rui Zhang, Xinjian Chen, Chen Zhao","doi":"10.1038/s41746-025-01461-0","DOIUrl":"https://doi.org/10.1038/s41746-025-01461-0","url":null,"abstract":"<p>Chatbot-based multimodal AI holds promise for collecting medical histories and diagnosing ophthalmic diseases using textual and imaging data. This study developed and evaluated the ChatGPT-powered Intelligent Ophthalmic Multimodal Interactive Diagnostic System (IOMIDS) to enable patient self-diagnosis and self-triage. IOMIDS included a text model and three multimodal models (text + slit-lamp, text + smartphone, text + slit-lamp + smartphone). The performance was evaluated through a two-stage cross-sectional study across three medical centers involving 10 subspecialties and 50 diseases. Using 15640 data entries, IOMIDS actively collected and analyzed medical history alongside slit-lamp and/or smartphone images. The text + smartphone model showed the highest diagnostic accuracy (internal: 79.6%, external: 81.1%), while other multimodal models underperformed or matched the text model (internal: 69.6%, external: 72.5%). Moreover, triage accuracy was consistent across models. Multimodal approaches enhanced response quality and reduced misinformation. This proof-of-concept study highlights the potential of chatbot-based multimodal AI for self-diagnosis and self-triage. (The clinical trial was registered on June 26, 2023, on ClinicalTrials.gov under the registration number NCT05930444.).</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"25 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143049985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards evaluating and building versatile large language models for medicine
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-27 DOI: 10.1038/s41746-024-01390-4
Chaoyi Wu, Pengcheng Qiu, Jinxin Liu, Hongfei Gu, Na Li, Ya Zhang, Yanfeng Wang, Weidi Xie

In this study, we present MedS-Bench, a comprehensive benchmark to evaluate large language models (LLMs) in clinical contexts, MedS-Bench, spanning 11 high-level clinical tasks. We evaluate nine leading LLMs, e.g., MEDITRON, Llama 3, Mistral, GPT-4, Claude-3.5, etc. and found that most models struggle with these complex tasks. To address these limitations, we developed MedS-Ins, a large-scale instruction-tuning dataset for medicine. MedS-Ins comprises 58 medically oriented language corpora, totaling 5M instances with 19K instructions, across 122 tasks. To demonstrate the dataset’s utility, we conducted a proof-of-concept experiment by performing instruction tuning on a lightweight, open-source medical language model. The resulting model, MMedIns-Llama 3, significantly outperformed existing models on various clinical tasks. To promote further advancements, we have made MedS-Ins fully accessible and invite the research community to contribute to its expansion. Additionally, we have launched a dynamic leaderboard for MedS-Bench, to track the development progress of medical LLMs.

{"title":"Towards evaluating and building versatile large language models for medicine","authors":"Chaoyi Wu, Pengcheng Qiu, Jinxin Liu, Hongfei Gu, Na Li, Ya Zhang, Yanfeng Wang, Weidi Xie","doi":"10.1038/s41746-024-01390-4","DOIUrl":"https://doi.org/10.1038/s41746-024-01390-4","url":null,"abstract":"<p>In this study, we present <b>MedS-Bench</b>, a comprehensive benchmark to evaluate large language models (LLMs) in clinical contexts, <b>MedS-Bench</b>, spanning 11 high-level clinical tasks. We evaluate nine leading LLMs, <i>e.g</i>., MEDITRON, Llama 3, Mistral, GPT-4, Claude-3.5, <i>etc</i>. and found that most models struggle with these complex tasks. To address these limitations, we developed <b>MedS-Ins</b>, a large-scale instruction-tuning dataset for medicine. <b>MedS-Ins</b> comprises 58 medically oriented language corpora, totaling 5M instances with 19K instructions, across 122 tasks. To demonstrate the dataset’s utility, we conducted a proof-of-concept experiment by performing instruction tuning on a lightweight, open-source medical language model. The resulting model, <b>MMedIns-Llama 3</b>, significantly outperformed existing models on various clinical tasks. To promote further advancements, we have made <b>MedS-Ins</b> fully accessible and invite the research community to contribute to its expansion. Additionally, we have launched a dynamic leaderboard for <b>MedS-Bench</b>, to track the development progress of medical LLMs.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"45 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143044104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic review of portable technologies for the early assessment of motor development in infants
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-27 DOI: 10.1038/s41746-025-01450-3
Weiyang Deng, Megan K. O’Brien, Rachel A. Andersen, Richa Rai, Erin Jones, Arun Jayaraman

Early screening and evaluation of infant motor development are crucial for detecting motor deficits and enabling timely interventions. Traditional clinical assessments are often subjective, without fully capturing infants’ “real-world” behavior. This has sparked interest in portable, low-cost technologies to objectively and precisely measure infant motion at home, with a goal of enhancing ecological validity. In this systematic review, we explored the current landscape of portable, technology-based solutions to assess early motor development (within the first year), outlining the prevailing challenges and future directions. We reviewed 66 publications, which utilized video, sensors, or a combination of technologies. There were three key applications of these technologies: (1) automating clinical assessments, (2) illuminating new measures of motor development, and (3) predicting developmental outcomes. There was a promising trend toward earlier and more accurate detection using portable technologies. Additional research and demographic diversity are needed to develop fully automated, robust, and user-friendly tools. Registration & Protocol OSF Registries https://doi.org/10.17605/OSF.IO/R6JAE.

{"title":"A systematic review of portable technologies for the early assessment of motor development in infants","authors":"Weiyang Deng, Megan K. O’Brien, Rachel A. Andersen, Richa Rai, Erin Jones, Arun Jayaraman","doi":"10.1038/s41746-025-01450-3","DOIUrl":"https://doi.org/10.1038/s41746-025-01450-3","url":null,"abstract":"<p>Early screening and evaluation of infant motor development are crucial for detecting motor deficits and enabling timely interventions. Traditional clinical assessments are often subjective, without fully capturing infants’ “real-world” behavior. This has sparked interest in portable, low-cost technologies to objectively and precisely measure infant motion at home, with a goal of enhancing ecological validity. In this systematic review, we explored the current landscape of portable, technology-based solutions to assess early motor development (within the first year), outlining the prevailing challenges and future directions. We reviewed 66 publications, which utilized video, sensors, or a combination of technologies. There were three key applications of these technologies: (1) automating clinical assessments, (2) illuminating new measures of motor development, and (3) predicting developmental outcomes. There was a promising trend toward earlier and more accurate detection using portable technologies. Additional research and demographic diversity are needed to develop fully automated, robust, and user-friendly tools. Registration &amp; Protocol OSF Registries https://doi.org/10.17605/OSF.IO/R6JAE.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"54 41 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143044107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic review of passive data for remote monitoring in psychosis and schizophrenia
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-27 DOI: 10.1038/s41746-025-01451-2
Siân Bladon, Emily Eisner, Sandra Bucci, Anuoluwapo Oluwatayo, Glen P. Martin, Matthew Sperrin, John Ainsworth, Sophie Faulkner

There is increasing use of digital tools to monitor people with psychosis and schizophrenia remotely, but using this type of data is challenging. This systematic review aimed to summarise how studies processed and analysed data collected through digital devices. In total, 203 articles collecting passive data through smartphones or wearable devices, from participants with psychosis or schizophrenia were included in the review. Accelerometers were the most common device (n = 115 studies), followed by smartphones (n = 46). The most commonly derived features were sleep duration (n = 50) and time spent sedentary (n = 41). Thirty studies assessed data quality and another 69 applied data quantity thresholds. Mixed effects models were used in 21 studies and time-series and machine-learning methods were used in 18 studies. Reporting of methods to process and analyse data was inconsistent, highlighting a need to improve the standardisation of methods and reporting in this area of research.

{"title":"A systematic review of passive data for remote monitoring in psychosis and schizophrenia","authors":"Siân Bladon, Emily Eisner, Sandra Bucci, Anuoluwapo Oluwatayo, Glen P. Martin, Matthew Sperrin, John Ainsworth, Sophie Faulkner","doi":"10.1038/s41746-025-01451-2","DOIUrl":"https://doi.org/10.1038/s41746-025-01451-2","url":null,"abstract":"<p>There is increasing use of digital tools to monitor people with psychosis and schizophrenia remotely, but using this type of data is challenging. This systematic review aimed to summarise how studies processed and analysed data collected through digital devices. In total, 203 articles collecting passive data through smartphones or wearable devices, from participants with psychosis or schizophrenia were included in the review. Accelerometers were the most common device (<i>n</i> = 115 studies), followed by smartphones (<i>n</i> = 46). The most commonly derived features were sleep duration (<i>n</i> = 50) and time spent sedentary (<i>n</i> = 41). Thirty studies assessed data quality and another 69 applied data quantity thresholds. Mixed effects models were used in 21 studies and time-series and machine-learning methods were used in 18 studies. Reporting of methods to process and analyse data was inconsistent, highlighting a need to improve the standardisation of methods and reporting in this area of research.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"58 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143044109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multicenter study of neurofibromatosis type 1 utilizing deep learning for whole body tumor identification
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-26 DOI: 10.1038/s41746-025-01454-z
Cheng-Jiang Wei, Yan Tang, Yang-Bai Sun, Tie-Long Yang, Cheng Yan, Hui Liu, Jun Liu, Jing-Ning Huang, Ming-Han Wang, Zhen-Wei Yao, Ji-Long Yang, Zhi-Chao Wang, Qing-Feng Li

Deep-learning models have shown promise in differentiating between benign and malignant lesions. Previous studies have primarily focused on specific anatomical regions, overlooking tumors occurring throughout the body with highly heterogeneous whole-body backgrounds. Using neurofibromatosis type 1 (NF1) as an example, this study developed highly accurate MRI-based deep-learning models for the early automated screening of malignant peripheral nerve sheath tumors (MPNSTs) against complex whole-body background. In a Chinese seven-center cohort, data from 347 subjects were analyzed. Our one-step model incorporated normal tissue/organ labels to provide contextual information, offering a solution for tumors with complex backgrounds. To address privacy concerns, we utilized a lightweight deep neural network suitable for hospital deployment. The final model achieved an accuracy of 85.71% for MPNST diagnosis in the validation cohort and 84.75% accuracy in the independent test set, outperforming another classic two-step model. This success suggests potential for AI models in screening other whole-body primary/metastatic tumors.

{"title":"A multicenter study of neurofibromatosis type 1 utilizing deep learning for whole body tumor identification","authors":"Cheng-Jiang Wei, Yan Tang, Yang-Bai Sun, Tie-Long Yang, Cheng Yan, Hui Liu, Jun Liu, Jing-Ning Huang, Ming-Han Wang, Zhen-Wei Yao, Ji-Long Yang, Zhi-Chao Wang, Qing-Feng Li","doi":"10.1038/s41746-025-01454-z","DOIUrl":"https://doi.org/10.1038/s41746-025-01454-z","url":null,"abstract":"<p>Deep-learning models have shown promise in differentiating between benign and malignant lesions. Previous studies have primarily focused on specific anatomical regions, overlooking tumors occurring throughout the body with highly heterogeneous whole-body backgrounds. Using neurofibromatosis type 1 (NF1) as an example, this study developed highly accurate MRI-based deep-learning models for the early automated screening of malignant peripheral nerve sheath tumors (MPNSTs) against complex whole-body background. In a Chinese seven-center cohort, data from 347 subjects were analyzed. Our one-step model incorporated normal tissue/organ labels to provide contextual information, offering a solution for tumors with complex backgrounds. To address privacy concerns, we utilized a lightweight deep neural network suitable for hospital deployment. The final model achieved an accuracy of 85.71% for MPNST diagnosis in the validation cohort and 84.75% accuracy in the independent test set, outperforming another classic two-step model. This success suggests potential for AI models in screening other whole-body primary/metastatic tumors.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"14 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143034970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning approach for noninvasive intracranial pressure estimation using pulsatile cranial expansion waveforms 利用脉动颅骨扩张波形估测无创颅内压的机器学习方法
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-26 DOI: 10.1038/s41746-025-01463-y
Gustavo Frigieri, Sérgio Brasil, Danilo Cardim, Marek Czosnyka, Matheus Ferreira, Wellingson S. Paiva, Xiao Hu

Noninvasive methods for intracranial pressure (ICP) monitoring have emerged, but none has successfully replaced invasive techniques. This observational study developed and tested a machine learning (ML) model to estimate ICP using waveforms from a cranial extensometer device (brain4care [B4C] System). The model explored multiple waveform parameters to optimize mean ICP estimation. Data from 112 neurocritical patients with acute brain injuries were used, with 92 patients randomly assigned to training and testing, and 20 reserved for independent validation. The ML model achieved a mean absolute error of 3.00 mmHg, with a 95% confidence interval within ±7.5 mmHg. Approximately 72% of estimates from the validation sample were within 0-4 mmHg of invasive ICP values. This proof-of-concept study demonstrates that noninvasive ICP estimation via the B4C System and ML is feasible. Prospective studies are needed to validate the model’s clinical utility across diverse settings.

{"title":"Machine learning approach for noninvasive intracranial pressure estimation using pulsatile cranial expansion waveforms","authors":"Gustavo Frigieri, Sérgio Brasil, Danilo Cardim, Marek Czosnyka, Matheus Ferreira, Wellingson S. Paiva, Xiao Hu","doi":"10.1038/s41746-025-01463-y","DOIUrl":"https://doi.org/10.1038/s41746-025-01463-y","url":null,"abstract":"<p>Noninvasive methods for intracranial pressure (ICP) monitoring have emerged, but none has successfully replaced invasive techniques. This observational study developed and tested a machine learning (ML) model to estimate ICP using waveforms from a cranial extensometer device (brain4care [B4C] System). The model explored multiple waveform parameters to optimize mean ICP estimation. Data from 112 neurocritical patients with acute brain injuries were used, with 92 patients randomly assigned to training and testing, and 20 reserved for independent validation. The ML model achieved a mean absolute error of 3.00 mmHg, with a 95% confidence interval within ±7.5 mmHg. Approximately 72% of estimates from the validation sample were within 0-4 mmHg of invasive ICP values. This proof-of-concept study demonstrates that noninvasive ICP estimation via the B4C System and ML is feasible. Prospective studies are needed to validate the model’s clinical utility across diverse settings.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"25 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143034968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable vision transformer for automatic visual sleep staging on multimodal PSG signals
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-01-25 DOI: 10.1038/s41746-024-01378-0
Hyojin Lee, You Rim Choi, Hyun Kyung Lee, Jaemin Jeong, Joopyo Hong, Hyun-Woo Shin, Hyung-Sin Kim

Polysomnography (PSG) is crucial for diagnosing sleep disorders, but manual scoring of PSG is time-consuming and subjective, leading to high variability. While machine-learning models have improved PSG scoring, their clinical use is hindered by the ‘black-box’ nature. In this study, we present SleepXViT, an automatic sleep staging system using Vision Transformer (ViT) that provides intuitive, consistent explanations by mimicking human ‘visual scoring’. Tested on KISS–a PSG image dataset from 7745 patients across four hospitals–SleepXViT achieved a Macro F1 score of 81.94%, outperforming baseline models and showing robust performances on public datasets SHHS1 and SHHS2. Furthermore, SleepXViT offers well-calibrated confidence scores, enabling expert review for low-confidence predictions, alongside high-resolution heatmaps highlighting essential features and relevance scores for adjacent epochs’ influence on sleep stage predictions. Together, these explanations reinforce the scoring consistency of SleepXViT, making it both reliable and interpretable, thereby facilitating the synergy between the AI model and human scorers in clinical settings.

{"title":"Explainable vision transformer for automatic visual sleep staging on multimodal PSG signals","authors":"Hyojin Lee, You Rim Choi, Hyun Kyung Lee, Jaemin Jeong, Joopyo Hong, Hyun-Woo Shin, Hyung-Sin Kim","doi":"10.1038/s41746-024-01378-0","DOIUrl":"https://doi.org/10.1038/s41746-024-01378-0","url":null,"abstract":"<p>Polysomnography (PSG) is crucial for diagnosing sleep disorders, but manual scoring of PSG is time-consuming and subjective, leading to high variability. While machine-learning models have improved PSG scoring, their clinical use is hindered by the ‘black-box’ nature. In this study, we present <i>SleepXViT</i>, an automatic sleep staging system using Vision Transformer (ViT) that provides intuitive, consistent explanations by mimicking human ‘visual scoring’. Tested on KISS–a PSG image dataset from 7745 patients across four hospitals–<i>SleepXViT</i> achieved a Macro F1 score of 81.94%, outperforming baseline models and showing robust performances on public datasets SHHS1 and SHHS2. Furthermore, <i>SleepXViT</i> offers well-calibrated confidence scores, enabling expert review for low-confidence predictions, alongside high-resolution heatmaps highlighting essential features and relevance scores for adjacent epochs’ influence on sleep stage predictions. Together, these explanations reinforce the scoring consistency of <i>SleepXViT</i>, making it both reliable and interpretable, thereby facilitating the synergy between the AI model and human scorers in clinical settings.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"47 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143030921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
NPJ Digital Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1