Avijit Mitra, Kun Chen, Weisong Liu, Ronald C Kessler, Hong Yu
{"title":"Post-discharge suicide prediction among US veterans using natural language processing-enriched social and behavioral determinants of health.","authors":"Avijit Mitra, Kun Chen, Weisong Liu, Ronald C Kessler, Hong Yu","doi":"10.1038/s44184-025-00120-2","DOIUrl":null,"url":null,"abstract":"<p><p>Despite the established association between social and behavioral determinants of health (SBDH) and suicide risk, SBDHs from unstructured electronic health record notes for suicide prediction remain underutilized. This study investigates the impact of SBDH identified from both structured and unstructured data utilizing a natural language processing (NLP) system on suicide prediction at 7, 30, 90, and 180 days post-discharge. Using data from 2,987,006 US Veterans between 1 October 2009, and 30 September 2015, we designed a case-control study demonstrating that structured and NLP-extracted SBDH significantly enhance distinct prediction models' performance. For example, the random forest model improved its 180-day post-discharge prediction with an area under the receiver operating characteristic curve increase from 83.57% to 84.25% (95% CI = 0.63%-0.98%, p val < 0.001) and area under the precision-recall curve increase from 57.38% to 59.87% (95% CI = 3.86%-4.82%, p val < 0.001) after integrating NLP-extracted SBDH. These findings underscore the potential of NLP-extracted SBDH in advancing suicide prediction.</p>","PeriodicalId":74321,"journal":{"name":"Npj mental health research","volume":"4 1","pages":"8"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11846906/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Npj mental health research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s44184-025-00120-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Despite the established association between social and behavioral determinants of health (SBDH) and suicide risk, SBDHs from unstructured electronic health record notes for suicide prediction remain underutilized. This study investigates the impact of SBDH identified from both structured and unstructured data utilizing a natural language processing (NLP) system on suicide prediction at 7, 30, 90, and 180 days post-discharge. Using data from 2,987,006 US Veterans between 1 October 2009, and 30 September 2015, we designed a case-control study demonstrating that structured and NLP-extracted SBDH significantly enhance distinct prediction models' performance. For example, the random forest model improved its 180-day post-discharge prediction with an area under the receiver operating characteristic curve increase from 83.57% to 84.25% (95% CI = 0.63%-0.98%, p val < 0.001) and area under the precision-recall curve increase from 57.38% to 59.87% (95% CI = 3.86%-4.82%, p val < 0.001) after integrating NLP-extracted SBDH. These findings underscore the potential of NLP-extracted SBDH in advancing suicide prediction.