V Zhu, L Lenert, B Bunnell, J Obeid, M Jefferson, C H Halbert
{"title":"Automatically Identifying Financial Stress Information from Clinical Notes for Patients with Prostate Cancer.","authors":"V Zhu, L Lenert, B Bunnell, J Obeid, M Jefferson, C H Halbert","doi":"10.61545/crr-1-102","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Financial stress, one of the social determinants, is common among cancer patients because of high out-ofpocket costs for treatment, as well as indirect costs. The National Academy of Medicine (NAM) has advised providers to recognize and discuss cost concerns with patients in order to enhance shared decision-making for treatment and exploration of financial assistant programs. However, financial stress is rarely assessed in clinical practice or research, thus, under-coded and under-documented in clinical practice. Natural language processing (NLP) offers great potential that can automatically extract and process data on financial stress from clinical free text existing in the patient electronic health record (EHR).</p><p><strong>Methods: </strong>We developed and evaluated an NLP approach to identify financial stress from clinical narratives for patients with prostate cancer. Of 4,195 eligible prostate cancer patients, we randomly sampled 3,138 patients (75%) as a training dataset (150,990 documents) to develop a financial stress lexicon and NLP algorithms iteratively. The remaining 1,057 patients (25%) were used as a test dataset (55,516 documents) to evaluate the NLP algorithm performance. The common terms representing financial stress were \"financial concerns,\" \"unable to afford,\" \"insurance issue,\" \"unemployed,\" and \"financial assistance.\" Negations were used to exclude false mentions of financial stress.</p><p><strong>Results: </strong>Applying both pre- and post-negation, the NLP algorithm identified 209 patients (6.0%) from the training sample and 66 patients (6.2%) with 161 notes from the test sample as having documented financial stress. Two independent domain experts manually reviewed all 161 notes with NLP identified positives and randomly selected 161 notes with NLP-identified negatives, the NLP algorithm yielded 0.86 for precision, 1 for recall, and 0.9.2 for F-score.</p><p><strong>Conclusions: </strong>Financial stress information is not commonly documented in the EHR, neither in structured format nor in clinical narratives. However, natural language processing can accurately extract financial stress information from clinical notes when such narrative information is available.</p>","PeriodicalId":516857,"journal":{"name":"Cancer research and reports","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10840090/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer research and reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.61545/crr-1-102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Financial stress, one of the social determinants, is common among cancer patients because of high out-ofpocket costs for treatment, as well as indirect costs. The National Academy of Medicine (NAM) has advised providers to recognize and discuss cost concerns with patients in order to enhance shared decision-making for treatment and exploration of financial assistant programs. However, financial stress is rarely assessed in clinical practice or research, thus, under-coded and under-documented in clinical practice. Natural language processing (NLP) offers great potential that can automatically extract and process data on financial stress from clinical free text existing in the patient electronic health record (EHR).
Methods: We developed and evaluated an NLP approach to identify financial stress from clinical narratives for patients with prostate cancer. Of 4,195 eligible prostate cancer patients, we randomly sampled 3,138 patients (75%) as a training dataset (150,990 documents) to develop a financial stress lexicon and NLP algorithms iteratively. The remaining 1,057 patients (25%) were used as a test dataset (55,516 documents) to evaluate the NLP algorithm performance. The common terms representing financial stress were "financial concerns," "unable to afford," "insurance issue," "unemployed," and "financial assistance." Negations were used to exclude false mentions of financial stress.
Results: Applying both pre- and post-negation, the NLP algorithm identified 209 patients (6.0%) from the training sample and 66 patients (6.2%) with 161 notes from the test sample as having documented financial stress. Two independent domain experts manually reviewed all 161 notes with NLP identified positives and randomly selected 161 notes with NLP-identified negatives, the NLP algorithm yielded 0.86 for precision, 1 for recall, and 0.9.2 for F-score.
Conclusions: Financial stress information is not commonly documented in the EHR, neither in structured format nor in clinical narratives. However, natural language processing can accurately extract financial stress information from clinical notes when such narrative information is available.