{"title":"自然语言处理和大型语言模型在健康社会决定因素中的应用:系统评价方案。","authors":"Swati Rajwal, Ziyuan Zhang, Yankai Chen, Hannah Rogers, Abeed Sarker, Yunyu Xiao","doi":"10.2196/66094","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>In recent years, the intersection of natural language processing (NLP) and public health has opened innovative pathways for investigating social determinants of health (SDOH) in textual datasets. Despite the promise of NLP in the SDOH domain, the literature is dispersed across various disciplines, and there is a need to consolidate existing knowledge, identify knowledge gaps in the literature, and inform future research directions in this emerging field.</p><p><strong>Objective: </strong>This research protocol describes a systematic review to identify and highlight NLP techniques, including large language models, used for SDOH-related studies.</p><p><strong>Methods: </strong>A search strategy will be executed across PubMed, Web of Science, IEEE Xplore, Scopus, PsycINFO, HealthSource: Academic Nursing, and ACL Anthology to find studies published in English between 2014 and 2024. Three reviewers (SR, ZZ, and YC) will independently screen the studies to avoid voting bias, and two (AS and YX) additional reviewers will resolve any conflicts during the screening process. We will further screen studies that cited the included studies (forward search). Following the title abstract and full-text screening, the characteristics and main findings of the included studies and resources will be tabulated, visualized, and summarized.</p><p><strong>Results: </strong>The search strategy was formulated and run across the 7 databases in August 2024. We expect the results to be submitted for peer review publication in early 2025. As of December 2024, the title and abstract screening was underway.</p><p><strong>Conclusions: </strong>This systematic review aims to provide a comprehensive study of existing research on the application of NLP for various SDOH tasks across multiple textual datasets. By rigorously evaluating the methodologies, tools, and outcomes of eligible studies, the review will identify gaps in current knowledge and suggest directions for future research in the form of specific research questions. The findings will be instrumental in developing more effective NLP models for SDOH, ultimately contributing to improved health outcomes and a better understanding of social determinants in diverse populations.</p><p><strong>International registered report identifier (irrid): </strong>DERR1-10.2196/66094.</p>","PeriodicalId":14755,"journal":{"name":"JMIR Research Protocols","volume":"14 ","pages":"e66094"},"PeriodicalIF":1.4000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11795155/pdf/","citationCount":"0","resultStr":"{\"title\":\"Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Protocol for a Systematic Review.\",\"authors\":\"Swati Rajwal, Ziyuan Zhang, Yankai Chen, Hannah Rogers, Abeed Sarker, Yunyu Xiao\",\"doi\":\"10.2196/66094\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>In recent years, the intersection of natural language processing (NLP) and public health has opened innovative pathways for investigating social determinants of health (SDOH) in textual datasets. Despite the promise of NLP in the SDOH domain, the literature is dispersed across various disciplines, and there is a need to consolidate existing knowledge, identify knowledge gaps in the literature, and inform future research directions in this emerging field.</p><p><strong>Objective: </strong>This research protocol describes a systematic review to identify and highlight NLP techniques, including large language models, used for SDOH-related studies.</p><p><strong>Methods: </strong>A search strategy will be executed across PubMed, Web of Science, IEEE Xplore, Scopus, PsycINFO, HealthSource: Academic Nursing, and ACL Anthology to find studies published in English between 2014 and 2024. Three reviewers (SR, ZZ, and YC) will independently screen the studies to avoid voting bias, and two (AS and YX) additional reviewers will resolve any conflicts during the screening process. We will further screen studies that cited the included studies (forward search). Following the title abstract and full-text screening, the characteristics and main findings of the included studies and resources will be tabulated, visualized, and summarized.</p><p><strong>Results: </strong>The search strategy was formulated and run across the 7 databases in August 2024. We expect the results to be submitted for peer review publication in early 2025. As of December 2024, the title and abstract screening was underway.</p><p><strong>Conclusions: </strong>This systematic review aims to provide a comprehensive study of existing research on the application of NLP for various SDOH tasks across multiple textual datasets. By rigorously evaluating the methodologies, tools, and outcomes of eligible studies, the review will identify gaps in current knowledge and suggest directions for future research in the form of specific research questions. The findings will be instrumental in developing more effective NLP models for SDOH, ultimately contributing to improved health outcomes and a better understanding of social determinants in diverse populations.</p><p><strong>International registered report identifier (irrid): </strong>DERR1-10.2196/66094.</p>\",\"PeriodicalId\":14755,\"journal\":{\"name\":\"JMIR Research Protocols\",\"volume\":\"14 \",\"pages\":\"e66094\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11795155/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Research Protocols\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/66094\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Research Protocols","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/66094","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
摘要
背景:近年来,自然语言处理(NLP)和公共卫生的交叉研究为在文本数据集中研究健康的社会决定因素(SDOH)开辟了创新途径。尽管NLP在SDOH领域的前景光明,但文献分散在各个学科中,需要巩固现有知识,识别文献中的知识空白,并为这一新兴领域的未来研究方向提供信息。目的:本研究方案描述了一项系统综述,以识别和突出用于sdoh相关研究的NLP技术,包括大型语言模型。方法:在PubMed、Web of Science、IEEE explore、Scopus、PsycINFO、HealthSource: Academic Nursing和ACL Anthology上执行搜索策略,查找2014年至2024年间发表的英文研究。三名审稿人(SR、ZZ和YC)将独立筛选研究以避免投票偏倚,另外两名审稿人(AS和YX)将解决筛选过程中的任何冲突。我们将进一步筛选引用纳入研究的研究(前向检索)。在标题摘要和全文筛选之后,将对纳入的研究和资源的特征和主要发现进行制表、可视化和总结。结果:制定了检索策略,并于2024年8月在7个数据库中运行。我们预计研究结果将于2025年初提交同行评议。截至2024年12月,片名和摘要放映正在进行中。结论:本系统综述旨在对NLP在不同文本数据集上应用于各种SDOH任务的现有研究进行全面研究。通过严格评估方法、工具和合格研究的结果,本综述将确定当前知识的差距,并以具体研究问题的形式为未来的研究提出方向。这些发现将有助于为SDOH开发更有效的NLP模型,最终有助于改善健康结果和更好地了解不同人群的社会决定因素。国际注册报告标识符(irrid): DERR1-10.2196/66094。
Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Protocol for a Systematic Review.
Background: In recent years, the intersection of natural language processing (NLP) and public health has opened innovative pathways for investigating social determinants of health (SDOH) in textual datasets. Despite the promise of NLP in the SDOH domain, the literature is dispersed across various disciplines, and there is a need to consolidate existing knowledge, identify knowledge gaps in the literature, and inform future research directions in this emerging field.
Objective: This research protocol describes a systematic review to identify and highlight NLP techniques, including large language models, used for SDOH-related studies.
Methods: A search strategy will be executed across PubMed, Web of Science, IEEE Xplore, Scopus, PsycINFO, HealthSource: Academic Nursing, and ACL Anthology to find studies published in English between 2014 and 2024. Three reviewers (SR, ZZ, and YC) will independently screen the studies to avoid voting bias, and two (AS and YX) additional reviewers will resolve any conflicts during the screening process. We will further screen studies that cited the included studies (forward search). Following the title abstract and full-text screening, the characteristics and main findings of the included studies and resources will be tabulated, visualized, and summarized.
Results: The search strategy was formulated and run across the 7 databases in August 2024. We expect the results to be submitted for peer review publication in early 2025. As of December 2024, the title and abstract screening was underway.
Conclusions: This systematic review aims to provide a comprehensive study of existing research on the application of NLP for various SDOH tasks across multiple textual datasets. By rigorously evaluating the methodologies, tools, and outcomes of eligible studies, the review will identify gaps in current knowledge and suggest directions for future research in the form of specific research questions. The findings will be instrumental in developing more effective NLP models for SDOH, ultimately contributing to improved health outcomes and a better understanding of social determinants in diverse populations.
International registered report identifier (irrid): DERR1-10.2196/66094.