自然语言处理对照顾者策略进行分类,支持患有颅面显微畸形和其他儿童发病残疾的儿童和青年的参与。

IF 5.4 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Journal of healthcare informatics research Pub Date : 2023-09-18 eCollection Date: 2023-12-01 DOI:10.1007/s41666-023-00149-y
Vera C Kaelin, Andrew D Boyd, Martha M Werler, Natalie Parde, Mary A Khetani
{"title":"自然语言处理对照顾者策略进行分类,支持患有颅面显微畸形和其他儿童发病残疾的儿童和青年的参与。","authors":"Vera C Kaelin, Andrew D Boyd, Martha M Werler, Natalie Parde, Mary A Khetani","doi":"10.1007/s41666-023-00149-y","DOIUrl":null,"url":null,"abstract":"<p><p>Customizing participation-focused pediatric rehabilitation interventions is an important but also complex and potentially resource intensive process, which may benefit from automated and simplified steps. This research aimed at applying natural language processing to develop and identify a best performing predictive model that classifies caregiver strategies into participation-related constructs, while filtering out non-strategies. We created a dataset including 1,576 caregiver strategies obtained from 236 families of children and youth (11-17 years) with craniofacial microsomia or other childhood-onset disabilities. These strategies were annotated to four participation-related constructs and a non-strategy class. We experimented with manually created features (i.e., speech and dependency tags, predefined likely sets of words, dense lexicon features (i.e., Unified Medical Language System (UMLS) concepts)) and three classical methods (i.e., logistic regression, naïve Bayes, support vector machines (SVM)). We tested a series of binary and multinomial classification tasks applying 10-fold cross-validation on the training set (80%) to test the best performing model on the held-out test set (20%). SVM using term frequency-inverse document frequency (TF-IDF) was the best performing model for all four classification tasks, with accuracy ranging from 78.10 to 94.92% and a macro-averaged F1-score ranging from 0.58 to 0.83. Manually created features only increased model performance when filtering out non-strategies. Results suggest pipelined classification tasks (i.e., filtering out non-strategies; classification into intrinsic and extrinsic strategies; classification into participation-related constructs) for implementation into participation-focused pediatric rehabilitation interventions like Participation and Environment Measure Plus (PEM+) among caregivers who complete the Participation and Environment Measure for Children and Youth (PEM-CY).</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s41666-023-00149-y.</p>","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":null,"pages":null},"PeriodicalIF":5.4000,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10620347/pdf/","citationCount":"0","resultStr":"{\"title\":\"Natural Language Processing to Classify Caregiver Strategies Supporting Participation Among Children and Youth with Craniofacial Microsomia and Other Childhood-Onset Disabilities.\",\"authors\":\"Vera C Kaelin, Andrew D Boyd, Martha M Werler, Natalie Parde, Mary A Khetani\",\"doi\":\"10.1007/s41666-023-00149-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Customizing participation-focused pediatric rehabilitation interventions is an important but also complex and potentially resource intensive process, which may benefit from automated and simplified steps. This research aimed at applying natural language processing to develop and identify a best performing predictive model that classifies caregiver strategies into participation-related constructs, while filtering out non-strategies. We created a dataset including 1,576 caregiver strategies obtained from 236 families of children and youth (11-17 years) with craniofacial microsomia or other childhood-onset disabilities. These strategies were annotated to four participation-related constructs and a non-strategy class. We experimented with manually created features (i.e., speech and dependency tags, predefined likely sets of words, dense lexicon features (i.e., Unified Medical Language System (UMLS) concepts)) and three classical methods (i.e., logistic regression, naïve Bayes, support vector machines (SVM)). We tested a series of binary and multinomial classification tasks applying 10-fold cross-validation on the training set (80%) to test the best performing model on the held-out test set (20%). SVM using term frequency-inverse document frequency (TF-IDF) was the best performing model for all four classification tasks, with accuracy ranging from 78.10 to 94.92% and a macro-averaged F1-score ranging from 0.58 to 0.83. Manually created features only increased model performance when filtering out non-strategies. Results suggest pipelined classification tasks (i.e., filtering out non-strategies; classification into intrinsic and extrinsic strategies; classification into participation-related constructs) for implementation into participation-focused pediatric rehabilitation interventions like Participation and Environment Measure Plus (PEM+) among caregivers who complete the Participation and Environment Measure for Children and Youth (PEM-CY).</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s41666-023-00149-y.</p>\",\"PeriodicalId\":101413,\"journal\":{\"name\":\"Journal of healthcare informatics research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2023-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10620347/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of healthcare informatics research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s41666-023-00149-y\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/12/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of healthcare informatics research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41666-023-00149-y","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

定制以参与为重点的儿科康复干预措施是一个重要但也复杂且潜在的资源密集型过程,这可能受益于自动化和简化的步骤。这项研究旨在应用自然语言处理来开发和确定一个性能最佳的预测模型,该模型将照顾者策略分类为参与相关的结构,同时过滤掉非策略。我们创建了一个数据集,其中包括从236个患有颅面侏儒症或其他儿童期残疾的儿童和青少年(11-17岁)家庭获得的1576项护理策略。这些策略被注释为四个与参与相关的结构和一个非策略类。我们实验了手动创建的特征(即语音和依赖标签、预定义的可能单词集、密集词典特征(即统一医学语言系统(UMLS)概念)和三种经典方法(即逻辑回归、天真贝叶斯、支持向量机(SVM))。我们在训练集(80%)上应用10倍交叉验证测试了一系列二进制和多项式分类任务,以在保留的测试集(20%)上测试性能最好的模型。使用术语频率逆文档频率(TF-IDF)的SVM是所有四个分类任务中性能最好的模型,准确率在78.10%到94.92%之间,宏观平均F1得分在0.58到0.83之间。手动创建的特征只会在过滤掉非策略时提高模型性能。结果表明,在完成参与与环境的护理人员中,流水线式的分类任务(即过滤非策略;分类为内在和外在策略;分类至参与相关结构)可用于实施以参与为重点的儿科康复干预措施,如参与与环境测量+(PEM+)儿童和青年措施(PEM-CY)。补充信息:在线版本包含补充材料,可访问10.1007/s41666-023-00149-y。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Natural Language Processing to Classify Caregiver Strategies Supporting Participation Among Children and Youth with Craniofacial Microsomia and Other Childhood-Onset Disabilities.

Customizing participation-focused pediatric rehabilitation interventions is an important but also complex and potentially resource intensive process, which may benefit from automated and simplified steps. This research aimed at applying natural language processing to develop and identify a best performing predictive model that classifies caregiver strategies into participation-related constructs, while filtering out non-strategies. We created a dataset including 1,576 caregiver strategies obtained from 236 families of children and youth (11-17 years) with craniofacial microsomia or other childhood-onset disabilities. These strategies were annotated to four participation-related constructs and a non-strategy class. We experimented with manually created features (i.e., speech and dependency tags, predefined likely sets of words, dense lexicon features (i.e., Unified Medical Language System (UMLS) concepts)) and three classical methods (i.e., logistic regression, naïve Bayes, support vector machines (SVM)). We tested a series of binary and multinomial classification tasks applying 10-fold cross-validation on the training set (80%) to test the best performing model on the held-out test set (20%). SVM using term frequency-inverse document frequency (TF-IDF) was the best performing model for all four classification tasks, with accuracy ranging from 78.10 to 94.92% and a macro-averaged F1-score ranging from 0.58 to 0.83. Manually created features only increased model performance when filtering out non-strategies. Results suggest pipelined classification tasks (i.e., filtering out non-strategies; classification into intrinsic and extrinsic strategies; classification into participation-related constructs) for implementation into participation-focused pediatric rehabilitation interventions like Participation and Environment Measure Plus (PEM+) among caregivers who complete the Participation and Environment Measure for Children and Youth (PEM-CY).

Supplementary information: The online version contains supplementary material available at 10.1007/s41666-023-00149-y.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
CliqueFluxNet: Unveiling EHR Insights with Stochastic Edge Fluxing and Maximal Clique Utilisation Using Graph Neural Networks. Analyzing Healthcare Processes with Incremental Process Discovery: Practical Insights from a Real-World Application. MELEP: A Novel Predictive Measure of Transferability in Multi-label ECG Diagnosis. DDE: Deep Dynamic Epidemiological Modeling for Infectious Illness Development Forecasting in Multi-level Geographic Entities. Analyzing the Impact of Personalization on Fairness in Federated Learning for Healthcare.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1