首页 > 最新文献

Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology最新文献

英文 中文
An Investigation of Deep Learning Systems for Suicide Risk Assessment 自杀风险评估的深度学习系统研究
Michelle Morales, Prajjalita Dey, T. Theisen, Daniel Belitz, N. Chernova
This work presents the systems explored as part of the CLPsych 2019 Shared Task. More specifically, this work explores the promise of deep learning systems for suicide risk assessment.
这项工作展示了作为CLPsych 2019共享任务的一部分所探索的系统。更具体地说,这项工作探讨了深度学习系统在自杀风险评估方面的前景。
{"title":"An Investigation of Deep Learning Systems for Suicide Risk Assessment","authors":"Michelle Morales, Prajjalita Dey, T. Theisen, Daniel Belitz, N. Chernova","doi":"10.18653/v1/W19-3023","DOIUrl":"https://doi.org/10.18653/v1/W19-3023","url":null,"abstract":"This work presents the systems explored as part of the CLPsych 2019 Shared Task. More specifically, this work explores the promise of deep learning systems for suicide risk assessment.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116463382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Identifying therapist conversational actions across diverse psychotherapeutic approaches 在不同的心理治疗方法中识别治疗师的对话行为
Fei-Tzin Lee, Derrick Hull, Jacob Levine, Bonnie Ray, K. McKeown
While conversation in therapy sessions can vary widely in both topic and style, an understanding of the underlying techniques used by therapists can provide valuable insights into how therapists best help clients of different types. Dialogue act classification aims to identify the conversational “action” each speaker takes at each utterance, such as sympathizing, problem-solving or assumption checking. We propose to apply dialogue act classification to therapy transcripts, using a therapy-specific labeling scheme, in order to gain a high-level understanding of the flow of conversation in therapy sessions. We present a novel annotation scheme that spans multiple psychotherapeutic approaches, apply it to a large and diverse corpus of psychotherapy transcripts, and present and discuss classification results obtained using both SVM and neural network-based models. The results indicate that identifying the structure and flow of therapeutic actions is an obtainable goal, opening up the opportunity in the future to provide therapeutic recommendations tailored to specific client situations.
虽然治疗过程中的谈话在主题和风格上都有很大的不同,但对治疗师使用的潜在技术的理解可以为治疗师如何最好地帮助不同类型的客户提供有价值的见解。对话行为分类旨在识别每个说话者在每次说话时所采取的对话“行动”,如同情、解决问题或检查假设。我们建议将对话行为分类应用于治疗记录,使用治疗特定的标签方案,以便对治疗过程中的对话流程有一个高层次的理解。我们提出了一种跨越多种心理治疗方法的新型注释方案,将其应用于大量不同的心理治疗记录语料库,并展示和讨论了使用支持向量机和基于神经网络的模型获得的分类结果。结果表明,确定治疗行动的结构和流程是一个可实现的目标,为未来提供针对特定客户情况的治疗建议提供了机会。
{"title":"Identifying therapist conversational actions across diverse psychotherapeutic approaches","authors":"Fei-Tzin Lee, Derrick Hull, Jacob Levine, Bonnie Ray, K. McKeown","doi":"10.18653/v1/W19-3002","DOIUrl":"https://doi.org/10.18653/v1/W19-3002","url":null,"abstract":"While conversation in therapy sessions can vary widely in both topic and style, an understanding of the underlying techniques used by therapists can provide valuable insights into how therapists best help clients of different types. Dialogue act classification aims to identify the conversational “action” each speaker takes at each utterance, such as sympathizing, problem-solving or assumption checking. We propose to apply dialogue act classification to therapy transcripts, using a therapy-specific labeling scheme, in order to gain a high-level understanding of the flow of conversation in therapy sessions. We present a novel annotation scheme that spans multiple psychotherapeutic approaches, apply it to a large and diverse corpus of psychotherapy transcripts, and present and discuss classification results obtained using both SVM and neural network-based models. The results indicate that identifying the structure and flow of therapeutic actions is an obtainable goal, opening up the opportunity in the future to provide therapeutic recommendations tailored to specific client situations.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115453566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Analyzing the use of existing systems for the CLPsych 2019 Shared Task 分析CLPsych 2019共享任务中现有系统的使用情况
A. Hevia, R. Menéndez, Daniel Gayo-Avello
In this paper we describe the UniOvi-WESO classification systems proposed for the 2019 Computational Linguistics and Clinical Psychology (CLPsych) Shared Task. We explore the use of two systems trained with ReachOut data from the 2016 CLPsych task, and compare them to a baseline system trained with the data provided for this task. All the classifiers were trained with features extracted just from the text of each post, without using any other metadata. We found out that the baseline system performs slightly better than the pretrained systems, mainly due to the differences in labeling between the two tasks. However, they still work reasonably well and can detect if a user is at risk of suicide or not.
在本文中,我们描述了为2019计算语言学和临床心理学(CLPsych)共享任务提出的UniOvi-WESO分类系统。我们探索了使用来自2016 CLPsych任务的ReachOut数据训练的两个系统的使用,并将它们与使用该任务提供的数据训练的基线系统进行比较。所有分类器都使用从每篇文章的文本中提取的特征进行训练,而不使用任何其他元数据。我们发现基线系统的表现略好于预训练系统,主要是由于两个任务之间标记的差异。然而,他们仍然工作得相当好,可以检测用户是否有自杀的风险。
{"title":"Analyzing the use of existing systems for the CLPsych 2019 Shared Task","authors":"A. Hevia, R. Menéndez, Daniel Gayo-Avello","doi":"10.18653/v1/W19-3017","DOIUrl":"https://doi.org/10.18653/v1/W19-3017","url":null,"abstract":"In this paper we describe the UniOvi-WESO classification systems proposed for the 2019 Computational Linguistics and Clinical Psychology (CLPsych) Shared Task. We explore the use of two systems trained with ReachOut data from the 2016 CLPsych task, and compare them to a baseline system trained with the data provided for this task. All the classifiers were trained with features extracted just from the text of each post, without using any other metadata. We found out that the baseline system performs slightly better than the pretrained systems, mainly due to the differences in labeling between the two tasks. However, they still work reasonably well and can detect if a user is at risk of suicide or not.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122342482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Temporal Analysis of the Semantic Verbal Fluency Task in Persons with Subjective and Mild Cognitive Impairment 主观和轻度认知障碍患者语义语言流畅性任务的时间分析
N. Linz, Kristina Lundholm Fors, Hali Lindsay, M. Eckerström, J. Alexandersson, D. Kokkinakis
The Semantic Verbal Fluency (SVF) task is a classical neuropsychological assessment where persons are asked to produce words belonging to a semantic category (e.g., animals) in a given time. This paper introduces a novel method of temporal analysis for SVF tasks utilizing time intervals and applies it to a corpus of elderly Swedish subjects (mild cognitive impairment, subjective cognitive impairment and healthy controls). A general decline in word count and lexical frequency over the course of the task is revealed, as well as an increase in word transition times. Persons with subjective cognitive impairment had a higher word count during the last intervals, but produced words of the same lexical frequencies. Persons with MCI had a steeper decline in both word count and lexical frequencies during the third interval. Additional correlations with neuropsychological scores suggest these findings are linked to a person’s overall vocabulary size and processing speed, respectively. Classification results improved when adding the novel features (AUC=0.72), supporting their diagnostic value.
语义语言流畅性(SVF)任务是一种经典的神经心理学评估,要求人们在给定的时间内说出属于语义类别的单词(例如,动物)。本文介绍了一种利用时间间隔对SVF任务进行时间分析的新方法,并将其应用于瑞典老年受试者(轻度认知障碍、主观认知障碍和健康对照)的语料库。在任务过程中,字数和词汇频率普遍下降,同时单词转换时间增加。有主观认知障碍的人在最后一段时间内字数较高,但词汇频率相同。患有轻度认知障碍的人在第三个间隔期间的字数和词汇频率下降幅度更大。与神经心理学分数的其他相关性表明,这些发现分别与一个人的整体词汇量和处理速度有关。添加新特征后,分类结果得到改善(AUC=0.72),支持其诊断价值。
{"title":"Temporal Analysis of the Semantic Verbal Fluency Task in Persons with Subjective and Mild Cognitive Impairment","authors":"N. Linz, Kristina Lundholm Fors, Hali Lindsay, M. Eckerström, J. Alexandersson, D. Kokkinakis","doi":"10.18653/v1/W19-3012","DOIUrl":"https://doi.org/10.18653/v1/W19-3012","url":null,"abstract":"The Semantic Verbal Fluency (SVF) task is a classical neuropsychological assessment where persons are asked to produce words belonging to a semantic category (e.g., animals) in a given time. This paper introduces a novel method of temporal analysis for SVF tasks utilizing time intervals and applies it to a corpus of elderly Swedish subjects (mild cognitive impairment, subjective cognitive impairment and healthy controls). A general decline in word count and lexical frequency over the course of the task is revealed, as well as an increase in word transition times. Persons with subjective cognitive impairment had a higher word count during the last intervals, but produced words of the same lexical frequencies. Persons with MCI had a steeper decline in both word count and lexical frequencies during the third interval. Additional correlations with neuropsychological scores suggest these findings are linked to a person’s overall vocabulary size and processing speed, respectively. Classification results improved when adding the novel features (AUC=0.72), supporting their diagnostic value.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128767863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
CLPsych2019 Shared Task: Predicting Suicide Risk Level from Reddit Posts on Multiple Forums CLPsych2019共享任务:从多个论坛上的Reddit帖子预测自杀风险水平
V. Ruiz, Lingyun Shi, Wei Quan, N. Ryan, C. Biernesser, D. Brent, R. Tsui
We aimed to predict an individual suicide risk level from longitudinal posts on Reddit discussion forums. Through participating in a shared task competition hosted by CLPsych2019, we received two annotated datasets: a training dataset with 496 users (31,553 posts) and a test dataset with 125 users (9610 posts). We submitted results from our three best-performing machine-learning models: SVM, Naïve Bayes, and an ensemble model. Each model provided a user’s suicide risk level in four categories, i.e., no risk, low risk, moderate risk, and severe risk. Among the three models, the ensemble model had the best macro-averaged F1 score 0.379 when tested on the holdout test dataset. The NB model had the best performance in two additional binary-classification tasks, i.e., no risk vs. flagged risk (any risk level other than no risk) with F1 score 0.836 and no or low risk vs. urgent risk (moderate or severe risk) with F1 score 0.736. We conclude that the NB model may serve as a tool for identifying users with flagged or urgent suicide risk based on longitudinal posts on Reddit discussion forums.
我们旨在通过Reddit论坛上的纵向帖子预测个人自杀风险水平。通过参加CLPsych2019主办的共享任务竞赛,我们收到了两个带注释的数据集:一个有496个用户(31,553个帖子)的训练数据集和一个有125个用户(9610个帖子)的测试数据集。我们提交了三个表现最好的机器学习模型的结果:SVM、Naïve贝叶斯和一个集成模型。每个模型都提供了用户的自杀风险水平,分为无风险、低风险、中等风险和严重风险四类。在三个模型中,集成模型在holdout测试数据集上的宏观平均F1得分为0.379。NB模型在另外两个二元分类任务中表现最好,即无风险vs标记风险(除无风险之外的任何风险水平)F1得分为0.836,无风险或低风险vs紧急风险(中度或重度风险)F1得分为0.736。我们的结论是,NB模型可以作为一种工具,根据Reddit论坛上的纵向帖子,识别有标记或紧急自杀风险的用户。
{"title":"CLPsych2019 Shared Task: Predicting Suicide Risk Level from Reddit Posts on Multiple Forums","authors":"V. Ruiz, Lingyun Shi, Wei Quan, N. Ryan, C. Biernesser, D. Brent, R. Tsui","doi":"10.18653/v1/W19-3020","DOIUrl":"https://doi.org/10.18653/v1/W19-3020","url":null,"abstract":"We aimed to predict an individual suicide risk level from longitudinal posts on Reddit discussion forums. Through participating in a shared task competition hosted by CLPsych2019, we received two annotated datasets: a training dataset with 496 users (31,553 posts) and a test dataset with 125 users (9610 posts). We submitted results from our three best-performing machine-learning models: SVM, Naïve Bayes, and an ensemble model. Each model provided a user’s suicide risk level in four categories, i.e., no risk, low risk, moderate risk, and severe risk. Among the three models, the ensemble model had the best macro-averaged F1 score 0.379 when tested on the holdout test dataset. The NB model had the best performance in two additional binary-classification tasks, i.e., no risk vs. flagged risk (any risk level other than no risk) with F1 score 0.836 and no or low risk vs. urgent risk (moderate or severe risk) with F1 score 0.736. We conclude that the NB model may serve as a tool for identifying users with flagged or urgent suicide risk based on longitudinal posts on Reddit discussion forums.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"356 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131873620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Computational Linguistics for Enhancing Scientific Reproducibility and Reducing Healthcare Inequities 提高科学再现性和减少医疗保健不公平的计算语言学
J. Parish-Morris
Computational linguistics holds promise for improving scientific integrity in clinical psychology, and for reducing longstanding inequities in healthcare access and quality. This paper describes how computational linguistics approaches could address the “reproducibility crisis” facing social science, particularly with regards to reliable diagnosis of neurodevelopmental and psychiatric conditions including autism spectrum disorder (ASD). It is argued that these improvements in scientific integrity are poised to naturally reduce persistent healthcare inequities in neglected subpopulations, such as verbally fluent girls and women with ASD, but that concerted attention to this issue is necessary to avoid reproducing biases built into training data. Finally, it is suggested that computational linguistics is just one component of an emergent digital phenotyping toolkit that could ultimately be used for clinical decision support, to improve clinical care via precision medicine (i.e., personalized intervention planning), granular treatment response monitoring (including remotely), and for gene-brain-behavior studies aiming to pinpoint the underlying biological etiology of otherwise behaviorally-defined conditions like ASD.
计算语言学有望提高临床心理学的科学完整性,并减少长期以来在医疗保健获取和质量方面的不平等。本文描述了计算语言学方法如何解决社会科学面临的“可重复性危机”,特别是在神经发育和精神疾病(包括自闭症谱系障碍(ASD))的可靠诊断方面。有人认为,这些科学完整性的提高自然会减少被忽视的亚群体中持续存在的医疗不平等,例如语言流利的女孩和患有自闭症的妇女,但对这一问题的一致关注是必要的,以避免再现训练数据中的偏见。最后,计算语言学只是新兴数字表型工具包的一个组成部分,最终可用于临床决策支持,通过精准医学(即个性化干预计划)改善临床护理,颗粒治疗反应监测(包括远程),以及基因-大脑-行为研究,旨在查明潜在的生物学病因,否则行为定义的条件,如ASD。
{"title":"Computational Linguistics for Enhancing Scientific Reproducibility and Reducing Healthcare Inequities","authors":"J. Parish-Morris","doi":"10.18653/v1/W19-3011","DOIUrl":"https://doi.org/10.18653/v1/W19-3011","url":null,"abstract":"Computational linguistics holds promise for improving scientific integrity in clinical psychology, and for reducing longstanding inequities in healthcare access and quality. This paper describes how computational linguistics approaches could address the “reproducibility crisis” facing social science, particularly with regards to reliable diagnosis of neurodevelopmental and psychiatric conditions including autism spectrum disorder (ASD). It is argued that these improvements in scientific integrity are poised to naturally reduce persistent healthcare inequities in neglected subpopulations, such as verbally fluent girls and women with ASD, but that concerted attention to this issue is necessary to avoid reproducing biases built into training data. Finally, it is suggested that computational linguistics is just one component of an emergent digital phenotyping toolkit that could ultimately be used for clinical decision support, to improve clinical care via precision medicine (i.e., personalized intervention planning), granular treatment response monitoring (including remotely), and for gene-brain-behavior studies aiming to pinpoint the underlying biological etiology of otherwise behaviorally-defined conditions like ASD.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133355605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
CLPsych 2019 Shared Task: Predicting the Degree of Suicide Risk in Reddit Posts CLPsych 2019共享任务:预测Reddit帖子中的自杀风险程度
Ayah Zirikly, P. Resnik, Özlem Uzuner, Kristy Hollingshead
The shared task for the 2019 Workshop on Computational Linguistics and Clinical Psychology (CLPsych’19) introduced an assessment of suicide risk based on social media postings, using data from Reddit to identify users at no, low, moderate, or severe risk. Two variations of the task focused on users whose posts to the r/SuicideWatch subreddit indicated they might be at risk; a third task looked at screening users based only on their more everyday (non-SuicideWatch) posts. We received submissions from 15 different teams, and the results provide progress and insight into the value of language signal in helping to predict risk level.
2019年计算语言学和临床心理学研讨会(CLPsych ' 19)的共同任务介绍了基于社交媒体帖子的自杀风险评估,使用来自Reddit的数据来识别无、低、中等或严重风险的用户。该任务的两种变体侧重于那些在reddit的r/SuicideWatch子版块上发帖表明自己可能处于危险中的用户;第三项任务是仅根据用户的日常(非自杀性观察)帖子来筛选用户。我们收到了来自15个不同团队的提交,结果为语言信号在帮助预测风险水平方面的价值提供了进展和见解。
{"title":"CLPsych 2019 Shared Task: Predicting the Degree of Suicide Risk in Reddit Posts","authors":"Ayah Zirikly, P. Resnik, Özlem Uzuner, Kristy Hollingshead","doi":"10.18653/v1/W19-3003","DOIUrl":"https://doi.org/10.18653/v1/W19-3003","url":null,"abstract":"The shared task for the 2019 Workshop on Computational Linguistics and Clinical Psychology (CLPsych’19) introduced an assessment of suicide risk based on social media postings, using data from Reddit to identify users at no, low, moderate, or severe risk. Two variations of the task focused on users whose posts to the r/SuicideWatch subreddit indicated they might be at risk; a third task looked at screening users based only on their more everyday (non-SuicideWatch) posts. We received submissions from 15 different teams, and the results provide progress and insight into the value of language signal in helping to predict risk level.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124098721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 156
Using Contextual Representations for Suicide Risk Assessment from Internet Forums 网络论坛自杀风险评估的语境表征
Ashwin Karthik Ambalavanan, P. D. Jagtap, Soumya Adhya, M. Devarakonda
Social media posts may yield clues to the subject’s (usually, the writer’s) suicide risk and intent, which can be used for timely intervention. This research, motivated by the CLPsych 2019 shared task, developed neural network-based methods for analyzing posts in one or more Reddit forums to assess the subject’s suicide risk. One of the technical challenges this task poses is the large amount of text from multiple posts of a single user. Our neural network models use the advanced multi-headed Attention-based autoencoder architecture, called Bidirectional Encoder Representations from Transformers (BERT). Our system achieved the 2nd best performance of 0.477 macro averaged F measure on Task A of the challenge. Among the three different alternatives we developed for the challenge, the single BERT model that processed all of a user’s posts performed the best on all three Tasks.
社交媒体上的帖子可能会提供有关主题(通常是作者)自杀风险和意图的线索,这些线索可以用于及时干预。在CLPsych 2019共享任务的推动下,这项研究开发了基于神经网络的方法,用于分析一个或多个Reddit论坛上的帖子,以评估主题的自杀风险。此任务带来的技术挑战之一是来自单个用户的多个帖子的大量文本。我们的神经网络模型使用先进的多头基于注意力的自编码器架构,称为变形金刚的双向编码器表示(BERT)。我们的系统在挑战的任务A上取得了0.477宏观平均F测量的第二好成绩。在我们为挑战开发的三个不同的替代方案中,处理所有用户帖子的单一BERT模型在所有三个任务上都表现最好。
{"title":"Using Contextual Representations for Suicide Risk Assessment from Internet Forums","authors":"Ashwin Karthik Ambalavanan, P. D. Jagtap, Soumya Adhya, M. Devarakonda","doi":"10.18653/v1/W19-3022","DOIUrl":"https://doi.org/10.18653/v1/W19-3022","url":null,"abstract":"Social media posts may yield clues to the subject’s (usually, the writer’s) suicide risk and intent, which can be used for timely intervention. This research, motivated by the CLPsych 2019 shared task, developed neural network-based methods for analyzing posts in one or more Reddit forums to assess the subject’s suicide risk. One of the technical challenges this task poses is the large amount of text from multiple posts of a single user. Our neural network models use the advanced multi-headed Attention-based autoencoder architecture, called Bidirectional Encoder Representations from Transformers (BERT). Our system achieved the 2nd best performance of 0.477 macro averaged F measure on Task A of the challenge. Among the three different alternatives we developed for the challenge, the single BERT model that processed all of a user’s posts performed the best on all three Tasks.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"166 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114733123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Coherence models in schizophrenia 精神分裂症的连贯性模型
S. Just, Erik Haegert, Nora Kořánová, Anna-Lena Bröcker, Ivan Nenchev, J. Funcke, C. Montag, Manfred Stede
Incoherent discourse in schizophrenia has long been recognized as a dominant symptom of the mental disorder (Bleuler, 1911/1950). Recent studies have used modern sentence and word embeddings to compute coherence metrics for spontaneous speech in schizophrenia. While clinical ratings always have a subjective element, computational linguistic methodology allows quantification of speech abnormalities. Clinical and empirical knowledge from psychiatry provide the theoretical and conceptual basis for modelling. Our study is an interdisciplinary attempt at improving coherence models in schizophrenia. Speech samples were obtained from healthy controls and patients with a diagnosis of schizophrenia or schizoaffective disorder and different severity of positive formal thought disorder. Interviews were transcribed and coherence metrics derived from different embeddings. One model found higher coherence metrics for controls than patients. All other models remained non-significant. More detailed analysis of the data motivates different approaches to improving coherence models in schizophrenia, e.g. by assessing referential abnormalities.
精神分裂症的语无伦次一直被认为是精神障碍的主要症状(Bleuler, 1911/1950)。最近的研究使用现代句子和词嵌入来计算精神分裂症患者自发言语的连贯度量。虽然临床评分总是有主观因素,但计算语言学方法允许对言语异常进行量化。来自精神病学的临床和经验知识为建模提供了理论和概念基础。我们的研究是一项跨学科的尝试,旨在改善精神分裂症的连贯模型。语言样本来自健康对照和诊断为精神分裂症或分裂情感性障碍和不同严重程度的积极形式思维障碍的患者。对访谈进行转录,并从不同的嵌入中得出一致性指标。一个模型发现对照组的一致性指标高于患者。所有其他模型仍然不显著。更详细的数据分析激发了不同的方法来改善精神分裂症的一致性模型,例如通过评估参照异常。
{"title":"Coherence models in schizophrenia","authors":"S. Just, Erik Haegert, Nora Kořánová, Anna-Lena Bröcker, Ivan Nenchev, J. Funcke, C. Montag, Manfred Stede","doi":"10.18653/v1/W19-3015","DOIUrl":"https://doi.org/10.18653/v1/W19-3015","url":null,"abstract":"Incoherent discourse in schizophrenia has long been recognized as a dominant symptom of the mental disorder (Bleuler, 1911/1950). Recent studies have used modern sentence and word embeddings to compute coherence metrics for spontaneous speech in schizophrenia. While clinical ratings always have a subjective element, computational linguistic methodology allows quantification of speech abnormalities. Clinical and empirical knowledge from psychiatry provide the theoretical and conceptual basis for modelling. Our study is an interdisciplinary attempt at improving coherence models in schizophrenia. Speech samples were obtained from healthy controls and patients with a diagnosis of schizophrenia or schizoaffective disorder and different severity of positive formal thought disorder. Interviews were transcribed and coherence metrics derived from different embeddings. One model found higher coherence metrics for controls than patients. All other models remained non-significant. More detailed analysis of the data motivates different approaches to improving coherence models in schizophrenia, e.g. by assessing referential abnormalities.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"345 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121518123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Reviving a psychometric measure: Classification and prediction of the Operant Motive Test 复兴心理测量:操作性动机测验的分类与预测
Dirk Johannßen, Chris Biemann, D. Scheffer
Implicit motives allow for the characterization of behavior, subsequent success and long-term development. While this has been operationalized in the operant motive test, research on motives has declined mainly due to labor-intensive and costly human annotation. In this study, we analyze over 200,000 labeled data items from 40,000 participants and utilize them for engineering features for training a logistic model tree machine learning model. It captures manually assigned motives well with an F-score of 80%, coming close to the pairwise annotator intraclass correlation coefficient of r = .85. In addition, we found a significant correlation of r = .2 between subsequent academic success and data automatically labeled with our model in an extrinsic evaluation.
内隐动机允许行为特征,随后的成功和长期发展。虽然这已经在操作性动机测试中实现了,但对动机的研究一直在下降,主要是由于人工注释的劳动密集型和成本高。在本研究中,我们分析了来自40,000名参与者的200,000多个标记数据项,并将其用于训练逻辑模型树机器学习模型的工程特征。它很好地捕获了手动分配的动机,f值为80%,接近成对注释器类内相关系数r = 0.85。此外,我们发现在随后的学业成功和在外部评估中自动标记为我们模型的数据之间存在r = 0.2的显著相关性。
{"title":"Reviving a psychometric measure: Classification and prediction of the Operant Motive Test","authors":"Dirk Johannßen, Chris Biemann, D. Scheffer","doi":"10.18653/v1/W19-3014","DOIUrl":"https://doi.org/10.18653/v1/W19-3014","url":null,"abstract":"Implicit motives allow for the characterization of behavior, subsequent success and long-term development. While this has been operationalized in the operant motive test, research on motives has declined mainly due to labor-intensive and costly human annotation. In this study, we analyze over 200,000 labeled data items from 40,000 participants and utilize them for engineering features for training a logistic model tree machine learning model. It captures manually assigned motives well with an F-score of 80%, coming close to the pairwise annotator intraclass correlation coefficient of r = .85. In addition, we found a significant correlation of r = .2 between subsequent academic success and data automatically labeled with our model in an extrinsic evaluation.","PeriodicalId":201097,"journal":{"name":"Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126534269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1