Knowledge-aware Assessment of Severity of Suicide Risk for Early Intervention

The World Wide Web Conference Pub Date : 2019-05-13 DOI:10.1145/3308558.3313698

Manas Gaur, Amanuel Alambo, Joy Prakash Sain, Ugur Kursuncu, K. Thirunarayan, Ramakanth Kavuluru, A. Sheth, R. Welton, Jyotishman Pathak

{"title":"Knowledge-aware Assessment of Severity of Suicide Risk for Early Intervention","authors":"Manas Gaur, Amanuel Alambo, Joy Prakash Sain, Ugur Kursuncu, K. Thirunarayan, Ramakanth Kavuluru, A. Sheth, R. Welton, Jyotishman Pathak","doi":"10.1145/3308558.3313698","DOIUrl":null,"url":null,"abstract":"Mental health illness such as depression is a significant risk factor for suicide ideation, behaviors, and attempts. A report by Substance Abuse and Mental Health Services Administration (SAMHSA) shows that 80% of the patients suffering from Borderline Personality Disorder (BPD) have suicidal behavior, 5-10% of whom commit suicide. While multiple initiatives have been developed and implemented for suicide prevention, a key challenge has been the social stigma associated with mental disorders, which deters patients from seeking help or sharing their experiences directly with others including clinicians. This is particularly true for teenagers and younger adults where suicide is the second highest cause of death in the US. Prior research involving surveys and questionnaires (e.g. PHQ-9) for suicide risk prediction failed to provide a quantitative assessment of risk that informed timely clinical decision-making for intervention. Our interdisciplinary study concerns the use of Reddit as an unobtrusive data source for gleaning information about suicidal tendencies and other related mental health conditions afflicting depressed users. We provide details of our learning framework that incorporates domain-specific knowledge to predict the severity of suicide risk for an individual. Our approach involves developing a suicide risk severity lexicon using medical knowledge bases and suicide ontology to detect cues relevant to suicidal thoughts and actions. We also use language modeling, medical entity recognition and normalization and negation detection to create a dataset of 2181 redditors that have discussed or implied suicidal ideation, behavior, or attempt. Given the importance of clinical knowledge, our gold standard dataset of 500 redditors (out of 2181) was developed by four practicing psychiatrists following the guidelines outlined in Columbia Suicide Severity Rating Scale (C-SSRS), with the pairwise annotator agreement of 0.79 and group-wise agreement of 0.73. Compared to the existing four-label classification scheme (no risk, low risk, moderate risk, and high risk), our proposed C-SSRS-based 5-label classification scheme distinguishes people who are supportive, from those who show different severity of suicidal tendency. Our 5-label classification scheme outperforms the state-of-the-art schemes by improving the graded recall by 4.2% and reducing the perceived risk measure by 12.5%. Convolutional neural network (CNN) provided the best performance in our scheme due to the discriminative features and use of domain-specific knowledge resources, in comparison to SVM-L that has been used in the state-of-the-art tools over similar dataset.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"107","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The World Wide Web Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3308558.3313698","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 107

Abstract

Mental health illness such as depression is a significant risk factor for suicide ideation, behaviors, and attempts. A report by Substance Abuse and Mental Health Services Administration (SAMHSA) shows that 80% of the patients suffering from Borderline Personality Disorder (BPD) have suicidal behavior, 5-10% of whom commit suicide. While multiple initiatives have been developed and implemented for suicide prevention, a key challenge has been the social stigma associated with mental disorders, which deters patients from seeking help or sharing their experiences directly with others including clinicians. This is particularly true for teenagers and younger adults where suicide is the second highest cause of death in the US. Prior research involving surveys and questionnaires (e.g. PHQ-9) for suicide risk prediction failed to provide a quantitative assessment of risk that informed timely clinical decision-making for intervention. Our interdisciplinary study concerns the use of Reddit as an unobtrusive data source for gleaning information about suicidal tendencies and other related mental health conditions afflicting depressed users. We provide details of our learning framework that incorporates domain-specific knowledge to predict the severity of suicide risk for an individual. Our approach involves developing a suicide risk severity lexicon using medical knowledge bases and suicide ontology to detect cues relevant to suicidal thoughts and actions. We also use language modeling, medical entity recognition and normalization and negation detection to create a dataset of 2181 redditors that have discussed or implied suicidal ideation, behavior, or attempt. Given the importance of clinical knowledge, our gold standard dataset of 500 redditors (out of 2181) was developed by four practicing psychiatrists following the guidelines outlined in Columbia Suicide Severity Rating Scale (C-SSRS), with the pairwise annotator agreement of 0.79 and group-wise agreement of 0.73. Compared to the existing four-label classification scheme (no risk, low risk, moderate risk, and high risk), our proposed C-SSRS-based 5-label classification scheme distinguishes people who are supportive, from those who show different severity of suicidal tendency. Our 5-label classification scheme outperforms the state-of-the-art schemes by improving the graded recall by 4.2% and reducing the perceived risk measure by 12.5%. Convolutional neural network (CNN) provided the best performance in our scheme due to the discriminative features and use of domain-specific knowledge resources, in comparison to SVM-L that has been used in the state-of-the-art tools over similar dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

早期干预自杀风险严重程度的知识意识评估

精神疾病，如抑郁症，是自杀意念、行为和企图的重要危险因素。药物滥用和精神健康服务管理局(SAMHSA)的一份报告显示，80%的边缘型人格障碍(BPD)患者有自杀行为，其中5-10%的人自杀。虽然为预防自杀制定和实施了多项举措，但一个关键挑战是与精神障碍相关的社会污名，这阻碍了患者寻求帮助或直接与包括临床医生在内的其他人分享他们的经验。对于青少年和年轻人来说尤其如此，自杀是美国第二大死亡原因。先前的研究包括调查和问卷(如PHQ-9)自杀风险预测，但未能提供定量的风险评估，为及时的临床干预决策提供信息。我们的跨学科研究关注的是使用Reddit作为一个不引人注目的数据源来收集有关自杀倾向和其他相关心理健康状况的信息，这些信息折磨着抑郁的用户。我们提供了我们的学习框架的细节，该框架结合了特定领域的知识来预测个人自杀风险的严重程度。我们的方法包括使用医学知识库和自杀本体开发自杀风险严重程度词典，以检测与自杀想法和行为相关的线索。我们还使用语言建模、医疗实体识别和规范化以及否定检测来创建一个包含2181名reddit用户的数据集，这些用户讨论或暗示了自杀念头、行为或企图。考虑到临床知识的重要性，我们的500名redditor(共2181人)的金标准数据集是由四位执业精神科医生根据哥伦比亚自杀严重程度评定量表(C-SSRS)中概述的指导原则开发的，双注释者一致性为0.79，组一致度为0.73。与现有的四标签分类方案(无风险、低风险、中风险和高风险)相比，我们提出的基于c - ssrs的五标签分类方案将支持者与表现出不同严重程度自杀倾向的人区分开来。我们的5标签分类方案优于最先进的方案，将分级召回率提高了4.2%，将感知风险度量降低了12.5%。与SVM-L相比，卷积神经网络(CNN)在我们的方案中提供了最好的性能，因为它具有判别特征和使用特定领域的知识资源，SVM-L已经在类似的数据集上使用了最先进的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助