Zhong Ding , Yang Zhou , An-Jie Dai , Chen Qian , Bao-Liang Zhong , Chen-Ling Liu , Zhen-Tao Liu
{"title":"Speech based suicide risk recognition for crisis intervention hotlines using explainable multi-task learning","authors":"Zhong Ding , Yang Zhou , An-Jie Dai , Chen Qian , Bao-Liang Zhong , Chen-Ling Liu , Zhen-Tao Liu","doi":"10.1016/j.jad.2024.11.022","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Crisis Intervention Hotline can effectively reduce suicide risk, but suffer from low connectivity rates and untimely crisis response. By integrating speech signals and deep learning to assist in crisis assessment, it is expected to enhanced the effectiveness of crisis intervention hotlines.</div></div><div><h3>Methods</h3><div>In this study, a crisis intervention hotline suicide risk speech dataset was constructed, and the speech was labeled based on the Modified Suicide Risk Scale. On the dataset, the variability of speech duration between different callers and different speech high-level features were explored across callers. Finally, this study proposed a data-theoretically dual-driven, gender-assisted speech crisis recognition method based on multi-tasking and deep learning, and the results of the model were obtained through five-fold cross-validation.</div></div><div><h3>Results</h3><div>Analysis of the dataset demonstrated gender differences in callers, with male callers speaking more in crisis calls compared to females. Feature analysis revealed significant differences between crisis callers in terms of emotional intensity of speech, speech rate and texture. The proposed method outperformed other methods with an F1 score of 96 % on the validation data, and feature visualization of the model also demonstrated the validity of the method.</div></div><div><h3>Limitations</h3><div>The sample size of this study was limited and ignored information from other modalities.</div></div><div><h3>Conclusion</h3><div>These findings demonstrated the effectiveness of the proposed model in speech crisis recognition, and the statistical data analysis enhanced the Interpretability of the model, while showing that the integration of data and theoretical knowledge facilitates the effectiveness of the method.</div></div>","PeriodicalId":14963,"journal":{"name":"Journal of affective disorders","volume":"370 ","pages":"Pages 392-400"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of affective disorders","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016503272401855X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Crisis Intervention Hotline can effectively reduce suicide risk, but suffer from low connectivity rates and untimely crisis response. By integrating speech signals and deep learning to assist in crisis assessment, it is expected to enhanced the effectiveness of crisis intervention hotlines.
Methods
In this study, a crisis intervention hotline suicide risk speech dataset was constructed, and the speech was labeled based on the Modified Suicide Risk Scale. On the dataset, the variability of speech duration between different callers and different speech high-level features were explored across callers. Finally, this study proposed a data-theoretically dual-driven, gender-assisted speech crisis recognition method based on multi-tasking and deep learning, and the results of the model were obtained through five-fold cross-validation.
Results
Analysis of the dataset demonstrated gender differences in callers, with male callers speaking more in crisis calls compared to females. Feature analysis revealed significant differences between crisis callers in terms of emotional intensity of speech, speech rate and texture. The proposed method outperformed other methods with an F1 score of 96 % on the validation data, and feature visualization of the model also demonstrated the validity of the method.
Limitations
The sample size of this study was limited and ignored information from other modalities.
Conclusion
These findings demonstrated the effectiveness of the proposed model in speech crisis recognition, and the statistical data analysis enhanced the Interpretability of the model, while showing that the integration of data and theoretical knowledge facilitates the effectiveness of the method.
期刊介绍:
The Journal of Affective Disorders publishes papers concerned with affective disorders in the widest sense: depression, mania, mood spectrum, emotions and personality, anxiety and stress. It is interdisciplinary and aims to bring together different approaches for a diverse readership. Top quality papers will be accepted dealing with any aspect of affective disorders, including neuroimaging, cognitive neurosciences, genetics, molecular biology, experimental and clinical neurosciences, pharmacology, neuroimmunoendocrinology, intervention and treatment trials.