Bayesian Optimization With Tree Ensembles to Improve Depression Screening on Textual Datasets

IF 9.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Affective Computing Pub Date : 2024-08-13 DOI:10.1109/TAFFC.2024.3442557

Tingting Zhao;ML Tlachac

{"title":"Bayesian Optimization With Tree Ensembles to Improve Depression Screening on Textual Datasets","authors":"Tingting Zhao;ML Tlachac","doi":"10.1109/TAFFC.2024.3442557","DOIUrl":null,"url":null,"abstract":"Improving digital depression screening is important for combating the global mental health crisis. Textual data are promising for depression screening due to their many origins, but the variety presents screening challenges. To improve depression screening with textual data, we propose eXtreme Gradient Boosting (XGBoost) with Bayesian Optimization (BO). We experiment with three different objective functions to optimize our models. We apply our models to screen for depression with three disparate textual datasets containing features extracted from transcripts, SMS text messages, and typed replies. When compared to seven other machine learning methods, our XGBoost with BO models demonstrated impressive generalizability across the datasets, achieving average balanced accuracy scores of 0.60, 0.67, and 0.69 with transcripts, SMS text messages, and typed replies, respectively. Our feature importance assessment revealed that the most important features for these three text types were respectively negative emotion, youth, and love lexical category frequencies. Overall, our research presents a promising depression screening method that offers generalizability across text types, explainability, and computational efficiency.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 2","pages":"573-585"},"PeriodicalIF":9.8000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10634776/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Improving digital depression screening is important for combating the global mental health crisis. Textual data are promising for depression screening due to their many origins, but the variety presents screening challenges. To improve depression screening with textual data, we propose eXtreme Gradient Boosting (XGBoost) with Bayesian Optimization (BO). We experiment with three different objective functions to optimize our models. We apply our models to screen for depression with three disparate textual datasets containing features extracted from transcripts, SMS text messages, and typed replies. When compared to seven other machine learning methods, our XGBoost with BO models demonstrated impressive generalizability across the datasets, achieving average balanced accuracy scores of 0.60, 0.67, and 0.69 with transcripts, SMS text messages, and typed replies, respectively. Our feature importance assessment revealed that the most important features for these three text types were respectively negative emotion, youth, and love lexical category frequencies. Overall, our research presents a promising depression screening method that offers generalizability across text types, explainability, and computational efficiency.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用树集合进行贝叶斯优化，改进文本数据集的抑郁症筛查

改善数字抑郁症筛查对于应对全球精神卫生危机非常重要。文本数据是有希望的抑郁症筛查，由于他们的许多来源，但多样性提出了筛选的挑战。为了改进文本数据的抑郁筛选，我们提出了基于贝叶斯优化（BO）的极限梯度增强（XGBoost）方法。我们尝试了三种不同的目标函数来优化我们的模型。我们用三个不同的文本数据集来筛选抑郁症，这些数据集包含从抄本、短信和打字回复中提取的特征。与其他7种机器学习方法相比，我们的带有BO模型的XGBoost在数据集上表现出了令人印象深刻的泛化性，在转录本、短信和打字回复上分别实现了0.60、0.67和0.69的平均平衡准确率得分。我们的特征重要性评估显示，这三种文本类型最重要的特征分别是消极情绪、青春和爱情词汇类别频率。总的来说，我们的研究提出了一种很有前途的抑郁症筛查方法，它提供了跨文本类型的通用性，可解释性和计算效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Affective Computing COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS

CiteScore

15.00

自引率

6.20%

发文量

174

期刊介绍： The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.