{"title":"Bayesian Optimization With Tree Ensembles to Improve Depression Screening on Textual Datasets","authors":"Tingting Zhao;ML Tlachac","doi":"10.1109/TAFFC.2024.3442557","DOIUrl":null,"url":null,"abstract":"Improving digital depression screening is important for combating the global mental health crisis. Textual data are promising for depression screening due to their many origins, but the variety presents screening challenges. To improve depression screening with textual data, we propose eXtreme Gradient Boosting (XGBoost) with Bayesian Optimization (BO). We experiment with three different objective functions to optimize our models. We apply our models to screen for depression with three disparate textual datasets containing features extracted from transcripts, SMS text messages, and typed replies. When compared to seven other machine learning methods, our XGBoost with BO models demonstrated impressive generalizability across the datasets, achieving average balanced accuracy scores of 0.60, 0.67, and 0.69 with transcripts, SMS text messages, and typed replies, respectively. Our feature importance assessment revealed that the most important features for these three text types were respectively negative emotion, youth, and love lexical category frequencies. Overall, our research presents a promising depression screening method that offers generalizability across text types, explainability, and computational efficiency.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 2","pages":"573-585"},"PeriodicalIF":9.8000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10634776/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Improving digital depression screening is important for combating the global mental health crisis. Textual data are promising for depression screening due to their many origins, but the variety presents screening challenges. To improve depression screening with textual data, we propose eXtreme Gradient Boosting (XGBoost) with Bayesian Optimization (BO). We experiment with three different objective functions to optimize our models. We apply our models to screen for depression with three disparate textual datasets containing features extracted from transcripts, SMS text messages, and typed replies. When compared to seven other machine learning methods, our XGBoost with BO models demonstrated impressive generalizability across the datasets, achieving average balanced accuracy scores of 0.60, 0.67, and 0.69 with transcripts, SMS text messages, and typed replies, respectively. Our feature importance assessment revealed that the most important features for these three text types were respectively negative emotion, youth, and love lexical category frequencies. Overall, our research presents a promising depression screening method that offers generalizability across text types, explainability, and computational efficiency.
期刊介绍:
The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.