A Novel Cognitive Behavioral Therapy-Based Generative AI Tool (Socrates 2.0) to Facilitate Socratic Dialogue: Protocol for a Mixed Methods Feasibility Study.

IF 1.4 Q3 HEALTH CARE SCIENCES & SERVICES JMIR Research Protocols Pub Date : 2024-10-10 DOI:10.2196/58195
Philip Held, Sarah A Pridgen, Yaozhong Chen, Zuhaib Akhtar, Darpan Amin, Sean Pohorence
{"title":"A Novel Cognitive Behavioral Therapy-Based Generative AI Tool (Socrates 2.0) to Facilitate Socratic Dialogue: Protocol for a Mixed Methods Feasibility Study.","authors":"Philip Held, Sarah A Pridgen, Yaozhong Chen, Zuhaib Akhtar, Darpan Amin, Sean Pohorence","doi":"10.2196/58195","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Digital mental health tools, designed to augment traditional mental health treatments, are becoming increasingly important due to a wide range of barriers to accessing mental health care, including a growing shortage of clinicians. Most existing tools use rule-based algorithms, often leading to interactions that feel unnatural compared with human therapists. Large language models (LLMs) offer a solution for the development of more natural, engaging digital tools. In this paper, we detail the development of Socrates 2.0, which was designed to engage users in Socratic dialogue surrounding unrealistic or unhelpful beliefs, a core technique in cognitive behavioral therapies. The multiagent LLM-based tool features an artificial intelligence (AI) therapist, Socrates, which receives automated feedback from an AI supervisor and an AI rater. The combination of multiple agents appeared to help address common LLM issues such as looping, and it improved the overall dialogue experience. Initial user feedback from individuals with lived experiences of mental health problems as well as cognitive behavioral therapists has been positive. Moreover, tests in approximately 500 scenarios showed that Socrates 2.0 engaged in harmful responses in under 1% of cases, with the AI supervisor promptly correcting the dialogue each time. However, formal feasibility studies with potential end users are needed.</p><p><strong>Objective: </strong>This mixed methods study examines the feasibility of Socrates 2.0.</p><p><strong>Methods: </strong>On the basis of the initial data, we devised a formal feasibility study of Socrates 2.0 to gather qualitative and quantitative data about users' and clinicians' experience of interacting with the tool. Using a mixed method approach, the goal is to gather feasibility and acceptability data from 100 users and 50 clinicians to inform the eventual implementation of generative AI tools, such as Socrates 2.0, in mental health treatment. We designed this study to better understand how users and clinicians interact with the tool, including the frequency, length, and time of interactions, users' satisfaction with the tool overall, quality of each dialogue and individual responses, as well as ways in which the tool should be improved before it is used in efficacy trials. Descriptive and inferential analyses will be performed on data from validated usability measures. Thematic analysis will be performed on the qualitative data.</p><p><strong>Results: </strong>Recruitment will begin in February 2024 and is expected to conclude by February 2025. As of September 25, 2024, overall, 55 participants have been recruited.</p><p><strong>Conclusions: </strong>The development of Socrates 2.0 and the outlined feasibility study are important first steps in applying generative AI to mental health treatment delivery and lay the foundation for formal feasibility studies.</p><p><strong>International registered report identifier (irrid): </strong>DERR1-10.2196/58195.</p>","PeriodicalId":14755,"journal":{"name":"JMIR Research Protocols","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11502974/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Research Protocols","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/58195","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Digital mental health tools, designed to augment traditional mental health treatments, are becoming increasingly important due to a wide range of barriers to accessing mental health care, including a growing shortage of clinicians. Most existing tools use rule-based algorithms, often leading to interactions that feel unnatural compared with human therapists. Large language models (LLMs) offer a solution for the development of more natural, engaging digital tools. In this paper, we detail the development of Socrates 2.0, which was designed to engage users in Socratic dialogue surrounding unrealistic or unhelpful beliefs, a core technique in cognitive behavioral therapies. The multiagent LLM-based tool features an artificial intelligence (AI) therapist, Socrates, which receives automated feedback from an AI supervisor and an AI rater. The combination of multiple agents appeared to help address common LLM issues such as looping, and it improved the overall dialogue experience. Initial user feedback from individuals with lived experiences of mental health problems as well as cognitive behavioral therapists has been positive. Moreover, tests in approximately 500 scenarios showed that Socrates 2.0 engaged in harmful responses in under 1% of cases, with the AI supervisor promptly correcting the dialogue each time. However, formal feasibility studies with potential end users are needed.

Objective: This mixed methods study examines the feasibility of Socrates 2.0.

Methods: On the basis of the initial data, we devised a formal feasibility study of Socrates 2.0 to gather qualitative and quantitative data about users' and clinicians' experience of interacting with the tool. Using a mixed method approach, the goal is to gather feasibility and acceptability data from 100 users and 50 clinicians to inform the eventual implementation of generative AI tools, such as Socrates 2.0, in mental health treatment. We designed this study to better understand how users and clinicians interact with the tool, including the frequency, length, and time of interactions, users' satisfaction with the tool overall, quality of each dialogue and individual responses, as well as ways in which the tool should be improved before it is used in efficacy trials. Descriptive and inferential analyses will be performed on data from validated usability measures. Thematic analysis will be performed on the qualitative data.

Results: Recruitment will begin in February 2024 and is expected to conclude by February 2025. As of September 25, 2024, overall, 55 participants have been recruited.

Conclusions: The development of Socrates 2.0 and the outlined feasibility study are important first steps in applying generative AI to mental health treatment delivery and lay the foundation for formal feasibility studies.

International registered report identifier (irrid): DERR1-10.2196/58195.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于认知行为疗法的新型生成式人工智能工具(苏格拉底 2.0)促进苏格拉底式对话:混合方法可行性研究协议》。
背景介绍数字心理健康工具旨在增强传统的心理健康治疗,由于获得心理健康护理的各种障碍,包括临床医生的日益短缺,数字心理健康工具正变得越来越重要。现有的大多数工具都使用基于规则的算法,与人类治疗师相比,往往导致交互感觉不自然。大型语言模型(LLM)为开发更自然、更吸引人的数字工具提供了解决方案。在本文中,我们详细介绍了苏格拉底 2.0 的开发过程,它旨在让用户围绕不现实或无益的信念进行苏格拉底式对话,这是认知行为疗法的核心技术。这款基于多代理 LLM 的工具以人工智能(AI)治疗师苏格拉底(Socrates)为特色,苏格拉底(Socrates)会接收来自 AI 监督员和 AI 评分员的自动反馈。多代理的组合似乎有助于解决常见的 LLM 问题,如循环问题,并改善了整体对话体验。有心理健康问题亲身经历的个人以及认知行为治疗师的初步用户反馈都很积极。此外,在大约 500 个场景中进行的测试表明,苏格拉底 2.0 在不到 1% 的情况下做出了有害的反应,而人工智能主管每次都会及时纠正对话。不过,还需要对潜在的最终用户进行正式的可行性研究:本混合方法研究探讨了苏格拉底 2.0 的可行性:在初步数据的基础上,我们设计了苏格拉底 2.0 的正式可行性研究,以收集有关用户和临床医生与该工具交互体验的定性和定量数据。采用混合方法,我们的目标是从 100 名用户和 50 名临床医生那里收集可行性和可接受性数据,为最终在心理健康治疗中使用苏格拉底 2.0 等生成式人工智能工具提供参考。我们设计这项研究的目的是为了更好地了解用户和临床医生如何与该工具互动,包括互动的频率、长度和时间,用户对工具的总体满意度,每次对话的质量和个人反应,以及该工具在用于疗效试验之前应如何改进。将对经过验证的可用性测量数据进行描述性和推论性分析。将对定性数据进行主题分析:招募工作将于 2024 年 2 月开始,预计于 2025 年 2 月结束。截至 2024 年 9 月 25 日,共招募了 55 名参与者:苏格拉底 2.0 的开发和概述的可行性研究是将生成式人工智能应用于心理健康治疗的重要第一步,并为正式的可行性研究奠定了基础:DERR1-10.2196/58195。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.40
自引率
5.90%
发文量
414
审稿时长
12 weeks
期刊最新文献
Noninvasive, Multimodal Inflammatory Biomarker Discovery for Systemic Inflammation (NOVA Study): Protocol for a Cross-Sectional Study. The CHALO! 2.0 mHealth-Based Multilevel Intervention to Promote HIV Testing and Linkage-to-Care Among Men Who Have Sex with Men in Mumbai, India: Protocol for a Randomized Controlled Trial. The COSMOS Registry of CytoSorb Hemoadsorption Therapy in Critically Ill Patients: Protocol for an International, Prospective Registry. The Effect of Effort During a Resistance Exercise Session on Glycemic Control in Individuals Living With Prediabetes or Type 2 Diabetes: Protocol for a Crossover Randomized Controlled Trial. Influence of Distraction Factors on Performance in Laparoscopic Surgery in Immersive Virtual Reality: Study Protocol of a Cross-Over Trial in Medical Students and Residents-DisLapVR.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1