Perspective on Harnessing Large Language Models to Uncover Insights in Diabetes Wearable Data

medRxiv - Health Informatics Pub Date : 2024-07-31 DOI:10.1101/2024.07.29.24310315

Arash Alavi, Kexin Cha, Delara P Esfarjani, Bhavesh Patel, Jennifer Li Pook Than, Aaron Y Lee, Camille Nebeker, Michael Snyder, Amir Bahmani

{"title":"Perspective on Harnessing Large Language Models to Uncover Insights in Diabetes Wearable Data","authors":"Arash Alavi, Kexin Cha, Delara P Esfarjani, Bhavesh Patel, Jennifer Li Pook Than, Aaron Y Lee, Camille Nebeker, Michael Snyder, Amir Bahmani","doi":"10.1101/2024.07.29.24310315","DOIUrl":null,"url":null,"abstract":"Large Language Models (LLMs) have gained significant attention and are increasingly used by researchers. Concurrently, publicly accessible datasets containing individual-level health information are becoming more available. Some of these datasets, such as the recently released Artificial Intelligence Ready and Equitable Atlas for Diabetes Insights (AI-READI) dataset, include individual-level data from digital wearable technologies. The application of LLMs to gain insights about health from wearable sensor data specific to diabetes is underexplored. This study presents a comprehensive evaluation of multiple LLMs, including GPT-3.5, GPT-4, GPT-4o, Gemini, Gemini 1.5 Pro, and Claude 3 Sonnet, on various diabetes research tasks using diverse prompting methods to evaluate their performance and gain new insights into diabetes and glucose dysregulation. Notably, GPT-4o showed promising performance across tasks with a chain-of-thought prompt design (aggregate performance score of 95.5%). Moreover, using this model, we identified new insights from the dataset, such as the heightened sensitivity to stress among diabetic participants during glucose level fluctuations, which underscores the complex interplay between metabolic and psychological factors. These results demonstrate that LLMs can enhance the pace of discovery and also enable automated interpretation of data for users of wearable devices, including both the research team and the individual wearing the device. Meanwhile, we also emphasize the critical limitations, such as privacy and ethical risks and dataset biases, that must be resolved for real-world application in diabetes health settings. This study highlights the potential and challenges of integrating LLMs into diabetes research and, more broadly, wearables, paving the way for future healthcare advancements, particularly in disadvantaged communities.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"149 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.29.24310315","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Large Language Models (LLMs) have gained significant attention and are increasingly used by researchers. Concurrently, publicly accessible datasets containing individual-level health information are becoming more available. Some of these datasets, such as the recently released Artificial Intelligence Ready and Equitable Atlas for Diabetes Insights (AI-READI) dataset, include individual-level data from digital wearable technologies. The application of LLMs to gain insights about health from wearable sensor data specific to diabetes is underexplored. This study presents a comprehensive evaluation of multiple LLMs, including GPT-3.5, GPT-4, GPT-4o, Gemini, Gemini 1.5 Pro, and Claude 3 Sonnet, on various diabetes research tasks using diverse prompting methods to evaluate their performance and gain new insights into diabetes and glucose dysregulation. Notably, GPT-4o showed promising performance across tasks with a chain-of-thought prompt design (aggregate performance score of 95.5%). Moreover, using this model, we identified new insights from the dataset, such as the heightened sensitivity to stress among diabetic participants during glucose level fluctuations, which underscores the complex interplay between metabolic and psychological factors. These results demonstrate that LLMs can enhance the pace of discovery and also enable automated interpretation of data for users of wearable devices, including both the research team and the individual wearing the device. Meanwhile, we also emphasize the critical limitations, such as privacy and ethical risks and dataset biases, that must be resolved for real-world application in diabetes health settings. This study highlights the potential and challenges of integrating LLMs into diabetes research and, more broadly, wearables, paving the way for future healthcare advancements, particularly in disadvantaged communities.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

透视利用大型语言模型挖掘糖尿病可穿戴数据中的洞察力

大型语言模型（LLMs）已受到广泛关注，并越来越多地被研究人员使用。与此同时，包含个人健康信息的可公开访问的数据集也越来越多。其中一些数据集，如最近发布的 "人工智能就绪与糖尿病洞察公平图集"（AI-READI）数据集，包含了来自数字可穿戴技术的个人层面数据。应用 LLM 从专门针对糖尿病的可穿戴传感器数据中获取健康洞察力的研究还很欠缺。本研究采用不同的提示方法，对多种 LLMs（包括 GPT-3.5、GPT-4、GPT-4o、Gemini、Gemini 1.5 Pro 和 Claude 3 Sonnet）在各种糖尿病研究任务中的表现进行了综合评估，以评价它们的性能，并获得有关糖尿病和血糖失调的新见解。值得注意的是，在采用思维链提示设计的任务中，GPT-4o 表现出色（总分 95.5%）。此外，利用该模型，我们还从数据集中发现了新的见解，例如糖尿病患者在血糖水平波动期间对压力的敏感性增强，这凸显了代谢和心理因素之间复杂的相互作用。这些结果表明，LLM 可以加快发现的速度，还能为可穿戴设备的用户（包括研究团队和佩戴设备的个人）自动解读数据。同时，我们也强调了在糖尿病健康环境中实际应用时必须解决的关键限制，如隐私和伦理风险以及数据集偏差。这项研究强调了将 LLMs 纳入糖尿病研究以及更广泛的可穿戴设备的潜力和挑战，为未来医疗保健的进步铺平了道路，尤其是在弱势群体中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

medRxiv - Health Informatics

自引率

0.00%

发文量