首页 > 最新文献

Digital Biomarkers最新文献

英文 中文
How Much Speech Data Is Needed for Tracking Language Change in Alzheimer's Disease? A Comparison of Random Length, 5-Min, and 1-Min Spontaneous Speech Samples. 追踪阿尔茨海默病患者的语言变化需要多少语音数据?随机长度、5分钟和1分钟自发语音样本的比较。
Q1 Computer Science Pub Date : 2023-11-24 eCollection Date: 2023-01-01 DOI: 10.1159/000533423
Ulla Petti, Simon Baker, Anna Korhonen, Jessica Robin

Introduction: Changes in speech can act as biomarkers of cognitive decline in Alzheimer's disease (AD). While shorter speech samples would promote data collection and analysis, the minimum length of informative speech samples remains debated. This study aims to provide insight into the effect of sample length in analyzing longitudinal recordings of spontaneous speech in AD by comparing the original random length, 5- and 1-minute-long samples. We hope to understand whether capping the audio improves the accuracy of the analysis, and whether an extra 4 min conveys necessary information.

Methods: 110 spontaneous speech samples were collected from decades of Youtube videos of 17 public figures, 9 of whom eventually developed AD. 456 language features were extracted and their text-length-sensitivity, comparability, and ability to capture change over time were analyzed across three different sample lengths.

Results: Capped audio files had advantages over the random length ones. While most extracted features were statistically comparable or highly correlated across the datasets, potential effects of sample length should be acknowledged for some features. The 5-min dataset presented the highest reliability in tracking the evolution of the disease, suggesting that the 4 extra minutes do convey informative data.

Conclusion: Sample length seems to play an important role in extracting the language feature values from speech and tracking disease progress over time. We highlight the importance of further research into optimal sample length and standardization of methods when studying speech in AD.

语言变化可以作为阿尔茨海默病(AD)认知能力下降的生物标志物。虽然较短的语音样本可以促进数据收集和分析,但信息语音样本的最小长度仍然存在争议。本研究旨在通过比较原始随机长度、5分钟和1分钟的样本,深入了解样本长度对AD自发性语音纵向记录分析的影响。我们希望了解限制音频是否能够提高分析的准确性,以及额外的4分钟是否能够传达必要的信息。方法:从17位公众人物数十年的Youtube视频中收集110个自发语音样本,其中9位最终发展为AD。提取了456种语言特征,并在三种不同的样本长度上分析了它们的文本长度敏感性、可比性和捕获随时间变化的能力。结果:上限音频文件优于随机长度音频文件。虽然大多数提取的特征在数据集之间具有统计可比性或高度相关性,但对于某些特征,应该承认样本长度的潜在影响。5分钟的数据集在追踪疾病演变方面表现出最高的可靠性,这表明额外的4分钟确实传达了信息丰富的数据。结论:样本长度在提取语言特征值和追踪疾病进展中起着重要作用。我们强调了在研究AD语音时进一步研究最佳样本长度和标准化方法的重要性。
{"title":"How Much Speech Data Is Needed for Tracking Language Change in Alzheimer's Disease? A Comparison of Random Length, 5-Min, and 1-Min Spontaneous Speech Samples.","authors":"Ulla Petti, Simon Baker, Anna Korhonen, Jessica Robin","doi":"10.1159/000533423","DOIUrl":"10.1159/000533423","url":null,"abstract":"<p><strong>Introduction: </strong>Changes in speech can act as biomarkers of cognitive decline in Alzheimer's disease (AD). While shorter speech samples would promote data collection and analysis, the minimum length of informative speech samples remains debated. This study aims to provide insight into the effect of sample length in analyzing longitudinal recordings of spontaneous speech in AD by comparing the original random length, 5- and 1-minute-long samples. We hope to understand whether capping the audio improves the accuracy of the analysis, and whether an extra 4 min conveys necessary information.</p><p><strong>Methods: </strong>110 spontaneous speech samples were collected from decades of Youtube videos of 17 public figures, 9 of whom eventually developed AD. 456 language features were extracted and their text-length-sensitivity, comparability, and ability to capture change over time were analyzed across three different sample lengths.</p><p><strong>Results: </strong>Capped audio files had advantages over the random length ones. While most extracted features were statistically comparable or highly correlated across the datasets, potential effects of sample length should be acknowledged for some features. The 5-min dataset presented the highest reliability in tracking the evolution of the disease, suggesting that the 4 extra minutes do convey informative data.</p><p><strong>Conclusion: </strong>Sample length seems to play an important role in extracting the language feature values from speech and tracking disease progress over time. We highlight the importance of further research into optimal sample length and standardization of methods when studying speech in AD.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"157-166"},"PeriodicalIF":0.0,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673351/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138458504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of a Prototype Video Head Impulse Test System Using an iPhone for Screening of Peripheral Vestibular Dysfunction. 使用iPhone开发用于筛查外周前庭功能障碍的视频头部脉冲测试系统原型。
Q1 Computer Science Pub Date : 2023-11-02 eCollection Date: 2023-01-01 DOI: 10.1159/000534543
Tatsuaki Kuroda, Kazuhiro Kuroda, Hiroaki Fushiki

Introduction: Head impulse, nystagmus, and test of skew (HINTS) is more accurate for the early diagnosis of occipital fossa stroke than magnetic resonance imaging. However, the head impulse test (HIT) is relatively challenging to perform, as it is subjective. Herein, we developed a prototype video HIT (vHIT) system using an iPhone (Apple, Cupertino, CA, USA) that is compact, easy to operate, and analyzable by our iPhone application.

Methods: The iPhone-vHIT and a vHIT using EyeSeeCam (Interacoustics, Eden Prairie, NM, USA) were performed on a healthy man in his 30s and on a patient with vestibular neuritis who visited the Mejiro University Ear Institute Clinic. For the iPhone-vHIT, eye movements were detected by analyzing high-speed videos captured using an iPhone camera, and head movements were followed using an iPhone gyro sensor. An iPhone fixation brace was used to capture the video without any blurring.

Results: The iPhone-vHIT system obtained vHIT waveforms similar to those of the EyeSeeCam-vHIT system in the healthy man and the patient with vestibular neuritis. The iPhone-vHIT system effectively detected the reduced vestibulo-ocular reflex gain in patients with vestibular neuritis. The iPhone-vHIT system at 120 frames per second was less sensitive to catch-up saccades than the EyeSeeCam.

Conclusion: vHIT systems using a smartphone have been reported but are currently unavailable. At present, the iPhone-vHIT application in this study is the only available smartphone-based vHIT system for screening of peripheral vestibular dysfunction. We believe that the prototype iPhone-vHIT with a high-speed camera will be clinically used to perform the vHIT, even though it only examines the lateral semicircular canal.

引言:与磁共振成像相比,头部冲动、眼球震颤和偏斜测试(HINTS)对枕窝卒中的早期诊断更准确。然而,头部冲击测试(HIT)相对来说具有挑战性,因为它是主观的。在此,我们使用iPhone(Apple,Cupertino,CA,USA)开发了一个原型视频HIT(vHIT)系统,该系统结构紧凑,易于操作,可通过我们的iPhone应用程序进行分析。方法:使用EyeSeeCam(Interacoustics,Eden Prairie,NM,USA)对一名30多岁的健康男性和一名到访梅吉罗大学耳朵研究所诊所的前庭神经炎患者进行iPhone vHIT和vHIT。对于iPhone vHIT,通过分析使用iPhone相机拍摄的高速视频来检测眼球运动,并使用iPhone陀螺仪传感器跟踪头部运动。一个iPhone固定支架被用来拍摄视频,没有任何模糊。结果:iPhone vHIT系统在健康男性和前庭神经炎患者中获得了与EyeSeeCam vHIT类似的vHIT波形。iPhone vHIT系统有效地检测到前庭神经炎患者前庭-眼反射增益的降低。每秒120帧的iPhone vHIT系统对追赶扫视的敏感度不如EyeSeeCam。结论:使用智能手机的vHIT系统已被报道,但目前不可用。目前,本研究中的iPhone vHIT应用程序是唯一可用的基于智能手机的vHIT系统,用于筛查外周前庭功能障碍。我们相信,带有高速摄像头的iPhone vHIT原型将在临床上用于进行vHIT,尽管它只检查侧半规管。
{"title":"Development of a Prototype Video Head Impulse Test System Using an iPhone for Screening of Peripheral Vestibular Dysfunction.","authors":"Tatsuaki Kuroda,&nbsp;Kazuhiro Kuroda,&nbsp;Hiroaki Fushiki","doi":"10.1159/000534543","DOIUrl":"https://doi.org/10.1159/000534543","url":null,"abstract":"<p><strong>Introduction: </strong>Head impulse, nystagmus, and test of skew (HINTS) is more accurate for the early diagnosis of occipital fossa stroke than magnetic resonance imaging. However, the head impulse test (HIT) is relatively challenging to perform, as it is subjective. Herein, we developed a prototype video HIT (vHIT) system using an iPhone (Apple, Cupertino, CA, USA) that is compact, easy to operate, and analyzable by our iPhone application.</p><p><strong>Methods: </strong>The iPhone-vHIT and a vHIT using EyeSeeCam (Interacoustics, Eden Prairie, NM, USA) were performed on a healthy man in his 30s and on a patient with vestibular neuritis who visited the Mejiro University Ear Institute Clinic. For the iPhone-vHIT, eye movements were detected by analyzing high-speed videos captured using an iPhone camera, and head movements were followed using an iPhone gyro sensor. An iPhone fixation brace was used to capture the video without any blurring.</p><p><strong>Results: </strong>The iPhone-vHIT system obtained vHIT waveforms similar to those of the EyeSeeCam-vHIT system in the healthy man and the patient with vestibular neuritis. The iPhone-vHIT system effectively detected the reduced vestibulo-ocular reflex gain in patients with vestibular neuritis. The iPhone-vHIT system at 120 frames per second was less sensitive to catch-up saccades than the EyeSeeCam.</p><p><strong>Conclusion: </strong>vHIT systems using a smartphone have been reported but are currently unavailable. At present, the iPhone-vHIT application in this study is the only available smartphone-based vHIT system for screening of peripheral vestibular dysfunction. We believe that the prototype iPhone-vHIT with a high-speed camera will be clinically used to perform the vHIT, even though it only examines the lateral semicircular canal.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"150-156"},"PeriodicalIF":0.0,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10622167/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71479318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Usability and Acceptability of a Corneal-Plane α-Opic Light Logger in a 24-h Field Trial. 角膜平面α-Opic光记录仪在24小时现场试验中的可用性和可接受性。
Q1 Computer Science Pub Date : 2023-09-19 eCollection Date: 2023-01-01 DOI: 10.1159/000531404
Eljoh Balajadia, Sophie Garcia, Janine Stampfli, Björn Schrader, Carolina Guidolin, Manuel Spitschan

Introduction: Exposure to light fundamentally influences human physiology and behaviour by synchronising our biological clock to the external light-dark cycle and controlling melatonin production. In addition to well-controlled laboratory studies, more naturalistic approaches to examining these "non-visual" effects of light have been developed in recent years. As naturalistic light exposure is quite unlike well-controlled stimulus conditions in the laboratory, it is critical to measure light exposure in a person-referenced way, the "spectral diet." To this end, light loggers have been developed to capture personalised light exposure. As an alternative to light sensors integrated into wrist-worn actimeters, pendants, or brooch-based light loggers, a recently developed wearable light logger laterally attached to spectacle frames enables the measurement of biologically relevant quantities in the corneal plane.

Methods: Here, we examine the usability and acceptability of using the light logger in an undergraduate student sample (n = 18, mean±1SD: 20.1 ± 1.7 years; 9 female; Oxford, UK) in real-world conditions during a 24-h measurement period. We probed the acceptability of the light logger using rating questionnaires and open-ended questions.

Results: Our quantitative results show a modest acceptability of the light logger. A thematic analysis of the open-ended questions reveals that the form factor of the device, in particular, size, weight, and stability, and reactions from other people to the wearer of the light logger, were commonly mentioned aspects.

Conclusion: In sum, the results indicate the miniaturisation of light loggers and "invisible" integration into extant everyday objects as key areas for future technological development, facilitating the availability of light exposure data for developing personalised intervention strategies in both research, clinical and consumer contexts.

简介:暴露在光下通过使我们的生物钟与外部明暗循环同步并控制褪黑激素的产生,从根本上影响人类的生理和行为。除了控制良好的实验室研究外,近年来还开发了更自然的方法来检查光的这些“非视觉”效果。由于自然光照与实验室中控制良好的刺激条件截然不同,因此以一种以人为参考的方式测量光照至关重要,即“光谱饮食”。为此,已经开发了光记录仪来捕捉个性化的光照。作为集成在手腕佩戴的活动计、吊坠或胸针式光记录仪中的光传感器的替代品,最近开发的一种横向连接到眼镜架上的可穿戴光记录仪能够测量角膜平面中的生物相关量。方法:在这里,我们检查了在24小时的测量期间,在真实世界条件下使用光记录仪的本科生样本(n=18,平均±1SD:20.1±1.7岁;9名女性;英国牛津)的可用性和可接受性。我们使用评分问卷和开放式问题来探讨光记录仪的可接受性。结果:我们的定量结果显示了光记录仪的适度可接受性。对开放式问题的主题分析表明,设备的形状因素,特别是尺寸、重量和稳定性,以及其他人对光记录仪佩戴者的反应,是经常提到的方面。结论:总之,研究结果表明,光记录仪的小型化和与现存日常物品的“隐形”集成是未来技术发展的关键领域,有助于获得光暴露数据,以便在研究、临床和消费者环境中制定个性化干预策略。
{"title":"Usability and Acceptability of a Corneal-Plane α-Opic Light Logger in a 24-h Field Trial.","authors":"Eljoh Balajadia, Sophie Garcia, Janine Stampfli, Björn Schrader, Carolina Guidolin, Manuel Spitschan","doi":"10.1159/000531404","DOIUrl":"10.1159/000531404","url":null,"abstract":"<p><strong>Introduction: </strong>Exposure to light fundamentally influences human physiology and behaviour by synchronising our biological clock to the external light-dark cycle and controlling melatonin production. In addition to well-controlled laboratory studies, more naturalistic approaches to examining these \"non-visual\" effects of light have been developed in recent years. As naturalistic light exposure is quite unlike well-controlled stimulus conditions in the laboratory, it is critical to measure light exposure in a person-referenced way, the \"spectral diet.\" To this end, light loggers have been developed to capture personalised light exposure. As an alternative to light sensors integrated into wrist-worn actimeters, pendants, or brooch-based light loggers, a recently developed wearable light logger laterally attached to spectacle frames enables the measurement of biologically relevant quantities in the corneal plane.</p><p><strong>Methods: </strong>Here, we examine the usability and acceptability of using the light logger in an undergraduate student sample (<i>n</i> = 18, mean±1SD: 20.1 ± 1.7 years; 9 female; Oxford, UK) in real-world conditions during a 24-h measurement period. We probed the acceptability of the light logger using rating questionnaires and open-ended questions.</p><p><strong>Results: </strong>Our quantitative results show a modest acceptability of the light logger. A thematic analysis of the open-ended questions reveals that the form factor of the device, in particular, size, weight, and stability, and reactions from other people to the wearer of the light logger, were commonly mentioned aspects.</p><p><strong>Conclusion: </strong>In sum, the results indicate the miniaturisation of light loggers and \"invisible\" integration into extant everyday objects as key areas for future technological development, facilitating the availability of light exposure data for developing personalised intervention strategies in both research, clinical and consumer contexts.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"139-149"},"PeriodicalIF":0.0,"publicationDate":"2023-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601946/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71411123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regulatory Qualification of a Cross-Disease Digital Measure: Benefits and Challenges from the Perspective of IMI Consortium IDEA-FAST. 跨疾病数字测量的监管资格:IMI联盟IDEA-FAST视角下的利益和挑战。
Q1 Computer Science Pub Date : 2023-09-19 eCollection Date: 2023-01-01 DOI: 10.1159/000533189
David Nobbs, Wojciech Piwko, Christopher Bull, Francesca Cormack, Teemu Ahmaniemi, Sebastian C Holst, Meenakshi Chatterjee, Walter Maetzler, Stefan Avey, Wan Fai Ng

Background: Innovative Medicines Initiative (IMI) consortium IDEA-FAST is developing novel digital measures of fatigue, sleep quality, and impact of sleep disturbances for neurodegenerative diseases and immune-mediated inflammatory diseases. In 2022, the consortium met with the European Medicines Agency (EMA) to receive advice on its plans for regulatory qualification of the measures. This viewpoint reviews the IDEA-FAST perspective on developing digital measures for multiple diseases and the advice provided by the EMA.

Summary: The EMA considered a cross-disease measure an interesting and arguably feasible concept. Developers should account for the need for a strong rationale that the clinical features to be measured are similar across diseases. In addition, they may expect increased complexity of study design, challenges when managing differences within and between disease populations, and the need for validation in both heterogeneous and homogeneous populations.

Key messages: EMA highlighted the challenges teams may encounter when developing a cross-disease measure, though benefits potentially include reduced resources for the technology developer and health authority, faster access to innovation across different therapeutic fields, and feasibility of cross-disease comparisons. The insights included here can be used by project teams to guide them in the development of cross-disease digital measures intended for regulatory qualification.

背景:创新药物倡议(IMI)联盟IDEA-FAST正在开发新的疲劳、睡眠质量和睡眠障碍对神经退行性疾病和免疫介导的炎症性疾病影响的数字测量方法。2022年,该财团与欧洲药品管理局(EMA)会面,就其措施的监管资格计划征求意见。这一观点回顾了IDEA-FAST对开发多种疾病的数字测量的观点以及EMA提供的建议。总结:EMA认为跨疾病测量是一个有趣且可行的概念。开发人员应该考虑到需要一个强有力的理由,即要测量的临床特征在不同疾病中是相似的。此外,他们可能预计研究设计的复杂性会增加,在管理疾病人群内部和之间的差异时会遇到挑战,以及需要在异质和同质人群中进行验证。关键信息:EMA强调了团队在制定跨疾病措施时可能遇到的挑战,尽管其好处可能包括减少技术开发人员和卫生当局的资源,更快地获得不同治疗领域的创新,以及跨疾病比较的可行性。项目团队可以使用此处包含的见解来指导他们制定用于监管资格的跨疾病数字措施。
{"title":"Regulatory Qualification of a Cross-Disease Digital Measure: Benefits and Challenges from the Perspective of IMI Consortium IDEA-FAST.","authors":"David Nobbs, Wojciech Piwko, Christopher Bull, Francesca Cormack, Teemu Ahmaniemi, Sebastian C Holst, Meenakshi Chatterjee, Walter Maetzler, Stefan Avey, Wan Fai Ng","doi":"10.1159/000533189","DOIUrl":"10.1159/000533189","url":null,"abstract":"<p><strong>Background: </strong>Innovative Medicines Initiative (IMI) consortium IDEA-FAST is developing novel digital measures of fatigue, sleep quality, and impact of sleep disturbances for neurodegenerative diseases and immune-mediated inflammatory diseases. In 2022, the consortium met with the European Medicines Agency (EMA) to receive advice on its plans for regulatory qualification of the measures. This viewpoint reviews the IDEA-FAST perspective on developing digital measures for multiple diseases and the advice provided by the EMA.</p><p><strong>Summary: </strong>The EMA considered a cross-disease measure an interesting and arguably feasible concept. Developers should account for the need for a strong rationale that the clinical features to be measured are similar across diseases. In addition, they may expect increased complexity of study design, challenges when managing differences within and between disease populations, and the need for validation in both heterogeneous and homogeneous populations.</p><p><strong>Key messages: </strong>EMA highlighted the challenges teams may encounter when developing a cross-disease measure, though benefits potentially include reduced resources for the technology developer and health authority, faster access to innovation across different therapeutic fields, and feasibility of cross-disease comparisons. The insights included here can be used by project teams to guide them in the development of cross-disease digital measures intended for regulatory qualification.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"132-138"},"PeriodicalIF":0.0,"publicationDate":"2023-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601930/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71411121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Case for the Patient-Centric Development of Novel Digital Sleep Assessment Tools in Major Depressive Disorder. 以患者为中心开发新型数字睡眠评估工具治疗重度抑郁症的案例。
Q1 Computer Science Pub Date : 2023-09-11 eCollection Date: 2023-01-01 DOI: 10.1159/000533523
Nele Peerenboom, Suvekshya Aryal, Jennifer M Blankenship, Tracy Swibas, Yaya Zhai, Ieuan Clay, Kate Lyden

Background: Depression imposes a major burden on public health as the leading cause of disability worldwide. Sleep disturbance is a core symptom of depression that affects the vast majority of patients. Nonetheless, it is frequently not resolved by depression treatment and may even be worsened through some pharmaceutical interventions. Disturbed sleep negatively impact patients' quality of life, and persistent sleep disturbance increases the risk of recurrence, relapse, and even suicide. However, the development of novel treatments that might improve sleep problems is hindered by the lack of reliable low-burden objective measures that can adequately assess disturbed sleep in this population.

Summary: Developing improved digital measurement tools that are fit for use in clinical trials for major depressive disorder could promote the inclusion of sleep as a focus for treatment, clinical drug development, and research. This perspective piece explores the path toward the development of novel digital measures, reviews the existing evidence on the meaningfulness of sleep in depression, and summarizes existing methods of sleep assessments, including the use of digital health technologies.

Key messages: Our objective was to make a clear call to action and path forward for the qualification of new digital outcome measures which would enable assessment of sleep disturbance as an aspect of health that truly matters to patients, promoting sleep as an important outcome for clinical development, and ultimately ensure that disturbed sleep will not remain the forgotten symptom of depression.

背景:抑郁症作为全球残疾的主要原因,给公共卫生带来了重大负担。睡眠障碍是影响绝大多数患者的抑郁症的核心症状。尽管如此,它通常无法通过抑郁症治疗得到解决,甚至可能通过一些药物干预而恶化。睡眠障碍会对患者的生活质量产生负面影响,持续的睡眠障碍会增加复发、复发甚至自杀的风险。然而,由于缺乏可靠的低负担客观指标来充分评估这一人群的睡眠障碍,可能改善睡眠问题的新疗法的开发受到了阻碍。总结:开发适用于重度抑郁症临床试验的改进的数字测量工具,可以促进将睡眠作为治疗、临床药物开发和研究的重点。这篇前瞻性文章探索了新的数字测量方法的发展道路,回顾了关于抑郁症患者睡眠意义的现有证据,并总结了现有的睡眠评估方法,包括数字健康技术的使用。关键信息:我们的目标是明确呼吁采取行动并确定新的数字结果指标的资格,这将使评估睡眠障碍成为对患者真正重要的健康方面,促进睡眠成为临床发展的重要结果,并最终确保睡眠紊乱不会成为被遗忘的抑郁症症状。
{"title":"The Case for the Patient-Centric Development of Novel Digital Sleep Assessment Tools in Major Depressive Disorder.","authors":"Nele Peerenboom, Suvekshya Aryal, Jennifer M Blankenship, Tracy Swibas, Yaya Zhai, Ieuan Clay, Kate Lyden","doi":"10.1159/000533523","DOIUrl":"10.1159/000533523","url":null,"abstract":"<p><strong>Background: </strong>Depression imposes a major burden on public health as the leading cause of disability worldwide. Sleep disturbance is a core symptom of depression that affects the vast majority of patients. Nonetheless, it is frequently not resolved by depression treatment and may even be worsened through some pharmaceutical interventions. Disturbed sleep negatively impact patients' quality of life, and persistent sleep disturbance increases the risk of recurrence, relapse, and even suicide. However, the development of novel treatments that might improve sleep problems is hindered by the lack of reliable low-burden objective measures that can adequately assess disturbed sleep in this population.</p><p><strong>Summary: </strong>Developing improved digital measurement tools that are fit for use in clinical trials for major depressive disorder could promote the inclusion of sleep as a focus for treatment, clinical drug development, and research. This perspective piece explores the path toward the development of novel digital measures, reviews the existing evidence on the meaningfulness of sleep in depression, and summarizes existing methods of sleep assessments, including the use of digital health technologies.</p><p><strong>Key messages: </strong>Our objective was to make a clear call to action and path forward for the qualification of new digital outcome measures which would enable assessment of sleep disturbance as an aspect of health that truly matters to patients, promoting sleep as an important outcome for clinical development, and ultimately ensure that disturbed sleep will not remain the forgotten symptom of depression.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"124-131"},"PeriodicalIF":0.0,"publicationDate":"2023-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601929/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71411122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment. 半自动电话评估中认知任务的自动语音分析的验证。
Q1 Computer Science Pub Date : 2023-08-31 eCollection Date: 2023-01-01 DOI: 10.1159/000533188
Daphne Ter Huurne, Nina Possemis, Leonie Banning, Angélique Gruters, Alexandra König, Nicklas Linz, Johannes Tröger, Kai Langel, Frans Verhey, Marjolein de Vugt, Inez Ramakers

Introduction: We studied the accuracy of the automatic speech recognition (ASR) software by comparing ASR scores with manual scores from a verbal learning test (VLT) and a semantic verbal fluency (SVF) task in a semiautomated phone assessment in a memory clinic population. Furthermore, we examined the differentiating value of these tests between participants with subjective cognitive decline (SCD) and mild cognitive impairment (MCI). We also investigated whether the automatically calculated speech and linguistic features had an additional value compared to the commonly used total scores in a semiautomated phone assessment.

Methods: We included 94 participants from the memory clinic of the Maastricht University Medical Center+ (SCD N = 56 and MCI N = 38). The test leader guided the participant through a semiautomated phone assessment. The VLT and SVF were audio recorded and processed via a mobile application. The recall count and speech and linguistic features were automatically extracted. The diagnostic groups were classified by training machine learning classifiers to differentiate SCD and MCI participants.

Results: The intraclass correlation for inter-rater reliability between the manual and the ASR total word count was 0.89 (95% CI 0.09-0.97) for the VLT immediate recall, 0.94 (95% CI 0.68-0.98) for the VLT delayed recall, and 0.93 (95% CI 0.56-0.97) for the SVF. The full model including the total word count and speech and linguistic features had an area under the curve of 0.81 and 0.77 for the VLT immediate and delayed recall, respectively, and 0.61 for the SVF.

Conclusion: There was a high agreement between the ASR and manual scores, keeping the broad confidence intervals in mind. The phone-based VLT was able to differentiate between SCD and MCI and can have opportunities for clinical trial screening.

引言:我们在记忆诊所人群的半自动电话评估中,通过将ASR分数与语言学习测试(VLT)和语义语言流利性(SVF)任务的手动分数进行比较,研究了自动语音识别(ASR)软件的准确性。此外,我们还检验了这些测试在主观认知能力下降(SCD)和轻度认知障碍(MCI)参与者之间的区分价值。我们还调查了在半自动电话评估中,与常用的总分相比,自动计算的语音和语言特征是否具有附加值。方法:我们纳入了来自马斯特里赫特大学医学中心+记忆诊所的94名参与者(SCD N=56,MCI N=38)。测试负责人指导参与者进行半自动电话评估。VLT和SVF通过移动应用程序进行音频记录和处理。自动提取回忆次数以及语音和语言特征。通过训练机器学习分类器对诊断组进行分类,以区分SCD和MCI参与者。结果:对于VLT即时回忆,手册和ASR总字数之间的评分者间信度的组内相关性为0.89(95%CI 0.09-0.97),对于VLT延迟回忆为0.94(95%CI 0.68-0.98),对于SVF为0.93(95%CI 0.56-0.97)。包括总字数、语音和语言特征在内的完整模型的VLT立即回忆和延迟回忆的曲线下面积分别为0.81和0.77,SVF的曲线下区域为0.61。结论:ASR和手动评分之间有很高的一致性,记住了广泛的置信区间。基于手机的VLT能够区分SCD和MCI,并有机会进行临床试验筛查。
{"title":"Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment.","authors":"Daphne Ter Huurne, Nina Possemis, Leonie Banning, Angélique Gruters, Alexandra König, Nicklas Linz, Johannes Tröger, Kai Langel, Frans Verhey, Marjolein de Vugt, Inez Ramakers","doi":"10.1159/000533188","DOIUrl":"10.1159/000533188","url":null,"abstract":"<p><strong>Introduction: </strong>We studied the accuracy of the automatic speech recognition (ASR) software by comparing ASR scores with manual scores from a verbal learning test (VLT) and a semantic verbal fluency (SVF) task in a semiautomated phone assessment in a memory clinic population. Furthermore, we examined the differentiating value of these tests between participants with subjective cognitive decline (SCD) and mild cognitive impairment (MCI). We also investigated whether the automatically calculated speech and linguistic features had an additional value compared to the commonly used total scores in a semiautomated phone assessment.</p><p><strong>Methods: </strong>We included 94 participants from the memory clinic of the Maastricht University Medical Center+ (SCD <i>N</i> = 56 and MCI <i>N</i> = 38). The test leader guided the participant through a semiautomated phone assessment. The VLT and SVF were audio recorded and processed via a mobile application. The recall count and speech and linguistic features were automatically extracted. The diagnostic groups were classified by training machine learning classifiers to differentiate SCD and MCI participants.</p><p><strong>Results: </strong>The intraclass correlation for inter-rater reliability between the manual and the ASR total word count was 0.89 (95% CI 0.09-0.97) for the VLT immediate recall, 0.94 (95% CI 0.68-0.98) for the VLT delayed recall, and 0.93 (95% CI 0.56-0.97) for the SVF. The full model including the total word count and speech and linguistic features had an area under the curve of 0.81 and 0.77 for the VLT immediate and delayed recall, respectively, and 0.61 for the SVF.</p><p><strong>Conclusion: </strong>There was a high agreement between the ASR and manual scores, keeping the broad confidence intervals in mind. The phone-based VLT was able to differentiate between SCD and MCI and can have opportunities for clinical trial screening.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"115-123"},"PeriodicalIF":0.0,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601928/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71411124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Potential of Apple SensorKit and Digital Phenotyping Data as New Digital Biomarkers for Mental Health Research. 探索Apple SensorKit和数字表型数据作为心理健康研究新数字生物标志物的潜力。
Q1 Computer Science Pub Date : 2023-08-25 eCollection Date: 2023-01-01 DOI: 10.1159/000530698
Carsten Langholm, Tobias Kowatsch, Sandra Bucci, Andrea Cipriani, John Torous

The use of digital phenotyping continues to expand across all fields of health. By collecting quantitative data in real-time using devices such as smartphones or smartwatches, researchers and clinicians can develop a profile of a wide range of conditions. Smartphones contain sensors that collect data, such as GPS or accelerometer data, which can inform secondary metrics such as time spent at home, location entropy, or even sleep duration. These metrics, when used as digital biomarkers, are not only used to investigate the relationship between behavior and health symptoms but can also be used to support personalized and preventative care. Successful phenotyping requires consistent long-term collection of relevant and high-quality data. In this paper, we present the potential of newly available, for approved research, opt-in SensorKit sensors on iOS devices in improving the accuracy of digital phenotyping. We collected opt-in sensor data over 1 week from a single person with depression using the open-source mindLAMP app developed by the Division of Digital Psychiatry at Beth Israel Deaconess Medical Center. Five sensors from SensorKit were included. The names of the sensors, as listed in official documentation, include the following: phone usage, messages usage, visits, device usage, and ambient light. We compared data from these five new sensors from SensorKit to our current digital phenotyping data collection sensors to assess similarity and differences in both raw and processed data. We present sample data from all five of these new sensors. We also present sample data from current digital phenotyping sources and compare these data to SensorKit sensors when applicable. SensorKit offers great potential for health research. Many SensorKit sensors improve upon previously accessible features and produce data that appears clinically relevant. SensorKit sensors will likely play a substantial role in digital phenotyping. However, using these data requires advanced health app infrastructure and the ability to securely store high-frequency data.

数字表型的使用继续扩展到卫生的所有领域。通过使用智能手机或智能手表等设备实时收集定量数据,研究人员和临床医生可以了解各种情况。智能手机包含收集数据的传感器,如GPS或加速度计数据,这些数据可以告知次要指标,如在家的时间、位置熵,甚至睡眠时间。这些指标用作数字生物标志物时,不仅用于研究行为和健康症状之间的关系,还可用于支持个性化和预防性护理。成功的表型分析需要长期收集相关的高质量数据。在这篇论文中,我们展示了新获得的、用于批准研究的、在iOS设备上选择SensorKit传感器在提高数字表型准确性方面的潜力。我们使用贝斯以色列女执事医疗中心数字精神病学部门开发的开源mindLAMP应用程序,从一名抑郁症患者身上收集了一周多的选择加入传感器数据。包括SensorKit的五个传感器。官方文档中列出的传感器名称包括以下内容:电话使用、信息使用、访问、设备使用和环境光照。我们将SensorKit的这五个新传感器的数据与我们目前的数字表型数据收集传感器进行了比较,以评估原始数据和处理数据的相似性和差异性。我们展示了所有五个新传感器的样本数据。我们还提供了来自当前数字表型来源的样本数据,并在适用时将这些数据与SensorKit传感器进行比较。SensorKit为健康研究提供了巨大的潜力。许多SensorKit传感器改进了以前可访问的功能,并产生了与临床相关的数据。SensorKit传感器可能在数字表型中发挥重要作用。然而,使用这些数据需要先进的健康应用程序基础设施和安全存储高频数据的能力。
{"title":"Exploring the Potential of Apple SensorKit and Digital Phenotyping Data as New Digital Biomarkers for Mental Health Research.","authors":"Carsten Langholm, Tobias Kowatsch, Sandra Bucci, Andrea Cipriani, John Torous","doi":"10.1159/000530698","DOIUrl":"10.1159/000530698","url":null,"abstract":"<p><p>The use of digital phenotyping continues to expand across all fields of health. By collecting quantitative data in real-time using devices such as smartphones or smartwatches, researchers and clinicians can develop a profile of a wide range of conditions. Smartphones contain sensors that collect data, such as GPS or accelerometer data, which can inform secondary metrics such as time spent at home, location entropy, or even sleep duration. These metrics, when used as digital biomarkers, are not only used to investigate the relationship between behavior and health symptoms but can also be used to support personalized and preventative care. Successful phenotyping requires consistent long-term collection of relevant and high-quality data. In this paper, we present the potential of newly available, for approved research, opt-in SensorKit sensors on iOS devices in improving the accuracy of digital phenotyping. We collected opt-in sensor data over 1 week from a single person with depression using the open-source mindLAMP app developed by the Division of Digital Psychiatry at Beth Israel Deaconess Medical Center. Five sensors from SensorKit were included. The names of the sensors, as listed in official documentation, include the following: <i>phone usage</i>, <i>messages usage</i>, <i>visits</i>, <i>device usage</i>, and <i>ambient light</i>. We compared data from these five new sensors from SensorKit to our current digital phenotyping data collection sensors to assess similarity and differences in both raw and processed data. We present sample data from all five of these new sensors. We also present sample data from current digital phenotyping sources and compare these data to SensorKit sensors when applicable. SensorKit offers great potential for health research. Many SensorKit sensors improve upon previously accessible features and produce data that appears clinically relevant. SensorKit sensors will likely play a substantial role in digital phenotyping. However, using these data requires advanced health app infrastructure and the ability to securely store high-frequency data.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"104-114"},"PeriodicalIF":0.0,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601905/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71411120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Real-World Video Analysis of Sit-to-Stand Transitions Predicts Parkinson's Disease Severity. 自动真实世界坐立转换视频分析可预测帕金森病的严重程度
Q1 Computer Science Pub Date : 2023-08-14 eCollection Date: 2023-01-01 DOI: 10.1159/000530953
Catherine Morgan, Alessandro Masullo, Majid Mirmehdi, Hanna Kristiina Isotalus, Ferdian Jovan, Ryan McConville, Emma L Tonkin, Alan Whone, Ian Craddock

Introduction: Technology holds the potential to track disease progression and response to neuroprotective therapies in Parkinson's disease (PD). The sit-to-stand (STS) transition is a frequently occurring event which is important to people with PD. The aim of this study was to demonstrate an automatic approach to quantify STS duration and speed using a real-world free-living dataset and look at clinical correlations of the outcomes, including whether STS parameters change when someone withholds PD medications.

Methods: Eighty-five hours of video data were collected from 24 participants staying in pairs for 5-day periods in a naturalistic setting. Skeleton joints were extracted from the video data; the head trajectory was estimated and used to estimate the STS parameters of duration and speed.

Results: 3.14 STS transitions were seen per hour per person on average. Significant correlations were seen between automatic and manual STS duration (Pearson rho - 0.419, p = 0.042) and between automatic STS speed and manual STS duration (Pearson rho - 0.780, p < 0.001). Significant and strong correlations were seen between the gold-standard clinical rating scale scores and both STS duration and STS speed; these correlations were not seen in the STS transitions when the participants were carrying something in their hand(s). Significant differences were seen at the cohort level between control and PD participants' ON medications' STS duration (U = 6,263, p = 0.018) and speed (U = 9,965, p < 0.001). At an individual level, only two participants with PD became significantly slower to STS when they were OFF medications; withholding medications did not significantly change STS duration at an individual level in any participant.

Conclusion: We demonstrate a novel approach to automatically quantify and ecologically validate two STS parameters which correlate with gold-standard clinical tools measuring disease severity in PD.

简介:技术有望跟踪帕金森病(PD)的病情发展和对神经保护疗法的反应。坐立转换(STS)是一个经常发生的过程,对帕金森病患者非常重要。本研究旨在展示一种自动方法,利用真实世界的自由生活数据集量化坐立转换的持续时间和速度,并研究结果的临床相关性,包括当患者停用帕金森病药物时,坐立转换参数是否会发生变化:方法: 在自然环境中,收集了 24 名参与者 5 天内两人一组的 85 小时视频数据。从视频数据中提取骨骼关节;估算头部轨迹并用于估算持续时间和速度的 STS 参数:结果:平均每人每小时看到 3.14 次 STS 过渡。自动 STS 持续时间与手动 STS 持续时间之间存在显著相关性(Pearson rho - 0.419,p = 0.042),自动 STS 速度与手动 STS 持续时间之间也存在显著相关性(Pearson rho - 0.780,p < 0.001)。金标准临床评分量表得分与STS持续时间和STS速度之间存在显著的强相关性;但在参与者手持物品时的STS转换中则未发现这些相关性。在组群水平上,对照组和帕金森病患者服用 ON 药物时的 STS 持续时间(U = 6,263, p = 0.018)和速度(U = 9,965, p < 0.001)之间存在显著差异。在个体水平上,只有两名帕金森氏症患者在停药后STS明显变慢;在个体水平上,停药并没有显著改变任何参与者的STS持续时间:我们展示了一种自动量化和生态验证两个 STS 参数的新方法,这两个参数与衡量帕金森病疾病严重程度的黄金标准临床工具相关。
{"title":"Automated Real-World Video Analysis of Sit-to-Stand Transitions Predicts Parkinson's Disease Severity.","authors":"Catherine Morgan, Alessandro Masullo, Majid Mirmehdi, Hanna Kristiina Isotalus, Ferdian Jovan, Ryan McConville, Emma L Tonkin, Alan Whone, Ian Craddock","doi":"10.1159/000530953","DOIUrl":"10.1159/000530953","url":null,"abstract":"<p><strong>Introduction: </strong>Technology holds the potential to track disease progression and response to neuroprotective therapies in Parkinson's disease (PD). The sit-to-stand (STS) transition is a frequently occurring event which is important to people with PD. The aim of this study was to demonstrate an automatic approach to quantify STS duration and speed using a real-world free-living dataset and look at clinical correlations of the outcomes, including whether STS parameters change when someone withholds PD medications.</p><p><strong>Methods: </strong>Eighty-five hours of video data were collected from 24 participants staying in pairs for 5-day periods in a naturalistic setting. Skeleton joints were extracted from the video data; the head trajectory was estimated and used to estimate the STS parameters of duration and speed.</p><p><strong>Results: </strong>3.14 STS transitions were seen per hour per person on average. Significant correlations were seen between automatic and manual STS duration (Pearson rho - 0.419, <i>p</i> = 0.042) and between automatic STS speed and manual STS duration (Pearson rho - 0.780, <i>p</i> < 0.001). Significant and strong correlations were seen between the gold-standard clinical rating scale scores and both STS duration and STS speed; these correlations were not seen in the STS transitions when the participants were carrying something in their hand(s). Significant differences were seen at the cohort level between control and PD participants' ON medications' STS duration (U = 6,263, <i>p</i> = 0.018) and speed (U = 9,965, <i>p</i> < 0.001). At an individual level, only two participants with PD became significantly slower to STS when they were OFF medications; withholding medications did not significantly change STS duration at an individual level in any participant.</p><p><strong>Conclusion: </strong>We demonstrate a novel approach to automatically quantify and ecologically validate two STS parameters which correlate with gold-standard clinical tools measuring disease severity in PD.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"92-103"},"PeriodicalIF":0.0,"publicationDate":"2023-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/40/91/dib-2023-0007-0001-530953.PMC10425718.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10022613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinical Validation of Novel Digital Measures: Statistical Methods for Reliability Evaluation. 新型数字测量的临床验证:可靠性评估统计方法。
Q1 Computer Science Pub Date : 2023-08-09 eCollection Date: 2023-01-01 DOI: 10.1159/000531054
Bohdana Ratitch, Andrew Trigg, Madhurima Majumder, Vanja Vlajnic, Nicole Rethemeier, Richard Nkulikiyinka

Background: Assessment of reliability is one of the key components of the validation process designed to demonstrate that a novel clinical measure assessed by a digital health technology tool is fit-for-purpose in clinical research, care, and decision-making. Reliability assessment contributes to characterization of the signal-to-noise ratio and measurement error and is the first indicator of potential usefulness of the proposed clinical measure.

Summary: Methodologies for reliability analyses are scattered across literature on validation of PROs, wet biomarkers, etc., yet are equally useful for digital clinical measures. We review a general modeling framework and statistical metrics typically used for reliability assessments as part of the clinical validation. We also present methods for the assessment of agreement and measurement error, alongside modified approaches for categorical measures. We illustrate the discussed techniques using physical activity data from a wearable device with an accelerometer sensor collected in clinical trial participants.

Key messages: This paper provides statisticians and data scientists, involved in development and validation of novel digital clinical measures, an overview of the statistical methodologies and analytical tools for reliability assessment.

背景:可靠性评估是验证过程的关键组成部分之一,旨在证明由数字医疗技术工具评估的新型临床测量方法适合临床研究、护理和决策。可靠性评估有助于确定信噪比和测量误差的特征,是衡量拟议临床指标潜在有用性的首要指标。摘要:可靠性分析方法散见于有关PROs、湿生物标记物等验证的文献中,但对数字临床指标同样有用。我们回顾了作为临床验证的一部分,通常用于可靠性评估的一般建模框架和统计指标。我们还介绍了评估一致性和测量误差的方法,以及针对分类测量的改进方法。我们使用从临床试验参与者身上采集的带有加速度传感器的可穿戴设备的体力活动数据来说明所讨论的技术:本文为参与开发和验证新型数字临床测量的统计学家和数据科学家提供了可靠性评估的统计方法和分析工具概览。
{"title":"Clinical Validation of Novel Digital Measures: Statistical Methods for Reliability Evaluation.","authors":"Bohdana Ratitch, Andrew Trigg, Madhurima Majumder, Vanja Vlajnic, Nicole Rethemeier, Richard Nkulikiyinka","doi":"10.1159/000531054","DOIUrl":"10.1159/000531054","url":null,"abstract":"<p><strong>Background: </strong>Assessment of reliability is one of the key components of the validation process designed to demonstrate that a novel clinical measure assessed by a digital health technology tool is fit-for-purpose in clinical research, care, and decision-making. Reliability assessment contributes to characterization of the signal-to-noise ratio and measurement error and is the first indicator of potential usefulness of the proposed clinical measure.</p><p><strong>Summary: </strong>Methodologies for reliability analyses are scattered across literature on validation of PROs, wet biomarkers, etc., yet are equally useful for digital clinical measures. We review a general modeling framework and statistical metrics typically used for reliability assessments as part of the clinical validation. We also present methods for the assessment of agreement and measurement error, alongside modified approaches for categorical measures. We illustrate the discussed techniques using physical activity data from a wearable device with an accelerometer sensor collected in clinical trial participants.</p><p><strong>Key messages: </strong>This paper provides statisticians and data scientists, involved in development and validation of novel digital clinical measures, an overview of the statistical methodologies and analytical tools for reliability assessment.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"74-91"},"PeriodicalIF":0.0,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/90/b8/dib-2023-0007-0001-531054.PMC10425717.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10017660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and Assessment of an Artificial Intelligence-Based Tool for Ptosis Measurement in Adult Myasthenia Gravis Patients Using Selfie Video Clips Recorded on Smartphones. 基于人工智能的上睑下垂测量工具的开发与评估--使用智能手机录制的自拍视频剪辑测量成年肌无力患者的上睑下垂。
Q1 Computer Science Pub Date : 2023-07-28 eCollection Date: 2023-01-01 DOI: 10.1159/000531224
Meelis Lootus, Lulu Beatson, Lucas Atwood, Theo Bourdais, Sandra Steyaert, Chethan Sarabu, Zeenia Framroze, Harriet Dickinson, Jean-Christophe Steels, Emily Lewis, Nirav R Shah, Francesca Rinaldo

Introduction: Myasthenia gravis (MG) is a rare autoimmune disease characterized by muscle weakness and fatigue. Ptosis (eyelid drooping) occurs due to fatigue of the muscles for eyelid elevation and is one symptom widely used by patients and healthcare providers to track progression of the disease. Margin reflex distance 1 (MRD1) is an accepted clinical measure of ptosis and is typically assessed using a hand-held ruler. In this work, we develop an AI model that enables automated measurement of MRD1 in self-recorded video clips collected using patient smartphones.

Methods: A 3-month prospective observational study collected a dataset of video clips from patients with MG. Study participants were asked to perform an eyelid fatigability exercise to elicit ptosis while filming "selfie" videos on their smartphones. These images were collected in nonclinical settings, with no in-person training. The dataset was annotated by non-clinicians for (1) eye landmarks to establish ground truth MRD1 and (2) the quality of the video frames. The ground truth MRD1 (in millimeters, mm) was calculated from eye landmark annotations in the video frames using a standard conversion factor, the horizontal visible iris diameter of the human eye. To develop the model, we trained a neural network for eye landmark detection consisting of a ResNet50 backbone plus two dense layers of 78 dimensions on publicly available datasets. Only the ResNet50 backbone was used, discarding the last two layers. The embeddings from the ResNet50 were used as features for a support vector regressor (SVR) using a linear kernel, for regression to MRD1, in mm. The SVR was trained on data collected remotely from MG patients in the prospective study, split into training and development folds. The model's performance for MRD1 estimation was evaluated on a separate test fold from the study dataset.

Results: On the full test fold (N = 664 images), the correlation between the ground truth and predicted MRD1 values was strong (r = 0.732). The mean absolute error was 0.822 mm; the mean of differences was -0.256 mm; and 95% limits of agreement (LOA) were -0.214-1.768 mm. Model performance showed no improvement when test data were gated to exclude "poor" quality images.

Conclusions: On data generated under highly challenging real-world conditions from a variety of different smartphone devices, the model predicts MRD1 with a strong correlation (r = 0.732) between ground truth and predicted MRD1.

简介重症肌无力(MG)是一种罕见的自身免疫性疾病,以肌肉无力和疲劳为特征。上睑下垂(眼睑下垂)是由于抬高眼睑的肌肉疲劳所致,是患者和医疗服务提供者广泛用于追踪疾病进展的症状之一。边缘反射距离 1(MRD1)是一种公认的眼睑下垂临床测量方法,通常使用手持尺进行评估。在这项工作中,我们开发了一个人工智能模型,可以自动测量使用患者智能手机收集的自我记录视频片段中的 MRD1:一项为期 3 个月的前瞻性观察研究收集了一组 MG 患者的视频片段。研究参与者在用智能手机拍摄 "自拍 "视频的同时,被要求进行眼睑疲劳练习,以诱发上睑下垂。这些图像是在非临床环境中收集的,没有经过现场培训。非临床医生对数据集进行了注释:(1) 眼睛地标,以建立地面实况 MRD1;(2) 视频帧的质量。基本真实值 MRD1(单位:毫米,mm)是根据视频帧中的眼部地标注释,使用标准换算系数(人眼水平可见虹膜直径)计算得出的。为了开发模型,我们在公开数据集上训练了一个用于眼部地标检测的神经网络,该网络由一个 ResNet50 主干网和两个 78 维的密集层组成。我们只使用了 ResNet50 主干网,舍弃了最后两层。ResNet50 的嵌入作为支持向量回归器(SVR)的特征,使用线性核对 MRD1 进行回归,单位为毫米。SVR 是根据前瞻性研究中从 MG 患者处远程收集的数据进行训练的,分为训练褶皱和开发褶皱。在研究数据集的一个单独测试折叠上评估了模型的 MRD1 估计性能:结果:在完整的测试折叠(N = 664 张图像)上,地面实况值和预测的 MRD1 值之间的相关性很强(r = 0.732)。平均绝对误差为 0.822 毫米;平均差异为 -0.256 毫米;95% 的一致度 (LOA) 为 -0.214-1.768 毫米。当测试数据被选中以排除 "劣质 "图像时,模型性能没有得到改善:在各种不同的智能手机设备在极具挑战性的真实世界条件下生成的数据上,该模型可以预测 MRD1,地面实况与预测的 MRD1 之间具有很强的相关性(r = 0.732)。
{"title":"Development and Assessment of an Artificial Intelligence-Based Tool for Ptosis Measurement in Adult Myasthenia Gravis Patients Using Selfie Video Clips Recorded on Smartphones.","authors":"Meelis Lootus, Lulu Beatson, Lucas Atwood, Theo Bourdais, Sandra Steyaert, Chethan Sarabu, Zeenia Framroze, Harriet Dickinson, Jean-Christophe Steels, Emily Lewis, Nirav R Shah, Francesca Rinaldo","doi":"10.1159/000531224","DOIUrl":"10.1159/000531224","url":null,"abstract":"<p><strong>Introduction: </strong>Myasthenia gravis (MG) is a rare autoimmune disease characterized by muscle weakness and fatigue. Ptosis (eyelid drooping) occurs due to fatigue of the muscles for eyelid elevation and is one symptom widely used by patients and healthcare providers to track progression of the disease. Margin reflex distance 1 (MRD1) is an accepted clinical measure of ptosis and is typically assessed using a hand-held ruler. In this work, we develop an AI model that enables automated measurement of MRD1 in self-recorded video clips collected using patient smartphones.</p><p><strong>Methods: </strong>A 3-month prospective observational study collected a dataset of video clips from patients with MG. Study participants were asked to perform an eyelid fatigability exercise to elicit ptosis while filming \"selfie\" videos on their smartphones. These images were collected in nonclinical settings, with no in-person training. The dataset was annotated by non-clinicians for (1) eye landmarks to establish ground truth MRD1 and (2) the quality of the video frames. The ground truth MRD1 (in millimeters, mm) was calculated from eye landmark annotations in the video frames using a standard conversion factor, the horizontal visible iris diameter of the human eye. To develop the model, we trained a neural network for eye landmark detection consisting of a ResNet50 backbone plus two dense layers of 78 dimensions on publicly available datasets. Only the ResNet50 backbone was used, discarding the last two layers. The embeddings from the ResNet50 were used as features for a support vector regressor (SVR) using a linear kernel, for regression to MRD1, in mm. The SVR was trained on data collected remotely from MG patients in the prospective study, split into training and development folds. The model's performance for MRD1 estimation was evaluated on a separate test fold from the study dataset.</p><p><strong>Results: </strong>On the full test fold (<i>N</i> = 664 images), the correlation between the ground truth and predicted MRD1 values was strong (<i>r</i> = 0.732). The mean absolute error was 0.822 mm; the mean of differences was -0.256 mm; and 95% limits of agreement (LOA) were -0.214-1.768 mm. Model performance showed no improvement when test data were gated to exclude \"poor\" quality images.</p><p><strong>Conclusions: </strong>On data generated under highly challenging real-world conditions from a variety of different smartphone devices, the model predicts MRD1 with a strong correlation (<i>r</i> = 0.732) between ground truth and predicted MRD1.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"63-73"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/5b/23/dib-2023-0007-0001-531224.PMC10399113.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9954353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Digital Biomarkers
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1