Junghyun Namkung, Seok Min Kim, Won Ik Cho, So Young Yoo, Beomjun Min, Sang Yool Lee, Ji-Hye Lee, Heyeon Park, Soyoung Baik, Je-Yeon Yun, Nam Soo Kim, Jeong-Hyun Kim
{"title":"Novel Deep Learning-Based Vocal Biomarkers for Stress Detection in Koreans.","authors":"Junghyun Namkung, Seok Min Kim, Won Ik Cho, So Young Yoo, Beomjun Min, Sang Yool Lee, Ji-Hye Lee, Heyeon Park, Soyoung Baik, Je-Yeon Yun, Nam Soo Kim, Jeong-Hyun Kim","doi":"10.30773/pi.2024.0131","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The rapid societal changes have underscored the importance of effective stress detection and management. Chronic mental stress significantly contributes to both physical and psychological illnesses. However, many individuals often remain unaware of their stress levels until they face physical health issues, highlighting the necessity for regular stress monitoring. This study aimed to investigate the effectiveness of vocal biomarkers in detecting stress levels among healthy Korean employees and to contribute to digital healthcare solutions.</p><p><strong>Methods: </strong>We conducted a multi-center clinical study by collecting voice recordings from 115 healthy Korean employees under both relaxed and stress-induced conditions. Stress was induced using the socially evaluated cold pressor test. The Emphasized Channel Attention, Propagation and Aggregation in Time delay neural network (ECAPA-TDNN) deep learning architecture, renowned for its advanced capabilities in analyzing person-specific voice features, was employed to develop stress prediction scores.</p><p><strong>Results: </strong>The proposed model achieved a 70% accuracy rate in detecting stress. This performance underscores the potential of vocal biomarkers as a convenient and effective tool for individuals to self-monitor and manage their stress levels within digital healthcare frameworks.</p><p><strong>Conclusion: </strong>The findings emphasize the promise of voice-based mental stress assessments within the Korean population and the importance of continued research on vocal biomarkers across diverse linguistic demographics.</p>","PeriodicalId":21164,"journal":{"name":"Psychiatry Investigation","volume":"21 11","pages":"1228-1237"},"PeriodicalIF":1.8000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11611465/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychiatry Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.30773/pi.2024.0131","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/18 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: The rapid societal changes have underscored the importance of effective stress detection and management. Chronic mental stress significantly contributes to both physical and psychological illnesses. However, many individuals often remain unaware of their stress levels until they face physical health issues, highlighting the necessity for regular stress monitoring. This study aimed to investigate the effectiveness of vocal biomarkers in detecting stress levels among healthy Korean employees and to contribute to digital healthcare solutions.
Methods: We conducted a multi-center clinical study by collecting voice recordings from 115 healthy Korean employees under both relaxed and stress-induced conditions. Stress was induced using the socially evaluated cold pressor test. The Emphasized Channel Attention, Propagation and Aggregation in Time delay neural network (ECAPA-TDNN) deep learning architecture, renowned for its advanced capabilities in analyzing person-specific voice features, was employed to develop stress prediction scores.
Results: The proposed model achieved a 70% accuracy rate in detecting stress. This performance underscores the potential of vocal biomarkers as a convenient and effective tool for individuals to self-monitor and manage their stress levels within digital healthcare frameworks.
Conclusion: The findings emphasize the promise of voice-based mental stress assessments within the Korean population and the importance of continued research on vocal biomarkers across diverse linguistic demographics.
期刊介绍:
The Psychiatry Investigation is published on the 25th day of every month in English by the Korean Neuropsychiatric Association (KNPA). The Journal covers the whole range of psychiatry and neuroscience. Both basic and clinical contributions are encouraged from all disciplines and research areas relevant to the pathophysiology and management of neuropsychiatric disorders and symptoms, as well as researches related to cross cultural psychiatry and ethnic issues in psychiatry. The Journal publishes editorials, review articles, original articles, brief reports, viewpoints and correspondences. All research articles are peer reviewed. Contributions are accepted for publication on the condition that their substance has not been published or submitted for publication elsewhere. Authors submitting papers to the Journal (serially or otherwise) with a common theme or using data derived from the same sample (or a subset thereof) must send details of all relevant previous publications and simultaneous submissions. The Journal is not responsible for statements made by contributors. Material in the Journal does not necessarily reflect the views of the Editor or of the KNPA. Manuscripts accepted for publication are copy-edited to improve readability and to ensure conformity with house style.