{"title":"中国抑郁症患者阅读言语的声学特征分析","authors":"Yuan Jia, Yuzhu Liang, T. Zhu","doi":"10.1109/O-COCOSDA50338.2020.9295039","DOIUrl":null,"url":null,"abstract":"This paper investigates acoustic features of depression patients in voice quality and formants, from the perspective of experimental phonetics. The analysis on voice quality based on large samples shows that jitter, shimmer and HNR can distinguish the patients with different degrees of depression, while F0, standard deviation of F0 and HNR can distinguish depression patients from non-patients. These features indicate that the voice of patients tends to be hoarse and rough, with a lower pitch falling into a narrower range. The analysis on formants shows that depression patients tend to centralize monophthongs and simplify diphthongs, reflected by a lower opening degree and slower movement of tongue. Moreover, the patients tend to show a lower spectrum energy than healthy people. Finally, our analysis results suggest that these acoustic features can be used as objective markers for recognition of depression.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Analysis of Acoustic Features in Reading Speech from Chinese Patients with Depression\",\"authors\":\"Yuan Jia, Yuzhu Liang, T. Zhu\",\"doi\":\"10.1109/O-COCOSDA50338.2020.9295039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper investigates acoustic features of depression patients in voice quality and formants, from the perspective of experimental phonetics. The analysis on voice quality based on large samples shows that jitter, shimmer and HNR can distinguish the patients with different degrees of depression, while F0, standard deviation of F0 and HNR can distinguish depression patients from non-patients. These features indicate that the voice of patients tends to be hoarse and rough, with a lower pitch falling into a narrower range. The analysis on formants shows that depression patients tend to centralize monophthongs and simplify diphthongs, reflected by a lower opening degree and slower movement of tongue. Moreover, the patients tend to show a lower spectrum energy than healthy people. Finally, our analysis results suggest that these acoustic features can be used as objective markers for recognition of depression.\",\"PeriodicalId\":385266,\"journal\":{\"name\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/O-COCOSDA50338.2020.9295039\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Analysis of Acoustic Features in Reading Speech from Chinese Patients with Depression
This paper investigates acoustic features of depression patients in voice quality and formants, from the perspective of experimental phonetics. The analysis on voice quality based on large samples shows that jitter, shimmer and HNR can distinguish the patients with different degrees of depression, while F0, standard deviation of F0 and HNR can distinguish depression patients from non-patients. These features indicate that the voice of patients tends to be hoarse and rough, with a lower pitch falling into a narrower range. The analysis on formants shows that depression patients tend to centralize monophthongs and simplify diphthongs, reflected by a lower opening degree and slower movement of tongue. Moreover, the patients tend to show a lower spectrum energy than healthy people. Finally, our analysis results suggest that these acoustic features can be used as objective markers for recognition of depression.