Creating a Spanish Speech Corpus to Develop Digital Dementia Biomarkers Using Machine Learning

L. Cabrera-Leyva, Jesús Favela Vara, Dagoberto Cruz-Sandoval, Diana Leticia Paniagua Santos, Maricruz Huerta Jauregui
{"title":"Creating a Spanish Speech Corpus to Develop Digital Dementia Biomarkers Using Machine Learning","authors":"L. Cabrera-Leyva, Jesús Favela Vara, Dagoberto Cruz-Sandoval, Diana Leticia Paniagua Santos, Maricruz Huerta Jauregui","doi":"10.1109/ENC56672.2022.9882903","DOIUrl":null,"url":null,"abstract":"Dementia is one of the most prevalent diseases affecting older adults in Mexico. There has been increasing interest in the development of digital biomarkers of dementia based on the analysis of speech. The availability of high-quality speech corpus is important to advance this line of research. However, there are no publicly available dataset in Spanish for this purpose. Therefore, we describe a protocol to capture Spanish audio from older adults for dementia research. We describe the lessons learned and adjustments to the protocol that emerged from a pilot study.","PeriodicalId":145622,"journal":{"name":"2022 IEEE Mexican International Conference on Computer Science (ENC)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Mexican International Conference on Computer Science (ENC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ENC56672.2022.9882903","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Dementia is one of the most prevalent diseases affecting older adults in Mexico. There has been increasing interest in the development of digital biomarkers of dementia based on the analysis of speech. The availability of high-quality speech corpus is important to advance this line of research. However, there are no publicly available dataset in Spanish for this purpose. Therefore, we describe a protocol to capture Spanish audio from older adults for dementia research. We describe the lessons learned and adjustments to the protocol that emerged from a pilot study.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用机器学习创建西班牙语语料库以开发数字痴呆症生物标志物
痴呆症是影响墨西哥老年人的最普遍疾病之一。基于语言分析的痴呆症数字生物标志物的开发越来越受到关注。高质量语音语料库的可用性对于推进这方面的研究非常重要。然而,没有西班牙语的公开可用数据集用于此目的。因此,我们描述了一种从老年人中获取西班牙语音频用于痴呆症研究的方案。我们描述了从试点研究中获得的经验教训和对方案的调整。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Uso de plataforma de videojuegos de conducción para analizar el desempeño visual de los conductores: estudio piloto Design, development, and evaluation of a medical system for estimating dosimetry levels in a public hospital Characterization of the environment of teachers, students and parents of basic education based on the GQM Quality Model Detection of Atypical Data in Point Cloud of Technical Vision System using Digital Filtering Creation of a Dataset for personality and professional interest recognition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1