Vowel Sound Synthesis from Electroencephalography during Listening and Recalling

Wataru Akashi, H. Kambara, Yousuke Ogata, Y. Koike, L. Minati, N. Yoshimura
{"title":"Vowel Sound Synthesis from Electroencephalography during Listening and Recalling","authors":"Wataru Akashi, H. Kambara, Yousuke Ogata, Y. Koike, L. Minati, N. Yoshimura","doi":"10.1002/aisy.202000164","DOIUrl":null,"url":null,"abstract":"Recent advances in brain imaging technology have furthered our knowledge of the neural basis of auditory and speech processing, often via contributions from invasive brain signal recording and stimulation studies conducted intraoperatively. Herein, an approach for synthesizing vowel sounds straightforwardly from scalp‐recorded electroencephalography (EEG), a noninvasive neurophysiological recording method is demonstrated. Given cortical current signals derived from the EEG acquired while human participants listen to and recall (i.e., imagined) two vowels, /a/ and /i/, sound parameters are estimated by a convolutional neural network (CNN). The speech synthesized from the estimated parameters is sufficiently natural to achieve recognition rates >85% during a subsequent sound discrimination task. Notably, the CNN identifies the involvement of the brain areas mediating the “what” auditory stream, namely the superior, middle temporal, and Heschl's gyri, demonstrating the efficacy of the computational method in extracting auditory‐related information from neuroelectrical activity. Differences in cortical sound representation between listening versus recalling are further revealed, such that the fusiform, calcarine, and anterior cingulate gyri contributes during listening, whereas the inferior occipital gyrus is engaged during recollection. The proposed approach can expand the scope of EEG in decoding auditory perception that requires high spatial and temporal resolution.","PeriodicalId":7187,"journal":{"name":"Advanced Intelligent Systems","volume":"27 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/aisy.202000164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recent advances in brain imaging technology have furthered our knowledge of the neural basis of auditory and speech processing, often via contributions from invasive brain signal recording and stimulation studies conducted intraoperatively. Herein, an approach for synthesizing vowel sounds straightforwardly from scalp‐recorded electroencephalography (EEG), a noninvasive neurophysiological recording method is demonstrated. Given cortical current signals derived from the EEG acquired while human participants listen to and recall (i.e., imagined) two vowels, /a/ and /i/, sound parameters are estimated by a convolutional neural network (CNN). The speech synthesized from the estimated parameters is sufficiently natural to achieve recognition rates >85% during a subsequent sound discrimination task. Notably, the CNN identifies the involvement of the brain areas mediating the “what” auditory stream, namely the superior, middle temporal, and Heschl's gyri, demonstrating the efficacy of the computational method in extracting auditory‐related information from neuroelectrical activity. Differences in cortical sound representation between listening versus recalling are further revealed, such that the fusiform, calcarine, and anterior cingulate gyri contributes during listening, whereas the inferior occipital gyrus is engaged during recollection. The proposed approach can expand the scope of EEG in decoding auditory perception that requires high spatial and temporal resolution.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
听和回忆过程中脑电图的元音合成
脑成像技术的最新进展进一步加深了我们对听觉和言语处理的神经基础的认识,这通常是通过侵入性脑信号记录和术中进行的刺激研究来实现的。本文展示了一种直接从头皮记录的脑电图(EEG)合成元音的方法,这是一种无创的神经生理学记录方法。当受试者听和回忆(即想象)两个元音/a/和/i/时,获得脑电皮层电流信号,通过卷积神经网络(CNN)估计声音参数。根据估计的参数合成的语音足够自然,可以在随后的声音识别任务中实现>85%的识别率。值得注意的是,CNN识别了参与“什么”听觉流的大脑区域,即上颞叶、中颞叶和Heschl’s gyri,证明了计算方法在从神经电活动中提取听觉相关信息方面的有效性。进一步揭示了倾听和回忆在皮层声音表征上的差异,如纺锤状回、肌动回和前扣带回在倾听过程中起作用,而枕下回在回忆过程中起作用。该方法可以扩大脑电在高时空分辨率听觉感知解码中的应用范围。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Dynamic Tactile Synthetic Tissue: from Soft Robotics to Hybrid Surgical Simulators Maximizing the Synaptic Efficiency of Ferroelectric Tunnel Junction Devices Using a Switching Mechanism Hidden in an Identical Pulse Programming Learning Scheme Enhancing Sensitivity across Scales with Highly Sensitive Hall Effect‐Based Auxetic Tactile Sensors 3D Printed Swordfish‐Like Wireless Millirobot Widened Attention‐Enhanced Atrous Convolutional Network for Efficient Embedded Vision Applications under Resource Constraints
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1