使用隐性语音的交流范式:将大脑信号转化为语音

Augmented Human Research Pub Date : 2016-10-10 DOI:10.1007/s41133-016-0001-z

Kusuma Mohanchandra, Snehanshu Saha

{"title":"使用隐性语音的交流范式:将大脑信号转化为语音","authors":"Kusuma Mohanchandra, Snehanshu Saha","doi":"10.1007/s41133-016-0001-z","DOIUrl":null,"url":null,"abstract":"<div><p>Recent science and technology studies in neuroscience, rehabilitation, and machine learning have focused attention on the EEG-based brain–computer interface (BCI) as an exciting field of research. Though the primary goal of the BCI has been to restore communication in the severely paralyzed, BCI for speech communication has acquired recognition in a variety of non-medical fields. These fields include silent speech communication, cognitive biometrics, and synthetic telepathy, to name a few. Though potentially a very sensitive issue on various counts, it is likely to revolutionize the whole system of communication. Considering the wide range of application, this paper presents innovative research on BCI for speech communication. Since imagined speech suffers from quite a few factors, we have chosen to focus on subvocalized speech for the current work. The current work is considered to be the first to utilize the subvocal verbalization for EEG-based BCI in speech communication. The electrical signals generated by the human brain during subvocalized speech are captured, analyzed, and interpreted as speech. Further, the processed EEG signals are used to drive a speech synthesizer, enabling communication and acoustical feedback for the user. We attempt to demonstrate and justify that the BCI is capable of providing good results. The basis of this effort is the presumption that, whether the speech is overt or covert, it always originates in the mind. The scalp maps provide evidence that subvocal speech prediction, from the neurological signals, is achievable. The statistical results obtained from the current study demonstrate that speech prediction is possible. EEG signals suffer from the curse of dimensionality due to the intrinsic biological and electromagnetic complexities. Therefore, in the current work, the subset selection method, using pairwise cross-correlation, is proposed to reduce the size of the data while minimizing loss of information. The prominent variances obtained from the SSM, based on principal representative features, were deployed to analyze multiclass EEG signals. A multiclass support vector machine is used for the classification of EEG signals of five subvocalized words extracted from scalp electrodes. Though the current work identifies many challenges, the promise of this technology is exhibited.</p></div>","PeriodicalId":100147,"journal":{"name":"Augmented Human Research","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2016-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s41133-016-0001-z","citationCount":"24","resultStr":"{\"title\":\"A Communication Paradigm Using Subvocalized Speech: Translating Brain Signals into Speech\",\"authors\":\"Kusuma Mohanchandra, Snehanshu Saha\",\"doi\":\"10.1007/s41133-016-0001-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Recent science and technology studies in neuroscience, rehabilitation, and machine learning have focused attention on the EEG-based brain–computer interface (BCI) as an exciting field of research. Though the primary goal of the BCI has been to restore communication in the severely paralyzed, BCI for speech communication has acquired recognition in a variety of non-medical fields. These fields include silent speech communication, cognitive biometrics, and synthetic telepathy, to name a few. Though potentially a very sensitive issue on various counts, it is likely to revolutionize the whole system of communication. Considering the wide range of application, this paper presents innovative research on BCI for speech communication. Since imagined speech suffers from quite a few factors, we have chosen to focus on subvocalized speech for the current work. The current work is considered to be the first to utilize the subvocal verbalization for EEG-based BCI in speech communication. The electrical signals generated by the human brain during subvocalized speech are captured, analyzed, and interpreted as speech. Further, the processed EEG signals are used to drive a speech synthesizer, enabling communication and acoustical feedback for the user. We attempt to demonstrate and justify that the BCI is capable of providing good results. The basis of this effort is the presumption that, whether the speech is overt or covert, it always originates in the mind. The scalp maps provide evidence that subvocal speech prediction, from the neurological signals, is achievable. The statistical results obtained from the current study demonstrate that speech prediction is possible. EEG signals suffer from the curse of dimensionality due to the intrinsic biological and electromagnetic complexities. Therefore, in the current work, the subset selection method, using pairwise cross-correlation, is proposed to reduce the size of the data while minimizing loss of information. The prominent variances obtained from the SSM, based on principal representative features, were deployed to analyze multiclass EEG signals. A multiclass support vector machine is used for the classification of EEG signals of five subvocalized words extracted from scalp electrodes. Though the current work identifies many challenges, the promise of this technology is exhibited.</p></div>\",\"PeriodicalId\":100147,\"journal\":{\"name\":\"Augmented Human Research\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s41133-016-0001-z\",\"citationCount\":\"24\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Augmented Human Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s41133-016-0001-z\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Augmented Human Research","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s41133-016-0001-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 24

摘要

近年来，神经科学、康复和机器学习等领域的科学技术研究都将基于脑电图的脑机接口(BCI)作为一个令人兴奋的研究领域。虽然脑机接口的主要目的是恢复重度瘫痪患者的交流，但脑机接口在言语交流方面的应用已在各种非医学领域得到认可。这些领域包括无声语言交流，认知生物识别和合成心灵感应，仅举几例。尽管从各个方面来看，这都是一个非常敏感的问题，但它可能会彻底改变整个通信系统。鉴于脑机接口在语音通信中的广泛应用，本文提出了脑机接口在语音通信中的创新研究。由于想象语音受到很多因素的影响，因此我们选择在当前的工作中重点研究次发声语音。本研究被认为是首次将声下言语化应用于基于脑电图的脑机接口语言交流。人脑在默语过程中产生的电信号被捕捉、分析并解释为语音。此外，处理后的脑电图信号用于驱动语音合成器，为用户实现通信和声学反馈。我们试图证明并证明BCI能够提供良好的结果。这种努力的基础是这样一种假设，即无论言语是公开的还是隐蔽的，它总是源于心灵。头皮图提供的证据表明，从神经信号中预测语音是可以实现的。本研究的统计结果表明，语音预测是可能的。脑电图信号由于其固有的生物复杂性和电磁复杂性而受到维数诅咒。因此，在当前的工作中，我们提出了使用成对相互关联的子集选择方法，以减少数据的大小，同时最大限度地减少信息的损失。基于主代表特征，利用SSM得到的显著方差分析多类脑电信号。利用多类支持向量机对头皮电极提取的5个次发声词的脑电信号进行分类。尽管目前的工作发现了许多挑战，但这项技术的前景已经展现出来。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Communication Paradigm Using Subvocalized Speech: Translating Brain Signals into Speech

Recent science and technology studies in neuroscience, rehabilitation, and machine learning have focused attention on the EEG-based brain–computer interface (BCI) as an exciting field of research. Though the primary goal of the BCI has been to restore communication in the severely paralyzed, BCI for speech communication has acquired recognition in a variety of non-medical fields. These fields include silent speech communication, cognitive biometrics, and synthetic telepathy, to name a few. Though potentially a very sensitive issue on various counts, it is likely to revolutionize the whole system of communication. Considering the wide range of application, this paper presents innovative research on BCI for speech communication. Since imagined speech suffers from quite a few factors, we have chosen to focus on subvocalized speech for the current work. The current work is considered to be the first to utilize the subvocal verbalization for EEG-based BCI in speech communication. The electrical signals generated by the human brain during subvocalized speech are captured, analyzed, and interpreted as speech. Further, the processed EEG signals are used to drive a speech synthesizer, enabling communication and acoustical feedback for the user. We attempt to demonstrate and justify that the BCI is capable of providing good results. The basis of this effort is the presumption that, whether the speech is overt or covert, it always originates in the mind. The scalp maps provide evidence that subvocal speech prediction, from the neurological signals, is achievable. The statistical results obtained from the current study demonstrate that speech prediction is possible. EEG signals suffer from the curse of dimensionality due to the intrinsic biological and electromagnetic complexities. Therefore, in the current work, the subset selection method, using pairwise cross-correlation, is proposed to reduce the size of the data while minimizing loss of information. The prominent variances obtained from the SSM, based on principal representative features, were deployed to analyze multiclass EEG signals. A multiclass support vector machine is used for the classification of EEG signals of five subvocalized words extracted from scalp electrodes. Though the current work identifies many challenges, the promise of this technology is exhibited.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Augmented Human Research

自引率

0.00%

发文量