X4无人机存在下的语音清晰度

M. Miesikowska
{"title":"X4无人机存在下的语音清晰度","authors":"M. Miesikowska","doi":"10.23919/SPA.2018.8563410","DOIUrl":null,"url":null,"abstract":"The main purpose of this work was to obtain background sound levels and speech intelligibility as well as to evaluate classification of speech commands in the presence of an unmanned aerial vehicle (UAV) equipped with four rotating propellers. Speech intelligibility was assessed using speech interference level (SIL) parameter according to ISO 9921. The UAV background sound levels were recorded in laboratory conditions using Norsonic140 sound analyzer in the absence of the UAV and in the presence of the UAV. The classification of speech commands/left, right, up, down, forward, backward, start, stop/recorded with Olympus LS-11 was evaluated in laboratory condition based on Mel-frequency cepstral coefficients and discriminant function analysis. The UAV was hovering at 1.5m during recordings. The A-weighted sound level obtained in the presence of the UAV was 70.5 dB(A). Speech intelligibility rating was poor in the presence of the UAV. Discriminant analysis based on Mel-frequency cepstral coefficients showed very successful classification of speech commands equal to 100%. Evaluated speech intelligibility did not exclude verbal communication with the UAV. The successful classification of speech commands in the presence of the UAV can enable the control of the UAV using voice commands and general communication with the UAV using speech.","PeriodicalId":265587,"journal":{"name":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Speech Intelligibility in the presence of X4 Unmanned Aerial Vehicle\",\"authors\":\"M. Miesikowska\",\"doi\":\"10.23919/SPA.2018.8563410\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The main purpose of this work was to obtain background sound levels and speech intelligibility as well as to evaluate classification of speech commands in the presence of an unmanned aerial vehicle (UAV) equipped with four rotating propellers. Speech intelligibility was assessed using speech interference level (SIL) parameter according to ISO 9921. The UAV background sound levels were recorded in laboratory conditions using Norsonic140 sound analyzer in the absence of the UAV and in the presence of the UAV. The classification of speech commands/left, right, up, down, forward, backward, start, stop/recorded with Olympus LS-11 was evaluated in laboratory condition based on Mel-frequency cepstral coefficients and discriminant function analysis. The UAV was hovering at 1.5m during recordings. The A-weighted sound level obtained in the presence of the UAV was 70.5 dB(A). Speech intelligibility rating was poor in the presence of the UAV. Discriminant analysis based on Mel-frequency cepstral coefficients showed very successful classification of speech commands equal to 100%. Evaluated speech intelligibility did not exclude verbal communication with the UAV. The successful classification of speech commands in the presence of the UAV can enable the control of the UAV using voice commands and general communication with the UAV using speech.\",\"PeriodicalId\":265587,\"journal\":{\"name\":\"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/SPA.2018.8563410\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/SPA.2018.8563410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

这项工作的主要目的是获得背景声级和语音可理解性,以及在配备四个旋转螺旋桨的无人机(UAV)存在的情况下评估语音命令的分类。根据ISO 9921标准,使用语音干扰水平(SIL)参数评估语音清晰度。在没有无人机和有无人机的情况下,在实验室条件下使用Norsonic140声音分析仪记录无人机背景声级。基于mel频倒谱系数和判别函数分析,在实验室条件下对Olympus LS-11录制的语音命令/左、右、上、下、前、后、开始、停止的分类进行评价。在录制过程中,无人机在1.5米的高度悬停。在无人机存在下获得的A加权声级为70.5 dB(A)。在无人机的存在下,语音清晰度评级很差。基于mel频率倒谱系数的判别分析表明,语音命令的分类成功率为100%。评估的语音清晰度不排除与无人机的口头交流。在UAV存在下语音命令的成功分类能够使UAV使用语音命令的控制和使用语音与UAV的一般通信成为可能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Speech Intelligibility in the presence of X4 Unmanned Aerial Vehicle
The main purpose of this work was to obtain background sound levels and speech intelligibility as well as to evaluate classification of speech commands in the presence of an unmanned aerial vehicle (UAV) equipped with four rotating propellers. Speech intelligibility was assessed using speech interference level (SIL) parameter according to ISO 9921. The UAV background sound levels were recorded in laboratory conditions using Norsonic140 sound analyzer in the absence of the UAV and in the presence of the UAV. The classification of speech commands/left, right, up, down, forward, backward, start, stop/recorded with Olympus LS-11 was evaluated in laboratory condition based on Mel-frequency cepstral coefficients and discriminant function analysis. The UAV was hovering at 1.5m during recordings. The A-weighted sound level obtained in the presence of the UAV was 70.5 dB(A). Speech intelligibility rating was poor in the presence of the UAV. Discriminant analysis based on Mel-frequency cepstral coefficients showed very successful classification of speech commands equal to 100%. Evaluated speech intelligibility did not exclude verbal communication with the UAV. The successful classification of speech commands in the presence of the UAV can enable the control of the UAV using voice commands and general communication with the UAV using speech.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Vehicle detector training with labels derived from background subtraction algorithms in video surveillance Automatic 3D segmentation of MRI data for detection of head and neck cancerous lymph nodes Centerline-Radius Polygonal-Mesh Modeling of Bifurcated Blood Vessels in 3D Images using Conformal Mapping Active elimination of tonal components in acoustic signals An adaptive transmission algorithm for an inertial motion capture system in the aspect of energy saving
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1