Sound source localization using binaural difference for hose-shaped rescue robot

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2017-12-01 DOI:10.1109/APSIPA.2017.8282292

Narumi Mae, Yoshiki Mitsui, S. Makino, Daichi Kitamura, Nobutaka Ono, Takeshi Yamada, H. Saruwatari

{"title":"Sound source localization using binaural difference for hose-shaped rescue robot","authors":"Narumi Mae, Yoshiki Mitsui, S. Makino, Daichi Kitamura, Nobutaka Ono, Takeshi Yamada, H. Saruwatari","doi":"10.1109/APSIPA.2017.8282292","DOIUrl":null,"url":null,"abstract":"Rescue robots have been developed for search and rescue operations in times of large-scale disasters. Such a robot is used to search for survivors in disaster sites by capturing their voices with its microphone array. However, since the robot has many vibration motors, ego noise is mixed with voices, and it is difficult to differentiate the ego noise from a call for help from a disaster survivor. In our previous works, an ego noise reduction technique that combines a method of blind source separation called independent low-rank matrix analysis and postprocessing for noise cancellation was proposed. In the practical use of this robot, to determine the precise location of survivors, the direction of the observed voice should be estimated after the ego noise reduction process. To achieve this objective, in this study, a new hose-shaped rescue robot with microphone arrays was developed. Moreover, we adapt postfilter called MOSIE to our previous noise reduction method to listen to stereo sound because this robot can record stereo sound. By performing in a simulated disaster site, we confirm that the operator can perceive the direction of a survivor's location by applying a speech enhancement technique combining independent low-rank matrix analysis, noise cancellation, and postfiltering to the observed multichannel noisy signals.","PeriodicalId":142091,"journal":{"name":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2017.8282292","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Rescue robots have been developed for search and rescue operations in times of large-scale disasters. Such a robot is used to search for survivors in disaster sites by capturing their voices with its microphone array. However, since the robot has many vibration motors, ego noise is mixed with voices, and it is difficult to differentiate the ego noise from a call for help from a disaster survivor. In our previous works, an ego noise reduction technique that combines a method of blind source separation called independent low-rank matrix analysis and postprocessing for noise cancellation was proposed. In the practical use of this robot, to determine the precise location of survivors, the direction of the observed voice should be estimated after the ego noise reduction process. To achieve this objective, in this study, a new hose-shaped rescue robot with microphone arrays was developed. Moreover, we adapt postfilter called MOSIE to our previous noise reduction method to listen to stereo sound because this robot can record stereo sound. By performing in a simulated disaster site, we confirm that the operator can perceive the direction of a survivor's location by applying a speech enhancement technique combining independent low-rank matrix analysis, noise cancellation, and postfiltering to the observed multichannel noisy signals.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于双耳差分的软管型救援机器人声源定位

救援机器人是为大规模灾害时的搜救行动而开发的。这种机器人通过麦克风阵列捕捉幸存者的声音，在灾难现场寻找幸存者。但是，由于机器人有许多振动马达，因此自我噪音与声音混杂在一起，很难将自我噪音与灾难幸存者的求救信号区分开来。在我们之前的工作中，我们提出了一种自我降噪技术，该技术结合了一种称为独立低秩矩阵分析的盲源分离方法和噪声消除的后处理。在本机器人的实际使用中，为了确定幸存者的精确位置，需要在自我降噪过程后估计观察到的声音的方向。为了实现这一目标，本研究开发了一种带有麦克风阵列的新型软管状救援机器人。此外，由于该机器人可以录制立体声，因此我们在之前的降噪方法上采用了称为MOSIE的后滤波来收听立体声。通过模拟灾难现场，我们证实了操作员可以通过将独立的低秩矩阵分析、噪声消除和后滤波相结合的语音增强技术对观察到的多通道噪声信号感知幸存者位置的方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

自引率

0.00%

发文量