基于感知的机器环境声音异构信息显著性特征融合方法

Jingyu Wang, Ke Zhang, K. Madani, C. Sabourin
{"title":"基于感知的机器环境声音异构信息显著性特征融合方法","authors":"Jingyu Wang, Ke Zhang, K. Madani, C. Sabourin","doi":"10.1109/ICAWST.2013.6765433","DOIUrl":null,"url":null,"abstract":"Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of \"sense\" or \"awareness\", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.","PeriodicalId":68697,"journal":{"name":"炎黄地理","volume":"44 1","pages":"197-205"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Heterogeneous information saliency features' fusion approach for machine's environment sounds based awareness\",\"authors\":\"Jingyu Wang, Ke Zhang, K. Madani, C. Sabourin\",\"doi\":\"10.1109/ICAWST.2013.6765433\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of \\\"sense\\\" or \\\"awareness\\\", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.\",\"PeriodicalId\":68697,\"journal\":{\"name\":\"炎黄地理\",\"volume\":\"44 1\",\"pages\":\"197-205\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"炎黄地理\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAWST.2013.6765433\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"炎黄地理","FirstCategoryId":"1089","ListUrlMain":"https://doi.org/10.1109/ICAWST.2013.6765433","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

人类在处理日常生活中发生的声音时比机器人或其他无人驾驶地面车辆更聪明,因为人类具有“感觉”或“意识”的本能,即区分周围环境中最显著的声音、物体或事件的能力。本文以人类听觉系统的生物声感知和人类视觉的视觉显著性天赋为灵感,提出了一种模拟人类对环境声音的感知以实现机器感知的异构信息显著性特征融合(HISFF)方法。利用短时傅里叶变换(STFT)算法对声音信号进行可视化处理,将声音显著性转化为视觉显著性,并利用Mel-Frequency倒频谱系数(MFCC)表征人的声意识。利用室内和室外环境的真实环境声数据对所提出的HISFF方法进行了测试。结果表明,该方法能够成功且清晰地从长期和短期声音信号中提取显著性信号,为机器基于环境声音的感知提供了非常容易区分的特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Heterogeneous information saliency features' fusion approach for machine's environment sounds based awareness
Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of "sense" or "awareness", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
784
期刊最新文献
Make decision boundary smoother by transition learning Neurophysiological evidence of the cognitive cycle and the emergence of awareness An efficient implementation of normalized cross-correlation image matching based on pyramid A hybrid recommender system based non-common items in social media "Canderoid": A mobile system to remotely monitor travelling status of the elderly with dementia
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1