利用从机器特定滤波器库中提取的频谱-时态调制表示法进行机器异常声音检测

Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki
{"title":"利用从机器特定滤波器库中提取的频谱-时态调制表示法进行机器异常声音检测","authors":"Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki","doi":"arxiv-2409.05319","DOIUrl":null,"url":null,"abstract":"Early detection of factory machinery malfunctions is crucial in industrial\napplications. In machine anomalous sound detection (ASD), different machines\nexhibit unique vibration-frequency ranges based on their physical properties.\nMeanwhile, the human auditory system is adept at tracking both temporal and\nspectral dynamics of machine sounds. Consequently, integrating the\ncomputational auditory models of the human auditory system with\nmachine-specific properties can be an effective approach to machine ASD. We\nfirst quantified the frequency importances of four types of machines using the\nFisher ratio (F-ratio). The quantified frequency importances were then used to\ndesign machine-specific non-uniform filterbanks (NUFBs), which extract the log\nnon-uniform spectrum (LNS) feature. The designed NUFBs have a narrower\nbandwidth and higher filter distribution density in frequency regions with\nrelatively high F-ratios. Finally, spectral and temporal modulation\nrepresentations derived from the LNS feature were proposed. These proposed LNS\nfeature and modulation representations are input into an autoencoder\nneural-network-based detector for ASD. The quantification results from the\ntraining set of the Malfunctioning Industrial Machine Investigation and\nInspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the\ndistinguishing information between normal and anomalous sounds of different\nmachines is encoded non-uniformly in the frequency domain. By highlighting\nthese important frequency regions using NUFBs, the LNS feature can\nsignificantly enhance performance using the metric of AUC (area under the\nreceiver operating characteristic curve) under various SNR conditions.\nFurthermore, modulation representations can further improve performance.\nSpecifically, temporal modulation is effective for fans, pumps, and sliders,\nwhile spectral modulation is particularly effective for valves.","PeriodicalId":501178,"journal":{"name":"arXiv - CS - Sound","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks\",\"authors\":\"Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki\",\"doi\":\"arxiv-2409.05319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Early detection of factory machinery malfunctions is crucial in industrial\\napplications. In machine anomalous sound detection (ASD), different machines\\nexhibit unique vibration-frequency ranges based on their physical properties.\\nMeanwhile, the human auditory system is adept at tracking both temporal and\\nspectral dynamics of machine sounds. Consequently, integrating the\\ncomputational auditory models of the human auditory system with\\nmachine-specific properties can be an effective approach to machine ASD. We\\nfirst quantified the frequency importances of four types of machines using the\\nFisher ratio (F-ratio). The quantified frequency importances were then used to\\ndesign machine-specific non-uniform filterbanks (NUFBs), which extract the log\\nnon-uniform spectrum (LNS) feature. The designed NUFBs have a narrower\\nbandwidth and higher filter distribution density in frequency regions with\\nrelatively high F-ratios. Finally, spectral and temporal modulation\\nrepresentations derived from the LNS feature were proposed. These proposed LNS\\nfeature and modulation representations are input into an autoencoder\\nneural-network-based detector for ASD. The quantification results from the\\ntraining set of the Malfunctioning Industrial Machine Investigation and\\nInspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the\\ndistinguishing information between normal and anomalous sounds of different\\nmachines is encoded non-uniformly in the frequency domain. By highlighting\\nthese important frequency regions using NUFBs, the LNS feature can\\nsignificantly enhance performance using the metric of AUC (area under the\\nreceiver operating characteristic curve) under various SNR conditions.\\nFurthermore, modulation representations can further improve performance.\\nSpecifically, temporal modulation is effective for fans, pumps, and sliders,\\nwhile spectral modulation is particularly effective for valves.\",\"PeriodicalId\":501178,\"journal\":{\"name\":\"arXiv - CS - Sound\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Sound\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.05319\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Sound","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

工厂机械故障的早期检测在工业应用中至关重要。在机器异常声音检测(ASD)中,不同的机器会根据其物理特性表现出独特的振动频率范围。因此,将人类听觉系统的计算听觉模型与机器的特定属性相结合,不失为一种有效的机器自动识别方法。我们首先使用菲舍尔比率(F-ratio)量化了四种机器的频率输入。然后,我们利用量化的频率导入值设计了针对特定机器的非均匀滤波器库(NUFB),从而提取出了对数均匀频谱(LNS)特征。所设计的 NUFB 在 F 比相对较高的频率区域具有更窄的带宽和更高的滤波器分布密度。最后,还提出了从 LNS 特征得出的频谱和时间调制表示法。这些提出的 LNS 特征和调制表示被输入到基于自动编码器神经网络的 ASD 检测器中。从信噪比(SNR)为 6 dB 的故障工业机器调查和检测数据集训练集的量化结果显示,不同机器的正常声音和异常声音之间的区分信息在频域中编码不均匀。通过使用 NUFBs 突出显示这些重要的频率区域,LNS 特征可以在各种信噪比条件下显著提高 AUC(接收器工作特性曲线下的面积)指标的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks
Early detection of factory machinery malfunctions is crucial in industrial applications. In machine anomalous sound detection (ASD), different machines exhibit unique vibration-frequency ranges based on their physical properties. Meanwhile, the human auditory system is adept at tracking both temporal and spectral dynamics of machine sounds. Consequently, integrating the computational auditory models of the human auditory system with machine-specific properties can be an effective approach to machine ASD. We first quantified the frequency importances of four types of machines using the Fisher ratio (F-ratio). The quantified frequency importances were then used to design machine-specific non-uniform filterbanks (NUFBs), which extract the log non-uniform spectrum (LNS) feature. The designed NUFBs have a narrower bandwidth and higher filter distribution density in frequency regions with relatively high F-ratios. Finally, spectral and temporal modulation representations derived from the LNS feature were proposed. These proposed LNS feature and modulation representations are input into an autoencoder neural-network-based detector for ASD. The quantification results from the training set of the Malfunctioning Industrial Machine Investigation and Inspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the distinguishing information between normal and anomalous sounds of different machines is encoded non-uniformly in the frequency domain. By highlighting these important frequency regions using NUFBs, the LNS feature can significantly enhance performance using the metric of AUC (area under the receiver operating characteristic curve) under various SNR conditions. Furthermore, modulation representations can further improve performance. Specifically, temporal modulation is effective for fans, pumps, and sliders, while spectral modulation is particularly effective for valves.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Benchmarking Sub-Genre Classification For Mainstage Dance Music PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification Evaluation of real-time transcriptions using end-to-end ASR models Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks Harmonic Reasoning in Large Language Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1