首页 > 最新文献

Acoustical Science and Technology最新文献

英文 中文
Joint analysis of acoustic scenes and sound events based on multitask learning with dynamic weight adaptation 基于动态权值自适应多任务学习的声场景和声事件联合分析
Q4 ACOUSTICS Pub Date : 2023-05-01 DOI: 10.1250/ast.44.167
Kayo Nada, Keisuke Imoto, Takao Tsuchiya
Acoustic scene classification (ASC) and sound event detection (SED) are major topics in environmental sound analysis. Considering that acoustic scenes and sound events are closely related to each other, the joint analysis of acoustic scenes and sound events using multitask learning (MTL)-based neural networks was proposed in some previous works. Conventional methods train MTL-based models using a linear combination of ASC and SED loss functions with constant weights. However, the performance of conventional MTL-based methods depends strongly on the weights of the ASC and SED losses, and it is difficult to determine the appropriate balance between the constant weights of the losses of MTL of ASC and SED. In this paper, we thus propose dynamic weight adaptation methods for MTL of ASC and SED based on dynamic weight average (DWA) and multi-focal loss (MFL) to adjust the learning weights automatically. By comparing the two methods, we then clarify how the dynamic adaptation of the loss weights, rather than specific methods of DWA and MFL, generally benefits the joint analysis of ASC and SED based on MTL. Moreover, we investigate how the training of the joint ASC and SED model dynamically progresses and disclose how the loss weights affect their performance.
声场景分类(ASC)和声事件检测(SED)是环境声分析中的主要问题。考虑到声音场景和声音事件之间的密切关系,前人提出了利用基于多任务学习(MTL)的神经网络对声音场景和声音事件进行联合分析。传统方法使用恒权的ASC和SED损失函数的线性组合来训练基于mtl的模型。然而,传统的基于MTL的方法的性能在很大程度上依赖于ASC和SED损失的权重,很难确定ASC和SED的MTL损失的恒定权重之间的适当平衡。因此,本文提出了基于动态加权平均(DWA)和多焦点损失(MFL)的ASC和SED的MTL动态权重自适应方法,自动调整学习权重。通过对两种方法的比较,我们阐明了相对于DWA和MFL的具体方法,loss weight的动态自适应通常更有利于基于MTL的ASC和SED的联合分析。此外,我们还研究了ASC和SED联合模型的训练是如何动态进行的,并揭示了损失权值是如何影响它们的性能的。
{"title":"Joint analysis of acoustic scenes and sound events based on multitask learning with dynamic weight adaptation","authors":"Kayo Nada, Keisuke Imoto, Takao Tsuchiya","doi":"10.1250/ast.44.167","DOIUrl":"https://doi.org/10.1250/ast.44.167","url":null,"abstract":"Acoustic scene classification (ASC) and sound event detection (SED) are major topics in environmental sound analysis. Considering that acoustic scenes and sound events are closely related to each other, the joint analysis of acoustic scenes and sound events using multitask learning (MTL)-based neural networks was proposed in some previous works. Conventional methods train MTL-based models using a linear combination of ASC and SED loss functions with constant weights. However, the performance of conventional MTL-based methods depends strongly on the weights of the ASC and SED losses, and it is difficult to determine the appropriate balance between the constant weights of the losses of MTL of ASC and SED. In this paper, we thus propose dynamic weight adaptation methods for MTL of ASC and SED based on dynamic weight average (DWA) and multi-focal loss (MFL) to adjust the learning weights automatically. By comparing the two methods, we then clarify how the dynamic adaptation of the loss weights, rather than specific methods of DWA and MFL, generally benefits the joint analysis of ASC and SED based on MTL. Moreover, we investigate how the training of the joint ASC and SED model dynamically progresses and disclose how the loss weights affect their performance.","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136048369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Formant estimation of high-pitched noisy speech using homomorphic deconvolution of higher-order group delay spectrum 基于高阶群延迟谱同态反卷积的高频噪声语音形成峰估计
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.84
Husne Ara Chowdhury, Mohammad Shahidur Rahman
{"title":"Formant estimation of high-pitched noisy speech using homomorphic deconvolution of higher-order group delay spectrum","authors":"Husne Ara Chowdhury, Mohammad Shahidur Rahman","doi":"10.1250/ast.44.84","DOIUrl":"https://doi.org/10.1250/ast.44.84","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"111 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90367154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heavy-weight floor impact sound of a pure framed structure by field measurement and numerical calculation 现场实测及数值计算的纯框架结构重型楼板冲击声
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.120
Tomoaki Uemura, N. Hashimoto, Yasuyuki Kondo
{"title":"Heavy-weight floor impact sound of a pure framed structure by field measurement and numerical calculation","authors":"Tomoaki Uemura, N. Hashimoto, Yasuyuki Kondo","doi":"10.1250/ast.44.120","DOIUrl":"https://doi.org/10.1250/ast.44.120","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"63 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84114272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech source separation avoiding initial value dependency by cepstral-basis-decomposed nonnegative matrix factorization 基于倒谱基分解非负矩阵分解的语音源分离避免了初始值依赖
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.137
Fuga Oshima, M. Nakayama
{"title":"Speech source separation avoiding initial value dependency by cepstral-basis-decomposed nonnegative matrix factorization","authors":"Fuga Oshima, M. Nakayama","doi":"10.1250/ast.44.137","DOIUrl":"https://doi.org/10.1250/ast.44.137","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"66 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81166227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic generation of stage data for music games with sparse target density 目标密度稀疏的音乐游戏舞台数据自动生成
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.49
Atsuhito Udo, N. Aoki, Y. Dobashi
{"title":"Automatic generation of stage data for music games with sparse target density","authors":"Atsuhito Udo, N. Aoki, Y. Dobashi","doi":"10.1250/ast.44.49","DOIUrl":"https://doi.org/10.1250/ast.44.49","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"40 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75697803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bit rate required for mono audio object in object-based audio program compressed with MPEG-H 3D Audio 用MPEG-H 3D audio压缩的基于对象的音频程序中单声道音频对象所需的比特率
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.93
T. Sugimoto
{"title":"Bit rate required for mono audio object in object-based audio program compressed with MPEG-H 3D Audio","authors":"T. Sugimoto","doi":"10.1250/ast.44.93","DOIUrl":"https://doi.org/10.1250/ast.44.93","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"1 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74799901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effect of varying corridor parameters on signal-to-noise ratio in classrooms 不同走廊参数对教室信噪比的影响
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.110
Hengling Song
{"title":"Effect of varying corridor parameters on signal-to-noise ratio in classrooms","authors":"Hengling Song","doi":"10.1250/ast.44.110","DOIUrl":"https://doi.org/10.1250/ast.44.110","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"9 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78319794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-dimensional finite-difference time-domain simulation of moving sound source and receiver with directivity 具有指向性的运动声源和接收机的二维时域有限差分仿真
Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.101
Takao Tsuchiya, Yusuke Makino, Yu Teshima, Shizuko Hiryu
This paper reports on the implementation of a moving sound source and receiver with directivity in the two-dimensional finite-difference time-domain (FDTD) method. A two-dimensional fundamental solution of a moving monopole source is theoretically derived. Then, a fundamental solution of a moving dipole source is obtained by differentiating the fundamental solution of a monopole source in space. Finally, the directivity of moving monopole, dipole, and cardioid sources is theoretically derived. Numerical experiments performed on the two-dimensional sound field showed that the effect of moving velocity on amplitude differs for the monopole and dipole sources. Furthermore, it was found that directivity characteristics of dipole and cardioid sources vary depending on the beam steering angle and moving direction. The present method can be accurately applied to the moving sound source and receiver with directivity.
本文报道了用二维时域有限差分(FDTD)方法实现具有指向性的运动声源和接收机。从理论上推导了运动单极源的二维基本解。然后,通过对单极源在空间上的基本解进行微分,得到运动偶极源的基本解。最后,从理论上推导了运动单极子、偶极子和心型源的指向性。在二维声场中进行的数值实验表明,在单极声源和偶极声源中,运动速度对振幅的影响是不同的。此外,偶极子光源和心型光源的指向性特性随光束转向角度和运动方向的不同而变化。该方法可以准确地应用于具有指向性的运动声源和接收机。
{"title":"Two-dimensional finite-difference time-domain simulation of moving sound source and receiver with directivity","authors":"Takao Tsuchiya, Yusuke Makino, Yu Teshima, Shizuko Hiryu","doi":"10.1250/ast.44.101","DOIUrl":"https://doi.org/10.1250/ast.44.101","url":null,"abstract":"This paper reports on the implementation of a moving sound source and receiver with directivity in the two-dimensional finite-difference time-domain (FDTD) method. A two-dimensional fundamental solution of a moving monopole source is theoretically derived. Then, a fundamental solution of a moving dipole source is obtained by differentiating the fundamental solution of a monopole source in space. Finally, the directivity of moving monopole, dipole, and cardioid sources is theoretically derived. Numerical experiments performed on the two-dimensional sound field showed that the effect of moving velocity on amplitude differs for the monopole and dipole sources. Furthermore, it was found that directivity characteristics of dipole and cardioid sources vary depending on the beam steering angle and moving direction. The present method can be accurately applied to the moving sound source and receiver with directivity.","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136051928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Lamb wave pulse compression in airborne ultrasound excitation 机载超声激励中的Lamb波脉冲压缩
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.141
K. Shimizu, A. Osumi, Youichi Ito
{"title":"Lamb wave pulse compression in airborne ultrasound excitation","authors":"K. Shimizu, A. Osumi, Youichi Ito","doi":"10.1250/ast.44.141","DOIUrl":"https://doi.org/10.1250/ast.44.141","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"61 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84540171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstracts of Papers in the Journal of the Acoustical Society of Japan (J) Journal of acoustic Society of Japan (J)论文摘要
IF 0.7 Q4 ACOUSTICS Pub Date : 2023-03-01 DOI: 10.1250/ast.44.155
{"title":"Abstracts of Papers in the Journal of the Acoustical Society of Japan (J)","authors":"","doi":"10.1250/ast.44.155","DOIUrl":"https://doi.org/10.1250/ast.44.155","url":null,"abstract":"","PeriodicalId":46068,"journal":{"name":"Acoustical Science and Technology","volume":"293 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78510854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Acoustical Science and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1