{"title":"Synthetic Transaural Audio Rendering (STAR): Extension to Full 3D Spatialization","authors":"Sylvain Marchand, Eric Meaux","doi":"10.17743/jaes.2022.0098","DOIUrl":"https://doi.org/10.17743/jaes.2022.0098","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135886027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Joshua Mannall, Lauri Savioja, Paul Calamia, Russell Mason, Enzo De Sena
{"title":"Efficient Diffraction Modeling Using Neural Networks and Infinite Impulse Response Filters","authors":"Joshua Mannall, Lauri Savioja, Paul Calamia, Russell Mason, Enzo De Sena","doi":"10.17743/jaes.2022.0107","DOIUrl":"https://doi.org/10.17743/jaes.2022.0107","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135886026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
François Salmon, Frédéric Changenet, Tom Colas, C. Verron, M. Paquier
{"title":"A Comparative Study of Multichannel Microphone Arrays Used in Classical Music Recording","authors":"François Salmon, Frédéric Changenet, Tom Colas, C. Verron, M. Paquier","doi":"10.17743/jaes.2022.0091","DOIUrl":"https://doi.org/10.17743/jaes.2022.0091","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49142966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Audio-Driven Talking Face Generation: A Review","authors":"Shiguang Liu","doi":"10.17743/jaes.2022.0081","DOIUrl":"https://doi.org/10.17743/jaes.2022.0081","url":null,"abstract":".","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41963721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A High-Resolution Boundary Element Method Suitable Full Torso Mesh of KEMAR","authors":"Kat Young, G. Kearney","doi":"10.17743/jaes.2022.0085","DOIUrl":"https://doi.org/10.17743/jaes.2022.0085","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43558198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jakob Bergner, Daphne Schössow, Stephan Preihs, J. Peissig
This work is motivated by the question of whether different loudspeaker-based multichan- nel playback methods can be robustly characterized by measurable acoustic properties. For that, underlying acoustic dimensions were identified that allow for a discriminative sound field analysis within a music reproduction scenario. The subject of investigation is a set of different musical pieces available in different multichannel playback formats. Re-recordings of the stimuli at a listening position using a spherical microphone array enable a sound field analysis that includes, in total, 237 signal-based indicators in the categories of loudness, qual- ity, spaciousness, and time. The indicators are fed to a factor and time series analysis to identify the most relevant acoustic dimensions that reflect and explain significant parts of the variance within the acoustical data. The results show that of the eight relevant dimensions, the dimensions “High-Frequency Diffusivity,” “Elevational Diffusivity,” and “Mid-Frequency Diffusivity” are capable of identifying statistically significant differences between the loudspeaker setups. The presented approach leads to plausible results that are in accordance with the expected differences between the loudspeaker configurations used. The findings may be used for a better understanding of the effects of different loudspeaker configurations on human perception and emotional response when listening to music.
{"title":"Identification of Discriminative Acoustic Dimensions in Stereo, Surround and 3D Music Reproduction","authors":"Jakob Bergner, Daphne Schössow, Stephan Preihs, J. Peissig","doi":"10.17743/jaes.2022.0071","DOIUrl":"https://doi.org/10.17743/jaes.2022.0071","url":null,"abstract":"This work is motivated by the question of whether different loudspeaker-based multichan- nel playback methods can be robustly characterized by measurable acoustic properties. For that, underlying acoustic dimensions were identified that allow for a discriminative sound field analysis within a music reproduction scenario. The subject of investigation is a set of different musical pieces available in different multichannel playback formats. Re-recordings of the stimuli at a listening position using a spherical microphone array enable a sound field analysis that includes, in total, 237 signal-based indicators in the categories of loudness, qual- ity, spaciousness, and time. The indicators are fed to a factor and time series analysis to identify the most relevant acoustic dimensions that reflect and explain significant parts of the variance within the acoustical data. The results show that of the eight relevant dimensions, the dimensions “High-Frequency Diffusivity,” “Elevational Diffusivity,” and “Mid-Frequency Diffusivity” are capable of identifying statistically significant differences between the loudspeaker setups. The presented approach leads to plausible results that are in accordance with the expected differences between the loudspeaker configurations used. The findings may be used for a better understanding of the effects of different loudspeaker configurations on human perception and emotional response when listening to music.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67642486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
F. Brinkmann, W. Kreuzer, J. Thomsen, Sergejs Dombrovskis, K. Pollack, S. Weinzierl, P. Majdak
Mesh2HRTF 1.x is an open-source and fully scriptable end-to-end pipeline for the numerical calculation of head-related transfer functions (HRTFs). The calculations are based on 3D meshes of listener’s body parts such as the head, pinna, and torso. The numerical core of Mesh2HRTF is written in C++ and employs the boundary-element method for solving the Helmholtz equation. It is accelerated by a multilevel fast multipole method and can easily be parallelized to further speed up the computations. The recently refactored framework of Mesh2HRTF 1.x contains tools for preparing the meshes as well as specific post-processing and inspection of the calculated HRTFs. The resulting HRTFs are saved in the spatially oriented format for acoustics being directly applicable in virtual and augmented reality applications and psychoacoustic research. The Mesh2HRTF 1.x code is automatically tested to assure high quality and reliability. A comprehensive online documentation enables easy access for users without in-depth knowledge of acoustic simulations.
{"title":"Recent Advances in an Open Software for Numerical HRTF Calculation","authors":"F. Brinkmann, W. Kreuzer, J. Thomsen, Sergejs Dombrovskis, K. Pollack, S. Weinzierl, P. Majdak","doi":"10.17743/jaes.2022.0078","DOIUrl":"https://doi.org/10.17743/jaes.2022.0078","url":null,"abstract":"Mesh2HRTF 1.x is an open-source and fully scriptable end-to-end pipeline for the numerical calculation of head-related transfer functions (HRTFs). The calculations are based on 3D meshes of listener’s body parts such as the head, pinna, and torso. The numerical core of Mesh2HRTF is written in C++ and employs the boundary-element method for solving the Helmholtz equation. It is accelerated by a multilevel fast multipole method and can easily be parallelized to further speed up the computations. The recently refactored framework of Mesh2HRTF 1.x contains tools for preparing the meshes as well as specific post-processing and inspection of the calculated HRTFs. The resulting HRTFs are saved in the spatially oriented format for acoustics being directly applicable in virtual and augmented reality applications and psychoacoustic research. The Mesh2HRTF 1.x code is automatically tested to assure high quality and reliability. A comprehensive online documentation enables easy access for users without in-depth knowledge of acoustic simulations.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42230053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Bivariate Mixture Space: A Compact Spectral Representation of Bivariate Signals","authors":"G. Presti","doi":"10.17743/jaes.2022.0090","DOIUrl":"https://doi.org/10.17743/jaes.2022.0090","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47417680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Listenerenvelopmentreferstothesensationofbeingsurroundedbysound,eitherbymultiple direct sound events or by a diffuse reverberant sound field. More recently, a specific attribute for the sensation of being covered by sound from elevated directions has been proposed by Sazdov et al. and was termed listener engulfment. The first experiment presented here investigates how the temporal and directional density of sound events affects listener envelopment. The second experiment studies how elevated loudspeaker layers affect envelopment versus engulfment. A spatial granular synthesis technique is used to precisely control the temporal and directional density of sound events. Experimental results indicate that a directionally uniform distribution of sound events at time intervals (cid:2) t < 20 ms is required to elicit a sensation of diffuse envelopment, whereas longer time intervals lead to localized auditory events. It shows that elevated loudspeaker layers do not increase envelopment but contribute specifically to listener engulfment. Low-pass-filtered stimuli enhance envelopment in directionally sparse conditions, but impede control over engulfment due to a reduction of height localization cues. The results can be exploited in the technical design and creative application of spatial sound synthesis and reverberation algorithms.
听者环境指的是被声音包围的感觉,无论是由多个直接声音事件还是由漫反射声场。最近,Sazdov等人提出了一种特殊的属性,即被来自较高方向的声音覆盖的感觉,并将其称为听众吞没。这里提出的第一个实验研究了声音事件的时间和方向密度如何影响听者包络。第二个实验研究了升高的扬声器层如何影响包围与吞没。采用空间颗粒合成技术精确控制声事件的时间和方向密度。实验结果表明,在时间间隔(cid:2) t < 20 ms时,声音事件的方向均匀分布需要引起弥漫性包络感,而更长的时间间隔则导致局部听觉事件。研究表明,升高的扬声器层并没有增加包围度,而是特别有助于听众的吞没。低通滤波刺激增强了方向稀疏条件下的包络,但由于高度定位线索的减少,阻碍了对吞噬的控制。研究结果可用于空间声音合成和混响算法的技术设计和创造性应用。
{"title":"The Effect of Temporal and Directional Density on Listener Envelopment","authors":"Stefan Riedel, M. Frank, F. Zotter","doi":"10.17743/jaes.2022.0088","DOIUrl":"https://doi.org/10.17743/jaes.2022.0088","url":null,"abstract":"Listenerenvelopmentreferstothesensationofbeingsurroundedbysound,eitherbymultiple direct sound events or by a diffuse reverberant sound field. More recently, a specific attribute for the sensation of being covered by sound from elevated directions has been proposed by Sazdov et al. and was termed listener engulfment. The first experiment presented here investigates how the temporal and directional density of sound events affects listener envelopment. The second experiment studies how elevated loudspeaker layers affect envelopment versus engulfment. A spatial granular synthesis technique is used to precisely control the temporal and directional density of sound events. Experimental results indicate that a directionally uniform distribution of sound events at time intervals (cid:2) t < 20 ms is required to elicit a sensation of diffuse envelopment, whereas longer time intervals lead to localized auditory events. It shows that elevated loudspeaker layers do not increase envelopment but contribute specifically to listener engulfment. Low-pass-filtered stimuli enhance envelopment in directionally sparse conditions, but impede control over engulfment due to a reduction of height localization cues. The results can be exploited in the technical design and creative application of spatial sound synthesis and reverberation algorithms.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41563209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-Layered Architecture for Efficient and Accurate HRTF Rendering","authors":"Mick Marchan, Andrew Allen","doi":"10.17743/jaes.2022.0075","DOIUrl":"https://doi.org/10.17743/jaes.2022.0075","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47853480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}