Pub Date : 2012-02-01DOI: 10.1109/TASL.2011.2177574
T. Arai, Nao Hodoshima, K. Yasu
In the above titled paper (ibid., vol. 18, no. 7, pp. 1775-1780, Sep. 2010), several IPA fonts were displayed incorrectly. The correct fonts are presented here.
在上述标题论文(同上,第十八卷,第。7, pp. 1775-1780, 2010年9月),一些国际音标字体显示不正确。这里给出了正确的字体。
{"title":"Errata to \"Using Steady-State Suppression to Improve Speech Intelligibility in Reverberant Environments for Elderly Listeners\"","authors":"T. Arai, Nao Hodoshima, K. Yasu","doi":"10.1109/TASL.2011.2177574","DOIUrl":"https://doi.org/10.1109/TASL.2011.2177574","url":null,"abstract":"In the above titled paper (ibid., vol. 18, no. 7, pp. 1775-1780, Sep. 2010), several IPA fonts were displayed incorrectly. The correct fonts are presented here.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"7 1","pages":"709"},"PeriodicalIF":0.0,"publicationDate":"2012-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73731987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2011-07-01DOI: 10.1109/TASL.2010.2082533
Aastha Gupta, T. Abhayapala
Three-dimensional spatial sound field reproduction enables enhanced immersive acoustic experience for a listener. Recreating an arbitrary 3-D spatial sound field using a practically realizable array of loudspeakers is a challenging problem in acoustic signal processing. This paper exploits the underlying characteristics of wavefield propagation to devise a strategy for accurate 3-D sound field reproduction inside a 3-D region of interest with practical array geometries. Specifically, we use the properties of the associated Legendre functions and the spherical Hankel functions, which are part of the solution to the wave equation in spherical coordinates, for loudspeaker placement on a set of multiple circular arrays and provide a technique for spherical harmonic mode-selection to control the reproduced sound field. We also analyze the artifacts of spatial aliasing due to the use of discrete loudspeaker arrays in the region of interest. As an illustration, we design a third-order reproduction system to operate at a frequency of 500 Hz with 18 loudspeakers arranged in a practically realizable configuration.
{"title":"Three-Dimensional Sound Field Reproduction Using Multiple Circular Loudspeaker Arrays","authors":"Aastha Gupta, T. Abhayapala","doi":"10.1109/TASL.2010.2082533","DOIUrl":"https://doi.org/10.1109/TASL.2010.2082533","url":null,"abstract":"Three-dimensional spatial sound field reproduction enables enhanced immersive acoustic experience for a listener. Recreating an arbitrary 3-D spatial sound field using a practically realizable array of loudspeakers is a challenging problem in acoustic signal processing. This paper exploits the underlying characteristics of wavefield propagation to devise a strategy for accurate 3-D sound field reproduction inside a 3-D region of interest with practical array geometries. Specifically, we use the properties of the associated Legendre functions and the spherical Hankel functions, which are part of the solution to the wave equation in spherical coordinates, for loudspeaker placement on a set of multiple circular arrays and provide a technique for spherical harmonic mode-selection to control the reproduced sound field. We also analyze the artifacts of spatial aliasing due to the use of discrete loudspeaker arrays in the region of interest. As an illustration, we design a third-order reproduction system to operate at a frequency of 500 Hz with 18 loudspeakers arranged in a practically realizable configuration.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"267 1","pages":"1149-1159"},"PeriodicalIF":0.0,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79566481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2010-09-01DOI: 10.1109/TASL.2010.2062590
T. Nakatani, Walter Kellermann, P. Naylor, M. Miyoshi, B. Juang
The 17 papers in this special issue focus on the methodologies and applications of processing reverberant speech. The issue highlights some major aspects of the recent progress in the field.
{"title":"Introduction to the Special Issue on Processing Reverberant Speech: Methodologies and Applications","authors":"T. Nakatani, Walter Kellermann, P. Naylor, M. Miyoshi, B. Juang","doi":"10.1109/TASL.2010.2062590","DOIUrl":"https://doi.org/10.1109/TASL.2010.2062590","url":null,"abstract":"The 17 papers in this special issue focus on the methodologies and applications of processing reverberant speech. The issue highlights some major aspects of the recent progress in the field.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"62 1","pages":"1673-1675"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74348423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2010-07-01DOI: 10.1109/TASL.2010.2051826
Y. Stylianou, T. Toda, C. Wu, A. Kain, O. Rosec
{"title":"Introduction to the Special Section on Voice Transformation","authors":"Y. Stylianou, T. Toda, C. Wu, A. Kain, O. Rosec","doi":"10.1109/TASL.2010.2051826","DOIUrl":"https://doi.org/10.1109/TASL.2010.2051826","url":null,"abstract":"","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"49 1","pages":"909-911"},"PeriodicalIF":0.0,"publicationDate":"2010-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73987799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2010-07-01DOI: 10.1109/TASL.2009.2031286
S. Abdallah
This paper examined a number of problems with the derivation of the approximation of the model-IR, including an algebraic error, some inconsistent or untenable assumptions, and an unsupported approximation. These problems make it difficult to judge the significance of the results presented therein.
{"title":"Comment on \"Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection\"","authors":"S. Abdallah","doi":"10.1109/TASL.2009.2031286","DOIUrl":"https://doi.org/10.1109/TASL.2009.2031286","url":null,"abstract":"This paper examined a number of problems with the derivation of the approximation of the model-IR, including an algebraic error, some inconsistent or untenable assumptions, and an unsupported approximation. These problems make it difficult to judge the significance of the results presented therein.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"106 1","pages":"1063-1065"},"PeriodicalIF":0.0,"publicationDate":"2010-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75661025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2010-05-01DOI: 10.1109/TASL.2010.2046449
V. Välimäki, F. Fontana, J. Smith, U. Zölzer
The 16 papers in this special issue focus on virtual audio effects and musical instruments.
这期特刊的16篇论文聚焦于虚拟音频效果和乐器。
{"title":"Introduction to the Special Issue on Virtual Analog Audio Effects and Musical Instruments","authors":"V. Välimäki, F. Fontana, J. Smith, U. Zölzer","doi":"10.1109/TASL.2010.2046449","DOIUrl":"https://doi.org/10.1109/TASL.2010.2046449","url":null,"abstract":"The 16 papers in this special issue focus on virtual audio effects and musical instruments.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"9 1","pages":"713-714"},"PeriodicalIF":0.0,"publicationDate":"2010-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75927426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2010-03-01DOI: 10.1109/TASL.2010.2042225
B. David, Masataka Goto, L. Daudet, P. Smaragdis
The 23 papers in this special issue focus on signal models and representations of musical and environmental sounds.
这期特刊中的23篇论文集中在信号模型和音乐和环境声音的表示上。
{"title":"Editorial for the Special Issue on Signal Models and Representations of Musical and Environmental Sounds","authors":"B. David, Masataka Goto, L. Daudet, P. Smaragdis","doi":"10.1109/TASL.2010.2042225","DOIUrl":"https://doi.org/10.1109/TASL.2010.2042225","url":null,"abstract":"The 23 papers in this special issue focus on signal models and representations of musical and environmental sounds.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"20 1","pages":"417-419"},"PeriodicalIF":0.0,"publicationDate":"2010-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73145396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2009-07-01DOI: 10.1109/TASL.2009.2023314
R. Sarikaya, K. Kirchhoff, Tanja Schultz, Dilek Z. Hakkani-Tür
The 12 papers in this special issue span a variety of speech and language processing applications highlighting the challenges and providing solutions for dealing with the morphological complexity of different languages.
{"title":"Introduction to the Special Issue on Processing Morphologically Rich Languages","authors":"R. Sarikaya, K. Kirchhoff, Tanja Schultz, Dilek Z. Hakkani-Tür","doi":"10.1109/TASL.2009.2023314","DOIUrl":"https://doi.org/10.1109/TASL.2009.2023314","url":null,"abstract":"The 12 papers in this special issue span a variety of speech and language processing applications highlighting the challenges and providing solutions for dealing with the morphological complexity of different languages.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"44 1","pages":"861-862"},"PeriodicalIF":0.0,"publicationDate":"2009-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83899971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}