Journal on Audio Speech and Music Processing最新文献

英文中文

Microphone utility estimation in acoustic sensor networks using single-channel signal features 基于单通道信号特征的声传感器网络麦克风效用估计

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2022-01-24 DOI: 10.1186/s13636-023-00294-7

M. Gunther, Andreas Brendel, Walter Kellermann

引用次数: 1

Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling 基于多语言、多层次单元建模的低资源藏文端到端ASR改进

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2022-01-12 DOI: 10.1186/s13636-021-00233-4

Siqing Qin, Longbiao Wang, Sheng Li, J. Dang, Lixin Pan

引用次数: 6

Auxiliary function-based algorithm for blind extraction of a moving speaker 基于辅助函数的运动说话人盲提取算法

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2022-01-04 DOI: 10.1186/s13636-021-00231-6

Jakub Janský, Zbyněk Koldovský, J. Málek, Tomás Kounovský, Jaroslav Cmejla

引用次数: 16

On the selection of the number of beamformers in beamforming-based binaural reproduction. 基于波束成形的双耳再现中波束成形器数量的选择。

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2022-01-01 Epub Date: 2022-03-30 DOI: 10.1186/s13636-022-00238-7

Itay Ifergan, Boaz Rafaely

In recent years, spatial audio reproduction has been widely researched with many studies focusing on headphone-based spatial reproduction. A popular format for spatial audio is higher order Ambisonics (HOA), where a spherical microphone array is typically used to obtain the HOA signals. When a spherical array is not available, beamforming-based binaural reproduction (BFBR) can be used, where signals are captured with arrays of a general configuration. While shown to be useful, no comprehensive studies of BFBR have been presented and so its limitations and other design aspects are not well understood. This paper takes an initial step towards developing a theory for BFBR and develops guidelines for selecting the number of beamformers. In particular, the average directivity factor of the microphone array is proposed as a measure for supporting this selection. The effect of head-related transfer function (HRTF) order truncation that occurs when using too many beamformer directions is presented and studied. In addition, the relation between HOA-based binaural reproduction and BFBR is discussed through analysis based on a spherical array. A simulation study is then presented, based on both a spherical and a planar array, demonstrating the proposed guidelines. A listening test verifies the perceptual attributes of the methods presented in this study. These results can be used for more informed beamformer design for BFBR.

近年来，空间音频再现得到了广泛的研究，许多研究都侧重于基于耳机的空间再现。一种流行的空间音频格式是高阶 Ambisonics（HOA），通常使用球形麦克风阵列来获取 HOA 信号。如果没有球形阵列，则可以使用基于波束成形的双耳再现（BFBR），即使用一般配置的阵列捕获信号。虽然 BFBR 被证明是有用的，但目前还没有对其进行全面的研究，因此对其局限性和其他设计方面还不甚了解。本文在发展 BFBR 理论方面迈出了第一步，并制定了选择波束成形器数量的指导原则。特别是提出了麦克风阵列的平均指向性系数，作为支持这种选择的衡量标准。此外，还提出并研究了使用过多波束成形器方向时产生的头部相关传递函数（HRTF）阶截断的影响。此外，通过基于球形阵列的分析，讨论了基于 HOA 的双耳再现与 BFBR 之间的关系。然后，介绍了基于球面和平面阵列的模拟研究，展示了所提出的指导原则。听力测试验证了本研究提出的方法的感知属性。这些结果可用于为 BFBR 进行更明智的波束成形器设计。

{"title":"On the selection of the number of beamformers in beamforming-based binaural reproduction.","authors":"Itay Ifergan, Boaz Rafaely","doi":"10.1186/s13636-022-00238-7","DOIUrl":"10.1186/s13636-022-00238-7","url":null,"abstract":"In recent years, spatial audio reproduction has been widely researched with many studies focusing on headphone-based spatial reproduction. A popular format for spatial audio is higher order Ambisonics (HOA), where a spherical microphone array is typically used to obtain the HOA signals. When a spherical array is not available, beamforming-based binaural reproduction (BFBR) can be used, where signals are captured with arrays of a general configuration. While shown to be useful, no comprehensive studies of BFBR have been presented and so its limitations and other design aspects are not well understood. This paper takes an initial step towards developing a theory for BFBR and develops guidelines for selecting the number of beamformers. In particular, the average directivity factor of the microphone array is proposed as a measure for supporting this selection. The effect of head-related transfer function (HRTF) order truncation that occurs when using too many beamformer directions is presented and studied. In addition, the relation between HOA-based binaural reproduction and BFBR is discussed through analysis based on a spherical array. A simulation study is then presented, based on both a spherical and a planar array, demonstrating the proposed guidelines. A listening test verifies the perceptual attributes of the methods presented in this study. These results can be used for more informed beamformer design for BFBR.","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2022 1","pages":"6"},"PeriodicalIF":2.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8965231/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65688237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit 通过RawNet-SA和门控循环单元在直播中锚定声纹识别

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2021-12-01 DOI: 10.1186/s13636-021-00234-3

Jiacheng Yao, J. Zhang, Jiafeng Li, L. Zhuo

引用次数: 0

Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation 使用跨语言迁移学习和数据增强的低资源语言文本到语音系统

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2021-12-01 DOI: 10.1186/s13636-021-00225-4

Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, N. Kitaoka

引用次数: 1

Spherical harmonic covariance and magnitude function encodings for beamformer design 波束形成器设计中的球谐协方差和幅度函数编码

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2021-12-01 DOI: 10.1186/s13636-021-00230-7

Yuancheng Luo

引用次数: 0

U2-VC: one-shot voice conversion using two-level nested U-structure U2-VC:一次语音转换，采用两级嵌套u型结构

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2021-11-24 DOI: 10.1186/s13636-021-00226-3

Fangkun Liu, Hui Wang, Renhua Peng, C. Zheng, Xiaodong Li

引用次数: 1

dEchorate: a calibrated room impulse response dataset for echo-aware signal processing dEchorate：用于回声感知信号处理的校准房间脉冲响应数据集

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2021-11-23 DOI: 10.1186/s13636-021-00229-0

D. Carlo, Pinchas Tandeitnik, C. Foy, N. Bertin, Antoine Deleforge, S. Gannot

引用次数: 17

Robust single- and multi-loudspeaker least-squares-based equalization for hearing devices 健壮的单扬声器和多扬声器基于最小二乘的听力设备均衡

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing

Pub Date : 2021-09-09 DOI: 10.1186/s13636-022-00247-6

H. Schepker, Florian Denk, B. Kollmeier, S. Doclo

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Journal on Audio Speech and Music Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀