首页 > 最新文献

Journal on Audio Speech and Music Processing最新文献

英文 中文
Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones 更正:集成MVDR波束形成器,用于使用本地麦克风阵列和外部麦克风进行语音增强
IF 2.4 3区 计算机科学 Pub Date : 2021-04-06 DOI: 10.1186/s13636-021-00202-x
Randall Ali, T. van Waterschoot, M. Moonen
{"title":"Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones","authors":"Randall Ali, T. van Waterschoot, M. Moonen","doi":"10.1186/s13636-021-00202-x","DOIUrl":"https://doi.org/10.1186/s13636-021-00202-x","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00202-x","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition 具有自注意机制的对抗性联合训练用于鲁棒的端到端语音识别
IF 2.4 3区 计算机科学 Pub Date : 2021-04-03 DOI: 10.1186/s13636-021-00215-6
Lujun Li, Yikai Kang, Yucheng Shi, Ludwig Kürzinger, Tobias Watzel, G. Rigoll
{"title":"Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition","authors":"Lujun Li, Yikai Kang, Yucheng Shi, Ludwig Kürzinger, Tobias Watzel, G. Rigoll","doi":"10.1186/s13636-021-00215-6","DOIUrl":"https://doi.org/10.1186/s13636-021-00215-6","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00215-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48650245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain 多说话人到达方向估计的nmf加权SRP:对空间混叠的鲁棒性同时利用原子时域的稀疏性
IF 2.4 3区 计算机科学 Pub Date : 2021-03-03 DOI: 10.1186/s13636-021-00201-y
S. Thakallapalli, S. Gangashetty, N. Madhu
{"title":"NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain","authors":"S. Thakallapalli, S. Gangashetty, N. Madhu","doi":"10.1186/s13636-021-00201-y","DOIUrl":"https://doi.org/10.1186/s13636-021-00201-y","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00201-y","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of transition cost and model parameters in speaker diarization for meetings 会议发言者配置的转换成本及模型参数分析
IF 2.4 3区 计算机科学 Pub Date : 2021-02-24 DOI: 10.1186/s13636-021-00196-6
Beatriz Martínez-González, J. Pardo, J. A. Vallejo-Pinto, R. San-Segundo, J. Ferreiros
{"title":"Analysis of transition cost and model parameters in speaker diarization for meetings","authors":"Beatriz Martínez-González, J. Pardo, J. A. Vallejo-Pinto, R. San-Segundo, J. Ferreiros","doi":"10.1186/s13636-021-00196-6","DOIUrl":"https://doi.org/10.1186/s13636-021-00196-6","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00196-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Comparison of semi-supervised deep learning algorithms for audio classification 音频分类的半监督深度学习算法比较
IF 2.4 3区 计算机科学 Pub Date : 2021-02-16 DOI: 10.1186/s13636-022-00255-6
Léo Cances, E. Labbé, Thomas Pellegrini
{"title":"Comparison of semi-supervised deep learning algorithms for audio classification","authors":"Léo Cances, E. Labbé, Thomas Pellegrini","doi":"10.1186/s13636-022-00255-6","DOIUrl":"https://doi.org/10.1186/s13636-022-00255-6","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2022 1","pages":"1-16"},"PeriodicalIF":2.4,"publicationDate":"2021-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43872465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones 一种集成MVDR波束形成器,用于使用本地麦克风阵列和外部麦克风进行语音增强
IF 2.4 3区 计算机科学 Pub Date : 2021-02-10 DOI: 10.1186/s13636-020-00192-2
Randall Ali, T. van Waterschoot, M. Moonen
{"title":"An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones","authors":"Randall Ali, T. van Waterschoot, M. Moonen","doi":"10.1186/s13636-020-00192-2","DOIUrl":"https://doi.org/10.1186/s13636-020-00192-2","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00192-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A CNN-based approach to identification of degradations in speech signals 基于cnn的语音信号退化识别方法
IF 2.4 3区 计算机科学 Pub Date : 2021-02-05 DOI: 10.1186/s13636-021-00198-4
Yuki Saishu, A. H. Poorjam, M. G. Christensen
{"title":"A CNN-based approach to identification of degradations in speech signals","authors":"Yuki Saishu, A. H. Poorjam, M. G. Christensen","doi":"10.1186/s13636-021-00198-4","DOIUrl":"https://doi.org/10.1186/s13636-021-00198-4","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00198-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Dynamic out-of-vocabulary word registration to language model for speech recognition 面向语音识别的动态词汇外词配准语言模型
IF 2.4 3区 计算机科学 Pub Date : 2021-01-25 DOI: 10.1186/s13636-020-00193-1
N. Kitaoka, Bohan Chen, Yuya Obashi
{"title":"Dynamic out-of-vocabulary word registration to language model for speech recognition","authors":"N. Kitaoka, Bohan Chen, Yuya Obashi","doi":"10.1186/s13636-020-00193-1","DOIUrl":"https://doi.org/10.1186/s13636-020-00193-1","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00193-1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A simulation study on optimal scores for speaker recognition 说话人识别最优分数的仿真研究
IF 2.4 3区 计算机科学 Pub Date : 2020-11-25 DOI: 10.1186/s13636-020-00183-3
Dong Wang
{"title":"A simulation study on optimal scores for speaker recognition","authors":"Dong Wang","doi":"10.1186/s13636-020-00183-3","DOIUrl":"https://doi.org/10.1186/s13636-020-00183-3","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2020 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2020-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00183-3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization DOANet:一种用于无人机嵌入声源定位搜救的深度扩张卷积神经网络方法
IF 2.4 3区 计算机科学 Pub Date : 2020-11-05 DOI: 10.1186/s13636-020-00184-2
Alif Bin Abdul Qayyum, K. M. N. Hassan, Adrita Anika, Md. Farhan Shadiq, M. Rahman, Md. Tariqul Islam, Sheikh Asif Imran, Shahruk Hossain, M. A. Haque
{"title":"DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization","authors":"Alif Bin Abdul Qayyum, K. M. N. Hassan, Adrita Anika, Md. Farhan Shadiq, M. Rahman, Md. Tariqul Islam, Sheikh Asif Imran, Shahruk Hossain, M. A. Haque","doi":"10.1186/s13636-020-00184-2","DOIUrl":"https://doi.org/10.1186/s13636-020-00184-2","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2020 1","pages":"1-18"},"PeriodicalIF":2.4,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00184-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49585126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Journal on Audio Speech and Music Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1