在预训练声学模型的背景下训练基于滤波器的耳蜗模型

Acoustics Pub Date : 2024-05-17 DOI:10.3390/acoustics6020025
Louise Coppieters de Gibson, Philip N. Garner
{"title":"在预训练声学模型的背景下训练基于滤波器的耳蜗模型","authors":"Louise Coppieters de Gibson, Philip N. Garner","doi":"10.3390/acoustics6020025","DOIUrl":null,"url":null,"abstract":"Auditory research aims in general to lead to understanding of physiological processes. By contrast, the state of the art in automatic speech processing (notably recognition) is dominated by large pre-trained models that are meant to be used as black-boxes. In this work, we integrate a physiologically plausible (albeit simple filter-based) model of the cochlea into a much larger pre-trained acoustic model for speech recognition. We show that the hybrid system can be trained and evaluated with various combinations of fine-tuning and self-supervision. The results broadly show that the system automatically yields structures that are known to work well. Moreover, these structures lack artifacts that were apparent in (our) previous work using less sophisticated neural models. We conclude that the hybrid structure is an appropriate way to proceed in auditory research, more generally allowing the work to take advantage of larger models and databases from which it would not otherwise benefit.","PeriodicalId":502373,"journal":{"name":"Acoustics","volume":"2 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models\",\"authors\":\"Louise Coppieters de Gibson, Philip N. Garner\",\"doi\":\"10.3390/acoustics6020025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Auditory research aims in general to lead to understanding of physiological processes. By contrast, the state of the art in automatic speech processing (notably recognition) is dominated by large pre-trained models that are meant to be used as black-boxes. In this work, we integrate a physiologically plausible (albeit simple filter-based) model of the cochlea into a much larger pre-trained acoustic model for speech recognition. We show that the hybrid system can be trained and evaluated with various combinations of fine-tuning and self-supervision. The results broadly show that the system automatically yields structures that are known to work well. Moreover, these structures lack artifacts that were apparent in (our) previous work using less sophisticated neural models. We conclude that the hybrid structure is an appropriate way to proceed in auditory research, more generally allowing the work to take advantage of larger models and databases from which it would not otherwise benefit.\",\"PeriodicalId\":502373,\"journal\":{\"name\":\"Acoustics\",\"volume\":\"2 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acoustics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/acoustics6020025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/acoustics6020025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

听觉研究的总体目标是了解生理过程。相比之下,自动语音处理(尤其是识别)领域的最新研究成果则是以大型预训练模型为主导,这些模型被当作黑盒子使用。在这项工作中,我们将一个生理上可信的(尽管是基于简单滤波器的)耳蜗模型集成到一个更大的预训练声学模型中,用于语音识别。我们的研究表明,该混合系统可以通过微调和自我监督的不同组合进行训练和评估。结果广泛表明,该系统能自动生成已知效果良好的结构。此外,这些结构缺乏(我们)以前使用不太复杂的神经模型所产生的假象。我们的结论是,混合结构是进行听觉研究的一种合适方法,它能让研究工作更广泛地利用大型模型和数据库,而这些模型和数据库是无法从中获益的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models
Auditory research aims in general to lead to understanding of physiological processes. By contrast, the state of the art in automatic speech processing (notably recognition) is dominated by large pre-trained models that are meant to be used as black-boxes. In this work, we integrate a physiologically plausible (albeit simple filter-based) model of the cochlea into a much larger pre-trained acoustic model for speech recognition. We show that the hybrid system can be trained and evaluated with various combinations of fine-tuning and self-supervision. The results broadly show that the system automatically yields structures that are known to work well. Moreover, these structures lack artifacts that were apparent in (our) previous work using less sophisticated neural models. We conclude that the hybrid structure is an appropriate way to proceed in auditory research, more generally allowing the work to take advantage of larger models and databases from which it would not otherwise benefit.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Acoustic Analysis of a Hybrid Propulsion System for Drone Applications A Study on Adaptive Implicit–Explicit and Explicit–Explicit Time Integration Procedures for Wave Propagation Analyses Silent Neonatal Incubators, Prototype Nica+ The Effect of an Emotionalizing Sound Design on the Driver’s Choice of Headway in a Driving Simulator Atmospheric Sound Propagation over Rough Sea: Numerical Evaluation of Equivalent Acoustic Impedance of Varying Sea States
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1