Bidirectional generative adversarial representation learning for natural stimulus synthesis.

IF 2.1 3区 医学 Q3 NEUROSCIENCES Journal of neurophysiology Pub Date : 2024-10-01 Epub Date: 2024-08-28 DOI:10.1152/jn.00421.2023
Johnny Reilly, John D Goodwin, Sihao Lu, Andriy S Kozlov
{"title":"Bidirectional generative adversarial representation learning for natural stimulus synthesis.","authors":"Johnny Reilly, John D Goodwin, Sihao Lu, Andriy S Kozlov","doi":"10.1152/jn.00421.2023","DOIUrl":null,"url":null,"abstract":"<p><p>Thousands of species use vocal signals to communicate with one another. Vocalizations carry rich information, yet characterizing and analyzing these complex, high-dimensional signals is difficult and prone to human bias. Moreover, animal vocalizations are ethologically relevant stimuli whose representation by auditory neurons is an important subject of research in sensory neuroscience. A method that can efficiently generate naturalistic vocalization waveforms would offer an unlimited supply of stimuli with which to probe neuronal computations. Although unsupervised learning methods allow for the projection of vocalizations into low-dimensional latent spaces learned from the waveforms themselves, and generative modeling allows for the synthesis of novel vocalizations for use in downstream tasks, we are not aware of any model that combines these tasks to synthesize naturalistic vocalizations in the waveform domain for stimulus playback. In this paper, we demonstrate BiWaveGAN: a bidirectional generative adversarial network (GAN) capable of learning a latent representation of ultrasonic vocalizations (USVs) from mice. We show that BiWaveGAN can be used to generate, and interpolate between, realistic vocalization waveforms. We then use these synthesized stimuli along with natural USVs to probe the sensory input space of mouse auditory cortical neurons. We show that stimuli generated from our method evoke neuronal responses as effectively as real vocalizations, and produce receptive fields with the same predictive power. BiWaveGAN is not restricted to mouse USVs but can be used to synthesize naturalistic vocalizations of any animal species and interpolate between vocalizations of the same or different species, which could be useful for probing categorical boundaries in representations of ethologically relevant auditory signals.<b>NEW & NOTEWORTHY</b> A new type of artificial neural network is presented that can be used to generate animal vocalization waveforms and interpolate between them to create new vocalizations. We find that our synthetic naturalistic stimuli drive auditory cortical neurons in the mouse equally well and produce receptive field features with the same predictive power as those obtained with natural mouse vocalizations, confirming the quality of the stimuli produced by the neural network.</p>","PeriodicalId":16563,"journal":{"name":"Journal of neurophysiology","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of neurophysiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1152/jn.00421.2023","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/28 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Thousands of species use vocal signals to communicate with one another. Vocalizations carry rich information, yet characterizing and analyzing these complex, high-dimensional signals is difficult and prone to human bias. Moreover, animal vocalizations are ethologically relevant stimuli whose representation by auditory neurons is an important subject of research in sensory neuroscience. A method that can efficiently generate naturalistic vocalization waveforms would offer an unlimited supply of stimuli with which to probe neuronal computations. Although unsupervised learning methods allow for the projection of vocalizations into low-dimensional latent spaces learned from the waveforms themselves, and generative modeling allows for the synthesis of novel vocalizations for use in downstream tasks, we are not aware of any model that combines these tasks to synthesize naturalistic vocalizations in the waveform domain for stimulus playback. In this paper, we demonstrate BiWaveGAN: a bidirectional generative adversarial network (GAN) capable of learning a latent representation of ultrasonic vocalizations (USVs) from mice. We show that BiWaveGAN can be used to generate, and interpolate between, realistic vocalization waveforms. We then use these synthesized stimuli along with natural USVs to probe the sensory input space of mouse auditory cortical neurons. We show that stimuli generated from our method evoke neuronal responses as effectively as real vocalizations, and produce receptive fields with the same predictive power. BiWaveGAN is not restricted to mouse USVs but can be used to synthesize naturalistic vocalizations of any animal species and interpolate between vocalizations of the same or different species, which could be useful for probing categorical boundaries in representations of ethologically relevant auditory signals.NEW & NOTEWORTHY A new type of artificial neural network is presented that can be used to generate animal vocalization waveforms and interpolate between them to create new vocalizations. We find that our synthetic naturalistic stimuli drive auditory cortical neurons in the mouse equally well and produce receptive field features with the same predictive power as those obtained with natural mouse vocalizations, confirming the quality of the stimuli produced by the neural network.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于自然刺激合成的双向生成对抗表征学习。
发声蕴含着丰富的信息,但描述和分析这些高维信号十分困难,而且容易受到人类偏见的影响。此外,动物发声是与人种学相关的刺激,其听觉神经元的表征是感觉神经科学研究的重要课题。一种能有效生成自然发声波形的方法将为探测神经元计算提供无限的刺激。虽然无监督学习方法可以将发声投射到从波形本身学习到的低维潜在空间,生成模型可以合成新颖的发声用于下游任务,但目前还没有一种方法可以将这些任务结合起来,生成自然的发声波形用于刺激回放。在这里,我们展示了 BiWaveGAN:一种双向生成对抗网络(GAN),能够学习小鼠超声发声(USV)的潜在表示。我们展示了 BiWaveGAN 可用于生成逼真的发声波形,并在这些波形之间进行插值。然后,我们使用这些合成刺激和自然 USV 来探测小鼠听觉皮层神经元的感觉输入空间。我们的研究表明,用我们的方法生成的刺激能像真实发声一样有效地唤起神经元的反应,并产生具有相同预测能力的感受野。BiWaveGAN 并不局限于小鼠的 USV,它可以用来合成任何动物物种的自然发声,并在同一物种或不同物种的发声之间进行插值,这对于探测与人种学相关的听觉信号表征中的分类界限非常有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of neurophysiology
Journal of neurophysiology 医学-神经科学
CiteScore
4.80
自引率
8.00%
发文量
255
审稿时长
2-3 weeks
期刊介绍: The Journal of Neurophysiology publishes original articles on the function of the nervous system. All levels of function are included, from the membrane and cell to systems and behavior. Experimental approaches include molecular neurobiology, cell culture and slice preparations, membrane physiology, developmental neurobiology, functional neuroanatomy, neurochemistry, neuropharmacology, systems electrophysiology, imaging and mapping techniques, and behavioral analysis. Experimental preparations may be invertebrate or vertebrate species, including humans. Theoretical studies are acceptable if they are tied closely to the interpretation of experimental data and elucidate principles of broad interest.
期刊最新文献
A 5-week centrifuge-based G training with feedback on the magnitude of G force, does not improve the perception of roll tilt during simulated coordinated turns. ALTERED CONTROL OF BREATHING IN A RAT MODEL OF ALLERGIC LOWER AIRWAY INFLAMMATION. Ictal and interictal epileptic networks of 34 patients with Hypothalamic Hamartoma on scalp electroencephalography. Investigating premotor corticospinal excitability in fast and slow voluntary contractions of the elbow flexors. Rat movements reflect internal decision dynamics in an evidence accumulation task.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1