首页 > 最新文献

JASA express letters最新文献

英文 中文
A metric to quantify the time-varying characteristics of underwater acoustic communication channels. 量化水下声学通信信道时变特性的指标。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-07-01 DOI: 10.1121/10.0026601
Xianpeng Li, Yupeng Tai, Haibin Wang, Jun Wang, Shuo Jia, Yonglin Zhang, Weiming Gan

Underwater acoustic communication signals suffer from time dispersion due to time-varying multipath propagation in the ocean. This leads to intersymbol interference, which in turn degrades the performance of the communication system. Typically, the channel correlation functions are employed to describe these characteristics. In this paper, a metric called the channel average correlation coefficient (CACC) is proposed from the correlation function to quantify the time-varying characteristics. It has a theoretical negative relationship with communication performance. Comparative analysis involving simulations and experimental data processing highlights the superior effectiveness of CACC over the traditional metric, the channel coherence time.

由于海洋中的时变多径传播,水下声学通信信号受到时间色散的影响。这会导致符号间干扰,进而降低通信系统的性能。通常情况下,采用信道相关函数来描述这些特性。本文从相关函数出发,提出了一种称为信道平均相关系数(CACC)的指标来量化时变特性。理论上,它与通信性能呈负相关。通过模拟和实验数据处理的对比分析,CACC 比传统指标--信道相干时间--更加有效。
{"title":"A metric to quantify the time-varying characteristics of underwater acoustic communication channels.","authors":"Xianpeng Li, Yupeng Tai, Haibin Wang, Jun Wang, Shuo Jia, Yonglin Zhang, Weiming Gan","doi":"10.1121/10.0026601","DOIUrl":"https://doi.org/10.1121/10.0026601","url":null,"abstract":"<p><p>Underwater acoustic communication signals suffer from time dispersion due to time-varying multipath propagation in the ocean. This leads to intersymbol interference, which in turn degrades the performance of the communication system. Typically, the channel correlation functions are employed to describe these characteristics. In this paper, a metric called the channel average correlation coefficient (CACC) is proposed from the correlation function to quantify the time-varying characteristics. It has a theoretical negative relationship with communication performance. Comparative analysis involving simulations and experimental data processing highlights the superior effectiveness of CACC over the traditional metric, the channel coherence time.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141560404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrast factor for standing-wave radiation forces on spheres: Series expansion in powers of sphere radius. 球体上驻波辐射力的对比系数:球体半径幂级数展开。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-07-01 DOI: 10.1121/10.0027928
Philip L Marston

Recently researchers often normalize the radiation force on spheres in standing waves in inviscid fluids using an acoustic contrast factor (typically denoted by Φ) that is independent of kR where k is the wave number and R is the sphere radius. An alternative normalization uses a function Ys that depends on kR. Here, standard results for Φ are extended as a power series in kR using prior Ys results. Also, new terms are found for fluid spheres and applied to the kR dependence of Φ for strongly responsive and weakly responsive examples. Partial-wave phase shifts are used in the derivation.

最近,研究人员经常使用与 kR 无关的声学对比系数(通常用 Φ 表示)对无粘性流体中驻波对球体的辐射力进行归一化,其中 k 是波数,R 是球体半径。另一种归一化方法是使用取决于 kR 的函数 Ys。在此,利用先前的 Ys 结果,将 Φ 的标准结果扩展为 kR 的幂级数。此外,还发现了流体球体的新项,并将其应用于强响应和弱响应示例的 Φ 的 kR 依赖性。在推导过程中使用了部分波相移。
{"title":"Contrast factor for standing-wave radiation forces on spheres: Series expansion in powers of sphere radius.","authors":"Philip L Marston","doi":"10.1121/10.0027928","DOIUrl":"https://doi.org/10.1121/10.0027928","url":null,"abstract":"<p><p>Recently researchers often normalize the radiation force on spheres in standing waves in inviscid fluids using an acoustic contrast factor (typically denoted by Φ) that is independent of kR where k is the wave number and R is the sphere radius. An alternative normalization uses a function Ys that depends on kR. Here, standard results for Φ are extended as a power series in kR using prior Ys results. Also, new terms are found for fluid spheres and applied to the kR dependence of Φ for strongly responsive and weakly responsive examples. Partial-wave phase shifts are used in the derivation.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141617765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech intelligibility and talker identification with non-telephone frequencies. 使用非电话频率的语音清晰度和通话者识别。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-07-01 DOI: 10.1121/10.0027938
Xianhui Wang, Jonathan Ge, Leo Meller, Ye Yang, Fan-Gang Zeng

Although the telephone band (0.3-3 kHz) provides sufficient information for speech recognition, the contribution of the non-telephone band (<0.3 and >3 kHz) is unclear. To investigate its contribution, speech intelligibility and talker identification were evaluated using consonants, vowels, and sentences. The non-telephone band produced relatively good intelligibility for consonants (76.0%) and sentences (77.4%), but not vowels (11.5%). The non-telephone band supported good talker identification only with sentences (74.5%), but not vowels (45.8%) or consonants (10.8%). Furthermore, the non-telephone band cannot produce satisfactory speech intelligibility in noise at the sentence level, suggesting the importance of full-band access in realistic listening.

虽然电话频段(0.3-3 kHz)为语音识别提供了足够的信息,但非电话频段(3 kHz)的贡献尚不清楚。为了研究非电话频段的贡献,我们使用辅音、元音和句子对语音清晰度和通话者识别进行了评估。非耳机频段对辅音(76.0%)和句子(77.4%)的可懂度相对较好,但对元音(11.5%)的可懂度较差。非耳机频段只在句子(74.5%)、元音(45.8%)和辅音(10.8%)方面支持良好的说话者识别。此外,非耳机频段无法在噪音中产生令人满意的句子级语音清晰度,这表明全频段接入在实际听力中的重要性。
{"title":"Speech intelligibility and talker identification with non-telephone frequencies.","authors":"Xianhui Wang, Jonathan Ge, Leo Meller, Ye Yang, Fan-Gang Zeng","doi":"10.1121/10.0027938","DOIUrl":"10.1121/10.0027938","url":null,"abstract":"<p><p>Although the telephone band (0.3-3 kHz) provides sufficient information for speech recognition, the contribution of the non-telephone band (<0.3 and >3 kHz) is unclear. To investigate its contribution, speech intelligibility and talker identification were evaluated using consonants, vowels, and sentences. The non-telephone band produced relatively good intelligibility for consonants (76.0%) and sentences (77.4%), but not vowels (11.5%). The non-telephone band supported good talker identification only with sentences (74.5%), but not vowels (45.8%) or consonants (10.8%). Furthermore, the non-telephone band cannot produce satisfactory speech intelligibility in noise at the sentence level, suggesting the importance of full-band access in realistic listening.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The tonotopic cochlea puzzle: A resonant transmission line with a "non-resonant" response peak. 声调耳蜗之谜:具有 "非共振 "响应峰的共振传输线
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-07-01 DOI: 10.1121/10.0028020
Renata Sisto, Arturo Moleti

The peaked cochlear tonotopic response does not show the typical phenomenology of a resonant system. Simulations of a 2 D viscous model show that the position of the peak is determined by the competition between a sharp pressure boost due to the increase in the real part of the wavenumber as the forward wave enters the short-wave region, and a sudden increase in the viscous losses, partly counteracted by the input power provided by the outer hair cells. This viewpoint also explains the peculiar experimental behavior of the cochlear admittance (broadly tuned and almost level-independent) in the peak region.

峰值耳蜗声调反应并不显示共振系统的典型现象。对 2 D 粘滞模型的模拟表明,峰值的位置是由前向波进入短波区时,由于实际波数的增加而产生的急剧压力提升和粘滞损耗的突然增加之间的竞争决定的,其中部分被外毛细胞提供的输入功率所抵消。这一观点也解释了耳蜗导纳在峰值区域的特殊实验行为(宽调谐且几乎与电平无关)。
{"title":"The tonotopic cochlea puzzle: A resonant transmission line with a \"non-resonant\" response peak.","authors":"Renata Sisto, Arturo Moleti","doi":"10.1121/10.0028020","DOIUrl":"10.1121/10.0028020","url":null,"abstract":"<p><p>The peaked cochlear tonotopic response does not show the typical phenomenology of a resonant system. Simulations of a 2 D viscous model show that the position of the peak is determined by the competition between a sharp pressure boost due to the increase in the real part of the wavenumber as the forward wave enters the short-wave region, and a sudden increase in the viscous losses, partly counteracted by the input power provided by the outer hair cells. This viewpoint also explains the peculiar experimental behavior of the cochlear admittance (broadly tuned and almost level-independent) in the peak region.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141728397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Information geometry analysis example for absolute and relative transmission loss in a shallow ocean. 浅海绝对和相对传输损耗的信息几何分析示例。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-07-01 DOI: 10.1121/10.0026449
Jay C Spendlove, Tracianne B Neilsen, Mark K Transtrum

The model manifold, an information geometry tool, is a geometric representation of a model that can quantify the expected information content of modeling parameters. For a normal-mode sound propagation model in a shallow ocean environment, transmission loss (TL) is calculated for a vertical line array and model manifolds are constructed for both absolute and relative TL. For the example presented in this paper, relative TL yields more compact model manifolds with seabed environments that are less statistically distinguishable than manifolds of absolute TL. This example illustrates how model manifolds can be used to improve experimental design for inverse problems.

模型流形是一种信息几何工具,是模型的几何表示,可以量化建模参数的预期信息含量。对于浅海环境中的常模声传播模型,要计算垂直线阵列的传输损耗(TL),并构建绝对和相对 TL 的模型流形。在本文介绍的例子中,相对 TL 得到的模型流形更紧凑,海底环境的统计区分度低于绝对 TL 的流形。这个例子说明了如何利用模型流形来改进反演问题的实验设计。
{"title":"Information geometry analysis example for absolute and relative transmission loss in a shallow ocean.","authors":"Jay C Spendlove, Tracianne B Neilsen, Mark K Transtrum","doi":"10.1121/10.0026449","DOIUrl":"https://doi.org/10.1121/10.0026449","url":null,"abstract":"<p><p>The model manifold, an information geometry tool, is a geometric representation of a model that can quantify the expected information content of modeling parameters. For a normal-mode sound propagation model in a shallow ocean environment, transmission loss (TL) is calculated for a vertical line array and model manifolds are constructed for both absolute and relative TL. For the example presented in this paper, relative TL yields more compact model manifolds with seabed environments that are less statistically distinguishable than manifolds of absolute TL. This example illustrates how model manifolds can be used to improve experimental design for inverse problems.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141473265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A blueprint for truncation resonance placement in elastic diatomic lattices with unit cell asymmetrya). 具有单元格不对称的弹性二原子晶格中的截断共振位置蓝图a)。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-07-01 DOI: 10.1121/10.0027939
Hasan B Al Ba'ba'a, Hosam Yousef, Mostafa Nouh

Elastic periodic lattices act as mechanical filters of incident vibrations. By and large, they forbid wave propagation within bandgaps and resonate outside them. However, they often encounter "truncation resonances" (TRs) inside bandgaps when certain conditions are met. In this study, we show that the extent of unit cell asymmetry, its mass and stiffness contrasts, and the boundary conditions all play a role in the TR location and wave profile. The work is experimentally supported via two examples that validate the methodology, and a set of design charts is provided as a blueprint for selective TR placement in diatomic lattices.

弹性周期晶格是入射振动的机械过滤器。总体而言,它们禁止波在带隙内传播,并在带隙外产生共鸣。然而,当满足某些条件时,它们经常会在带隙内遇到 "截断共振"(TRs)。在这项研究中,我们证明了单位晶胞的不对称程度、其质量和刚度对比以及边界条件都会对 TR 位置和波形产生影响。我们通过两个实例对这一方法进行了实验验证,并提供了一套设计图表,作为在二原子晶格中选择性放置 TR 的蓝图。
{"title":"A blueprint for truncation resonance placement in elastic diatomic lattices with unit cell asymmetrya).","authors":"Hasan B Al Ba'ba'a, Hosam Yousef, Mostafa Nouh","doi":"10.1121/10.0027939","DOIUrl":"https://doi.org/10.1121/10.0027939","url":null,"abstract":"<p><p>Elastic periodic lattices act as mechanical filters of incident vibrations. By and large, they forbid wave propagation within bandgaps and resonate outside them. However, they often encounter \"truncation resonances\" (TRs) inside bandgaps when certain conditions are met. In this study, we show that the extent of unit cell asymmetry, its mass and stiffness contrasts, and the boundary conditions all play a role in the TR location and wave profile. The work is experimentally supported via two examples that validate the methodology, and a set of design charts is provided as a blueprint for selective TR placement in diatomic lattices.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141735897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Age and masking effects on acoustic cues for vowel categorizationa). 年龄和掩蔽对元音分类声学线索的影响a)。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-06-01 DOI: 10.1121/10.0026371
Mishaela DiNino

Age-related changes in auditory processing may reduce physiological coding of acoustic cues, contributing to older adults' difficulty perceiving speech in background noise. This study investigated whether older adults differed from young adults in patterns of acoustic cue weighting for categorizing vowels in quiet and in noise. All participants relied primarily on spectral quality to categorize /ɛ/ and /æ/ sounds under both listening conditions. However, relative to young adults, older adults exhibited greater reliance on duration and less reliance on spectral quality. These results suggest that aging alters patterns of perceptual cue weights that may influence speech recognition abilities.

听觉处理过程中与年龄有关的变化可能会减少对声音线索的生理编码,从而导致老年人难以感知背景噪声中的语音。本研究调查了老年人在对安静和噪音中的元音进行分类时,其声学线索加权模式是否与年轻人不同。在这两种听力条件下,所有参与者都主要依靠频谱质量来对/ɛ/和/æ/进行分类。然而,与年轻人相比,老年人对持续时间的依赖程度更高,对频谱质量的依赖程度更低。这些结果表明,衰老会改变知觉线索权重的模式,从而影响语音识别能力。
{"title":"Age and masking effects on acoustic cues for vowel categorizationa).","authors":"Mishaela DiNino","doi":"10.1121/10.0026371","DOIUrl":"10.1121/10.0026371","url":null,"abstract":"<p><p>Age-related changes in auditory processing may reduce physiological coding of acoustic cues, contributing to older adults' difficulty perceiving speech in background noise. This study investigated whether older adults differed from young adults in patterns of acoustic cue weighting for categorizing vowels in quiet and in noise. All participants relied primarily on spectral quality to categorize /ɛ/ and /æ/ sounds under both listening conditions. However, relative to young adults, older adults exhibited greater reliance on duration and less reliance on spectral quality. These results suggest that aging alters patterns of perceptual cue weights that may influence speech recognition abilities.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141332651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Who is singing? Voice recognition from spoken versus sung speech. 谁在唱歌?语音识别口语和唱歌。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-06-01 DOI: 10.1121/10.0026385
Angela Cooper, Matthew Eitel, Natalie Fecher, Elizabeth Johnson, Laura K Cirelli

Singing is socially important but constrains voice acoustics, potentially masking certain aspects of vocal identity. Little is known about how well listeners extract talker details from sung speech or identify talkers across the sung and spoken modalities. Here, listeners (n = 149) were trained to recognize sung or spoken voices and then tested on their identification of these voices in both modalities. Learning vocal identities was initially easier through speech than song. At test, cross-modality voice recognition was above chance, but weaker than within-modality recognition. We conclude that talker information is accessible in sung speech, despite acoustic constraints in song.

唱歌具有重要的社交意义,但却限制了嗓音声学,可能会掩盖声音特征的某些方面。关于听者如何从唱歌语音中提取说话者的细节,或如何在唱歌和说话两种模式中识别说话者,人们知之甚少。在这里,听者(n = 149)接受了识别歌唱或口语声音的训练,然后测试他们在两种模式下对这些声音的识别能力。最初,通过语音学习声音识别比通过歌曲学习更容易。在测试中,跨模态声音识别率高于偶然识别率,但低于模态内识别率。我们的结论是,尽管在歌曲中存在声学限制,但在歌唱语音中可以获得说话者的信息。
{"title":"Who is singing? Voice recognition from spoken versus sung speech.","authors":"Angela Cooper, Matthew Eitel, Natalie Fecher, Elizabeth Johnson, Laura K Cirelli","doi":"10.1121/10.0026385","DOIUrl":"10.1121/10.0026385","url":null,"abstract":"<p><p>Singing is socially important but constrains voice acoustics, potentially masking certain aspects of vocal identity. Little is known about how well listeners extract talker details from sung speech or identify talkers across the sung and spoken modalities. Here, listeners (n = 149) were trained to recognize sung or spoken voices and then tested on their identification of these voices in both modalities. Learning vocal identities was initially easier through speech than song. At test, cross-modality voice recognition was above chance, but weaker than within-modality recognition. We conclude that talker information is accessible in sung speech, despite acoustic constraints in song.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141422096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The perception and production of Korean stops in second dialect acquisition. 第二方言习得中韩语停顿的感知和产生。
IF 1.2 Q3 ACOUSTICS Pub Date : 2024-06-01 DOI: 10.1121/10.0026374
Hyunjung Lee, Eun Jong Kong, Jeffrey J Holliday

This study investigated the acoustic cue weighting of the Korean stop contrast in the perception and production of speakers who moved from a nonstandard dialect region to the standard dialect region, Seoul. Through comparing these mobile speakers with data from nonmobile speakers in Seoul and their home region, it was found that the speakers shifted their cue weighting in perception and production to some degree, but also retained some subphonemic features of their home dialect in production. The implications of these results for the role of dialect prestige and awareness in second dialect acquisition are discussed.

本研究调查了从非标准方言区移居到标准方言区(首尔)的说话者在感知和发音中韩语停止对比的声学线索权重。通过将这些流动说话者的数据与首尔及其家乡地区的非流动说话者的数据进行比较,发现这些说话者在一定程度上改变了其感知和发声中的线索权重,但在发声中也保留了家乡方言的一些次音位特征。本文讨论了这些结果对方言声望和意识在第二方言习得中的作用的影响。
{"title":"The perception and production of Korean stops in second dialect acquisition.","authors":"Hyunjung Lee, Eun Jong Kong, Jeffrey J Holliday","doi":"10.1121/10.0026374","DOIUrl":"10.1121/10.0026374","url":null,"abstract":"<p><p>This study investigated the acoustic cue weighting of the Korean stop contrast in the perception and production of speakers who moved from a nonstandard dialect region to the standard dialect region, Seoul. Through comparing these mobile speakers with data from nonmobile speakers in Seoul and their home region, it was found that the speakers shifted their cue weighting in perception and production to some degree, but also retained some subphonemic features of their home dialect in production. The implications of these results for the role of dialect prestige and awareness in second dialect acquisition are discussed.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141312448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of phonation types in singing voice using wavelet scattering network-based features. 利用基于小波散射网络的特征对歌声中的发音类型进行分类。
Q3 ACOUSTICS Pub Date : 2024-06-01 DOI: 10.1121/10.0026241
Kiran Reddy Mittapalle, Paavo Alku

The automatic classification of phonation types in singing voice is essential for tasks such as identification of singing style. In this study, it is proposed to use wavelet scattering network (WSN)-based features for classification of phonation types in singing voice. WSN, which has a close similarity with auditory physiological models, generates acoustic features that greatly characterize the information related to pitch, formants, and timbre. Hence, the WSN-based features can effectively capture the discriminative information across phonation types in singing voice. The experimental results show that the proposed WSN-based features improved phonation classification accuracy by at least 9% compared to state-of-the-art features.

对歌声中的发音类型进行自动分类对于识别歌声风格等任务至关重要。本研究建议使用基于小波散射网络(WSN)的特征对歌唱声音的发音类型进行分类。小波散射网络(WSN)与听觉生理模型十分相似,它生成的声学特征能极大地表征与音高、声母和音色相关的信息。因此,基于 WSN 的特征能有效捕捉歌声中不同发音类型的辨别信息。实验结果表明,与最先进的特征相比,基于 WSN 特征的语音分类准确率至少提高了 9%。
{"title":"Classification of phonation types in singing voice using wavelet scattering network-based features.","authors":"Kiran Reddy Mittapalle, Paavo Alku","doi":"10.1121/10.0026241","DOIUrl":"https://doi.org/10.1121/10.0026241","url":null,"abstract":"<p><p>The automatic classification of phonation types in singing voice is essential for tasks such as identification of singing style. In this study, it is proposed to use wavelet scattering network (WSN)-based features for classification of phonation types in singing voice. WSN, which has a close similarity with auditory physiological models, generates acoustic features that greatly characterize the information related to pitch, formants, and timbre. Hence, the WSN-based features can effectively capture the discriminative information across phonation types in singing voice. The experimental results show that the proposed WSN-based features improved phonation classification accuracy by at least 9% compared to state-of-the-art features.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"4 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141285490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JASA express letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1