四种语言-音乐区分分类器的比较:哥斯达黎加无线电广播的第一个案例研究

IF 0.1 Q4 MULTIDISCIPLINARY SCIENCES Tecnologia en Marcha Pub Date : 2022-11-16 DOI:10.18845/tm.v35i8.6463
Joseline Sánchez-Solís, Marvin Coto-Jiménez
{"title":"四种语言-音乐区分分类器的比较:哥斯达黎加无线电广播的第一个案例研究","authors":"Joseline Sánchez-Solís, Marvin Coto-Jiménez","doi":"10.18845/tm.v35i8.6463","DOIUrl":null,"url":null,"abstract":"During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems.","PeriodicalId":42957,"journal":{"name":"Tecnologia en Marcha","volume":null,"pages":null},"PeriodicalIF":0.1000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting\",\"authors\":\"Joseline Sánchez-Solís, Marvin Coto-Jiménez\",\"doi\":\"10.18845/tm.v35i8.6463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems.\",\"PeriodicalId\":42957,\"journal\":{\"name\":\"Tecnologia en Marcha\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.1000,\"publicationDate\":\"2022-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tecnologia en Marcha\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18845/tm.v35i8.6463\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tecnologia en Marcha","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18845/tm.v35i8.6463","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

在过去的几十年里,大量的音频数据以世界上大多数语言和地区的形式出现。有效地组织和操作这些数据对于数据分类、搜索信息、分类等任务非常重要,但也可以用于构建用于自动语音识别或构建语音合成系统的训练模型的语料库。其中一些任务需要针对特定语言和口音进行广泛的测试和数据,特别是当开发机器通信系统是一个目标时。在这项工作中,我们探索了几种分类器在哥斯达黎加广播中区分语音和音乐的应用。这种区分是探索大型语料库的首要任务,以确定可用信息是否对特定研究领域有用。这项探索性工作的主要贡献是哥斯达黎加无线电语料库的一般程序和算法选择,这可以导致在许多自己的应用程序和系统中广泛使用这一数据来源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting
During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Tecnologia en Marcha
Tecnologia en Marcha MULTIDISCIPLINARY SCIENCES-
自引率
0.00%
发文量
93
审稿时长
28 weeks
期刊最新文献
Gestión de Residuos en proyectos de construcción de viviendas en Costa Rica: teoría versus práctica Enseñanza del ordenamiento territorial como herramienta en la gestión de proyectos de obra pública Transferencia de conocimiento desde las universidades a las empresas Virtualidad en la enseñanza de investigación en la maestría en Gerencia de Proyectos del Tecnológico de Costa Rica Impacto de la metodología BIM en la gestión de proyectos de construcción
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1