四种语言-音乐区分分类器的比较:哥斯达黎加无线电广播的第一个案例研究

IF 0.1 Q4 MULTIDISCIPLINARY SCIENCES Tecnologia en Marcha Pub Date : 2022-11-16 DOI:10.18845/tm.v35i8.6463

Joseline Sánchez-Solís, Marvin Coto-Jiménez

{"title":"四种语言-音乐区分分类器的比较:哥斯达黎加无线电广播的第一个案例研究","authors":"Joseline Sánchez-Solís, Marvin Coto-Jiménez","doi":"10.18845/tm.v35i8.6463","DOIUrl":null,"url":null,"abstract":"During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems.","PeriodicalId":42957,"journal":{"name":"Tecnologia en Marcha","volume":"122 1","pages":""},"PeriodicalIF":0.1000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting\",\"authors\":\"Joseline Sánchez-Solís, Marvin Coto-Jiménez\",\"doi\":\"10.18845/tm.v35i8.6463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems.\",\"PeriodicalId\":42957,\"journal\":{\"name\":\"Tecnologia en Marcha\",\"volume\":\"122 1\",\"pages\":\"\"},\"PeriodicalIF\":0.1000,\"publicationDate\":\"2022-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tecnologia en Marcha\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18845/tm.v35i8.6463\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tecnologia en Marcha","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18845/tm.v35i8.6463","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

在过去的几十年里，大量的音频数据以世界上大多数语言和地区的形式出现。有效地组织和操作这些数据对于数据分类、搜索信息、分类等任务非常重要，但也可以用于构建用于自动语音识别或构建语音合成系统的训练模型的语料库。其中一些任务需要针对特定语言和口音进行广泛的测试和数据，特别是当开发机器通信系统是一个目标时。在这项工作中，我们探索了几种分类器在哥斯达黎加广播中区分语音和音乐的应用。这种区分是探索大型语料库的首要任务，以确定可用信息是否对特定研究领域有用。这项探索性工作的主要贡献是哥斯达黎加无线电语料库的一般程序和算法选择，这可以导致在许多自己的应用程序和系统中广泛使用这一数据来源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting

During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Tecnologia en Marcha MULTIDISCIPLINARY SCIENCES-

自引率

0.00%

发文量

审稿时长

28 weeks