SYSTEM OF AUTOMATIC SEGMENTATION OF PAUSES IN PHONOGRAMS ON THE BASIS OF NEURON NETWORKS OF THE DEEP LEARNING

Viktor I. Soloviev, O. Rybalsky, V. Zhuravel
{"title":"SYSTEM OF AUTOMATIC SEGMENTATION OF PAUSES IN PHONOGRAMS ON THE BASIS OF NEURON NETWORKS OF THE DEEP LEARNING","authors":"Viktor I. Soloviev, O. Rybalsky, V. Zhuravel","doi":"10.34229/0572-2691-2021-1-8","DOIUrl":null,"url":null,"abstract":"The use of neuron networks of the deep learning for the construction of tool for realization of examinations of materials and apparatus of the digital audio recording allows to solve the «frigging» problem of such examination — problem of exposure of tracks of editing in digital phonograms. These networks provide high probability of exposure of such tracks in the pauses of speech information writtenin on a phonogram. Before man-hunting of tracks of editing in the investigated phonogram it is necessary to distinguish pauses (to perform its segmentation), and tool built on the basis of neuron networks of the deep learning, requires its work to be done in automatic mode. The basic requirement of automatic segmentation is high efficiency of selection of pauses in the conditions of permanent change of level of noises in phonograms. It is determined by probability of errors of І and ІІ kinds. It is offered on the basis of neuron networks of the deep learning to create CAS of segmentation of phonograms, possessing high efficiency of selection of pauses in speech information. Thus the system must be independent of level of noises in every concrete pause, and also language, context and announcer, whose speech is fixed in a phonogram. It is suggested to examine pauses as one of the types of voice information, which characteristics differ from characteristics of speech information fixed in a phonogram. For educating of such network it was required to create the primary base of these sounds and pauses. On its basis three arrays of the data, intended for learning, testing and determination of the crooked errors of І and ІІ kinds, are created. After learning and testing the system passed verification on the real phonograms. As a result taking into account some features of speech on the neuron networks of deep learning there has been built the system providing effective segmentation of pauses in phonograms in the automatics mode. The obtained results suit examination that is conformed by given curves over of errors of І and ІІ kinds.","PeriodicalId":54874,"journal":{"name":"Journal of Automation and Information Sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Automation and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34229/0572-2691-2021-1-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0

Abstract

The use of neuron networks of the deep learning for the construction of tool for realization of examinations of materials and apparatus of the digital audio recording allows to solve the «frigging» problem of such examination — problem of exposure of tracks of editing in digital phonograms. These networks provide high probability of exposure of such tracks in the pauses of speech information writtenin on a phonogram. Before man-hunting of tracks of editing in the investigated phonogram it is necessary to distinguish pauses (to perform its segmentation), and tool built on the basis of neuron networks of the deep learning, requires its work to be done in automatic mode. The basic requirement of automatic segmentation is high efficiency of selection of pauses in the conditions of permanent change of level of noises in phonograms. It is determined by probability of errors of І and ІІ kinds. It is offered on the basis of neuron networks of the deep learning to create CAS of segmentation of phonograms, possessing high efficiency of selection of pauses in speech information. Thus the system must be independent of level of noises in every concrete pause, and also language, context and announcer, whose speech is fixed in a phonogram. It is suggested to examine pauses as one of the types of voice information, which characteristics differ from characteristics of speech information fixed in a phonogram. For educating of such network it was required to create the primary base of these sounds and pauses. On its basis three arrays of the data, intended for learning, testing and determination of the crooked errors of І and ІІ kinds, are created. After learning and testing the system passed verification on the real phonograms. As a result taking into account some features of speech on the neuron networks of deep learning there has been built the system providing effective segmentation of pauses in phonograms in the automatics mode. The obtained results suit examination that is conformed by given curves over of errors of І and ІІ kinds.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度学习神经元网络的留声图停顿自动分割系统
利用深度学习的神经元网络构建工具来实现对数字录音材料和设备的检查,可以解决这种检查的“恼人”问题-数字录音中编辑轨迹的暴露问题。这些网络提供了在写在音图上的语音信息的停顿中暴露这些轨迹的高概率。在对被调查的音图进行编辑音轨的人工搜索之前,需要对停顿进行区分(进行分割),而基于深度学习的神经元网络构建的工具要求其工作在自动模式下完成。自动分割的基本要求是在留声机噪声水平不断变化的情况下,能够高效地选择停顿。它是由І和ІІ类的误差概率决定的。在深度学习神经元网络的基础上,提出了一种基于语音图分割的CAS,具有语音信息中停顿选择的高效率。因此,该系统必须独立于每个具体停顿中的噪声水平,以及语言、上下文和播音员(其讲话固定在音图中)。建议将停顿作为语音信息的一种,其特征不同于语音图中固定的语音信息特征。为了教育这种网络,需要创造这些声音和停顿的基本基础。在此基础上,创建了三个数据数组,用于学习、测试和确定І和ІІ类型的弯曲误差。经过学习和测试,该系统通过了在真实录音机上的验证。因此,考虑到深度学习神经元网络中语音的一些特征,建立了在自动模式下对留声图中的停顿进行有效分割的系统。所得结果符合І和ІІ两种误差曲线的检验结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Automation and Information Sciences
Journal of Automation and Information Sciences AUTOMATION & CONTROL SYSTEMS-
自引率
0.00%
发文量
0
审稿时长
6-12 weeks
期刊介绍: This journal contains translations of papers from the Russian-language bimonthly "Mezhdunarodnyi nauchno-tekhnicheskiy zhurnal "Problemy upravleniya i informatiki". Subjects covered include information sciences such as pattern recognition, forecasting, identification and evaluation of complex systems, information security, fault diagnosis and reliability. In addition, the journal also deals with such automation subjects as adaptive, stochastic and optimal control, control and identification under uncertainty, robotics, and applications of user-friendly computers in management of economic, industrial, biological, and medical systems. The Journal of Automation and Information Sciences will appeal to professionals in control systems, communications, computers, engineering in biology and medicine, instrumentation and measurement, and those interested in the social implications of technology.
期刊最新文献
Validity and Reliability Study of Turkish Version of Clinical Assessment Interview for Negative Symptoms (CAINS). OPTIMIZATION OF THE TRAJECTORY OF SENSORS MOTION TAKING INTO ACCOUNT THE IMPORTANCE OF THE AREAS OF THE MONITORING AREA SEGMENTS AND THE PROBABILITY OF DETECTION OF OBJECTS ALGORITHMIC AND HARDWARE TOOLS FOR MOVING TARGETS DETECTION ON THE PROJECTION SCREEN FROM THE LASER EMITTER OF THE MULTIMEDIA TRAINER ON DEGENARATE CASE OF THREE-DIMENTIONAL PROBLEM OF THICK ELASTIC PLATESʼ DYNAMICS VALIDATION OF LAND DEGRADATION CARDS ON THE BASIS OF GEOSPATIAL DATA
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1