用于视频和音频数据分类的卷积网络

Marcel Nikmon, Roman Budjac, Daniel Kuchár, Peter Schreiber, D. Janácová
{"title":"用于视频和音频数据分类的卷积网络","authors":"Marcel Nikmon, Roman Budjac, Daniel Kuchár, Peter Schreiber, D. Janácová","doi":"10.2478/rput-2019-0034","DOIUrl":null,"url":null,"abstract":"Abstract Deep learning is a kind of machine learning, and machine learning is a kind of artificial intelligence. Machine learning depicts groups of various technologies, and deep learning is one of them. The use of deep learning is an integral part of the current data classification practice in today’s world. This paper introduces the possibilities of classification using convolutional networks. Experiments focused on audio and video data show different approaches to data classification. Most experiments use the well-known pre-trained AlexNet network with various pre-processing types of input data. However, there are also comparisons of other neural network architectures, and we also show the results of training on small and larger datasets. The paper comprises description of eight different kinds of experiments. Several training sessions were conducted in each experiment with different aspects that were monitored. The focus was put on the effect of batch size on the accuracy of deep learning, including many other parameters that affect deep learning [1].","PeriodicalId":21013,"journal":{"name":"Research Papers Faculty of Materials Science and Technology Slovak University of Technology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Convolutional Networks Used to Classify Video and Audio Data\",\"authors\":\"Marcel Nikmon, Roman Budjac, Daniel Kuchár, Peter Schreiber, D. Janácová\",\"doi\":\"10.2478/rput-2019-0034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Deep learning is a kind of machine learning, and machine learning is a kind of artificial intelligence. Machine learning depicts groups of various technologies, and deep learning is one of them. The use of deep learning is an integral part of the current data classification practice in today’s world. This paper introduces the possibilities of classification using convolutional networks. Experiments focused on audio and video data show different approaches to data classification. Most experiments use the well-known pre-trained AlexNet network with various pre-processing types of input data. However, there are also comparisons of other neural network architectures, and we also show the results of training on small and larger datasets. The paper comprises description of eight different kinds of experiments. Several training sessions were conducted in each experiment with different aspects that were monitored. The focus was put on the effect of batch size on the accuracy of deep learning, including many other parameters that affect deep learning [1].\",\"PeriodicalId\":21013,\"journal\":{\"name\":\"Research Papers Faculty of Materials Science and Technology Slovak University of Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research Papers Faculty of Materials Science and Technology Slovak University of Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2478/rput-2019-0034\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Papers Faculty of Materials Science and Technology Slovak University of Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/rput-2019-0034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

深度学习是机器学习的一种,机器学习是人工智能的一种。机器学习描述了各种技术的组合,深度学习就是其中之一。深度学习的使用是当今世界当前数据分类实践的一个组成部分。本文介绍了使用卷积网络进行分类的可能性。针对音频和视频数据的实验显示了不同的数据分类方法。大多数实验使用众所周知的预训练AlexNet网络和各种预处理类型的输入数据。然而,也有其他神经网络架构的比较,我们也展示了小数据集和大数据集的训练结果。本文包括八种不同实验的描述。在每个实验中都进行了几次培训,对不同方面进行了监测。重点放在批大小对深度学习准确性的影响上,包括影响深度学习的许多其他参数[1]。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Convolutional Networks Used to Classify Video and Audio Data
Abstract Deep learning is a kind of machine learning, and machine learning is a kind of artificial intelligence. Machine learning depicts groups of various technologies, and deep learning is one of them. The use of deep learning is an integral part of the current data classification practice in today’s world. This paper introduces the possibilities of classification using convolutional networks. Experiments focused on audio and video data show different approaches to data classification. Most experiments use the well-known pre-trained AlexNet network with various pre-processing types of input data. However, there are also comparisons of other neural network architectures, and we also show the results of training on small and larger datasets. The paper comprises description of eight different kinds of experiments. Several training sessions were conducted in each experiment with different aspects that were monitored. The focus was put on the effect of batch size on the accuracy of deep learning, including many other parameters that affect deep learning [1].
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Analysis of the Development of Postal Services in European Countries, with a Special Focus on Serbia and Slovakia The Impact of the Covid-19 Pandemic on Human Resource Management Priorities Gender Equality Perception in Industrial Enterprises Under the Conditions of Industry 4.0 Sustainability Reporting and Earnings Management of Listed Non-Financial Firms in Nigeria Agile Manufacturing vs. Lean Manufacturing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1