首页 > 最新文献

2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)最新文献

英文 中文
Efficient Quantization Techniques for Deep Neural Networks 深度神经网络的有效量化技术
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00059
Chutian Jiang
As model prediction becomes more and more accurate and the network becomes deeper and deeper, the amount of memory consumed by the neural network becomes a problem, especially on mobile devices. It is also very difficult to balance the tradeoff between computational cost and battery life, which makes mobile devices very hard as well to become smarter. Model quantification techniques provide the opportunity to tackle this tradeoff by reducing the memory bandwidth and storage and improving the system throughput and latency. This paper discusses and compares the state-of-the-art methods of neural network quantification methodologies including Post Training Quantization (PTQ) and Quantization Aware Training (QAT). PTQ directly quantizes the trained floating-point model. The implementation process is simple and does not require quantization during the training phase. QAT requires us to use simulated quantization operations to model the effect of the quantization, and forward and backward passes are usually performed in the floating-point model. Finally, as discussed in the experiments in this paper, we conclude that with the evolution of the quantization techniques, the accuracy gap between PTQ and QAT is shrinking.
随着模型预测越来越准确,网络越来越深入,神经网络消耗的内存量成为一个问题,特别是在移动设备上。在计算成本和电池寿命之间取得平衡也非常困难,这使得移动设备也很难变得更智能。模型量化技术提供了通过减少内存带宽和存储以及改进系统吞吐量和延迟来解决这种权衡的机会。本文讨论并比较了目前最先进的神经网络量化方法,包括训练后量化(PTQ)和量化感知训练(QAT)。PTQ直接量化训练的浮点模型。实施过程简单,在培训阶段不需要量化。QAT要求我们使用模拟量化操作来模拟量化的效果,并且通常在浮点模型中进行正向和向后传递。最后,根据本文的实验讨论,我们得出结论,随着量化技术的发展,PTQ和QAT之间的精度差距正在缩小。
{"title":"Efficient Quantization Techniques for Deep Neural Networks","authors":"Chutian Jiang","doi":"10.1109/CONF-SPML54095.2021.00059","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00059","url":null,"abstract":"As model prediction becomes more and more accurate and the network becomes deeper and deeper, the amount of memory consumed by the neural network becomes a problem, especially on mobile devices. It is also very difficult to balance the tradeoff between computational cost and battery life, which makes mobile devices very hard as well to become smarter. Model quantification techniques provide the opportunity to tackle this tradeoff by reducing the memory bandwidth and storage and improving the system throughput and latency. This paper discusses and compares the state-of-the-art methods of neural network quantification methodologies including Post Training Quantization (PTQ) and Quantization Aware Training (QAT). PTQ directly quantizes the trained floating-point model. The implementation process is simple and does not require quantization during the training phase. QAT requires us to use simulated quantization operations to model the effect of the quantization, and forward and backward passes are usually performed in the floating-point model. Finally, as discussed in the experiments in this paper, we conclude that with the evolution of the quantization techniques, the accuracy gap between PTQ and QAT is shrinking.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134005528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Training Method For VideoPose3D with Ideology of Action Recognition 基于动作识别思想的VideoPose3D训练方法
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00041
Hao Bai
Action recognition and pose estimation from videos are closely related to understand human motions, but more literature focuses on how to solve pose estimation tasks alone from action recognition. This research shows a faster and more flexible training method for VideoPose3D which is based on action recognition. This model is fed with the same type of action as the type that will be estimated, and different types of actions can be trained separately. Evidence has shown that, for common pose-estimation tasks, this model requires a relatively small amount of data to carry out similar results with the original research, and for action-oriented tasks, it outperforms the original research by 4.5% with a limited receptive field size and training epoch on Velocity Error of MPJPE. This model can handle both action-oriented and common pose-estimation problems.
视频中的动作识别和姿态估计与理解人体运动密切相关,但更多的文献关注如何从动作识别中单独解决姿态估计任务。本研究提出了一种基于动作识别的更快、更灵活的VideoPose3D训练方法。该模型被输入与将要估计的动作类型相同的动作类型,并且不同类型的动作可以单独训练。有证据表明,对于常见的姿态估计任务,该模型需要相对较少的数据量来实现与原始研究相似的结果,对于面向动作的任务,该模型在MPJPE的速度误差上的接受野大小和训练历元有限的情况下,其性能优于原始研究4.5%。这个模型既可以处理面向动作的问题,也可以处理常见的姿态估计问题。
{"title":"A Training Method For VideoPose3D with Ideology of Action Recognition","authors":"Hao Bai","doi":"10.1109/CONF-SPML54095.2021.00041","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00041","url":null,"abstract":"Action recognition and pose estimation from videos are closely related to understand human motions, but more literature focuses on how to solve pose estimation tasks alone from action recognition. This research shows a faster and more flexible training method for VideoPose3D which is based on action recognition. This model is fed with the same type of action as the type that will be estimated, and different types of actions can be trained separately. Evidence has shown that, for common pose-estimation tasks, this model requires a relatively small amount of data to carry out similar results with the original research, and for action-oriented tasks, it outperforms the original research by 4.5% with a limited receptive field size and training epoch on Velocity Error of MPJPE. This model can handle both action-oriented and common pose-estimation problems.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121858853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Q-Learning to Personalize Pedagogical Policies for Addition Problems 运用Q-Learning实现加法问题教学策略的个性化
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00043
Danyating Shen, Takara E. Truong, C. Weintz
The prevalence of COVID-19 has illuminated the need for practical digital education tools over the past year. With students studying from home, teachers have struggled to provide their students with adequately challenging coursework. Our project aims to solve this issue in the context of math. More specifically, our goal is to encourage thoughtful learning by supplying students with personalized two-number addition problems that take time to solve but expect the student to answer correctly still. Our solution is to model the process of selecting a math problem to give a student as a Markov Decision Process (MDP) and then use Q-learning to determine the best policy for arriving at the most optimally challenging two-number addition problem for that student. The project creates three student simulators based on group member data. We show that it took student one: $(162 pm 134)$ iterations to give appropriate level problems where the first entry is mean and the second is the standard deviation. Student two took $(230 pm 205)$ iterations, and student three took $(247 pm 236)$ iterations. Lastly, we demonstrate that pre-training our model on students two and three and testing on student one showed a significant improvement from $(162 pm 134)$ iterations to $(35 pm 44)$ iterations.
过去一年,COVID-19的流行凸显了对实用数字教育工具的需求。由于学生在家学习,老师们一直在努力为他们的学生提供足够有挑战性的课程。我们的项目旨在在数学的背景下解决这个问题。更具体地说,我们的目标是通过为学生提供个性化的两数加法问题来鼓励深思熟虑的学习,这些问题需要时间来解决,但希望学生仍然能正确回答。我们的解决方案是将选择数学问题的过程建模为马尔可夫决策过程(MDP),然后使用Q-learning来确定最佳策略,以达到对该学生最具挑战性的两数加法问题。该项目基于小组成员数据创建了三个学生模拟器。我们展示了学生1:$(162 pm 134)$迭代来给出适当级别的问题,其中第一个条目是平均值,第二个是标准差。学生2获得$(230 pm 205)$迭代,学生3获得$(247 pm 236)$迭代。最后,我们证明了在学生2和3上预训练我们的模型并在学生1上进行测试显示了从$(162 pm 134)$迭代到$(35 pm 44)$迭代的显着改进。
{"title":"Using Q-Learning to Personalize Pedagogical Policies for Addition Problems","authors":"Danyating Shen, Takara E. Truong, C. Weintz","doi":"10.1109/CONF-SPML54095.2021.00043","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00043","url":null,"abstract":"The prevalence of COVID-19 has illuminated the need for practical digital education tools over the past year. With students studying from home, teachers have struggled to provide their students with adequately challenging coursework. Our project aims to solve this issue in the context of math. More specifically, our goal is to encourage thoughtful learning by supplying students with personalized two-number addition problems that take time to solve but expect the student to answer correctly still. Our solution is to model the process of selecting a math problem to give a student as a Markov Decision Process (MDP) and then use Q-learning to determine the best policy for arriving at the most optimally challenging two-number addition problem for that student. The project creates three student simulators based on group member data. We show that it took student one: $(162 pm 134)$ iterations to give appropriate level problems where the first entry is mean and the second is the standard deviation. Student two took $(230 pm 205)$ iterations, and student three took $(247 pm 236)$ iterations. Lastly, we demonstrate that pre-training our model on students two and three and testing on student one showed a significant improvement from $(162 pm 134)$ iterations to $(35 pm 44)$ iterations.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123923812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Application of Machine Learning Algorithms in Speech Emotion Recognition 机器学习算法在语音情感识别中的应用
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00031
Junyi Cao
Speech emotion recognition has been widely used in recent years and has become a heated topic for research. Focused on the convolutional neural network model using spectrograms as input, the CNN-LSTM model based on feature vectors, original speech signal and Log-mel spectrograms, the performance of different models is compared as well as analyzed. The study found that there are some common problems existing in the classification performance of the model. The features and algorithms currently used can effectively distinguish emotions with varied “arousal”, but it is difficult to identify the feelings with similar arousal, among the models. The CNN-LSTM model with Log-mel spectrograms as input achieved the highest accuracy.
语音情感识别近年来得到了广泛的应用,成为研究的热点。重点对以频谱图为输入的卷积神经网络模型、基于特征向量、原始语音信号和Log-mel频谱图的CNN-LSTM模型进行了性能比较和分析。研究发现,该模型的分类性能存在一些共性问题。目前使用的特征和算法可以有效区分具有不同“唤醒”的情绪,但难以在模型中识别具有相似唤醒的情绪。以Log-mel谱图为输入的CNN-LSTM模型获得了最高的精度。
{"title":"Application of Machine Learning Algorithms in Speech Emotion Recognition","authors":"Junyi Cao","doi":"10.1109/CONF-SPML54095.2021.00031","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00031","url":null,"abstract":"Speech emotion recognition has been widely used in recent years and has become a heated topic for research. Focused on the convolutional neural network model using spectrograms as input, the CNN-LSTM model based on feature vectors, original speech signal and Log-mel spectrograms, the performance of different models is compared as well as analyzed. The study found that there are some common problems existing in the classification performance of the model. The features and algorithms currently used can effectively distinguish emotions with varied “arousal”, but it is difficult to identify the feelings with similar arousal, among the models. The CNN-LSTM model with Log-mel spectrograms as input achieved the highest accuracy.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"388 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132495963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integral Sliding Mode Trajectory Tracking Method for 1-DOP Manipulator Systems driven by Pneumatic Muscles on the Basis of the Nonlinear Extended State Observer 基于非线性扩展状态观测器的气动肌肉驱动1-DOP机械臂系统积分滑模轨迹跟踪方法
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00072
Yixin Liu
This paper aims to get good trajectory tracking performance for 1-DOP(degrees of freedom) manipulator system driven by pneumatic muscles. However, it is difficult for achieving wonderful trajectory tracking performance due to nonlinearity of the 1-DOP manipulator system. The integral sliding mode trajectory tracking method is shown on 1-DOP manipulator system within the paper. A nonlinear extended state observer is proposed for estimating the nonlinearity of 1-DOP manipulator system. Moreover, an integral sliding mode controller on the basis of nonlinear extended state observer is adopted for getting good trajectory tracking performance in 1-DOP manipulator system. Finally, results of the simulation show that good trajectory tracking performance is achieved by proposed integral sliding mode trajectory tracking method within the paper.
本文旨在为气动肌肉驱动的1-DOP(自由度)机械手系统提供良好的轨迹跟踪性能。然而,由于1-DOP机械臂系统的非线性,难以获得良好的轨迹跟踪性能。文中给出了1-DOP机械手系统的积分滑模轨迹跟踪方法。为了估计1-DOP机械臂系统的非线性,提出了一种非线性扩展状态观测器。为了使1-DOP机械臂系统具有良好的轨迹跟踪性能,采用了基于非线性扩展状态观测器的积分滑模控制器。最后,仿真结果表明,本文提出的积分滑模轨迹跟踪方法具有良好的轨迹跟踪性能。
{"title":"Integral Sliding Mode Trajectory Tracking Method for 1-DOP Manipulator Systems driven by Pneumatic Muscles on the Basis of the Nonlinear Extended State Observer","authors":"Yixin Liu","doi":"10.1109/CONF-SPML54095.2021.00072","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00072","url":null,"abstract":"This paper aims to get good trajectory tracking performance for 1-DOP(degrees of freedom) manipulator system driven by pneumatic muscles. However, it is difficult for achieving wonderful trajectory tracking performance due to nonlinearity of the 1-DOP manipulator system. The integral sliding mode trajectory tracking method is shown on 1-DOP manipulator system within the paper. A nonlinear extended state observer is proposed for estimating the nonlinearity of 1-DOP manipulator system. Moreover, an integral sliding mode controller on the basis of nonlinear extended state observer is adopted for getting good trajectory tracking performance in 1-DOP manipulator system. Finally, results of the simulation show that good trajectory tracking performance is achieved by proposed integral sliding mode trajectory tracking method within the paper.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116868265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Overview of Recommender Systems and Its Next Generation: Context-Aware Recommender Systems 推荐系统及其下一代概述:上下文感知推荐系统
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00015
Jiahao Liang
Recommender Systems have been generally utilized in different areas including motion pictures, news, music with an intend to give the most important recommendations to clients from an assortment of accessible alternatives. Recommender Systems are planned utilizing procedures from numerous fields, some of which are: AI, data recovery, information mining, direct variable based math and man-made consciousness. However, in typical commodity applications, due to the huge user and project library and just few evaluations (Sparsity issue), and at the point when the client is new to Recommender Frameworks, the framework can’t prescribe things that are applicable to clients in light of absence of past data about the client as well as the client thing rating history that assists with deciding the clients’ preferences (cold start). What’s more, presently there’s an innovation called Context-aware Recommender Systems (CARS), which utilizing setting information (location, time, peer, etc.) during the time spent proposal. In this work, we present an outline of some of noticeable customary RS and the high level CARS. We discuss the advantages and disadvantages of them. Furthermore, we reveal some inherent problems in RS. At last, we make a conclusion and give some challenges in current works.
推荐系统通常用于不同的领域,包括电影,新闻,音乐,旨在从各种可访问的替代品中向客户提供最重要的推荐。推荐系统是利用许多领域的程序来规划的,其中一些是:人工智能、数据恢复、信息挖掘、基于直接变量的数学和人工意识。然而,在典型的商品应用程序中,由于庞大的用户和项目库以及很少的评估(稀疏性问题),并且在客户端对推荐框架不熟悉的情况下,框架无法规定适用于客户端的东西,因为没有关于客户端的过去数据以及帮助决定客户偏好的客户端事物评级历史(冷启动)。更重要的是,目前有一种创新被称为上下文感知推荐系统(CARS),它利用设置信息(地点,时间,同伴等)在花费的时间建议。在这项工作中,我们提出了一些值得注意的习惯RS和高级car的大纲。我们讨论了它们的优点和缺点。在此基础上,揭示了RS研究中存在的一些问题。最后,对本文进行了总结,并提出了当前工作中存在的一些问题。
{"title":"An Overview of Recommender Systems and Its Next Generation: Context-Aware Recommender Systems","authors":"Jiahao Liang","doi":"10.1109/CONF-SPML54095.2021.00015","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00015","url":null,"abstract":"Recommender Systems have been generally utilized in different areas including motion pictures, news, music with an intend to give the most important recommendations to clients from an assortment of accessible alternatives. Recommender Systems are planned utilizing procedures from numerous fields, some of which are: AI, data recovery, information mining, direct variable based math and man-made consciousness. However, in typical commodity applications, due to the huge user and project library and just few evaluations (Sparsity issue), and at the point when the client is new to Recommender Frameworks, the framework can’t prescribe things that are applicable to clients in light of absence of past data about the client as well as the client thing rating history that assists with deciding the clients’ preferences (cold start). What’s more, presently there’s an innovation called Context-aware Recommender Systems (CARS), which utilizing setting information (location, time, peer, etc.) during the time spent proposal. In this work, we present an outline of some of noticeable customary RS and the high level CARS. We discuss the advantages and disadvantages of them. Furthermore, we reveal some inherent problems in RS. At last, we make a conclusion and give some challenges in current works.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115674781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mechatronics: System Analysis Based on Software Simulation and Programming 机电一体化:基于软件仿真与编程的系统分析
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00022
Xuanang Chen
Mechatronics system design is more than just integrating electronic, software, and mechanical design, the additional features must be contingent on the ability of the mechatronic designer to optimize a design solution through these disparate fields. The basic idea of Mechatronics modeling is to divide the system into several subsystems according to the mathematical or physical characteristics of the system and the functions to be realized, and then use a variety of domain means to model according to the various links involved in these subsystems. This paper will introduce a common linear mechatronics modeling method based on a variety of simulation software.
机电一体化系统设计不仅仅是集成电子、软件和机械设计,附加功能必须取决于机电一体化设计师通过这些不同领域优化设计解决方案的能力。机电一体化建模的基本思想是根据系统的数学或物理特性和要实现的功能,将系统划分为若干个子系统,然后根据这些子系统所涉及的各个环节,利用各种领域手段进行建模。本文将介绍一种常用的基于各种仿真软件的线性机电一体化建模方法。
{"title":"Mechatronics: System Analysis Based on Software Simulation and Programming","authors":"Xuanang Chen","doi":"10.1109/CONF-SPML54095.2021.00022","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00022","url":null,"abstract":"Mechatronics system design is more than just integrating electronic, software, and mechanical design, the additional features must be contingent on the ability of the mechatronic designer to optimize a design solution through these disparate fields. The basic idea of Mechatronics modeling is to divide the system into several subsystems according to the mathematical or physical characteristics of the system and the functions to be realized, and then use a variety of domain means to model according to the various links involved in these subsystems. This paper will introduce a common linear mechatronics modeling method based on a variety of simulation software.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126083166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Neural Network Based Image and Video Denoising 基于神经网络的图像和视频去噪研究
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00023
Z. Luo
Noises are inevitable in images and videos. Many denoising algorithms have been proposed. As the acquirement of image and video qualities gets higher, algorithms with high performances and easy implementation are the new trend. With the development of deep learning, neural network has been applied in denoising algorithms. These methods represent better performances and obtain video of high quality. In this paper, we will discuss several denoising methods for both images and videos on their architectures, analyze their features and comment. Finally, we will carefully forecast what neural network based denoising modules can be built in the future study.
在图像和视频中,噪音是不可避免的。人们提出了许多去噪算法。随着对图像和视频质量的要求越来越高,高性能、易于实现的算法成为新的发展趋势。随着深度学习的发展,神经网络在去噪算法中得到了应用。这些方法具有较好的性能,可以获得高质量的视频。在本文中,我们将讨论图像和视频的几种去噪方法,分析它们的特征并进行评论。最后,我们将仔细预测在未来的研究中可以建立哪些基于神经网络的去噪模块。
{"title":"Research on Neural Network Based Image and Video Denoising","authors":"Z. Luo","doi":"10.1109/CONF-SPML54095.2021.00023","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00023","url":null,"abstract":"Noises are inevitable in images and videos. Many denoising algorithms have been proposed. As the acquirement of image and video qualities gets higher, algorithms with high performances and easy implementation are the new trend. With the development of deep learning, neural network has been applied in denoising algorithms. These methods represent better performances and obtain video of high quality. In this paper, we will discuss several denoising methods for both images and videos on their architectures, analyze their features and comment. Finally, we will carefully forecast what neural network based denoising modules can be built in the future study.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128291966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative Study of the Optimization of the Multi-prime RSA Algorithm 多素数RSA算法优化的比较研究
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00039
Ziyuan Ma
With the continuous development of computer technology, the amount of data shared on the Internet has increased significantly. Increasing demand for big data continues to grow, facing many security challenges. In today’s information era, data on the Internet is vulnerable to various attacks, and everyone wants to protect their privacy. Therefore, maintaining the security of user-class data has become the current research hotspot. This article integrates four methods, the traditional two-prime RSA, two-prime mixed CRT, and three-prime CRT-RSA and traditional tetraprime and optimized quatertraprime methods operate on the same message summaries to record their respective time-consuming behaviors. optimization is of great significance.
随着计算机技术的不断发展,互联网上共享的数据量显著增加。大数据需求持续增长,面临诸多安全挑战。在当今的信息时代,互联网上的数据容易受到各种攻击,每个人都希望保护自己的隐私。因此,维护用户类数据的安全已成为当前的研究热点。本文将传统的二素数RSA、二素数混合CRT、三素数CRT-RSA以及传统的四素数和优化的四素数方法在同一消息摘要上操作,记录各自的耗时行为。优化具有重要意义。
{"title":"Comparative Study of the Optimization of the Multi-prime RSA Algorithm","authors":"Ziyuan Ma","doi":"10.1109/CONF-SPML54095.2021.00039","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00039","url":null,"abstract":"With the continuous development of computer technology, the amount of data shared on the Internet has increased significantly. Increasing demand for big data continues to grow, facing many security challenges. In today’s information era, data on the Internet is vulnerable to various attacks, and everyone wants to protect their privacy. Therefore, maintaining the security of user-class data has become the current research hotspot. This article integrates four methods, the traditional two-prime RSA, two-prime mixed CRT, and three-prime CRT-RSA and traditional tetraprime and optimized quatertraprime methods operate on the same message summaries to record their respective time-consuming behaviors. optimization is of great significance.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128493264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Unsupervised Side Information for Zero-Shot Learning 零射击学习的无监督侧信息学习
Pub Date : 2021-11-01 DOI: 10.1109/CONF-SPML54095.2021.00070
Fan Zhang
Zero-Shot Learning aims to recognize unseen class images that do not appear in training, which is attracting more and more research interests in recently years. Side information is an important key to ZSL since it transfers the knowledge between seen and unseen classes. Human annotated attribute, as the most popular side information, need much human effort and time consumption during data collection. While unsupervised side information such as word2vec is not performing well since they lack the representation ability for visual information. In this paper, we propose to use CLIP features, which is learned with image and natural language pairs without human efforts, to perform ZSL. Extensive experiments on two benchmark datasets, AWA2 and CUB, demonstrates that our method is achieving impressive accuracy gain over word2vec, even beats human attributes in some circumstances.
Zero-Shot Learning旨在识别未在训练中出现的未见的类图像,近年来引起了越来越多的研究兴趣。副信息是ZSL的一个重要关键,因为它在可见类和不可见类之间传递知识。人工标注属性作为最流行的侧信息,在数据收集过程中需要耗费大量人力和时间。而word2vec等无监督侧信息由于缺乏对视觉信息的表示能力,表现不佳。在本文中,我们提出使用CLIP特征来执行ZSL,该特征是通过图像和自然语言对学习而来的,无需人工。在两个基准数据集(AWA2和CUB)上进行的大量实验表明,我们的方法比word2vec获得了令人印象深刻的精度增益,在某些情况下甚至超过了人类属性。
{"title":"Learning Unsupervised Side Information for Zero-Shot Learning","authors":"Fan Zhang","doi":"10.1109/CONF-SPML54095.2021.00070","DOIUrl":"https://doi.org/10.1109/CONF-SPML54095.2021.00070","url":null,"abstract":"Zero-Shot Learning aims to recognize unseen class images that do not appear in training, which is attracting more and more research interests in recently years. Side information is an important key to ZSL since it transfers the knowledge between seen and unseen classes. Human annotated attribute, as the most popular side information, need much human effort and time consumption during data collection. While unsupervised side information such as word2vec is not performing well since they lack the representation ability for visual information. In this paper, we propose to use CLIP features, which is learned with image and natural language pairs without human efforts, to perform ZSL. Extensive experiments on two benchmark datasets, AWA2 and CUB, demonstrates that our method is achieving impressive accuracy gain over word2vec, even beats human attributes in some circumstances.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124596953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1