首页 > 最新文献

J. Inf. Process. Syst.最新文献

英文 中文
Optimal Two-Section Layouts for the Two-Dimensional Cutting Problem 二维切削问题的最优两段布局
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.01.0066
Jun Ji, Dun-hua Huang, Feifei Xing, Yaodong Cui
When generating layout schemes, both the material usage and practicality of the cutting process should be considered. This paper presents a two-section algorithm for generating guillotine-cutting schemes of rectangular blanks. It simplifies the cutting process by allowing only one size of blanks to appear in any rectangular block. The algorithm uses an implicit enumeration and a linear programming optimal cutting scheme to maximize the material usage. The algorithm was tested on some benchmark problems in the literature, and compared with the three types of layout scheme algorithm. The experimental results show that the algorithm is effective both in computation time and in material usage
在生成布置方案时,既要考虑材料的使用,又要考虑切割工艺的实用性。提出了一种生成矩形坯料断头台切割方案的两段算法。它简化了切割过程,只允许一种尺寸的毛坯出现在任何矩形块。该算法采用隐式枚举法和线性规划最优切割方案实现材料利用率最大化。在文献中的一些基准问题上对算法进行了测试,并与三种类型的布局方案算法进行了比较。实验结果表明,该算法在计算时间和材料利用率方面都是有效的
{"title":"Optimal Two-Section Layouts for the Two-Dimensional Cutting Problem","authors":"Jun Ji, Dun-hua Huang, Feifei Xing, Yaodong Cui","doi":"10.3745/JIPS.01.0066","DOIUrl":"https://doi.org/10.3745/JIPS.01.0066","url":null,"abstract":"When generating layout schemes, both the material usage and practicality of the cutting process should be considered. This paper presents a two-section algorithm for generating guillotine-cutting schemes of rectangular blanks. It simplifies the cutting process by allowing only one size of blanks to appear in any rectangular block. The algorithm uses an implicit enumeration and a linear programming optimal cutting scheme to maximize the material usage. The algorithm was tested on some benchmark problems in the literature, and compared with the three types of layout scheme algorithm. The experimental results show that the algorithm is effective both in computation time and in material usage","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122626849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reference Architecture and Operation Model for PPP (Public-Private-Partnership) Cloud PPP (Public-Private-Partnership)云的参考架构与运营模式
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.04.0212
Youngkon Lee, Ukhyun Lee
The cloud has already become the core infrastructure of information systems, and government institutions are rapidly migrating information systems to the cloud. Government institutions in several countries use private clouds in their closed networks. However, because of the advantages of public clouds over private clouds, the demand for public clouds is increasing, and government institutions are expected to gradually switch to public clouds. When all data from government institutions are managed in the public cloud, the biggest concern for government institutions is the leakage of confidential data. The public-private-partnership (PPP) cloud provides a solution to this problem. PPP cloud is a form participation in a public cloud infrastructure and the building of a closed network data center. The PPP cloud prevents confidential data leakage and leverages the benefits of the public cloud to build a cloud quickly and easily maintain the cloud. In this paper, based on the case of the PPP cloud applied to the Korean government, the concept, architecture, operation model, and contract method of the PPP cloud are presented.
云已经成为信息系统的核心基础设施,政府机构正在迅速将信息系统迁移到云上。一些国家的政府机构在其封闭的网络中使用私有云。然而,由于公有云相对于私有云的优势,对公有云的需求越来越大,政府机构有望逐步转向公有云。当政府机构的所有数据都在公有云上进行管理时,政府机构最大的担忧就是机密数据的泄露。公私合作(PPP)云为这个问题提供了一个解决方案。PPP云是一种参与公共云基础设施和封闭网络数据中心建设的形式。PPP云可以防止机密数据泄露,利用公有云的优势,快速构建云,方便维护云。本文以PPP云应用于韩国政府的案例为基础,介绍了PPP云的概念、架构、运营模式和合同方式。
{"title":"Reference Architecture and Operation Model for PPP (Public-Private-Partnership) Cloud","authors":"Youngkon Lee, Ukhyun Lee","doi":"10.3745/JIPS.04.0212","DOIUrl":"https://doi.org/10.3745/JIPS.04.0212","url":null,"abstract":"The cloud has already become the core infrastructure of information systems, and government institutions are rapidly migrating information systems to the cloud. Government institutions in several countries use private clouds in their closed networks. However, because of the advantages of public clouds over private clouds, the demand for public clouds is increasing, and government institutions are expected to gradually switch to public clouds. When all data from government institutions are managed in the public cloud, the biggest concern for government institutions is the leakage of confidential data. The public-private-partnership (PPP) cloud provides a solution to this problem. PPP cloud is a form participation in a public cloud infrastructure and the building of a closed network data center. The PPP cloud prevents confidential data leakage and leverages the benefits of the public cloud to build a cloud quickly and easily maintain the cloud. In this paper, based on the case of the PPP cloud applied to the Korean government, the concept, architecture, operation model, and contract method of the PPP cloud are presented.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115776561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Industrial Process Monitoring and Fault Diagnosis Based on Temporal Attention Augmented Deep Network 基于时间关注增强深度网络的工业过程监测与故障诊断
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.04.0211
Ke Mu, Lin Luo, Qiao Wang, Fushun Mao
Following the intuition that the local information in time instances is hardly incorporated into the posterior sequence in long short-term memory (LSTM), this paper proposes an attention augmented mechanism for fault diagnosis of the complex chemical process data. Unlike conventional fault diagnosis and classification methods, an attention mechanism layer architecture is introduced to detect and focus on local temporal information. The augmented deep network results preserve each local instance’s importance and contribution and allow the interpretable feature representation and classification simultaneously. The comprehensive comparative analyses demonstrate that the developed model has a high-quality fault classification rate of 95.49%, on average. The results are comparable to those obtained using various other techniques for the Tennessee Eastman benchmark process.
针对长短期记忆(LSTM)中时间实例的局部信息难以融入后验序列的直观认识,提出了一种用于复杂化工过程数据故障诊断的注意力增强机制。与传统的故障诊断和分类方法不同,该方法引入了一种关注机制层架构来检测和关注局部时间信息。增强的深度网络结果保留了每个局部实例的重要性和贡献,同时允许可解释的特征表示和分类。综合对比分析表明,所建立的模型平均具有95.49%的高质量故障分类率。结果与使用田纳西伊士曼基准过程的各种其他技术获得的结果相当。
{"title":"Industrial Process Monitoring and Fault Diagnosis Based on Temporal Attention Augmented Deep Network","authors":"Ke Mu, Lin Luo, Qiao Wang, Fushun Mao","doi":"10.3745/JIPS.04.0211","DOIUrl":"https://doi.org/10.3745/JIPS.04.0211","url":null,"abstract":"Following the intuition that the local information in time instances is hardly incorporated into the posterior sequence in long short-term memory (LSTM), this paper proposes an attention augmented mechanism for fault diagnosis of the complex chemical process data. Unlike conventional fault diagnosis and classification methods, an attention mechanism layer architecture is introduced to detect and focus on local temporal information. The augmented deep network results preserve each local instance’s importance and contribution and allow the interpretable feature representation and classification simultaneously. The comprehensive comparative analyses demonstrate that the developed model has a high-quality fault classification rate of 95.49%, on average. The results are comparable to those obtained using various other techniques for the Tennessee Eastman benchmark process.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129407531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Personalized Product Recommendation Method for Analyzing User Behavior Using DeepFM 使用深度调频分析用户行为的个性化产品推荐方法
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.01.0069
Jianqiang Xu, Zhujiao Hu, Junzhong Zou
In a personalized product recommendation system, when the amount of log data is large or sparse, the accuracy of model recommendation will be greatly affected. To solve this problem, a personalized product recommendation method using deep factorization machine (DeepFM) to analyze user behavior is proposed. Firstly, the K-means clustering algorithm is used to cluster the original log data from the perspective of similarity to reduce the data dimension. Then, through the DeepFM parameter sharing strategy, the relationship between low-and high-order feature combinations is learned from log data, and the click rate prediction model is constructed. Finally, based on the predicted click-through rate, products are recommended to users in sequence and fed back. The area under the curve (AUC) and Logloss of the proposed method are 0.8834 and 0.0253, respectively, on the Criteo dataset, and 0.7836 and 0.0348 on the KDD2012 Cup dataset, respectively. Compared with other newer recommendation methods, the proposed method can achieve better recommendation effect.
在个性化产品推荐系统中,当日志数据量较大或稀疏时,模型推荐的准确性会受到很大影响。为了解决这一问题,提出了一种利用深度分解机(DeepFM)分析用户行为的个性化产品推荐方法。首先,采用K-means聚类算法,从相似度角度对原始日志数据进行聚类,降低数据维数;然后,通过DeepFM参数共享策略,从日志数据中学习高低阶特征组合之间的关系,构建点击率预测模型;最后,根据预测的点击率,依次向用户推荐产品并进行反馈。该方法的曲线下面积(AUC)和Logloss在Criteo数据集上分别为0.8834和0.0253,在KDD2012 Cup数据集上分别为0.7836和0.0348。与其他较新的推荐方法相比,该方法可以获得更好的推荐效果。
{"title":"Personalized Product Recommendation Method for Analyzing User Behavior Using DeepFM","authors":"Jianqiang Xu, Zhujiao Hu, Junzhong Zou","doi":"10.3745/JIPS.01.0069","DOIUrl":"https://doi.org/10.3745/JIPS.01.0069","url":null,"abstract":"In a personalized product recommendation system, when the amount of log data is large or sparse, the accuracy of model recommendation will be greatly affected. To solve this problem, a personalized product recommendation method using deep factorization machine (DeepFM) to analyze user behavior is proposed. Firstly, the K-means clustering algorithm is used to cluster the original log data from the perspective of similarity to reduce the data dimension. Then, through the DeepFM parameter sharing strategy, the relationship between low-and high-order feature combinations is learned from log data, and the click rate prediction model is constructed. Finally, based on the predicted click-through rate, products are recommended to users in sequence and fed back. The area under the curve (AUC) and Logloss of the proposed method are 0.8834 and 0.0253, respectively, on the Criteo dataset, and 0.7836 and 0.0348 on the KDD2012 Cup dataset, respectively. Compared with other newer recommendation methods, the proposed method can achieve better recommendation effect.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121408409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An Efficient Load Balancing Scheme for Gaming Server Using Proximal Policy Optimization Algorithm 基于近端策略优化算法的高效游戏服务器负载均衡方案
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.03.0158
Hye-young Kim
Large amount of data is being generated in gaming servers due to the increase in the number of users and the variety of game services being provided. In particular, load balancing schemes for gaming servers are crucial consideration. The existing literature proposes algorithms that distribute loads in servers by mostly concentrating on load balancing and cooperative offloading. However, many proposed schemes impose heavy restrictions and assumptions, and such a limited service classification method is not enough to satisfy the wide range of service requirements. We propose a load balancing agent that combines the dynamic allocation programming method, a type of greedy algorithm, and proximal policy optimization, a reinforcement learning. Also, we compare performances of our proposed scheme and those of a scheme from previous literature, ProGreGA, by running a simulation.
由于用户数量的增加和游戏服务的多样化,游戏服务器中产生了大量的数据。特别是,游戏服务器的负载平衡方案是至关重要的考虑因素。现有文献提出的算法主要集中在负载均衡和协同卸载的服务器上分配负载。然而,许多提出的方案施加了很大的限制和假设,这种有限的服务分类方法不足以满足广泛的服务需求。我们提出了一种负载平衡代理,它结合了动态分配规划方法(一种贪婪算法)和近端策略优化(一种强化学习)。此外,我们通过运行仿真比较了我们提出的方案与先前文献中的方案ProGreGA的性能。
{"title":"An Efficient Load Balancing Scheme for Gaming Server Using Proximal Policy Optimization Algorithm","authors":"Hye-young Kim","doi":"10.3745/JIPS.03.0158","DOIUrl":"https://doi.org/10.3745/JIPS.03.0158","url":null,"abstract":"Large amount of data is being generated in gaming servers due to the increase in the number of users and the variety of game services being provided. In particular, load balancing schemes for gaming servers are crucial consideration. The existing literature proposes algorithms that distribute loads in servers by mostly concentrating on load balancing and cooperative offloading. However, many proposed schemes impose heavy restrictions and assumptions, and such a limited service classification method is not enough to satisfy the wide range of service requirements. We propose a load balancing agent that combines the dynamic allocation programming method, a type of greedy algorithm, and proximal policy optimization, a reinforcement learning. Also, we compare performances of our proposed scheme and those of a scheme from previous literature, ProGreGA, by running a simulation.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123892177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
POI Recommendation Method Based on Multi-Source Information Fusion Using Deep Learning in Location-Based Social Networks 位置社交网络中基于深度学习的多源信息融合POI推荐方法
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.01.0068
Liqiang Sun
Sign-in point of interest (POI) are extremely sparse in location-based social networks, hindering recommendation systems from capturing users’ deep-level preferences. To solve this problem, we propose a content-aware POI recommendation algorithm based on a convolutional neural network. First, using convolutional neural networks to process comment text information, we model location POI and user latent factors. Subsequently, the objective function is constructed by fusing users’ geographical information and obtaining the emotional category information. In addition, the objective function comprises matrix decomposition and maximisation of the probability objective function. Finally, we solve the objective function efficiently. The prediction rate and F1 value on the Instagram-NewYork dataset are 78.32% and 76.37%, respectively, and those on the Instagram-Chicago dataset are 85.16% and 83.29%, respectively. Comparative experiments show that the proposed method can obtain a higher precision rate than several other newer recommended methods.
在基于位置的社交网络中,登录兴趣点(POI)极其稀少,阻碍了推荐系统捕捉用户的深层次偏好。为了解决这一问题,我们提出了一种基于卷积神经网络的内容感知POI推荐算法。首先,利用卷积神经网络对评论文本信息进行处理,对位置POI和用户潜在因素进行建模。随后,通过融合用户地理信息,获取情感类信息,构建目标函数。此外,目标函数还包括矩阵分解和概率目标函数的最大化。最后,有效地求解了目标函数。在instagram -纽约数据集上预测率和F1值分别为78.32%和76.37%,在instagram -芝加哥数据集上预测率和F1值分别为85.16%和83.29%。对比实验表明,该方法比其他几种较新的推荐方法具有更高的精度。
{"title":"POI Recommendation Method Based on Multi-Source Information Fusion Using Deep Learning in Location-Based Social Networks","authors":"Liqiang Sun","doi":"10.3745/JIPS.01.0068","DOIUrl":"https://doi.org/10.3745/JIPS.01.0068","url":null,"abstract":"Sign-in point of interest (POI) are extremely sparse in location-based social networks, hindering recommendation systems from capturing users’ deep-level preferences. To solve this problem, we propose a content-aware POI recommendation algorithm based on a convolutional neural network. First, using convolutional neural networks to process comment text information, we model location POI and user latent factors. Subsequently, the objective function is constructed by fusing users’ geographical information and obtaining the emotional category information. In addition, the objective function comprises matrix decomposition and maximisation of the probability objective function. Finally, we solve the objective function efficiently. The prediction rate and F1 value on the Instagram-NewYork dataset are 78.32% and 76.37%, respectively, and those on the Instagram-Chicago dataset are 85.16% and 83.29%, respectively. Comparative experiments show that the proposed method can obtain a higher precision rate than several other newer recommended methods.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127834565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Towards a Redundant Response Avoidance for Intelligent Chatbot 智能聊天机器人冗余响应避免方法研究
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.04.0213
Hyuck-Moo Gwon, Yeong-Seok Seo
Smartphones are one of the most widely used mobile devices allowing users to communicate with each other. With the development of mobile apps, many companies now provide various services for their customers by studying interactive systems in the form of mobile messengers for business marketing and commercial promotion. Such interactive systems are called “chatbots.” In this paper, we propose a method of avoiding the redundant responses of chatbots, according to the utterances entered by the user. In addition, the redundant patterns of chatbot responses are classified into three categories for the first time. In order to verify the proposed method, a chatbot is implemented using Telegram, an open source messenger. By comparing the proposed method with an existent method for each pattern, it is confirmed that the proposed method significantly improves the redundancy avoidance rate. Furthermore, response performance and variation analysis of the proposed method are investigated in our experiment.
智能手机是使用最广泛的移动设备之一,允许用户相互交流。随着移动应用程序的发展,现在许多公司通过研究移动信使形式的交互系统为客户提供各种服务,用于商业营销和商业推广。这种互动系统被称为“聊天机器人”。在本文中,我们提出了一种方法来避免聊天机器人的冗余响应,根据用户输入的话语。此外,首次将聊天机器人响应的冗余模式划分为三类。为了验证所提出的方法,使用开源信使Telegram实现了一个聊天机器人。通过对每种模式与已有方法的比较,证实了所提方法显著提高了冗余避免率。此外,本文还对该方法的响应性能和变异分析进行了实验研究。
{"title":"Towards a Redundant Response Avoidance for Intelligent Chatbot","authors":"Hyuck-Moo Gwon, Yeong-Seok Seo","doi":"10.3745/JIPS.04.0213","DOIUrl":"https://doi.org/10.3745/JIPS.04.0213","url":null,"abstract":"Smartphones are one of the most widely used mobile devices allowing users to communicate with each other. With the development of mobile apps, many companies now provide various services for their customers by studying interactive systems in the form of mobile messengers for business marketing and commercial promotion. Such interactive systems are called “chatbots.” In this paper, we propose a method of avoiding the redundant responses of chatbots, according to the utterances entered by the user. In addition, the redundant patterns of chatbot responses are classified into three categories for the first time. In order to verify the proposed method, a chatbot is implemented using Telegram, an open source messenger. By comparing the proposed method with an existent method for each pattern, it is confirmed that the proposed method significantly improves the redundancy avoidance rate. Furthermore, response performance and variation analysis of the proposed method are investigated in our experiment.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"410 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124362171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Facial Expression Recognition Method Using Two-Stream Convolutional Networks in Natural Scenes 自然场景中基于双流卷积网络的面部表情识别方法
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.01.0070
Lixing Zhao
Aiming at the problem that complex external variables in natural scenes have a greater impact on facial expression recognition results, a facial expression recognition method based on two-stream convolutional neural network is proposed. The model introduces exponentially enhanced shared input weights before each level of convolution input, and uses soft attention mechanism modules on the space-time features of the combination of static and dynamic streams. This enables the network to autonomously find areas that are more relevant to the expression category and pay more attention to these areas. Through these means, the information of irrelevant interference areas is suppressed. In order to solve the problem of poor local robustness caused by lighting and expression changes, this paper also performs lighting preprocessing with the lighting preprocessing chain algorithm to eliminate most of the lighting effects. Experimental results on AFEW6.0 and Multi-PIE datasets show that the recognition rates of this method are 95.05% and 61.40%, respectively, which are better than other comparison methods.
针对自然场景中复杂的外部变量对面部表情识别结果影响较大的问题,提出了一种基于双流卷积神经网络的面部表情识别方法。该模型在每一层卷积输入前引入指数增强的共享输入权重,并对静态流和动态流结合的时空特征使用软注意机制模块。这使得网络能够自主地找到与表达类别更相关的区域,并更加关注这些区域。通过这些手段,可以抑制无关干扰区域的信息。为了解决光照和表情变化带来的局部鲁棒性差的问题,本文还采用光照预处理链算法进行光照预处理,消除大部分光照效果。在AFEW6.0和Multi-PIE数据集上的实验结果表明,该方法的识别率分别为95.05%和61.40%,优于其他比较方法。
{"title":"A Facial Expression Recognition Method Using Two-Stream Convolutional Networks in Natural Scenes","authors":"Lixing Zhao","doi":"10.3745/JIPS.01.0070","DOIUrl":"https://doi.org/10.3745/JIPS.01.0070","url":null,"abstract":"Aiming at the problem that complex external variables in natural scenes have a greater impact on facial expression recognition results, a facial expression recognition method based on two-stream convolutional neural network is proposed. The model introduces exponentially enhanced shared input weights before each level of convolution input, and uses soft attention mechanism modules on the space-time features of the combination of static and dynamic streams. This enables the network to autonomously find areas that are more relevant to the expression category and pay more attention to these areas. Through these means, the information of irrelevant interference areas is suppressed. In order to solve the problem of poor local robustness caused by lighting and expression changes, this paper also performs lighting preprocessing with the lighting preprocessing chain algorithm to eliminate most of the lighting effects. Experimental results on AFEW6.0 and Multi-PIE datasets show that the recognition rates of this method are 95.05% and 61.40%, respectively, which are better than other comparison methods.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125809205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion 基于时空递归神经网络和特征融合的视频表情识别方法
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.01.0067
Xuan Zhou
Automatically recognizing facial expressions in video sequences is a challenging task because there is little direct correlation between facial features and subjective emotions in video. To overcome the problem, a video facial expression recognition method using spatiotemporal recurrent neural network and feature fusion is proposed. Firstly, the video is preprocessed. Then, the double-layer cascade structure is used to detect a face in a video image. In addition, two deep convolutional neural networks are used to extract the time-domain and airspace facial features in the video. The spatial convolutional neural network is used to extract the spatial information features from each frame of the static expression images in the video. The temporal convolutional neural network is used to extract the dynamic information features from the optical flow information from multiple frames of expression images in the video. A multiplication fusion is performed with the spatiotemporal features learned by the two deep convolutional neural networks. Finally, the fused features are input to the support vector machine to realize the facial expression classification task. The experimental results on cNTERFACE, RML, and AFEW6.0 datasets show that the recognition rates obtained by the proposed method are as high as 88.67%, 70.32%, and 63.84%, respectively. Comparative experiments show that the proposed method obtains higher recognition accuracy than other recently reported methods.
由于视频中的面部特征与主观情绪之间没有直接的相关性,因此自动识别视频序列中的面部表情是一项具有挑战性的任务。为了克服这一问题,提出了一种基于时空递归神经网络和特征融合的视频面部表情识别方法。首先,对视频进行预处理。然后,采用双层级联结构对视频图像中的人脸进行检测。此外,利用两个深度卷积神经网络提取视频中的时域和空域面部特征。利用空间卷积神经网络从视频中静态表情图像的每一帧中提取空间信息特征。利用时间卷积神经网络从视频中多帧表情图像的光流信息中提取动态信息特征。利用两个深度卷积神经网络学习到的时空特征进行乘法融合。最后,将融合后的特征输入到支持向量机中,实现面部表情分类任务。在cNTERFACE、RML和AFEW6.0数据集上的实验结果表明,该方法的识别率分别高达88.67%、70.32%和63.84%。对比实验表明,该方法比目前报道的其他方法具有更高的识别精度。
{"title":"Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion","authors":"Xuan Zhou","doi":"10.3745/JIPS.01.0067","DOIUrl":"https://doi.org/10.3745/JIPS.01.0067","url":null,"abstract":"Automatically recognizing facial expressions in video sequences is a challenging task because there is little direct correlation between facial features and subjective emotions in video. To overcome the problem, a video facial expression recognition method using spatiotemporal recurrent neural network and feature fusion is proposed. Firstly, the video is preprocessed. Then, the double-layer cascade structure is used to detect a face in a video image. In addition, two deep convolutional neural networks are used to extract the time-domain and airspace facial features in the video. The spatial convolutional neural network is used to extract the spatial information features from each frame of the static expression images in the video. The temporal convolutional neural network is used to extract the dynamic information features from the optical flow information from multiple frames of expression images in the video. A multiplication fusion is performed with the spatiotemporal features learned by the two deep convolutional neural networks. Finally, the fused features are input to the support vector machine to realize the facial expression classification task. The experimental results on cNTERFACE, RML, and AFEW6.0 datasets show that the recognition rates obtained by the proposed method are as high as 88.67%, 70.32%, and 63.84%, respectively. Comparative experiments show that the proposed method obtains higher recognition accuracy than other recently reported methods.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124512945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
RAVIP: Real-Time AI Vision Platform for Heterogeneous Multi-Channel Video Stream RAVIP:异构多通道视频流实时AI视觉平台
Pub Date : 2021-04-01 DOI: 10.3745/JIPS.02.0154
Jeong-Hun Lee, Kwang-il Hwang
Object detection techniques based on deep learning such as YOLO have high detection performance and precision in a single channel video stream. In order to expand to multiple channel object detection in real-time, however, high-performance hardware is required. In this paper, we propose a novel back-end server framework, a real-time AI vision platform (RAVIP), which can extend the object detection function from single channel to simultaneous multi-channels, which can work well even in low-end server hardware. RAVIP assembles appropriate component modules from the RODEM (real-time object detection module) Base to create perchannel instances for each channel, enabling efficient parallelization of object detection instances on limited hardware resources through continuous monitoring with respect to resource utilization. Through practical experiments, RAVIP shows that it is possible to optimize CPU, GPU, and memory utilization while performing object detection service in a multi-channel situation. In addition, it has been proven that RAVIP can provide object detection services with 25 FPS for all 16 channels at the same time.
基于深度学习的目标检测技术如YOLO在单通道视频流中具有较高的检测性能和精度。然而,为了扩展到实时的多通道目标检测,需要高性能的硬件。本文提出了一种新颖的后端服务器框架——实时人工智能视觉平台(RAVIP),它可以将目标检测功能从单通道扩展到同时多通道,即使在低端服务器硬件上也能很好地工作。RAVIP从RODEM(实时目标检测模块)库中组装适当的组件模块,为每个通道创建跨通道实例,通过对资源利用率的持续监控,在有限的硬件资源上实现目标检测实例的高效并行化。通过实际实验,RAVIP表明,在多通道情况下,在执行目标检测服务时,可以优化CPU、GPU和内存利用率。此外,已经证明RAVIP可以同时为所有16个通道提供25 FPS的目标检测服务。
{"title":"RAVIP: Real-Time AI Vision Platform for Heterogeneous Multi-Channel Video Stream","authors":"Jeong-Hun Lee, Kwang-il Hwang","doi":"10.3745/JIPS.02.0154","DOIUrl":"https://doi.org/10.3745/JIPS.02.0154","url":null,"abstract":"Object detection techniques based on deep learning such as YOLO have high detection performance and precision in a single channel video stream. In order to expand to multiple channel object detection in real-time, however, high-performance hardware is required. In this paper, we propose a novel back-end server framework, a real-time AI vision platform (RAVIP), which can extend the object detection function from single channel to simultaneous multi-channels, which can work well even in low-end server hardware. RAVIP assembles appropriate component modules from the RODEM (real-time object detection module) Base to create perchannel instances for each channel, enabling efficient parallelization of object detection instances on limited hardware resources through continuous monitoring with respect to resource utilization. Through practical experiments, RAVIP shows that it is possible to optimize CPU, GPU, and memory utilization while performing object detection service in a multi-channel situation. In addition, it has been proven that RAVIP can provide object detection services with 25 FPS for all 16 channels at the same time.","PeriodicalId":415161,"journal":{"name":"J. Inf. Process. Syst.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117295943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
J. Inf. Process. Syst.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1