首页 > 最新文献

2022 International Joint Conference on Neural Networks (IJCNN)最新文献

英文 中文
Dirichlet Mixture Model of Hawkes Processes Based Patent User Role Discovery Model 基于Hawkes过程的专利用户角色发现Dirichlet混合模型
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892056
Weidong Liu, Quanping Zhang, Wenbo Qiao
With the complexity of patent transformation scenarios, the roles of users have become more diverse. Therefore, how to discover the roles of different users in the patent transformation scenarios has become a hot issue. In the process of patent transformation, the behaviors of each user are regular, historical behavior has an impact on the current behavior. Because the Hawkes processes can take into account the characteristic of self-exciting among behaviors, we explored the Dirichlet Mixture model of Hawkes Processes based on variational inference to cluster users for user roles discovery. In this model, different Hawkes processes correspond to different user types. Dirichlet distribution is used as the prior distribution of user clusters. The dependence of current behavior on historical behavior is expressed as intensity function. The variational inference is used to learn the model. The model is evaluated by Precision, Recall and F-measure, which shows that our model has good accuracy.
随着专利转化场景的复杂化,用户的角色也变得更加多样化。因此,如何发现不同用户在专利转化场景中的角色成为一个热点问题。在专利转化过程中,每个用户的行为都是有规律的,历史行为对当前行为产生影响。由于Hawkes过程可以考虑到行为之间的自激特性,我们探索了基于变分推理的Hawkes过程的Dirichlet混合模型,以聚类用户进行用户角色发现。在该模型中,不同的Hawkes进程对应不同的用户类型。使用Dirichlet分布作为用户簇的先验分布。当前行为对历史行为的依赖关系表示为强度函数。采用变分推理对模型进行学习。通过Precision、Recall和F-measure对模型进行了评价,结果表明模型具有较好的准确性。
{"title":"Dirichlet Mixture Model of Hawkes Processes Based Patent User Role Discovery Model","authors":"Weidong Liu, Quanping Zhang, Wenbo Qiao","doi":"10.1109/IJCNN55064.2022.9892056","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892056","url":null,"abstract":"With the complexity of patent transformation scenarios, the roles of users have become more diverse. Therefore, how to discover the roles of different users in the patent transformation scenarios has become a hot issue. In the process of patent transformation, the behaviors of each user are regular, historical behavior has an impact on the current behavior. Because the Hawkes processes can take into account the characteristic of self-exciting among behaviors, we explored the Dirichlet Mixture model of Hawkes Processes based on variational inference to cluster users for user roles discovery. In this model, different Hawkes processes correspond to different user types. Dirichlet distribution is used as the prior distribution of user clusters. The dependence of current behavior on historical behavior is expressed as intensity function. The variational inference is used to learn the model. The model is evaluated by Precision, Recall and F-measure, which shows that our model has good accuracy.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115715719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meta Proximal Policy Optimization for Cooperative Multi-Agent Continuous Control 协同多智能体连续控制的元近端策略优化
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892004
Boli Fang, Zhenghao Peng, Hao Sun, Qin Zhang
In this paper we propose Multi-Agent Proxy Proximal Policy Optimization (MA3PO), a novel multi-agent deep reinforcement learning algorithm that tackles the challenge of cooperative continuous multi-agent control. Our method is driven by the observation that most existing multi-agent reinforcement learning algorithms mainly focus on discrete state/action spaces and are thus computationally infeasible when extended to environments with continuous state/action spaces. To address the issue of computational complexity and to better model intra-agent collaboration, we make use of the recently successful Proximal Policy Optimization algorithm that effectively explores of continuous action spaces, and incorporate the notion of intrinsic motivation via meta-gradient methods so as to stimulate the behavior of individual agents in cooperative multi-agent settings. Towards these ends, we design proxy rewards to quantify the effect of individual agent-level intrinsic motivation onto the team-level reward, and apply meta-gradient methods to leverage such an addition so that our algorithm can learn the team-level cumulative reward effectively. Experiments on various multi-agent reinforcement learning benchmark environments with continuous action spaces demonstrate that our algorithm is not only comparable with the existing state-of-the-art benchmarks, but also significantly reduces training time complexity.
本文提出了一种新的多智能体深度强化学习算法——多智能体代理近端策略优化算法(MA3PO),解决了多智能体协作连续控制的难题。我们的方法是由观察到大多数现有的多智能体强化学习算法主要关注离散状态/动作空间,因此当扩展到具有连续状态/动作空间的环境时,计算上是不可行的。为了解决计算复杂性问题并更好地模拟智能体内部协作,我们利用最近成功的邻域策略优化算法,该算法有效地探索了连续的动作空间,并通过元梯度方法引入了内在动机的概念,从而在多智能体协作设置中刺激个体智能体的行为。为此,我们设计代理奖励来量化个体代理级内在动机对团队级奖励的影响,并应用元梯度方法来利用这种附加,以便我们的算法可以有效地学习团队级累积奖励。在具有连续动作空间的各种多智能体强化学习基准环境中进行的实验表明,我们的算法不仅可以与现有的最先进的基准相媲美,而且可以显着降低训练时间复杂度。
{"title":"Meta Proximal Policy Optimization for Cooperative Multi-Agent Continuous Control","authors":"Boli Fang, Zhenghao Peng, Hao Sun, Qin Zhang","doi":"10.1109/IJCNN55064.2022.9892004","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892004","url":null,"abstract":"In this paper we propose Multi-Agent Proxy Proximal Policy Optimization (MA3PO), a novel multi-agent deep reinforcement learning algorithm that tackles the challenge of cooperative continuous multi-agent control. Our method is driven by the observation that most existing multi-agent reinforcement learning algorithms mainly focus on discrete state/action spaces and are thus computationally infeasible when extended to environments with continuous state/action spaces. To address the issue of computational complexity and to better model intra-agent collaboration, we make use of the recently successful Proximal Policy Optimization algorithm that effectively explores of continuous action spaces, and incorporate the notion of intrinsic motivation via meta-gradient methods so as to stimulate the behavior of individual agents in cooperative multi-agent settings. Towards these ends, we design proxy rewards to quantify the effect of individual agent-level intrinsic motivation onto the team-level reward, and apply meta-gradient methods to leverage such an addition so that our algorithm can learn the team-level cumulative reward effectively. Experiments on various multi-agent reinforcement learning benchmark environments with continuous action spaces demonstrate that our algorithm is not only comparable with the existing state-of-the-art benchmarks, but also significantly reduces training time complexity.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124407428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aspect-aware Asymmetric Representation Learning Network for Review-based Recommendation 基于评论的推荐的面向方面的非对称表示学习网络
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892559
Hao Liu, H. Qiao, Xiaoyu Shi, Mingsheng Shang
Recently, user-provided reviews have been identified as an essential resource to improve user and item representation in recommender systems. Previous methods focus on the review-based recommender typically leverages symmetric networks to process user and item reviews. However, in reality, these two sets of reviews are markedly different: a user's reviews reflect the experience of buying diverse items and show their heterogeneous interests. In contrast, an item's reviews emphasize the quality of the specific item. Thus an item's reviews are usually homogeneous. This paper seeks to explore the aspect of review difference in the review-based recommendation framework. We propose a novel asymmetric neural network model that accurately learns the user and item representation by identifying this critical difference. We focus on capturing the dynamic change of user interest for the user-aspect reviews via modeling the temporal information into the conventional neural network(CNN). On the other side, we try to identify a specific item's essential yet essential features by utilizing the self-attention neural network. Finally, a factorization machine (FM) is adopted to finish the rating prediction task, where the user and item IDs are encoded as supplementary review embedding. We conduct comprehensive experiments on four Amazon datasets, and the experimental results show that our proposed model consistently outperforms several state-of-the-art methods.
最近,用户提供的评论被认为是改善推荐系统中用户和项目表示的重要资源。以前的方法主要关注基于评论的推荐,通常利用对称网络来处理用户和项目评论。然而,在现实中,这两组评论是明显不同的:用户的评论反映了购买不同商品的体验,并显示了他们的异质兴趣。相比之下,一个项目的评论强调的是具体项目的质量。因此,一个项目的评论通常是同质的。本文旨在探讨基于评论的推荐框架中评论差异的方面。我们提出了一种新的非对称神经网络模型,通过识别这种关键差异来准确地学习用户和物品的表示。我们的重点是通过将时间信息建模到传统神经网络(CNN)中来捕捉用户方面评论的用户兴趣的动态变化。另一方面,我们试图利用自注意神经网络来识别特定物品的本质特征。最后,采用因子分解机(FM)完成评分预测任务,其中用户id和项目id被编码为补充评论嵌入。我们在四个Amazon数据集上进行了全面的实验,实验结果表明,我们提出的模型始终优于几种最先进的方法。
{"title":"Aspect-aware Asymmetric Representation Learning Network for Review-based Recommendation","authors":"Hao Liu, H. Qiao, Xiaoyu Shi, Mingsheng Shang","doi":"10.1109/IJCNN55064.2022.9892559","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892559","url":null,"abstract":"Recently, user-provided reviews have been identified as an essential resource to improve user and item representation in recommender systems. Previous methods focus on the review-based recommender typically leverages symmetric networks to process user and item reviews. However, in reality, these two sets of reviews are markedly different: a user's reviews reflect the experience of buying diverse items and show their heterogeneous interests. In contrast, an item's reviews emphasize the quality of the specific item. Thus an item's reviews are usually homogeneous. This paper seeks to explore the aspect of review difference in the review-based recommendation framework. We propose a novel asymmetric neural network model that accurately learns the user and item representation by identifying this critical difference. We focus on capturing the dynamic change of user interest for the user-aspect reviews via modeling the temporal information into the conventional neural network(CNN). On the other side, we try to identify a specific item's essential yet essential features by utilizing the self-attention neural network. Finally, a factorization machine (FM) is adopted to finish the rating prediction task, where the user and item IDs are encoded as supplementary review embedding. We conduct comprehensive experiments on four Amazon datasets, and the experimental results show that our proposed model consistently outperforms several state-of-the-art methods.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123079732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NAM: What Does a Neural Network See? NAM:神经网络能看到什么?
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892442
Katarzyna Filus, J. Domańska
Deep Convolutional Neural Networks (CNNs) still lack interpretability and are often treated as miraculous blackbox machines. Therefore, when an intelligent system fails, it is usually difficult to troubleshoot the problems. Among others, these issues can be caused by incorrect decisions of the CNN classifier. The other reason can be selective “blindness” of the CNN - caused by an insufficient generalization of the convolutional feature extractor. To better understand the CNN decisions, methods from the Class Activation Mapping (CAM) family have been introduced. In contrast to CAM techniques, which focus on the model's predictions (thus a classifier), we propose a simple yet informative way to visualize network activation - Network Activation Mapping (NAM). Our method targets the most important part of the CNN - a convolutional feature extractor. Opposed to CAM methods, NAM is class-and classifier-independent and provides insight into what the neural network focuses on during the feature extraction process and what features it finds the most prominent in the examined image. Due to the classifier-independence, it can be used with all CNN models. In our experiments, we demonstrate how the performance of a convolutional feature extractor can be preliminarily evaluated using NAM. We also present results obtained for a simple NAM-based visual attention mechanism, which allows us to filter out less informative regions of the image and facilitates the decision making process.
深度卷积神经网络(cnn)仍然缺乏可解释性,通常被视为神奇的黑箱机器。因此,当智能系统出现故障时,通常很难排除故障。其中,这些问题可能是由CNN分类器的错误决策引起的。另一个原因可能是CNN的选择性“盲目性”——由卷积特征提取器泛化不足引起的。为了更好地理解CNN决策,引入了类激活映射(CAM)家族的方法。与CAM技术相反,CAM技术侧重于模型的预测(因此是分类器),我们提出了一种简单但信息丰富的方法来可视化网络激活-网络激活映射(NAM)。我们的方法针对的是CNN最重要的部分——卷积特征提取器。与CAM方法相反,NAM是独立于类和分类器的,并提供了神经网络在特征提取过程中关注的内容以及它在检查的图像中发现的最突出的特征。由于与分类器无关,它可以用于所有CNN模型。在我们的实验中,我们演示了如何使用NAM初步评估卷积特征提取器的性能。我们还介绍了一个简单的基于nam的视觉注意机制的结果,该机制允许我们过滤掉图像中信息较少的区域,从而促进决策过程。
{"title":"NAM: What Does a Neural Network See?","authors":"Katarzyna Filus, J. Domańska","doi":"10.1109/IJCNN55064.2022.9892442","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892442","url":null,"abstract":"Deep Convolutional Neural Networks (CNNs) still lack interpretability and are often treated as miraculous blackbox machines. Therefore, when an intelligent system fails, it is usually difficult to troubleshoot the problems. Among others, these issues can be caused by incorrect decisions of the CNN classifier. The other reason can be selective “blindness” of the CNN - caused by an insufficient generalization of the convolutional feature extractor. To better understand the CNN decisions, methods from the Class Activation Mapping (CAM) family have been introduced. In contrast to CAM techniques, which focus on the model's predictions (thus a classifier), we propose a simple yet informative way to visualize network activation - Network Activation Mapping (NAM). Our method targets the most important part of the CNN - a convolutional feature extractor. Opposed to CAM methods, NAM is class-and classifier-independent and provides insight into what the neural network focuses on during the feature extraction process and what features it finds the most prominent in the examined image. Due to the classifier-independence, it can be used with all CNN models. In our experiments, we demonstrate how the performance of a convolutional feature extractor can be preliminarily evaluated using NAM. We also present results obtained for a simple NAM-based visual attention mechanism, which allows us to filter out less informative regions of the image and facilitates the decision making process.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116658823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Class Distribution Monitoring for Concept Drift Detection 概念漂移检测的类分布监测
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892772
Diego Stucchi, Luca Frittoli, G. Boracchi
We introduce Class Distribution Monitoring (CDM), an effective concept-drift detection scheme that monitors the class-conditional distributions of a datastream. In particular, our solution leverages multiple instances of an online and nonparametric change-detection algorithm based on QuantTree. CDM reports a concept drift after detecting a distribution change in any class, thus identifying which classes are affected by the concept drift. This can be precious information for diagnostics and adaptation. Our experiments on synthetic and real-world datastreams show that when the concept drift affects a few classes, CDM outperforms algorithms monitoring the overall data distribution, while achieving similar detection delays when the drift affects all the classes. Moreover, CDM outperforms comparable approaches that monitor the classification error, particularly when the change is not very apparent. Finally, we demonstrate that CDM inherits the properties of the underlying change detector, yielding an effective control over the expected time before a false alarm, or Average Run Length (ARL0).
我们介绍了类分布监控(CDM),这是一种有效的概念漂移检测方案,用于监控数据流的类条件分布。特别是,我们的解决方案利用了基于quantreree的在线和非参数变化检测算法的多个实例。CDM在检测到任何类中的分布变化后报告概念漂移,从而确定哪些类受到概念漂移的影响。这可能是诊断和适应的宝贵信息。我们在合成数据流和真实数据流上的实验表明,当概念漂移影响几个类时,CDM优于监控整体数据分布的算法,同时在漂移影响所有类时实现类似的检测延迟。此外,CDM优于监视分类错误的类似方法,特别是在变化不是很明显的情况下。最后,我们演示了CDM继承了底层变更检测器的属性,在出现假警报之前产生对预期时间或平均运行长度(ARL0)的有效控制。
{"title":"Class Distribution Monitoring for Concept Drift Detection","authors":"Diego Stucchi, Luca Frittoli, G. Boracchi","doi":"10.1109/IJCNN55064.2022.9892772","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892772","url":null,"abstract":"We introduce Class Distribution Monitoring (CDM), an effective concept-drift detection scheme that monitors the class-conditional distributions of a datastream. In particular, our solution leverages multiple instances of an online and nonparametric change-detection algorithm based on QuantTree. CDM reports a concept drift after detecting a distribution change in any class, thus identifying which classes are affected by the concept drift. This can be precious information for diagnostics and adaptation. Our experiments on synthetic and real-world datastreams show that when the concept drift affects a few classes, CDM outperforms algorithms monitoring the overall data distribution, while achieving similar detection delays when the drift affects all the classes. Moreover, CDM outperforms comparable approaches that monitor the classification error, particularly when the change is not very apparent. Finally, we demonstrate that CDM inherits the properties of the underlying change detector, yielding an effective control over the expected time before a false alarm, or Average Run Length (ARL0).","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120856232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fractional-Order Learning Systems 分数阶学习系统
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892971
S. Talebi, Stefan Werner, D. Mandic
From the inaugural steps of McCulloch and Pitts to put forth a composition for an electrical brain, that combined with the conception of an adaptive leaning mechanism by Widrow and Hoff has given rise to the phenomena of intelligent machines, machine learning techniques have gained the status of a miracle solution in a myriad of scientific fields. At the heart of these techniques lies iterative optimisation processes that are derived based on first, and in some cases, second-order derivatives. This manuscript, however, aims to expand the mentioned framework to that of using fractional-order derivatives. The entire format of adaptation is revised form the perspective of fractional-order calculus and the appropriate framework for taking full advantage of the fractional-order calculus in learning and adaptation paradigms is formulated. For rigour, the structure of behavioural analysis and performance prediction of this novel class of learning machines is also forged.
从麦卡洛克(McCulloch)和皮茨(Pitts)首次提出电子大脑的构成,再加上Widrow和Hoff提出的自适应学习机制的概念,智能机器的现象应运而生,机器学习技术已经在无数科学领域获得了奇迹解决方案的地位。这些技术的核心是基于一阶导数的迭代优化过程,在某些情况下是二阶导数。然而,本文的目的是将上述框架扩展到使用分数阶导数的框架。从分数阶演算的角度对整个适应范式进行了修正,并提出了在学习和适应范式中充分利用分数阶演算的适当框架。严格来说,这种新型学习机的行为分析和性能预测结构也是锻造出来的。
{"title":"Fractional-Order Learning Systems","authors":"S. Talebi, Stefan Werner, D. Mandic","doi":"10.1109/IJCNN55064.2022.9892971","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892971","url":null,"abstract":"From the inaugural steps of McCulloch and Pitts to put forth a composition for an electrical brain, that combined with the conception of an adaptive leaning mechanism by Widrow and Hoff has given rise to the phenomena of intelligent machines, machine learning techniques have gained the status of a miracle solution in a myriad of scientific fields. At the heart of these techniques lies iterative optimisation processes that are derived based on first, and in some cases, second-order derivatives. This manuscript, however, aims to expand the mentioned framework to that of using fractional-order derivatives. The entire format of adaptation is revised form the perspective of fractional-order calculus and the appropriate framework for taking full advantage of the fractional-order calculus in learning and adaptation paradigms is formulated. For rigour, the structure of behavioural analysis and performance prediction of this novel class of learning machines is also forged.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120878411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trans-SBLGCN: A Transfer Learning Model for Event Logic Knowledge Graph Construction of Fault Diagnosis transsblgcn:一种故障诊断事件逻辑知识图构建的迁移学习模型
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892075
R. Lin, Lianglun Cheng, Tao Wang, Jianfeng Deng
Taking fault diagnosis corpus as the research object, an event logic knowledge graph construction method is proposed in this paper. Firstly, we propose a data labeling strategy based on a constructed event logic ontology model, then collect large-scale robot transmission system fault diagnosis corpus, and label part of the data according to the strategy. Secondly, we propose a transfer learning model called Trans-SBLGCN for event argument entity and event argument relation joint extraction. A language model is trained based on large-scale unlabeled fault diagnosis corpus and transferred to a model based on stacked bidirectional long short term memory (BiLSTM) and bidirectional graph convolutional network (BiGCN). Experimental results show that the method is superior to other methods. Finally, an event logic knowledge graph of robot transmission system fault diagnosis is constructed to provide decision support for autonomous robot transmission system fault diagnosis.
以故障诊断语料库为研究对象,提出了一种事件逻辑知识图的构建方法。首先,在构建事件逻辑本体模型的基础上提出数据标注策略,然后收集大型机器人传动系统故障诊断语料库,并根据该策略对部分数据进行标注。其次,我们提出了一种transsblgcn迁移学习模型,用于事件参数实体和事件参数关系联合抽取。基于大规模无标记故障诊断语料库训练语言模型,并将其转化为基于堆叠双向长短期记忆(BiLSTM)和双向图卷积网络(BiGCN)的模型。实验结果表明,该方法优于其他方法。最后,构建了机器人传动系统故障诊断的事件逻辑知识图,为自主机器人传动系统故障诊断提供决策支持。
{"title":"Trans-SBLGCN: A Transfer Learning Model for Event Logic Knowledge Graph Construction of Fault Diagnosis","authors":"R. Lin, Lianglun Cheng, Tao Wang, Jianfeng Deng","doi":"10.1109/IJCNN55064.2022.9892075","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892075","url":null,"abstract":"Taking fault diagnosis corpus as the research object, an event logic knowledge graph construction method is proposed in this paper. Firstly, we propose a data labeling strategy based on a constructed event logic ontology model, then collect large-scale robot transmission system fault diagnosis corpus, and label part of the data according to the strategy. Secondly, we propose a transfer learning model called Trans-SBLGCN for event argument entity and event argument relation joint extraction. A language model is trained based on large-scale unlabeled fault diagnosis corpus and transferred to a model based on stacked bidirectional long short term memory (BiLSTM) and bidirectional graph convolutional network (BiGCN). Experimental results show that the method is superior to other methods. Finally, an event logic knowledge graph of robot transmission system fault diagnosis is constructed to provide decision support for autonomous robot transmission system fault diagnosis.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120930424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TRICAN: Multi-Modal Hateful Memes Detection with Triplet-Relation Information Cross-Attention Network 基于三重关系信息交叉注意网络的多模态仇恨模因检测
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892164
Xiaolin Liang, Yajuan Huang, Wen Liu, He Zhu, Zhao Liang, Libo Chen
Memes are spreading on social networking. Most are created to be humorous, while some become hateful with the combination of images and words, conveying negative information to people. The hateful memes detection poses an interesting multimodal fusion problem, unlike traditional multi-modal tasks, the majority of memos have images and text that are only weakly consistent or even uncorrelated, so various modalities contained in the data play an important role in predicting its results. In this paper, we attempt to work on the Facebook Meme challenge, which solves the binary classification task of predicting a meme's hatefulness or not. We extract triplet-relation information from origin OCR text features, image content features and image caption features and proposed a novel cross-attention network to address this task. TRICAN leverages object detection and image caption models to explore visual modalities to obtain “actual captions” and then combines combine origin OCR text with the multi-modal representation to perform hateful memes detection. These meme-related features are then reconstructed and fused into one feature vector for prediction. We have performed extensively experimental on multi-modal memory datasets. Experimental results demonstrate the effectiveness of TRICAN and the usefulness of triplet-relation information.
表情包在社交网络上传播。大多数是为了幽默而创作的,而有些则是通过图像和文字的结合而变得可恨,向人们传达负面信息。仇恨模因检测提出了一个有趣的多模态融合问题,与传统的多模态任务不同,大多数备忘录的图像和文本只有弱一致性甚至不相关,因此数据中包含的各种模态在预测其结果中起着重要作用。在本文中,我们试图研究Facebook Meme挑战,该挑战解决了预测Meme是否可恨的二元分类任务。我们从原始OCR文本特征、图像内容特征和图像标题特征中提取三重关系信息,并提出了一种新的交叉注意网络来解决这一问题。TRICAN利用目标检测和图像标题模型探索视觉模态,获得“实际标题”,然后将组合原始OCR文本与多模态表示相结合,进行仇恨模因检测。然后将这些模因相关的特征重构并融合成一个特征向量进行预测。我们在多模态记忆数据集上进行了大量的实验。实验结果证明了TRICAN的有效性和三重关系信息的实用性。
{"title":"TRICAN: Multi-Modal Hateful Memes Detection with Triplet-Relation Information Cross-Attention Network","authors":"Xiaolin Liang, Yajuan Huang, Wen Liu, He Zhu, Zhao Liang, Libo Chen","doi":"10.1109/IJCNN55064.2022.9892164","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892164","url":null,"abstract":"Memes are spreading on social networking. Most are created to be humorous, while some become hateful with the combination of images and words, conveying negative information to people. The hateful memes detection poses an interesting multimodal fusion problem, unlike traditional multi-modal tasks, the majority of memos have images and text that are only weakly consistent or even uncorrelated, so various modalities contained in the data play an important role in predicting its results. In this paper, we attempt to work on the Facebook Meme challenge, which solves the binary classification task of predicting a meme's hatefulness or not. We extract triplet-relation information from origin OCR text features, image content features and image caption features and proposed a novel cross-attention network to address this task. TRICAN leverages object detection and image caption models to explore visual modalities to obtain “actual captions” and then combines combine origin OCR text with the multi-modal representation to perform hateful memes detection. These meme-related features are then reconstructed and fused into one feature vector for prediction. We have performed extensively experimental on multi-modal memory datasets. Experimental results demonstrate the effectiveness of TRICAN and the usefulness of triplet-relation information.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120986433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CatchPhish: Model for detecting homographic attacks on phishing pages CatchPhish:用于检测网络钓鱼页面上的同形攻击的模型
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892525
Lucas Candeia Teixeira, Júlio César Gomes De Barros, Bruno José Torres Fernandes, Carlo Marcelo Revoredo da Silva
The growth in the numbers of phishing attacks, along with the volume of successful frauds, demonstrates vul-nerabilities of the protection tools and exposes the advance in the refinement of the attacks. In more than 70% of cases, the improvements rely on the presence of homographic terms as a mechanism to embed reliability in malicious pages. In this scenario, the present study proposes an intelligent approach denominated CatchPhish, which, through the attack target brand identification, can infer the veracity of the page evaluated. CatchPhish uses a Siamese neural network capable of identifying the presence of typosquatting mentions in phishing pages. In the experiments, the proposed approach achieved 99.30% of assertiveness. In addition, the proposed approach stands out for its ability to produce terms for training, so, instead of providing the tool with a high amount of distorted terms, it provides the mark preceded by the correct spelling, which circumvents a strong obstacle in the construction of protection mechanisms.
网络钓鱼攻击数量的增长,以及成功欺诈的数量,表明了保护工具的漏洞,并暴露了攻击改进的进步。在超过70%的情况下,改进依赖于同形词的存在,作为在恶意页面中嵌入可靠性的机制。在这种情况下,本研究提出了一种名为CatchPhish的智能方法,该方法通过攻击目标品牌识别,可以推断被评估页面的真实性。CatchPhish使用连体神经网络,能够识别钓鱼页面中出现的打字错误。在实验中,所提出的方法达到了99.30%的自信。此外,该方法的突出之处在于它能够生成用于训练的术语,因此,它不会为工具提供大量扭曲的术语,而是提供拼写正确的标记,这规避了构建保护机制中的一个强大障碍。
{"title":"CatchPhish: Model for detecting homographic attacks on phishing pages","authors":"Lucas Candeia Teixeira, Júlio César Gomes De Barros, Bruno José Torres Fernandes, Carlo Marcelo Revoredo da Silva","doi":"10.1109/IJCNN55064.2022.9892525","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892525","url":null,"abstract":"The growth in the numbers of phishing attacks, along with the volume of successful frauds, demonstrates vul-nerabilities of the protection tools and exposes the advance in the refinement of the attacks. In more than 70% of cases, the improvements rely on the presence of homographic terms as a mechanism to embed reliability in malicious pages. In this scenario, the present study proposes an intelligent approach denominated CatchPhish, which, through the attack target brand identification, can infer the veracity of the page evaluated. CatchPhish uses a Siamese neural network capable of identifying the presence of typosquatting mentions in phishing pages. In the experiments, the proposed approach achieved 99.30% of assertiveness. In addition, the proposed approach stands out for its ability to produce terms for training, so, instead of providing the tool with a high amount of distorted terms, it provides the mark preceded by the correct spelling, which circumvents a strong obstacle in the construction of protection mechanisms.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127107585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Model for COVID-19 Prediction Based on Spatio-temporal Convolutional Network 基于时空卷积网络的COVID-19预测模型
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892790
Zhengkai Wang, Weiyu Zhang, Zhongxiu Xia, Wenpeng Lu
COVID-19 has become a worldwide epidemic. Prediction of COVID-19 is an effective way to control its spread. Recently, some research efforts have made great progress on this task. However, these works rarely combine both the temporal and spatial domains for case number prediction. Moreover, most of them are only suitable for short-term prediction tasks, which cannot achieve good long-term predicting effects. Therefore, we use a method that combines human-mobility factors and time-series factors - the Spatio-temporal convolutional network (G-TCN) to deal with these problems. Firstly, we use data on the mobility of people between regions to generate graphs of regional relationships. Secondly, to process the spatial information at each moment, we apply multi-layer graph convolutional neural networks (GCNs) to aggregate multi-layer neighborhood information. And we input the information obtained by GCNs at different moments into temporal convolutional networks (TCNs), which are used to process the time-series information. Finally, we tested the proposed G-TCN method using datasets from four countries. The experimental results show that G-TCN has lower prediction errors than other comparison methods and can better fit the trend of COVID-19 development.
COVID-19已成为全球性流行病。预测新冠肺炎疫情是控制疫情传播的有效手段。近年来,一些研究工作在这方面取得了很大进展。然而,这些工作很少结合时间和空间域来预测病例数。而且,它们大多只适用于短期预测任务,无法达到较好的长期预测效果。因此,我们使用一种结合人类迁移因素和时间序列因素的方法-时空卷积网络(G-TCN)来处理这些问题。首先,我们使用区域间人口流动数据生成区域关系图。其次,利用多层图卷积神经网络(GCNs)对多层邻域信息进行聚合,处理每一时刻的空间信息;然后将GCNs在不同时刻获取的信息输入到时序卷积网络(temporal convolutional networks, TCNs)中,由TCNs对时间序列信息进行处理。最后,我们使用来自四个国家的数据集测试了所提出的G-TCN方法。实验结果表明,与其他对比方法相比,G-TCN预测误差更小,能更好地拟合COVID-19的发展趋势。
{"title":"A Model for COVID-19 Prediction Based on Spatio-temporal Convolutional Network","authors":"Zhengkai Wang, Weiyu Zhang, Zhongxiu Xia, Wenpeng Lu","doi":"10.1109/IJCNN55064.2022.9892790","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892790","url":null,"abstract":"COVID-19 has become a worldwide epidemic. Prediction of COVID-19 is an effective way to control its spread. Recently, some research efforts have made great progress on this task. However, these works rarely combine both the temporal and spatial domains for case number prediction. Moreover, most of them are only suitable for short-term prediction tasks, which cannot achieve good long-term predicting effects. Therefore, we use a method that combines human-mobility factors and time-series factors - the Spatio-temporal convolutional network (G-TCN) to deal with these problems. Firstly, we use data on the mobility of people between regions to generate graphs of regional relationships. Secondly, to process the spatial information at each moment, we apply multi-layer graph convolutional neural networks (GCNs) to aggregate multi-layer neighborhood information. And we input the information obtained by GCNs at different moments into temporal convolutional networks (TCNs), which are used to process the time-series information. Finally, we tested the proposed G-TCN method using datasets from four countries. The experimental results show that G-TCN has lower prediction errors than other comparison methods and can better fit the trend of COVID-19 development.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127128545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2022 International Joint Conference on Neural Networks (IJCNN)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1