首页 > 最新文献

2022 International Joint Conference on Neural Networks (IJCNN)最新文献

英文 中文
MoStress: a Sequence Model for Stress Classification 最大应力:应力分类的序列模型
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892953
Arturo de Souza, M. Melchiades, S. Rigo, G. D. O. Ramos
Mental disorders affect a large number of people worldwide. In response to the increasing number of people affected by such illnesses, there has been an increased interest in the use of state-of-the-art technologies to mitigate its effects. This paper presents a Sequence Model for Stress Classification (MoStress), which is a novel pipeline for pre-processing physio-logical data collected from wearable devices and for identifying stress sequences using a recurrent neural network (RNN). Using the WESAD dataset, the RNN model achieved accuracy of 86% in a three-class classification problem (baseline vs. stress vs. amusement). When only considering the presence of stress or not, we achieved an accuracy of 96.5% as well as precision, recall, and f'1-score of 96%, 93%, and 94%, respectively. Those results are close to other papers using the same dataset, however, the neural network used on MoStress, is considerable simpler.
精神障碍影响着全世界很多人。由于受这类疾病影响的人越来越多,人们对使用最先进的技术来减轻其影响的兴趣越来越大。本文提出了一种应力分类序列模型(MoStress),它是一种新的管道,用于预处理从可穿戴设备收集的生理数据,并使用递归神经网络(RNN)识别应力序列。使用WESAD数据集,RNN模型在三类分类问题(基线、压力、娱乐)中达到了86%的准确率。当只考虑压力或不考虑压力时,我们的准确率为96.5%,准确率、召回率和f'1得分分别为96%、93%和94%。这些结果与使用相同数据集的其他论文接近,然而,在MoStress上使用的神经网络要简单得多。
{"title":"MoStress: a Sequence Model for Stress Classification","authors":"Arturo de Souza, M. Melchiades, S. Rigo, G. D. O. Ramos","doi":"10.1109/IJCNN55064.2022.9892953","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892953","url":null,"abstract":"Mental disorders affect a large number of people worldwide. In response to the increasing number of people affected by such illnesses, there has been an increased interest in the use of state-of-the-art technologies to mitigate its effects. This paper presents a Sequence Model for Stress Classification (MoStress), which is a novel pipeline for pre-processing physio-logical data collected from wearable devices and for identifying stress sequences using a recurrent neural network (RNN). Using the WESAD dataset, the RNN model achieved accuracy of 86% in a three-class classification problem (baseline vs. stress vs. amusement). When only considering the presence of stress or not, we achieved an accuracy of 96.5% as well as precision, recall, and f'1-score of 96%, 93%, and 94%, respectively. Those results are close to other papers using the same dataset, however, the neural network used on MoStress, is considerable simpler.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114570826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Attention-based Single Image Dehazing Using Improved CycleGAN 基于注意力的改进CycleGAN单幅图像去雾
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892628
R. S. Jaisurya, Snehasis Mukherjee
Single image dehazing is a popular research topic among the researchers in computer vision, machine learning, image processing, and graphics. Most of the recent methods for single image dehazing are based upon supervised learning set up. However, supervised methods require annotation of the data, which often makes the dehazing methods biased towards the manual annotation errors. Unsupervised methods are more likely to produce realistic, clear images. However, fewer efforts are found in the literature for single image dehazing in unsupervised set up. We propose an enhanced CycleGAN architecture for Unpaired single image dehazing, with an attention-based transformer architecture embedded in the generator. The proposed transformer comprises three components: 1) A Feature Attention (FA) block combining channel attention and pixel attention mechanism, 2) A Dynamic feature enhancement block for dynamically capturing the spatial structured features and 3) An adaptive mix-up module to preserve the flow of shallow features from downsampling. Experiments on the benchmark datasets show the efficacy of the proposed method. Codes for this work are available in the link: https://github.com/rsjai47/Attention-Based-CycleDehaze.
单幅图像去雾是计算机视觉、机器学习、图像处理和图形学等领域的热门研究课题。目前大多数单幅图像去雾的方法都是基于监督学习建立的。然而,有监督的方法需要对数据进行标注,这往往使除雾方法偏向于人工标注的错误。无监督的方法更有可能产生逼真、清晰的图像。然而,文献中对无监督环境下的单幅图像去雾的研究较少。我们提出了一种用于非配对单幅图像去雾的增强CycleGAN架构,在生成器中嵌入了一个基于注意力的变压器架构。该变压器由三个部分组成:1)结合通道注意和像素注意机制的特征注意(FA)块,2)动态捕获空间结构特征的动态特征增强块,3)自适应混合模块,以保持下采样时浅层特征的流动。在基准数据集上的实验表明了该方法的有效性。这项工作的代码可在链接:https://github.com/rsjai47/Attention-Based-CycleDehaze。
{"title":"Attention-based Single Image Dehazing Using Improved CycleGAN","authors":"R. S. Jaisurya, Snehasis Mukherjee","doi":"10.1109/IJCNN55064.2022.9892628","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892628","url":null,"abstract":"Single image dehazing is a popular research topic among the researchers in computer vision, machine learning, image processing, and graphics. Most of the recent methods for single image dehazing are based upon supervised learning set up. However, supervised methods require annotation of the data, which often makes the dehazing methods biased towards the manual annotation errors. Unsupervised methods are more likely to produce realistic, clear images. However, fewer efforts are found in the literature for single image dehazing in unsupervised set up. We propose an enhanced CycleGAN architecture for Unpaired single image dehazing, with an attention-based transformer architecture embedded in the generator. The proposed transformer comprises three components: 1) A Feature Attention (FA) block combining channel attention and pixel attention mechanism, 2) A Dynamic feature enhancement block for dynamically capturing the spatial structured features and 3) An adaptive mix-up module to preserve the flow of shallow features from downsampling. Experiments on the benchmark datasets show the efficacy of the proposed method. Codes for this work are available in the link: https://github.com/rsjai47/Attention-Based-CycleDehaze.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114605207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Evaluating Adversarial Attacks and Defences in Infrared Deep Learning Monitoring Systems 评估红外深度学习监测系统中的对抗性攻击和防御
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9891997
Flaminia Spasiano, Gabriele Gennaro, Simone Scardapane
This paper studies adversarial attacks and defences against deep learning models trained on infrared data to classify the presence of humans and detect their bounding boxes, which differently from the standard RGB case is an open research problem with multiple consequences related to safety and secure artificial intelligence applications. The paper has two major contributions. Firstly, we study the effectiveness of the Projected Gradient Descent (PGD) adversarial attack against Convolutional Neural Networks (CNNs) trained exclusively on infrared data, and the effectiveness of adversarial training as a possible defense against the attack. Secondly, we study the response of an object detection model trained on infrared images under adversarial attacks. In particular, we propose and empirically evaluate two attacks: one classical attack from the literature on object detection, and a new hybrid attack which exploits a common CNN base architecture of the classifier and the object detector. We show for the first time that adversarial attacks weaken the performance of classification and detection models trained on infrared images only. We also prove that the defense adversarial training optimized for the infinity norm increases the robustness of different classification models trained on infrared data.
本文研究了针对红外数据训练的深度学习模型的对抗性攻击和防御,以对人类的存在进行分类并检测其边界框,这与标准RGB案例不同,是一个开放的研究问题,具有与安全和可靠的人工智能应用相关的多重后果。这篇论文有两个主要贡献。首先,我们研究了投影梯度下降(PGD)对抗性攻击对专门训练红外数据的卷积神经网络(cnn)的有效性,以及对抗性训练作为一种可能防御攻击的有效性。其次,研究了基于红外图像训练的目标检测模型在对抗性攻击下的响应。特别是,我们提出并经验评估了两种攻击:一种是来自对象检测文献的经典攻击,另一种是利用分类器和对象检测器的通用CNN基础架构的新的混合攻击。我们首次表明,对抗性攻击削弱了仅在红外图像上训练的分类和检测模型的性能。我们还证明了针对无穷范数优化的防御对抗训练提高了红外数据训练的不同分类模型的鲁棒性。
{"title":"Evaluating Adversarial Attacks and Defences in Infrared Deep Learning Monitoring Systems","authors":"Flaminia Spasiano, Gabriele Gennaro, Simone Scardapane","doi":"10.1109/IJCNN55064.2022.9891997","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9891997","url":null,"abstract":"This paper studies adversarial attacks and defences against deep learning models trained on infrared data to classify the presence of humans and detect their bounding boxes, which differently from the standard RGB case is an open research problem with multiple consequences related to safety and secure artificial intelligence applications. The paper has two major contributions. Firstly, we study the effectiveness of the Projected Gradient Descent (PGD) adversarial attack against Convolutional Neural Networks (CNNs) trained exclusively on infrared data, and the effectiveness of adversarial training as a possible defense against the attack. Secondly, we study the response of an object detection model trained on infrared images under adversarial attacks. In particular, we propose and empirically evaluate two attacks: one classical attack from the literature on object detection, and a new hybrid attack which exploits a common CNN base architecture of the classifier and the object detector. We show for the first time that adversarial attacks weaken the performance of classification and detection models trained on infrared images only. We also prove that the defense adversarial training optimized for the infinity norm increases the robustness of different classification models trained on infrared data.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122016207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards interpretable machine learning for clinical decision support 面向临床决策支持的可解释机器学习
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892114
Bradley Walters, S. Ortega-Martorell, I. Olier, Paulo J. G. Lisboa
A major challenge in delivering reliable and trustworthy computational intelligence for practical applications in clinical medicine is interpretability. This aspect of machine learning is a major distinguishing factor compared with traditional statistical models for the stratification of patients, which typically use rules or a risk score identified by logistic regression. We show how functions of one and two variables can be extracted from pre-trained machine learning models using anchored Analysis of Variance (ANOVA) decompositions. This enables complex interaction terms to be filtered out by aggressive regularisation using the Least Absolute Shrinkage and Selection Operator (LASSO) resulting in a sparse model with comparable or even better performance than the original pre-trained black-box. Besides being theoretically well-founded, the decomposition of a black-box multivariate probabilistic binary classifier into a General Additive Model (GAM) comprising a linear combination of non-linear functions of one or two variables provides full interpretability. In effect this extends logistic regression into non-linear modelling without the need for manual intervention by way of variable transformations, using the pre-trained model as a seed. The application of the proposed methodology to existing machine learning models is demonstrated using the Multi-Layer Perceptron (MLP), Support Vector Machine (SVM), Random Forests (RF) and Gradient Boosting Machines (GBM), to model a data frame from a well-known benchmark dataset available from Physionet, the Medical Information Mart for Intensive Care (MIMIC-III). Both the classification performance and plausibility of clinical interpretation compare favourably with other state-of-the-art sparse models namely Sparse Additive Models (SAM) and the Explainable Boosting Machine (EBM).
为临床医学的实际应用提供可靠和可信的计算智能的一个主要挑战是可解释性。与传统的患者分层统计模型相比,机器学习的这一方面是一个主要的区别因素,传统的患者分层统计模型通常使用规则或由逻辑回归确定的风险评分。我们展示了如何使用锚定方差分析(ANOVA)分解从预训练的机器学习模型中提取一个和两个变量的函数。这使得复杂的交互项可以通过使用最小绝对收缩和选择算子(LASSO)的积极正则化来过滤掉,从而产生具有与原始预训练黑箱相当甚至更好性能的稀疏模型。除了理论基础良好外,将黑盒多元概率二元分类器分解为包含一个或两个变量的非线性函数的线性组合的一般可加模型(GAM)提供了充分的可解释性。实际上,这将逻辑回归扩展到非线性建模,而不需要通过变量转换的方式进行人工干预,使用预训练模型作为种子。将提出的方法应用于现有的机器学习模型,使用多层感知器(MLP)、支持向量机(SVM)、随机森林(RF)和梯度增强机(GBM),对来自Physionet、重症监护医疗信息市场(MIMIC-III)的知名基准数据集的数据框架进行建模。临床解释的分类性能和合理性都优于其他最先进的稀疏模型,即稀疏加性模型(SAM)和可解释增强机(EBM)。
{"title":"Towards interpretable machine learning for clinical decision support","authors":"Bradley Walters, S. Ortega-Martorell, I. Olier, Paulo J. G. Lisboa","doi":"10.1109/IJCNN55064.2022.9892114","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892114","url":null,"abstract":"A major challenge in delivering reliable and trustworthy computational intelligence for practical applications in clinical medicine is interpretability. This aspect of machine learning is a major distinguishing factor compared with traditional statistical models for the stratification of patients, which typically use rules or a risk score identified by logistic regression. We show how functions of one and two variables can be extracted from pre-trained machine learning models using anchored Analysis of Variance (ANOVA) decompositions. This enables complex interaction terms to be filtered out by aggressive regularisation using the Least Absolute Shrinkage and Selection Operator (LASSO) resulting in a sparse model with comparable or even better performance than the original pre-trained black-box. Besides being theoretically well-founded, the decomposition of a black-box multivariate probabilistic binary classifier into a General Additive Model (GAM) comprising a linear combination of non-linear functions of one or two variables provides full interpretability. In effect this extends logistic regression into non-linear modelling without the need for manual intervention by way of variable transformations, using the pre-trained model as a seed. The application of the proposed methodology to existing machine learning models is demonstrated using the Multi-Layer Perceptron (MLP), Support Vector Machine (SVM), Random Forests (RF) and Gradient Boosting Machines (GBM), to model a data frame from a well-known benchmark dataset available from Physionet, the Medical Information Mart for Intensive Care (MIMIC-III). Both the classification performance and plausibility of clinical interpretation compare favourably with other state-of-the-art sparse models namely Sparse Additive Models (SAM) and the Explainable Boosting Machine (EBM).","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116735577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Assembly of Echo State Networks Driven by Segregated Low Dimensional Signals 分离低维信号驱动的回声状态网络装配
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892881
T. Iinuma, S. Nobukawa, S. Yamaguchi
An echo state network (ESN), consisting of an input layer, reservoir, and output layer, provides a higher learning-efficient approach than other recurrent neural networks (RNNs). In the design of ESNs, a sufficiently large number of reservoir neurons is required compared to the dimension of the input signal. Thus, the number of neurons must be increased for high-dimensional input to achieve good performance. However, an increase in the number of neurons increases the computational load. To solve this problem, we propose an assembly ESN (AESN) architecture comprising a feature extraction part that uses multiple sub-ESNs with segregated components of high-dimensional input and a feature integration part. To validate the effectiveness of the proposed AESN, we investigated and compared the conventional ESN with the AESN under high-dimensional input. The results show that the AESN is possibly superior to the conventional ESN in accuracy, memory performance, and computational load. We believe that the AESN also has a correct integration function. Therefore, the proposed method is expected to solve high-dimensional problems with improved accuracy.
回声状态网络(ESN)由输入层、存储层和输出层组成,提供了比其他递归神经网络(rnn)更高的学习效率。在ESNs的设计中,与输入信号的维数相比,需要足够多的存储神经元。因此,为了获得良好的性能,必须增加高维输入的神经元数量。然而,神经元数量的增加增加了计算负荷。为了解决这个问题,我们提出了一种装配ESN (asesn)架构,该架构包括一个特征提取部分,该部分使用多个具有高维输入的分离组件的子ESN和一个特征集成部分。为了验证所提出的回声状态网络的有效性,我们研究并比较了高维输入下的传统回声状态网络和回声状态网络。结果表明,该方法在准确率、记忆性能和计算量等方面都优于传统的回声状态网络。我们认为asesn也具有正确的积分功能。因此,该方法有望以更高的精度解决高维问题。
{"title":"Assembly of Echo State Networks Driven by Segregated Low Dimensional Signals","authors":"T. Iinuma, S. Nobukawa, S. Yamaguchi","doi":"10.1109/IJCNN55064.2022.9892881","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892881","url":null,"abstract":"An echo state network (ESN), consisting of an input layer, reservoir, and output layer, provides a higher learning-efficient approach than other recurrent neural networks (RNNs). In the design of ESNs, a sufficiently large number of reservoir neurons is required compared to the dimension of the input signal. Thus, the number of neurons must be increased for high-dimensional input to achieve good performance. However, an increase in the number of neurons increases the computational load. To solve this problem, we propose an assembly ESN (AESN) architecture comprising a feature extraction part that uses multiple sub-ESNs with segregated components of high-dimensional input and a feature integration part. To validate the effectiveness of the proposed AESN, we investigated and compared the conventional ESN with the AESN under high-dimensional input. The results show that the AESN is possibly superior to the conventional ESN in accuracy, memory performance, and computational load. We believe that the AESN also has a correct integration function. Therefore, the proposed method is expected to solve high-dimensional problems with improved accuracy.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129507351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MaxEnt Dreamer: Maximum Entropy Reinforcement Learning with World Model MaxEnt做梦者:世界模型的最大熵强化学习
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892381
Hongying Ma, Wuyang Xue, R. Ying, Peilin Liu
Model-based reinforcement learning algorithms can alleviate the low sample efficiency problem compared with modelfree methods for control tasks. However, the learned policy's performance often lags behind the best model-free algorithms since its weak exploration ability. Existing model-based reinforcement learning algorithms learn policy by interacting with the learned world model and then use the learned policy to guide a new round of world model learning. Due to weak policy exploration ability, the learned world model has a large bias. As a result, it fails to learn the globally optimal policy on such a world model. This paper improves the learned world model by maximizing both the reward and the corresponding policy entropy in the framework of maximum entropy reinforcement learning. The effectiveness of applying the maximum entropy approach to model-based reinforcement learning is supported by the better performance of our algorithm on several complex mujoco and deepmind control suite tasks.
与无模型方法相比,基于模型的强化学习算法可以缓解控制任务的低样本效率问题。然而,由于学习策略的探索能力较弱,其性能往往落后于最佳的无模型算法。现有的基于模型的强化学习算法通过与学习到的世界模型交互来学习策略,然后利用学习到的策略来指导新一轮的世界模型学习。由于政策探索能力较弱,学习世界模型存在较大偏差。因此,它无法在这样一个世界模型上学习全局最优策略。本文在最大熵强化学习的框架下,通过最大化奖励和相应的策略熵来改进学习世界模型。我们的算法在几个复杂的mujoco和deepmind控制套件任务上的更好性能支持了将最大熵方法应用于基于模型的强化学习的有效性。
{"title":"MaxEnt Dreamer: Maximum Entropy Reinforcement Learning with World Model","authors":"Hongying Ma, Wuyang Xue, R. Ying, Peilin Liu","doi":"10.1109/IJCNN55064.2022.9892381","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892381","url":null,"abstract":"Model-based reinforcement learning algorithms can alleviate the low sample efficiency problem compared with modelfree methods for control tasks. However, the learned policy's performance often lags behind the best model-free algorithms since its weak exploration ability. Existing model-based reinforcement learning algorithms learn policy by interacting with the learned world model and then use the learned policy to guide a new round of world model learning. Due to weak policy exploration ability, the learned world model has a large bias. As a result, it fails to learn the globally optimal policy on such a world model. This paper improves the learned world model by maximizing both the reward and the corresponding policy entropy in the framework of maximum entropy reinforcement learning. The effectiveness of applying the maximum entropy approach to model-based reinforcement learning is supported by the better performance of our algorithm on several complex mujoco and deepmind control suite tasks.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129602656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Kernel Learning for Modeling Resting State EEG Connectomes using Structural Connectivity of the Brain 基于脑结构连通性的静息状态脑电连接体多核学习建模
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892022
Ammar Ahmed, Archi Yadav, Avinash Sharma, R. Bapi
An active area of research in cognitive science is characterizing the relationship between brain structure and the observed functional activations. Recent graph diffusion models have had great success in mapping whole-brain, resting-state dynamics measured using functional Magnetic Resonance Imaging (fMRI) to the brain structure derived using diffusion and T1 brain imaging. Here we test the application of one such graph diffusion method called the Multiple Kernel Learning (MKL) model. MKL model, formulated as a reaction-diffusion system using Wilson-Cowan equations, combines multiple diffusion kernels at different scales to predict functional connectome (FC) arising from a fixed structural connectome (SC). Our simulation results demonstrate that the MKL model successfully mapped the relationship between SC and FC from five different Electroen-cephalogram (EEG) bands (delta, theta, alpha, beta, and gamma). We used simultaneously acquired EEG-fMRI and NODDI dataset of 17 participants. The correlation between predicted FC and ground truth FC was higher for EEG bands than for fMRI data. The prediction accuracy peaked for the alpha band, and the highest frequency band, gamma had the lowest prediction accuracy. To the best of our knowledge, this is the first such end-to-end application of multiple kernel graph diffusion framework for modeling EEG data. One of the important features of MKL model is its ability to incorporate structural connectivity features into the generative model that predicts the EEG functional connectivity.
认知科学中一个活跃的研究领域是描述大脑结构与观察到的功能激活之间的关系。最近的图扩散模型在绘制全脑图方面取得了巨大的成功,使用功能性磁共振成像(fMRI)测量的静息状态动态到使用扩散和T1脑成像得出的脑结构。在这里,我们测试了一种称为多核学习(MKL)模型的图扩散方法的应用。MKL模型采用Wilson-Cowan方程,将不同尺度的多个扩散核结合在一起,预测固定结构连接组(SC)产生的功能连接组(FC)。我们的模拟结果表明,MKL模型成功地从五个不同的脑电图(delta, theta, alpha, beta和gamma)波段映射出SC和FC之间的关系。我们使用同时获得的17名参与者的EEG-fMRI和NODDI数据集。脑电波段预测FC与真实FC之间的相关性高于功能磁共振数据。α波段预测精度最高,γ波段预测精度最低。据我们所知,这是首个将多核图扩散框架用于EEG数据建模的端到端应用。MKL模型的一个重要特征是能够将结构连通性特征融入到生成模型中,从而预测脑电功能的连通性。
{"title":"Multiple Kernel Learning for Modeling Resting State EEG Connectomes using Structural Connectivity of the Brain","authors":"Ammar Ahmed, Archi Yadav, Avinash Sharma, R. Bapi","doi":"10.1109/IJCNN55064.2022.9892022","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892022","url":null,"abstract":"An active area of research in cognitive science is characterizing the relationship between brain structure and the observed functional activations. Recent graph diffusion models have had great success in mapping whole-brain, resting-state dynamics measured using functional Magnetic Resonance Imaging (fMRI) to the brain structure derived using diffusion and T1 brain imaging. Here we test the application of one such graph diffusion method called the Multiple Kernel Learning (MKL) model. MKL model, formulated as a reaction-diffusion system using Wilson-Cowan equations, combines multiple diffusion kernels at different scales to predict functional connectome (FC) arising from a fixed structural connectome (SC). Our simulation results demonstrate that the MKL model successfully mapped the relationship between SC and FC from five different Electroen-cephalogram (EEG) bands (delta, theta, alpha, beta, and gamma). We used simultaneously acquired EEG-fMRI and NODDI dataset of 17 participants. The correlation between predicted FC and ground truth FC was higher for EEG bands than for fMRI data. The prediction accuracy peaked for the alpha band, and the highest frequency band, gamma had the lowest prediction accuracy. To the best of our knowledge, this is the first such end-to-end application of multiple kernel graph diffusion framework for modeling EEG data. One of the important features of MKL model is its ability to incorporate structural connectivity features into the generative model that predicts the EEG functional connectivity.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129806487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of bearing failures using wavelet transformation and machine learning approach 基于小波变换和机器学习方法的轴承故障检测
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892755
Maciej Golgowski, S. Osowski
The paper analyzes and compares two forms of wavelet transformation: discrete (DWT) and continuous (CWT) combined with machine learning in the analysis of the bearing failure. It presents the automatic system to detect the anomaly in the rolling bearing based on wavelet analysis of vibration waveforms combined with the set of classical and deep classifiers. The wavelet transformation is used in the stage of pre-processing of the signal for generating the input attributes in the final classification system. The considered structures of the classifiers include 6 classical machine learning tools integrated into an ensemble and a combination of a few deep Convolutional Neural Networks (CNN) to develop the most accurate diagnostics of the bearing. The calculations have been done in Python and Matlab. The results of both approaches DWT and CWT are discussed and compared. They show the high effectiveness of the approach based on the cooperation of wavelet transform and machine learning methods.
本文分析比较了结合机器学习的两种小波变换形式:离散小波变换和连续小波变换在轴承故障分析中的应用。提出了一种基于振动波形小波分析与经典分类器和深度分类器相结合的滚动轴承异常自动检测系统。在信号预处理阶段使用小波变换生成最终分类系统的输入属性。分类器的考虑结构包括集成到集成中的6个经典机器学习工具和几个深度卷积神经网络(CNN)的组合,以开发最准确的轴承诊断。在Python和Matlab中进行了计算。对两种方法的结果进行了讨论和比较。结果表明,基于小波变换和机器学习方法相结合的方法具有很高的有效性。
{"title":"Detection of bearing failures using wavelet transformation and machine learning approach","authors":"Maciej Golgowski, S. Osowski","doi":"10.1109/IJCNN55064.2022.9892755","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892755","url":null,"abstract":"The paper analyzes and compares two forms of wavelet transformation: discrete (DWT) and continuous (CWT) combined with machine learning in the analysis of the bearing failure. It presents the automatic system to detect the anomaly in the rolling bearing based on wavelet analysis of vibration waveforms combined with the set of classical and deep classifiers. The wavelet transformation is used in the stage of pre-processing of the signal for generating the input attributes in the final classification system. The considered structures of the classifiers include 6 classical machine learning tools integrated into an ensemble and a combination of a few deep Convolutional Neural Networks (CNN) to develop the most accurate diagnostics of the bearing. The calculations have been done in Python and Matlab. The results of both approaches DWT and CWT are discussed and compared. They show the high effectiveness of the approach based on the cooperation of wavelet transform and machine learning methods.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129815476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimising hyperparameter search in a visual thalamocortical pathway model 在视觉丘脑皮质通路模型中优化超参数搜索
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892380
Swapna Sasi, Taher Yunus Lilywala, B. Bhattacharya
We have made a comparative study of three optimisation algorithms viz. Random Search (RS), Grid Search (GS) and Bayesian Optimization (BO) to find optimal hyperparameter combinations in an existing brain-inspired thalamocortical model that can simulate brain signals such as local field potentials (lfp) and electroencephalogram (eeg). The layout and parameters for the model are sourced from anatomical and physiological data. However, there is a lot of missing data in such sources due to obvious constraints in wet-lab experimental studies. In our previous work, the missing data are set by trial and error. As the scale of the model gets larger though, the combinatorics of the hyperparameters explode and manual parameter tuning gets non-trivial. The goal of this study is to identify the optimisation algorithm (among the three abovementioned) that gives the best performance at minimal computational costs; performance is evaluated by setting an objective, which is to search for hyperparameter combinations that can simulate theta (4 – 8 Hz), alpha (8 – 13 Hz) and beta (13 – 30 Hz) rhythms, which are typically observed in eeg and lfp. Each optimisation algorithm is tested on a small model (thalamus only) with eight hyperparameters and a large model (thalamocortical) with maximum of fifteen hyperparameters. The performance metric for each algorithm is measured by the number of times the objective is achieved during a fixed number of trials. Our results demonstrate that BO performs the best in reaching the objective with a 30.5% better performance compared to GS and 13% better than RS. In comparison, GS performance is lower with an exponential increase in time with increasing grid size. Overall, our study demonstrates the suitability of using the BO for optimising hyperparameter search in our thalamocortical network model of the visual pathway.
我们对随机搜索(RS)、网格搜索(GS)和贝叶斯优化(BO)三种优化算法进行了比较研究,以在现有的大脑启发的丘脑皮质模型中找到最佳的超参数组合,该模型可以模拟大脑信号,如局部场电位(lfp)和脑电图(eeg)。模型的布局和参数来源于解剖学和生理学数据。然而,由于湿室实验研究的明显限制,这些来源中存在大量缺失数据。在我们以前的工作中,缺失的数据是通过试错来确定的。然而,随着模型的规模越来越大,超参数的组合会爆炸,手动参数调优变得不平凡。本研究的目标是确定以最小计算成本获得最佳性能的优化算法(在上述三种算法中);通过设定目标来评估性能,该目标是搜索可以模拟theta (4 - 8 Hz), alpha (8 - 13 Hz)和beta (13 - 30 Hz)节奏的超参数组合,这些节奏通常在eeg和lfp中观察到。每个优化算法在一个具有8个超参数的小模型(仅丘脑)和一个具有最多15个超参数的大模型(丘脑皮质)上进行测试。每个算法的性能指标是通过在固定次数的试验中实现目标的次数来衡量的。我们的研究结果表明,BO在达到目标方面表现最好,比GS的性能好30.5%,比RS的性能好13%。相比之下,随着网格大小的增加,GS的性能随时间呈指数增长而降低。总的来说,我们的研究证明了在视觉通路的丘脑皮质网络模型中使用BO来优化超参数搜索的适用性。
{"title":"Optimising hyperparameter search in a visual thalamocortical pathway model","authors":"Swapna Sasi, Taher Yunus Lilywala, B. Bhattacharya","doi":"10.1109/IJCNN55064.2022.9892380","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892380","url":null,"abstract":"We have made a comparative study of three optimisation algorithms viz. Random Search (RS), Grid Search (GS) and Bayesian Optimization (BO) to find optimal hyperparameter combinations in an existing brain-inspired thalamocortical model that can simulate brain signals such as local field potentials (lfp) and electroencephalogram (eeg). The layout and parameters for the model are sourced from anatomical and physiological data. However, there is a lot of missing data in such sources due to obvious constraints in wet-lab experimental studies. In our previous work, the missing data are set by trial and error. As the scale of the model gets larger though, the combinatorics of the hyperparameters explode and manual parameter tuning gets non-trivial. The goal of this study is to identify the optimisation algorithm (among the three abovementioned) that gives the best performance at minimal computational costs; performance is evaluated by setting an objective, which is to search for hyperparameter combinations that can simulate theta (4 – 8 Hz), alpha (8 – 13 Hz) and beta (13 – 30 Hz) rhythms, which are typically observed in eeg and lfp. Each optimisation algorithm is tested on a small model (thalamus only) with eight hyperparameters and a large model (thalamocortical) with maximum of fifteen hyperparameters. The performance metric for each algorithm is measured by the number of times the objective is achieved during a fixed number of trials. Our results demonstrate that BO performs the best in reaching the objective with a 30.5% better performance compared to GS and 13% better than RS. In comparison, GS performance is lower with an exponential increase in time with increasing grid size. Overall, our study demonstrates the suitability of using the BO for optimising hyperparameter search in our thalamocortical network model of the visual pathway.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129859508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Early Detection of Parkinson's Disease using Spiral Test and Echo State Networks 利用螺旋试验和回声状态网络早期检测帕金森病
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9891917
Lerina Aversano, M. Bernardi, Marta Cimitile, Martina Iammarino, Chiara Verdone
Parkinson's disease is one of the most prevalent neurodegenerative diseases in the world, usually occurring after the age of 50, but in some cases, also affects younger people. It is a disease that affects movement, coordination, and muscle control, all of which cause a range of symptoms that affect patients' writing and drawing skills. Diagnosis is clinical, so it occurs mainly through the evaluation of the patient's movements, coordination, and muscle control. Therefore, the analysis of micrographic models can introduce a new methodology of investigation in the diagnosis and monitoring of Parkinson's disease. This study proposes an approach based on artificial intelligence in combination with the spiral test, which consists in asking the patient to draw a spiral, thanks to which it is possible to make the early diagnosis of Parkinson's disease. The classification is performed with a combination of an Echo State Network and an MLP layer. To validate the approach, several classification algorithms belonging to two macro groups (boosting decision trees based) were used as baseline. The results obtained are very satisfactory with the ESN-based classifier exhibiting an F-Score of 97.8%. The very encouraging results indicate that the proposed approach may be an effective contribution to improving Parkinson's diagnostics.
帕金森氏症是世界上最常见的神经退行性疾病之一,通常发生在50岁以后,但在某些情况下,也会影响年轻人。这是一种影响运动、协调和肌肉控制的疾病,所有这些都会导致一系列症状,影响患者的写作和绘画技能。诊断是临床的,因此主要是通过对患者的运动、协调和肌肉控制的评估来进行的。因此,显微模型的分析可以为帕金森病的诊断和监测提供一种新的研究方法。该研究提出了一种基于人工智能的方法,该方法与螺旋测试相结合,即让患者画螺旋,从而可以早期诊断帕金森病。分类是通过回声状态网络和MLP层的组合来完成的。为了验证该方法,使用了属于两个宏观组(基于增强决策树)的几种分类算法作为基线。结果非常令人满意,基于esn的分类器的F-Score为97.8%。令人鼓舞的结果表明,所提出的方法可能是改善帕金森病诊断的有效贡献。
{"title":"Early Detection of Parkinson's Disease using Spiral Test and Echo State Networks","authors":"Lerina Aversano, M. Bernardi, Marta Cimitile, Martina Iammarino, Chiara Verdone","doi":"10.1109/IJCNN55064.2022.9891917","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9891917","url":null,"abstract":"Parkinson's disease is one of the most prevalent neurodegenerative diseases in the world, usually occurring after the age of 50, but in some cases, also affects younger people. It is a disease that affects movement, coordination, and muscle control, all of which cause a range of symptoms that affect patients' writing and drawing skills. Diagnosis is clinical, so it occurs mainly through the evaluation of the patient's movements, coordination, and muscle control. Therefore, the analysis of micrographic models can introduce a new methodology of investigation in the diagnosis and monitoring of Parkinson's disease. This study proposes an approach based on artificial intelligence in combination with the spiral test, which consists in asking the patient to draw a spiral, thanks to which it is possible to make the early diagnosis of Parkinson's disease. The classification is performed with a combination of an Echo State Network and an MLP layer. To validate the approach, several classification algorithms belonging to two macro groups (boosting decision trees based) were used as baseline. The results obtained are very satisfactory with the ESN-based classifier exhibiting an F-Score of 97.8%. The very encouraging results indicate that the proposed approach may be an effective contribution to improving Parkinson's diagnostics.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128482437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2022 International Joint Conference on Neural Networks (IJCNN)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1