首页 > 最新文献

Advances in computational intelligence最新文献

英文 中文
Towards edge devices implementation: deep learning model with visualization for COVID-19 prediction from chest X-ray 面向边缘设备的实施:基于胸部X光的新冠肺炎预测可视化深度学习模型
Pub Date : 2022-09-28 DOI: 10.1007/s43674-022-00044-w
Shaline Jia Thean Koh, Marwan Nafea, Hermawan Nugroho

Due to the outbreak of COVID-19 disease globally, countries around the world are facing shortages of resources (i.e. testing kits, medicine). A quick diagnosis of COVID-19 and isolating patients are crucial in curbing the pandemic, especially in rural areas. This is because the disease is highly contagious and can spread easily. To assist doctors, several studies have proposed an initial detection of COVID-19 cases using radiological images. In this paper, we propose an alternative method for analyzing chest X-ray images to provide an efficient and accurate diagnosis of COVID-19 which can run on edge devices. The approach acts as an enabler for the deep learning model to be deployed in practical application. Here, the convolutional neural network models which are fine-tuned to predict COVID-19 and pneumonia infection from chest X-ray images are developed by adopting transfer learning techniques. The developed model yielded an accuracy of 98.13%, sensitivity of 97.7%, and specificity of 99.1%. To highlight the important regions in the X-ray images which directs the model to its decision/prediction, we adopted the Gradient Class Activation Map (Grad-CAM). The generated heat maps from the Grad-CAM were then compared with the annotated X-ray images by board-certified radiologists. Results showed that the findings strongly correlate with clinical evidence. For practical deployment, we implemented the trained model in edge devices (NCS2) and this has achieved an improvement of 90% in inference speed compared to CPU. This shows that the developed model has the potential to be implemented on the edge, for example in primary care clinics and rural areas which are not well-equipped or do not have access to stable internet connections.

由于新冠肺炎疫情在全球范围内爆发,世界各国面临资源短缺(即检测试剂盒、药品)。快速诊断新冠肺炎并隔离患者对于遏制疫情至关重要,尤其是在农村地区。这是因为这种疾病传染性很强,很容易传播。为了帮助医生,几项研究提出了使用放射性图像初步检测新冠肺炎病例的方法。在本文中,我们提出了一种分析胸部X射线图像的替代方法,以提供新冠肺炎的高效准确诊断,该方法可以在边缘设备上运行。该方法是深度学习模型在实际应用中部署的推动者。在这里,通过采用转移学习技术开发了卷积神经网络模型,该模型经过微调以从胸部X射线图像预测新冠肺炎和肺炎感染。所开发的模型的准确率为98.13%,灵敏度为97.7%,特异性为99.1%。为了突出X射线图像中指导模型决策/预测的重要区域,我们采用了梯度类激活图(Grad-CAM)。然后,由委员会认证的放射科医生将Grad CAM生成的热图与注释的X射线图像进行比较。结果表明,这些发现与临床证据密切相关。为了进行实际部署,我们在边缘设备(NCS2)中实现了训练后的模型,与CPU相比,推理速度提高了90%。这表明,开发的模式有可能在边缘地区实施,例如在设备不完善或无法获得稳定互联网连接的初级保健诊所和农村地区。
{"title":"Towards edge devices implementation: deep learning model with visualization for COVID-19 prediction from chest X-ray","authors":"Shaline Jia Thean Koh,&nbsp;Marwan Nafea,&nbsp;Hermawan Nugroho","doi":"10.1007/s43674-022-00044-w","DOIUrl":"10.1007/s43674-022-00044-w","url":null,"abstract":"<div><p>Due to the outbreak of COVID-19 disease globally, countries around the world are facing shortages of resources (i.e. testing kits, medicine). A quick diagnosis of COVID-19 and isolating patients are crucial in curbing the pandemic, especially in rural areas. This is because the disease is highly contagious and can spread easily. To assist doctors, several studies have proposed an initial detection of COVID-19 cases using radiological images. In this paper, we propose an alternative method for analyzing chest X-ray images to provide an efficient and accurate diagnosis of COVID-19 which can run on edge devices. The approach acts as an enabler for the deep learning model to be deployed in practical application. Here, the convolutional neural network models which are fine-tuned to predict COVID-19 and pneumonia infection from chest X-ray images are developed by adopting transfer learning techniques. The developed model yielded an accuracy of 98.13%, sensitivity of 97.7%, and specificity of 99.1%. To highlight the important regions in the X-ray images which directs the model to its decision/prediction, we adopted the Gradient Class Activation Map (Grad-CAM). The generated heat maps from the Grad-CAM were then compared with the annotated X-ray images by board-certified radiologists. Results showed that the findings strongly correlate with clinical evidence. For practical deployment, we implemented the trained model in edge devices (NCS2) and this has achieved an improvement of 90% in inference speed compared to CPU. This shows that the developed model has the potential to be implemented on the edge, for example in primary care clinics and rural areas which are not well-equipped or do not have access to stable internet connections.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-022-00044-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40391902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Canola and soybean oil price forecasts via neural networks 基于神经网络的油菜和豆油价格预测
Pub Date : 2022-09-15 DOI: 10.1007/s43674-022-00045-9
Xiaojie Xu, Yun Zhang

Forecasts of commodity prices are vital issues to market participants and policy-makers. Those of cooking section oil are of no exception, considering its importance as one of main food resources. In the present study, we assess the forecast problem using weekly wholesale price indices of canola and soybean oil in China during January 1, 2010–January 3, 2020, by employing the non-linear auto-regressive neural network as the forecast tool. We evaluate forecast performance of different model settings over algorithms, delays, hidden neurons, and data splitting ratios in arriving at the final models for the two commodities, which are relatively simple and lead to accurate and stable results. Particularly, the model for the price index of canola oil generates relative root mean square errors of 2.66, 1.46, and 2.17% for training, validation, and testing, respectively, and the model for the price index of soybean oil generates relative root mean square errors of 2.33, 1.96, and 1.98% for training, validation, and testing, respectively. Through the analysis, we show usefulness of the neural network technique for commodity price forecasts. Our results might serve as technical forecasts on a standalone basis or be combined with other fundamental forecasts for perspectives of price trends and corresponding policy analysis.

对市场参与者和决策者来说,大宗商品价格预测是至关重要的问题。考虑到食用油作为主要食品资源之一的重要性,食用油也不例外。在本研究中,我们使用非线性自回归神经网络作为预测工具,使用2010年1月1日至2020年1月3日期间中国油菜籽和豆油的周批发价格指数来评估预测问题。我们评估了不同模型设置对算法、延迟、隐藏神经元和数据分割率的预测性能,以得出这两种商品的最终模型,这些模型相对简单,结果准确稳定。特别是,菜籽油价格指数模型在训练、验证和测试中分别产生2.66%、1.46%和2.17%的相对均方根误差,豆油价格指数模型对训练、验证、测试分别产生2.33%、1.96%和1.98%的相对均方误差。通过分析,我们展示了神经网络技术在商品价格预测中的有用性。我们的结果可以作为独立的技术预测,也可以与其他基本预测相结合,用于价格趋势和相应的政策分析。
{"title":"Canola and soybean oil price forecasts via neural networks","authors":"Xiaojie Xu,&nbsp;Yun Zhang","doi":"10.1007/s43674-022-00045-9","DOIUrl":"10.1007/s43674-022-00045-9","url":null,"abstract":"<div><p>Forecasts of commodity prices are vital issues to market participants and policy-makers. Those of cooking section oil are of no exception, considering its importance as one of main food resources. In the present study, we assess the forecast problem using weekly wholesale price indices of canola and soybean oil in China during January 1, 2010–January 3, 2020, by employing the non-linear auto-regressive neural network as the forecast tool. We evaluate forecast performance of different model settings over algorithms, delays, hidden neurons, and data splitting ratios in arriving at the final models for the two commodities, which are relatively simple and lead to accurate and stable results. Particularly, the model for the price index of canola oil generates relative root mean square errors of 2.66, 1.46, and 2.17% for training, validation, and testing, respectively, and the model for the price index of soybean oil generates relative root mean square errors of 2.33, 1.96, and 1.98% for training, validation, and testing, respectively. Through the analysis, we show usefulness of the neural network technique for commodity price forecasts. Our results might serve as technical forecasts on a standalone basis or be combined with other fundamental forecasts for perspectives of price trends and corresponding policy analysis.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-022-00045-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50485510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Attributed community search based on seed replacement and joint random walk 基于种子置换和联合随机游走的属性社区搜索
Pub Date : 2022-09-01 DOI: 10.1007/s43674-022-00041-z
Ju Li, Huifang Ma

Community search enables personalized community discovery and has wide applications in real-life scenarios. Existing attributed community search algorithms use personalized information provided by attributes to locate desired community. Though achieved promising results, existing works suffer from two major limitations: (i) the precision of the algorithm decreases significantly when the seed comes from the boundary regions of the community. (ii) Most attributed community search methods mainly take the attribute information as edge weights to reveal semantic strength (e.g., attribute similarity, attribute distance, etc.), but largely ignore that attribute may serve as heterogeneous vertex. To make up for these deficiencies, in this paper, we propose a novel two-stage attributed community search method with seed replacement and joint random walk (SRRW). Specifically, in the seed replacement stage, we replace the initial query node with a core node; in the random walk stage, attributes are taken as heterogeneous nodes and the augmented graph is modeled based on the affiliation of the attributes via an overlapping clustering algorithm. And finally, a joint random walk is performed on the augmented graph to explore the desired local community. We conduct extensive experiments on both synthetic and real-world benchmarks, demonstrating its effectiveness for attributed community search.

社区搜索实现了个性化的社区发现,并在现实场景中有着广泛的应用。现有的属性社区搜索算法使用由属性提供的个性化信息来定位期望的社区。尽管取得了有希望的结果,但现有工作存在两个主要局限性:(i)当种子来自社区的边界区域时,算法的精度显著降低。(ii)大多数属性社区搜索方法主要将属性信息作为边缘权重来揭示语义强度(如属性相似性、属性距离等),但在很大程度上忽略了属性可能作为异构顶点。为了弥补这些不足,本文提出了一种新的两阶段属性社区搜索方法,该方法采用种子替换和联合随机游动(SRRW)。具体来说,在种子替换阶段,我们将初始查询节点替换为核心节点;在随机行走阶段,将属性作为异构节点,通过重叠聚类算法,基于属性的隶属关系对增广图进行建模。最后,在增广图上进行联合随机行走,以探索所需的局部社区。我们在合成基准和真实世界基准上进行了广泛的实验,证明了其在归因社区搜索中的有效性。
{"title":"Attributed community search based on seed replacement and joint random walk","authors":"Ju Li,&nbsp;Huifang Ma","doi":"10.1007/s43674-022-00041-z","DOIUrl":"10.1007/s43674-022-00041-z","url":null,"abstract":"<div><p>Community search enables personalized community discovery and has wide applications in real-life scenarios. Existing attributed community search algorithms use personalized information provided by attributes to locate desired community. Though achieved promising results, existing works suffer from two major limitations: (i) the precision of the algorithm decreases significantly when the seed comes from the boundary regions of the community. (ii) Most attributed community search methods mainly take the attribute information as edge weights to reveal semantic strength (e.g., attribute similarity, attribute distance, etc.), but largely ignore that attribute may serve as heterogeneous vertex. To make up for these deficiencies, in this paper, we propose a novel two-stage attributed community search method with seed replacement and joint random walk (SRRW). Specifically, in the seed replacement stage, we replace the initial query node with a core node; in the random walk stage, attributes are taken as heterogeneous nodes and the augmented graph is modeled based on the affiliation of the attributes via an overlapping clustering algorithm. And finally, a joint random walk is performed on the augmented graph to explore the desired local community. We conduct extensive experiments on both synthetic and real-world benchmarks, demonstrating its effectiveness for attributed community search.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50437271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of cyber attacks on smart grids 智能电网网络攻击检测
Pub Date : 2022-08-31 DOI: 10.1007/s43674-022-00042-y
Aditi Kar Gangopadhyay, Tanay Sheth, Tanmoy Kanti Das, Sneha Chauhan

The paper analyzes observations using a logic-based numerical methodology in Python. The Logical Analysis of Data (LAD) specializes in selecting a minimal number of features and finding unique patterns within it to distinguish ‘positive’ from ‘negative’ observations. The Python implementation of the classification model is further improved by introducing adaptations to pattern generation techniques. Finally, a case study of the Power Attack Systems Dataset used to improvise Smart Grid technology is performed to explore real-life applications of the classification model and analyze its performance against commonly used techniques.

本文使用Python中基于逻辑的数值方法分析观测结果。数据逻辑分析(LAD)专门选择最小数量的特征,并在其中找到独特的模式,以区分“积极”和“消极”的观察结果。通过引入对模式生成技术的调整,进一步改进了分类模型的Python实现。最后,对用于即兴开发智能电网技术的电力攻击系统数据集进行了案例研究,以探索分类模型的实际应用,并分析其相对于常用技术的性能。
{"title":"Detection of cyber attacks on smart grids","authors":"Aditi Kar Gangopadhyay,&nbsp;Tanay Sheth,&nbsp;Tanmoy Kanti Das,&nbsp;Sneha Chauhan","doi":"10.1007/s43674-022-00042-y","DOIUrl":"10.1007/s43674-022-00042-y","url":null,"abstract":"<div><p>The paper analyzes observations using a logic-based numerical methodology in Python. The Logical Analysis of Data (LAD) specializes in selecting a minimal number of features and finding unique patterns within it to distinguish ‘positive’ from ‘negative’ observations. The Python implementation of the classification model is further improved by introducing adaptations to pattern generation techniques. Finally, a case study of the Power Attack Systems Dataset used to improvise Smart Grid technology is performed to explore real-life applications of the classification model and analyze its performance against commonly used techniques.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50527969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New models of classifier learning curves 分类器学习曲线的新模型
Pub Date : 2022-07-16 DOI: 10.1007/s43674-022-00040-0
Vincent Berthiaume

In machine learning, a classifier has a certain learning curve i.e. the curve of the error/success probability as a function of the training set size. Finding the learning curve for a large interval of sizes takes a lot of processing time. A better method is to estimate the error probabilities only for few minimal sizes and use the pairs size-estimate as data points to model the learning curve. Searchers have tested different models. These models have certain parameters and are conceived from curves that only have the general aspect of a real learning curve. In this paper, we propose two new models that have more parameters and are conceived from real learning curves of nearest neighbour classifiers. These two main differences increase the chance for these new models to fit better the learning curve. We test these new models on one-input and two-class nearest neighbour classifiers.

在机器学习中,分类器具有特定的学习曲线,即作为训练集大小的函数的错误/成功概率的曲线。找到大尺寸间隔的学习曲线需要大量的处理时间。一种更好的方法是仅对少数最小大小估计误差概率,并使用对大小估计作为数据点来对学习曲线进行建模。搜索人员测试了不同的模型。这些模型具有某些参数,并且是从仅具有真实学习曲线的一般方面的曲线中构思的。在本文中,我们提出了两个新的模型,它们具有更多的参数,并且是根据最近邻分类器的真实学习曲线构思的。这两个主要差异增加了这些新模型更好地拟合学习曲线的机会。我们在单输入和两类最近邻分类器上测试了这些新模型。
{"title":"New models of classifier learning curves","authors":"Vincent Berthiaume","doi":"10.1007/s43674-022-00040-0","DOIUrl":"10.1007/s43674-022-00040-0","url":null,"abstract":"<div><p>In machine learning, a classifier has a certain learning curve i.e. the curve of the error/success probability as a function of the training set size. Finding the learning curve for a large interval of sizes takes a lot of processing time. A better method is to estimate the error probabilities only for few minimal sizes and use the pairs size-estimate as data points to model the learning curve. Searchers have tested different models. These models have certain parameters and are conceived from curves that only have the general aspect of a real learning curve. In this paper, we propose two new models that have more parameters and are conceived from real learning curves of nearest neighbour classifiers. These two main differences increase the chance for these new models to fit better the learning curve. We test these new models on one-input and two-class nearest neighbour classifiers.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-022-00040-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50487084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resolvent and new activation functions for linear programming kernel sparse learning 线性规划核稀疏学习的分解函数和新的激活函数
Pub Date : 2022-06-29 DOI: 10.1007/s43674-022-00038-8
Zhao Lu, Haoda Fu, William R. Prucka

The resolvent operator and the corresponding Green’s function occupy a central position in the realms of differential and integral equations, operator theory, and in particular the modern physics. However, in the field of machine learning, when confronted with the complex and highly challenging learning tasks from the real world, the prowess of Green’s function of resolvent is rarely explored and exploited. This paper aims at innovating the conventional translation-invariant kernels and rotation-invariant kernels, through theoretical investigation into a new view of constructing kernel functions by means of the resolvent operator and its Green’s function. From the practical perspective, the newly developed kernel functions are applied for robust signal recovery from noise corrupted data in the scenario of linear programming support vector learning. In particular, the monotonic and non-monotonic activation functions are used for kernel design to improve the representation capability. In this manner, a new dimension is given for kernel-based robust sparse learning from the following two aspects: firstly, a new theoretical framework by bridging the gap between the mathematical subtleties of resolvent operator and Green’s function theory and kernel construction; secondly, a concretization for the fusion between activation functions design in neural networks and nonlinear kernels design. Finally, the experimental study demonstrates the potential and superiority of the newly developed kernel functions in robust signal recovery and multiscale sparse modeling, as one step towards removing the apparent boundaries between the realms of modern signal processing and computational intelligence.

预解算子和相应的格林函数在微分方程和积分方程、算子理论,特别是现代物理学领域占据着中心地位。然而,在机器学习领域,当面对来自现实世界的复杂且极具挑战性的学习任务时,格林预解函数的威力却很少被探索和利用。本文旨在对传统的平移不变核和旋转不变核进行创新,通过对利用预解算子及其格林函数构造核函数的新观点的理论研究。从实际角度来看,在线性规划支持向量学习的场景中,新开发的核函数用于从噪声破坏的数据中进行稳健的信号恢复。特别地,单调和非单调激活函数被用于内核设计,以提高表示能力。通过这种方式,从以下两个方面为基于核的鲁棒稀疏学习提供了一个新的维度:首先,通过弥合预解算子与格林函数理论的数学微妙之处和核构造之间的差距,提出了新的理论框架;其次,具体化了神经网络中激活函数设计与非线性核设计的融合。最后,实验研究证明了新开发的核函数在鲁棒信号恢复和多尺度稀疏建模方面的潜力和优越性,这是消除现代信号处理和计算智能领域之间明显界限的一步。
{"title":"Resolvent and new activation functions for linear programming kernel sparse learning","authors":"Zhao Lu,&nbsp;Haoda Fu,&nbsp;William R. Prucka","doi":"10.1007/s43674-022-00038-8","DOIUrl":"10.1007/s43674-022-00038-8","url":null,"abstract":"<div><p>The resolvent operator and the corresponding Green’s function occupy a central position in the realms of differential and integral equations, operator theory, and in particular the modern physics. However, in the field of machine learning, when confronted with the complex and highly challenging learning tasks from the real world, the prowess of Green’s function of resolvent is rarely explored and exploited. This paper aims at innovating the conventional translation-invariant kernels and rotation-invariant kernels, through theoretical investigation into a new view of constructing kernel functions by means of the resolvent operator and its Green’s function. From the practical perspective, the newly developed kernel functions are applied for robust signal recovery from noise corrupted data in the scenario of linear programming support vector learning. In particular, the monotonic and non-monotonic activation functions are used for kernel design to improve the representation capability. In this manner, a new dimension is given for kernel-based robust sparse learning from the following two aspects: firstly, a new theoretical framework by bridging the gap between the mathematical subtleties of resolvent operator and Green’s function theory and kernel construction; secondly, a concretization for the fusion between activation functions design in neural networks and nonlinear kernels design. Finally, the experimental study demonstrates the potential and superiority of the newly developed kernel functions in robust signal recovery and multiscale sparse modeling, as one step towards removing the apparent boundaries between the realms of modern signal processing and computational intelligence.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50523930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-agent-based dynamic railway scheduling and optimization: a coloured petri-net model 基于多智能体的铁路动态调度与优化:一个有色petri网模型
Pub Date : 2022-06-16 DOI: 10.1007/s43674-022-00039-7
Poulami Dalapati, Kaushik Paul

This paper addresses the issues concerning the rescheduling of a static timetable in case of a disaster, encountered in a large and complex railway network system. The proposed approach tries to modify the existing schedule to minimise the overall delay of trains. This is achieved by representing the rescheduling problem in the form of a Petri-Net and the highly uncertain disaster recovery time in such a model is handled as Markov decision processes (MDP). For solving the rescheduling problem, a distributed constraint optimisation (DCOP)-based strategy involving the use of autonomous agents is used to generate the desired schedule. The proposed approach is evaluated on the real-time data set taken from the Eastern Railways, India by constructing various disaster scenarios using the Java Agent DEvelopment Framework (JADE). The proposed framework, when compared to the existing approaches, substantially reduces the delay of trains after rescheduling.

本文讨论了在大型复杂铁路网系统中发生灾难时,静态时间表的重新安排问题。拟议的方法试图修改现有的时间表,以最大限度地减少列车的整体延误。这是通过将重新调度问题表示为Petri网的形式来实现的,并且这种模型中高度不确定的灾难恢复时间被处理为马尔可夫决策过程(MDP)。为了解决重新调度问题,使用了一种基于分布式约束优化(DCOP)的策略,包括使用自主代理来生成所需的调度。通过使用Java Agent DEvelopment Framework(JADE)构建各种灾难场景,在印度东部铁路公司的实时数据集上对所提出的方法进行了评估。与现有方法相比,拟议的框架大大减少了列车改期后的延误。
{"title":"Multi-agent-based dynamic railway scheduling and optimization: a coloured petri-net model","authors":"Poulami Dalapati,&nbsp;Kaushik Paul","doi":"10.1007/s43674-022-00039-7","DOIUrl":"10.1007/s43674-022-00039-7","url":null,"abstract":"<div><p>This paper addresses the issues concerning the rescheduling of a static timetable in case of a disaster, encountered in a large and complex railway network system. The proposed approach tries to modify the existing schedule to minimise the overall delay of trains. This is achieved by representing the rescheduling problem in the form of a Petri-Net and the highly uncertain disaster recovery time in such a model is handled as Markov decision processes (MDP). For solving the rescheduling problem, a distributed constraint optimisation (DCOP)-based strategy involving the use of autonomous agents is used to generate the desired schedule. The proposed approach is evaluated on the real-time data set taken from the Eastern Railways, India by constructing various disaster scenarios using the Java Agent DEvelopment Framework (JADE). The proposed framework, when compared to the existing approaches, substantially reduces the delay of trains after rescheduling.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-022-00039-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50486572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An unsupervised autonomous learning framework for goal-directed behaviours in dynamic contexts 动态环境下目标导向行为的无监督自主学习框架
Pub Date : 2022-06-02 DOI: 10.1007/s43674-022-00037-9
Chinedu Pascal Ezenkwu, Andrew Starkey

Due to their dependence on a task-specific reward function, reinforcement learning agents are ineffective at responding to a dynamic goal or environment. This paper seeks to overcome this limitation of traditional reinforcement learning through a task-agnostic, self-organising autonomous agent framework. The proposed algorithm is a hybrid of TMGWR for self-adaptive learning of sensorimotor maps and value iteration for goal-directed planning. TMGWR has been previously demonstrated to overcome the problems associated with competing sensorimotor techniques such SOM, GNG, and GWR; these problems include: difficulty in setting a suitable number of neurons for a task, inflexibility, the inability to cope with non-markovian environments, challenges with noise, and inappropriate representation of sensory observations and actions together. However, the binary sensorimotor-link implementation in the original TMGWR enables catastrophic forgetting when the agent experiences changes in the task and it is therefore not suitable for self-adaptive learning. A new sensorimotor-link update rule is presented in this paper to enable the adaptation of the sensorimotor map to new experiences. This paper has demonstrated that the TMGWR-based algorithm has better sample efficiency than model-free reinforcement learning and better self-adaptivity than both the model-free and the traditional model-based reinforcement learning algorithms. Moreover, the algorithm has been demonstrated to give the lowest overall computational cost when compared to traditional reinforcement learning algorithms.

由于它们依赖于特定任务的奖励函数,强化学习主体在响应动态目标或环境方面是无效的。本文试图通过任务不可知、自组织的自主主体框架来克服传统强化学习的局限性。所提出的算法是用于感知运动图自适应学习的TMGWR和用于目标导向规划的值迭代的混合。TMGWR先前已被证明可以克服与竞争性感觉运动技术(如SOM、GNG和GWR)相关的问题;这些问题包括:难以为一项任务设置合适数量的神经元、灵活性、无法应对非马尔可夫环境、噪音挑战以及不恰当地将感官观察和动作表现在一起。然而,当主体在任务中经历变化时,原始TMGWR中的二元感觉运动链接实现会导致灾难性遗忘,因此不适合自适应学习。本文提出了一种新的感觉运动链接更新规则,以使感觉运动图能够适应新的体验。本文证明了基于TMGWR的算法比无模型强化学习具有更好的样本效率,并且比无模型和传统的基于模型的强化学习算法都具有更好的自适应性。此外,与传统的强化学习算法相比,该算法的总体计算成本最低。
{"title":"An unsupervised autonomous learning framework for goal-directed behaviours in dynamic contexts","authors":"Chinedu Pascal Ezenkwu,&nbsp;Andrew Starkey","doi":"10.1007/s43674-022-00037-9","DOIUrl":"10.1007/s43674-022-00037-9","url":null,"abstract":"<div><p>Due to their dependence on a task-specific reward function, reinforcement learning agents are ineffective at responding to a dynamic goal or environment. This paper seeks to overcome this limitation of traditional reinforcement learning through a task-agnostic, self-organising autonomous agent framework. The proposed algorithm is a hybrid of TMGWR for self-adaptive learning of sensorimotor maps and value iteration for goal-directed planning. TMGWR has been previously demonstrated to overcome the problems associated with competing sensorimotor techniques such SOM, GNG, and GWR; these problems include: difficulty in setting a suitable number of neurons for a task, inflexibility, the inability to cope with non-markovian environments, challenges with noise, and inappropriate representation of sensory observations and actions together. However, the binary sensorimotor-link implementation in the original TMGWR enables catastrophic forgetting when the agent experiences changes in the task and it is therefore not suitable for self-adaptive learning. A new sensorimotor-link update rule is presented in this paper to enable the adaptation of the sensorimotor map to new experiences. This paper has demonstrated that the TMGWR-based algorithm has better sample efficiency than model-free reinforcement learning and better self-adaptivity than both the model-free and the traditional model-based reinforcement learning algorithms. Moreover, the algorithm has been demonstrated to give the lowest overall computational cost when compared to traditional reinforcement learning algorithms.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-022-00037-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50442527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning cutting forces in milling processes of functionally graded materials 功能梯度材料铣削过程中的机器学习切削力
Pub Date : 2022-05-27 DOI: 10.1007/s43674-022-00036-w
Xiaojie Xu, Yun Zhang, Yunlu Li, Yunyao Li

Machine learning approaches can serve as powerful tools in the machining optimization process. Criteria, such as accuracy and stability, are important to consider when choosing among different models. For the industrial application, it also is essential to balance cost, applicabilities, and ease of implementations. Here, we develop Gaussian process regression models for predicting the main cutting force (R) and its components in three directions of the coordinate system ((F_{x}), (F_{y}), and (F_{z})) based on two predictors: the depth of cut ((a_{p})) and the feed rate (f) in milling processes of functionally graded materials. The model performance shows high accuracy and stability, and the models are thus promising for estimating the cutting force and its component in a fast, cost effective, and robust fashion.

机器学习方法可以作为加工优化过程中的强大工具。在选择不同的模型时,精度和稳定性等标准非常重要。对于工业应用,平衡成本、适用性和易于实现也是至关重要的。在此,我们基于两个预测因子:功能梯度材料铣削过程中的切削深度(a_{p})和进给速率(F),建立了高斯过程回归模型,用于预测坐标系((F_{x})、(F_{y}和(F_{z}))三个方向上的主切削力(R)及其分量。模型性能显示出高精度和稳定性,因此该模型有望以快速、经济高效和稳健的方式估计切削力及其分量。
{"title":"Machine learning cutting forces in milling processes of functionally graded materials","authors":"Xiaojie Xu,&nbsp;Yun Zhang,&nbsp;Yunlu Li,&nbsp;Yunyao Li","doi":"10.1007/s43674-022-00036-w","DOIUrl":"10.1007/s43674-022-00036-w","url":null,"abstract":"<div><p>Machine learning approaches can serve as powerful tools in the machining optimization process. Criteria, such as accuracy and stability, are important to consider when choosing among different models. For the industrial application, it also is essential to balance cost, applicabilities, and ease of implementations. Here, we develop Gaussian process regression models for predicting the main cutting force (<i>R</i>) and its components in three directions of the coordinate system (<span>(F_{x})</span>, <span>(F_{y})</span>, and <span>(F_{z})</span>) based on two predictors: the depth of cut (<span>(a_{p})</span>) and the feed rate (<i>f</i>) in milling processes of functionally graded materials. The model performance shows high accuracy and stability, and the models are thus promising for estimating the cutting force and its component in a fast, cost effective, and robust fashion.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50518489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Comparative analysis of super-resolution reconstructed images for micro-expression recognition 用于微表情识别的超分辨率重建图像的比较分析
Pub Date : 2022-05-14 DOI: 10.1007/s43674-022-00035-x
Pratikshya Sharma, Sonya Coleman, Pratheepan Yogarajah, Laurence Taggart, Pradeepa Samarasinghe

It is an established fact that the genuineness of facial micro-expression is an effective means for estimating concealed emotions (Li et al. in Micro-expression recognition under low-resolution cases. SciTePress, Science and Technology Publications, Setúbal, 2019). Conventionally, analysis of these expressions has been performed using high resolution images which are ideal cases. However, in a real-world scenario, capturing expressions with high resolution images may not always be possible particularly using low-cost surveillance cameras. Faces captured using such cameras are often very tiny and of poor resolution. Due to the loss of discriminative features these images may not be of much use particularly for identifying certain minute facial details. To make these images useful, enhancing the textural information becomes essential and super-resolution algorithms can be ideal to achieve this. In this work, we utilize algorithms based on deep learning and generative adversarial network for transforming low-resolution micro-expression images into super-resolution images and examine their fitness particularly for micro-expression recognition. The proposed approach is tested on simulated dataset obtained from two popular spontaneous micro-expression datasets namely CASME II and SMIC-VIS; the experimental results demonstrate that the method achieved favourable results with the best recognition performance recorded as 61.63%. The significance of this work is: first, it thoroughly investigates reconstruction performance of several deep learning super-resolution algorithms on simulated low-quality micro-expression images; second, it provides a comprehensive analysis of the results obtained employing these reconstructed images to determine their contribution in addressing image quality issues specifically for micro-expression recognition.

一个公认的事实是,面部微表情的真实性是估计隐藏情绪的有效手段(Li et al.在低分辨率情况下的微表情识别中。SciTePress,科学技术出版社,Setúbal,2019)。传统上,已经使用作为理想情况的高分辨率图像来执行这些表达式的分析。然而,在现实世界中,用高分辨率图像捕捉表情可能并不总是可能的,尤其是使用低成本的监控摄像头。使用这种相机拍摄的人脸通常非常小,分辨率也很低。由于辨别特征的丢失,这些图像可能没有多大用处,特别是对于识别某些微小的面部细节。为了使这些图像变得有用,增强纹理信息变得至关重要,超分辨率算法可能是实现这一点的理想方法。在这项工作中,我们利用基于深度学习和生成对抗性网络的算法将低分辨率微表情图像转换为超分辨率图像,并检查它们是否适合微表情识别。该方法在两个流行的自发微表达数据集CASME II和SMIC-VIS的模拟数据集上进行了测试;实验结果表明,该方法取得了良好的效果,最佳识别率为61.63%。本工作的意义在于:首先,深入研究了几种深度学习超分辨率算法在模拟低质量微表情图像上的重建性能;其次,它对使用这些重建图像获得的结果进行了全面的分析,以确定它们在解决专门用于微表情识别的图像质量问题方面的贡献。
{"title":"Comparative analysis of super-resolution reconstructed images for micro-expression recognition","authors":"Pratikshya Sharma,&nbsp;Sonya Coleman,&nbsp;Pratheepan Yogarajah,&nbsp;Laurence Taggart,&nbsp;Pradeepa Samarasinghe","doi":"10.1007/s43674-022-00035-x","DOIUrl":"10.1007/s43674-022-00035-x","url":null,"abstract":"<div><p>It is an established fact that the genuineness of facial micro-expression is an effective means for estimating concealed emotions (Li et al. in Micro-expression recognition under low-resolution cases. SciTePress, Science and Technology Publications, Setúbal, 2019). Conventionally, analysis of these expressions has been performed using high resolution images which are ideal cases. However, in a real-world scenario, capturing expressions with high resolution images may not always be possible particularly using low-cost surveillance cameras. Faces captured using such cameras are often very tiny and of poor resolution. Due to the loss of discriminative features these images may not be of much use particularly for identifying certain minute facial details. To make these images useful, enhancing the textural information becomes essential and super-resolution algorithms can be ideal to achieve this. In this work, we utilize algorithms based on deep learning and generative adversarial network for transforming low-resolution micro-expression images into super-resolution images and examine their fitness particularly for micro-expression recognition. The proposed approach is tested on simulated dataset obtained from two popular spontaneous micro-expression datasets namely CASME II and SMIC-VIS; the experimental results demonstrate that the method achieved favourable results with the best recognition performance recorded as 61.63%. The significance of this work is: first, it thoroughly investigates reconstruction performance of several deep learning super-resolution algorithms on simulated low-quality micro-expression images; second, it provides a comprehensive analysis of the results obtained employing these reconstructed images to determine their contribution in addressing image quality issues specifically for micro-expression recognition.\u0000</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"2 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-022-00035-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50482157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Advances in computational intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1