2021 IEEE Symposium Series on Computational Intelligence (SSCI)最新文献

英文中文

Automated Behavior-based Malice Scoring of Ransomware Using Genetic Programming 基于自动行为的基于遗传编程的勒索软件恶意评分

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9660009

Muhammad Shabbir Abbasi, Harith Al-Sahaf, I. Welch

Malice or severity scoring models are a technique for detection of maliciousness. A few ransomware detection studies utilise malice scoring models for detection of ransomware-like behavior. These models rely on the weighted sum of some manually chosen features and their weights by a domain expert. To automate the modelling of malice scoring for ransomware detection, we propose a method based on Genetic Programming (GP) that automatically evolves a behavior-based malice scoring model by selecting appropriate features and functions from the input feature and operator sets. The experimental results show that the best-evolved model correctly assigned a malice score, below the threshold value to over 85% of the unseen goodware instances, and over the threshold value to more than 99% of the unseen ransomware instances.

恶意或严重性评分模型是一种检测恶意的技术。一些勒索软件检测研究利用恶意评分模型来检测类似勒索软件的行为。这些模型依赖于领域专家手动选择的一些特征及其权重的加权和。为了实现勒索软件检测恶意评分的自动化建模，我们提出了一种基于遗传规划(GP)的方法，该方法通过从输入特征和算子集中选择适当的特征和函数，自动进化出基于行为的恶意评分模型。实验结果表明，进化最好的模型正确地分配了恶意分数，低于阈值的恶意分数超过85%的未见恶意软件实例，高于阈值的恶意分数超过99%的未见勒索软件实例。

引用次数: 3

Classification of Artificial and Real Objects Using Faster Region-Based Convolutional Neural Networks 基于更快区域的卷积神经网络的人工和真实物体分类

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9660105

Ritvik Sai Teegavarapu, Debojit Biswas

Object detection and classification tasks can be addressed effectively using machine learning (ML) methods that use convolutional neural networks (CNNs) and region-based convolutional neural networks (R-CNNs). In this study, the ability of R-CNNs to distinguish between digital images of artificial and real objects is evaluated. A single-shot detection (SSD) network is also developed to serve as a baseline approach and for comparative evaluation. Experiments are designed using several images of real and artificial leaves as inputs to the R-CNNs that are trained and tested with different proposal areas of the images. The performances of R-CNNs and SSDs are evaluated using mean average precision (mAP) measure. Results from this study indicate that trained R-CNN s perform well in classification of real and artificial leaves and are robust in performance against changes in many of the experimental factors including minimal training data and resolution of the images. R-CNNs have also performed better than SSDs in the classification tasks with higher values of mAP. The performance of R-CNNs is affected by the proposal area, or the number of subsections the R-CNNs utilizes to determine distinct characteristics of the objects (i.e., leaves) presented. Results based on limited experiments from this study indicate the R-CNNs and their variants are ideally suited for object classification tasks with numerous real-world applications.

对象检测和分类任务可以使用使用卷积神经网络(cnn)和基于区域的卷积神经网络(r - cnn)的机器学习(ML)方法有效地解决。在本研究中，评估了r - cnn区分人造物体和真实物体的数字图像的能力。还开发了单次检测(SSD)网络，作为基线方法和比较评估。实验设计使用几张真实和人造树叶的图像作为r - cnn的输入，这些r - cnn使用图像的不同建议区域进行训练和测试。采用平均精度(mAP)方法对r - cnn和ssd的性能进行了评价。本研究的结果表明，训练后的R-CNN在真实叶和人工叶的分类中表现良好，并且对包括最小训练数据和图像分辨率在内的许多实验因素的变化具有鲁棒性。在mAP值较高的分类任务中，r - cnn的表现也优于ssd。r - cnn的性能受到提议区域的影响，或者r - cnn用来确定所呈现对象(即叶子)的不同特征的子部分数量的影响。基于本研究有限实验的结果表明，r - cnn及其变体非常适合具有大量实际应用的对象分类任务。

{"title":"Classification of Artificial and Real Objects Using Faster Region-Based Convolutional Neural Networks","authors":"Ritvik Sai Teegavarapu, Debojit Biswas","doi":"10.1109/SSCI50451.2021.9660105","DOIUrl":"https://doi.org/10.1109/SSCI50451.2021.9660105","url":null,"abstract":"Object detection and classification tasks can be addressed effectively using machine learning (ML) methods that use convolutional neural networks (CNNs) and region-based convolutional neural networks (R-CNNs). In this study, the ability of R-CNNs to distinguish between digital images of artificial and real objects is evaluated. A single-shot detection (SSD) network is also developed to serve as a baseline approach and for comparative evaluation. Experiments are designed using several images of real and artificial leaves as inputs to the R-CNNs that are trained and tested with different proposal areas of the images. The performances of R-CNNs and SSDs are evaluated using mean average precision (mAP) measure. Results from this study indicate that trained R-CNN s perform well in classification of real and artificial leaves and are robust in performance against changes in many of the experimental factors including minimal training data and resolution of the images. R-CNNs have also performed better than SSDs in the classification tasks with higher values of mAP. The performance of R-CNNs is affected by the proposal area, or the number of subsections the R-CNNs utilizes to determine distinct characteristics of the objects (i.e., leaves) presented. Results based on limited experiments from this study indicate the R-CNNs and their variants are ideally suited for object classification tasks with numerous real-world applications.","PeriodicalId":255763,"journal":{"name":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123080652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Intelligent Strategies to Combine Move Heuristics in Selection Hyper-heuristics for Real-World Fibre Network Design Optimisation 结合移动启发式选择的智能策略-现实世界光纤网络设计优化的超启发式

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9659994

Anil Arpaci, Jun Chen, J. Drake, Tim Glover

Increasing competition in today's telecommunication industry drives the need for more cost effective services. In order to reduce the cost of designing a fibre network with low capital expenditure, automation and optimisation of network design has become crucial. British Telecom's network design software, BT NetDesign, has been developed for the purpose of network design and optimisation using a rich set of network/graph-based heuristics and the simulated annealing (SA) search method. Although NetDesign provides several different ways of navigating the search space via different move heuristics, the existing search method (SA) does not consistently reach the near-global optimum as the size of network increases. To deal with larger networks, this study utilises an intelligent approach based on the well-known Luby sequence to combine move heuristics, using two separate learning schemes: frequency based and bigram statistics. These two strategies are rigorously evaluated on network instances of different sizes. Experimental results on real-world case studies indicate that a bigram scheme with a longer warm-up period to learn heuristic combinations can reach high quality solutions for large networks.

当今电信行业日益激烈的竞争促使人们需要更具成本效益的服务。为了降低光纤网络的设计成本，降低资本支出，网络设计的自动化和优化变得至关重要。英国电信的网络设计软件BT NetDesign是为网络设计和优化而开发的，使用了一套丰富的基于网络/图的启发式和模拟退火(SA)搜索方法。尽管NetDesign通过不同的移动启发式提供了几种不同的导航搜索空间的方法，但随着网络规模的增加，现有的搜索方法(SA)并不能始终达到近全局最优。为了处理更大的网络，本研究利用了一种基于著名的Luby序列的智能方法来结合移动启发式，使用两种独立的学习方案:基于频率和双图统计。这两种策略在不同规模的网络实例上进行了严格的评估。实际案例研究的实验结果表明，具有较长预热期的双图方案可以获得大型网络的高质量解。

{"title":"Intelligent Strategies to Combine Move Heuristics in Selection Hyper-heuristics for Real-World Fibre Network Design Optimisation","authors":"Anil Arpaci, Jun Chen, J. Drake, Tim Glover","doi":"10.1109/SSCI50451.2021.9659994","DOIUrl":"https://doi.org/10.1109/SSCI50451.2021.9659994","url":null,"abstract":"Increasing competition in today's telecommunication industry drives the need for more cost effective services. In order to reduce the cost of designing a fibre network with low capital expenditure, automation and optimisation of network design has become crucial. British Telecom's network design software, BT NetDesign, has been developed for the purpose of network design and optimisation using a rich set of network/graph-based heuristics and the simulated annealing (SA) search method. Although NetDesign provides several different ways of navigating the search space via different move heuristics, the existing search method (SA) does not consistently reach the near-global optimum as the size of network increases. To deal with larger networks, this study utilises an intelligent approach based on the well-known Luby sequence to combine move heuristics, using two separate learning schemes: frequency based and bigram statistics. These two strategies are rigorously evaluated on network instances of different sizes. Experimental results on real-world case studies indicate that a bigram scheme with a longer warm-up period to learn heuristic combinations can reach high quality solutions for large networks.","PeriodicalId":255763,"journal":{"name":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125322672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Scalable embedding of multiple perspectives for indefinite life-science data analysis 无限生命科学数据分析的多视角可扩展嵌入

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9659914

Maximilian Münch, Simon Heilig, Philipp Väth, Frank-Michael Schleif

Life science data analysis frequently encounters particular challenges that cannot be solved with classical techniques from data analytics or machine learning domains. The complex inherent structure of the data and especially the encoding in non-standard ways, e.g., as genome- or protein-sequences, graph structure or histograms, often limit the development of appropriate classification models. To address these limitations, the application of domain-specific expert similarity measures has gained a lot of attention in the past. However, the use of such expert measures suffers from two major drawbacks: (a) there is not one outstanding similarity measure that guarantees success in all application scenarios, and (b) such similarity functions often lead to indefinite data that cannot be processed by classical machine learning methods. In order to tackle both of these limitations, this paper presents a method to embed indefinite life science data with various similarity measures at the same time into a complex-valued vector space. We test our approach on various life science data sets and evaluate the performance against other competitive methods to show its efficiency.

生命科学数据分析经常遇到数据分析或机器学习领域的经典技术无法解决的特殊挑战。数据复杂的固有结构，特别是以非标准方式编码，如基因组或蛋白质序列、图形结构或直方图，往往限制了适当分类模型的发展。为了解决这些限制，特定领域的专家相似度度量的应用在过去得到了很多关注。然而，这种专家度量的使用有两个主要缺点:(a)没有一个突出的相似性度量保证在所有应用场景中成功，(b)这种相似性函数通常会导致不确定的数据，无法通过经典的机器学习方法处理。为了解决这两个问题，本文提出了一种将具有多种相似度量的不确定生命科学数据同时嵌入复值向量空间的方法。我们在各种生命科学数据集上测试了我们的方法，并与其他竞争方法进行了性能评估，以显示其效率。

{"title":"Scalable embedding of multiple perspectives for indefinite life-science data analysis","authors":"Maximilian Münch, Simon Heilig, Philipp Väth, Frank-Michael Schleif","doi":"10.1109/SSCI50451.2021.9659914","DOIUrl":"https://doi.org/10.1109/SSCI50451.2021.9659914","url":null,"abstract":"Life science data analysis frequently encounters particular challenges that cannot be solved with classical techniques from data analytics or machine learning domains. The complex inherent structure of the data and especially the encoding in non-standard ways, e.g., as genome- or protein-sequences, graph structure or histograms, often limit the development of appropriate classification models. To address these limitations, the application of domain-specific expert similarity measures has gained a lot of attention in the past. However, the use of such expert measures suffers from two major drawbacks: (a) there is not one outstanding similarity measure that guarantees success in all application scenarios, and (b) such similarity functions often lead to indefinite data that cannot be processed by classical machine learning methods. In order to tackle both of these limitations, this paper presents a method to embed indefinite life science data with various similarity measures at the same time into a complex-valued vector space. We test our approach on various life science data sets and evaluate the performance against other competitive methods to show its efficiency.","PeriodicalId":255763,"journal":{"name":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116165849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Discrete Collective Estimation in Swarm Robotics with Ranked Voting Systems 基于排序投票系统的群体机器人离散集体估计

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9659868

Qihao Shan, Alexander Heck, Sanaz Mostaghim

The best-of-n problem has been a popular research topic for understanding collective decision-making in recent years. Researchers aim to enable a swarm of agents to collectively converge to a single opinion out of a series of potential options, using only local interactions. In this paper, we investigate the viability of decision-making via majority rule using ranked voting systems in multi-option scenarios where n>2. We focus on two ranked voting systems, single transferable vote (STV) and Borda count (BC). The proposed algorithms are tested in a discrete collective estimation scenario, and compared against two benchmark algorithms, direct comparison (DC) and majority rule using first-past-the-post voting (FPTP). We have analyzed the experimental results, focusing on the trade-off between accuracy and speed in decision-making. We have concluded that ranked voting systems can significantly improve the performances of collective decision-making strategies in multi-option scenarios. Our experiments show that BC is the best performing algorithm in the studied scenario.

近年来，最优问题一直是理解集体决策的热门研究课题。研究人员的目标是使一群智能体仅使用局部交互，就能从一系列潜在选项中集体收敛到一个单一的意见。在本文中，我们研究了在n>2的多选项场景下，使用排名投票系统进行多数决决策的可行性。我们重点研究了两种排名投票系统，即单一可转移投票(STV)和博尔达计数(BC)。提出的算法在一个离散的集体估计场景中进行了测试，并与两种基准算法进行了比较，直接比较(DC)和使用简单多数制投票(FPTP)的多数决规则。我们对实验结果进行了分析，重点关注决策的准确性和速度之间的权衡。我们的结论是，在多选项场景下，排名投票系统可以显著提高集体决策策略的性能。我们的实验表明，BC是研究场景中性能最好的算法。

引用次数: 2

A Study of Transfer Learning in a Generation Constructive Hyper-Heuristic for One Dimensional Bin Packing 一维装箱生成超启发式迁移学习研究

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9660092

Darius Scheepers, N. Pillay

The research presented in this paper investigates the use of transfer learning in a genetic programming generation constructive hyper-heuristic for discrete optimisation, namely, the one dimensional bin packing problem (1BPP). The source hyper-heuristic solves easy and medium problem instances from the Scholl benchmark set and the target hyper-heuristic solves the hard problem instances in the same benchmark set. Performance is assessed in terms of objective value, i.e. the number of bins, computational effort and generality of the hyper-heuristic. This study firstly compares the performance of two transfer learning approaches previously shown to be effective for generation constructive hyper-heuristics, for the one dimensional bin packing problem. Both these approaches performed better than not using transfer learning, with the approach transferring the best elements from each generation of the source hyper-heuristic to the target hyper-heuristic (TL2) producing the best results. The study then investigated transferring knowledge on an area of the search space rather than a point in the search space. Three approaches were developed and evaluated for this purpose. Two of these approaches were able to improve the performance of TL2 on three of the ten problem instances with respect to objective value.

本文研究了迁移学习在离散优化的遗传规划生成建设性超启发式中的应用，即一维装箱问题(1BPP)。源超启发式算法求解Scholl基准集中的简单和中等问题实例，目标超启发式算法求解同一基准集中的困难问题实例。性能是根据客观值来评估的，即箱的数量，计算工作量和超启发式的通用性。本研究首先比较了两种迁移学习方法的性能，这两种迁移学习方法先前被证明是有效的，用于生成建设性的超启发式，用于一维装箱问题。这两种方法都比不使用迁移学习表现得更好，该方法将每一代源超启发式的最佳元素转移到目标超启发式(TL2)中，产生最佳结果。然后研究了在搜索空间的一个区域而不是搜索空间中的一个点上转移知识。为此目的制定和评价了三种方法。其中两种方法能够在10个问题实例中的3个上提高TL2的客观值性能。

{"title":"A Study of Transfer Learning in a Generation Constructive Hyper-Heuristic for One Dimensional Bin Packing","authors":"Darius Scheepers, N. Pillay","doi":"10.1109/SSCI50451.2021.9660092","DOIUrl":"https://doi.org/10.1109/SSCI50451.2021.9660092","url":null,"abstract":"The research presented in this paper investigates the use of transfer learning in a genetic programming generation constructive hyper-heuristic for discrete optimisation, namely, the one dimensional bin packing problem (1BPP). The source hyper-heuristic solves easy and medium problem instances from the Scholl benchmark set and the target hyper-heuristic solves the hard problem instances in the same benchmark set. Performance is assessed in terms of objective value, i.e. the number of bins, computational effort and generality of the hyper-heuristic. This study firstly compares the performance of two transfer learning approaches previously shown to be effective for generation constructive hyper-heuristics, for the one dimensional bin packing problem. Both these approaches performed better than not using transfer learning, with the approach transferring the best elements from each generation of the source hyper-heuristic to the target hyper-heuristic (TL2) producing the best results. The study then investigated transferring knowledge on an area of the search space rather than a point in the search space. Three approaches were developed and evaluated for this purpose. Two of these approaches were able to improve the performance of TL2 on three of the ten problem instances with respect to objective value.","PeriodicalId":255763,"journal":{"name":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114145836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

AHPSO: Altruistic Heterogeneous Particle Swarm Optimisation Algorithm for Global Optimisation 全局优化的利他异构粒子群优化算法

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9660149

Fevzi Tugrul Varna, P. Husbands

This paper introduces a new particle swarm optimisation variant: the altruistic heterogeneous particle swarm optimisation algorithm (AHPSO). The algorithm conceptualises particles as energy-driven agents with bio-inspired altruistic behaviour. In our approach, particles possess a current energy level and an activation threshold and are in one of two possible states (active or inactive) depending on their energy levels at time t. The idea of altruism is used to form lending-borrowing relationships among particles to change an agent's state from inactive to active, and the main search mechanism exploits this idea. Diversity in the swarm, which prevent premature convergence, is maintained via agent states and the level of altruistic behaviour particles exhibit. The performance of AHPSO was compared with 11 metaheuristics and 12 state-of-the-art PSO variants using the CEC'17 and CEC'05 test suites at 30 and 50 dimensions. The AHPSO algorithm outperformed all 23 comparison algorithms on both benchmark test suites at both 30 and 50 dimensions.

本文介绍了一种新的粒子群优化算法:利他异构粒子群优化算法(AHPSO)。该算法将粒子概念化为具有生物启发的利他行为的能量驱动代理。在我们的方法中，粒子拥有当前的能量水平和激活阈值，并且根据它们在时刻t的能量水平处于两种可能的状态(活跃或不活跃)之一。利他主义的思想用于形成粒子之间的借贷关系，以将代理的状态从不活跃改变为活跃，主要的搜索机制利用了这一思想。群体中的多样性可以防止过早的趋同，这种多样性是通过个体状态和粒子表现出的利他行为水平来维持的。使用CEC'17和CEC'05测试套件在30和50个维度上对AHPSO的性能与11个元启发式和12个最先进的PSO变体进行了比较。AHPSO算法在30和50个维度的基准测试套件上都优于所有23种比较算法。

引用次数: 2

Information sharing in multi-agent search and task allocation problems 多智能体搜索与任务分配问题中的信息共享

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9660121

Mathias Minos-Stensrud, H. Moen, Jan Dyre Bjerknes

The impact of information sharing in the generalized problem of combined search and task allocation has not been studied in detail in the context of multi-agent systems. Thus, a simple swarm intelligence mechanism called call-out and a basic game theoretic auction mechanism are compared and analyzed in terms of communication distance and information fault tolerance. Simulations show that the auction mechanism performs well under varying communication distances but has problems when the communication distance is low and when facing faulty information. The call-out mechanism, however, performs significantly better when communication distances are low and when information transfer between agents is uncertain. Furthermore, call-out performs almost equal to auction for intermittent communication distances but due to the inherent property of “over-coordination” for large communication distances agents become “over-committed” in solving tasks at the expense of searching for new tasks. This fundamental system behavior can only be studied in the combined search and task allocation problem,

在多智能体系统背景下，信息共享对广义组合搜索和任务分配问题的影响尚未得到详细的研究。在此基础上，从通信距离和信息容错性两个方面对一种简单的群体智能机制——呼出机制和一种基本的博弈论拍卖机制进行了比较和分析。仿真结果表明，该拍卖机制在不同通信距离下表现良好，但在通信距离较低和面对错误信息时存在问题。然而，呼出机制在通信距离较低和代理之间的信息传递不确定时表现得更好。此外，对于间歇性通信距离，呼出执行几乎等同于拍卖，但由于大通信距离的“过度协调”的固有属性，代理在解决任务时变得“过度承诺”，而牺牲了搜索新任务的代价。这种基本的系统行为只能在搜索和任务分配的组合问题中研究，

引用次数: 0

Investigating Neural Network Architectures, Techniques, and Datasets for Autonomous Navigation in Simulation 研究自主导航仿真中的神经网络架构、技术和数据集

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9659907

Oliver Chang, Christiana Marchese, Jared Mejia, A. Clark

Neural networks (NNs) are becoming an increasingly important part of mobile robot control systems. Compared with traditional methods, NNs (and other data-driven techniques) produce comparable-if not better-results while requiring less engineering knowhow. Training NNs, however, still requires exploration of a significant number of architectural, optimization, and evaluation options. In this study, we build a simulation environment, generate three image datasets using distinct techniques, train 652 models (including replicates) using a variety of architectures and paradigms (e.g., classification, regression, etc.), and evaluate the navigation ability of the model in simulation. Our goal is to explore a large number of model possibilities so that we can select the most promising for future study with a physical device. Training datasets leading to the best performing models were those that included a significant amount of noise from seemingly inefficient actions. The most promising models explicitly incorporated “memory” wherein previous actions were included as an input in the next step. Such models performed as good or better than conventional convolutional NNs, recurrent NNs, and custom architectures including two camera frames. Although trained models perform well in an environment matching the distribution of the training dataset, they fail when the simulation environment is altered in a seemingly insignificant manner. In robotics research it is often taken for granted that a model with good validation characteristics will perform well on the underlying task, but the results presented here show that there can often be a loose relationship between validation metrics and performance.

神经网络在移动机器人控制系统中扮演着越来越重要的角色。与传统方法相比，神经网络(以及其他数据驱动技术)在需要更少的工程知识的情况下，即使不是更好，也能产生相当的结果。然而，训练神经网络仍然需要探索大量的架构、优化和评估选项。在本研究中，我们构建了一个仿真环境，使用不同的技术生成了三个图像数据集，使用各种架构和范式(如分类、回归等)训练了652个模型(包括复制)，并在仿真中评估了模型的导航能力。我们的目标是探索大量的模型可能性，以便我们可以选择最有希望的用于未来物理设备的研究。导致表现最好的模型的训练数据集是那些包含了大量来自看似低效的操作的噪声的数据集。最有前途的模型明确地结合了“记忆”，其中先前的动作被作为下一步的输入。这些模型的表现与传统的卷积神经网络、循环神经网络和包括两个相机帧的定制架构一样好，甚至更好。虽然训练后的模型在与训练数据集分布相匹配的环境中表现良好，但当模拟环境以看似微不足道的方式改变时，它们就会失败。在机器人研究中，通常理所当然地认为具有良好验证特征的模型将在底层任务上表现良好，但本文给出的结果表明，验证指标和性能之间通常存在松散的关系。

{"title":"Investigating Neural Network Architectures, Techniques, and Datasets for Autonomous Navigation in Simulation","authors":"Oliver Chang, Christiana Marchese, Jared Mejia, A. Clark","doi":"10.1109/SSCI50451.2021.9659907","DOIUrl":"https://doi.org/10.1109/SSCI50451.2021.9659907","url":null,"abstract":"Neural networks (NNs) are becoming an increasingly important part of mobile robot control systems. Compared with traditional methods, NNs (and other data-driven techniques) produce comparable-if not better-results while requiring less engineering knowhow. Training NNs, however, still requires exploration of a significant number of architectural, optimization, and evaluation options. In this study, we build a simulation environment, generate three image datasets using distinct techniques, train 652 models (including replicates) using a variety of architectures and paradigms (e.g., classification, regression, etc.), and evaluate the navigation ability of the model in simulation. Our goal is to explore a large number of model possibilities so that we can select the most promising for future study with a physical device. Training datasets leading to the best performing models were those that included a significant amount of noise from seemingly inefficient actions. The most promising models explicitly incorporated “memory” wherein previous actions were included as an input in the next step. Such models performed as good or better than conventional convolutional NNs, recurrent NNs, and custom architectures including two camera frames. Although trained models perform well in an environment matching the distribution of the training dataset, they fail when the simulation environment is altered in a seemingly insignificant manner. In robotics research it is often taken for granted that a model with good validation characteristics will perform well on the underlying task, but the results presented here show that there can often be a loose relationship between validation metrics and performance.","PeriodicalId":255763,"journal":{"name":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129520504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Online Microgrid Energy Management Based on Safe Deep Reinforcement Learning 基于安全深度强化学习的微电网在线能量管理

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

Pub Date : 2021-12-05 DOI: 10.1109/SSCI50451.2021.9659545

Hepeng Li, Zhenhua Wang, Lusi Li, Haibo He

Microgrids provide power systems with an effective manner to integrate distributed energy resources, increase power supply reliability, and reduce operational cost. However, intermittent renewable energy resources (RESs) makes it challenging to operate a microgrid safely and economically based on forecasting. To overcome this issue, we develop an online energy management approach for efficient microgrid operation using safe deep reinforcement learning (SDRL). By considering uncertainties and AC power flow, the proposed method formulates online microgrid energy management as a constrained Markov decision process (CMDP). The objective is to find a safety-guaranteed scheduling policy to minimize the total operational cost. To achieve this, we use a SDRL method to learn a neural network-based policy based on constrained policy optimization (CPO). Different from tradition DRL methods that allow an agent to freely explore any behavior during training, the proposed method limits the exploration to safe policies that satisfy AC power flow constraints during training. The proposed method is model-free and does not require predictive information or explicit model of the microgrid. The proposed method is trained and tested on a medium voltage distribution network with real-world power grid data from California Independent Operator (CAISO). Simulation results verify the effectiveness and superiority of proposed method over traditional DRL approaches.

微电网为电力系统整合分布式能源、提高供电可靠性、降低运行成本提供了有效途径。然而，间歇性可再生能源(RESs)使得基于预测的微电网安全、经济地运行具有挑战性。为了克服这一问题，我们开发了一种使用安全深度强化学习(SDRL)的在线能源管理方法，用于有效的微电网运行。该方法考虑了不确定性和交流潮流，将微网在线能量管理表述为约束马尔可夫决策过程(CMDP)。目标是找到一个安全保证的调度策略，以最小化总运营成本。为了实现这一点，我们使用SDRL方法来学习基于约束策略优化(CPO)的基于神经网络的策略。传统的DRL方法允许智能体在训练过程中自由探索任何行为，而该方法将探索限制在训练过程中满足交流潮流约束的安全策略上。该方法是无模型的，不需要预测信息或明确的微电网模型。利用加州独立运营商(CAISO)的实际电网数据对该方法进行了训练和测试。仿真结果验证了该方法相对于传统DRL方法的有效性和优越性。

{"title":"Online Microgrid Energy Management Based on Safe Deep Reinforcement Learning","authors":"Hepeng Li, Zhenhua Wang, Lusi Li, Haibo He","doi":"10.1109/SSCI50451.2021.9659545","DOIUrl":"https://doi.org/10.1109/SSCI50451.2021.9659545","url":null,"abstract":"Microgrids provide power systems with an effective manner to integrate distributed energy resources, increase power supply reliability, and reduce operational cost. However, intermittent renewable energy resources (RESs) makes it challenging to operate a microgrid safely and economically based on forecasting. To overcome this issue, we develop an online energy management approach for efficient microgrid operation using safe deep reinforcement learning (SDRL). By considering uncertainties and AC power flow, the proposed method formulates online microgrid energy management as a constrained Markov decision process (CMDP). The objective is to find a safety-guaranteed scheduling policy to minimize the total operational cost. To achieve this, we use a SDRL method to learn a neural network-based policy based on constrained policy optimization (CPO). Different from tradition DRL methods that allow an agent to freely explore any behavior during training, the proposed method limits the exploration to safe policies that satisfy AC power flow constraints during training. The proposed method is model-free and does not require predictive information or explicit model of the microgrid. The proposed method is trained and tested on a medium voltage distribution network with real-world power grid data from California Independent Operator (CAISO). Simulation results verify the effectiveness and superiority of proposed method over traditional DRL approaches.","PeriodicalId":255763,"journal":{"name":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128761988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2021 IEEE Symposium Series on Computational Intelligence (SSCI)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀