2021 International Conference on Information Networking (ICOIN)最新文献

英文中文

On the Tradeoff between Computation-Time and Learning-Accuracy in GAN-based Super-Resolution Deep Learning 基于gan的超分辨率深度学习中计算时间与学习精度的权衡

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333991

JooYong Shim, Joongheon Kim, Jong-Kook Kim

The trade-off between accuracy and computation should be considered when applying generative adversarial network (GAN)-based image generation to real-world applications. This paper presents a simple yet efficient method based on Progressive Growing of GANs (PGGAN) to exploit the trade-off for image generation. The scheme is evaluated using the LSUN dataset.

将基于生成对抗网络(GAN)的图像生成应用于实际应用时，应考虑精度和计算之间的权衡。本文提出了一种简单而有效的基于gan渐进生长(PGGAN)的算法来利用图像生成的权衡。该方案使用LSUN数据集进行评估。

引用次数: 0

Using Auxiliary Inputs in Deep Learning Models for Detecting DGA-based Domain Names 使用深度学习模型中的辅助输入检测基于dga的域名

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333979

Indraneel Ghosh, Subham Kumar, Ashutosh Bhatia, D. Vishwakarma

Command-and-Control (C&C) servers use Domain Generation Algorithms (DGAs) to communicate with bots for uploading malware and coordinating attacks. Manual detection methods and sinkholing fail to work against these algorithms, which can generate thousands of domain names within a short period. This creates a need for an automated and intelligent system that can detect such malicious domains. LSTM (Long Short Term Memory) is one of the most popularly used deep learning architectures for DGA detection, but it performs poorly against Dictionary Domain Generation Algorithms. This work explores the application of various machine learning techniques to this problem, including specialized approaches such as Auxiliary Loss Optimization for Hypothesis Augmentation (ALOHA), with a particular focus on their performance against Dictionary Domain Generation Algorithms. The ALOHA-LSTM model improves the accuracy of Dictionary Domain Generation Algorithms compared to the state of the art LSTM model. Improvements were observed in the case of word-based DGAs as well. Addressing this issue is of paramount importance, as they are used extensively in carrying out Distributed Denial-of-Service (DDoS) attacks. DDoS and its variants comprise one of the most significant and damaging cyber-attacks that have been carried out in the past.

命令与控制(C&C)服务器使用域生成算法(DGAs)与机器人通信，以上传恶意软件并协调攻击。人工检测方法和下沉无法对抗这些算法，这些算法可以在短时间内生成数千个域名。这就需要一个能够检测此类恶意域的自动化智能系统。LSTM(长短期记忆)是DGA检测中最常用的深度学习架构之一，但它对字典域生成算法的性能很差。这项工作探讨了各种机器学习技术在这个问题上的应用，包括专门的方法，如假设增强的辅助损失优化(ALOHA)，特别关注它们对字典域生成算法的性能。与现有的LSTM模型相比，ALOHA-LSTM模型提高了字典域生成算法的准确性。在基于单词的dga的情况下也观察到了改进。解决这个问题至关重要，因为它们被广泛用于执行分布式拒绝服务(DDoS)攻击。DDoS及其变种构成了过去实施的最严重和最具破坏性的网络攻击之一。

{"title":"Using Auxiliary Inputs in Deep Learning Models for Detecting DGA-based Domain Names","authors":"Indraneel Ghosh, Subham Kumar, Ashutosh Bhatia, D. Vishwakarma","doi":"10.1109/ICOIN50884.2021.9333979","DOIUrl":"https://doi.org/10.1109/ICOIN50884.2021.9333979","url":null,"abstract":"Command-and-Control (C&C) servers use Domain Generation Algorithms (DGAs) to communicate with bots for uploading malware and coordinating attacks. Manual detection methods and sinkholing fail to work against these algorithms, which can generate thousands of domain names within a short period. This creates a need for an automated and intelligent system that can detect such malicious domains. LSTM (Long Short Term Memory) is one of the most popularly used deep learning architectures for DGA detection, but it performs poorly against Dictionary Domain Generation Algorithms. This work explores the application of various machine learning techniques to this problem, including specialized approaches such as Auxiliary Loss Optimization for Hypothesis Augmentation (ALOHA), with a particular focus on their performance against Dictionary Domain Generation Algorithms. The ALOHA-LSTM model improves the accuracy of Dictionary Domain Generation Algorithms compared to the state of the art LSTM model. Improvements were observed in the case of word-based DGAs as well. Addressing this issue is of paramount importance, as they are used extensively in carrying out Distributed Denial-of-Service (DDoS) attacks. DDoS and its variants comprise one of the most significant and damaging cyber-attacks that have been carried out in the past.","PeriodicalId":6741,"journal":{"name":"2021 International Conference on Information Networking (ICOIN)","volume":"78 1","pages":"391-396"},"PeriodicalIF":0.0,"publicationDate":"2021-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77067445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Mobility-Aware Prioritized Flow Rule Placement in Software-Defined Access Networks 软件定义接入网中移动性感知的优先流规则放置

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333854

Yeunwoong Kyung

Software-defined access networks (SDAN) have gained considerable attention due to the flexible and fine-granular mobile traffic management. Due to the dynamic mobility feature, an efficient flow rule management method is required in SDAN. To deal with the challenges of the mobility feature and limited rule space in forwarding nodes, a mobility-aware prioritized flow rule placement scheme in SDAN is proposed. The proposed scheme proactively performs the flow rule placement based on the flow characteristics considering delay-sensitiveness since it can directly affect users’ QoS experiences.

软件定义接入网(SDAN)因其灵活、细粒度的移动流量管理而备受关注。由于SDAN的动态迁移特性，需要一种高效的流规则管理方法。针对转发节点的移动性和规则空间有限的挑战，提出了一种感知移动性的SDAN优先流规则放置方案。考虑到延迟敏感性会直接影响用户的QoS体验，该方案基于流特征主动执行流规则放置。

引用次数: 1

Robust Epileptic Seizure Detection Using Multiscale Distribution Entropy Analysis for Short EEG Recordings 基于多尺度分布熵分析的短脑电图鲁棒性检测

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333993

Jin-Oh Park, Dae-Young Lee, Young-Seok Choi

In the world, epilepsy is a common neurological disorder, and around 50 million people have epilepsy. The risk of premature death in epileptic patients is up to 3 times higher than the general population. To improve epilepsy patients’ quality of life, the use of non-invasive brain rhythm, i.e., electroencephalogram (EEG) has an important role in detecting an epileptic seizure that is the hallmark of epilepsy. By measuring the complexity of the EEG signals from patients, various entropy methods are used for detecting a variety of types of epileptic seizures. Conventional entropy methods such as the Approximate Entropy (ApEn) and Sample Entropy (SampEn) are dependent on data length and predetermined parameters. Here, we present a multiscale extension of Distribution Entropy (DistEn) that addresses the disadvantages of conventional entropy measures, which is referred to as multiscale DistEn (MDE). The proposed MDE is composed of a moving averaging procedure and DistEn estimation to reflect the reliable complexities over multiple temporal scales for short length EEG signals. The validation of the performance of MDE using actual normal and epileptic EEG signals is carried out. The experimental results show that MDE yields an outstanding performance in distinguishing the ictal EEG recordings compared to other entropy measures for short EEG recordings.

在世界范围内，癫痫是一种常见的神经系统疾病，约有5000万人患有癫痫。癫痫患者过早死亡的风险比一般人群高3倍。为了改善癫痫患者的生活质量，使用无创脑节律，即脑电图(EEG)在检测癫痫发作(癫痫的标志)方面具有重要作用。通过测量患者脑电图信号的复杂性，利用各种熵方法来检测各种类型的癫痫发作。传统的熵方法，如近似熵(ApEn)和样本熵(SampEn)依赖于数据长度和预定参数。在这里，我们提出了分布熵(DistEn)的多尺度扩展，它解决了传统熵度量的缺点，被称为多尺度DistEn (MDE)。该方法由移动平均和DistEn估计两部分组成，以反映短长度脑电信号在多个时间尺度上的可靠复杂性。利用实际的正常和癫痫脑电信号对MDE的性能进行了验证。实验结果表明，与其他熵值度量方法相比，MDE方法在短时脑电记录特征识别方面具有较好的效果。

{"title":"Robust Epileptic Seizure Detection Using Multiscale Distribution Entropy Analysis for Short EEG Recordings","authors":"Jin-Oh Park, Dae-Young Lee, Young-Seok Choi","doi":"10.1109/ICOIN50884.2021.9333993","DOIUrl":"https://doi.org/10.1109/ICOIN50884.2021.9333993","url":null,"abstract":"In the world, epilepsy is a common neurological disorder, and around 50 million people have epilepsy. The risk of premature death in epileptic patients is up to 3 times higher than the general population. To improve epilepsy patients’ quality of life, the use of non-invasive brain rhythm, i.e., electroencephalogram (EEG) has an important role in detecting an epileptic seizure that is the hallmark of epilepsy. By measuring the complexity of the EEG signals from patients, various entropy methods are used for detecting a variety of types of epileptic seizures. Conventional entropy methods such as the Approximate Entropy (ApEn) and Sample Entropy (SampEn) are dependent on data length and predetermined parameters. Here, we present a multiscale extension of Distribution Entropy (DistEn) that addresses the disadvantages of conventional entropy measures, which is referred to as multiscale DistEn (MDE). The proposed MDE is composed of a moving averaging procedure and DistEn estimation to reflect the reliable complexities over multiple temporal scales for short length EEG signals. The validation of the performance of MDE using actual normal and epileptic EEG signals is carried out. The experimental results show that MDE yields an outstanding performance in distinguishing the ictal EEG recordings compared to other entropy measures for short EEG recordings.","PeriodicalId":6741,"journal":{"name":"2021 International Conference on Information Networking (ICOIN)","volume":"2 1","pages":"473-476"},"PeriodicalIF":0.0,"publicationDate":"2021-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82654372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Novel Algorithm for Estimating Fast-Moving Vehicle Speed in Intelligent Transport Systems 智能交通系统中快速移动车辆速度估计的新算法

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333970

Trong-Hop Do, Dang-Khoa Tran, Dinh-Quang Hoang, Dat Vuong, Trong-Minh Hoang, Nhu-Ngoc Dao, Chunghyun Lee, Sungrae Cho

Intelligent Transport System (ITS) has been considered is the ultimate goal of traffic management in the 21st century. ITS is hoped to create a more efficient transport system and safer traffic experience. An ITS comprises many components of which traffic data collection is one of the essential functionalities. This data collection component is responsible for collecting various kinds of data on which the system relies to make responses to traffic conditions. One of the most important data to be collected is vehicle speed. With the rapid development of artificial intelligence, computer vision based techniques have been used increasingly for vehicle speed estimation. However, most techniques focus on daytime environment. This paper proposes a novel algorithm for vehicle speed estimation. Transfer learning with YOLO is used as the backbone algorithm for detecting the vehicle taillights. Based on the distance between two taillights, a model that combines camera geometry and Kalman filters is proposed to estimate the vehicle speed. The advantage of the proposed algorithm is that it can quickly estimate the vehicle speed without prerequisite information about the vehicle which to be known as in many existing algorithms. Furthermore, the processing time of the proposed algorithm is very fast thanks to the backbone deep learning model. Owing to the Kalman filter, the proposed algorithm can achieve very high level of speed estimation accuracy. In this paper, the performance of the proposed algorithm is verified through experiment results.

智能交通系统(ITS)被认为是21世纪交通管理的终极目标。智能交通系统有望创造一个更高效的交通系统和更安全的交通体验。智能交通系统由许多组件组成，交通数据收集是其中一项基本功能。该数据收集组件负责收集系统所依赖的各种数据，以对交通状况做出响应。需要收集的最重要的数据之一是车速。随着人工智能的迅速发展，基于计算机视觉的车辆速度估计技术得到了越来越多的应用。然而，大多数技术集中在白天的环境。提出了一种新的车辆速度估计算法。采用基于YOLO的迁移学习算法作为车辆尾灯检测的主干算法。基于两个尾灯之间的距离，提出了一种结合摄像机几何和卡尔曼滤波的车辆速度估计模型。该算法的优点在于，它可以快速估计车辆的速度，而不需要许多现有算法中已知的车辆的先决条件信息。此外，由于采用了骨干深度学习模型，该算法的处理速度非常快。由于采用了卡尔曼滤波，该算法可以达到很高的速度估计精度。本文通过实验结果验证了所提算法的性能。

{"title":"A Novel Algorithm for Estimating Fast-Moving Vehicle Speed in Intelligent Transport Systems","authors":"Trong-Hop Do, Dang-Khoa Tran, Dinh-Quang Hoang, Dat Vuong, Trong-Minh Hoang, Nhu-Ngoc Dao, Chunghyun Lee, Sungrae Cho","doi":"10.1109/ICOIN50884.2021.9333970","DOIUrl":"https://doi.org/10.1109/ICOIN50884.2021.9333970","url":null,"abstract":"Intelligent Transport System (ITS) has been considered is the ultimate goal of traffic management in the 21st century. ITS is hoped to create a more efficient transport system and safer traffic experience. An ITS comprises many components of which traffic data collection is one of the essential functionalities. This data collection component is responsible for collecting various kinds of data on which the system relies to make responses to traffic conditions. One of the most important data to be collected is vehicle speed. With the rapid development of artificial intelligence, computer vision based techniques have been used increasingly for vehicle speed estimation. However, most techniques focus on daytime environment. This paper proposes a novel algorithm for vehicle speed estimation. Transfer learning with YOLO is used as the backbone algorithm for detecting the vehicle taillights. Based on the distance between two taillights, a model that combines camera geometry and Kalman filters is proposed to estimate the vehicle speed. The advantage of the proposed algorithm is that it can quickly estimate the vehicle speed without prerequisite information about the vehicle which to be known as in many existing algorithms. Furthermore, the processing time of the proposed algorithm is very fast thanks to the backbone deep learning model. Owing to the Kalman filter, the proposed algorithm can achieve very high level of speed estimation accuracy. In this paper, the performance of the proposed algorithm is verified through experiment results.","PeriodicalId":6741,"journal":{"name":"2021 International Conference on Information Networking (ICOIN)","volume":"24 1","pages":"499-503"},"PeriodicalIF":0.0,"publicationDate":"2021-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82099961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Bitcoin Price Forecasting via Ensemble-based LSTM Deep Learning Networks 基于集成的LSTM深度学习网络的比特币价格预测

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333853

Myungjae Shin, David A. Mohaisen, Joongheon Kim

Time series prediction plays a significant role in the Bitcoin market because of volatile characteristics. Recently, deep neural networks with advanced techniques such as ensembles have led to studies that show successful performance in various fields. In this paper, an ensemble-enabled Long Short-Term Memory (LSTM) with various time interval models is proposed for predicting Bitcoin price. Although hour and minute data set are shown to provide moderate shifts, daily data has relatively a deterministic shift. As such, the ensemble-enabled LSTM network architecture learned the individual characteristics and impact on price predictions from each data set. Experimental results with real-world measurement data show that this learning architecture effectively forecasts prices, especially in risky time such as sudden price fall.

由于比特币市场的波动性，时间序列预测在比特币市场中扮演着重要的角色。最近，深度神经网络与先进的技术，如集成，已经导致研究显示出成功的表现在各个领域。本文提出了一种具有各种时间间隔模型的集成长短期记忆(LSTM)用于预测比特币价格。虽然小时和分钟数据集显示出适度的变化，但日数据具有相对确定性的变化。因此，集成的LSTM网络架构可以从每个数据集中学习个体特征及其对价格预测的影响。实际测量数据的实验结果表明，这种学习架构可以有效地预测价格，特别是在价格突然下跌等风险时刻。

引用次数: 9

Reliable Integrated Space-Oceanic Network Profit Maximization by Bender Decomposition Approach 基于Bender分解的可靠集成空间-海洋网络利润最大化

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333882

Sheikh Salman Hassan, Umer Majeed, C. Hong

Maritime network traffic is increasing due to the ongoing need for trade and tourism, thus increasing the demand for convenient, reliable, energy-efficient, and high-speed network access at sea that could be analogous to terrestrial networks. Therefore, to ensure the concept of a connected world under the umbrella of sixth-generation (6G) networks, we propose the next-generation integrated space-oceanic network, which consists of a set of LEO satellites and marine user equipments (MUE). This paper investigates network profit maximization (NPM) by optimizing the MUE association and its resource allocation in downlink communication. The formulated optimization problem corresponds to mixed-integer nonlinear programming (MINLP). To solve this problem, we propose an iterative algorithm based on Bender’s decomposition (BD). Numerical results are provided to demonstrate the convergence and effectiveness of our proposed algorithm.

由于贸易和旅游的持续需求，海上网络流量正在增加，从而增加了对海上方便、可靠、节能和高速网络接入的需求，这种网络可以类似于地面网络。因此，为了确保第六代(6G)网络框架下的互联世界概念，我们提出了由一组低轨道卫星和海洋用户设备(MUE)组成的下一代空间-海洋综合网络。本文通过优化MUE关联及其在下行通信中的资源分配，研究了网络利润最大化问题。所建立的优化问题对应于混合整数非线性规划(MINLP)。为了解决这个问题，我们提出了一种基于Bender分解(BD)的迭代算法。数值结果证明了该算法的收敛性和有效性。

引用次数: 7

Real-Time Geolocation Approach through LoRa on Internet of Things 基于物联网LoRa的实时地理定位方法

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333860

C. Bouras, A. Gkamas, V. Kokkinos, Nikolaos Papachristos

Internet of Things (IoT) and wireless technologies like LoRa brought more opportunities for application development in a plethora of different fields. One of these is location estimation of real-time objects and people. In this study, we focus on monitoring user’s location through a wearable IoT device with LoRa connectivity. The paper presents the development and integration of an IoT ecosystem (Hardware and Software) which can be used in Search and Rescue (SAR) use cases. The proposed IoT ecosystem is evaluated and deployed in real-scenarios with established gateways. After that we compare the existed location-estimation methods in terms of attenuation problem, cost and operation as well to conclude to the most suitable solution that can be integrated in LoRaWAN environments. Finally, the conclusions of this work and improvements for possible future activity are described.

物联网(IoT)和LoRa等无线技术为众多不同领域的应用程序开发带来了更多机会。其中之一是实时物体和人的位置估计。在本研究中，我们专注于通过具有LoRa连接的可穿戴物联网设备监控用户的位置。本文介绍了可用于搜索和救援(SAR)用例的物联网生态系统(硬件和软件)的开发和集成。提议的物联网生态系统在真实场景中进行评估和部署，并建立网关。然后从衰减问题、成本和运行等方面对现有的位置估计方法进行比较，得出最适合集成在LoRaWAN环境中的解决方案。最后，介绍了本工作的结论和未来可能开展的改进工作。

引用次数: 1

Stream-Based Active Learning with Multiple Kernels 基于多核流的主动学习

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9333940

Jeongmin Chae, Songnam Hong

Online multiple kernel learning (OMKL) has provided an attractive performance in nonlinear function learning tasks. Leveraging a random feature (RF) approximation, the major drawback of OMKL, known as the curse of dimensionality, has been recently alleviated. These advantages enable RF-based OMKL to be considered in practice. In this paper we introduce a new research problem, named stream-based active multiple kernel learning (AMKL), where a learner is allowed to label some selected data from an oracle according to a selection criterion. This is necessary in many real-world applications since acquiring a true label is costly or time-consuming. We theoretically prove that the proposed AMKL achieves an optimal sublinear regret $mathcal{O}(sqrt{T})$ as in OMKL with little labeled data, implying that the proposed selection criterion indeed avoids unnecessary label-requests.

在线多核学习(OMKL)在非线性函数学习任务中具有很好的性能。利用随机特征(RF)近似，OMKL的主要缺点(称为维度诅咒)最近得到了缓解。这些优点使基于rf的OMKL在实践中得到考虑。本文介绍了一个新的研究问题，即基于流的主动多核学习(AMKL)，该问题允许学习者根据选择标准从oracle中选择一些数据进行标记。这在许多实际应用中是必要的，因为获得真正的标签是昂贵或耗时的。我们从理论上证明了所提出的AMKL实现了最优的次线性后悔$mathcal{O}(sqrt{T})$，这表明所提出的选择准则确实避免了不必要的标签请求。

引用次数: 1

Clustering-Guided Incremental Learning of Tasks 聚类引导的任务增量学习

2021 International Conference on Information Networking (ICOIN)

Pub Date : 2021-01-13 DOI: 10.1109/ICOIN50884.2021.9334003

Y. Kim, Eunwoo Kim

Incremental deep learning aims to learn a sequence of tasks while avoiding forgetting their knowledge. One naïve approach using a deep architecture is to increase the capacity of the architecture as the number of tasks increases. However, this is followed by heavy memory consumption and makes the approach not practical. If we attempt to avoid such an issue with a fixed capacity, we encounter another challenging problem called catastrophic forgetting, which leads to a notable degradation of performance on previously learned tasks. To overcome these problems, we propose a clustering-guided incremental learning approach that can mitigate catastrophic forgetting while not increasing the capacity of an architecture. The proposed approach adopts a parameter-splitting strategy to assign a subset of parameters in an architecture for each task to prevent forgetting. It uses a clustering approach to discover the relationship between tasks by storing a few samples per task. When we learn a new task, we utilize the knowledge of the relevant tasks together with the current task to improve performance. This approach could maximize the efficiency of the approach realized in a single fixed architecture. Experimental results with a number of fine-grained datasets show that our method outperforms existing competitors.

增量深度学习旨在学习一系列任务，同时避免忘记他们的知识。使用深度架构的一种naïve方法是随着任务数量的增加而增加架构的容量。然而，随之而来的是大量的内存消耗，使得该方法不实用。如果我们试图用固定的能力来避免这样的问题，我们就会遇到另一个具有挑战性的问题，即灾难性遗忘，它会导致我们在完成之前学习过的任务时的表现显著下降。为了克服这些问题，我们提出了一种聚类引导的增量学习方法，该方法可以减轻灾难性遗忘，同时不增加架构的容量。该方法采用参数分离策略，为每个任务分配一个体系结构的参数子集，以防止遗忘。它使用聚类方法通过为每个任务存储一些样本来发现任务之间的关系。当我们学习一项新任务时，我们利用相关任务的知识和当前任务来提高表现。这种方法可以最大限度地提高在单一固定体系结构中实现的方法的效率。在大量细粒度数据集上的实验结果表明，我们的方法优于现有的竞争对手。

{"title":"Clustering-Guided Incremental Learning of Tasks","authors":"Y. Kim, Eunwoo Kim","doi":"10.1109/ICOIN50884.2021.9334003","DOIUrl":"https://doi.org/10.1109/ICOIN50884.2021.9334003","url":null,"abstract":"Incremental deep learning aims to learn a sequence of tasks while avoiding forgetting their knowledge. One naïve approach using a deep architecture is to increase the capacity of the architecture as the number of tasks increases. However, this is followed by heavy memory consumption and makes the approach not practical. If we attempt to avoid such an issue with a fixed capacity, we encounter another challenging problem called catastrophic forgetting, which leads to a notable degradation of performance on previously learned tasks. To overcome these problems, we propose a clustering-guided incremental learning approach that can mitigate catastrophic forgetting while not increasing the capacity of an architecture. The proposed approach adopts a parameter-splitting strategy to assign a subset of parameters in an architecture for each task to prevent forgetting. It uses a clustering approach to discover the relationship between tasks by storing a few samples per task. When we learn a new task, we utilize the knowledge of the relevant tasks together with the current task to improve performance. This approach could maximize the efficiency of the approach realized in a single fixed architecture. Experimental results with a number of fine-grained datasets show that our method outperforms existing competitors.","PeriodicalId":6741,"journal":{"name":"2021 International Conference on Information Networking (ICOIN)","volume":"1 1","pages":"417-421"},"PeriodicalIF":0.0,"publicationDate":"2021-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87906431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2021 International Conference on Information Networking (ICOIN)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀