首页 > 最新文献

2020 International Conference on Data Mining Workshops (ICDMW)最新文献

英文 中文
Electric Energy Demand Forecasting with Explainable Time-series Modeling 基于可解释时间序列模型的电力需求预测
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00101
Jin-Young Kim, Sung-Bae Cho
Recently, deep learning models are utilized to predict the energy consumption. However, to construct the smart grid systems, the conventional methods have limitation on explanatory power or require manual analysis. To overcome it, in this paper, we present a novel deep learning model that can infer the predicted results by calculating the correlation between the latent variables and output as well as forecast the future consumption in high performance. The proposed model is composed of 1) a main encoder that models the past energy demand, 2) a sub encoder that models electric information except global active power as the latent variable in two dimensions, 3) a predictor that maps the future demand from the concatenation of the latent variables extracted from each encoder, and 4) an explainer that provides the most significant electric information. Several experiments on a household electric energy demand dataset show that the proposed model not only has better performance than the conventional models, but also provides the ability to explain the results by analyzing the correlation of inputs, latent variables, and energy demand predicted in the form of time-series.
最近,深度学习模型被用来预测能源消耗。然而,为了构建智能电网系统,传统的方法存在解释力有限或需要人工分析的问题。为了克服这个问题,在本文中,我们提出了一种新的深度学习模型,该模型可以通过计算潜在变量与输出之间的相关性来推断预测结果,并预测高性能的未来消耗。所提出的模型由1)模拟过去能源需求的主编码器,2)模拟除全球有功功率作为两个维度潜在变量的电力信息的副编码器,3)从每个编码器提取的潜在变量的连接中映射未来需求的预测器,以及4)提供最重要电力信息的解释器组成。在一个家庭电力需求数据集上的实验表明,该模型不仅比传统模型具有更好的性能,而且能够通过分析输入、潜在变量和以时间序列形式预测的能源需求之间的相关性来解释结果。
{"title":"Electric Energy Demand Forecasting with Explainable Time-series Modeling","authors":"Jin-Young Kim, Sung-Bae Cho","doi":"10.1109/ICDMW51313.2020.00101","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00101","url":null,"abstract":"Recently, deep learning models are utilized to predict the energy consumption. However, to construct the smart grid systems, the conventional methods have limitation on explanatory power or require manual analysis. To overcome it, in this paper, we present a novel deep learning model that can infer the predicted results by calculating the correlation between the latent variables and output as well as forecast the future consumption in high performance. The proposed model is composed of 1) a main encoder that models the past energy demand, 2) a sub encoder that models electric information except global active power as the latent variable in two dimensions, 3) a predictor that maps the future demand from the concatenation of the latent variables extracted from each encoder, and 4) an explainer that provides the most significant electric information. Several experiments on a household electric energy demand dataset show that the proposed model not only has better performance than the conventional models, but also provides the ability to explain the results by analyzing the correlation of inputs, latent variables, and energy demand predicted in the form of time-series.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"349 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122041042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
AttentionFM: Incorporating Attention Mechanism and Factorization Machine for Credit Scoring 基于注意力机制和因子分解机的信用评分
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00056
Ying Liu, Wei Wang, Tianlin Zhang, Zhenyu Cui
Learning effective feature interactions behind user behavior is challenging in credit scoring. Existing machine learning methods seem to have a strong bias towards low-order or high-order interactions, or require expertise feature engineering. In this paper, we present a novel neural network approach AttentionFM, which incorporates Factorization Machines and Attention mechanism for credit scoring. The proposed model focuses more on critical features and emphasizes both low- and high-order feature interactions, with no need of manually feature engineering on raw data representation. Experimental results demonstrate that our proposed model significantly outperforms the baselines based on two public datasets.
在信用评分中,学习用户行为背后的有效功能交互是一个挑战。现有的机器学习方法似乎对低阶或高阶交互有强烈的偏见,或者需要专业的特征工程。在本文中,我们提出了一种新的神经网络方法AttentionFM,它结合了分解机器和注意机制来进行信用评分。该模型更关注关键特征,强调低阶和高阶特征交互,不需要对原始数据表示进行手动特征工程。实验结果表明,我们提出的模型明显优于基于两个公共数据集的基线。
{"title":"AttentionFM: Incorporating Attention Mechanism and Factorization Machine for Credit Scoring","authors":"Ying Liu, Wei Wang, Tianlin Zhang, Zhenyu Cui","doi":"10.1109/ICDMW51313.2020.00056","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00056","url":null,"abstract":"Learning effective feature interactions behind user behavior is challenging in credit scoring. Existing machine learning methods seem to have a strong bias towards low-order or high-order interactions, or require expertise feature engineering. In this paper, we present a novel neural network approach AttentionFM, which incorporates Factorization Machines and Attention mechanism for credit scoring. The proposed model focuses more on critical features and emphasizes both low- and high-order feature interactions, with no need of manually feature engineering on raw data representation. Experimental results demonstrate that our proposed model significantly outperforms the baselines based on two public datasets.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117273944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building knowledge graphs of homicide investigation chronologies 建立凶杀调查年表的知识图谱
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00115
Ritika Pandey, P. Brantingham, Craig D. Uchida, G. Mohler
Homicide investigations generate large and diverse data in the form of witness interview transcripts, physical evidence, photographs, DNA, etc. Homicide case chronologies are summaries of these data created by investigators that consist of short text-based entries documenting specific steps taken in the investigation. A chronology tracks the evolution of an investigation, including when and how persons involved and items of evidence became part of a case. In this article we discuss a framework for creating knowledge graphs of case chronologies that may aid investigators in analyzing homicide case data and also allow for post hoc analysis of the key features that determine whether a homicide is ultimately solved. Our method consists of 1) performing named entity recognition to determine witnesses, suspects, and detectives from chronology entries 2) using keyword expansion to identify documentary, physical, and forensic evidence in each entry and 3) linking entities and evidence to construct a homicide investigation knowledge graph. We compare the performance of several choices of methodologies for these sub-tasks using homicide investigation chronologies from Los Angeles, California. We then analyze the association between network statistics of the knowledge graphs and homicide solvability.
凶杀调查产生了大量多样的数据,包括证人采访记录、物证、照片、DNA等。杀人案年表是调查人员创建的这些数据的摘要,由记录调查中采取的具体步骤的简短文本条目组成。年表记录了一项调查的演变,包括涉及的人员和证据项目何时以及如何成为案件的一部分。在本文中,我们讨论了一个用于创建案件年表知识图谱的框架,它可以帮助调查人员分析杀人案数据,并允许对决定杀人案最终是否得到解决的关键特征进行事后分析。我们的方法包括:1)执行命名实体识别,从年表条目中确定证人、嫌疑人和侦探;2)使用关键字扩展来识别每个条目中的文件、物理和法医证据;3)将实体和证据联系起来,构建凶杀调查知识图谱。我们使用来自加利福尼亚州洛杉矶的凶杀调查年表来比较这些子任务的几种方法选择的性能。然后,我们分析了知识图的网络统计与凶杀可解性之间的关系。
{"title":"Building knowledge graphs of homicide investigation chronologies","authors":"Ritika Pandey, P. Brantingham, Craig D. Uchida, G. Mohler","doi":"10.1109/ICDMW51313.2020.00115","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00115","url":null,"abstract":"Homicide investigations generate large and diverse data in the form of witness interview transcripts, physical evidence, photographs, DNA, etc. Homicide case chronologies are summaries of these data created by investigators that consist of short text-based entries documenting specific steps taken in the investigation. A chronology tracks the evolution of an investigation, including when and how persons involved and items of evidence became part of a case. In this article we discuss a framework for creating knowledge graphs of case chronologies that may aid investigators in analyzing homicide case data and also allow for post hoc analysis of the key features that determine whether a homicide is ultimately solved. Our method consists of 1) performing named entity recognition to determine witnesses, suspects, and detectives from chronology entries 2) using keyword expansion to identify documentary, physical, and forensic evidence in each entry and 3) linking entities and evidence to construct a homicide investigation knowledge graph. We compare the performance of several choices of methodologies for these sub-tasks using homicide investigation chronologies from Los Angeles, California. We then analyze the association between network statistics of the knowledge graphs and homicide solvability.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123822929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
TrustedChain: A Blockchain-based Data Sharing Scheme for Supply Chain TrustedChain:基于区块链的供应链数据共享方案
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00128
Gejun Le, Qifeng Gu, Qingshan Jiang, Weiyi Lin
Supply chain involves mutual independent and distrusted stakeholders and large of sensitive order data. Sharing data among stakeholders is a essential project because that improves efficiency for various workflow among stakeholders. This paper proposes TrustedChain, a blockchain-based data sharing scheme for supply chain, which has two advantages: (a) trusted: we present a trusted environment, Trusted Environment (TE), based on blockchain to allow mutually distrusted stakeholders manage data collaboratively. (b) secure: we provide a secure design that first stores order forms in Distributed Database (DDB) and then records URI in Contract Account (CA) of TE. In addition, Supply-Business Contract Management (SCM) manages all CA and Node Communication (NC) allows communication over the network. The security analysis and evaluation prove the effectiveness of TrustedChain.
供应链涉及相互独立且互不信任的利益相关者和大量敏感的订单数据。在利益相关者之间共享数据是一个重要的项目,因为它可以提高利益相关者之间各种工作流程的效率。本文提出了一种基于区块链的供应链数据共享方案TrustedChain,该方案具有两个优点:(a)可信:我们提出了一个基于区块链的可信环境trusted environment (TE),允许互不信任的利益相关者协同管理数据。(b)安全:我们提供了一种安全的设计,首先将订单存储在分布式数据库(DDB)中,然后在TE的合同账户(CA)中记录URI。此外,供应业务合同管理(SCM)管理所有CA,节点通信(NC)允许通过网络进行通信。安全性分析和评估证明了TrustedChain的有效性。
{"title":"TrustedChain: A Blockchain-based Data Sharing Scheme for Supply Chain","authors":"Gejun Le, Qifeng Gu, Qingshan Jiang, Weiyi Lin","doi":"10.1109/ICDMW51313.2020.00128","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00128","url":null,"abstract":"Supply chain involves mutual independent and distrusted stakeholders and large of sensitive order data. Sharing data among stakeholders is a essential project because that improves efficiency for various workflow among stakeholders. This paper proposes TrustedChain, a blockchain-based data sharing scheme for supply chain, which has two advantages: (a) trusted: we present a trusted environment, Trusted Environment (TE), based on blockchain to allow mutually distrusted stakeholders manage data collaboratively. (b) secure: we provide a secure design that first stores order forms in Distributed Database (DDB) and then records URI in Contract Account (CA) of TE. In addition, Supply-Business Contract Management (SCM) manages all CA and Node Communication (NC) allows communication over the network. The security analysis and evaluation prove the effectiveness of TrustedChain.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"10 1-2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134470362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Classification of Dementia Associated Disorders Using EEG based Frequent Subgraph Technique 基于脑电图频繁子图技术的痴呆相关疾病分类
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00087
A. T. Adebisi, V. Gonuguntla, Ho-Won Lee, K. Veluvolu
Dementia associated disorders such as vascular dementia, frontotemporal dementia and Alzheimer dementia lead to cognitive impairment. Discrimination of dementia associated disorders has reamined a challenging task as they have overlapping underlying complex structures and display similar clinical features. In this work, we explore an EEG based frequent subgraph searching technique to characterize stages of brain functional networks of mild cognitive impairment (MCI), Alzheimer's disease (AD) and vascular dementia (VD) subjects in comparison with healthy control (HC) subjects. To identify the frequent subgraph related to dementia, we first formulated the brain functional network based on the phase information of EEG with mutual information as a measure. The whole network is then divided into sub-regions and frequent sub-graph search is performed. The identified frequent subgraphs were employed to discriminate the dementia associated disorders from the data recorded from 10 healthy and 32 dementia subjects in various stages. Results show that the proposed method has the potential to quantify the disease progression using brain functional connectivity and the identified networks can aid in the diagnosis of dementia associated disorders.
痴呆相关疾病,如血管性痴呆、额颞叶痴呆和阿尔茨海默氏痴呆,会导致认知障碍。痴呆症相关疾病的鉴别仍然是一项具有挑战性的任务,因为它们具有重叠的潜在复杂结构并表现出相似的临床特征。在这项工作中,我们探索了一种基于脑电图的频繁子图搜索技术,以表征轻度认知障碍(MCI)、阿尔茨海默病(AD)和血管性痴呆(VD)受试者与健康对照组(HC)受试者的脑功能网络阶段。为了识别与痴呆相关的频繁子图,我们首先以互信息为度量,建立了基于脑电相位信息的脑功能网络。然后将整个网络划分为子区域,并进行频繁的子图搜索。利用识别出的频繁子图从10名健康受试者和32名不同阶段的痴呆受试者的数据中区分痴呆相关疾病。结果表明,所提出的方法具有利用脑功能连接来量化疾病进展的潜力,并且确定的网络可以帮助诊断痴呆症相关疾病。
{"title":"Classification of Dementia Associated Disorders Using EEG based Frequent Subgraph Technique","authors":"A. T. Adebisi, V. Gonuguntla, Ho-Won Lee, K. Veluvolu","doi":"10.1109/ICDMW51313.2020.00087","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00087","url":null,"abstract":"Dementia associated disorders such as vascular dementia, frontotemporal dementia and Alzheimer dementia lead to cognitive impairment. Discrimination of dementia associated disorders has reamined a challenging task as they have overlapping underlying complex structures and display similar clinical features. In this work, we explore an EEG based frequent subgraph searching technique to characterize stages of brain functional networks of mild cognitive impairment (MCI), Alzheimer's disease (AD) and vascular dementia (VD) subjects in comparison with healthy control (HC) subjects. To identify the frequent subgraph related to dementia, we first formulated the brain functional network based on the phase information of EEG with mutual information as a measure. The whole network is then divided into sub-regions and frequent sub-graph search is performed. The identified frequent subgraphs were employed to discriminate the dementia associated disorders from the data recorded from 10 healthy and 32 dementia subjects in various stages. Results show that the proposed method has the potential to quantify the disease progression using brain functional connectivity and the identified networks can aid in the diagnosis of dementia associated disorders.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133765864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning Disentangled Representation of Residential Power Demand Peak via Convolutional-Recurrent Triplet Network 基于卷积-循环三重网络的住宅电力需求峰值解纠缠表征
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00110
Hyung-Jun Moon, Seok-Jun Bu, Sung-Bae Cho
In the time-series models for predicting residential energy consumption, the energy properties collected through multiple sensors usually include irregular and seasonal factors. The irregular pattern resulting from them is called peak demand, which is a major cause of performance degradation. In order to enhance the performance, we propose a convolutional-recurrent triplet network to learn and detect the demand peaks. The proposed model generates the latent space for demand peaks from data, which is transferred into convolutional neural network-long short-term memory (CNN-LSTM) to finally predict the future power demand. Experiments with the dataset of UCI household power consumption composed of a total of 2,075,259 time-series data show that the proposed model reduces the error by 23.63% and outperforms the state-of-the-art deep learning models including the CNN-LSTM. Especially, the proposed model improves the prediction performance by modeling the distribution of demand peaks in Euclidean space.
在住宅能耗预测的时间序列模型中,通过多个传感器采集的能源特性通常包含不规则和季节性因素。由此产生的不规则模式称为峰值需求,这是导致性能下降的主要原因。为了提高性能,我们提出了一种卷积-循环三重网络来学习和检测需求峰值。该模型从数据中生成需求峰值的潜在空间,并将其传递到卷积神经网络长短期记忆(CNN-LSTM)中,最终预测未来的电力需求。在包含2,075,259个时间序列数据的UCI家庭用电数据集上进行的实验表明,该模型的误差降低了23.63%,优于CNN-LSTM等最先进的深度学习模型。该模型通过对需求峰在欧氏空间中的分布进行建模,提高了预测性能。
{"title":"Learning Disentangled Representation of Residential Power Demand Peak via Convolutional-Recurrent Triplet Network","authors":"Hyung-Jun Moon, Seok-Jun Bu, Sung-Bae Cho","doi":"10.1109/ICDMW51313.2020.00110","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00110","url":null,"abstract":"In the time-series models for predicting residential energy consumption, the energy properties collected through multiple sensors usually include irregular and seasonal factors. The irregular pattern resulting from them is called peak demand, which is a major cause of performance degradation. In order to enhance the performance, we propose a convolutional-recurrent triplet network to learn and detect the demand peaks. The proposed model generates the latent space for demand peaks from data, which is transferred into convolutional neural network-long short-term memory (CNN-LSTM) to finally predict the future power demand. Experiments with the dataset of UCI household power consumption composed of a total of 2,075,259 time-series data show that the proposed model reduces the error by 23.63% and outperforms the state-of-the-art deep learning models including the CNN-LSTM. Especially, the proposed model improves the prediction performance by modeling the distribution of demand peaks in Euclidean space.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122762712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Design of Neural Network-based Boost Charging for Reducing the Charging Time of Li-ion Battery 缩短锂离子电池充电时间的神经网络升压充电设计
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00109
Sue Hyang Lim, S. Kim, Hyeong Min Lee, Sijun Kim, Y. Shin
Rapid charging of Li-ion batteries is vital for the commercialization of electric propulsion systems. But, during the fast-charging process, reduction in the battery capacity and temperature increases must be considered in real-time. Most Li-ion battery chargers follow the charging profile of an open-loop system, which has been determined based on prior knowledge. However, such a system does not reflect the temperature change of the battery and the degree of aging. Therefore, in this study, we propose a neural network-based charging profile model by applying a closed-loop system to reflect the various states of batteries; we also show two battery-state characteristics in addition to temperature. Consequently, we show battery characteristics other than those shown in the past, such as the battery voltage and temperature trends. In addition to the design of the charging current, an improvement of approximately 22 ∼ 50% based on the mean absolute error (MAE) is achieved. By considering the various characteristics, the long short-term memory performance is determined to be better when compared to the feed-forward neural network, and this performance is improved by 35% based on MAE.
锂离子电池的快速充电对于电力推进系统的商业化至关重要。但是,在快速充电过程中,必须实时考虑电池容量的减少和温度的升高。大多数锂离子电池充电器遵循开环系统的充电曲线,这是基于先验知识确定的。但是,这样的系统并不能反映电池的温度变化和老化程度。因此,在本研究中,我们提出了一种基于神经网络的充电剖面模型,该模型采用闭环系统来反映电池的各种状态;除了温度,我们还展示了两个电池状态特性。因此,我们展示了不同于以往的电池特性,例如电池电压和温度趋势。除了充电电流的设计之外,基于平均绝对误差(MAE)的改进约为22 ~ 50%。综合考虑各种特征,确定了与前馈神经网络相比长短期记忆性能更好,并且基于MAE的长短期记忆性能提高了35%。
{"title":"Design of Neural Network-based Boost Charging for Reducing the Charging Time of Li-ion Battery","authors":"Sue Hyang Lim, S. Kim, Hyeong Min Lee, Sijun Kim, Y. Shin","doi":"10.1109/ICDMW51313.2020.00109","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00109","url":null,"abstract":"Rapid charging of Li-ion batteries is vital for the commercialization of electric propulsion systems. But, during the fast-charging process, reduction in the battery capacity and temperature increases must be considered in real-time. Most Li-ion battery chargers follow the charging profile of an open-loop system, which has been determined based on prior knowledge. However, such a system does not reflect the temperature change of the battery and the degree of aging. Therefore, in this study, we propose a neural network-based charging profile model by applying a closed-loop system to reflect the various states of batteries; we also show two battery-state characteristics in addition to temperature. Consequently, we show battery characteristics other than those shown in the past, such as the battery voltage and temperature trends. In addition to the design of the charging current, an improvement of approximately 22 ∼ 50% based on the mean absolute error (MAE) is achieved. By considering the various characteristics, the long short-term memory performance is determined to be better when compared to the feed-forward neural network, and this performance is improved by 35% based on MAE.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123358265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Using Deep Generative Models to Boost Forecasting: A Phishing Prediction Case Study 使用深度生成模型促进预测:一个网络钓鱼预测案例研究
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00073
Syed Hasan Amin Mahmood, A. Abbasi
Time series predictions are important for various application domains. However, effective forecasting can be challenging in noisy contexts devoid of time series data encompassing stationarity, cyclicality, completeness, and non-sparseness. Cyber-security is a good example of such context. In organizational security settings, predicting time series related to emerging attacks could enhance cyber threat intelligence, resulting in timely and actionable insights at the operational, tactical, and strategic levels. In order to explore this gap, we propose a deep generative model-based framework for time series forecasting in noisy data environments. The proposed framework incorporates a novel ensembling strategy where generative adversarial networks and recurrent variational autoencoders are leveraged in unison with base predictors for enhanced regularization of time series predictive models. The framework is extensible, supporting different model combinations and analytical or iterative model fusion strategies. Using a test bed encompassing 10 years of weekly phishing attack volume data from 5 organizations in the technology, financial services, and social networking sectors, we show that the framework can boost predictive power for various standard time series models. Additional results reveal that the framework outperforms generative data augmentation approaches designed to enrich the input time series data matrices. Collectively, our findings suggest that utilizing generative models in more robust end-to-end setup can improve prediction in cyber threat intelligence contexts, as well as related problems involving challenging time series data.
时间序列预测对于各种应用领域都很重要。然而,在缺乏包含平稳性、周期性、完整性和非稀疏性的时间序列数据的嘈杂环境中,有效的预测是具有挑战性的。网络安全就是一个很好的例子。在组织安全设置中,预测与新出现的攻击相关的时间序列可以增强网络威胁情报,从而在操作、战术和战略层面上获得及时和可操作的见解。为了探索这一差距,我们提出了一个基于深度生成模型的框架,用于嘈杂数据环境下的时间序列预测。提出的框架结合了一种新的集成策略,其中生成对抗网络和循环变分自编码器与基本预测器一起用于增强时间序列预测模型的正则化。框架是可扩展的,支持不同的模型组合和分析或迭代模型融合策略。使用包含来自技术、金融服务和社交网络领域的5个组织10年来每周网络钓鱼攻击量数据的测试平台,我们表明该框架可以提高各种标准时间序列模型的预测能力。其他结果表明,该框架优于生成数据增强方法,旨在丰富输入时间序列数据矩阵。总的来说,我们的研究结果表明,在更强大的端到端设置中使用生成模型可以改善网络威胁情报背景下的预测,以及涉及具有挑战性的时间序列数据的相关问题。
{"title":"Using Deep Generative Models to Boost Forecasting: A Phishing Prediction Case Study","authors":"Syed Hasan Amin Mahmood, A. Abbasi","doi":"10.1109/ICDMW51313.2020.00073","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00073","url":null,"abstract":"Time series predictions are important for various application domains. However, effective forecasting can be challenging in noisy contexts devoid of time series data encompassing stationarity, cyclicality, completeness, and non-sparseness. Cyber-security is a good example of such context. In organizational security settings, predicting time series related to emerging attacks could enhance cyber threat intelligence, resulting in timely and actionable insights at the operational, tactical, and strategic levels. In order to explore this gap, we propose a deep generative model-based framework for time series forecasting in noisy data environments. The proposed framework incorporates a novel ensembling strategy where generative adversarial networks and recurrent variational autoencoders are leveraged in unison with base predictors for enhanced regularization of time series predictive models. The framework is extensible, supporting different model combinations and analytical or iterative model fusion strategies. Using a test bed encompassing 10 years of weekly phishing attack volume data from 5 organizations in the technology, financial services, and social networking sectors, we show that the framework can boost predictive power for various standard time series models. Additional results reveal that the framework outperforms generative data augmentation approaches designed to enrich the input time series data matrices. Collectively, our findings suggest that utilizing generative models in more robust end-to-end setup can improve prediction in cyber threat intelligence contexts, as well as related problems involving challenging time series data.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115744481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Restructuring of Hoeffding Trees for Trapezoidal Data Streams 梯形数据流Hoeffding树的重构
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00064
Christian Schreckenberger, Tim Glockner, H. Stuckenschmidt, Christian Bartelt
Trapezoidal Data Streams are an emerging topic, where not only the data volume increases, but also the data dimension, i.e. new features emerge. In this paper, we address the challenges that arise from this problem by providing a novel approach to restructure and prune Hoeffding trees. We evaluate our approach on synthetic datasets, where we can show that the approach significantly improves the performance compared to the baseline of an adjusted Hoeffding tree algorithm without restructuring and pruning.
梯形数据流是一个新兴的话题,它不仅增加了数据量,而且增加了数据维度,即出现了新的特征。在本文中,我们通过提供一种重组和修剪Hoeffding树的新方法来解决这个问题所带来的挑战。我们在合成数据集上评估了我们的方法,在那里我们可以证明,与调整后的Hoeffding树算法的基线相比,该方法在没有重组和修剪的情况下显着提高了性能。
{"title":"Restructuring of Hoeffding Trees for Trapezoidal Data Streams","authors":"Christian Schreckenberger, Tim Glockner, H. Stuckenschmidt, Christian Bartelt","doi":"10.1109/ICDMW51313.2020.00064","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00064","url":null,"abstract":"Trapezoidal Data Streams are an emerging topic, where not only the data volume increases, but also the data dimension, i.e. new features emerge. In this paper, we address the challenges that arise from this problem by providing a novel approach to restructure and prune Hoeffding trees. We evaluate our approach on synthetic datasets, where we can show that the approach significantly improves the performance compared to the baseline of an adjusted Hoeffding tree algorithm without restructuring and pruning.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114661349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Efficient Mining of Non-Dominated High Quantity-Utility Patterns 非支配型高数量效用模式的有效挖掘
Pub Date : 2020-11-01 DOI: 10.1109/ICDMW51313.2020.00097
J. Wu, Qian Teng, Gautam Srivastava, Matin Pirouz, Chun-Wei Lin
In this paper, we propose a new pattern called skyline quantity-utility pattern (SQUP) to provide better estimations in the decision-making process by considering quantity and utility together. Two algorithms respectively called SQUM-1 and SQUM-2 are presented to efficiently mine the set of SQUPs. Two new efficient utility-max structures are also mentioned for the reduction of the candidate itemsets respectively utilized in two developed algorithms. Our in-depth experimental results prove that our proposed algorithms achieve good performance in terms of runtime and memory usage.
本文提出了一种新的天际线数量-效用模式(SQUP),以便在决策过程中更好地综合考虑数量和效用。提出了SQUM-1和SQUM-2两种算法来有效地挖掘squp集。本文还提出了两种新的有效的效用最大化结构,分别用于两种开发的算法中候选项集的缩减。我们的深入实验结果证明,我们提出的算法在运行时间和内存使用方面取得了良好的性能。
{"title":"Efficient Mining of Non-Dominated High Quantity-Utility Patterns","authors":"J. Wu, Qian Teng, Gautam Srivastava, Matin Pirouz, Chun-Wei Lin","doi":"10.1109/ICDMW51313.2020.00097","DOIUrl":"https://doi.org/10.1109/ICDMW51313.2020.00097","url":null,"abstract":"In this paper, we propose a new pattern called skyline quantity-utility pattern (SQUP) to provide better estimations in the decision-making process by considering quantity and utility together. Two algorithms respectively called SQUM-1 and SQUM-2 are presented to efficiently mine the set of SQUPs. Two new efficient utility-max structures are also mentioned for the reduction of the candidate itemsets respectively utilized in two developed algorithms. Our in-depth experimental results prove that our proposed algorithms achieve good performance in terms of runtime and memory usage.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123810350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2020 International Conference on Data Mining Workshops (ICDMW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1