首页 > 最新文献

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)最新文献

英文 中文
FlexAdapt: Flexible Cycle-Consistent Adversarial Domain Adaptation FlexAdapt:灵活的周期-一致的对抗领域适应
Akhil Mathur, Anton Isopoussu, F. Kawsar, N. Bianchi-Berthouze, N. Lane
Unsupervised domain adaptation is emerging as a powerful technique to improve the generalizability of deep learning models to new image domains without using any labeled data in the target domain. In the literature, solutions which perform cross-domain feature-matching (e.g., ADDA), pixel-matching (CycleGAN), and combination of the two (e.g., CyCADA) have been proposed for unsupervised domain adaptation. Many of these approaches make a strong assumption that the source and target label spaces are the same, however in the real-world, this assumption does not hold true. In this paper, we propose a novel solution, FlexAdapt, which extends the state-of-the-art unsupervised domain adaptation approach of CyCADA to scenarios where the label spaces in source and target domains are only partially overlapped. Our solution beats a number of state-of-the-art baseline approaches by as much as 29% in some scenarios, and represent a way forward for applying domain adaptation techniques in the real world.
无监督域自适应是一种强大的技术,它可以在不使用目标域中任何标记数据的情况下提高深度学习模型对新图像域的泛化能力。在文献中,已经提出了跨域特征匹配(例如ADDA),像素匹配(CycleGAN)以及两者结合(例如CyCADA)的解决方案,用于无监督域自适应。这些方法中的许多都假定源标签空间和目标标签空间是相同的,但是在现实世界中,这个假定并不成立。在本文中,我们提出了一种新颖的解决方案FlexAdapt,它将CyCADA最先进的无监督域自适应方法扩展到源域和目标域的标签空间仅部分重叠的场景。在某些情况下,我们的解决方案比许多最先进的基线方法高出29%,并且代表了在现实世界中应用领域自适应技术的前进方向。
{"title":"FlexAdapt: Flexible Cycle-Consistent Adversarial Domain Adaptation","authors":"Akhil Mathur, Anton Isopoussu, F. Kawsar, N. Bianchi-Berthouze, N. Lane","doi":"10.1109/ICMLA.2019.00155","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00155","url":null,"abstract":"Unsupervised domain adaptation is emerging as a powerful technique to improve the generalizability of deep learning models to new image domains without using any labeled data in the target domain. In the literature, solutions which perform cross-domain feature-matching (e.g., ADDA), pixel-matching (CycleGAN), and combination of the two (e.g., CyCADA) have been proposed for unsupervised domain adaptation. Many of these approaches make a strong assumption that the source and target label spaces are the same, however in the real-world, this assumption does not hold true. In this paper, we propose a novel solution, FlexAdapt, which extends the state-of-the-art unsupervised domain adaptation approach of CyCADA to scenarios where the label spaces in source and target domains are only partially overlapped. Our solution beats a number of state-of-the-art baseline approaches by as much as 29% in some scenarios, and represent a way forward for applying domain adaptation techniques in the real world.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131333586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Towards the Integration of a Post-Hoc Interpretation Step into the Machine Learning Workflow for IoT Botnet Detection 将事后解释步骤集成到物联网僵尸网络检测的机器学习工作流程中
S. Nõmm, Alejandro Guerra-Manzanares, Hayretdin Bahsi
The analysis of the interplay between the feature selection and the post-hoc local interpretation steps in a machine learning workflow followed for IoT botnet detection constitutes the research scope of the present paper. While the application of machine learning-based techniques has become a trend in cyber security, the main focus has been almost on detection accuracy. However, providing the relevant explanation for a detection decision is a vital requirement in a tiered incident handling processes of the contemporary security operations centers. Moreover, the design of intrusion detection systems in IoT networks has to take the limitations of the computational resources into consideration. Therefore, resource limitations in addition to human element of incident handling necessitate considering feature selection and interpretability at the same time in machine learning workflows. In this paper, first, we analyzed the selection of features and its implication on the data accuracy. Second, we investigated the impact of feature selection on the explanations generated at the post-hoc interpretation phase. We utilized a filter method, Fisher's Score and Local Interpretable Model-Agnostic Explanation (LIME) at feature selection and post-hoc interpretation phases, respectively. To evaluate the quality of explanations, we proposed a metric that reflects the need of the security analysts. It is demonstrated that the application of both steps for the particular case of IoT botnet detection may result in highly accurate and interpretable learning models induced by fewer features. Our metric enables us to evaluate the detection accuracy and interpretability in an integrated way.
分析物联网僵尸网络检测所遵循的机器学习工作流程中特征选择和事后局部解释步骤之间的相互作用构成了本文的研究范围。虽然基于机器学习技术的应用已成为网络安全领域的一种趋势,但主要焦点几乎集中在检测准确性上。然而,在当代安全操作中心的分层事件处理流程中,为检测决策提供相关解释是一个至关重要的需求。此外,物联网网络中入侵检测系统的设计必须考虑到计算资源的局限性。因此,除了事件处理的人为因素外,资源限制还需要在机器学习工作流中同时考虑特征选择和可解释性。本文首先分析了特征的选择及其对数据精度的影响。其次,我们研究了特征选择对事后解释阶段产生的解释的影响。我们分别在特征选择和事后解释阶段使用了过滤方法、Fisher’s Score和局部可解释模型不可知论解释(LIME)。为了评估解释的质量,我们提出了一个反映证券分析师需求的度量。研究表明,在物联网僵尸网络检测的特定情况下,这两个步骤的应用可能会导致由较少特征诱导的高度准确和可解释的学习模型。我们的度量使我们能够以一种综合的方式评估检测的准确性和可解释性。
{"title":"Towards the Integration of a Post-Hoc Interpretation Step into the Machine Learning Workflow for IoT Botnet Detection","authors":"S. Nõmm, Alejandro Guerra-Manzanares, Hayretdin Bahsi","doi":"10.1109/ICMLA.2019.00193","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00193","url":null,"abstract":"The analysis of the interplay between the feature selection and the post-hoc local interpretation steps in a machine learning workflow followed for IoT botnet detection constitutes the research scope of the present paper. While the application of machine learning-based techniques has become a trend in cyber security, the main focus has been almost on detection accuracy. However, providing the relevant explanation for a detection decision is a vital requirement in a tiered incident handling processes of the contemporary security operations centers. Moreover, the design of intrusion detection systems in IoT networks has to take the limitations of the computational resources into consideration. Therefore, resource limitations in addition to human element of incident handling necessitate considering feature selection and interpretability at the same time in machine learning workflows. In this paper, first, we analyzed the selection of features and its implication on the data accuracy. Second, we investigated the impact of feature selection on the explanations generated at the post-hoc interpretation phase. We utilized a filter method, Fisher's Score and Local Interpretable Model-Agnostic Explanation (LIME) at feature selection and post-hoc interpretation phases, respectively. To evaluate the quality of explanations, we proposed a metric that reflects the need of the security analysts. It is demonstrated that the application of both steps for the particular case of IoT botnet detection may result in highly accurate and interpretable learning models induced by fewer features. Our metric enables us to evaluate the detection accuracy and interpretability in an integrated way.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"59 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131922812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
An Unsupervised Framework for Anomaly Detection in a Water Treatment System 水处理系统异常检测的无监督框架
Mayra Alexandra Macas Carrasco, Chunming Wu
Current Cyber-Physical Systems (CPSs) are sophisticated, complex, and equipped with networked sensors and actuators. As such, they have become further exposed to cyber-attacks. Recent catastrophic events have demonstrated that standard, human-based management of anomaly detection in complex systems is not efficient enough and have underlined the significance of automated detection, intelligent and rapid response. Nevertheless, existing anomaly detection frameworks usually are not capable of dealing with the dynamic and complicated nature of the CPSs. In this study, we introduce an unsupervised framework for anomaly detection based on an Attention-based Spatio-Temporal Autoencoder. In particular, we first construct statistical correlation matrices to characterize the system status across different time steps. Next, a 2D convolutional encoder is employed to encode the patterns of the correlation matrices, whereas an Attention-based Convolutional LSTM Encoder-Decoder (ConvLSTM-ED) is used to capture the temporal dependencies. More precisely, we introduce an input attention mechanism to adaptively select the most significant input features at each time step. Finally, the 2D convolutional decoder reconstructs the correlation matrices. The differences between the reconstructed correlation matrices and the original ones are used as indicators of anomalies. Extensive experimental analysis on data collected from all six stages of Secure Water Treatment (SWaT) testbed, a scaled-down version of a real-world industrial water treatment plant, demonstrates that the proposed model outperforms the state-of-the-art baseline techniques.
当前的信息物理系统(cps)是复杂的、复杂的,并且配备了网络传感器和执行器。因此,它们进一步暴露在网络攻击之下。最近的灾难性事件表明,在复杂系统中,标准的、以人为基础的异常检测管理是不够有效的,并强调了自动检测、智能和快速响应的重要性。然而,现有的异常检测框架通常无法处理cps的动态性和复杂性。在本研究中,我们引入了一种基于注意力的时空自编码器的无监督异常检测框架。特别是,我们首先构建统计相关矩阵来表征系统在不同时间步长的状态。接下来,使用二维卷积编码器对相关矩阵的模式进行编码,而使用基于注意力的卷积LSTM编码器-解码器(ConvLSTM-ED)来捕获时间依赖性。更准确地说,我们引入了一种输入注意机制来自适应地选择每个时间步最重要的输入特征。最后,二维卷积解码器重建相关矩阵。将重建的相关矩阵与原始相关矩阵的差异作为异常指标。对从安全水处理(SWaT)测试平台(一个实际工业水处理厂的缩小版)的所有六个阶段收集的数据进行了广泛的实验分析,表明所提出的模型优于最先进的基线技术。
{"title":"An Unsupervised Framework for Anomaly Detection in a Water Treatment System","authors":"Mayra Alexandra Macas Carrasco, Chunming Wu","doi":"10.1109/ICMLA.2019.00212","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00212","url":null,"abstract":"Current Cyber-Physical Systems (CPSs) are sophisticated, complex, and equipped with networked sensors and actuators. As such, they have become further exposed to cyber-attacks. Recent catastrophic events have demonstrated that standard, human-based management of anomaly detection in complex systems is not efficient enough and have underlined the significance of automated detection, intelligent and rapid response. Nevertheless, existing anomaly detection frameworks usually are not capable of dealing with the dynamic and complicated nature of the CPSs. In this study, we introduce an unsupervised framework for anomaly detection based on an Attention-based Spatio-Temporal Autoencoder. In particular, we first construct statistical correlation matrices to characterize the system status across different time steps. Next, a 2D convolutional encoder is employed to encode the patterns of the correlation matrices, whereas an Attention-based Convolutional LSTM Encoder-Decoder (ConvLSTM-ED) is used to capture the temporal dependencies. More precisely, we introduce an input attention mechanism to adaptively select the most significant input features at each time step. Finally, the 2D convolutional decoder reconstructs the correlation matrices. The differences between the reconstructed correlation matrices and the original ones are used as indicators of anomalies. Extensive experimental analysis on data collected from all six stages of Secure Water Treatment (SWaT) testbed, a scaled-down version of a real-world industrial water treatment plant, demonstrates that the proposed model outperforms the state-of-the-art baseline techniques.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132605923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Rare-Event Time Series Prediction: A Case Study of Solar Flare Forecasting 罕见事件时间序列预测:以太阳耀斑预测为例
Azim Ahmadzadeh, Berkay Aydin, Dustin J. Kempton, Maxwell Hostetter, R. Angryk, M. Georgoulis, Sushant S. Mahajan
We present a case study for time series prediction models in extreme class-imbalance problems. We have extracted multiple properties from the Space Weather ANalytics for Solar Flares (SWAN-SF) benchmark dataset which comprises of magnetic features from over 4075 active regions over a period of 9 years to create the forecasting dataset used in this study. In the extracted dataset, the class-imbalance ratio is 1:60, where the minority class is formed by instances of strong solar flares (GOES M-and X-class). This ratio reaches to 1:800 if we only consider the strongest class of flares (GOES X-class). This case of extreme imbalance, along with the temporal coherence of the sliced time series, provides us with an interesting set of challenges in the forecasting of scarce real-life phenomena. We have explored remedies to tackle the class-imbalance issue such as undersampling, oversampling and misclassification weights. In the process, we elaborate on common mistakes and pitfalls caused by ignoring the side effects of these remedies, including how and why they weaken the robustness of the trained models while seemingly improving the performance.
我们提出了一个极端类别失衡问题的时间序列预测模型的案例研究。我们从太阳耀斑空间天气分析(SWAN-SF)基准数据集中提取了多个属性,该数据集包括9年来超过4075个活跃区域的磁特征,以创建本研究中使用的预测数据集。在提取的数据集中,类不平衡比为1:60,其中少数类是由强太阳耀斑(GOES m级和x级)的实例形成的。如果我们只考虑最强级别的耀斑(GOES x级),这个比例达到1:80。这种极端不平衡的情况,以及切片时间序列的时间一致性,为我们在预测稀缺的现实生活现象方面提供了一系列有趣的挑战。我们探索了解决类不平衡问题的补救措施,如采样不足、过采样和错误的分类权重。在此过程中,我们详细说明了由于忽视这些补救措施的副作用而导致的常见错误和陷阱,包括它们如何以及为什么在表面上提高性能的同时削弱了训练模型的鲁棒性。
{"title":"Rare-Event Time Series Prediction: A Case Study of Solar Flare Forecasting","authors":"Azim Ahmadzadeh, Berkay Aydin, Dustin J. Kempton, Maxwell Hostetter, R. Angryk, M. Georgoulis, Sushant S. Mahajan","doi":"10.1109/ICMLA.2019.00293","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00293","url":null,"abstract":"We present a case study for time series prediction models in extreme class-imbalance problems. We have extracted multiple properties from the Space Weather ANalytics for Solar Flares (SWAN-SF) benchmark dataset which comprises of magnetic features from over 4075 active regions over a period of 9 years to create the forecasting dataset used in this study. In the extracted dataset, the class-imbalance ratio is 1:60, where the minority class is formed by instances of strong solar flares (GOES M-and X-class). This ratio reaches to 1:800 if we only consider the strongest class of flares (GOES X-class). This case of extreme imbalance, along with the temporal coherence of the sliced time series, provides us with an interesting set of challenges in the forecasting of scarce real-life phenomena. We have explored remedies to tackle the class-imbalance issue such as undersampling, oversampling and misclassification weights. In the process, we elaborate on common mistakes and pitfalls caused by ignoring the side effects of these remedies, including how and why they weaken the robustness of the trained models while seemingly improving the performance.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130779122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Learning Curve Estimation with Large Imbalanced Datasets 大型不平衡数据集的学习曲线估计
Aaron N. Richter, T. Khoshgoftaar
Datasets for machine learning are constantly increasing in size, along with computational requirements for processing the data. A useful exercise for machine learning experiments is to approximate model performance as dataset size increases. This can inform application building and data collection efforts as well as improve computational efficiency by using subsets of the data. In this paper, we evaluate a learning curve estimation method on three large imbalanced datasets. Estimation is performed by fitting an inverse power law model to a learning curve created on a small amount of data. We then explore how well this estimated curve fits to the full learning curve of each dataset. The method has been previously evaluated for small datasets (hundreds or thousands of instances), and in this study we show that the method is indeed effective for larger datasets with millions of instances. This is beneficial because only a few thousand instances are required to accurately estimate the performance of models using millions of instances. To the best of our knowledge, this is the first study to systematically explore the use of an inverse power law curve fitting method for big data.
机器学习的数据集规模不断增加,处理数据的计算需求也在不断增加。机器学习实验的一个有用的练习是随着数据集大小的增加来近似模型的性能。这可以通知应用程序构建和数据收集工作,并通过使用数据子集提高计算效率。本文在三个大型不平衡数据集上评估了一种学习曲线估计方法。估计是通过在少量数据上建立的学习曲线上拟合一个逆幂律模型来完成的。然后我们探索这个估计曲线与每个数据集的完整学习曲线的拟合程度。该方法之前已经对小型数据集(数百或数千个实例)进行了评估,在本研究中,我们表明该方法对于具有数百万个实例的大型数据集确实有效。这是有益的,因为只需要几千个实例就可以准确地估计使用数百万个实例的模型的性能。据我们所知,这是第一个系统地探索在大数据中使用逆幂律曲线拟合方法的研究。
{"title":"Learning Curve Estimation with Large Imbalanced Datasets","authors":"Aaron N. Richter, T. Khoshgoftaar","doi":"10.1109/ICMLA.2019.00135","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00135","url":null,"abstract":"Datasets for machine learning are constantly increasing in size, along with computational requirements for processing the data. A useful exercise for machine learning experiments is to approximate model performance as dataset size increases. This can inform application building and data collection efforts as well as improve computational efficiency by using subsets of the data. In this paper, we evaluate a learning curve estimation method on three large imbalanced datasets. Estimation is performed by fitting an inverse power law model to a learning curve created on a small amount of data. We then explore how well this estimated curve fits to the full learning curve of each dataset. The method has been previously evaluated for small datasets (hundreds or thousands of instances), and in this study we show that the method is indeed effective for larger datasets with millions of instances. This is beneficial because only a few thousand instances are required to accurately estimate the performance of models using millions of instances. To the best of our knowledge, this is the first study to systematically explore the use of an inverse power law curve fitting method for big data.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131020403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Using Convolutional Neural Networks to Extract Keywords and Keyphrases: A Case Study for Foodborne Illnesses 使用卷积神经网络提取关键字和关键词:以食源性疾病为例
Jingjing Wang, Fei Song, Kavita Walia, Jeffery Farber, R. Dara
Keywords and keyphrases are important for Natural Language Processing (NLP) applications such as document classification, information retrieval, and topic identification. They are also useful for capturing different classes of entities from content related to healthcare, biology, food science, and journalism fields. There are different approaches to extract keywords and keyphrases. Deep learning approaches have achieved high-performance results in terms of keywords and keyphrase extraction. However, among deep learning approaches, Convolutional Neural Network (CNN) potentials have not been fully explored as a technique for extracting keywords and keyphrases. In this work, we performed a comparative study using a benchmark dataset, the IEEE Xplore collection to test the CNN generalization ability in selecting keywords and keyphrases. In addition, we further collected a corpus in the field of foodborne illness outbreaks. We utilize this corpus to develop a CNN-based identification approach of keywords and keyphrases related to foodborne illnesses. Results were compared with several supervised (KEA, GuidedLDA) and unsupervised (LDA) machine learning algorithms. CNN outperformed these algorithms in selecting relevant keywords and keyphrases for foodborne illnesses. The findings of this study have also confirmed superiority of CNN-based algorithm for keyphrase extraction to other machine learning approaches.
关键字和关键短语对于文档分类、信息检索和主题识别等自然语言处理(NLP)应用非常重要。它们对于从与医疗保健、生物学、食品科学和新闻领域相关的内容中捕获不同类别的实体也很有用。提取关键字和关键短语有不同的方法。深度学习方法在关键字和关键短语提取方面取得了高性能的结果。然而,在深度学习方法中,卷积神经网络(CNN)作为提取关键字和关键短语的技术尚未得到充分的探索。在这项工作中,我们使用基准数据集IEEE Xplore集合进行了比较研究,以测试CNN在选择关键字和关键短语方面的泛化能力。此外,我们进一步收集了食源性疾病暴发领域的语料库。我们利用这个语料库来开发一个基于cnn的与食源性疾病相关的关键字和关键短语识别方法。结果与几种有监督(KEA, GuidedLDA)和无监督(LDA)机器学习算法进行了比较。CNN在选择食源性疾病的相关关键词和关键词方面优于这些算法。本研究的发现也证实了基于cnn的关键词提取算法相对于其他机器学习方法的优越性。
{"title":"Using Convolutional Neural Networks to Extract Keywords and Keyphrases: A Case Study for Foodborne Illnesses","authors":"Jingjing Wang, Fei Song, Kavita Walia, Jeffery Farber, R. Dara","doi":"10.1109/ICMLA.2019.00228","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00228","url":null,"abstract":"Keywords and keyphrases are important for Natural Language Processing (NLP) applications such as document classification, information retrieval, and topic identification. They are also useful for capturing different classes of entities from content related to healthcare, biology, food science, and journalism fields. There are different approaches to extract keywords and keyphrases. Deep learning approaches have achieved high-performance results in terms of keywords and keyphrase extraction. However, among deep learning approaches, Convolutional Neural Network (CNN) potentials have not been fully explored as a technique for extracting keywords and keyphrases. In this work, we performed a comparative study using a benchmark dataset, the IEEE Xplore collection to test the CNN generalization ability in selecting keywords and keyphrases. In addition, we further collected a corpus in the field of foodborne illness outbreaks. We utilize this corpus to develop a CNN-based identification approach of keywords and keyphrases related to foodborne illnesses. Results were compared with several supervised (KEA, GuidedLDA) and unsupervised (LDA) machine learning algorithms. CNN outperformed these algorithms in selecting relevant keywords and keyphrases for foodborne illnesses. The findings of this study have also confirmed superiority of CNN-based algorithm for keyphrase extraction to other machine learning approaches.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127805578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Deep Learning for Flood Forecasting and Monitoring in Urban Environments 城市环境中洪水预报与监测的深度学习
Charalampos Karyotis, Tomasz Maniak, F. Doctor, R. Iqbal, V. Palade, Raymond Tang
This paper describes the core computational mechanisms used by an urban flood forecasting and monitoring platform developed as part of a UK Newton Fund project in Malaysia. FLUD-FLood monitoring and forecasting platform for Urban Deployment - is a novel system aiming to deliver an effective and low cost urban flood forecasting solution, which is able to accurately forecast flood risk at street level, and deliver optimized recommendations to the relevant authorities as well as an early warning alerts to members of the public. This platform is based on a hybrid Deep Learning and Fuzzy Logic based architecture. As demonstrated by the experimental results and the analysis presented in this paper, this architecture enables the proposed system to account for factors that are not included in other modern flood forecasting systems, and simultaneously process high volumes of data originating from diverse data sources, in order to deliver accurate predictions concerning urban flood events
本文描述了作为英国牛顿基金在马来西亚项目的一部分开发的城市洪水预报和监测平台所使用的核心计算机制。“城市部署的洪水监测和预报平台”是一个新颖的系统,旨在提供有效和低成本的城市洪水预报解决方案,能够准确预测街道层面的洪水风险,并向有关当局提供优化建议,并向公众发出预警。该平台基于深度学习和模糊逻辑的混合架构。正如本文的实验结果和分析所证明的那样,该架构使所提出的系统能够考虑其他现代洪水预报系统中未包含的因素,并同时处理来自不同数据源的大量数据,以便提供有关城市洪水事件的准确预测
{"title":"Deep Learning for Flood Forecasting and Monitoring in Urban Environments","authors":"Charalampos Karyotis, Tomasz Maniak, F. Doctor, R. Iqbal, V. Palade, Raymond Tang","doi":"10.1109/ICMLA.2019.00227","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00227","url":null,"abstract":"This paper describes the core computational mechanisms used by an urban flood forecasting and monitoring platform developed as part of a UK Newton Fund project in Malaysia. FLUD-FLood monitoring and forecasting platform for Urban Deployment - is a novel system aiming to deliver an effective and low cost urban flood forecasting solution, which is able to accurately forecast flood risk at street level, and deliver optimized recommendations to the relevant authorities as well as an early warning alerts to members of the public. This platform is based on a hybrid Deep Learning and Fuzzy Logic based architecture. As demonstrated by the experimental results and the analysis presented in this paper, this architecture enables the proposed system to account for factors that are not included in other modern flood forecasting systems, and simultaneously process high volumes of data originating from diverse data sources, in order to deliver accurate predictions concerning urban flood events","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134487447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Client-Side Monitoring of HTTP Clusters Using Machine Learning Techniques 使用机器学习技术的HTTP集群客户端监控
R. Filipe, Filipe Araújo
Large online web sites are supported in the back-end by a cluster of servers behind a load balancer. Ensuring proper operation of the cluster with minimal monitoring efforts from the load balancer is necessary to ensure performance. Previous monitoring efforts require extensive data from the system and fail to include the client perspective. We monitor the cluster using machine learning techniques that process data collected and uploaded by web clients, an approach that might complement system-side information. To experiment our solution, we trained the machine learning algorithms in a cluster of 10 machines with a load balancer and evaluated the results of these algorithms when one of the machines is overloaded. While a fine-grained view of the state of the machines, may require much effort to accomplish, given the compensation effect of the remaining healthy machines, the results show that we can achieve a coarse grained view of the entire system, to produce relevant insight about the cluster.
大型在线网站在后端由负载平衡器后面的服务器集群支持。以最小的负载平衡器监控确保集群的正常运行是确保性能的必要条件。以前的监测工作需要来自系统的大量数据,而没有包括客户的观点。我们使用机器学习技术监控集群,该技术处理由web客户端收集和上传的数据,这种方法可能会补充系统端信息。为了实验我们的解决方案,我们在一个由10台机器组成的集群中使用负载平衡器训练机器学习算法,并在其中一台机器过载时评估这些算法的结果。虽然机器状态的细粒度视图可能需要很多努力才能完成,但考虑到剩余健康机器的补偿效应,结果表明我们可以实现整个系统的粗粒度视图,以产生有关集群的相关见解。
{"title":"Client-Side Monitoring of HTTP Clusters Using Machine Learning Techniques","authors":"R. Filipe, Filipe Araújo","doi":"10.1109/ICMLA.2019.00053","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00053","url":null,"abstract":"Large online web sites are supported in the back-end by a cluster of servers behind a load balancer. Ensuring proper operation of the cluster with minimal monitoring efforts from the load balancer is necessary to ensure performance. Previous monitoring efforts require extensive data from the system and fail to include the client perspective. We monitor the cluster using machine learning techniques that process data collected and uploaded by web clients, an approach that might complement system-side information. To experiment our solution, we trained the machine learning algorithms in a cluster of 10 machines with a load balancer and evaluated the results of these algorithms when one of the machines is overloaded. While a fine-grained view of the state of the machines, may require much effort to accomplish, given the compensation effect of the remaining healthy machines, the results show that we can achieve a coarse grained view of the entire system, to produce relevant insight about the cluster.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"461 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115868550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Brown Planthopper Damage Detection using Remote Sensing and Machine Learning 基于遥感和机器学习的褐飞虱损伤检测
D. Lakmal, Kumaran Kugathasan, V. Nanayakkara, S. Jayasena, Amal Perera, Lasantha Fernando
Every year paddy cultivators lose a significant amount of crop yield due to diseases and pests. Brown Planthopper (BPH) is one of the most common diseases that affect paddy cultivation. Sri Lankan government is struggling to make appropriate estimations regarding Brown Planthopper prevalence due to the absence of accurate and timely data. To solve this issue, a machine learning approach is proposed based on optical and synthetic aperture radar remote sensing data in this study. However, there is no previous effort for detecting Brown Planthopper attacks using machine learning and satellite remote sensing data under field conditions. This study consists of two phases. A time series classification based on SAR imagery is implemented to identify cultivated paddy fields in the first phase. Ratio and standard difference indices derived from optical satellite images are used in the second phase to identify regions affected by BPH attacks in paddy fields. Convolution neural network that is used in the first phase reports an accuracy of 96.20% for identifying cultivated paddy regions. A Support Vector Machine is used to detect areas damaged by BPH attacks in the second phase. The Combined approach of the first and the second phases shows promising results with an accuracy of 96.31% for detecting Brown Planthopper attacks.
每年,由于病虫害,水稻种植者损失了大量的作物产量。褐飞虱(BPH)是水稻种植中最常见的病害之一。由于缺乏准确和及时的数据,斯里兰卡政府正在努力对褐飞虱的流行率做出适当的估计。为了解决这一问题,本研究提出了一种基于光学和合成孔径雷达遥感数据的机器学习方法。然而,以前没有在野外条件下使用机器学习和卫星遥感数据来检测布朗飞虱的攻击。本研究分为两个阶段。第一阶段采用基于SAR影像的时间序列分类方法对耕地进行识别。第二阶段使用光学卫星图像的比值和标准差分指数来识别水田中受BPH影响的区域。第一阶段使用的卷积神经网络对水稻种植区域的识别准确率为96.20%。第二阶段使用支持向量机检测被BPH攻击破坏的区域。第一阶段和第二阶段相结合的方法检测Brown planth飞虱攻击的准确率达到96.31%。
{"title":"Brown Planthopper Damage Detection using Remote Sensing and Machine Learning","authors":"D. Lakmal, Kumaran Kugathasan, V. Nanayakkara, S. Jayasena, Amal Perera, Lasantha Fernando","doi":"10.1109/ICMLA.2019.00024","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00024","url":null,"abstract":"Every year paddy cultivators lose a significant amount of crop yield due to diseases and pests. Brown Planthopper (BPH) is one of the most common diseases that affect paddy cultivation. Sri Lankan government is struggling to make appropriate estimations regarding Brown Planthopper prevalence due to the absence of accurate and timely data. To solve this issue, a machine learning approach is proposed based on optical and synthetic aperture radar remote sensing data in this study. However, there is no previous effort for detecting Brown Planthopper attacks using machine learning and satellite remote sensing data under field conditions. This study consists of two phases. A time series classification based on SAR imagery is implemented to identify cultivated paddy fields in the first phase. Ratio and standard difference indices derived from optical satellite images are used in the second phase to identify regions affected by BPH attacks in paddy fields. Convolution neural network that is used in the first phase reports an accuracy of 96.20% for identifying cultivated paddy regions. A Support Vector Machine is used to detect areas damaged by BPH attacks in the second phase. The Combined approach of the first and the second phases shows promising results with an accuracy of 96.31% for detecting Brown Planthopper attacks.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123429922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Maximizing Customer Lifetime Value using Stacked Neural Networks: An Insurance Industry Application 利用堆叠神经网络最大化客户生命周期价值:保险行业应用
Gaddiel Desirena, Armando Diaz, Jalil Desirena, Ismael Moreno, Daniel Garcia
This paper proposes a recommender system based on two-stage neural network architecture that maximizes Customer Lifetime Value (CLV). The Stage-I neural network uses a self-attention mechanism and a Collaborative Metric Learning (CML) to generate product recommendations. The Stage-II neural network uses a neural network-based survival analysis to infer insurance product recommendations that maximize customer lifetime. The proposed stacked neural network model can be used as a generative model to explore different cross-sell scenarios. The applicability of the proposed recommendation system is evaluated using transactional data from an Australian insurance company. We validated our results against a state of the art self-attention recommendation system, successfully extending its functionality to include lifetime value.
提出了一种基于两阶段神经网络架构的客户终身价值最大化的推荐系统。第一阶段神经网络使用自注意机制和协同度量学习(CML)来生成产品推荐。第二阶段神经网络使用基于神经网络的生存分析来推断最大化客户生命周期的保险产品建议。所提出的堆叠神经网络模型可以作为一个生成模型来探索不同的交叉销售场景。使用来自澳大利亚保险公司的交易数据来评估所提出的推荐系统的适用性。我们用最先进的自我关注推荐系统验证了我们的结果,成功地扩展了它的功能,包括终身价值。
{"title":"Maximizing Customer Lifetime Value using Stacked Neural Networks: An Insurance Industry Application","authors":"Gaddiel Desirena, Armando Diaz, Jalil Desirena, Ismael Moreno, Daniel Garcia","doi":"10.1109/ICMLA.2019.00101","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00101","url":null,"abstract":"This paper proposes a recommender system based on two-stage neural network architecture that maximizes Customer Lifetime Value (CLV). The Stage-I neural network uses a self-attention mechanism and a Collaborative Metric Learning (CML) to generate product recommendations. The Stage-II neural network uses a neural network-based survival analysis to infer insurance product recommendations that maximize customer lifetime. The proposed stacked neural network model can be used as a generative model to explore different cross-sell scenarios. The applicability of the proposed recommendation system is evaluated using transactional data from an Australian insurance company. We validated our results against a state of the art self-attention recommendation system, successfully extending its functionality to include lifetime value.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123653629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1