2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文中文

Machine Learning for Optimum CT-Prediction for qPCR qPCR最佳ct预测的机器学习

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0103

M. Günay, Evgin Göçeri, R. Balasubramaniyan

Introduction of fluorescence-based Real-Time PCR (RT-PCR) is increasingly used to detect multiple pathogens simultaneously and rapidly by gene expression analysis of PCR amplification data. PCR data is analyzed often by setting an arbitrary threshold that intersect the signal curve in its exponential phase if it exists. The point at which the curve crosses the threshold is called Threshold Cycle (CT) for positive samples. On the other, when such cross of threshold does not occur, the sample is identified as negative. This simple and arbitrary however not an elagant definition of CT value sometimes leads to conclusions that are either false positive or negative. Therefore, the purpose of this paper is to present a stable and consistent alternative approach that is based on machine learning for the definition and determination of CT values.

基于荧光的实时荧光聚合酶链式反应(RT-PCR)越来越多地应用于通过分析PCR扩增数据的基因表达来同时快速检测多种病原体。PCR数据通常通过设置任意阈值来分析，该阈值与信号曲线在指数阶段相交(如果存在)。对于阳性样本，曲线越过阈值的点称为阈值周期(CT)。另一方面，当这种阈值的交叉没有发生时，样本被识别为阴性。这种简单武断的CT值定义有时会导致假阳性或假阴性的结论。因此，本文的目的是提出一种基于机器学习的稳定和一致的替代方法来定义和确定CT值。

引用次数: 10

Efficient Content Replacement in Wireless Content Delivery Network with Cooperative Caching 基于协同缓存的无线内容分发网络中的高效内容替换

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0096

Jihoon Sung, Kyounghye Kim, Junhyuk Kim, J. Rhee

Wireless content delivery networks (WCDNs) have received attention as a promising solution to reduce the network congestion caused by rapidly growing demands for mobile content. The amount of reduced congestion is intuitively proportional to the hit ratio in a WCDN. Cooperation among cache servers is strongly required to maximize the hit ratio in a WCDN where each cache server is equipped with a small-size cache storage space. In this paper, we address a content replacement problem that deals with how to manage contents in a limited cache storage space in a reactive manner to cope with a dynamic content demand over time. As a new challenge, we apply reinforcement learning, which is Q-learning, to the content replacement problem in a WCDN with coooperative caching. We model the content replacement problem as a Markov Decision Process (MDP) and finally propose an efficient content replacement strategy to maximize the hit ratio based on a multi-agent Q-learning scheme. Simulation results exhibit that the proposed strategy contributes to achieving better content delivery performance in delay due to a higher hit ratio, compared to typical existing schemes of least recently used (LRU) and least frequently used (LFU).

无线内容分发网络(wcdn)作为一种很有前途的解决方案，已受到人们的关注，以减少因移动内容需求快速增长而引起的网络拥塞。减少拥塞的数量直观地与WCDN中的命中率成正比。在WCDN中，每个缓存服务器都配备了一个小的缓存存储空间，因此迫切需要缓存服务器之间的合作来最大化命中率。在本文中，我们解决了一个内容替换问题，该问题涉及如何以响应式方式管理有限缓存存储空间中的内容，以应对随时间变化的动态内容需求。作为一个新的挑战，我们将强化学习即q学习应用于WCDN中具有协同缓存的内容替换问题。我们将内容替换问题建模为马尔可夫决策过程(MDP)，最后提出了一种基于多智能体q -学习方案的高效内容替换策略，以最大化命中率。仿真结果表明，与现有典型的最近最少使用(LRU)和最不频繁使用(LFU)方案相比，由于更高的命中率，所提出的策略有助于实现更好的延迟内容交付性能。

{"title":"Efficient Content Replacement in Wireless Content Delivery Network with Cooperative Caching","authors":"Jihoon Sung, Kyounghye Kim, Junhyuk Kim, J. Rhee","doi":"10.1109/ICMLA.2016.0096","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0096","url":null,"abstract":"Wireless content delivery networks (WCDNs) have received attention as a promising solution to reduce the network congestion caused by rapidly growing demands for mobile content. The amount of reduced congestion is intuitively proportional to the hit ratio in a WCDN. Cooperation among cache servers is strongly required to maximize the hit ratio in a WCDN where each cache server is equipped with a small-size cache storage space. In this paper, we address a content replacement problem that deals with how to manage contents in a limited cache storage space in a reactive manner to cope with a dynamic content demand over time. As a new challenge, we apply reinforcement learning, which is Q-learning, to the content replacement problem in a WCDN with coooperative caching. We model the content replacement problem as a Markov Decision Process (MDP) and finally propose an efficient content replacement strategy to maximize the hit ratio based on a multi-agent Q-learning scheme. Simulation results exhibit that the proposed strategy contributes to achieving better content delivery performance in delay due to a higher hit ratio, compared to typical existing schemes of least recently used (LRU) and least frequently used (LFU).","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127862318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Nonlinear Dimensionality Reduction by Unit Ball Embedding (UBE) and Its Application to Image Clustering 单位球嵌入非线性降维及其在图像聚类中的应用

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0177

Behrouz Haji Soleimani, S. Matwin

The paper presents an unsupervised nonlinear dimensionality reduction algorithm called Unit Ball Embedding (UBE). Many high-dimensional data, such as object or face images, lie on a union of low-dimensional subspaces which are often called manifolds. The proposed method is able to learn the structure of these manifolds by exploiting the local neighborhood arrangement around each point. It tries to preserve the local structure by minimizing a cost function that measures the discrepancy between similarities of points in the high-dimensional data and similarities of points in the low-dimensional embedding. The cost function is proposed in a way that it provides a hyper-spherical representation of points in the low-dimensional embedding. Visualizations of our method on different datasets show that it creates large gaps between the manifolds and maximizes the separability of them. As a result, it notably improves the quality of unsupervised machine learning tasks (e.g. clustering). UBE is successfully applied on image datasets such as faces, handwritten digits, and objects and the results of clustering on the low-dimensional embedding show significant improvement over existing dimensionality reduction methods.

提出了一种无监督非线性降维算法——单位球嵌入(UBE)。许多高维数据，如物体或人脸图像，位于低维子空间的并集上，这些子空间通常被称为流形。该方法通过利用每个点周围的局部邻域排列来学习流形的结构。它试图通过最小化代价函数来保持局部结构，该代价函数测量高维数据中点的相似性与低维嵌入中点的相似性之间的差异。成本函数的提出方式是在低维嵌入中提供点的超球面表示。我们的方法在不同数据集上的可视化显示，它在流形之间创建了很大的间隙，并最大化了它们的可分离性。因此，它显著提高了无监督机器学习任务(例如聚类)的质量。UBE成功应用于人脸、手写数字、物体等图像数据集，在低维嵌入上聚类的结果比现有降维方法有显著改善。

{"title":"Nonlinear Dimensionality Reduction by Unit Ball Embedding (UBE) and Its Application to Image Clustering","authors":"Behrouz Haji Soleimani, S. Matwin","doi":"10.1109/ICMLA.2016.0177","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0177","url":null,"abstract":"The paper presents an unsupervised nonlinear dimensionality reduction algorithm called Unit Ball Embedding (UBE). Many high-dimensional data, such as object or face images, lie on a union of low-dimensional subspaces which are often called manifolds. The proposed method is able to learn the structure of these manifolds by exploiting the local neighborhood arrangement around each point. It tries to preserve the local structure by minimizing a cost function that measures the discrepancy between similarities of points in the high-dimensional data and similarities of points in the low-dimensional embedding. The cost function is proposed in a way that it provides a hyper-spherical representation of points in the low-dimensional embedding. Visualizations of our method on different datasets show that it creates large gaps between the manifolds and maximizes the separability of them. As a result, it notably improves the quality of unsupervised machine learning tasks (e.g. clustering). UBE is successfully applied on image datasets such as faces, handwritten digits, and objects and the results of clustering on the low-dimensional embedding show significant improvement over existing dimensionality reduction methods.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128993519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Error Detection of Ocean Depth Series Data with Area Partitioning and Using Sliding Window 基于区域划分和滑动窗口的海洋深度序列数据误差检测

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0186

S. Hayashi, S. Ono, S. Hosoda, M. Numao, Ken-ichi Fukui

In the ocean around the world, depth series ocean data of temperature and salinity have being measured. However, it is difficult to discriminate the errors from the normal data since the variation of ocean areas are different. In this research, using hierarchical clustering, we partitioned the ocean into some areas so that the ocean data have the same variation in each area. Then, transforming the ocean data into sets of sliding windows in consideration of depth series, we applied some anomaly detection methodologies. Finally, we succeeded in assigning high anomaly scores on errors that seemed to be normal.

在世界各地的海洋中，已经测量了温度和盐度的深度序列海洋数据。然而，由于海洋区域的变化不同，因此很难将误差与正常数据区分开来。在本研究中，我们采用分层聚类的方法，将海洋划分为若干区域，使海洋数据在每个区域具有相同的变化。然后，将海洋数据转换为考虑深度序列的滑动窗口集，应用一些异常检测方法。最后，我们成功地为看似正常的错误分配了高异常分数。

引用次数: 2

Bee Colony Based Worker Reliability Estimation Algorithm in Microtask Crowdsourcing 微任务众包中基于蜂群的工蚁可靠性估计算法

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0127

Alireza Moayedikia, K. Ong, Yee Ling Boo, W. Yeoh

Estimation of worker reliability on microtask crowdsourcing platforms has gained attention from many researchers. On microtask platforms no worker is fully reliable for a task and it is likely that some workers are spammers, in the sense that they provide a random answer to collect the financial reward. Existence of spammers is harmful as they increase the cost of microtasking and will negatively affect the answer aggregation process. Hence, to discriminate spammers and non-spammers one needs to measure worker reliability to predict how likely that a worker put an effort in solving a task. In this paper we introduce a new reliability estimation algorithm works based on bee colony algorithm called REBECO. This algorithm relies on Gaussian process model to estimate reliability of workers dynamically. With bees that go in search of pollen, some are more successful than the others. This maps well to our problem, where some workers (i.e., bees) are more successful than other workers for a given task thus, giving rise to a reliability measure. Answer aggregation with respect to worker reliability rates has been considered as a suitable replacement for conventional majority voting. We compared REBECO with majority voting using two real world datasets. The results indicate that REBECO is able to outperform MV significantly.

微任务众包平台上工人可靠性的评估受到了许多研究者的关注。在微任务平台上，没有一个工作人员可以完全可靠地完成任务，而且有些工作人员很可能是垃圾邮件发送者，因为他们提供随机答案来收集经济奖励。垃圾邮件发送者的存在是有害的，因为他们增加了微任务的成本，并会对答案聚合过程产生负面影响。因此，要区分垃圾邮件发送者和非垃圾邮件发送者，需要衡量工作人员的可靠性，以预测工作人员努力解决任务的可能性。本文介绍了一种新的基于蜂群算法的可靠性估计算法——REBECO。该算法依靠高斯过程模型对工人的可靠性进行动态估计。对于寻找花粉的蜜蜂来说，有些蜜蜂比其他蜜蜂更成功。这很好地反映了我们的问题，其中一些工人(例如蜜蜂)在给定的任务中比其他工人更成功，因此产生了可靠性测量。关于工人可靠性的答案聚合被认为是传统多数投票的合适替代品。我们使用两个真实世界的数据集比较了REBECO和多数投票。结果表明，REBECO能够显著优于MV。

{"title":"Bee Colony Based Worker Reliability Estimation Algorithm in Microtask Crowdsourcing","authors":"Alireza Moayedikia, K. Ong, Yee Ling Boo, W. Yeoh","doi":"10.1109/ICMLA.2016.0127","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0127","url":null,"abstract":"Estimation of worker reliability on microtask crowdsourcing platforms has gained attention from many researchers. On microtask platforms no worker is fully reliable for a task and it is likely that some workers are spammers, in the sense that they provide a random answer to collect the financial reward. Existence of spammers is harmful as they increase the cost of microtasking and will negatively affect the answer aggregation process. Hence, to discriminate spammers and non-spammers one needs to measure worker reliability to predict how likely that a worker put an effort in solving a task. In this paper we introduce a new reliability estimation algorithm works based on bee colony algorithm called REBECO. This algorithm relies on Gaussian process model to estimate reliability of workers dynamically. With bees that go in search of pollen, some are more successful than the others. This maps well to our problem, where some workers (i.e., bees) are more successful than other workers for a given task thus, giving rise to a reliability measure. Answer aggregation with respect to worker reliability rates has been considered as a suitable replacement for conventional majority voting. We compared REBECO with majority voting using two real world datasets. The results indicate that REBECO is able to outperform MV significantly.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121186994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Classification of X-Ray Galaxy Clusters with Morphological Feature and Tree SVM 基于形态特征和树支持向量机的x射线星系团分类

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0124

Lei Wang, Zhixian Ma, Haiguang Xu, Jie Zhu

Since many sky-survey observations were performed, as well as appreciable amount of data were obtained, study on large-scale evolution of our Universe has become a field of interest. In this work, we concentrate on the X-ray astronomical samples from NASA's Chandra observatory, and propose an approach to classify galaxy clusters (GCs) based on their central gas profiles' morphological features. Firstly, the raw images are preprocessed, and the central gas profile are segmented. Then, the Fourier descriptors and wavelet moments are take advantaged to extract the morphological features. Finally, a tree structure classifier using support vector machine (SVM) is trained and aid us to categorize the X-ray astronomical observations. Experiments and applications of our classification method on the real X-ray astronomical samples were demonstrated, and comparison of our approach with the non-tree SVM classifier was also performed, which proved our approach is accurate and efficient.

由于进行了许多巡天观测，以及获得了相当数量的数据，对我们宇宙的大规模演化的研究已成为一个感兴趣的领域。在这项工作中，我们集中研究了来自美国宇航局钱德拉天文台的x射线天文样本，并提出了一种基于其中心气体剖面形态特征对星系团进行分类的方法。首先，对原始图像进行预处理，对中心瓦斯剖面进行分割;然后，利用傅里叶描述子和小波矩提取图像的形态特征。最后，利用支持向量机(SVM)训练树结构分类器，帮助我们对x射线天文观测进行分类。通过实验验证了该方法在实际x射线天文样本上的应用，并与非树支持向量机分类器进行了比较，验证了该方法的准确性和有效性。

引用次数: 1

An LED Based Indoor Localization System Using k-Means Clustering 使用 k-Means 聚类的基于 LED 的室内定位系统

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0048

M. Saadi, Touqeer Ahmad, Yan Zhao, L. Wuttisittikulkij

This paper introduces a novel visible light positioning (VLP) system using an un-supervised machine learning approach. Two transmitters consist of light emitting diodes (LEDs) which are modulated with 1 kHz and 2.5 kHz sinusoidal signals respectively. At the receiver end, the received signal strength (RSS) is calculated and a sparse grid/cube is constructed by measuring light intensity at different locations. A bilinear interpolation is then applied to create a dense grid of readings which is used for the training of a hierarchical k-means clustering system. For a given query LEDs reading, the trained clusters are used for position estimation by minimizing the distances between the readings and cluster centroids. Experimental results show that an average accuracy of 0.31m can be achieved for a room with the dimensions of 4.3 × 4 × 4 m3. We further compared the performance of two other clustering methods: k-medoids and fuzzy c-means however no significant improvement over the kmeans clustering is found.

本文介绍了一种采用无监督机器学习方法的新型可见光定位（VLP）系统。两个发射器由发光二极管（LED）组成，分别使用 1 kHz 和 2.5 kHz 正弦信号进行调制。在接收端，通过测量不同位置的光强度，计算接收信号强度（RSS）并构建稀疏网格/立方体。然后应用双线性插值法创建密集的读数网格，用于分层 k-means 聚类系统的训练。对于给定的 LED 读数查询，通过最小化读数与聚类中心点之间的距离，将训练好的聚类用于位置估计。实验结果表明，对于一个尺寸为 4.3 × 4 × 4 m3 的房间，平均准确度可达 0.31 米。我们进一步比较了其他两种聚类方法的性能：k-medoids 和模糊 c-means，但与 kmeans 聚类方法相比，没有发现明显的改进。

引用次数: 13

Time Series Classification Using Time Warping Invariant Echo State Networks 基于时间翘曲不变回声状态网络的时间序列分类

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0149

Pattreeya Tanisaro, G. Heidemann

For many years, neural networks have gained gigantic interest and their popularity is likely to continue because of the success stories of deep learning. Nonetheless, their applications are mostly limited to static and not temporal patterns. In this paper, we apply time warping invariant Echo State Networks (ESNs) to time-series classification tasks using datasets from various studies in the UCR archive. We also investigate the influence of ESN architecture and spectral radius of the network in view of general characteristics of data, such as dataset type, number of classes, and amount of training data. We evaluate our results comparing it to other state-of-the-art methods, using One Nearest Neighbor (1-NN) with Euclidean Distance (ED), Dynamic Time Warping (DTW) and best warping window DTW.

多年来，神经网络获得了巨大的兴趣，由于深度学习的成功故事，它们的受欢迎程度可能会继续下去。尽管如此，它们的应用程序大多局限于静态模式，而不是临时模式。在本文中，我们使用来自UCR档案中各种研究的数据集，将时间扭曲不变回声状态网络(esn)应用于时间序列分类任务。我们还根据数据的一般特征(如数据集类型、类数和训练数据量)研究了回声状态网络架构和网络谱半径的影响。我们将其与其他最先进的方法进行比较，评估我们的结果，使用具有欧几里得距离(ED)的一个最近邻(1-NN)，动态时间翘曲(DTW)和最佳翘曲窗口DTW。

引用次数: 64

Nonlinear Metric Learning for Semi-Supervised Learning via Coherent Point Drifting 基于相干点漂移的半监督学习非线性度量学习

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0058

P. Zhang, Bibo Shi, Charles D. Smith, Jundong Liu

In this paper, we propose a nonlinear metric learning framework to boost the performance of semi-supervised learning (SSL) algorithms. Constructed on top of Laplacian SVM (LapSVM), the proposed method learns a smooth nonlinear feature space transformation that makes the input data points more linearly separable. Coherent point drifting (CPD) is utilized as the geometric model with the consideration of its remarkable expressive power in generating sophisticated yet smooth deformations. Our framework has broad applicability, and it can be integrated with many other SSL classifiers than LapSVM. Experiments performed on synthetic and real world datasets show the effectiveness of our CPD-LapSVM over the state-of-the-art metric learning solutions in SSL.

在本文中，我们提出一个非线性度量学习框架来提高半监督学习(SSL)算法的性能。该方法建立在拉普拉斯支持向量机(LapSVM)的基础上，学习光滑的非线性特征空间变换，使输入数据点更加线性可分。采用相干点漂移(CPD)作为几何模型，考虑到它在生成复杂而光滑的变形方面具有出色的表现力。我们的框架具有广泛的适用性，它可以与LapSVM以外的许多其他SSL分类器集成。在合成数据集和真实世界数据集上进行的实验表明，我们的CPD-LapSVM优于SSL中最先进的度量学习解决方案。

引用次数: 4

Using Domain Knowledge Features for Wind Turbine Diagnostics 利用领域知识特征进行风力涡轮机诊断

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2016-12-01 DOI: 10.1109/ICMLA.2016.0056

R. Hu, K. Leahy, Ioannis C. Konstantakopoulos, D. Auslander, C. Spanos, A. Agogino

Maximising electricity production from wind requires improvement of wind turbine reliability. Component failures result in unscheduled or reactive maintenance on turbines which incurs significant downtime and, in turn, increases production cost, ultimately limiting the competitiveness of renewable energy. Thus, a critical task is the early detection of faults. To this end, we present a framework for fault detection using machine learning that uses Supervisory Control and Data Acquisition (SCADA) data from a large 3MW turbine, supplemented with features derived from this data that encapsulate expert knowledge about wind turbines. These new features are created using application domain knowledge that is general to large horizontal-axis wind turbines, including knowledge of the physical quantities measured by sensors, the approximate locations of the sensors, the time series behaviour of the system, and some statistics related to the interpretation of sensor measurements. We then use mRMR feature selection to select the most important of these features. The new feature set is used to train a support vector machine to detect faults. The classification performance using the new feature set is compared to performance using the original feature set. Use of the new feature set achieves an F1-score of 90%, an improvement of 27% compared to the original feature set.

最大限度地利用风能发电需要提高风力涡轮机的可靠性。组件故障导致涡轮机出现计划外或无功维护，从而导致大量停机，进而增加生产成本，最终限制了可再生能源的竞争力。因此，及早发现故障是一项关键任务。为此，我们提出了一个使用机器学习的故障检测框架，该框架使用来自大型3MW涡轮机的监控和数据采集(SCADA)数据，并补充了来自该数据的特征，这些特征封装了有关风力涡轮机的专家知识。这些新特性是使用大型水平轴风力涡轮机的通用应用领域知识创建的，包括传感器测量的物理量、传感器的大致位置、系统的时间序列行为以及与传感器测量解释相关的一些统计数据。然后我们使用mRMR特征选择来选择这些特征中最重要的。利用新特征集训练支持向量机进行故障检测。将使用新特征集的分类性能与使用原始特征集的性能进行比较。新特性集的使用获得了90%的f1分数，与原始特性集相比提高了27%。

{"title":"Using Domain Knowledge Features for Wind Turbine Diagnostics","authors":"R. Hu, K. Leahy, Ioannis C. Konstantakopoulos, D. Auslander, C. Spanos, A. Agogino","doi":"10.1109/ICMLA.2016.0056","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0056","url":null,"abstract":"Maximising electricity production from wind requires improvement of wind turbine reliability. Component failures result in unscheduled or reactive maintenance on turbines which incurs significant downtime and, in turn, increases production cost, ultimately limiting the competitiveness of renewable energy. Thus, a critical task is the early detection of faults. To this end, we present a framework for fault detection using machine learning that uses Supervisory Control and Data Acquisition (SCADA) data from a large 3MW turbine, supplemented with features derived from this data that encapsulate expert knowledge about wind turbines. These new features are created using application domain knowledge that is general to large horizontal-axis wind turbines, including knowledge of the physical quantities measured by sensors, the approximate locations of the sensors, the time series behaviour of the system, and some statistics related to the interpretation of sensor measurements. We then use mRMR feature selection to select the most important of these features. The new feature set is used to train a support vector machine to detect faults. The classification performance using the new feature set is compared to performance using the original feature set. Use of the new feature set achieves an F1-score of 90%, an improvement of 27% compared to the original feature set.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131892974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀