2012 11th International Conference on Machine Learning and Applications最新文献

英文中文

Unsupervised Disaggregation for Non-intrusive Load Monitoring 非侵入式负荷监测的无监督分解

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.249

S. Pattem

A method for unsupervised disaggregation of appliance signatures from smart meter data is presented. The primary feature used for unsupervised learning relates to abrupt transitions or magnitude changes in the power waveform. The method consists of a sequence of procedures for appliance signature identification, and disaggregation using hidden Markov modeling (HMM), and residual analysis. The key contributions are (a) a novel 'segmented' application of the Viterbi algorithm for sequence decoding with the HMM, (b) details of establishing observation and state transition probabilities for the HMM, and (c) procedures for careful handling of low power signatures. Results show that the method is effective for magnitude-based disaggregation, and provide insights for a more complete solution.

提出了一种从智能电表数据中提取设备签名的无监督分解方法。用于无监督学习的主要特征与功率波形的突变或幅度变化有关。该方法包括一系列电器签名识别、隐马尔可夫模型分解和残差分析。主要贡献有:(a)将Viterbi算法用于HMM序列解码的新型“分段”应用，(b)建立HMM的观察和状态转移概率的细节，以及(c)仔细处理低功耗签名的程序。结果表明，该方法对基于震级的分解是有效的，并为更完整的解提供了见解。

引用次数: 53

Multi-label Collective Classification Using Adaptive Neighborhoods 基于自适应邻域的多标签集体分类

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.77

Tanwistha Saha, H. Rangwala, C. Domeniconi

Multi-label learning in graph-based relational data has gained popularity in recent years due to the increasingly complex structures of real world applications. Collective Classification deals with the simultaneous classification of neighboring instances in relational data, until a convergence criterion is reached. The rationale behind collective classification stems from the fact that an entity in a network (or relational data) is most likely influenced by the neighboring entities, and can be classified accordingly, based on the class assignment of the neighbors. Although extensive work has been done on collective classification of single labeled data, the domain of multi-labeled relational data has not been sufficiently explored. In this paper, we propose a neighborhood ranking method for multi-label classification, which can be further used in the Multi-label Collective Classification framework. We test our methods on real world datasets and also discuss the relevance of our approach for other multi-labeled relational data. Our experimental results show that the use of ranking in neighborhood selection for collective classification improves the performance of the classifier.

近年来，由于现实世界应用的结构越来越复杂，基于图的关系数据中的多标签学习越来越受欢迎。集体分类处理关系数据中相邻实例的同时分类，直到达到收敛准则。集体分类背后的基本原理源于这样一个事实，即网络中的实体(或关系数据)最有可能受到相邻实体的影响，并且可以根据相邻实体的类分配相应地进行分类。虽然在单标签数据的集体分类方面已经做了大量的工作，但多标签关系数据的领域还没有得到充分的探索。本文提出了一种多标签分类的邻域排序方法，该方法可进一步应用于多标签集体分类框架。我们在真实世界的数据集上测试了我们的方法，并讨论了我们的方法与其他多标签关系数据的相关性。我们的实验结果表明，在邻域选择中使用排序可以提高分类器的性能。

{"title":"Multi-label Collective Classification Using Adaptive Neighborhoods","authors":"Tanwistha Saha, H. Rangwala, C. Domeniconi","doi":"10.1109/ICMLA.2012.77","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.77","url":null,"abstract":"Multi-label learning in graph-based relational data has gained popularity in recent years due to the increasingly complex structures of real world applications. Collective Classification deals with the simultaneous classification of neighboring instances in relational data, until a convergence criterion is reached. The rationale behind collective classification stems from the fact that an entity in a network (or relational data) is most likely influenced by the neighboring entities, and can be classified accordingly, based on the class assignment of the neighbors. Although extensive work has been done on collective classification of single labeled data, the domain of multi-labeled relational data has not been sufficiently explored. In this paper, we propose a neighborhood ranking method for multi-label classification, which can be further used in the Multi-label Collective Classification framework. We test our methods on real world datasets and also discuss the relevance of our approach for other multi-labeled relational data. Our experimental results show that the use of ranking in neighborhood selection for collective classification improves the performance of the classifier.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126576717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

A Novel Neural Network Based Control Method with Adaptive On-Line Training for DC-DC Converters 基于神经网络的DC-DC变换器自适应在线训练控制方法

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.152

H. Maruta, M. Motomura, F. Kurokawa

This study presents a novel adaptive control based on a neural network for dc - dc converters. The control method is required to adapt to changes of conditions to obtain high performance dc-dc converters. In this study, the neural network control is adopted to improve the transient response of dc-dc converters. It woks in coordination with a conventional PID control to realize a high adaptive method. The neural network is trained with data which is obtained on-line. Therefore, the neural network control can adapt dynamically to change of input. The adaptation is realized by the modification of the reference in the PID control. The effect of the presented method is confirmed in simulations. Results show the presented method contributes to realize such adaptive control.

提出了一种基于神经网络的直流变换器自适应控制方法。为了获得高性能的dc-dc变换器，要求控制方法适应条件的变化。本研究采用神经网络控制来改善dc-dc变换器的暂态响应。它与传统的PID控制协同工作，实现了高自适应控制。神经网络用在线获取的数据进行训练。因此，神经网络控制可以动态适应输入的变化。通过修改PID控制中的参考值来实现自适应。仿真结果验证了该方法的有效性。结果表明，该方法有助于实现这种自适应控制。

引用次数: 5

Multivariate Assessment of a Repair Program for a New York City Electrical Grid 纽约市电网维修计划的多元评估

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.208

R. Passonneau, Ashish Tomar, Somnath Sarkar, Haimonti Dutta, Axinia Radeva

We assess the impact of an inspection repair program administered to the secondary electrical grid in New York City. The question of interest is whether repairs reduce the incidence of future events that cause service disruptions ranging from minor to serious ones. A key challenge in defining treatment and control groups in the absence of a randomized experiment involved an inherent bias in selection of electrical structures to be inspected in a given year. To compensate for the bias, we construct separate models for each year of the propensity for a structure to have an inspection repair. The propensity models account for differences across years in the structures that get inspected. To model the treatment outcome, we use a statistical approach based on the additive effects of many weak learners. Our results indicate that inspection repairs are more beneficial earlier in the five-year inspection cycle, which accords with the inherent bias to inspect structures in earlier years that are known to have problems.

我们评估了对纽约市二级电网管理的检查维修计划的影响。人们感兴趣的问题是，维修是否会减少未来导致服务中断的事件的发生率，这些事件从轻微到严重不等。在没有随机实验的情况下，确定治疗组和对照组的一个关键挑战涉及在特定年份选择要检查的电结构的固有偏见。为了弥补偏差，我们为每年的结构进行检查修复的倾向构建了单独的模型。倾向模型解释了被检查的结构在不同年份之间的差异。为了对治疗结果进行建模，我们使用了基于许多弱学习器的加性效应的统计方法。我们的研究结果表明，在五年的检查周期中，检查维修更有利于早期，这符合固有的偏见，即在早期检查已知有问题的结构。

引用次数: 1

Real-Time Statistical Background Learning for Foreground Detection under Unstable Illuminations 不稳定光照下前景检测的实时统计背景学习

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.85

Dawei Li, Lihong Xu, E. Goodman

This work proposes a fast background learning algorithm for foreground detection under changing illumination. Gaussian Mixture Model (GMM) is an effective statistical model in background learning. We first focus on Titterington's online EM algorithm that can be used for real-time unsupervised GMM learning, and then advocate a deterministic data assignment strategy to avoid Bayesian computation. The color of the foreground is apt to be influenced by the environmental illumination that usually produce undesirable effect for GMM updating, however, a collinear feature of pixel intensity under changing light is discovered in RGB color space. This feature is afterward used as a reliable clue to decide which part of mixture to update under changing light. A foreground detection step proposed in early version of this work is employed to extract foreground objects by comparing the estimated background model with the current video frame. Experiments have shown the proposed method is able to achieve satisfactory static background images of scenes as well as is also superior to some mainstream methods in detection performance under both indoor and outdoor scenes.

本文提出了一种快速背景学习算法，用于光照变化下的前景检测。高斯混合模型(GMM)是一种有效的背景学习统计模型。我们首先研究了Titterington的在线EM算法，该算法可用于实时无监督GMM学习，然后提出了一种确定性数据分配策略，以避免贝叶斯计算。前景的颜色容易受到环境光照的影响，通常会对GMM的更新产生不利的影响，但在RGB色彩空间中，发现了光照变化下像素强度的共线特征。这个特征随后被用作一个可靠的线索来决定在变化的光线下更新混合物的哪一部分。本文采用早期提出的前景检测步骤，通过将估计的背景模型与当前视频帧进行比较，提取前景目标。实验表明，该方法在室内和室外场景下都能获得令人满意的静态场景背景图像，并且在检测性能上也优于一些主流方法。

{"title":"Real-Time Statistical Background Learning for Foreground Detection under Unstable Illuminations","authors":"Dawei Li, Lihong Xu, E. Goodman","doi":"10.1109/ICMLA.2012.85","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.85","url":null,"abstract":"This work proposes a fast background learning algorithm for foreground detection under changing illumination. Gaussian Mixture Model (GMM) is an effective statistical model in background learning. We first focus on Titterington's online EM algorithm that can be used for real-time unsupervised GMM learning, and then advocate a deterministic data assignment strategy to avoid Bayesian computation. The color of the foreground is apt to be influenced by the environmental illumination that usually produce undesirable effect for GMM updating, however, a collinear feature of pixel intensity under changing light is discovered in RGB color space. This feature is afterward used as a reliable clue to decide which part of mixture to update under changing light. A foreground detection step proposed in early version of this work is employed to extract foreground objects by comparing the estimated background model with the current video frame. Experiments have shown the proposed method is able to achieve satisfactory static background images of scenes as well as is also superior to some mainstream methods in detection performance under both indoor and outdoor scenes.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126715755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

First Order Statistics Based Feature Selection: A Diverse and Powerful Family of Feature Seleciton Techniques 基于一阶统计量的特征选择:一个多样化和强大的特征选择技术家族

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.192

T. Khoshgoftaar, D. Dittman, Randall Wald, Alireza Fazelpour

Dimensionality reduction techniques have become a required step when working with bioinformatics datasets. Techniques such as feature selection have been known to not only improve computation time, but to improve the results of experiments by removing the redundant and irrelevant features or genes from consideration in subsequent analysis. Univariate feature selection techniques in particular are well suited for the large levels of high dimensionality that are inherent in bioinformatics datasets (for example: DNA microarray datasets) due to their intuitive output (a ranked lists of features or genes) and their relatively small computational time compared to other techniques. This paper presents seven univariate feature selection techniques and collects them into a single family entitled First Order Statistics (FOS) based feature selection. These seven all share the trait of using first order statistical measures such as mean and standard deviation, although this is the first work to relate them to one another and consider their performance compared with one another. In order to examine the properties of these seven techniques we performed a series of similarity and classification experiments on eleven DNA microarray datasets. Our results show that in general, each feature selection technique will create diverse feature subsets when compared to the other members of the family. However when we look at classification we find that, with one exception, the techniques will produce good classification results and that the techniques will have similar performances to each other. Our recommendation, is to use the rankers Signal-to-Noise and SAM for the best classification results and to avoid Fold Change Ratio as it is consistently the worst performer of the seven rankers.

降维技术已经成为处理生物信息学数据集的必要步骤。众所周知，特征选择等技术不仅可以缩短计算时间，而且可以通过在后续分析中去除冗余和不相关的特征或基因来改善实验结果。单变量特征选择技术特别适合于生物信息学数据集(例如:DNA微阵列数据集)中固有的高维度的大水平，因为它们具有直观的输出(特征或基因的排名列表)，并且与其他技术相比，它们的计算时间相对较小。本文提出了七种单变量特征选择技术，并将其归纳为一类基于一阶统计量的特征选择技术。这七种方法都有一个共同的特点，即使用一阶统计方法，如平均值和标准差，尽管这是第一次将它们相互联系起来，并将它们的表现相互比较。为了检验这七种技术的特性，我们对11个DNA微阵列数据集进行了一系列的相似性和分类实验。我们的结果表明，在一般情况下，与家族的其他成员相比，每种特征选择技术将创建不同的特征子集。然而，当我们观察分类时，我们发现，除了一个例外，这些技术将产生良好的分类结果，并且这些技术将具有彼此相似的性能。我们的建议是，使用排名器信号噪声和SAM来获得最佳分类结果，并避免Fold Change Ratio，因为它一直是七个排名器中表现最差的。

{"title":"First Order Statistics Based Feature Selection: A Diverse and Powerful Family of Feature Seleciton Techniques","authors":"T. Khoshgoftaar, D. Dittman, Randall Wald, Alireza Fazelpour","doi":"10.1109/ICMLA.2012.192","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.192","url":null,"abstract":"Dimensionality reduction techniques have become a required step when working with bioinformatics datasets. Techniques such as feature selection have been known to not only improve computation time, but to improve the results of experiments by removing the redundant and irrelevant features or genes from consideration in subsequent analysis. Univariate feature selection techniques in particular are well suited for the large levels of high dimensionality that are inherent in bioinformatics datasets (for example: DNA microarray datasets) due to their intuitive output (a ranked lists of features or genes) and their relatively small computational time compared to other techniques. This paper presents seven univariate feature selection techniques and collects them into a single family entitled First Order Statistics (FOS) based feature selection. These seven all share the trait of using first order statistical measures such as mean and standard deviation, although this is the first work to relate them to one another and consider their performance compared with one another. In order to examine the properties of these seven techniques we performed a series of similarity and classification experiments on eleven DNA microarray datasets. Our results show that in general, each feature selection technique will create diverse feature subsets when compared to the other members of the family. However when we look at classification we find that, with one exception, the techniques will produce good classification results and that the techniques will have similar performances to each other. Our recommendation, is to use the rankers Signal-to-Noise and SAM for the best classification results and to avoid Fold Change Ratio as it is consistently the worst performer of the seven rankers.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130530193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 37

A Machine Learning Pipeline for Three-Way Classification of Alzheimer Patients from Structural Magnetic Resonance Images of the Brain 基于脑结构磁共振图像对阿尔茨海默病患者进行三向分类的机器学习管道

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.42

Sriraam Natarajan, Saket Joshi, B. Saha, A. Edwards, Tushar Khot, Elizabeth Moody, K. Kersting, C. Whitlow, J. Maldjian

Magnetic resonance imaging (MRI) has emerged as an important tool to identify intermediate biomarkers of Alzheimer's disease (AD) due to its ability to measure regional changes in the brain that are thought to reflect disease severity and progression. In this paper, we set out a novel pipeline that uses volumetric MRI data collected from different subjects as input and classifies them into one of three classes: AD, mild cognitive impairment (MCI) and cognitively normal (CN). Our pipeline consists of three stages - (1) a segmentation layer where brain MRI data is divided into clinically relevant regions, (2) a classification layer that uses relational learning algorithms to make pair wise predictions between the three classes, and (3) a combination layer that combines the results of the different classes to obtain the final classification. One of the key features of our proposed approach is that it allows for domain expert's knowledge to guide the learning in all the layers. We evaluate our pipeline on 397 patients acquired from the Alzheimer's Disease Neuroimaging Initiative and demonstrate that it obtains state-of the-art performance with minimal feature engineering.

磁共振成像(MRI)已成为识别阿尔茨海默病(AD)中间生物标志物的重要工具，因为它能够测量被认为反映疾病严重程度和进展的大脑区域变化。在本文中，我们提出了一种新的管道，使用从不同受试者收集的体积MRI数据作为输入，并将其分为三类:AD，轻度认知障碍(MCI)和认知正常(CN)。我们的流水线由三个阶段组成:(1)分割层，其中脑MRI数据被划分为临床相关区域;(2)分类层，使用关系学习算法在三个类别之间进行配对预测;(3)组合层，将不同类别的结果组合在一起以获得最终分类。我们提出的方法的一个关键特征是它允许领域专家的知识来指导所有层的学习。我们对从阿尔茨海默病神经成像计划获得的397名患者进行了评估，并证明它以最小的特征工程获得了最先进的性能。

{"title":"A Machine Learning Pipeline for Three-Way Classification of Alzheimer Patients from Structural Magnetic Resonance Images of the Brain","authors":"Sriraam Natarajan, Saket Joshi, B. Saha, A. Edwards, Tushar Khot, Elizabeth Moody, K. Kersting, C. Whitlow, J. Maldjian","doi":"10.1109/ICMLA.2012.42","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.42","url":null,"abstract":"Magnetic resonance imaging (MRI) has emerged as an important tool to identify intermediate biomarkers of Alzheimer's disease (AD) due to its ability to measure regional changes in the brain that are thought to reflect disease severity and progression. In this paper, we set out a novel pipeline that uses volumetric MRI data collected from different subjects as input and classifies them into one of three classes: AD, mild cognitive impairment (MCI) and cognitively normal (CN). Our pipeline consists of three stages - (1) a segmentation layer where brain MRI data is divided into clinically relevant regions, (2) a classification layer that uses relational learning algorithms to make pair wise predictions between the three classes, and (3) a combination layer that combines the results of the different classes to obtain the final classification. One of the key features of our proposed approach is that it allows for domain expert's knowledge to guide the learning in all the layers. We evaluate our pipeline on 397 patients acquired from the Alzheimer's Disease Neuroimaging Initiative and demonstrate that it obtains state-of the-art performance with minimal feature engineering.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"225 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130768548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Incremental Mitosis: Discovering Clusters of Arbitrary Shapes and Densities in Dynamic Data 增量有丝分裂:发现动态数据中任意形状和密度的簇

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.26

Rania Ibrahim, N. Ahmed, N. A. Yousri, M. Ismail

While finding natural clusters in high dimensional data is in itself a challenge, the dynamic nature of data adds another greater challenge. Many applications such as Data Warehouses and WWW demand the presence of efficient incremental clustering algorithms to handle their dynamic data. So far, numerous useful incremental clustering algorithms have been developed for large datasets such as incremental K-means, incremental DBSCAN, similarity histogram-based clustering (SHC) and mean shift. However, targeting clusters of different shapes and densities is yet to be efficiently tackled. In this work, an efficient incremental clustering algorithm (Incremental Mitosis) is proposed. It is based on Mitosis clustering algorithm which maximizes the relatedness of distances between patterns of the same cluster. The proposed algorithm is able to discover clusters of arbitrary shapes and densities in dynamic high dimensional data. Experimental results show that the proposed algorithm efficiently clusters the data and maintains the accuracy of Mitosis algorithm.

虽然在高维数据中寻找自然集群本身就是一个挑战，但数据的动态特性又增加了另一个更大的挑战。许多应用程序，如数据仓库和WWW，都需要有效的增量聚类算法来处理它们的动态数据。到目前为止，已经为大型数据集开发了许多有用的增量聚类算法，如增量K-means、增量DBSCAN、基于相似性直方图的聚类(SHC)和均值移位。然而，针对不同形状和密度的集群尚未得到有效解决。本文提出了一种高效的增量聚类算法——增量有丝分裂算法。它基于有丝分裂聚类算法，该算法最大限度地提高了同一聚类中模式之间距离的相关性。该算法能够在动态高维数据中发现任意形状和密度的聚类。实验结果表明，该算法能有效地对数据进行聚类，并保持有丝分裂算法的准确性。

引用次数: 4

A Normalized Criterion of Spatial Clustering in Model-Based Framework 基于模型框架的空间聚类归一化准则

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.99

X. Wang, E. Grall-Maës, P. Beauseroy

This paper presents a model-based criterion for assessing the clustering results of spatial data, where both geometrical constraints and observation attributes are taken into account. An extra parameter is often used in the aim of controlling the importance of each characteristic. Since the values of both terms vary according to different realizations of data, it becomes essential to determine the parameter value which has a large influence on the clustering criterion value. Thus, an `upper-lower bound' technique is proposed to solve that problem caused by stochastic properties in both terms. In addition, we apply a normalization method to regularize the parameter value. The effectiveness of this approach is validated through the experimental results by using simulated reliability data.

本文提出了一种基于模型的空间数据聚类结果评价准则，该准则同时考虑几何约束和观测属性。为了控制每个特征的重要性，通常使用一个额外的参数。由于数据实现的不同，这两项的值也不同，因此确定对聚类准则值影响较大的参数值就变得至关重要。因此，提出了一种“上下界”技术来解决这两项的随机性质所引起的问题。此外，我们应用一种归一化方法对参数值进行正则化。通过仿真可靠性数据的实验结果验证了该方法的有效性。

引用次数: 1

On the Use of SVMs to Detect Anomalies in a Stream of SIP Messages 基于svm的SIP消息流异常检测

2012 11th International Conference on Machine Learning and Applications

Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.109

Raihana Ferdous, R. Cigno, A. Zorat

Voice and multimedia communications are rapidly migrating from traditional networks to TCP/IP networks (Internet), where services are provisioned by SIP (Session Initiation Protocol). This paper proposes an on-line filter that examines the stream of incoming SIP messages and classifies them as good or bad. The classification is carried out in two stages: first a lexical analysis is performed to weed out those messages that do not belong to the language generated by the grammar defined by the SIP standard. After this first stage, a second filtering occurs which identifies messages that somehow differ - in structure or contents - from messages that were previously classified as good. While the first filter stage is straightforward, as the classification is crisp (either a messages belongs to the language or it does not), the second stage requires a more delicate handling, as it is not a sharp decision whether a message is semantically meaningful or not. The approach we followed for this step is based on using past experience on previously classified messages, i.e. a "learn-by-example" approach, which led to a classifier based on Support-Vector-Machines (SVM) to perform the required analysis of each incoming SIP message. The paper describes the overall architecture of the two-stage filter and then explores several points of the configuration-space for the SVM to determine a good configuration setting that will perform well when used to classify a large sample of SIP messages obtained from real traffic collected on a VoIP installation at our institution. Finally, the performance of the classification on additional messages collected from the same source is presented.

语音和多媒体通信正迅速从传统网络向TCP/IP网络(Internet)迁移，在TCP/IP网络中，业务由SIP(会话发起协议)提供。本文提出了一种在线过滤器，该过滤器检查传入的SIP消息流并将其分类为好或坏。分类分两个阶段进行:首先执行词法分析，以清除那些不属于由SIP标准定义的语法生成的语言的消息。在第一阶段之后，将进行第二次过滤，以识别在结构或内容上与先前分类为良好的消息有所不同的消息。虽然第一个筛选阶段很简单，因为分类很清晰(消息要么属于该语言，要么不属于该语言)，但第二阶段需要更精细的处理，因为它不能明确地决定消息是否在语义上有意义。我们在此步骤中采用的方法是基于使用过去对先前分类消息的经验，即“通过示例学习”方法，该方法导致基于支持向量机(SVM)的分类器对每个传入的SIP消息执行所需的分析。本文描述了两阶段过滤器的整体架构，然后探讨了支持向量机配置空间的几个点，以确定一个良好的配置设置，当用于分类从我们机构的VoIP安装上收集的真实流量中获得的大量SIP消息样本时，该配置设置将表现良好。最后，给出了对从同一来源收集的其他消息进行分类的性能。

{"title":"On the Use of SVMs to Detect Anomalies in a Stream of SIP Messages","authors":"Raihana Ferdous, R. Cigno, A. Zorat","doi":"10.1109/ICMLA.2012.109","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.109","url":null,"abstract":"Voice and multimedia communications are rapidly migrating from traditional networks to TCP/IP networks (Internet), where services are provisioned by SIP (Session Initiation Protocol). This paper proposes an on-line filter that examines the stream of incoming SIP messages and classifies them as good or bad. The classification is carried out in two stages: first a lexical analysis is performed to weed out those messages that do not belong to the language generated by the grammar defined by the SIP standard. After this first stage, a second filtering occurs which identifies messages that somehow differ - in structure or contents - from messages that were previously classified as good. While the first filter stage is straightforward, as the classification is crisp (either a messages belongs to the language or it does not), the second stage requires a more delicate handling, as it is not a sharp decision whether a message is semantically meaningful or not. The approach we followed for this step is based on using past experience on previously classified messages, i.e. a \"learn-by-example\" approach, which led to a classifier based on Support-Vector-Machines (SVM) to perform the required analysis of each incoming SIP message. The paper describes the overall architecture of the two-stage filter and then explores several points of the configuration-space for the SVM to determine a good configuration setting that will perform well when used to classify a large sample of SIP messages obtained from real traffic collected on a VoIP installation at our institution. Finally, the performance of the classification on additional messages collected from the same source is presented.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130966840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2012 11th International Conference on Machine Learning and Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀