International Conference on Machine Learning and Computing最新文献

英文中文

A Distributed PCM Clustering Algorithm Based on Spark 基于Spark的分布式PCM聚类算法

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318315

Yong Zhang, Hao Liu, Tianzhen Chen, Di Tang

With the large-scale growth of data, traditional single-machine data processing methods are difficult to deal with massive data, especially iterative clustering algorithms that require frequent reading and writing operations. On the basis of Spark framework, this paper proposes a distributed possibilistic c-means algorithm based on memory computing, called Spark-PCM. The proposed method improves the related processing of distributed matrix operation and is implemented on the Spark platform. Experimental results show that the proposed Spark-PCM algorithm runs in a linear relationship with the number of nodes and has a good scalability, which indicates that it has higher scalability and adaptability to large-scale data.

随着数据的大规模增长，传统的单机数据处理方法难以处理海量数据，尤其是需要频繁读写操作的迭代聚类算法。本文在Spark框架的基础上，提出了一种基于内存计算的分布式可能性c均值算法，称为Spark- pcm。该方法改进了分布式矩阵运算的相关处理，并在Spark平台上实现。实验结果表明，提出的Spark-PCM算法与节点数呈线性关系，具有良好的可扩展性，表明该算法具有较高的可扩展性和对大规模数据的适应性。

引用次数: 1

Application of Load Balancing Technology Based on Dynamic Migration in Wide Area Measurement Data Storage 基于动态迁移的负载均衡技术在广域测量数据存储中的应用

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318361

Allam Maalla

With the increasing number of technology, Wide area measurement plays an important role in real-time analysis of low frequency oscillation, transient stability control, voltage phase angle and amplitude measurement in modern smart grid. Wide area measurement has huge amount of data and how to store data is an important issue in wide area measurement. This paper presents the storage method of wide-area measurement data, studies the load distribution, adopts the Magician dynamic migration framework, which is developed based on Xen, and proposes the hierarchical copy algorithm and compression algorithm to optimize the mass data storage in wide-area measurement. Through the research, the problem of excessive data in wide-area measurement can be better solved, the reasonable storage of wide-area measurement data can be realized, and the role of wide-area measurement in smart grid can be better played.

随着技术的不断增多，广域测量在现代智能电网的低频振荡实时分析、暂态稳定控制、电压相角和幅值测量等方面发挥着重要作用。广域测量数据量巨大，如何存储数据是广域测量中的一个重要问题。本文提出了广域测量数据的存储方法，研究了负载分布，采用基于Xen开发的魔法师动态迁移框架，提出了分层复制算法和压缩算法来优化广域测量中海量数据的存储。通过研究，可以更好地解决广域测量数据过多的问题，实现广域测量数据的合理存储，更好地发挥广域测量在智能电网中的作用。

引用次数: 0

A Review of Methods Used in Machine Learning and Data Analysis 机器学习和数据分析方法综述

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318300

Qingyang Wu

This report provides an overview of machine learning and data analysis with explanation of the advantages and disadvantages of different methods. I also demonstrate a practical implementation of the described methods on a dataset of real estate prices.

本报告概述了机器学习和数据分析，并解释了不同方法的优缺点。我还在房地产价格数据集上演示了所描述方法的实际实现。

引用次数: 1

An Interpretable Classification Model Based on Characteristic Element Extraction 基于特征元素提取的可解释分类模型

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318370

Mingwei Zhang, Xiuxiu He, Bin Zhang

The process of a classification application is usually dynamic and long. During the process of an application, better classification application effect can be acquired by enlarging and adjusting the training dataset continuously, for example, modifying the wrong labels of original instances. For this kind of dynamic classification applications, how to build an interpretable classifier which can help domain experts to understand each label's meanings reflected from the dataset, then to compare and discriminate them with their own mastered domain knowledge, and finally to adjust and optimize the training set to enhance the effect of classification applications, is a neglected but worth studying issue. Therefore, an interpretable classification model based on characteristic element extraction is proposed in this paper. The proposed classifier is constructed by extracting positive and negative characteristic elements for all class labels which can intuitively reflect their instinct characteristics. Thus, it has high interpretability obviously and can effectively help domain experts optimize classification effect. At the same time, experiment results show that our classifier also has higher accuracy compared with other kinds of classical classifiers. Consequently, the classification model proposed in this paper is effective and efficient, especially in practical applications.

分类应用程序的过程通常是动态的和漫长的。在应用过程中，通过不断扩大和调整训练数据集，例如修改原始实例的错误标签，可以获得更好的分类应用效果。对于这类动态分类应用，如何构建一个可解释的分类器，帮助领域专家理解从数据集中反映出来的每个标签的含义，然后与自己掌握的领域知识进行比较和区分，最后对训练集进行调整和优化，以提高分类应用的效果，是一个被忽视但值得研究的问题。为此，本文提出了一种基于特征元素提取的可解释分类模型。该分类器通过提取所有类标签的正负特征元素来构建，可以直观地反映类标签的本能特征。因此，该方法具有明显的可解释性，可以有效地帮助领域专家优化分类效果。与此同时，实验结果表明，与其他经典分类器相比，我们的分类器也具有更高的准确率。因此，本文提出的分类模型是有效和高效的，特别是在实际应用中。

{"title":"An Interpretable Classification Model Based on Characteristic Element Extraction","authors":"Mingwei Zhang, Xiuxiu He, Bin Zhang","doi":"10.1145/3318299.3318370","DOIUrl":"https://doi.org/10.1145/3318299.3318370","url":null,"abstract":"The process of a classification application is usually dynamic and long. During the process of an application, better classification application effect can be acquired by enlarging and adjusting the training dataset continuously, for example, modifying the wrong labels of original instances. For this kind of dynamic classification applications, how to build an interpretable classifier which can help domain experts to understand each label's meanings reflected from the dataset, then to compare and discriminate them with their own mastered domain knowledge, and finally to adjust and optimize the training set to enhance the effect of classification applications, is a neglected but worth studying issue. Therefore, an interpretable classification model based on characteristic element extraction is proposed in this paper. The proposed classifier is constructed by extracting positive and negative characteristic elements for all class labels which can intuitively reflect their instinct characteristics. Thus, it has high interpretability obviously and can effectively help domain experts optimize classification effect. At the same time, experiment results show that our classifier also has higher accuracy compared with other kinds of classical classifiers. Consequently, the classification model proposed in this paper is effective and efficient, especially in practical applications.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131515828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Particle Competition for Multilayer Network Community Detection 多层网络社区检测中的粒子竞争

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318320

Xubo Gao, Qiusheng Zheng, F. Verri, Rafael D. Rodrigues, Liang Zhao

Multilayer complex networks are suitable models to represent high-dimensional heterogeneous systems with special importance in big data era. Community structures in a multilayer network can be drastically changed in comparison to the set of isolated monolayer networks composited by the same sets of nodes due to the existence of interlayer connections. For this reason, community detection in multilayer networks, as an unsupervised learning task, has turned out to be an interesting research topic in data mining and analysis in complex systems. In this paper, we propose a modified version of the particle competition model for multilayer network community detection. The original model was designed to community detection in monolayer unweighted and undirected networks. The modified version presented in this paper can be in turn applied to multilayer, weighted, and/or directed networks. Moreover, we also propose a localized measure to determine the optimal number of particles corresponding to the correct number of detected communities. Computer simulations shows the better performance of the proposed technique over the state of the art ones.

多层复杂网络是表征高维异构系统的合适模型，在大数据时代具有特殊的重要性。由于层间连接的存在，与由相同节点组成的孤立单层网络相比，多层网络中的社区结构可能会发生巨大变化。因此，多层网络中的社区检测作为一项无监督学习任务，已成为复杂系统数据挖掘与分析的一个有趣的研究课题。本文提出了一种改进的粒子竞争模型，用于多层网络社区检测。原始模型设计用于单层无权无向网络中的社区检测。本文提出的改进版本可以反过来应用于多层、加权和/或有向网络。此外，我们还提出了一种局部度量来确定与正确检测到的群落数相对应的最佳粒子数。计算机仿真结果表明，该方法具有较好的性能。

{"title":"Particle Competition for Multilayer Network Community Detection","authors":"Xubo Gao, Qiusheng Zheng, F. Verri, Rafael D. Rodrigues, Liang Zhao","doi":"10.1145/3318299.3318320","DOIUrl":"https://doi.org/10.1145/3318299.3318320","url":null,"abstract":"Multilayer complex networks are suitable models to represent high-dimensional heterogeneous systems with special importance in big data era. Community structures in a multilayer network can be drastically changed in comparison to the set of isolated monolayer networks composited by the same sets of nodes due to the existence of interlayer connections. For this reason, community detection in multilayer networks, as an unsupervised learning task, has turned out to be an interesting research topic in data mining and analysis in complex systems. In this paper, we propose a modified version of the particle competition model for multilayer network community detection. The original model was designed to community detection in monolayer unweighted and undirected networks. The modified version presented in this paper can be in turn applied to multilayer, weighted, and/or directed networks. Moreover, we also propose a localized measure to determine the optimal number of particles corresponding to the correct number of detected communities. Computer simulations shows the better performance of the proposed technique over the state of the art ones.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"33 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114039517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

An Improved Hybrid Quantum Particle Swarm Optimization Algorithm for FJSP 一种改进的FJSP混合量子粒子群优化算法

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318359

Qiwen Zhang, Songqi Hu

Aiming at minimizing makespan (the end time of the final machine) in flexible job shop scheduling problems (FJSP), a hybrid quantum behaved particle swarm optimization algorithm based on Lévy flights is proposed in this paper. Firstly, the algorithm uses the quantum probability amplitude coding method to establish a relationship between the process sequence and the particle position to solve job process sequencing sub-problem. Then uses the global selection, local selection and probability random selection to select the machine for each process. Finally, the Lévy flights is used to improve variant mode and enhance the effect of variation, the elitist strategy combined with neighborhood search is used after each iteration to improve the quality of the results. Experiments in a classical case show that the algorithm is effective and feasible for solving flexible job shop scheduling problems.

针对柔性作业车间调度问题(FJSP)中最大完工时间的问题，提出了一种基于lsamvy飞行的混合量子粒子群优化算法。该算法首先采用量子概率幅度编码方法，建立工序序列与粒子位置之间的关系，求解作业工序排序子问题;然后采用全局选择、局部选择和概率随机选择对各工序进行机器选择。最后，利用lsamvy飞行改进变异模式，增强变异效果，每次迭代后采用精英策略结合邻域搜索提高结果质量。经典实例实验表明，该算法对于求解柔性作业车间调度问题是有效可行的。

引用次数: 1

The Classification of the Documents Based on Word Embedding and 2-layer Spherical Self Organizing Maps 基于词嵌入和两层球面自组织图的文档分类

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318378

Koki Yoshioka, H. Dozono

Due to a popularization of SNS and increase of web pages, many documents can be obtained from the internet. However, it is difficult to process a huge set of document data manually. Therefore, various classification methods based on machine learning have been proposed. In this paper, a classification method which can visualize the relationship among the documents using Word2Vec and Spherical SOM is proposed, and the performance is examined in experiments of visualization and numerical evaluation of classification accuracy.

由于SNS的普及和网页的增加，很多文档都可以从网络上获得。然而，手工处理大量的文档数据是很困难的。因此，人们提出了各种基于机器学习的分类方法。本文提出了一种利用Word2Vec和Spherical SOM可视化文档之间关系的分类方法，并通过可视化实验和分类精度的数值评价对其性能进行了检验。

引用次数: 2

Attention Based Echo State Network: A Novel Approach for Fault Prognosis 基于注意力的回声状态网络:一种故障预测新方法

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318325

Chongdang Liu, Rong Yao, Linxuan Zhang, Yuan Liao

Recurrent neural networks (RNNs) are widely studied in recent years, since RNNs are capable of modeling the significant nonlinear dynamical systems. Echo state network (ESN) is a novel type of RNN with an interconnected reservoir to model temporal dynamics of complex sequential information. In this paper, a novel ESN structure is developed and employed to conduct fault prognosis. Fault prognosis is vital in predictive maintenance, which is a prevalent research area that mainly concentrates on predicting the remaining useful life of a machine and reducing the machine's downtime. Attention model is integrated to a typical ESN and thus different importance levels of different input elements can be adaptively treated. To further enhance the generalization of the prediction model, genetic algorithm is applied to adaptively optimize the parameters of the attention-based ESN. The proposed prognostic approach is verified on the NASA's turbofan benchmark dataset. Experimental results show that the attention-based ESN can not only achieve superior prediction accuracy but also obtain substantial improvement on stability.

近年来，递归神经网络(RNNs)由于能够对重要的非线性动力系统进行建模而得到了广泛的研究。回声状态网络(ESN)是一种新型的RNN，具有相互连接的存储库，用于模拟复杂序列信息的时间动态。本文提出了一种新的回声状态网络结构，并将其用于故障预测。故障预测是预测性维修的重要内容，是预测机器剩余使用寿命和减少机器停机时间的研究热点。将注意力模型整合到一个典型的回声状态网络中，从而对不同输入元素的不同重要程度进行自适应处理。为了进一步增强预测模型的泛化能力，采用遗传算法对基于注意力的ESN参数进行自适应优化。所提出的预测方法在NASA的涡扇基准数据集上得到了验证。实验结果表明，基于注意力的回声状态网络不仅可以获得较好的预测精度，而且在稳定性上也有较大的提高。

引用次数: 12

An Ontology Embedding Approach Based on Multiple Neural Networks 基于多神经网络的本体嵌入方法

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318365

Achref Benarab, Fahad Rafique, Jianguo Sun

In this paper, we present a low-dimensional vector representation method for the concepts and instances of an ontology. The main idea is to transform the ontological entities into digestible data for machine learning and deep learning algorithms that only use digital inputs. The generated vectors will represent the semantics contained in the source ontology. We use the semantic relationships connecting the concepts as a landmark to train expert neural networks using the noise contrastive estimation technique to project them into a vector space specific to this relationship with weightings dependent on their frequency. The resulting vectors are then combined and fed into an autoencoder to generate a denser representation. The generated representation vectors can be used to find the semantically similar ontology entities, allowing creating a semantic network automatically. Thus, semantically similar ontology entities will have relatively close corresponding vector representations in the projection space.

本文提出了一种本体概念和实例的低维向量表示方法。其主要思想是将本体实体转换为仅使用数字输入的机器学习和深度学习算法可消化的数据。生成的向量将表示源本体中包含的语义。我们使用连接概念的语义关系作为里程碑来训练专家神经网络，使用噪声对比估计技术将它们投影到特定于这种关系的向量空间中，权重取决于它们的频率。然后将结果向量组合并馈送到自动编码器中以生成更密集的表示。生成的表示向量可用于寻找语义相似的本体实体，从而自动创建语义网络。因此，语义相似的本体实体在投影空间中具有相对接近的对应向量表示。

引用次数: 1

Analysis Method of Travel Mode Choice of Urban Residents Based on Spatial-temporal Heterogeneity 基于时空异质性的城市居民出行方式选择分析方法

International Conference on Machine Learning and Computing

Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318333

K. Zhou, Xiao Peng, Zhong Guo

Green travel, low-carbon travel, harmonious and livable have become the main objectives of urban development. Public transport-oriented urban development mode can effectively alleviate traffic congestion, reduce energy consumption, reduce environmental pollution. Considering the influence of spatial-temporal heterogeneity on the choice of urban residents' travel modes, a cross-classification selection model is constructed based on hierarchical modeling theory to capture the spatial-temporal heterogeneity quantitatively. Bayesian estimation method is selected to estimate the model parameters, and then the influencing factors of urban residents' travel mode choice behavior are revealed. Combining with typical cases, this paper compares and analyzes the differences between the results of the model analysis under the two scenarios of neglecting spatial-temporal heterogeneity and considering spatial-temporal heterogeneity, so as to provide a scientific basis for public transport-oriented urban planning.

绿色出行、低碳出行、和谐宜居成为城市发展的主要目标。以公共交通为导向的城市发展模式可以有效缓解交通拥堵，降低能源消耗，减少环境污染。考虑到时空异质性对城市居民出行方式选择的影响，基于层次建模理论构建了城市居民出行方式选择的交叉分类模型，以定量捕捉城市居民出行方式的时空异质性。采用贝叶斯估计方法对模型参数进行估计，揭示城市居民出行方式选择行为的影响因素。结合典型案例，对比分析忽略时空异质性和考虑时空异质性两种情景下模型分析结果的差异，为公共交通导向的城市规划提供科学依据。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Conference on Machine Learning and Computing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀