首页 > 最新文献

2012 11th International Conference on Machine Learning and Applications最新文献

英文 中文
Taxonomic Dimensionality Reduction in Bayesian Text Classification 贝叶斯文本分类中的分类降维
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.93
Richard A. McAllister, John W. Sheppard
Lexical abstraction hierarchies can be leveraged to provide semantic information that characterizes features of text corpora as a whole. This information may be used to determine the classification utility of the dimensions that describe a dataset. This paper presents a new method for preparing a dataset for probabilistic classification by determining, a priori, the utility of a very small subset of taxonomically-related dimensions via a Discriminative Multinomial Naive Bayes process. We show that this method yields significant improvements over both Discriminative Multinomial Naive Bayes and Bayesian network classifiers alone.
可以利用词汇抽象层次结构来提供语义信息,这些信息将文本语料库的特征作为一个整体来描述。此信息可用于确定描述数据集的维度的分类效用。本文提出了一种通过判别多项式朴素贝叶斯过程先验地确定分类相关维度的极小子集的效用来准备用于概率分类的数据集的新方法。我们证明这种方法比单独的判别多项式朴素贝叶斯和贝叶斯网络分类器都有显著的改进。
{"title":"Taxonomic Dimensionality Reduction in Bayesian Text Classification","authors":"Richard A. McAllister, John W. Sheppard","doi":"10.1109/ICMLA.2012.93","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.93","url":null,"abstract":"Lexical abstraction hierarchies can be leveraged to provide semantic information that characterizes features of text corpora as a whole. This information may be used to determine the classification utility of the dimensions that describe a dataset. This paper presents a new method for preparing a dataset for probabilistic classification by determining, a priori, the utility of a very small subset of taxonomically-related dimensions via a Discriminative Multinomial Naive Bayes process. We show that this method yields significant improvements over both Discriminative Multinomial Naive Bayes and Bayesian network classifiers alone.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115478448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Face Recognition in the Virtual World: Recognizing Avatar Faces 虚拟世界中的人脸识别:识别化身的面孔
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.16
Roman V. Yampolskiy, Brendan Klare, Anil K. Jain
Criminal activity in virtual worlds is becoming a major problem for law enforcement agencies. Forensic investigators are becoming interested in being able to accurately and automatically track people in virtual communities. In this paper a set of algorithms capable of verification and recognition of avatar faces with high degree of accuracy are described. Results of experiments aimed at within-virtual-world avatar authentication and inter-reality-based scenarios of tracking a person between real and virtual worlds are reported. In the FERET-to-Avatar face dataset, where an avatar face was generated from every photo in the FERET database, a COTS FR algorithm achieved a near perfect 99.58% accuracy on 725 subjects. On a dataset of avatars from Second Life, the proposed avatar-to-avatar matching algorithm (which uses a fusion of local structural and appearance descriptors) achieved average true accept rates of (i) 96.33% using manual eye detection, and (ii) 86.5% in a fully automated mode at a false accept rate of 1.0%. A combination of the proposed face matcher and a state-of-the art commercial matcher (FaceVACS) resulted in further improvement on the inter-reality-based scenario.
虚拟世界中的犯罪活动正成为执法机构面临的一个主要问题。法医调查人员对能够准确、自动地追踪虚拟社区中的人越来越感兴趣。本文描述了一套能够对虚拟形象人脸进行高精度验证和识别的算法。本文报道了虚拟世界内的虚拟身份认证和基于虚拟世界和真实世界之间跟踪人的跨现实场景的实验结果。在FERET-to- avatar人脸数据集中,从FERET数据库中的每张照片生成头像,COTS FR算法在725个受试者上实现了近乎完美的99.58%的准确率。在《第二人生》的化身数据集上,提出的化身到化身匹配算法(使用局部结构和外观描述符的融合)在使用手动眼睛检测时实现了(i) 96.33%的平均真实接受率,(ii)在完全自动化模式下实现了86.5%的平均真实接受率,错误接受率为1.0%。将拟议的人脸匹配器与最先进的商用匹配器(FaceVACS)结合在一起,进一步改进了基于inter-reality的场景。
{"title":"Face Recognition in the Virtual World: Recognizing Avatar Faces","authors":"Roman V. Yampolskiy, Brendan Klare, Anil K. Jain","doi":"10.1109/ICMLA.2012.16","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.16","url":null,"abstract":"Criminal activity in virtual worlds is becoming a major problem for law enforcement agencies. Forensic investigators are becoming interested in being able to accurately and automatically track people in virtual communities. In this paper a set of algorithms capable of verification and recognition of avatar faces with high degree of accuracy are described. Results of experiments aimed at within-virtual-world avatar authentication and inter-reality-based scenarios of tracking a person between real and virtual worlds are reported. In the FERET-to-Avatar face dataset, where an avatar face was generated from every photo in the FERET database, a COTS FR algorithm achieved a near perfect 99.58% accuracy on 725 subjects. On a dataset of avatars from Second Life, the proposed avatar-to-avatar matching algorithm (which uses a fusion of local structural and appearance descriptors) achieved average true accept rates of (i) 96.33% using manual eye detection, and (ii) 86.5% in a fully automated mode at a false accept rate of 1.0%. A combination of the proposed face matcher and a state-of-the art commercial matcher (FaceVACS) resulted in further improvement on the inter-reality-based scenario.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115485413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Risk Estimation in Spatial Disease Clusters: An RBF Network Approach 基于RBF网络的空间疾病集群风险评估
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.233
Fernanda C. Takahashi, Ricardo H. C. Takahashi
This paper proposes a method which is suitable for the estimation of the probability of occurrence of a syndrome, as a function of the geographical coordinates of the individuals under risk. The data describing the location of syndrome cases over the population suffers a moving-average filtering, and the resulting values are fitted by an RBF network performing a regression. Some contour curves of the RBF network are then employed in order to establish the boundaries between four kinds of regions: regions of high-incidence, regions of medium incidence, regions of slightly-abnormal incidence, and regions of normal prevalence. In each region, the risk is estimated with three indicators: a nominal risk, an upper bound risk and a lower bound risk. Those indicators are obtained by adjusting the probability employed for the Monte Carlo simulation of syndrome scenarios over the population. The nominal risk is the probability which produces Monte Carlo simulations for which the empirical number of syndrome cases corresponds to the median. The upper bound and the lower bound risks are the probabilities which produce Monte Carlo simulations for which the empirical values of syndrome cases correspond respectively to the 25% percentile and the 75% percentile. The proposed method constitutes an advance in relation to the currently known techniques of spatial cluster detection, which are dedicated to finding clusters of abnormal occurrence of a syndrome, without quantifying the probability associated to such an abnormality, and without performing a stratification of different sub-regions with different associated risks. The proposed method was applied on data which were studied formerly in a paper that was intended to find a cluster of dengue fever. The result determined here is compatible with the cluster that was found in that reference.
本文提出了一种适用于估计某一综合征发生概率的方法,该方法是危险个体地理坐标的函数。描述综合征病例在人群中的位置的数据经过移动平均滤波,结果值由执行回归的RBF网络拟合。然后利用RBF网络的一些轮廓曲线来建立四种区域之间的边界:高发病率区域、中等发病率区域、轻微异常发病率区域和正常患病率区域。在每个地区,用三个指标来估计风险:名义风险、上限风险和下限风险。这些指标是通过调整总体上综合症情景的蒙特卡罗模拟所采用的概率而获得的。名义风险是产生蒙特卡罗模拟的概率,其中综合症病例的经验数对应于中位数。上界和下界风险是产生蒙特卡罗模拟的概率,其中综合症病例的经验值分别对应于25%百分位和75%百分位。与目前已知的空间聚类检测技术相比,所提出的方法是一种进步,这些技术致力于发现综合征异常发生的聚类,而没有量化与这种异常相关的概率,也没有对具有不同相关风险的不同子区域进行分层。所提出的方法应用于以前在一篇旨在找到登革热群集的论文中研究的数据。这里确定的结果与在该引用中找到的集群兼容。
{"title":"Risk Estimation in Spatial Disease Clusters: An RBF Network Approach","authors":"Fernanda C. Takahashi, Ricardo H. C. Takahashi","doi":"10.1109/ICMLA.2012.233","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.233","url":null,"abstract":"This paper proposes a method which is suitable for the estimation of the probability of occurrence of a syndrome, as a function of the geographical coordinates of the individuals under risk. The data describing the location of syndrome cases over the population suffers a moving-average filtering, and the resulting values are fitted by an RBF network performing a regression. Some contour curves of the RBF network are then employed in order to establish the boundaries between four kinds of regions: regions of high-incidence, regions of medium incidence, regions of slightly-abnormal incidence, and regions of normal prevalence. In each region, the risk is estimated with three indicators: a nominal risk, an upper bound risk and a lower bound risk. Those indicators are obtained by adjusting the probability employed for the Monte Carlo simulation of syndrome scenarios over the population. The nominal risk is the probability which produces Monte Carlo simulations for which the empirical number of syndrome cases corresponds to the median. The upper bound and the lower bound risks are the probabilities which produce Monte Carlo simulations for which the empirical values of syndrome cases correspond respectively to the 25% percentile and the 75% percentile. The proposed method constitutes an advance in relation to the currently known techniques of spatial cluster detection, which are dedicated to finding clusters of abnormal occurrence of a syndrome, without quantifying the probability associated to such an abnormality, and without performing a stratification of different sub-regions with different associated risks. The proposed method was applied on data which were studied formerly in a paper that was intended to find a cluster of dengue fever. The result determined here is compatible with the cluster that was found in that reference.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122027312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate Assessment of a Repair Program for a New York City Electrical Grid 纽约市电网维修计划的多元评估
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.208
R. Passonneau, Ashish Tomar, Somnath Sarkar, Haimonti Dutta, Axinia Radeva
We assess the impact of an inspection repair program administered to the secondary electrical grid in New York City. The question of interest is whether repairs reduce the incidence of future events that cause service disruptions ranging from minor to serious ones. A key challenge in defining treatment and control groups in the absence of a randomized experiment involved an inherent bias in selection of electrical structures to be inspected in a given year. To compensate for the bias, we construct separate models for each year of the propensity for a structure to have an inspection repair. The propensity models account for differences across years in the structures that get inspected. To model the treatment outcome, we use a statistical approach based on the additive effects of many weak learners. Our results indicate that inspection repairs are more beneficial earlier in the five-year inspection cycle, which accords with the inherent bias to inspect structures in earlier years that are known to have problems.
我们评估了对纽约市二级电网管理的检查维修计划的影响。人们感兴趣的问题是,维修是否会减少未来导致服务中断的事件的发生率,这些事件从轻微到严重不等。在没有随机实验的情况下,确定治疗组和对照组的一个关键挑战涉及在特定年份选择要检查的电结构的固有偏见。为了弥补偏差,我们为每年的结构进行检查修复的倾向构建了单独的模型。倾向模型解释了被检查的结构在不同年份之间的差异。为了对治疗结果进行建模,我们使用了基于许多弱学习器的加性效应的统计方法。我们的研究结果表明,在五年的检查周期中,检查维修更有利于早期,这符合固有的偏见,即在早期检查已知有问题的结构。
{"title":"Multivariate Assessment of a Repair Program for a New York City Electrical Grid","authors":"R. Passonneau, Ashish Tomar, Somnath Sarkar, Haimonti Dutta, Axinia Radeva","doi":"10.1109/ICMLA.2012.208","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.208","url":null,"abstract":"We assess the impact of an inspection repair program administered to the secondary electrical grid in New York City. The question of interest is whether repairs reduce the incidence of future events that cause service disruptions ranging from minor to serious ones. A key challenge in defining treatment and control groups in the absence of a randomized experiment involved an inherent bias in selection of electrical structures to be inspected in a given year. To compensate for the bias, we construct separate models for each year of the propensity for a structure to have an inspection repair. The propensity models account for differences across years in the structures that get inspected. To model the treatment outcome, we use a statistical approach based on the additive effects of many weak learners. Our results indicate that inspection repairs are more beneficial earlier in the five-year inspection cycle, which accords with the inherent bias to inspect structures in earlier years that are known to have problems.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125457799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unsupervised Disaggregation for Non-intrusive Load Monitoring 非侵入式负荷监测的无监督分解
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.249
S. Pattem
A method for unsupervised disaggregation of appliance signatures from smart meter data is presented. The primary feature used for unsupervised learning relates to abrupt transitions or magnitude changes in the power waveform. The method consists of a sequence of procedures for appliance signature identification, and disaggregation using hidden Markov modeling (HMM), and residual analysis. The key contributions are (a) a novel 'segmented' application of the Viterbi algorithm for sequence decoding with the HMM, (b) details of establishing observation and state transition probabilities for the HMM, and (c) procedures for careful handling of low power signatures. Results show that the method is effective for magnitude-based disaggregation, and provide insights for a more complete solution.
提出了一种从智能电表数据中提取设备签名的无监督分解方法。用于无监督学习的主要特征与功率波形的突变或幅度变化有关。该方法包括一系列电器签名识别、隐马尔可夫模型分解和残差分析。主要贡献有:(a)将Viterbi算法用于HMM序列解码的新型“分段”应用,(b)建立HMM的观察和状态转移概率的细节,以及(c)仔细处理低功耗签名的程序。结果表明,该方法对基于震级的分解是有效的,并为更完整的解提供了见解。
{"title":"Unsupervised Disaggregation for Non-intrusive Load Monitoring","authors":"S. Pattem","doi":"10.1109/ICMLA.2012.249","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.249","url":null,"abstract":"A method for unsupervised disaggregation of appliance signatures from smart meter data is presented. The primary feature used for unsupervised learning relates to abrupt transitions or magnitude changes in the power waveform. The method consists of a sequence of procedures for appliance signature identification, and disaggregation using hidden Markov modeling (HMM), and residual analysis. The key contributions are (a) a novel 'segmented' application of the Viterbi algorithm for sequence decoding with the HMM, (b) details of establishing observation and state transition probabilities for the HMM, and (c) procedures for careful handling of low power signatures. Results show that the method is effective for magnitude-based disaggregation, and provide insights for a more complete solution.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"68 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120844123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
Feature Mapping and Fusion for Music Genre Classification 音乐类型分类的特征映射与融合
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.59
H. Balti, H. Frigui
We propose a feature level fusion that is based on mapping the original low-level audio features to histogram descriptors. Our mapping is based on possibilistic membership functions and has two main components. The first one consists of clustering each set of features and identifying a set of representative prototypes. The second component uses the learned prototypes within membership functions to transform the original features into histograms. The mapping transforms features of different dimensions to histograms of fixed dimensions. This makes the fusion of multiple features less biased by the dimensionality and distributions of the different features. Using a standard collection of songs, we show that the transformed features provide higher classification accuracy than the original features. We also show that mapping simple low-level features and using a K-NN classifier provides results comparable to the state-of-the art.
我们提出了一种基于将原始低级音频特征映射到直方图描述符的特征级融合。我们的映射是基于可能性隶属函数的,它有两个主要组成部分。第一种方法包括对每组特征进行聚类,并确定一组具有代表性的原型。第二个组件使用隶属函数中学习到的原型将原始特征转换成直方图。映射将不同维度的特征转换为固定维度的直方图。这使得多特征的融合较少受到不同特征的维数和分布的影响。使用标准的歌曲集,我们证明了转换后的特征比原始特征提供了更高的分类精度。我们还表明,映射简单的低级特征和使用K-NN分类器可以提供与最先进的结果相媲美的结果。
{"title":"Feature Mapping and Fusion for Music Genre Classification","authors":"H. Balti, H. Frigui","doi":"10.1109/ICMLA.2012.59","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.59","url":null,"abstract":"We propose a feature level fusion that is based on mapping the original low-level audio features to histogram descriptors. Our mapping is based on possibilistic membership functions and has two main components. The first one consists of clustering each set of features and identifying a set of representative prototypes. The second component uses the learned prototypes within membership functions to transform the original features into histograms. The mapping transforms features of different dimensions to histograms of fixed dimensions. This makes the fusion of multiple features less biased by the dimensionality and distributions of the different features. Using a standard collection of songs, we show that the transformed features provide higher classification accuracy than the original features. We also show that mapping simple low-level features and using a K-NN classifier provides results comparable to the state-of-the art.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123929704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Prognosis Based on Handling Drifts in Dynamical Environments: Application to a Wind Turbine Benchmark 动态环境中基于漂移处理的预测:在风力发电机基准测试中的应用
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.131
Antoine Chammas, E. Duviella, S. Lecoeuche
In this paper, we present a prognosis architecture that allows the computation of the Remaining Useful Life (RUL) of a failing process. A process subject to an incipient fault experiments slowly developing degradation. Sensor measurements and Condition Monitoring (CM) data extracted from the system allow to follow up the process drift. The prognosis architecture we propose makes use of a dynamical clustering algorithm to model the data in a feature space. This algorithm uses a sliding window scheme on which the model is iteratively updated. Metrics applied on the parameters of this model are used to compute a drift severity indicator, which is also an indicator of the health of the system. The architecture for prognosis is applied on a benchmark of wind turbine. The used benchmark has been constructed to serve as a realistic wind turbine model. It was used in the context of a global scale fault diagnosis and fault tolerant control competition. The benchmark also proposed a drifting fault scenario that we used to test our approach.
在本文中,我们提出了一个预测体系结构,允许计算失效过程的剩余使用寿命(RUL)。受初期故障影响的过程会逐渐退化。从系统中提取的传感器测量和状态监测(CM)数据允许跟踪过程漂移。我们提出的预测架构利用动态聚类算法在特征空间中对数据建模。该算法采用滑动窗口模式,迭代更新模型。应用于该模型参数的度量用于计算漂移严重性指标,该指标也是系统健康状况的指标。将该预测体系应用于某风力发电机组的基准试验。所使用的基准已被构建为一个现实的风力涡轮机模型。将其应用于全局范围的故障诊断和容错控制竞争。基准测试还提出了一个漂移故障场景,我们用它来测试我们的方法。
{"title":"Prognosis Based on Handling Drifts in Dynamical Environments: Application to a Wind Turbine Benchmark","authors":"Antoine Chammas, E. Duviella, S. Lecoeuche","doi":"10.1109/ICMLA.2012.131","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.131","url":null,"abstract":"In this paper, we present a prognosis architecture that allows the computation of the Remaining Useful Life (RUL) of a failing process. A process subject to an incipient fault experiments slowly developing degradation. Sensor measurements and Condition Monitoring (CM) data extracted from the system allow to follow up the process drift. The prognosis architecture we propose makes use of a dynamical clustering algorithm to model the data in a feature space. This algorithm uses a sliding window scheme on which the model is iteratively updated. Metrics applied on the parameters of this model are used to compute a drift severity indicator, which is also an indicator of the health of the system. The architecture for prognosis is applied on a benchmark of wind turbine. The used benchmark has been constructed to serve as a realistic wind turbine model. It was used in the context of a global scale fault diagnosis and fault tolerant control competition. The benchmark also proposed a drifting fault scenario that we used to test our approach.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128094781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Measuring the Spatial Error in Load Forecasting for Electrical Distribution Planning as a Problem of Transporting the Surplus to the In-Deficit Locations 基于余量输缺问题的配电规划负荷预测空间误差测量
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.203
D. Vieira, M. A. M. Cabral, T. V. Menezes, B. E. Silva, A. C. Lisboa
While there are many functions defined in the literature to measure the error magnitude (how much), the problem of dinning the spatial error (where) is not so well defined. For instance, in a given region it is expected a global growth in the electrical demand of 10MW. For the electrical system planning not only the amount but also the location must be considered. Predicting a growth of 10MW (how much) in the south (where) of a city would lead to complete different polices in terms of resources allocation (for instance a new substation) than predicting the same amount of 10MW in the north. Trying to cope with this difficulty, this paper proposes the concept of spatial error as the cost of transporting the surplus of one region to compensate another region deceit. This conceptual problem was written as an optimization transportation problem. This paper describes conceptually the difference between magnitude and spatial error measures and shows an algorithm to deal efficiently with the defined framework.
虽然在文献中定义了许多函数来测量误差大小(多少),但没有很好地定义空间误差(在哪里)的问题。例如,在某一特定地区,预计全球电力需求将增长10MW。在电力系统规划中,不仅要考虑电量,而且要考虑位置。预测一个城市的南部(在哪里)增长10MW(多少)会导致资源分配方面的完全不同的政策(例如,一个新的变电站),而在北部预测同样数量的10MW。为了解决这一问题,本文提出了空间误差的概念,将空间误差作为一个区域的剩余量运输到另一个区域的补偿成本。这个概念问题被写成一个优化运输问题。本文从概念上描述了幅度和空间误差度量之间的区别,并给出了一种有效处理定义框架的算法。
{"title":"Measuring the Spatial Error in Load Forecasting for Electrical Distribution Planning as a Problem of Transporting the Surplus to the In-Deficit Locations","authors":"D. Vieira, M. A. M. Cabral, T. V. Menezes, B. E. Silva, A. C. Lisboa","doi":"10.1109/ICMLA.2012.203","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.203","url":null,"abstract":"While there are many functions defined in the literature to measure the error magnitude (how much), the problem of dinning the spatial error (where) is not so well defined. For instance, in a given region it is expected a global growth in the electrical demand of 10MW. For the electrical system planning not only the amount but also the location must be considered. Predicting a growth of 10MW (how much) in the south (where) of a city would lead to complete different polices in terms of resources allocation (for instance a new substation) than predicting the same amount of 10MW in the north. Trying to cope with this difficulty, this paper proposes the concept of spatial error as the cost of transporting the surplus of one region to compensate another region deceit. This conceptual problem was written as an optimization transportation problem. This paper describes conceptually the difference between magnitude and spatial error measures and shows an algorithm to deal efficiently with the defined framework.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129667613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Treeboost Model for Software Effort Estimation Based on Use Case Points 基于用例点的软件工作量评估Treeboost模型
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.155
A. B. Nassif, Luiz Fernando Capretz, D. Ho, Mohammad Azzeh
Software effort prediction is an important task in the software development life cycle. Many models including regression models, machine learning models, algorithmic models, expert judgment and estimation by analogy have been widely used to estimate software effort and cost. In this work, a Tree boost (Stochastic Gradient Boosting) model is put forward to predict software effort based on the Use Case Point method. The inputs of the model include software size in use case points, productivity and complexity. A multiple linear regression model was created and the Tree boost model was evaluated against the multiple linear regression model, as well as the use case point model by using four performance criteria: MMRE, PRED, MdMRE and MSE. Experiments show that the Tree boost model can be used with promising results to estimate software effort.
软件工作量预测是软件开发生命周期中的一项重要任务。回归模型、机器学习模型、算法模型、专家判断和类比估计等模型已被广泛用于估算软件的工作量和成本。在这项工作中,提出了一个基于用例点方法预测软件工作量的树增强(随机梯度增强)模型。模型的输入包括用例点中的软件大小、生产力和复杂性。创建了一个多元线性回归模型,并根据多元线性回归模型和用例点模型,通过使用四个性能标准:MMRE、PRED、MdMRE和MSE,对Tree boost模型进行评估。实验表明,Tree boost模型可以用于估算软件工作量,并取得了令人满意的结果。
{"title":"A Treeboost Model for Software Effort Estimation Based on Use Case Points","authors":"A. B. Nassif, Luiz Fernando Capretz, D. Ho, Mohammad Azzeh","doi":"10.1109/ICMLA.2012.155","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.155","url":null,"abstract":"Software effort prediction is an important task in the software development life cycle. Many models including regression models, machine learning models, algorithmic models, expert judgment and estimation by analogy have been widely used to estimate software effort and cost. In this work, a Tree boost (Stochastic Gradient Boosting) model is put forward to predict software effort based on the Use Case Point method. The inputs of the model include software size in use case points, productivity and complexity. A multiple linear regression model was created and the Tree boost model was evaluated against the multiple linear regression model, as well as the use case point model by using four performance criteria: MMRE, PRED, MdMRE and MSE. Experiments show that the Tree boost model can be used with promising results to estimate software effort.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130568040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 55
A Machine Learning Pipeline for Three-Way Classification of Alzheimer Patients from Structural Magnetic Resonance Images of the Brain 基于脑结构磁共振图像对阿尔茨海默病患者进行三向分类的机器学习管道
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.42
Sriraam Natarajan, Saket Joshi, B. Saha, A. Edwards, Tushar Khot, Elizabeth Moody, K. Kersting, C. Whitlow, J. Maldjian
Magnetic resonance imaging (MRI) has emerged as an important tool to identify intermediate biomarkers of Alzheimer's disease (AD) due to its ability to measure regional changes in the brain that are thought to reflect disease severity and progression. In this paper, we set out a novel pipeline that uses volumetric MRI data collected from different subjects as input and classifies them into one of three classes: AD, mild cognitive impairment (MCI) and cognitively normal (CN). Our pipeline consists of three stages - (1) a segmentation layer where brain MRI data is divided into clinically relevant regions, (2) a classification layer that uses relational learning algorithms to make pair wise predictions between the three classes, and (3) a combination layer that combines the results of the different classes to obtain the final classification. One of the key features of our proposed approach is that it allows for domain expert's knowledge to guide the learning in all the layers. We evaluate our pipeline on 397 patients acquired from the Alzheimer's Disease Neuroimaging Initiative and demonstrate that it obtains state-of the-art performance with minimal feature engineering.
磁共振成像(MRI)已成为识别阿尔茨海默病(AD)中间生物标志物的重要工具,因为它能够测量被认为反映疾病严重程度和进展的大脑区域变化。在本文中,我们提出了一种新的管道,使用从不同受试者收集的体积MRI数据作为输入,并将其分为三类:AD,轻度认知障碍(MCI)和认知正常(CN)。我们的流水线由三个阶段组成:(1)分割层,其中脑MRI数据被划分为临床相关区域;(2)分类层,使用关系学习算法在三个类别之间进行配对预测;(3)组合层,将不同类别的结果组合在一起以获得最终分类。我们提出的方法的一个关键特征是它允许领域专家的知识来指导所有层的学习。我们对从阿尔茨海默病神经成像计划获得的397名患者进行了评估,并证明它以最小的特征工程获得了最先进的性能。
{"title":"A Machine Learning Pipeline for Three-Way Classification of Alzheimer Patients from Structural Magnetic Resonance Images of the Brain","authors":"Sriraam Natarajan, Saket Joshi, B. Saha, A. Edwards, Tushar Khot, Elizabeth Moody, K. Kersting, C. Whitlow, J. Maldjian","doi":"10.1109/ICMLA.2012.42","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.42","url":null,"abstract":"Magnetic resonance imaging (MRI) has emerged as an important tool to identify intermediate biomarkers of Alzheimer's disease (AD) due to its ability to measure regional changes in the brain that are thought to reflect disease severity and progression. In this paper, we set out a novel pipeline that uses volumetric MRI data collected from different subjects as input and classifies them into one of three classes: AD, mild cognitive impairment (MCI) and cognitively normal (CN). Our pipeline consists of three stages - (1) a segmentation layer where brain MRI data is divided into clinically relevant regions, (2) a classification layer that uses relational learning algorithms to make pair wise predictions between the three classes, and (3) a combination layer that combines the results of the different classes to obtain the final classification. One of the key features of our proposed approach is that it allows for domain expert's knowledge to guide the learning in all the layers. We evaluate our pipeline on 397 patients acquired from the Alzheimer's Disease Neuroimaging Initiative and demonstrate that it obtains state-of the-art performance with minimal feature engineering.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"225 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130768548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2012 11th International Conference on Machine Learning and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1