{"title":"A Robust Prediction Method for Interval Symbolic Data","authors":"Roberta Fagundes, R. Souza, F. Cysneiros","doi":"10.1109/ISDA.2009.36","DOIUrl":null,"url":null,"abstract":"This paper introduces a robust prediction method for symbolic interval data based on the simple linear regression methodology. Each example of the data set is described by feature vector, for which each feature is an interval. Two classic robust regression models are fitted, respectively for range and mid-points of the interval values assumed by the variables in the data set. The prediction of the lower and upper bounds of the new intervals is performed from these fits. To validate this model, experiments with a synthetic interval data set and an application with a cardiology interval-valued data set are considered. The fit and prediction qualities are assessed by a pooled root mean square error measure calculated from learning and test data sets, respectively.","PeriodicalId":330324,"journal":{"name":"2009 Ninth International Conference on Intelligent Systems Design and Applications","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Ninth International Conference on Intelligent Systems Design and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2009.36","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper introduces a robust prediction method for symbolic interval data based on the simple linear regression methodology. Each example of the data set is described by feature vector, for which each feature is an interval. Two classic robust regression models are fitted, respectively for range and mid-points of the interval values assumed by the variables in the data set. The prediction of the lower and upper bounds of the new intervals is performed from these fits. To validate this model, experiments with a synthetic interval data set and an application with a cardiology interval-valued data set are considered. The fit and prediction qualities are assessed by a pooled root mean square error measure calculated from learning and test data sets, respectively.