{"title":"一种处理决策树缺失值的概率方法的复杂性","authors":"Lamis Hawarah, A. Simonet, M. Simonet","doi":"10.1109/SYNASC.2006.70","DOIUrl":null,"url":null,"abstract":"We describe the complexity of an approach to fill missing values in decision trees during classification. This approach is derived from the ordered attribute trees method which builds a decision tree for each attribute and uses these trees to fill the missing attribute values. Both our approach and theirs are based on the mutual information between the attributes and the class. Our method takes into account the dependence between attributes by using mutual information. The result of the classification process is a probability distribution instead of a single class. In this paper, we explain our classification algorithm. We then calculate the complexity of constructing the attribute trees and the complexity of classifying a new instance with missing values using our classification algorithm","PeriodicalId":309740,"journal":{"name":"2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2006-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"The Complexity of a Probabilistic Approach to Deal with Missing Values in a Decision Tree\",\"authors\":\"Lamis Hawarah, A. Simonet, M. Simonet\",\"doi\":\"10.1109/SYNASC.2006.70\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We describe the complexity of an approach to fill missing values in decision trees during classification. This approach is derived from the ordered attribute trees method which builds a decision tree for each attribute and uses these trees to fill the missing attribute values. Both our approach and theirs are based on the mutual information between the attributes and the class. Our method takes into account the dependence between attributes by using mutual information. The result of the classification process is a probability distribution instead of a single class. In this paper, we explain our classification algorithm. We then calculate the complexity of constructing the attribute trees and the complexity of classifying a new instance with missing values using our classification algorithm\",\"PeriodicalId\":309740,\"journal\":{\"name\":\"2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SYNASC.2006.70\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SYNASC.2006.70","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Complexity of a Probabilistic Approach to Deal with Missing Values in a Decision Tree
We describe the complexity of an approach to fill missing values in decision trees during classification. This approach is derived from the ordered attribute trees method which builds a decision tree for each attribute and uses these trees to fill the missing attribute values. Both our approach and theirs are based on the mutual information between the attributes and the class. Our method takes into account the dependence between attributes by using mutual information. The result of the classification process is a probability distribution instead of a single class. In this paper, we explain our classification algorithm. We then calculate the complexity of constructing the attribute trees and the complexity of classifying a new instance with missing values using our classification algorithm