{"title":"一种基于分类的模糊规则代理模型,以帮助解决大容量数据集的全模型选择问题","authors":"Ángel Díaz-Pacheco, C. García","doi":"10.1080/0952813X.2021.1925972","DOIUrl":null,"url":null,"abstract":"ABSTRACT Improvement of accuracy in classifiers is a crucial topic in the machine learning field. The problem has been addressed, making new algorithms and selecting the fittest classifier for a given dataset. The latter approach combined with feature selection and pre-processing form up a new paradigm known as Full Model Selection. This paradigm is like a black box whose input is a dataset, and as an output, a precise classification model is obtained. Despite that, full model selection is not the first alternative with the larger datasets of nowadays. We propose the use of MapReduce to deal with huge datasets, a bio-inspired optimisation algorithm and the use of a novel algorithm based on fuzzy classification rules as a proxy model to guide the optimisation process. To the best of our knowledge, this work is the first to propose a classification algorithm based on fuzzy rules as a proxy model. Obtained results showed an accuracy improvement and a considerable reduction of the computing time in datasets of a wide range of sizes.","PeriodicalId":15677,"journal":{"name":"Journal of Experimental & Theoretical Artificial Intelligence","volume":"3 1","pages":"815 - 844"},"PeriodicalIF":1.7000,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A classification-based fuzzy-rules proxy model to assist in the full model selection problem in high volume datasets\",\"authors\":\"Ángel Díaz-Pacheco, C. García\",\"doi\":\"10.1080/0952813X.2021.1925972\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Improvement of accuracy in classifiers is a crucial topic in the machine learning field. The problem has been addressed, making new algorithms and selecting the fittest classifier for a given dataset. The latter approach combined with feature selection and pre-processing form up a new paradigm known as Full Model Selection. This paradigm is like a black box whose input is a dataset, and as an output, a precise classification model is obtained. Despite that, full model selection is not the first alternative with the larger datasets of nowadays. We propose the use of MapReduce to deal with huge datasets, a bio-inspired optimisation algorithm and the use of a novel algorithm based on fuzzy classification rules as a proxy model to guide the optimisation process. To the best of our knowledge, this work is the first to propose a classification algorithm based on fuzzy rules as a proxy model. Obtained results showed an accuracy improvement and a considerable reduction of the computing time in datasets of a wide range of sizes.\",\"PeriodicalId\":15677,\"journal\":{\"name\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"volume\":\"3 1\",\"pages\":\"815 - 844\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2021-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1080/0952813X.2021.1925972\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental & Theoretical Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1080/0952813X.2021.1925972","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A classification-based fuzzy-rules proxy model to assist in the full model selection problem in high volume datasets
ABSTRACT Improvement of accuracy in classifiers is a crucial topic in the machine learning field. The problem has been addressed, making new algorithms and selecting the fittest classifier for a given dataset. The latter approach combined with feature selection and pre-processing form up a new paradigm known as Full Model Selection. This paradigm is like a black box whose input is a dataset, and as an output, a precise classification model is obtained. Despite that, full model selection is not the first alternative with the larger datasets of nowadays. We propose the use of MapReduce to deal with huge datasets, a bio-inspired optimisation algorithm and the use of a novel algorithm based on fuzzy classification rules as a proxy model to guide the optimisation process. To the best of our knowledge, this work is the first to propose a classification algorithm based on fuzzy rules as a proxy model. Obtained results showed an accuracy improvement and a considerable reduction of the computing time in datasets of a wide range of sizes.
期刊介绍:
Journal of Experimental & Theoretical Artificial Intelligence (JETAI) is a world leading journal dedicated to publishing high quality, rigorously reviewed, original papers in artificial intelligence (AI) research.
The journal features work in all subfields of AI research and accepts both theoretical and applied research. Topics covered include, but are not limited to, the following:
• cognitive science
• games
• learning
• knowledge representation
• memory and neural system modelling
• perception
• problem-solving