{"title":"用于诊断2019冠状病毒病(COVID-19)感染的自动机器学习模型","authors":"Noor Maher, Suhad A. Yousif","doi":"10.11591/ijai.v12.i3.pp1360-1369","DOIUrl":null,"url":null,"abstract":"The coronavirus disease 2019 (COVID-19) epidemic still impacts every facet of life and necessitates a fast and accurate diagnosis. The need for an effective, rapid, and precise way to reduce radiologists' workload in diagnosing suspected cases has emerged. This study used the tree-based pipeline optimization tool (TPOT) and many machine learning (ML) algorithms. TPOT is an open-source genetic programming-based AutoML system that optimizes a set of feature preprocessors and ML models to maximize classification accuracy on a supervised classification problem. A series of trials and comparisons with the results of ML and earlier studies discovered that most of the AutoML beat traditional ML in terms of accuracy. A blood test dataset that has 111 variables and 5644 cases were used. In TPOT, 450 pipelines were used, and the best pipeline selected consisted of radial basis function (RBF) Sampler preprocessing and Gradient boosting classifier as the best algorithm with a 99% accuracy rate.","PeriodicalId":52221,"journal":{"name":"IAES International Journal of Artificial Intelligence","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An automated machine learning model for diagnosing coronavirus disease 2019 (COVID-19) infection\",\"authors\":\"Noor Maher, Suhad A. Yousif\",\"doi\":\"10.11591/ijai.v12.i3.pp1360-1369\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The coronavirus disease 2019 (COVID-19) epidemic still impacts every facet of life and necessitates a fast and accurate diagnosis. The need for an effective, rapid, and precise way to reduce radiologists' workload in diagnosing suspected cases has emerged. This study used the tree-based pipeline optimization tool (TPOT) and many machine learning (ML) algorithms. TPOT is an open-source genetic programming-based AutoML system that optimizes a set of feature preprocessors and ML models to maximize classification accuracy on a supervised classification problem. A series of trials and comparisons with the results of ML and earlier studies discovered that most of the AutoML beat traditional ML in terms of accuracy. A blood test dataset that has 111 variables and 5644 cases were used. In TPOT, 450 pipelines were used, and the best pipeline selected consisted of radial basis function (RBF) Sampler preprocessing and Gradient boosting classifier as the best algorithm with a 99% accuracy rate.\",\"PeriodicalId\":52221,\"journal\":{\"name\":\"IAES International Journal of Artificial Intelligence\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IAES International Journal of Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11591/ijai.v12.i3.pp1360-1369\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IAES International Journal of Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijai.v12.i3.pp1360-1369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Decision Sciences","Score":null,"Total":0}
An automated machine learning model for diagnosing coronavirus disease 2019 (COVID-19) infection
The coronavirus disease 2019 (COVID-19) epidemic still impacts every facet of life and necessitates a fast and accurate diagnosis. The need for an effective, rapid, and precise way to reduce radiologists' workload in diagnosing suspected cases has emerged. This study used the tree-based pipeline optimization tool (TPOT) and many machine learning (ML) algorithms. TPOT is an open-source genetic programming-based AutoML system that optimizes a set of feature preprocessors and ML models to maximize classification accuracy on a supervised classification problem. A series of trials and comparisons with the results of ML and earlier studies discovered that most of the AutoML beat traditional ML in terms of accuracy. A blood test dataset that has 111 variables and 5644 cases were used. In TPOT, 450 pipelines were used, and the best pipeline selected consisted of radial basis function (RBF) Sampler preprocessing and Gradient boosting classifier as the best algorithm with a 99% accuracy rate.