{"title":"利用机器学习方法学习哮喘与气象事件之间的关系","authors":"Alibek Zhakubayev, A. Yazıcı","doi":"10.1109/AICT47866.2019.8981778","DOIUrl":null,"url":null,"abstract":"In this article, a new methodology is proposed by using the relationships between meteorological events and asthma cases of asthma patients in a region compared to other regions in a country. We focus on the impact of weather conditions on asthma in order to estimate asthma cases using machine learning methods based on meteorological events only. In order to increase the success of the estimates, in addition to the 10 features identified by the National Environmental Information Centers, we create some new semi-synthetic features by using the multiplication and addition operations on the features given after the scaling. Then, we use machine learning methods and the R-square coefficient approach to learn the effective features using the features obtained from publicly available data sets for Russia. After determining the effective features, we use three different machine learning algorithms: random forest, linear regression, and kernel ridge regression algorithms. We use transfer learning to store effective features obtained from a dataset for Russia and then apply them to a dataset for Kazakhstan. Our hypothesis is that a combination of the selected semi-synthetic properties of the random forest algorithm has the best performance accuracy for this application. The model successfully identifies (predicts) very high, high, medium, low or very low numbers of people with asthma for the first time in the region.","PeriodicalId":329473,"journal":{"name":"2019 IEEE 13th International Conference on Application of Information and Communication Technologies (AICT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning the Relationship between Asthma and Meteorological Events by Using Machine Learning Methods\",\"authors\":\"Alibek Zhakubayev, A. Yazıcı\",\"doi\":\"10.1109/AICT47866.2019.8981778\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this article, a new methodology is proposed by using the relationships between meteorological events and asthma cases of asthma patients in a region compared to other regions in a country. We focus on the impact of weather conditions on asthma in order to estimate asthma cases using machine learning methods based on meteorological events only. In order to increase the success of the estimates, in addition to the 10 features identified by the National Environmental Information Centers, we create some new semi-synthetic features by using the multiplication and addition operations on the features given after the scaling. Then, we use machine learning methods and the R-square coefficient approach to learn the effective features using the features obtained from publicly available data sets for Russia. After determining the effective features, we use three different machine learning algorithms: random forest, linear regression, and kernel ridge regression algorithms. We use transfer learning to store effective features obtained from a dataset for Russia and then apply them to a dataset for Kazakhstan. Our hypothesis is that a combination of the selected semi-synthetic properties of the random forest algorithm has the best performance accuracy for this application. The model successfully identifies (predicts) very high, high, medium, low or very low numbers of people with asthma for the first time in the region.\",\"PeriodicalId\":329473,\"journal\":{\"name\":\"2019 IEEE 13th International Conference on Application of Information and Communication Technologies (AICT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 13th International Conference on Application of Information and Communication Technologies (AICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICT47866.2019.8981778\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 13th International Conference on Application of Information and Communication Technologies (AICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICT47866.2019.8981778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning the Relationship between Asthma and Meteorological Events by Using Machine Learning Methods
In this article, a new methodology is proposed by using the relationships between meteorological events and asthma cases of asthma patients in a region compared to other regions in a country. We focus on the impact of weather conditions on asthma in order to estimate asthma cases using machine learning methods based on meteorological events only. In order to increase the success of the estimates, in addition to the 10 features identified by the National Environmental Information Centers, we create some new semi-synthetic features by using the multiplication and addition operations on the features given after the scaling. Then, we use machine learning methods and the R-square coefficient approach to learn the effective features using the features obtained from publicly available data sets for Russia. After determining the effective features, we use three different machine learning algorithms: random forest, linear regression, and kernel ridge regression algorithms. We use transfer learning to store effective features obtained from a dataset for Russia and then apply them to a dataset for Kazakhstan. Our hypothesis is that a combination of the selected semi-synthetic properties of the random forest algorithm has the best performance accuracy for this application. The model successfully identifies (predicts) very high, high, medium, low or very low numbers of people with asthma for the first time in the region.