{"title":"你的婚姻可靠吗?:用机器学习算法进行离婚分析","authors":"Jue Kong, Tianrui Chai","doi":"10.1145/3404555.3404559","DOIUrl":null,"url":null,"abstract":"In recent years, global divorce rate is still high. What kind of couple will divorce and what factors lead to divorce are important problems that worth studying. In this paper, we apply three machine learning algorithms (Support Vector Machine (SVM), Random forest (RF) and Natural Gradient Boosting (NGBoost)) on a divorce prediction dataset. The dataset consists of 170 samples, each of which contains 54 questions about the couple's emotional status. We regard the scores of 54 questions as the features of each sample to apply our machine learning algorithms. Compared with SVM and RF, NGBoost has superior performance as NGBoost can achieve 0.9833 accuracy, 0.9769 precision and 0.9828 F1 score. In addition, we also show the most important features in the model of RF and NGBoost to find the most important factors which lead to divorce.","PeriodicalId":220526,"journal":{"name":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","volume":"38 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Is Your Marriage Reliable?: Divorce Analysis with Machine Learning Algorithms\",\"authors\":\"Jue Kong, Tianrui Chai\",\"doi\":\"10.1145/3404555.3404559\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, global divorce rate is still high. What kind of couple will divorce and what factors lead to divorce are important problems that worth studying. In this paper, we apply three machine learning algorithms (Support Vector Machine (SVM), Random forest (RF) and Natural Gradient Boosting (NGBoost)) on a divorce prediction dataset. The dataset consists of 170 samples, each of which contains 54 questions about the couple's emotional status. We regard the scores of 54 questions as the features of each sample to apply our machine learning algorithms. Compared with SVM and RF, NGBoost has superior performance as NGBoost can achieve 0.9833 accuracy, 0.9769 precision and 0.9828 F1 score. In addition, we also show the most important features in the model of RF and NGBoost to find the most important factors which lead to divorce.\",\"PeriodicalId\":220526,\"journal\":{\"name\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"volume\":\"38 6\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3404555.3404559\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3404555.3404559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Is Your Marriage Reliable?: Divorce Analysis with Machine Learning Algorithms
In recent years, global divorce rate is still high. What kind of couple will divorce and what factors lead to divorce are important problems that worth studying. In this paper, we apply three machine learning algorithms (Support Vector Machine (SVM), Random forest (RF) and Natural Gradient Boosting (NGBoost)) on a divorce prediction dataset. The dataset consists of 170 samples, each of which contains 54 questions about the couple's emotional status. We regard the scores of 54 questions as the features of each sample to apply our machine learning algorithms. Compared with SVM and RF, NGBoost has superior performance as NGBoost can achieve 0.9833 accuracy, 0.9769 precision and 0.9828 F1 score. In addition, we also show the most important features in the model of RF and NGBoost to find the most important factors which lead to divorce.