机器学习和时间序列模型在预测女性劳动力参与方面的比较分析。

IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE PeerJ Computer Science Pub Date : 2024-11-11 eCollection Date: 2024-01-01 DOI:10.7717/peerj-cs.2430
Rasha Elstohy, Nevein Aneis, Eman Mounir Ali
{"title":"机器学习和时间序列模型在预测女性劳动力参与方面的比较分析。","authors":"Rasha Elstohy, Nevein Aneis, Eman Mounir Ali","doi":"10.7717/peerj-cs.2430","DOIUrl":null,"url":null,"abstract":"<p><p>Labor force participation of Egyptian women has been a chronic economic problem in Egypt. Despite the improvement in the human capital front, whether on the education or health indicators, female labor force participation remains persistently low. This study proposes a hybrid machine-learning model that integrates principal component analysis (PCA) for feature extraction with various machine learning and time-series models to predict women's employment in times of crisis. Various machine learning (ML) algorithms, such as support vector machine (SVM), neural network, K-nearest neighbor (KNN), linear regression, random forest, and AdaBoost, in addition to popular time series algorithms, including autoregressive integrated moving average (ARIMA) and vector autoregressive (VAR) models, have been applied to an actual dataset from the public sector. The manpower dataset considered gender from different regions, ages, and educational levels. The dataset was then trained, tested, and evaluated. For performance validation, forecasting accuracy metrics were constructed using mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), mean absolute percent error (MAPE), R-squared (R2), and cross-validated root mean squared error (CVRMSE). Another Dickey-Fuller test was performed to evaluate and compare the accuracy of the applied models, and the results showed that AdaBoost outperforms the other methods by an accuracy of 100%. Compared to alternative works, our findings demonstrate a comprehensive comparative analysis for predicting women's participation in different regions during an economic crisis.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2430"},"PeriodicalIF":3.5000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622832/pdf/","citationCount":"0","resultStr":"{\"title\":\"A comparative analysis of variants of machine learning and time series models in predicting women's participation in the labor force.\",\"authors\":\"Rasha Elstohy, Nevein Aneis, Eman Mounir Ali\",\"doi\":\"10.7717/peerj-cs.2430\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Labor force participation of Egyptian women has been a chronic economic problem in Egypt. Despite the improvement in the human capital front, whether on the education or health indicators, female labor force participation remains persistently low. This study proposes a hybrid machine-learning model that integrates principal component analysis (PCA) for feature extraction with various machine learning and time-series models to predict women's employment in times of crisis. Various machine learning (ML) algorithms, such as support vector machine (SVM), neural network, K-nearest neighbor (KNN), linear regression, random forest, and AdaBoost, in addition to popular time series algorithms, including autoregressive integrated moving average (ARIMA) and vector autoregressive (VAR) models, have been applied to an actual dataset from the public sector. The manpower dataset considered gender from different regions, ages, and educational levels. The dataset was then trained, tested, and evaluated. For performance validation, forecasting accuracy metrics were constructed using mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), mean absolute percent error (MAPE), R-squared (R2), and cross-validated root mean squared error (CVRMSE). Another Dickey-Fuller test was performed to evaluate and compare the accuracy of the applied models, and the results showed that AdaBoost outperforms the other methods by an accuracy of 100%. Compared to alternative works, our findings demonstrate a comprehensive comparative analysis for predicting women's participation in different regions during an economic crisis.</p>\",\"PeriodicalId\":54224,\"journal\":{\"name\":\"PeerJ Computer Science\",\"volume\":\"10 \",\"pages\":\"e2430\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622832/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PeerJ Computer Science\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.7717/peerj-cs.2430\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.7717/peerj-cs.2430","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

埃及妇女参加劳动一直是埃及的一个长期经济问题。尽管人力资本方面有所改善,无论是在教育还是健康指标方面,但女性劳动力参与率仍然很低。本研究提出了一种混合机器学习模型,该模型将用于特征提取的主成分分析(PCA)与各种机器学习和时间序列模型相结合,以预测女性在危机时期的就业情况。各种机器学习(ML)算法,如支持向量机(SVM)、神经网络、k近邻(KNN)、线性回归、随机森林和AdaBoost,以及流行的时间序列算法,包括自回归综合移动平均(ARIMA)和向量自回归(VAR)模型,已应用于公共部门的实际数据集。人力数据集考虑了不同地区、年龄和教育水平的性别。然后对数据集进行训练、测试和评估。为了进行性能验证,使用均方误差(MSE)、均方根误差(RMSE)、平均绝对误差(MAE)、平均绝对百分比误差(MAPE)、r平方(R2)和交叉验证的均方根误差(CVRMSE)构建预测精度指标。进行另一个Dickey-Fuller测试来评估和比较应用模型的准确性,结果表明AdaBoost优于其他方法,准确率为100%。与其他研究相比,我们的研究结果为预测经济危机期间不同地区的妇女参与情况提供了全面的比较分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A comparative analysis of variants of machine learning and time series models in predicting women's participation in the labor force.

Labor force participation of Egyptian women has been a chronic economic problem in Egypt. Despite the improvement in the human capital front, whether on the education or health indicators, female labor force participation remains persistently low. This study proposes a hybrid machine-learning model that integrates principal component analysis (PCA) for feature extraction with various machine learning and time-series models to predict women's employment in times of crisis. Various machine learning (ML) algorithms, such as support vector machine (SVM), neural network, K-nearest neighbor (KNN), linear regression, random forest, and AdaBoost, in addition to popular time series algorithms, including autoregressive integrated moving average (ARIMA) and vector autoregressive (VAR) models, have been applied to an actual dataset from the public sector. The manpower dataset considered gender from different regions, ages, and educational levels. The dataset was then trained, tested, and evaluated. For performance validation, forecasting accuracy metrics were constructed using mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), mean absolute percent error (MAPE), R-squared (R2), and cross-validated root mean squared error (CVRMSE). Another Dickey-Fuller test was performed to evaluate and compare the accuracy of the applied models, and the results showed that AdaBoost outperforms the other methods by an accuracy of 100%. Compared to alternative works, our findings demonstrate a comprehensive comparative analysis for predicting women's participation in different regions during an economic crisis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
PeerJ Computer Science
PeerJ Computer Science Computer Science-General Computer Science
CiteScore
6.10
自引率
5.30%
发文量
332
审稿时长
10 weeks
期刊介绍: PeerJ Computer Science is the new open access journal covering all subject areas in computer science, with the backing of a prestigious advisory board and more than 300 academic editors.
期刊最新文献
Applying auxiliary supervised depth-assisted transformer and cross modal attention fusion in monocular 3D object detection. Process mining applications in healthcare: a systematic literature review. Social media network public opinion emotion classification method based on multi-feature fusion and multi-scale hybrid neural network. Deep learning for enhanced risk management: a novel approach to analyzing financial reports. A comparative study of the performance of ten metaheuristic algorithms for parameter estimation of solar photovoltaic models.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1