Data Mining to Determine the Causes of Gender-Based Violence against Women in Ecuador

Oscar M. Cumbicus-Pineda, Tania E. Abad-Eras, Lisset A. Neyra-Romero
{"title":"Data Mining to Determine the Causes of Gender-Based Violence against Women in Ecuador","authors":"Oscar M. Cumbicus-Pineda, Tania E. Abad-Eras, Lisset A. Neyra-Romero","doi":"10.1109/ETCM53643.2021.9590664","DOIUrl":null,"url":null,"abstract":"In this paper, we applied data mining to determine the causes of gender-based violence against women in Ecuador. We divided the original database into 30 subsets, according to the scopes in which violence occurs. We previously classified these subsets using six algorithms, namely Decision Trees(J48), Exhaustive CHAID, Neural Networks, Nearest Neighbors (IBk), Decision Tables, and Random Forests. The results of this classification showed a bias towards the majority class; for this reason, we applied the SMOTE Synthetic Minority Oversampling Technique to balance the classes and obtain better results. For the predictions of the causes of violence, we used Exhaustive CHAID because our variables are mostly non-binary, and this algorithm allowed us to generate trees with more than two branches. IBk algorithm was the best at globally classifying the data, and Random Forests performed the best in classification precision. The predictions obtained from the 30 subsets of data show that the most common causes for a woman to suffer violence are: when her partner drinks or consumes alcohol or drugs; if her partner is in another relationship; when her husbands suffered violence in childhood; and when the woman was touched without her consent or denigrated for being a woman.","PeriodicalId":438567,"journal":{"name":"2021 IEEE Fifth Ecuador Technical Chapters Meeting (ETCM)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Fifth Ecuador Technical Chapters Meeting (ETCM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETCM53643.2021.9590664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this paper, we applied data mining to determine the causes of gender-based violence against women in Ecuador. We divided the original database into 30 subsets, according to the scopes in which violence occurs. We previously classified these subsets using six algorithms, namely Decision Trees(J48), Exhaustive CHAID, Neural Networks, Nearest Neighbors (IBk), Decision Tables, and Random Forests. The results of this classification showed a bias towards the majority class; for this reason, we applied the SMOTE Synthetic Minority Oversampling Technique to balance the classes and obtain better results. For the predictions of the causes of violence, we used Exhaustive CHAID because our variables are mostly non-binary, and this algorithm allowed us to generate trees with more than two branches. IBk algorithm was the best at globally classifying the data, and Random Forests performed the best in classification precision. The predictions obtained from the 30 subsets of data show that the most common causes for a woman to suffer violence are: when her partner drinks or consumes alcohol or drugs; if her partner is in another relationship; when her husbands suffered violence in childhood; and when the woman was touched without her consent or denigrated for being a woman.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
数据挖掘确定厄瓜多尔基于性别的暴力侵害妇女行为的原因
在本文中,我们应用数据挖掘来确定基于性别的暴力侵害妇女在厄瓜多尔的原因。根据暴力发生的范围,我们将原始数据库分为30个子集。我们之前使用六种算法对这些子集进行分类,即决策树(J48)、穷举CHAID、神经网络、最近邻(IBk)、决策表和随机森林。这种分类的结果显示了对多数类的偏向;为此,我们采用了SMOTE合成少数派过采样技术来平衡类,获得了更好的结果。对于暴力原因的预测,我们使用了穷举式CHAID,因为我们的变量大多是非二进制的,并且这个算法允许我们生成具有两个以上分支的树。IBk算法对数据的全局分类效果最好,随机森林算法对数据的分类精度最好。从30个数据子集中获得的预测表明,妇女遭受暴力的最常见原因是:她的伴侣饮酒或消费酒精或毒品;如果她的伴侣有另一段关系;当她的丈夫在童年遭受暴力时;当女性未经同意就被触碰或因为身为女性而被诋毁时。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Relevant and Non-Redundant Feature Subset Selection Applied to the Detection of Malware in a Network Multi-objective Optimization of Active and Reactive Power to assess Bus Loadability Limit On the Monitoring of the Electromagnetic Fields Accompanying the Seismic and Volcanic Activity of the Chiles Volcano: Preliminary Results Text-based CAPTCHA Vulnerability Assessment using a Deep Learning-based Solver Secure Systems via Reconfigurable Intelligent Surfaces over Correlated Rayleigh Channels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1