社交媒体(Twitter)在分析家庭暴力中的作用:一种机器学习方法

S. Adeeba, Kuhaneswaran Banujan, B. Kumara
{"title":"社交媒体(Twitter)在分析家庭暴力中的作用:一种机器学习方法","authors":"S. Adeeba, Kuhaneswaran Banujan, B. Kumara","doi":"10.1109/SCSE59836.2023.10215027","DOIUrl":null,"url":null,"abstract":"Home Violence (HV) has been a persistent issue across the globe, transcending economic status and cultural boundaries. The COVID-19 pandemic has further exacerbated this problem, bringing it to the forefront of public discourse. This study aims to analyse the impact of HV by utilising Twitter data and Machine Learning (ML) techniques, categorising tweets into three groups: (i) HV Incident Tweets, (ii) HV Awareness Tweets, and (iii) HV Shelter Tweets. This categorisation provides several advantages, such as uncovering new or hidden evidence, filling information gaps, and identifying potential suspects. Over 40,000 tweets were collected using the Twitter API between April 2019 and July 2021. Data pre-processing and word embedding were performed to prepare the data for analysis. Initially, tweets were categorised into HV Positive (containing relevant information) and HV Negative (noise or unrelated content) groups. Manually labelled tweets were used for training and testing purposes. Machine learning models, including SVM, NB, Logistic Regression, Decision Tree (DT), Artificial Neural Networks (ANN), and LSTM, were employed for this task. Subsequently, HV Positive tweets were classified into the three aforementioned categories. Manually labelled tweets were again used for training and testing. Models such as Tf-IDF+SVM, Tf-IDF+DT, Tf-IDF+NB, and GloVe+LSTM were utilised. Several evaluation metrics were used to assess the performance of the models. The study’s results provide important new understandings of the prevalence, patterns, and causes of HV as they are reported on social media and how the general population reacts to these problems. The research clarifies how social media may help spread knowledge, provide assistance, and link victims to resources. These insights can be instrumental in informing policymakers, non-profit organisations, and researchers as they work to develop targeted interventions and strategies to address HV during and beyond the COVID-19 pandemic.","PeriodicalId":429228,"journal":{"name":"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Role of Social Media (Twitter) in Analysing Home Violence: A Machine Learning Approach\",\"authors\":\"S. Adeeba, Kuhaneswaran Banujan, B. Kumara\",\"doi\":\"10.1109/SCSE59836.2023.10215027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Home Violence (HV) has been a persistent issue across the globe, transcending economic status and cultural boundaries. The COVID-19 pandemic has further exacerbated this problem, bringing it to the forefront of public discourse. This study aims to analyse the impact of HV by utilising Twitter data and Machine Learning (ML) techniques, categorising tweets into three groups: (i) HV Incident Tweets, (ii) HV Awareness Tweets, and (iii) HV Shelter Tweets. This categorisation provides several advantages, such as uncovering new or hidden evidence, filling information gaps, and identifying potential suspects. Over 40,000 tweets were collected using the Twitter API between April 2019 and July 2021. Data pre-processing and word embedding were performed to prepare the data for analysis. Initially, tweets were categorised into HV Positive (containing relevant information) and HV Negative (noise or unrelated content) groups. Manually labelled tweets were used for training and testing purposes. Machine learning models, including SVM, NB, Logistic Regression, Decision Tree (DT), Artificial Neural Networks (ANN), and LSTM, were employed for this task. Subsequently, HV Positive tweets were classified into the three aforementioned categories. Manually labelled tweets were again used for training and testing. Models such as Tf-IDF+SVM, Tf-IDF+DT, Tf-IDF+NB, and GloVe+LSTM were utilised. Several evaluation metrics were used to assess the performance of the models. The study’s results provide important new understandings of the prevalence, patterns, and causes of HV as they are reported on social media and how the general population reacts to these problems. The research clarifies how social media may help spread knowledge, provide assistance, and link victims to resources. These insights can be instrumental in informing policymakers, non-profit organisations, and researchers as they work to develop targeted interventions and strategies to address HV during and beyond the COVID-19 pandemic.\",\"PeriodicalId\":429228,\"journal\":{\"name\":\"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCSE59836.2023.10215027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCSE59836.2023.10215027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

家庭暴力(HV)在全球范围内一直是一个持续存在的问题,超越了经济地位和文化界限。2019冠状病毒病大流行进一步加剧了这一问题,使其成为公众讨论的焦点。本研究旨在利用Twitter数据和机器学习(ML)技术分析HV的影响,将推文分为三组:(i) HV事件推文,(ii) HV意识推文,(iii) HV庇护所推文。这种分类提供了几个优点,例如发现新的或隐藏的证据,填补信息空白,以及识别潜在的嫌疑人。2019年4月至2021年7月期间,使用Twitter API收集了4万多条推文。进行数据预处理和词嵌入,为数据分析做准备。最初,推文被分为HV Positive(包含相关信息)和HV Negative(噪音或不相关内容)组。手动标记推文用于训练和测试目的。机器学习模型包括SVM、NB、Logistic回归、决策树(DT)、人工神经网络(ANN)和LSTM。随后,HV Positive推文被分为上述三类。人工标记的推文再次用于训练和测试。采用Tf-IDF+SVM、Tf-IDF+DT、Tf-IDF+NB、GloVe+LSTM等模型。几个评估指标被用来评估模型的性能。这项研究的结果为社交媒体上报道的艾滋病毒的流行、模式和原因以及普通人群对这些问题的反应提供了重要的新认识。该研究阐明了社交媒体如何帮助传播知识、提供援助以及将受害者与资源联系起来。这些见解有助于决策者、非营利组织和研究人员在COVID-19大流行期间和之后制定有针对性的干预措施和战略,以应对艾滋病毒。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
The Role of Social Media (Twitter) in Analysing Home Violence: A Machine Learning Approach
Home Violence (HV) has been a persistent issue across the globe, transcending economic status and cultural boundaries. The COVID-19 pandemic has further exacerbated this problem, bringing it to the forefront of public discourse. This study aims to analyse the impact of HV by utilising Twitter data and Machine Learning (ML) techniques, categorising tweets into three groups: (i) HV Incident Tweets, (ii) HV Awareness Tweets, and (iii) HV Shelter Tweets. This categorisation provides several advantages, such as uncovering new or hidden evidence, filling information gaps, and identifying potential suspects. Over 40,000 tweets were collected using the Twitter API between April 2019 and July 2021. Data pre-processing and word embedding were performed to prepare the data for analysis. Initially, tweets were categorised into HV Positive (containing relevant information) and HV Negative (noise or unrelated content) groups. Manually labelled tweets were used for training and testing purposes. Machine learning models, including SVM, NB, Logistic Regression, Decision Tree (DT), Artificial Neural Networks (ANN), and LSTM, were employed for this task. Subsequently, HV Positive tweets were classified into the three aforementioned categories. Manually labelled tweets were again used for training and testing. Models such as Tf-IDF+SVM, Tf-IDF+DT, Tf-IDF+NB, and GloVe+LSTM were utilised. Several evaluation metrics were used to assess the performance of the models. The study’s results provide important new understandings of the prevalence, patterns, and causes of HV as they are reported on social media and how the general population reacts to these problems. The research clarifies how social media may help spread knowledge, provide assistance, and link victims to resources. These insights can be instrumental in informing policymakers, non-profit organisations, and researchers as they work to develop targeted interventions and strategies to address HV during and beyond the COVID-19 pandemic.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Exploring Music Similarity through Siamese CNNs using Triplet Loss on Music Samples Impacts of Integrated Railway-Based Containerized Cargo Transport Network to Connect the Port of Colombo and Free Trade Zones in Sri Lanka Investigating Factors Influencing Behavioral Intention Toward Green Computing Practices Among Undergraduates In Sri Lankan Universities Preserving India’s Rich Dance Heritage: A Classification of Indian Dance Forms and Innovative Digital Management Solutions for Cultural Heritage Conservation An Automatic Density Cluster Generation Method to Identify the Amount of Tool Flank Wear via Tool Vibration
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1