不同机器学习分类器对自杀意念和自杀内容检测的准确率分析

Divya Dewangan, Smita Selot, Sreejit Panicker
{"title":"不同机器学习分类器对自杀意念和自杀内容检测的准确率分析","authors":"Divya Dewangan, Smita Selot, Sreejit Panicker","doi":"10.51983/ajes-2023.12.1.3694","DOIUrl":null,"url":null,"abstract":"Suicide is the matter of purposely causing one’s death and suicidal ideation refers to thoughts or preoccupations with ending one’s own life. Studies have explored verbal and written communications related to suicide, including analyzing suicide notes, online discussions, and social media posts to identify linguistic and content markers that may help in early detection and intervention. The primary purpose of this study is to detect signs of risk of suicide/self-harm in social media users by investigating several frequency-based featuring and prediction-based featuring methods along with different baseline machine learning classifiers. The algorithms applied for analysis are Decision Tree, K-Nearest Neighbors, Random Forest, Multinomial Naïve Bayes, and SVM. Our experimental results showed that the best performance is obtained by the FastText embedding with SVM model having the highest accuracy of 93.76% which outperforms other baselines. The aim of this work is to learn the significance of analysis and do a comparative study of algorithms to find the best suited algorithm.","PeriodicalId":365290,"journal":{"name":"Asian Journal of Electrical Sciences","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Accuracy Analysis of Different Machine Learning Classifiers for Detecting Suicidal Ideation and Content\",\"authors\":\"Divya Dewangan, Smita Selot, Sreejit Panicker\",\"doi\":\"10.51983/ajes-2023.12.1.3694\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Suicide is the matter of purposely causing one’s death and suicidal ideation refers to thoughts or preoccupations with ending one’s own life. Studies have explored verbal and written communications related to suicide, including analyzing suicide notes, online discussions, and social media posts to identify linguistic and content markers that may help in early detection and intervention. The primary purpose of this study is to detect signs of risk of suicide/self-harm in social media users by investigating several frequency-based featuring and prediction-based featuring methods along with different baseline machine learning classifiers. The algorithms applied for analysis are Decision Tree, K-Nearest Neighbors, Random Forest, Multinomial Naïve Bayes, and SVM. Our experimental results showed that the best performance is obtained by the FastText embedding with SVM model having the highest accuracy of 93.76% which outperforms other baselines. The aim of this work is to learn the significance of analysis and do a comparative study of algorithms to find the best suited algorithm.\",\"PeriodicalId\":365290,\"journal\":{\"name\":\"Asian Journal of Electrical Sciences\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Electrical Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.51983/ajes-2023.12.1.3694\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Electrical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51983/ajes-2023.12.1.3694","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

自杀是故意造成死亡的行为,自杀意念是指结束自己生命的想法或念头。研究探索了与自杀有关的口头和书面交流,包括分析自杀遗书、在线讨论和社交媒体帖子,以确定可能有助于早期发现和干预的语言和内容标记。本研究的主要目的是通过研究几种基于频率的特征和基于预测的特征方法以及不同的基线机器学习分类器,来检测社交媒体用户自杀/自残风险的迹象。用于分析的算法有决策树、k近邻、随机森林、多项式Naïve贝叶斯和支持向量机。实验结果表明,基于SVM模型的FastText嵌入效果最好,准确率高达93.76%,优于其他基线。本工作的目的是了解分析的意义,并对算法进行比较研究,以找到最适合的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
The Accuracy Analysis of Different Machine Learning Classifiers for Detecting Suicidal Ideation and Content
Suicide is the matter of purposely causing one’s death and suicidal ideation refers to thoughts or preoccupations with ending one’s own life. Studies have explored verbal and written communications related to suicide, including analyzing suicide notes, online discussions, and social media posts to identify linguistic and content markers that may help in early detection and intervention. The primary purpose of this study is to detect signs of risk of suicide/self-harm in social media users by investigating several frequency-based featuring and prediction-based featuring methods along with different baseline machine learning classifiers. The algorithms applied for analysis are Decision Tree, K-Nearest Neighbors, Random Forest, Multinomial Naïve Bayes, and SVM. Our experimental results showed that the best performance is obtained by the FastText embedding with SVM model having the highest accuracy of 93.76% which outperforms other baselines. The aim of this work is to learn the significance of analysis and do a comparative study of algorithms to find the best suited algorithm.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Power System Security Protection in Microgrids Based on Advanced Metering Infrastructure Parameter Optimization of Refrigeration Chiller by Machine Learning Analysing and Optimizing the Refrigeration System Using Machine Learning Algorithm The Accuracy Analysis of Different Machine Learning Classifiers for Detecting Suicidal Ideation and Content A Novel Shaped Four Port MIMO Antenna for Wireless Communication
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1