在社交媒体上发现阿片类药物俚语:利用 Reddit 数据的 Word2Vec 方法

E. Holbrook, B. Wiskur, Z. Nagykaldi
{"title":"在社交媒体上发现阿片类药物俚语:利用 Reddit 数据的 Word2Vec 方法","authors":"E. Holbrook,&nbsp;B. Wiskur,&nbsp;Z. Nagykaldi","doi":"10.1016/j.dadr.2024.100302","DOIUrl":null,"url":null,"abstract":"<div><div>The CDC reported that the overdose of prescription or illicit opioids was responsible for the deaths of over 80,000 Americans in 2021. Social media is a valuable source of insight into problematic patterns of substance misuse. The way people converse with illicit drugs in online forums is highly variable, and slang terms are frequently used. Manually identifying names of specific drugs can be difficult in both time and labor.</div></div><div><h3>Subjects and methods</h3><div>The study utilized the Gensim Python library and its Word2Vec neural network model to develop an auto-encoding neural network, enabling the innovative analysis of drug-related discourse downloaded from the Reddit website. The slang terms were then used to qualitatively analyze the topics and categories of drugs discussed on the forum.</div></div><div><h3>Results</h3><div>The inclusion of slang terms facilitated the introduction of 200,000 specific mentions of opioid drugs and that stimulant drugs share a substantial semantic similarity with opioids, a 200 % increase in the number of drug-related terms as compared to using existing datasets.</div></div><div><h3>Conclusions</h3><div>This study advances the academic field with an extended collection of drug-related terms, offering a useful methodology and resource for tackling the opioid crisis with innovative, reduced-time detection and surveillance methods.</div></div>","PeriodicalId":72841,"journal":{"name":"Drug and alcohol dependence reports","volume":"13 ","pages":"Article 100302"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discovering opioid slang on social media: A Word2Vec approach with reddit data\",\"authors\":\"E. Holbrook,&nbsp;B. Wiskur,&nbsp;Z. Nagykaldi\",\"doi\":\"10.1016/j.dadr.2024.100302\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The CDC reported that the overdose of prescription or illicit opioids was responsible for the deaths of over 80,000 Americans in 2021. Social media is a valuable source of insight into problematic patterns of substance misuse. The way people converse with illicit drugs in online forums is highly variable, and slang terms are frequently used. Manually identifying names of specific drugs can be difficult in both time and labor.</div></div><div><h3>Subjects and methods</h3><div>The study utilized the Gensim Python library and its Word2Vec neural network model to develop an auto-encoding neural network, enabling the innovative analysis of drug-related discourse downloaded from the Reddit website. The slang terms were then used to qualitatively analyze the topics and categories of drugs discussed on the forum.</div></div><div><h3>Results</h3><div>The inclusion of slang terms facilitated the introduction of 200,000 specific mentions of opioid drugs and that stimulant drugs share a substantial semantic similarity with opioids, a 200 % increase in the number of drug-related terms as compared to using existing datasets.</div></div><div><h3>Conclusions</h3><div>This study advances the academic field with an extended collection of drug-related terms, offering a useful methodology and resource for tackling the opioid crisis with innovative, reduced-time detection and surveillance methods.</div></div>\",\"PeriodicalId\":72841,\"journal\":{\"name\":\"Drug and alcohol dependence reports\",\"volume\":\"13 \",\"pages\":\"Article 100302\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Drug and alcohol dependence reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772724624000866\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drug and alcohol dependence reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772724624000866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

美国疾病预防控制中心报告称,2021 年,超过 80,000 名美国人死于处方或非法阿片类药物过量。社交媒体是了解药物滥用问题模式的宝贵来源。人们在网络论坛上谈论非法药物的方式千变万化,而且经常使用俚语。这项研究利用 Gensim Python 库及其 Word2Vec 神经网络模型开发了一个自动编码神经网络,从而能够对从 Reddit 网站下载的毒品相关言论进行创新分析。结果纳入俚语有助于引入 20 万个关于阿片类药物的具体提及,而且刺激类药物与阿片类药物在语义上有很大的相似性,与使用现有数据集相比,毒品相关术语的数量增加了 200%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Discovering opioid slang on social media: A Word2Vec approach with reddit data
The CDC reported that the overdose of prescription or illicit opioids was responsible for the deaths of over 80,000 Americans in 2021. Social media is a valuable source of insight into problematic patterns of substance misuse. The way people converse with illicit drugs in online forums is highly variable, and slang terms are frequently used. Manually identifying names of specific drugs can be difficult in both time and labor.

Subjects and methods

The study utilized the Gensim Python library and its Word2Vec neural network model to develop an auto-encoding neural network, enabling the innovative analysis of drug-related discourse downloaded from the Reddit website. The slang terms were then used to qualitatively analyze the topics and categories of drugs discussed on the forum.

Results

The inclusion of slang terms facilitated the introduction of 200,000 specific mentions of opioid drugs and that stimulant drugs share a substantial semantic similarity with opioids, a 200 % increase in the number of drug-related terms as compared to using existing datasets.

Conclusions

This study advances the academic field with an extended collection of drug-related terms, offering a useful methodology and resource for tackling the opioid crisis with innovative, reduced-time detection and surveillance methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Drug and alcohol dependence reports
Drug and alcohol dependence reports Psychiatry and Mental Health
自引率
0.00%
发文量
0
审稿时长
100 days
期刊最新文献
Cannabidiol prescribing in the United States: An analysis of real-world data Discovering opioid slang on social media: A Word2Vec approach with reddit data Large decrease in syringe distribution following the introduction of fentanyl in King County, Washington A systematic review of the reasons for quitting and/or reducing alcohol among those who have received alcohol use disorder treatment The power of lived experience in optimizing US policymakers’ engagement with substance use research: A series of rapid-cycle randomized controlled trials
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1