Kurdish social media sentiment corpus: Misyar marriage perspectives

IF 1 Q3 MULTIDISCIPLINARY SCIENCES Data in Brief Pub Date : 2024-10-03 DOI:10.1016/j.dib.2024.110989
Sarkhel H. Taher Karim
{"title":"Kurdish social media sentiment corpus: Misyar marriage perspectives","authors":"Sarkhel H. Taher Karim","doi":"10.1016/j.dib.2024.110989","DOIUrl":null,"url":null,"abstract":"<div><div>This article presents a thorough compilation of 5108 Central Kurdish comments taken from YouTube and Facebook. The purpose of compiling the dataset was to investigate public perceptions of Misyar marriage, a non-traditional form of marriage, in the Kurdistan region. The goal of the 135-day data collection period was to gather comments from specific public pages on these social media platforms. there are two columns in the dataset: sentiments and comments. The sentiments column classifies each comment into one of eight sentiment labels: Positive, Negative, Neutral, Sarcastic or Humorous, Suggestive, Dismissive, Skeptical, and Curious. The comments column contains the text of the comments in Central Kurdish. To improve the quality and uniformity of the data, a great deal of preprocessing was done to address problems like noise removal, character replacement, and space adjustments.</div><div>Researchers interested in sentiment analysis, social media studies, Islamic studies, and Kurdish cultural practices will find the dataset to be a useful resource. It can be used for sentiment analysis, trend analysis, linguistic studies, and other analyses. It provides insights into the public discourse surrounding Misyar marriage. The labeled data can aid in the creation of machine learning models and further our knowledge of societal perceptions of emerging religious trends<em>.</em></div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235234092400951X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

This article presents a thorough compilation of 5108 Central Kurdish comments taken from YouTube and Facebook. The purpose of compiling the dataset was to investigate public perceptions of Misyar marriage, a non-traditional form of marriage, in the Kurdistan region. The goal of the 135-day data collection period was to gather comments from specific public pages on these social media platforms. there are two columns in the dataset: sentiments and comments. The sentiments column classifies each comment into one of eight sentiment labels: Positive, Negative, Neutral, Sarcastic or Humorous, Suggestive, Dismissive, Skeptical, and Curious. The comments column contains the text of the comments in Central Kurdish. To improve the quality and uniformity of the data, a great deal of preprocessing was done to address problems like noise removal, character replacement, and space adjustments.
Researchers interested in sentiment analysis, social media studies, Islamic studies, and Kurdish cultural practices will find the dataset to be a useful resource. It can be used for sentiment analysis, trend analysis, linguistic studies, and other analyses. It provides insights into the public discourse surrounding Misyar marriage. The labeled data can aid in the creation of machine learning models and further our knowledge of societal perceptions of emerging religious trends.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
库尔德社交媒体情感语料库:Misyar 婚姻观点
本文全面汇编了 5108 条来自 YouTube 和 Facebook 的库尔德中部评论。汇编该数据集的目的是调查库尔德斯坦地区公众对 Misyar 婚姻(一种非传统形式的婚姻)的看法。135 天数据收集期的目标是从这些社交媒体平台的特定公共页面上收集评论。数据集中有两列:情感和评论。情绪列将每条评论分为八种情绪标签之一:积极、消极、中性、讽刺或幽默、暗示、轻蔑、怀疑和好奇。评论栏包含中库尔德语的评论文本。为了提高数据的质量和统一性,对数据进行了大量预处理,以解决噪音去除、字符替换和空间调整等问题。对情感分析、社交媒体研究、伊斯兰研究和库尔德文化习俗感兴趣的研究人员会发现该数据集是一个有用的资源。它可用于情感分析、趋势分析、语言研究和其他分析。它提供了对围绕 Misyar 婚姻的公共讨论的见解。标注的数据有助于创建机器学习模型,进一步了解社会对新兴宗教趋势的看法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Data in Brief
Data in Brief MULTIDISCIPLINARY SCIENCES-
CiteScore
3.10
自引率
0.00%
发文量
996
审稿时长
70 days
期刊介绍: Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.
期刊最新文献
Dataset of dendrometer and environmental parameter measurements of two different species of the group of genera known as eucalypts in South Africa and Portugal Bulk mRNA-sequencing data of the estrogen and androgen responses in the human prostate cancer cell line VCaP A refined spirometry dataset for comparing segmented (piecewise) linear models to that of GAMLSS Shotgun metagenomics sequencing data of root microbial community of Huanglongbing-infected Citrus nobilis BEEHIVE: A dataset of Apis mellifera images to empower honeybee monitoring research
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1