The media framing dataset: Analyzing news narratives in Mexico and Colombia.

IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Data in Brief Pub Date : 2025-01-09 eCollection Date: 2025-02-01 DOI:10.1016/j.dib.2025.111284
Juan Cuadrado, Elizabeth Martinez, Juan Carlos Martinez-Santos, Edwin Puertas
{"title":"The media framing dataset: Analyzing news narratives in Mexico and Colombia.","authors":"Juan Cuadrado, Elizabeth Martinez, Juan Carlos Martinez-Santos, Edwin Puertas","doi":"10.1016/j.dib.2025.111284","DOIUrl":null,"url":null,"abstract":"<p><p>This paper introduces \"The Media Framing Dataset,\" a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexico's Delfin program, guarantees both precision and depth. \"The Media Framing Dataset\" serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"58 ","pages":"111284"},"PeriodicalIF":1.4000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11787578/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.dib.2025.111284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

This paper introduces "The Media Framing Dataset," a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexico's Delfin program, guarantees both precision and depth. "The Media Framing Dataset" serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
媒体框架数据集:分析墨西哥和哥伦比亚的新闻叙述。
本文介绍了“媒体框架数据集”,该数据集是通过深入研究墨西哥和哥伦比亚140家地方报纸的新闻文章而开发的,涵盖了2022年5月至2023年8月的事件。我们的数据集涵盖了广泛的主题,包括政治、移民、公众舆论和犯罪。数据收集包括一种细致的基于关键字的搜索策略,旨在识别能够说明各种新闻框架维度的文章,如经济、政策、道德等。为了构建这个数据集,我们结合使用了手动和自动注释技术。文章根据特定的框架维度进行分类,使用与计算语言学专家合作开发的结构化框架。注释过程由来自墨西哥Delfin项目的训练有素的注释人员进行,保证了准确性和深度。“媒体框架数据集”是NLP研究的宝贵资源,具有很高的重用潜力。它特别适合分析媒体框架中的文化和语言细微差别,评估框架对公众感知的影响,并支持自动检测框架技术的模型的发展。此外,它还为语言分析和机器学习项目提供了基础,使研究人员和从业者能够探索媒体框架动态并开发媒体分析的创新工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Data in Brief
Data in Brief MULTIDISCIPLINARY SCIENCES-
CiteScore
3.10
自引率
0.00%
发文量
996
审稿时长
70 days
期刊介绍: Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.
期刊最新文献
Shotgun sequencing data and SSR mining data of aibika (Abelmoschus manihot, Malvaceae) Dataset on multiregional variations of Bangla language (BD-Dialect) InclusiveHAR: A smartphone-based dataset for human activity recognition across diverse physical abilities UzbekPOS: A multi-domain dataset for Uzbek part-of-speech tagging AvianAction101: A Dataset for the dancing behavior of rose-ringed parakeets(Psittacula krameri)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1