Juan Cuadrado, Elizabeth Martinez, Juan Carlos Martinez-Santos, Edwin Puertas
{"title":"The media framing dataset: Analyzing news narratives in Mexico and Colombia.","authors":"Juan Cuadrado, Elizabeth Martinez, Juan Carlos Martinez-Santos, Edwin Puertas","doi":"10.1016/j.dib.2025.111284","DOIUrl":null,"url":null,"abstract":"<p><p>This paper introduces \"The Media Framing Dataset,\" a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexico's Delfin program, guarantees both precision and depth. \"The Media Framing Dataset\" serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"58 ","pages":"111284"},"PeriodicalIF":1.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11787578/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.dib.2025.111284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces "The Media Framing Dataset," a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexico's Delfin program, guarantees both precision and depth. "The Media Framing Dataset" serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.