使用“现成”词典衡量新闻情绪的四个最佳实践:大规模p-hacking实验

Computational Communication Research Pub Date : 2020-10-07 DOI:10.31235/osf.io/np5wa

Chung-hong Chan, Joseph W. Bajjalieh, L. Auvil, Hartmut Wessler, Scott L. Althaus, Kasper Welbers, Wouter van Atteveldt, Marc Jungblut

{"title":"使用“现成”词典衡量新闻情绪的四个最佳实践:大规模p-hacking实验","authors":"Chung-hong Chan, Joseph W. Bajjalieh, L. Auvil, Hartmut Wessler, Scott L. Althaus, Kasper Welbers, Wouter van Atteveldt, Marc Jungblut","doi":"10.31235/osf.io/np5wa","DOIUrl":null,"url":null,"abstract":"We examined the validity of 37 sentiment scores based on dictionary-based methods using a large news corpus and demonstrated the risk of generating a spectrum of results with different levels of statistical significance by presenting an analysis of relationships between news sentiment and U.S. presidential approval. We summarize our findings into four best practices: 1) use a suitable sentiment dictionary; 2) do not assume that the validity and reliability of the dictionary is ‘built-in’; 3) check for the influence of content length and 4) do not use multiple dictionaries to test the same statistical hypothesis.","PeriodicalId":275035,"journal":{"name":"Computational Communication Research","volume":"14 36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":"{\"title\":\"Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment\",\"authors\":\"Chung-hong Chan, Joseph W. Bajjalieh, L. Auvil, Hartmut Wessler, Scott L. Althaus, Kasper Welbers, Wouter van Atteveldt, Marc Jungblut\",\"doi\":\"10.31235/osf.io/np5wa\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We examined the validity of 37 sentiment scores based on dictionary-based methods using a large news corpus and demonstrated the risk of generating a spectrum of results with different levels of statistical significance by presenting an analysis of relationships between news sentiment and U.S. presidential approval. We summarize our findings into four best practices: 1) use a suitable sentiment dictionary; 2) do not assume that the validity and reliability of the dictionary is ‘built-in’; 3) check for the influence of content length and 4) do not use multiple dictionaries to test the same statistical hypothesis.\",\"PeriodicalId\":275035,\"journal\":{\"name\":\"Computational Communication Research\",\"volume\":\"14 36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Communication Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31235/osf.io/np5wa\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Communication Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31235/osf.io/np5wa","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

摘要

我们使用一个大型新闻语料库，基于基于词典的方法检验了37种情绪得分的有效性，并通过分析新闻情绪与美国总统支持率之间的关系，展示了产生具有不同统计显著性水平的结果谱的风险。我们将研究结果总结为四个最佳实践:1)使用合适的情感词典;2)不要认为字典的有效性和可靠性是“内置的”;3)检查内容长度的影响，4)不要使用多个字典来检验相同的统计假设。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment

We examined the validity of 37 sentiment scores based on dictionary-based methods using a large news corpus and demonstrated the risk of generating a spectrum of results with different levels of statistical significance by presenting an analysis of relationships between news sentiment and U.S. presidential approval. We summarize our findings into four best practices: 1) use a suitable sentiment dictionary; 2) do not assume that the validity and reliability of the dictionary is ‘built-in’; 3) check for the influence of content length and 4) do not use multiple dictionaries to test the same statistical hypothesis.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computational Communication Research

自引率

0.00%

发文量