首页 > 最新文献

International journal of digital humanities最新文献

英文 中文
From form to sound 自形至聲: visual and aural representations of premodern Chinese phonology and phonorhetoric with applications for phonetic scripts 从形到声:前现代汉语音韵和音韵的视觉和听觉表现及其对音标的应用
Pub Date : 2022-11-21 DOI: 10.1007/s42803-022-00053-8
Jeffrey R. Tharsen
{"title":"From form to sound 自形至聲: visual and aural representations of premodern Chinese phonology and phonorhetoric with applications for phonetic scripts","authors":"Jeffrey R. Tharsen","doi":"10.1007/s42803-022-00053-8","DOIUrl":"https://doi.org/10.1007/s42803-022-00053-8","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"25 1","pages":"115-129"},"PeriodicalIF":0.0,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73021365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining sentiment analysis classifiers to explore multilingual news articles covering London 2012 and Rio 2016 Olympics. 结合情感分析分类器,探索报道 2012 年伦敦奥运会和 2016 年里约奥运会的多语言新闻文章。
Pub Date : 2022-11-16 DOI: 10.1007/s42803-022-00052-9
Caio Mello, Gullal S Cheema, Gaurish Thakkar

This study aims to present an approach for the challenges of working with Sentiment Analysis (SA) applied to news articles in a multilingual corpus. It looks at the use and combination of multiple algorithms to explore news articles published in English and Portuguese. It presents a methodology that starts by evaluating and combining four SA algorithms (SenticNet, SentiStrength, Vader and BERT, being BERT trained in two datasets) to improve the quality of outputs. A thorough review of the algorithms' limitations is conducted using SHAP, an explainable AI tool, resulting in a list of issues that researchers must consider before using SA to interpret texts. We propose a combination of the three best classifiers (Vader, Amazon BERT and Sent140 BERT) to identify contradictory results, improving the quality of the positive, neutral and negative labels assigned to the texts. Challenges with translation are addressed, indicating possible solutions for non-English corpora. As a case study, the method is applied to the study of the media coverage of London 2012 and Rio 2016 Olympic legacies. The combination of different classifiers has proved to be efficient, revealing the unbalance between the media coverage of London 2012, much more positive, and Rio 2016, more negative.

本研究旨在提出一种方法,以应对将情感分析(SA)应用于多语言语料库中的新闻文章所带来的挑战。它着眼于使用和组合多种算法来探索用英语和葡萄牙语发表的新闻文章。它提出了一种方法,首先评估和组合四种情感分析算法(SenticNet、SentiStrength、Vader 和 BERT,其中 BERT 在两个数据集中进行了训练),以提高输出的质量。我们使用 SHAP(一种可解释的人工智能工具)对算法的局限性进行了全面审查,得出了研究人员在使用 SA 解释文本之前必须考虑的一系列问题。我们建议将三种最佳分类器(Vader、Amazon BERT 和 Sent140 BERT)结合起来,以识别相互矛盾的结果,从而提高分配给文本的正面、中性和负面标签的质量。该方法解决了翻译方面的难题,并指出了非英语语料库的可能解决方案。作为案例研究,该方法被应用于 2012 年伦敦奥运会和 2016 年里约奥运会遗产的媒体报道研究。事实证明,不同分类器的组合是有效的,揭示了 2012 年伦敦奥运会媒体报道(正面报道较多)和 2016 年里约奥运会媒体报道(负面报道较多)之间的不平衡。
{"title":"Combining sentiment analysis classifiers to explore multilingual news articles covering London 2012 and Rio 2016 Olympics.","authors":"Caio Mello, Gullal S Cheema, Gaurish Thakkar","doi":"10.1007/s42803-022-00052-9","DOIUrl":"10.1007/s42803-022-00052-9","url":null,"abstract":"<p><p>This study aims to present an approach for the challenges of working with Sentiment Analysis (SA) applied to news articles in a multilingual corpus. It looks at the use and combination of multiple algorithms to explore news articles published in English and Portuguese. It presents a methodology that starts by evaluating and combining four SA algorithms (SenticNet, SentiStrength, Vader and BERT, being BERT trained in two datasets) to improve the quality of outputs. A thorough review of the algorithms' limitations is conducted using SHAP, an explainable AI tool, resulting in a list of issues that researchers must consider before using SA to interpret texts. We propose a combination of the three best classifiers (Vader, Amazon BERT and Sent140 BERT) to identify contradictory results, improving the quality of the positive, neutral and negative labels assigned to the texts. Challenges with translation are addressed, indicating possible solutions for non-English corpora. As a case study, the method is applied to the study of the media coverage of London 2012 and Rio 2016 Olympic legacies. The combination of different classifiers has proved to be efficient, revealing the unbalance between the media coverage of London 2012, much more positive, and Rio 2016, more negative.</p>","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":" ","pages":"1-27"},"PeriodicalIF":0.0,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9667437/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40504197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Civil service examination records and political independence in the autonomous northeastern region during the second half of the Tang dynasty (755–907 C.E.) 唐下半叶(755-907年)东北自治区科举记录与政治独立
Pub Date : 2022-10-10 DOI: 10.1007/s42803-022-00054-7
Wenyi Shang, T. Underwood
{"title":"Civil service examination records and political independence in the autonomous northeastern region during the second half of the Tang dynasty (755–907 C.E.)","authors":"Wenyi Shang, T. Underwood","doi":"10.1007/s42803-022-00054-7","DOIUrl":"https://doi.org/10.1007/s42803-022-00054-7","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"32 3 1","pages":"41-59"},"PeriodicalIF":0.0,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77225769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CLG Authorship Analytics: a library for authorship verification CLG作者身份分析:一个用于作者身份验证的库
Pub Date : 2022-10-10 DOI: 10.1007/s42803-022-00051-w
Erwan Moreau, Carl Vogel
{"title":"CLG Authorship Analytics: a library for authorship verification","authors":"Erwan Moreau, Carl Vogel","doi":"10.1007/s42803-022-00051-w","DOIUrl":"https://doi.org/10.1007/s42803-022-00051-w","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"44 1","pages":"5 - 27"},"PeriodicalIF":0.0,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77354375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing a sentence level fairness metric using word embeddings. 利用词嵌入开发句子级公平度量。
Pub Date : 2022-10-10 DOI: 10.1007/s42803-022-00049-4
Ahmed Izzidien, Stephen Fitz, Peter Romero, Bao S Loe, David Stillwell

Fairness is a principal social value that is observable in civilisations around the world. Yet, a fairness metric for digital texts that describe even a simple social interaction, e.g., 'The boy hurt the girl' has not been developed. We address this by employing word embeddings that use factors found in a new social psychology literature review on the topic. We use these factors to build fairness vectors. These vectors are used as sentence level measures, whereby each dimension reflects a fairness component. The approach is employed to approximate human perceptions of fairness. The method leverages a pro-social bias within word embeddings, for which we obtain an F1 = 79.8 on a list of sentences using the Universal Sentence Encoder (USE). A second approach, using principal component analysis (PCA) and machine learning (ML), produces an F1 = 86.2. Repeating these tests using Sentence Bidirectional Encoder Representations from Transformers (SBERT) produces an F1 = 96.9 and F1 = 100 respectively. Improvements using subspace representations are further suggested. By proposing a first-principles approach, the paper contributes to the analysis of digital texts along an ethical dimension.

公平是一种主要的社会价值观,在世界各地的文明中都可以看到。然而,即使是描述简单社会互动(如 "男孩伤害了女孩")的数字文本,也尚未开发出公平度量标准。为了解决这个问题,我们采用了词嵌入技术,使用了在有关该主题的最新社会心理学文献综述中发现的因素。我们利用这些因素来构建公平性向量。这些向量被用作句子级别的衡量标准,其中每个维度都反映了公平性的组成部分。该方法可用于近似人类对公平性的感知。该方法利用了词嵌入中的亲社会偏差,我们在使用通用句子编码器(USE)的句子列表中获得了 F1 = 79.8 的结果。第二种方法使用主成分分析(PCA)和机器学习(ML),得出的 F1 = 86.2。使用来自变换器的句子双向编码器表示法(SBERT)重复这些测试,F1 = 96.9,F1 = 100。我们还提出了使用子空间表示法进行改进的建议。通过提出第一原理方法,本文为从伦理维度分析数字文本做出了贡献。
{"title":"Developing a sentence level fairness metric using word embeddings.","authors":"Ahmed Izzidien, Stephen Fitz, Peter Romero, Bao S Loe, David Stillwell","doi":"10.1007/s42803-022-00049-4","DOIUrl":"10.1007/s42803-022-00049-4","url":null,"abstract":"<p><p>Fairness is a principal social value that is observable in civilisations around the world. Yet, a fairness metric for digital texts that describe even a simple social interaction, e.g., 'The boy hurt the girl' has not been developed. We address this by employing word embeddings that use factors found in a new social psychology literature review on the topic. We use these factors to build fairness vectors. These vectors are used as sentence level measures, whereby each dimension reflects a fairness component. The approach is employed to approximate human perceptions of fairness. The method leverages a pro-social bias within word embeddings, for which we obtain an F1 = 79.8 on a list of sentences using the Universal Sentence Encoder (USE). A second approach, using principal component analysis (PCA) and machine learning (ML), produces an F1 = 86.2. Repeating these tests using Sentence Bidirectional Encoder Representations from Transformers (SBERT) produces an F1 = 96.9 and F1 = 100 respectively. Improvements using subspace representations are further suggested. By proposing a first-principles approach, the paper contributes to the analysis of digital texts along an ethical dimension.</p>","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":" ","pages":"1-36"},"PeriodicalIF":0.0,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9549858/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33544721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Treating a genre as a database: a digital research methodology for studying Chinese local gazetteers 以体裁为数据库:中文地方志研究的数字化研究方法
Pub Date : 2022-10-06 DOI: 10.1007/s42803-022-00048-5
Shih-Pei Chen, Calvin Yeh, Sean Wang, Qun Che
{"title":"Treating a genre as a database: a digital research methodology for studying Chinese local gazetteers","authors":"Shih-Pei Chen, Calvin Yeh, Sean Wang, Qun Che","doi":"10.1007/s42803-022-00048-5","DOIUrl":"https://doi.org/10.1007/s42803-022-00048-5","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"40 1","pages":"171-193"},"PeriodicalIF":0.0,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76216058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Song authorship attribution: a lyrics and rhyme based approach 歌曲作者归属:基于歌词和韵脚的方法
Pub Date : 2022-09-21 DOI: 10.1007/s42803-022-00050-x
Tunç Yılmaz, Tatjana Scheffler
{"title":"Song authorship attribution: a lyrics and rhyme based approach","authors":"Tunç Yılmaz, Tatjana Scheffler","doi":"10.1007/s42803-022-00050-x","DOIUrl":"https://doi.org/10.1007/s42803-022-00050-x","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"83 1","pages":"29 - 44"},"PeriodicalIF":0.0,"publicationDate":"2022-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75831349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Computational authorship analysis of the homeric poems 荷马诗歌作者身份的计算分析
Pub Date : 2022-07-12 DOI: 10.1007/s42803-022-00046-7
John Pavlopoulos, M. Konstantinidou
{"title":"Computational authorship analysis of the homeric poems","authors":"John Pavlopoulos, M. Konstantinidou","doi":"10.1007/s42803-022-00046-7","DOIUrl":"https://doi.org/10.1007/s42803-022-00046-7","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"25 1","pages":"45 - 64"},"PeriodicalIF":0.0,"publicationDate":"2022-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84986511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On uncertain ground: lost landscapes, digital mediation, and site-based research at early Qing Chengde 在不确定的基础上:清初承德的失落景观、数字中介与遗址研究
Pub Date : 2022-07-12 DOI: 10.1007/s42803-022-00047-6
Stephen H. Whiteman
{"title":"On uncertain ground: lost landscapes, digital mediation, and site-based research at early Qing Chengde","authors":"Stephen H. Whiteman","doi":"10.1007/s42803-022-00047-6","DOIUrl":"https://doi.org/10.1007/s42803-022-00047-6","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"20 1","pages":"1-35"},"PeriodicalIF":0.0,"publicationDate":"2022-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90048099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Web-history: designing a course, shaping the discipline 网络历史:设计课程,塑造学科
Pub Date : 2022-05-30 DOI: 10.1007/s42803-022-00045-8
N. Povroznik
{"title":"Web-history: designing a course, shaping the discipline","authors":"N. Povroznik","doi":"10.1007/s42803-022-00045-8","DOIUrl":"https://doi.org/10.1007/s42803-022-00045-8","url":null,"abstract":"","PeriodicalId":91018,"journal":{"name":"International journal of digital humanities","volume":"39 1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79880714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International journal of digital humanities
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1