{"title":"Clowns in the Visual Artists:","authors":"Michael Mandiberg, Danara Sarıoğlu","doi":"10.1086/719999","DOIUrl":null,"url":null,"abstract":"This case study explores the challenges of defining a data set to analyze changes in Wikipedia’s gender gap for articles about visual art. Wikipedia and Wikidata each group content in data structures that produce overlapping and incomplete visual art datasets and include non-visual art data. To circumvent potentially biased editorial decisions about what to include or exclude, this case study describes the process of using a topic model algorithm that identifies a dataset by analyzing the words in each article and grouping the articles into topics.","PeriodicalId":43009,"journal":{"name":"Art Documentation","volume":"241 1","pages":"19 - 37"},"PeriodicalIF":0.2000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Art Documentation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1086/719999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ART","Score":null,"Total":0}
引用次数: 0
Abstract
This case study explores the challenges of defining a data set to analyze changes in Wikipedia’s gender gap for articles about visual art. Wikipedia and Wikidata each group content in data structures that produce overlapping and incomplete visual art datasets and include non-visual art data. To circumvent potentially biased editorial decisions about what to include or exclude, this case study describes the process of using a topic model algorithm that identifies a dataset by analyzing the words in each article and grouping the articles into topics.