Anda Liang;Emerson Murphy-Hill;Westley Weimer;Yu Huang
{"title":"A Controlled Experiment in Age and Gender Bias When Reading Technical Articles in Software Engineering","authors":"Anda Liang;Emerson Murphy-Hill;Westley Weimer;Yu Huang","doi":"10.1109/TSE.2024.3437355","DOIUrl":null,"url":null,"abstract":"Online platforms and communities are a critical part of modern software engineering, yet are often affected by human biases. While previous studies investigated human biases and their potential harms against the efficiency and fairness of online communities, they have mainly focused on the open source and \n<italic>Q & A</i>\n platforms, such as \n<italic>GitHub</i>\n and \n<italic>Stack Overflow</i>\n, but overlooked the audience-focused online platforms for delivering programming and SE-related technical articles, where millions of software engineering practitioners share, seek for, and learn from high-quality software engineering articles (i.e., \n<italic>technical articles</i>\n for SE). Furthermore, most of the previous work has revealed gender and race bias, but we have little knowledge about the effect of age on software engineering practice. In this paper, we propose to investigate the effect of authors’ demographic information (gender and age) on the evaluation of technical articles on software engineering and potential behavioral differences among participants. We conducted a survey-based and controlled human study and collected responses from 540 participants to investigate developers’ evaluation of technical articles for software engineering. By controlling the gender and age of the author profiles of technical articles for SE, we found that raters tend to have more positive content depth evaluations for younger male authors when compared to older male authors and that male participants conduct technical article evaluations faster than female participants, consistent with prior study findings. Surprisingly, different from other software engineering evaluation activities (e.g., code review, pull request, etc.), we did not find a significant difference in the genders of authors on the evaluation outcome of technical articles in SE.","PeriodicalId":13324,"journal":{"name":"IEEE Transactions on Software Engineering","volume":"50 10","pages":"2498-2511"},"PeriodicalIF":6.5000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10623245","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Software Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10623245/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Online platforms and communities are a critical part of modern software engineering, yet are often affected by human biases. While previous studies investigated human biases and their potential harms against the efficiency and fairness of online communities, they have mainly focused on the open source and
Q & A
platforms, such as
GitHub
and
Stack Overflow
, but overlooked the audience-focused online platforms for delivering programming and SE-related technical articles, where millions of software engineering practitioners share, seek for, and learn from high-quality software engineering articles (i.e.,
technical articles
for SE). Furthermore, most of the previous work has revealed gender and race bias, but we have little knowledge about the effect of age on software engineering practice. In this paper, we propose to investigate the effect of authors’ demographic information (gender and age) on the evaluation of technical articles on software engineering and potential behavioral differences among participants. We conducted a survey-based and controlled human study and collected responses from 540 participants to investigate developers’ evaluation of technical articles for software engineering. By controlling the gender and age of the author profiles of technical articles for SE, we found that raters tend to have more positive content depth evaluations for younger male authors when compared to older male authors and that male participants conduct technical article evaluations faster than female participants, consistent with prior study findings. Surprisingly, different from other software engineering evaluation activities (e.g., code review, pull request, etc.), we did not find a significant difference in the genders of authors on the evaluation outcome of technical articles in SE.
在线平台和社区是现代软件工程的重要组成部分,但往往受到人为偏见的影响。虽然以往的研究调查了人为偏见及其对在线社区的效率和公平性的潜在危害,但这些研究主要集中在开源和 Q & A 平台,如 GitHub 和 Stack Overflow,却忽略了以受众为中心的在线平台,这些平台提供编程和 SE 相关的技术文章,数百万软件工程从业人员在这些平台上分享、寻求和学习高质量的软件工程文章(即 SE 技术文章)。此外,以前的工作大多揭示了性别和种族偏见,但我们对年龄对软件工程实践的影响知之甚少。在本文中,我们拟研究作者的人口统计学信息(性别和年龄)对软件工程技术文章评价的影响,以及参与者之间潜在的行为差异。我们开展了一项以调查为基础的对照人类研究,收集了 540 名参与者的回答,以调查开发人员对软件工程技术文章的评价。通过控制软件工程技术文章作者的性别和年龄,我们发现,与年长的男性作者相比,评分者倾向于对年轻男性作者的内容深度做出更积极的评价,而且男性参与者比女性参与者更快地进行技术文章评价,这与之前的研究结果一致。令人惊讶的是,与其他软件工程评估活动(如代码审查、拉取请求等)不同,我们没有发现作者性别在 SE 技术文章评估结果上的显著差异。
期刊介绍:
IEEE Transactions on Software Engineering seeks contributions comprising well-defined theoretical results and empirical studies with potential impacts on software construction, analysis, or management. The scope of this Transactions extends from fundamental mechanisms to the development of principles and their application in specific environments. Specific topic areas include:
a) Development and maintenance methods and models: Techniques and principles for specifying, designing, and implementing software systems, encompassing notations and process models.
b) Assessment methods: Software tests, validation, reliability models, test and diagnosis procedures, software redundancy, design for error control, and measurements and evaluation of process and product aspects.
c) Software project management: Productivity factors, cost models, schedule and organizational issues, and standards.
d) Tools and environments: Specific tools, integrated tool environments, associated architectures, databases, and parallel and distributed processing issues.
e) System issues: Hardware-software trade-offs.
f) State-of-the-art surveys: Syntheses and comprehensive reviews of the historical development within specific areas of interest.