首页 > 最新文献

Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering最新文献

英文 中文
Implementing AI Ethics: Making Sense of the Ethical Requirements 实施人工智能伦理:理解伦理要求
M. Agbese, Rahul Mohanani, A. Khan, P. Abrahamsson
Society’s increasing dependence on Artificial Intelligence (AI) and AI-enabled systems require a more practical approach from software engineering (SE) executives in middle and higher-level management to improve their involvement in implementing AI ethics by making ethical requirements part of their management practices. However, research indicates that most work on implementing ethical requirements in SE management primarily focuses on technical development, with scarce findings for middle and higher-level management. We investigate this by interviewing ten Finnish SE executives in middle and higher-level management to examine how they consider and implement ethical requirements. We use ethical requirements from the European Union (EU) Trustworthy Ethics guidelines for Trustworthy AI as our reference for ethical requirements and an Agile portfolio management framework to analyze implementation. Our findings reveal a general consideration of privacy and data governance ethical requirements as legal requirements with no other consideration for ethical requirements identified. The findings also show practicable consideration of ethical requirements as technical robustness and safety for implementation as risk requirements and societal and environmental well-being for implementation as sustainability requirements. We examine a practical approach to implementing ethical requirements using the ethical risk requirements stack employing the Agile portfolio management framework.
社会对人工智能(AI)和支持AI的系统的日益依赖,要求中层和高层管理中的软件工程(SE)高管采取更实际的方法,通过将道德要求作为其管理实践的一部分,提高他们对实施人工智能伦理的参与度。然而,研究表明,在SE管理中实施道德要求的大多数工作主要集中在技术开发上,很少有针对中高层管理的研究成果。我们通过采访十位芬兰中小企业的中高层管理人员来调查他们是如何考虑和实施道德要求的。我们使用欧盟(EU)可信赖AI的可信赖道德准则的道德要求作为道德要求的参考,并使用敏捷投资组合管理框架来分析实现。我们的研究结果表明,一般将隐私和数据治理道德要求视为法律要求,而没有确定其他道德要求的考虑。研究结果还表明,将道德要求作为技术稳健性,将实施的安全性作为风险要求,将社会和环境福祉作为可持续性要求。我们研究了一种实用的方法来实现道德需求,使用使用敏捷投资组合管理框架的道德风险需求堆栈。
{"title":"Implementing AI Ethics: Making Sense of the Ethical Requirements","authors":"M. Agbese, Rahul Mohanani, A. Khan, P. Abrahamsson","doi":"10.1145/3593434.3593453","DOIUrl":"https://doi.org/10.1145/3593434.3593453","url":null,"abstract":"Society’s increasing dependence on Artificial Intelligence (AI) and AI-enabled systems require a more practical approach from software engineering (SE) executives in middle and higher-level management to improve their involvement in implementing AI ethics by making ethical requirements part of their management practices. However, research indicates that most work on implementing ethical requirements in SE management primarily focuses on technical development, with scarce findings for middle and higher-level management. We investigate this by interviewing ten Finnish SE executives in middle and higher-level management to examine how they consider and implement ethical requirements. We use ethical requirements from the European Union (EU) Trustworthy Ethics guidelines for Trustworthy AI as our reference for ethical requirements and an Agile portfolio management framework to analyze implementation. Our findings reveal a general consideration of privacy and data governance ethical requirements as legal requirements with no other consideration for ethical requirements identified. The findings also show practicable consideration of ethical requirements as technical robustness and safety for implementation as risk requirements and societal and environmental well-being for implementation as sustainability requirements. We examine a practical approach to implementing ethical requirements using the ethical risk requirements stack employing the Agile portfolio management framework.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123555198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing Maintenance Activities of Software Libraries 分析软件库的维护活动
Alexandros Tsakpinis
Industrial applications heavily integrate open-source software libraries nowadays. Beyond the benefits that libraries bring, they can also impose a real threat in case a library is affected by a vulnerability but its community is not active in creating a fixing release. Therefore, I want to introduce an automatic monitoring approach for industrial applications to identify open-source dependencies that show negative signs regarding their current or future maintenance activities. Since most research in this field is limited due to lack of features, labels, and transitive links, and thus is not applicable in industry, my approach aims to close this gap by capturing the impact of direct and transitive dependencies in terms of their maintenance activities. Automatically monitoring the maintenance activities of dependencies reduces the manual effort of application maintainers and supports application security by continuously having well-maintained dependencies.
如今,工业应用大量集成了开源软件库。除了库带来的好处之外,如果库受到漏洞的影响,但其社区不积极创建修复版本,它们也会带来真正的威胁。因此,我想为工业应用程序引入一种自动监控方法,以识别在当前或未来维护活动中显示负面信号的开源依赖关系。由于该领域的大多数研究由于缺乏特征、标签和传递链接而受到限制,因此不适用于工业,因此我的方法旨在通过捕获直接和传递依赖关系在其维护活动方面的影响来缩小这一差距。自动监视依赖项的维护活动减少了应用程序维护人员的手工工作,并通过持续拥有维护良好的依赖项来支持应用程序安全性。
{"title":"Analyzing Maintenance Activities of Software Libraries","authors":"Alexandros Tsakpinis","doi":"10.1145/3593434.3593474","DOIUrl":"https://doi.org/10.1145/3593434.3593474","url":null,"abstract":"Industrial applications heavily integrate open-source software libraries nowadays. Beyond the benefits that libraries bring, they can also impose a real threat in case a library is affected by a vulnerability but its community is not active in creating a fixing release. Therefore, I want to introduce an automatic monitoring approach for industrial applications to identify open-source dependencies that show negative signs regarding their current or future maintenance activities. Since most research in this field is limited due to lack of features, labels, and transitive links, and thus is not applicable in industry, my approach aims to close this gap by capturing the impact of direct and transitive dependencies in terms of their maintenance activities. Automatically monitoring the maintenance activities of dependencies reduces the manual effort of application maintainers and supports application security by continuously having well-maintained dependencies.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126322461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Rapid Review of Responsible AI frameworks: How to guide the development of ethical AI 快速回顾负责任的人工智能框架:如何指导道德人工智能的发展
Vita Santa Barletta, D. Caivano, Domenico Gigante, A. Ragone
In the last years, the raise of Artificial Intelligence (AI), and its pervasiveness in our lives, has sparked a flourishing debate about the ethical principles that should lead its implementation and use in society. Driven by these concerns, we conduct a rapid review of several frameworks providing principles, guidelines, and/or tools to help practitioners in the development and deployment of Responsible AI (RAI) applications. We map each framework w.r.t. the different Software Development Life Cycle (SDLC) phases discovering that most of these frameworks fall just in the Requirements Elicitation phase, leaving the other phases uncovered. Very few of these frameworks offer supporting tools for practitioners, and they are mainly provided by private companies. Our results reveal that there is not a "catching-all" framework supporting both technical and non-technical stakeholders in the implementation of real-world projects. Our findings highlight the lack of a comprehensive framework encompassing all RAI principles and all (SDLC) phases that could be navigated by users with different skill sets and with different goals.
在过去的几年里,人工智能(AI)的兴起及其在我们生活中的普及,引发了一场关于在社会中实施和使用人工智能的伦理原则的激烈辩论。在这些关注点的驱动下,我们对几个框架进行了快速的审查,这些框架提供了原则、指导方针和/或工具,以帮助开发和部署负责任的AI (RAI)应用程序的从业者。我们将每个框架映射到不同的软件开发生命周期(SDLC)阶段,发现这些框架中的大多数只落在需求引出阶段,而没有发现其他阶段。这些框架很少为从业者提供支持工具,它们主要由私人公司提供。我们的结果表明,在现实世界项目的实施中,并没有一个“包揽一切”的框架来支持技术和非技术利益相关者。我们的发现强调了缺乏一个包含所有RAI原则和所有(SDLC)阶段的综合框架,这些框架可以由具有不同技能集和不同目标的用户导航。
{"title":"A Rapid Review of Responsible AI frameworks: How to guide the development of ethical AI","authors":"Vita Santa Barletta, D. Caivano, Domenico Gigante, A. Ragone","doi":"10.1145/3593434.3593478","DOIUrl":"https://doi.org/10.1145/3593434.3593478","url":null,"abstract":"In the last years, the raise of Artificial Intelligence (AI), and its pervasiveness in our lives, has sparked a flourishing debate about the ethical principles that should lead its implementation and use in society. Driven by these concerns, we conduct a rapid review of several frameworks providing principles, guidelines, and/or tools to help practitioners in the development and deployment of Responsible AI (RAI) applications. We map each framework w.r.t. the different Software Development Life Cycle (SDLC) phases discovering that most of these frameworks fall just in the Requirements Elicitation phase, leaving the other phases uncovered. Very few of these frameworks offer supporting tools for practitioners, and they are mainly provided by private companies. Our results reveal that there is not a \"catching-all\" framework supporting both technical and non-technical stakeholders in the implementation of real-world projects. Our findings highlight the lack of a comprehensive framework encompassing all RAI principles and all (SDLC) phases that could be navigated by users with different skill sets and with different goals.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"183 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126950885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the Reporting of Threats to Construct Validity 改进威胁报告对构建效度的影响
Dag I.K. Sjøberg, Gunnar R. Bergersen
Background: Construct validity concerns the use of indicators to measure a concept that is not directly measurable. Aim: This study intends to identify, categorize, assess and quantify discussions of threats to construct validity in empirical software engineering literature and use the findings to suggest ways to improve the reporting of construct validity issues. Method: We analyzed 83 articles that report human-centric experiments published in five top-tier software engineering journals from 2015 to 2019. The articles’ text concerning threats to construct validity was divided into segments (the unit of analysis) based on predefined categories. The segments were then evaluated regarding whether they clearly discussed a threat and a construct. Results: Three-fifths of the segments were associated with topics not related to construct validity. Two-thirds of the articles discussed construct validity without using the definition of construct validity given in the article. The threats were clearly described in more than four-fifths of the segments, but the construct in question was clearly described in only two-thirds of the segments. The construct was unclear when the discussion was not related to construct validity but to other types of validity. Conclusions: The results show potential for improving the understanding of construct validity in software engineering. Recommendations addressing the identified weaknesses are given to improve the awareness and reporting of CV.
背景:建构效度关注的是使用指标来测量一个不能直接测量的概念。目的:本研究旨在识别、分类、评估和量化实证软件工程文献中对结构效度威胁的讨论,并利用研究结果提出改进结构效度问题报告的方法。方法:我们分析了2015年至2019年在五家顶级软件工程期刊上发表的83篇以人为中心的实验报告。文章的结构效度威胁文本根据预定义的类别被划分为片段(分析单元)。然后评估这些片段是否清楚地讨论了威胁和构想。结果:五分之三的片段与构念效度无关的主题相关。三分之二的文章在讨论构念效度时没有使用文中给出的构念效度定义。超过五分之四的片段清楚地描述了威胁,但只有三分之二的片段清楚地描述了所讨论的结构。当讨论与构念效度无关而是与其他类型的效度有关时,构念不清楚。结论:研究结果显示了在软件工程中提高对结构效度理解的潜力。针对已确定的弱点提出建议,以提高对CV的认识和报告。
{"title":"Improving the Reporting of Threats to Construct Validity","authors":"Dag I.K. Sjøberg, Gunnar R. Bergersen","doi":"10.1145/3593434.3593449","DOIUrl":"https://doi.org/10.1145/3593434.3593449","url":null,"abstract":"Background: Construct validity concerns the use of indicators to measure a concept that is not directly measurable. Aim: This study intends to identify, categorize, assess and quantify discussions of threats to construct validity in empirical software engineering literature and use the findings to suggest ways to improve the reporting of construct validity issues. Method: We analyzed 83 articles that report human-centric experiments published in five top-tier software engineering journals from 2015 to 2019. The articles’ text concerning threats to construct validity was divided into segments (the unit of analysis) based on predefined categories. The segments were then evaluated regarding whether they clearly discussed a threat and a construct. Results: Three-fifths of the segments were associated with topics not related to construct validity. Two-thirds of the articles discussed construct validity without using the definition of construct validity given in the article. The threats were clearly described in more than four-fifths of the segments, but the construct in question was clearly described in only two-thirds of the segments. The construct was unclear when the discussion was not related to construct validity but to other types of validity. Conclusions: The results show potential for improving the understanding of construct validity in software engineering. Recommendations addressing the identified weaknesses are given to improve the awareness and reporting of CV.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133562247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Systematic Literature Review on Client Selection in Federated Learning 联邦学习中客户选择的系统文献综述
Carl Smestad, Jingyue Li
With the arising concerns of privacy within machine learning, federated learning (FL) was invented in 2017, in which the clients, such as mobile devices, train a model and send the update to the centralized server. Choosing clients randomly for FL can harm learning performance due to different reasons. Many studies have proposed approaches to address the challenges of client selection of FL. However, no systematic literature review (SLR) on this topic existed. This SLR investigates the state of the art of client selection in FL and answers the challenges, solutions, and metrics to evaluate the solutions. We systematically reviewed 47 primary studies. The main challenges found in client selection are heterogeneity, resource allocation, communication costs, and fairness. The client selection schemes aim to improve the original random selection algorithm by focusing on one or several of the aforementioned challenges. The most common metric used is testing accuracy versus communication rounds, as testing accuracy measures the successfulness of the learning and preferably in as few communication rounds as possible, as they are very expensive. Although several possible improvements can be made with the current state of client selection, the most beneficial ones are evaluating the impact of unsuccessful clients and gaining a more theoretical understanding of the impact of fairness in FL.
随着机器学习中隐私问题的日益关注,联邦学习(FL)于2017年被发明,其中客户端(如移动设备)训练模型并将更新发送到中央服务器。由于不同的原因,随机选择学习对象会影响学习绩效。许多研究提出了解决客户选择FL的挑战的方法。然而,没有关于这一主题的系统文献综述(SLR)。这个单反调查了FL客户选择的艺术状态,并回答了挑战,解决方案和评估解决方案的指标。我们系统地回顾了47项主要研究。客户选择的主要挑战是异质性、资源分配、沟通成本和公平性。客户端选择方案旨在通过关注上述一个或几个挑战来改进原始的随机选择算法。最常用的度量是测试准确性与交流回合的比较,因为测试准确性衡量学习的成功,最好是在尽可能少的交流回合中,因为它们非常昂贵。虽然目前的客户选择状态可以做出一些可能的改进,但最有益的是评估不成功客户的影响,并获得对FL中公平性影响的更理论上的理解。
{"title":"A Systematic Literature Review on Client Selection in Federated Learning","authors":"Carl Smestad, Jingyue Li","doi":"10.1145/3593434.3593438","DOIUrl":"https://doi.org/10.1145/3593434.3593438","url":null,"abstract":"With the arising concerns of privacy within machine learning, federated learning (FL) was invented in 2017, in which the clients, such as mobile devices, train a model and send the update to the centralized server. Choosing clients randomly for FL can harm learning performance due to different reasons. Many studies have proposed approaches to address the challenges of client selection of FL. However, no systematic literature review (SLR) on this topic existed. This SLR investigates the state of the art of client selection in FL and answers the challenges, solutions, and metrics to evaluate the solutions. We systematically reviewed 47 primary studies. The main challenges found in client selection are heterogeneity, resource allocation, communication costs, and fairness. The client selection schemes aim to improve the original random selection algorithm by focusing on one or several of the aforementioned challenges. The most common metric used is testing accuracy versus communication rounds, as testing accuracy measures the successfulness of the learning and preferably in as few communication rounds as possible, as they are very expensive. Although several possible improvements can be made with the current state of client selection, the most beneficial ones are evaluating the impact of unsuccessful clients and gaining a more theoretical understanding of the impact of fairness in FL.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"169 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126584457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HornFuzz: Fuzzing CHC solvers 模糊CHC求解器
Anzhela Sukhanova, Valentyn Sobol
Many advanced program analysis and verification methods are based on solving systems of Constrained Horn Clauses (CHC). Testing CHC solvers is very important, as correctness of their work determines whether bugs in the analyzed programs are detected or missed. One of the well-established and efficient methods of automated software testing is fuzzing: analyzing the reactions of programs to random input data. Currently, there are no fuzzers for CHC solvers, and fuzzers for SMT solvers are not efficient in CHC solver testing, since they do not consider CHC specifics. In this paper, we present HornFuzz, a mutation-based gray-box fuzzing technique for detecting bugs in CHC solvers based on the idea of metamorphic testing. We evaluated our fuzzer on one of the highest performing CHC solvers, Spacer, and found a handful of bugs in Spacer. In particular, some discovered problems are so serious that they require fixes with significant changes to the solver.
许多先进的程序分析和验证方法都是基于求解约束角子句(CHC)系统。测试CHC解算器非常重要,因为它们工作的正确性决定了被分析程序中的错误是否被检测或遗漏。自动化软件测试的一种行之有效的有效方法是模糊测试:分析程序对随机输入数据的反应。目前,没有用于CHC解算器的模糊器,并且SMT解算器的模糊器在CHC解算器测试中效率不高,因为它们不考虑CHC的具体情况。在本文中,我们提出了HornFuzz,一种基于变异测试思想的灰盒模糊检测技术,用于检测CHC求解器中的错误。我们在性能最高的CHC解算器之一Spacer上评估了我们的fuzzer,并在Spacer中发现了一些bug。特别是,一些发现的问题非常严重,需要通过对求解器进行重大更改来修复。
{"title":"HornFuzz: Fuzzing CHC solvers","authors":"Anzhela Sukhanova, Valentyn Sobol","doi":"10.1145/3593434.3593455","DOIUrl":"https://doi.org/10.1145/3593434.3593455","url":null,"abstract":"Many advanced program analysis and verification methods are based on solving systems of Constrained Horn Clauses (CHC). Testing CHC solvers is very important, as correctness of their work determines whether bugs in the analyzed programs are detected or missed. One of the well-established and efficient methods of automated software testing is fuzzing: analyzing the reactions of programs to random input data. Currently, there are no fuzzers for CHC solvers, and fuzzers for SMT solvers are not efficient in CHC solver testing, since they do not consider CHC specifics. In this paper, we present HornFuzz, a mutation-based gray-box fuzzing technique for detecting bugs in CHC solvers based on the idea of metamorphic testing. We evaluated our fuzzer on one of the highest performing CHC solvers, Spacer, and found a handful of bugs in Spacer. In particular, some discovered problems are so serious that they require fixes with significant changes to the solver.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128144553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring User Experience of Adaptive User Interfaces using EEG: A Replication Study 利用脑电图测量自适应用户界面的用户体验:一项重复性研究
Daniel Gaspar-Figueiredo, S. Abrahão, E. Insfrán, J. Vanderdonckt
Background: Adaptive user interfaces have the advantage of being able to dynamically change their aspect and/or behaviour depending on the characteristics of the context of use, i.e. to improve user experience. User experience is an important quality factor that has been primarily evaluated with classical measures (e.g. effectiveness, efficiency, satisfaction), but to a lesser extent with physiological measures, such as emotion recognition, skin response, or brain activity. Aim: In a previous exploratory experiment involving users with different profiles and a wide range of ages, we analysed user experience in terms of cognitive load, engagement, attraction and memorisation when employing twenty graphical adaptive menus through the use of an Electroencephalogram (EEG) device. The results indicated that there were statistically significant differences for these four variables. However, we considered that it was necessary to confirm or reject these findings using a more homogeneous group of users. Method: We conducted an operational internal replication study with 40 participants. We also investigated the potential correlation between EEG signals and the participants’ user experience ratings, such as their preferences. Results: The results of this experiment confirm that there are statistically significant differences between the EEG variables when the participants interact with the different adaptive menus. Moreover, there is a high correlation among the participants’ user experience ratings and the EEG signals, and a trend regarding performance has emerged from our analysis. Conclusions: These findings suggest that EEG signals could be used to evaluate user experience. With regard to the menus studied, our results suggest that graphical menus with different structures and font types produce more differences in users’ brain responses, while menus which use colours produce more similarities in users’ brain responses. Several insights with which to improve users’ experience of graphical adaptive menus are outlined.
背景:自适应用户界面的优点是能够根据使用环境的特点动态改变其方面和/或行为,即改善用户体验。用户体验是一个重要的质量因素,主要通过经典测量(例如有效性、效率、满意度)进行评估,但较少使用生理测量,例如情绪识别、皮肤反应或大脑活动。目的:在之前的一项探索性实验中,我们通过使用脑电图(EEG)设备,分析了使用20个图形自适应菜单时,用户在认知负荷、参与度、吸引力和记忆方面的体验。结果表明,这四个变量的差异有统计学意义。然而,我们认为有必要使用更均匀的用户组来证实或拒绝这些发现。方法:我们对40名参与者进行了可操作的内部重复研究。我们还研究了脑电图信号与参与者的用户体验评级(如他们的偏好)之间的潜在相关性。结果:实验结果证实,被试与不同自适应菜单交互时,脑电变量之间存在显著的统计学差异。此外,参与者的用户体验评级与脑电图信号之间存在高度相关性,并且从我们的分析中出现了有关性能的趋势。结论:这些发现表明脑电图信号可以用来评估用户体验。对于所研究的菜单,我们的研究结果表明,不同结构和字体类型的图形菜单在用户的大脑反应中产生更多的差异,而使用颜色的菜单在用户的大脑反应中产生更多的相似性。本文概述了改善图形化自适应菜单的用户体验的几个见解。
{"title":"Measuring User Experience of Adaptive User Interfaces using EEG: A Replication Study","authors":"Daniel Gaspar-Figueiredo, S. Abrahão, E. Insfrán, J. Vanderdonckt","doi":"10.1145/3593434.3593452","DOIUrl":"https://doi.org/10.1145/3593434.3593452","url":null,"abstract":"Background: Adaptive user interfaces have the advantage of being able to dynamically change their aspect and/or behaviour depending on the characteristics of the context of use, i.e. to improve user experience. User experience is an important quality factor that has been primarily evaluated with classical measures (e.g. effectiveness, efficiency, satisfaction), but to a lesser extent with physiological measures, such as emotion recognition, skin response, or brain activity. Aim: In a previous exploratory experiment involving users with different profiles and a wide range of ages, we analysed user experience in terms of cognitive load, engagement, attraction and memorisation when employing twenty graphical adaptive menus through the use of an Electroencephalogram (EEG) device. The results indicated that there were statistically significant differences for these four variables. However, we considered that it was necessary to confirm or reject these findings using a more homogeneous group of users. Method: We conducted an operational internal replication study with 40 participants. We also investigated the potential correlation between EEG signals and the participants’ user experience ratings, such as their preferences. Results: The results of this experiment confirm that there are statistically significant differences between the EEG variables when the participants interact with the different adaptive menus. Moreover, there is a high correlation among the participants’ user experience ratings and the EEG signals, and a trend regarding performance has emerged from our analysis. Conclusions: These findings suggest that EEG signals could be used to evaluate user experience. With regard to the menus studied, our results suggest that graphical menus with different structures and font types produce more differences in users’ brain responses, while menus which use colours produce more similarities in users’ brain responses. Several insights with which to improve users’ experience of graphical adaptive menus are outlined.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121069694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying Characteristics of the Agile Development Process That Impact User Satisfaction 识别影响用户满意度的敏捷开发过程的特征
Minshun Yang, Seiji Sato, H. Washizaki, Y. Fukazawa, Juichi Takahashi
The purpose of this study is to identify the characteristics of Agile development processes that impact user satisfaction. We used user reviews of OSS smartphone apps and various data from version control systems to examine the relationships, especially time-series correlations, between user satisfaction and development metrics that are expected to be related to user satisfaction. Although no metrics conclusively indicate an improved user satisfaction, motivation of the development team, the ability to set appropriate work units, the appropriateness of work rules, and the improvement of code maintainability should be considered as they are correlated with improved user satisfaction. In contrast, changes in the release frequency and workload are not correlated.
本研究的目的是确定影响用户满意度的敏捷开发过程的特征。我们使用OSS智能手机应用程序的用户评论和来自版本控制系统的各种数据来检查用户满意度和预期与用户满意度相关的开发指标之间的关系,特别是时间序列相关性。尽管没有指标明确地表明用户满意度的提高,但是开发团队的动机、设置适当工作单元的能力、工作规则的适当性以及代码可维护性的改进应该被考虑,因为它们与用户满意度的提高是相关的。相反,发布频率和工作负载的变化是不相关的。
{"title":"Identifying Characteristics of the Agile Development Process That Impact User Satisfaction","authors":"Minshun Yang, Seiji Sato, H. Washizaki, Y. Fukazawa, Juichi Takahashi","doi":"10.1145/3593434.3593470","DOIUrl":"https://doi.org/10.1145/3593434.3593470","url":null,"abstract":"The purpose of this study is to identify the characteristics of Agile development processes that impact user satisfaction. We used user reviews of OSS smartphone apps and various data from version control systems to examine the relationships, especially time-series correlations, between user satisfaction and development metrics that are expected to be related to user satisfaction. Although no metrics conclusively indicate an improved user satisfaction, motivation of the development team, the ability to set appropriate work units, the appropriateness of work rules, and the improvement of code maintainability should be considered as they are correlated with improved user satisfaction. In contrast, changes in the release frequency and workload are not correlated.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131660263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Barriers for Social Inclusion in Online Software Engineering Communities - A Study of Offensive Language Use in Gitter Projects 在线软件工程社区中社会包容的障碍——Gitter项目中攻击性语言使用的研究
Bastin Tony Roy Savarimuthu, Zoofishan Zareen, Jithin Cheriyan, Muhammad Yasir, M. Galster
Social inclusion is a fundamental feature of thriving societies. This paper first investigates barriers for social inclusion in online Software Engineering (SE) communities, by identifying a set of 11 attributes and organising them as a taxonomy. Second, by applying the taxonomy and analysing language used in the comments posted by members in 189 Gitter projects (with > 3 million comments), it presents the evidence for the social exclusion problem. It employs a keyword-based search approach for this purpose. Third, it presents a framework for improving social inclusion in SE communities.
社会包容是繁荣社会的基本特征。本文首先研究了在线软件工程(SE)社区中社会包容的障碍,通过识别一组11个属性并将它们组织为一个分类法。其次,通过应用分类法并分析189个Gitter项目(评论数为1.93亿条)成员发表的评论中使用的语言,它提供了社会排斥问题的证据。为此,它采用了基于关键字的搜索方法。第三,它提出了一个改善东南社区社会包容的框架。
{"title":"Barriers for Social Inclusion in Online Software Engineering Communities - A Study of Offensive Language Use in Gitter Projects","authors":"Bastin Tony Roy Savarimuthu, Zoofishan Zareen, Jithin Cheriyan, Muhammad Yasir, M. Galster","doi":"10.1145/3593434.3593463","DOIUrl":"https://doi.org/10.1145/3593434.3593463","url":null,"abstract":"Social inclusion is a fundamental feature of thriving societies. This paper first investigates barriers for social inclusion in online Software Engineering (SE) communities, by identifying a set of 11 attributes and organising them as a taxonomy. Second, by applying the taxonomy and analysing language used in the comments posted by members in 189 Gitter projects (with > 3 million comments), it presents the evidence for the social exclusion problem. It employs a keyword-based search approach for this purpose. Third, it presents a framework for improving social inclusion in SE communities.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127503930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Does Code Smell Frequency Have a Relationship with Fault-proneness? 代码气味频率与错误倾向有关系吗?
Md Masudur Rahman, T. Ahammed, Md. Mahbubul Alam Joarder, K. Sakib
Fault-proneness is an indication of programming errors that decreases software quality and maintainability. On the contrary, code smell is a symptom of potential design problems which has impact on fault-proneness. In the literature, negative impact of code smells on fault-proneness has been investigated. However, it is still unclear that how frequency of each code smell type impacts the fault-proneness. To mitigate this research gap, we present an empirical study to identify whether frequency of individual code smell types has a relationship with the fault-proneness. The results show that Anti Singleton, Blob and Class Data Should Be Private smell types have strong relationship with fault-proneness though their frequencies are not very high. On the other hand, comparatively high frequent code smell types such as Complex Class, Large Class and Long Parameter List have moderate relationship with fault-proneness. These findings will assist developers to prioritize and refactor code smells to improve software quality.
错误倾向是降低软件质量和可维护性的编程错误的指示。相反,代码气味是潜在的设计问题的症状,它会影响到错误的易感性。在文献中,已经研究了代码气味对错误倾向的负面影响。然而,每种代码气味类型的频率如何影响错误倾向仍然不清楚。为了弥补这一研究空白,我们提出了一项实证研究,以确定单个代码气味类型的频率是否与错误倾向有关。结果表明,Anti Singleton、Blob和Class Data Should Be Private的气味类型虽然频率不是很高,但与故障倾向性有很强的关系。另一方面,复杂类(Complex Class)、大类(Large Class)和长参数列表(Long Parameter List)等相对频繁的代码气味类型与故障倾向性的关系中等。这些发现将帮助开发人员对代码气味进行优先级排序和重构,以提高软件质量。
{"title":"Does Code Smell Frequency Have a Relationship with Fault-proneness?","authors":"Md Masudur Rahman, T. Ahammed, Md. Mahbubul Alam Joarder, K. Sakib","doi":"10.1145/3593434.3593457","DOIUrl":"https://doi.org/10.1145/3593434.3593457","url":null,"abstract":"Fault-proneness is an indication of programming errors that decreases software quality and maintainability. On the contrary, code smell is a symptom of potential design problems which has impact on fault-proneness. In the literature, negative impact of code smells on fault-proneness has been investigated. However, it is still unclear that how frequency of each code smell type impacts the fault-proneness. To mitigate this research gap, we present an empirical study to identify whether frequency of individual code smell types has a relationship with the fault-proneness. The results show that Anti Singleton, Blob and Class Data Should Be Private smell types have strong relationship with fault-proneness though their frequencies are not very high. On the other hand, comparatively high frequent code smell types such as Complex Class, Large Class and Long Parameter List have moderate relationship with fault-proneness. These findings will assist developers to prioritize and refactor code smells to improve software quality.","PeriodicalId":178596,"journal":{"name":"Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134532511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1