首页 > 最新文献

Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops最新文献

英文 中文
Automatic library categorization 图书馆自动分类
Camilo Velázquez-Rodríguez, Coen De Roover
Software ecosystems contain several types of artefacts such as libraries, documentation and source code files. Recent studies show that the Maven software ecosystem alone already contains over 2.8 million artefacts and over 70, 000 libraries. Given the size of the ecosystem, selecting a library represents a challenge to its users. The MVNRepository website offers a category-based search functionality as a solution. However, not all of the libraries have been categorised, which leads to incomplete search results. This work proposes an approach to the automatic categorisation of libraries through machine learning classifiers trained on class and method names. Our preliminary results show that the approach is accurate, suggesting that large-scale applications may be feasible.
软件生态系统包含几种类型的工件,例如库、文档和源代码文件。最近的研究表明,仅Maven软件生态系统就已经包含了超过280万个工件和超过7万个库。考虑到生态系统的规模,选择一个图书馆对它的用户来说是一个挑战。MVNRepository网站提供了一个基于类别的搜索功能作为解决方案。然而,并不是所有的图书馆都被分类了,这导致了不完整的搜索结果。这项工作提出了一种通过对类和方法名称进行训练的机器学习分类器对库进行自动分类的方法。我们的初步结果表明,该方法是准确的,表明大规模应用是可行的。
{"title":"Automatic library categorization","authors":"Camilo Velázquez-Rodríguez, Coen De Roover","doi":"10.1145/3387940.3392186","DOIUrl":"https://doi.org/10.1145/3387940.3392186","url":null,"abstract":"Software ecosystems contain several types of artefacts such as libraries, documentation and source code files. Recent studies show that the Maven software ecosystem alone already contains over 2.8 million artefacts and over 70, 000 libraries. Given the size of the ecosystem, selecting a library represents a challenge to its users. The MVNRepository website offers a category-based search functionality as a solution. However, not all of the libraries have been categorised, which leads to incomplete search results. This work proposes an approach to the automatic categorisation of libraries through machine learning classifiers trained on class and method names. Our preliminary results show that the approach is accurate, suggesting that large-scale applications may be feasible.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130951541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Exploratory Field Study of Programmer Assistance-Seeking during Software Development 软件开发中寻求程序员帮助的探索性实地研究
Paige Rodeghero
Developers often face a dilemma: to seek assistance from a colleague or to expend effort to answer a question herself. On one hand, seeking help is fast and reliable. But on the other, seeking assistance can distract colleagues and reduce their productivity. In this paper, we report our preliminary findings of assistance-seeking from an observational study at a medium-sized software company. We found that developers have varying levels of spoken communication when seeking help. We believe this is correlated with their different years of experience working as developers, among other factors. We also found that many employees would avoid asking for help several times due to various work-related and reported personal reasons. This has driven us to explore a new, exciting research area discovering the complexities of developers seeking help. This paper is our first analysis of this kind, and we hope to receive the community's feedback before continued work.
开发人员经常面临两难境地:是寻求同事的帮助,还是自己努力回答问题。一方面,寻求帮助是快速和可靠的。但另一方面,寻求帮助可能会分散同事的注意力,降低他们的工作效率。在本文中,我们报告了我们对一家中型软件公司寻求帮助的初步研究结果。我们发现,在寻求帮助时,开发人员的口语交流水平各不相同。我们认为这与他们作为开发者的不同工作年限以及其他因素有关。我们还发现,很多员工会因为各种与工作相关和报告的个人原因而避免多次寻求帮助。这促使我们探索一个新的、令人兴奋的研究领域,发现开发人员寻求帮助的复杂性。这篇论文是我们第一次对这种类型的分析,我们希望在继续工作之前收到社区的反馈。
{"title":"An Exploratory Field Study of Programmer Assistance-Seeking during Software Development","authors":"Paige Rodeghero","doi":"10.1145/3387940.3392237","DOIUrl":"https://doi.org/10.1145/3387940.3392237","url":null,"abstract":"Developers often face a dilemma: to seek assistance from a colleague or to expend effort to answer a question herself. On one hand, seeking help is fast and reliable. But on the other, seeking assistance can distract colleagues and reduce their productivity. In this paper, we report our preliminary findings of assistance-seeking from an observational study at a medium-sized software company. We found that developers have varying levels of spoken communication when seeking help. We believe this is correlated with their different years of experience working as developers, among other factors. We also found that many employees would avoid asking for help several times due to various work-related and reported personal reasons. This has driven us to explore a new, exciting research area discovering the complexities of developers seeking help. This paper is our first analysis of this kind, and we hope to receive the community's feedback before continued work.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131440479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flake It 'Till You Make It: Using Automated Repair to Induce and Fix Latent Test Flakiness 剥落直到你完成它:使用自动修复诱导和修复潜在的测试剥落
Owain Parry, G. M. Kapfhammer, Michael C Hilton, Phil McMinn
Since flaky tests pass or fail nondeterministically, without any code changes, they are an unreliable indicator of program quality. Developers may quarantine or delete flaky tests because it is often too time consuming to repair them. Yet, since decommissioning too many tests may ultimately degrade a test suite's effectiveness, developers may eventually want to fix them, a process that is challenging because the nondeterminism may have been introduced previously. We contend that the best time to discover and repair a flaky test is when a developer first creates and best understands it. We refer to tests that are not currently flaky, but that could become so, as having latent flakiness. We further argue that efforts to expose and repair latent flakiness are valuable in ensuring the future-reliability of the test suite, and that the testing cost is greater if latent flakiness is left to manifest itself later. Using concrete examples from a real-world program, this paper posits that automated program repair techniques will prove useful for surfacing latent flakiness.
由于不稳定的测试不确定地通过或失败,没有任何代码更改,因此它们是程序质量的不可靠指示器。开发人员可能会隔离或删除不稳定的测试,因为修复它们通常太耗时。然而,由于停用太多的测试可能最终会降低测试套件的有效性,开发人员可能最终想要修复它们,这是一个具有挑战性的过程,因为不确定性可能已经在前面引入了。我们认为,发现和修复不可靠的测试的最佳时机是开发人员第一次创建并最好地理解它的时候。我们指的是目前没有片状的测试,但可能会变得如此,因为有潜在的片状。我们进一步论证,暴露和修复潜在缺陷的努力在确保测试套件的未来可靠性方面是有价值的,并且如果让潜在缺陷在以后表现出来,测试成本会更大。通过实际程序中的具体例子,本文假设自动程序修复技术将证明对潜在的片状物表面处理是有用的。
{"title":"Flake It 'Till You Make It: Using Automated Repair to Induce and Fix Latent Test Flakiness","authors":"Owain Parry, G. M. Kapfhammer, Michael C Hilton, Phil McMinn","doi":"10.1145/3387940.3392177","DOIUrl":"https://doi.org/10.1145/3387940.3392177","url":null,"abstract":"Since flaky tests pass or fail nondeterministically, without any code changes, they are an unreliable indicator of program quality. Developers may quarantine or delete flaky tests because it is often too time consuming to repair them. Yet, since decommissioning too many tests may ultimately degrade a test suite's effectiveness, developers may eventually want to fix them, a process that is challenging because the nondeterminism may have been introduced previously. We contend that the best time to discover and repair a flaky test is when a developer first creates and best understands it. We refer to tests that are not currently flaky, but that could become so, as having latent flakiness. We further argue that efforts to expose and repair latent flakiness are valuable in ensuring the future-reliability of the test suite, and that the testing cost is greater if latent flakiness is left to manifest itself later. Using concrete examples from a real-world program, this paper posits that automated program repair techniques will prove useful for surfacing latent flakiness.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"187 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124745814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Investigation of Mutual-Influence among Blockchain Development Communities and Cryptocurrency Price Changes 区块链开发社区与加密货币价格变化的相互影响研究
Nicola Uras, Stefano Vacca, Giuseppe Destefanis
This paper aims to identify and model relationships between cryptocurrencies market price changes and topic discussion occurrences on social media. The considered cryptocurrencies are the two highest in value at the moment, Bitcoin and Ethereum. At the same time, topics were realized through a classification of the comments gained from the Reddit social media platform, implementing a Hawkes model. The results highlight that it is possible to identify some interactions among the considered features, and it appears that some topics are indicative of certain types of price movements. Specifically, the discussions concerning issues about government, trading and Ethereum cryptocurrency as an exchange currency, appear to affect Bitcoin and Ethereum prices negatively. The discussions of investment appear to be indicative of price rises, while the discussions related to new decentralized realities and technological applications is indicative of price falls.
本文旨在识别和建模加密货币市场价格变化与社交媒体上的话题讨论事件之间的关系。被考虑的加密货币是目前价值最高的两种货币,比特币和以太坊。同时,通过对Reddit社交媒体平台上获得的评论进行分类来实现主题,实现Hawkes模型。结果突出表明,有可能确定所考虑的特征之间的一些相互作用,并且似乎有些主题表明某些类型的价格变动。具体来说,关于政府、交易和以太坊加密货币作为交换货币的讨论似乎对比特币和以太坊的价格产生了负面影响。关于投资的讨论似乎表明价格上涨,而关于新的分散现实和技术应用的讨论则表明价格下跌。
{"title":"Investigation of Mutual-Influence among Blockchain Development Communities and Cryptocurrency Price Changes","authors":"Nicola Uras, Stefano Vacca, Giuseppe Destefanis","doi":"10.1145/3387940.3392245","DOIUrl":"https://doi.org/10.1145/3387940.3392245","url":null,"abstract":"This paper aims to identify and model relationships between cryptocurrencies market price changes and topic discussion occurrences on social media. The considered cryptocurrencies are the two highest in value at the moment, Bitcoin and Ethereum. At the same time, topics were realized through a classification of the comments gained from the Reddit social media platform, implementing a Hawkes model. The results highlight that it is possible to identify some interactions among the considered features, and it appears that some topics are indicative of certain types of price movements. Specifically, the discussions concerning issues about government, trading and Ethereum cryptocurrency as an exchange currency, appear to affect Bitcoin and Ethereum prices negatively. The discussions of investment appear to be indicative of price rises, while the discussions related to new decentralized realities and technological applications is indicative of price falls.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127705588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Predicting Stack Overflow Question Tags: A Multi-Class, Multi-Label Classification 预测堆栈溢出问题标签:一个多类别,多标签分类
E. M. Kavuk, Ayse Tosun Misirli
This work proposes to predict the tags assigned for the posts on Stack Overflow platform. The raw data was obtained from the stackexchange.com including more than 50K posts and their associated tags given by the users. The posts' questions and titles are pre-processed, and the sentences in the posts are further transformed into features via Latent Dirichlet Allocation. The problem is a multi-class and multi-label classification and hence, we propose 1) one-against-all models for 15 most popularly used tags, and 2) a combined multi-tag classifier for finding the top K tags for a single post. Three algorithms are used to train the one-against-all classifiers to decide to what extent a post belongs to a tag. The probabilities of each post belonging to a tag are then combined to give the results of the multi-tag classifier with the best performing algorithm. The performance is compared with a baseline approach (kNN). Our multi-tag classifier achieves 55% recall and 39% F1-score.
本研究提出预测Stack Overflow平台上文章的标签分配。原始数据是从stackexchange.com获得的,包括用户给出的5万多篇帖子及其相关标签。对帖子的问题和标题进行预处理,并通过Latent Dirichlet Allocation将帖子中的句子进一步转化为特征。这个问题是一个多类和多标签的分类,因此,我们提出1)针对15个最常用标签的一个对所有模型,以及2)一个组合的多标签分类器,用于为单个帖子找到前K个标签。三种算法被用来训练单对全分类器来决定文章在多大程度上属于一个标签。然后将每个帖子属于一个标签的概率结合起来,给出具有最佳性能算法的多标签分类器的结果。将性能与基线方法(kNN)进行比较。我们的多标签分类器达到了55%的召回率和39%的f1得分。
{"title":"Predicting Stack Overflow Question Tags: A Multi-Class, Multi-Label Classification","authors":"E. M. Kavuk, Ayse Tosun Misirli","doi":"10.1145/3387940.3391491","DOIUrl":"https://doi.org/10.1145/3387940.3391491","url":null,"abstract":"This work proposes to predict the tags assigned for the posts on Stack Overflow platform. The raw data was obtained from the stackexchange.com including more than 50K posts and their associated tags given by the users. The posts' questions and titles are pre-processed, and the sentences in the posts are further transformed into features via Latent Dirichlet Allocation. The problem is a multi-class and multi-label classification and hence, we propose 1) one-against-all models for 15 most popularly used tags, and 2) a combined multi-tag classifier for finding the top K tags for a single post. Three algorithms are used to train the one-against-all classifiers to decide to what extent a post belongs to a tag. The probabilities of each post belonging to a tag are then combined to give the results of the multi-tag classifier with the best performing algorithm. The performance is compared with a baseline approach (kNN). Our multi-tag classifier achieves 55% recall and 39% F1-score.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116867403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Modelling Knowledge about Software Processes using Provenance Graphs and its Application to Git-based Version Control Systems 用出处图建模软件过程知识及其在基于git的版本控制系统中的应用
A. Schreiber, C. D. Boer
Using the W3C PROV data model, we present a general provenance model for software development processes and---as an example---specialized models for git services, for which we generate provenance graphs. Provenance graphs are knowledge graphs, since they have defined semantics, and can be analyzed with graph algorithms or semantic reasoning to get insights into processes.
使用W3C PROV数据模型,我们为软件开发过程提供了一个通用的来源模型,作为一个例子,我们为git服务提供了专门的模型,我们为git服务生成了来源图。出处图是知识图,因为它们具有定义的语义,并且可以用图算法或语义推理进行分析,以深入了解过程。
{"title":"Modelling Knowledge about Software Processes using Provenance Graphs and its Application to Git-based Version Control Systems","authors":"A. Schreiber, C. D. Boer","doi":"10.1145/3387940.3392220","DOIUrl":"https://doi.org/10.1145/3387940.3392220","url":null,"abstract":"Using the W3C PROV data model, we present a general provenance model for software development processes and---as an example---specialized models for git services, for which we generate provenance graphs. Provenance graphs are knowledge graphs, since they have defined semantics, and can be analyzed with graph algorithms or semantic reasoning to get insights into processes.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130903791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Automatic Improvement of Machine Translation Using Mutamorphic Relation: Invited Talk Paper 利用变形关系自动改进机器翻译:特邀报告
Jie M. Zhang
This paper introduces Mutamorphic Relation for Machine Learning Testing. Mutamorphic Relation combines data mutation and metamorphic relations as test oracles for machine learning systems. These oracles can help achieve fully automatic testing as well as automatic repair of the machine learning models. The paper takes TransRepair as an example to show the effectiveness of Mutamorphic Relation in automatically testing and improving machine translators, TransRepair detects inconsistency bugs without access to human oracles. It then adopts probability-reference or cross-reference to post-process the translations, in a grey-box or black-box manner, to repair the inconsistencies. Manual inspection indicates that the translations repaired by TransRepair improve consistency in 87% of cases (degrading it in 2%), and that the repairs of have better translation acceptability in 27% of the cases (worse in 8%).
介绍了机器学习测试中的变形关系。Mutamorphic Relation结合了数据突变和变形关系作为机器学习系统的测试预言。这些预言机可以帮助实现全自动测试以及机器学习模型的自动修复。本文以TransRepair为例,说明了变形关系在自动测试和改进机器翻译中的有效性,TransRepair可以在不需要人工指令的情况下检测不一致的错误。然后采用概率参考或交叉参考对翻译进行后处理,以灰盒或黑盒的方式修复不一致性。人工检查表明,通过TransRepair修复的译文在87%的情况下提高了一致性(2%的情况下降低了一致性),并且修复的译文在27%的情况下具有更好的翻译可接受性(8%的情况更差)。
{"title":"Automatic Improvement of Machine Translation Using Mutamorphic Relation: Invited Talk Paper","authors":"Jie M. Zhang","doi":"10.1145/3387940.3391541","DOIUrl":"https://doi.org/10.1145/3387940.3391541","url":null,"abstract":"This paper introduces Mutamorphic Relation for Machine Learning Testing. Mutamorphic Relation combines data mutation and metamorphic relations as test oracles for machine learning systems. These oracles can help achieve fully automatic testing as well as automatic repair of the machine learning models. The paper takes TransRepair as an example to show the effectiveness of Mutamorphic Relation in automatically testing and improving machine translators, TransRepair detects inconsistency bugs without access to human oracles. It then adopts probability-reference or cross-reference to post-process the translations, in a grey-box or black-box manner, to repair the inconsistencies. Manual inspection indicates that the translations repaired by TransRepair improve consistency in 87% of cases (degrading it in 2%), and that the repairs of have better translation acceptability in 27% of the cases (worse in 8%).","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133152349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Educating Project Stakeholders: A Preliminary Report 教育项目利益相关者:初步报告
Alex Dekhtyar, Bruno Carreiro da Silva, Karson Slocum
In college coursework, we take care to educate future professional software engineers on how software development process works. Computer Science and Software Engineering students across the globe study software process models, gather requirements, design, implement and test their software, work on software maintenance, learn to submit bug reports, build project roadmaps, construct UML diagrams, and deploy software. Yet, ever since the emergence of consumer-facing software, software development often is a collaboration between professional software engineers and multiple stakeholders whose education, professional expertise, and general experience lie outside of computing. We teach future software engineers how to develop software. Why don't we do the same with other future stakeholders? This paper is a description of a pilot Software Engineering Without Programming course developed and taught at our university for the first time in 2020. In this early stage report (the course is ongoing as of the submisison deadline, but will have been completed by the time of the workshop) we outline the need for the course, its learning objectives, its organization, and the expected results.
在大学的课程中,我们注意教育未来的专业软件工程师软件开发过程是如何工作的。计算机科学和软件工程专业的学生在全球范围内学习软件过程模型,收集需求,设计,实现和测试他们的软件,从事软件维护工作,学习提交错误报告,构建项目路线图,构建UML图,以及部署软件。然而,自从面向消费者的软件出现以来,软件开发通常是专业软件工程师和多个利益相关者之间的合作,这些利益相关者的教育、专业知识和一般经验都在计算之外。我们教未来的软件工程师如何开发软件。为什么我们不对其他未来的利益相关者做同样的事情呢?本文描述了2020年我校首次开发和教授的“无编程软件工程”试点课程。在这个早期阶段的报告中(课程在提交截止日期前仍在进行中,但将在研讨会期间完成),我们概述了课程的需求、学习目标、组织和预期结果。
{"title":"Educating Project Stakeholders: A Preliminary Report","authors":"Alex Dekhtyar, Bruno Carreiro da Silva, Karson Slocum","doi":"10.1145/3387940.3392164","DOIUrl":"https://doi.org/10.1145/3387940.3392164","url":null,"abstract":"In college coursework, we take care to educate future professional software engineers on how software development process works. Computer Science and Software Engineering students across the globe study software process models, gather requirements, design, implement and test their software, work on software maintenance, learn to submit bug reports, build project roadmaps, construct UML diagrams, and deploy software. Yet, ever since the emergence of consumer-facing software, software development often is a collaboration between professional software engineers and multiple stakeholders whose education, professional expertise, and general experience lie outside of computing. We teach future software engineers how to develop software. Why don't we do the same with other future stakeholders? This paper is a description of a pilot Software Engineering Without Programming course developed and taught at our university for the first time in 2020. In this early stage report (the course is ongoing as of the submisison deadline, but will have been completed by the time of the workshop) we outline the need for the course, its learning objectives, its organization, and the expected results.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133458833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Does Neuron Coverage Matter for Deep Reinforcement Learning?: A Preliminary Study 神经元覆盖对深度强化学习有影响吗?:初步研究
Miller Trujillo, M. Linares-Vásquez, Camilo Escobar-Velásquez, Ivana Dusparic, Nicolás Cardozo
Deep Learning (DL) is powerful family of algorithms used for a wide variety of problems and systems, including safety critical systems. As a consequence, analyzing, understanding, and testing DL models is attracting more practitioners and researchers with the purpose of implementing DL systems that are robust, reliable, efficient, and accurate. First software testing approaches for DL systems have focused on black-box testing, white-box testing, and test cases generation, in particular for deep neural networks (CNNs and RNNs). However, Deep Reinforcement Learning (DRL), which is a branch of DL extending reinforcement learning, is still out of the scope of research providing testing techniques for DL systems. In this paper, we present a first step towards testing of DRL systems. In particular, we investigate whether neuron coverage (a widely used metric for white-box testing of DNNs) could be used also for DRL systems, by analyzing coverage evolutionary patterns, and the correlation with RL rewards.
深度学习(DL)是一个强大的算法家族,用于解决各种各样的问题和系统,包括安全关键系统。因此,分析、理解和测试深度学习模型吸引了更多的从业者和研究人员,他们的目的是实现强大、可靠、高效和准确的深度学习系统。DL系统的第一个软件测试方法集中在黑盒测试、白盒测试和测试用例生成上,特别是深度神经网络(cnn和rnn)。然而,深度强化学习(DRL)作为深度学习扩展强化学习的一个分支,仍然不在为深度学习系统提供测试技术的研究范围之内。在本文中,我们提出了对DRL系统进行测试的第一步。特别是,我们通过分析神经元覆盖的进化模式以及与强化学习奖励的相关性,研究了神经元覆盖(一种广泛用于dnn白盒测试的指标)是否也可以用于DRL系统。
{"title":"Does Neuron Coverage Matter for Deep Reinforcement Learning?: A Preliminary Study","authors":"Miller Trujillo, M. Linares-Vásquez, Camilo Escobar-Velásquez, Ivana Dusparic, Nicolás Cardozo","doi":"10.1145/3387940.3391462","DOIUrl":"https://doi.org/10.1145/3387940.3391462","url":null,"abstract":"Deep Learning (DL) is powerful family of algorithms used for a wide variety of problems and systems, including safety critical systems. As a consequence, analyzing, understanding, and testing DL models is attracting more practitioners and researchers with the purpose of implementing DL systems that are robust, reliable, efficient, and accurate. First software testing approaches for DL systems have focused on black-box testing, white-box testing, and test cases generation, in particular for deep neural networks (CNNs and RNNs). However, Deep Reinforcement Learning (DRL), which is a branch of DL extending reinforcement learning, is still out of the scope of research providing testing techniques for DL systems. In this paper, we present a first step towards testing of DRL systems. In particular, we investigate whether neuron coverage (a widely used metric for white-box testing of DNNs) could be used also for DRL systems, by analyzing coverage evolutionary patterns, and the correlation with RL rewards.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"238 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131612657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Sorry to Bother You Again: Developer Recommendation Choice Architectures for Designing Effective Bots 很抱歉再次打扰你:开发者推荐设计有效机器人的选择架构
Chris Brown, Chris Parnin
Software robots, or bots, are useful for automating a wide variety of programming and software development tasks. Despite the advantages of using bots throughout the software engineering process, research shows that developers often face challenges interacting with these systems. To improve automated developer recommendations from bots, this work introduces developer recommendation choice architectures. Choice architecture is a behavioral science concept that suggests the presentation of options impacts the decisions humans make. To evaluate the impact of framing recommendations for software engineers, we examine the impact of one choice architecture, actionability, for improving the design of bot recommendations. We present the results of a preliminary study evaluating this choice architecture in a bot and provide implications for integrating choice architecture into the design of future software engineering bots.
软件机器人,或机器人,对于自动化各种各样的编程和软件开发任务非常有用。尽管在整个软件工程过程中使用机器人具有优势,但研究表明,开发人员经常面临与这些系统交互的挑战。为了改进机器人的自动开发人员推荐,这项工作引入了开发人员推荐选择架构。选择架构是一个行为科学概念,它表明选择的呈现会影响人们做出的决策。为了评估框架推荐对软件工程师的影响,我们研究了一种选择架构、可操作性对改进机器人推荐设计的影响。我们提出了一项初步研究的结果,评估了机器人中的这种选择架构,并为将选择架构集成到未来软件工程机器人的设计中提供了启示。
{"title":"Sorry to Bother You Again: Developer Recommendation Choice Architectures for Designing Effective Bots","authors":"Chris Brown, Chris Parnin","doi":"10.1145/3387940.3391506","DOIUrl":"https://doi.org/10.1145/3387940.3391506","url":null,"abstract":"Software robots, or bots, are useful for automating a wide variety of programming and software development tasks. Despite the advantages of using bots throughout the software engineering process, research shows that developers often face challenges interacting with these systems. To improve automated developer recommendations from bots, this work introduces developer recommendation choice architectures. Choice architecture is a behavioral science concept that suggests the presentation of options impacts the decisions humans make. To evaluate the impact of framing recommendations for software engineers, we examine the impact of one choice architecture, actionability, for improving the design of bot recommendations. We present the results of a preliminary study evaluating this choice architecture in a bot and provide implications for integrating choice architecture into the design of future software engineering bots.","PeriodicalId":309659,"journal":{"name":"Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131732763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1