Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering最新文献

英文中文

An empirical study on program comprehension with reactive programming 响应式编程对程序理解的实证研究

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2635895

G. Salvaneschi, Sven Amann, Sebastian Proksch, M. Mezini

Starting from the first investigations with strictly functional languages, reactive programming has been proposed as THE programming paradigm for reactive applications. The advantages of designs based on this style over designs based on the Observer design pattern have been studied for a long time. Over the years, researchers have enriched reactive languages with more powerful abstractions, embedded these abstractions into mainstream languages – including object-oriented languages – and applied reactive programming to several domains, like GUIs, animations, Web applications, robotics, and sensor networks. However, an important assumption behind this line of research – that, beside other advantages, reactive programming makes a wide class of otherwise cumbersome applications more comprehensible – has never been evaluated. In this paper, we present the design and the results of the first empirical study that evaluates the effect of reactive programming on comprehensibility compared to the traditional object-oriented style with the Observer design pattern. Results confirm the conjecture that comprehensibility is enhanced by reactive programming. In the experiment, the reactive programming group significantly outperforms the other group.

从对严格函数式语言的第一次研究开始，响应式编程已经被提出作为响应式应用程序的编程范式。基于这种风格的设计相对于基于观察者设计模式的设计的优势已经被研究了很长时间。多年来，研究人员用更强大的抽象丰富了响应式语言，将这些抽象嵌入主流语言(包括面向对象语言)，并将响应式编程应用于几个领域，如gui、动画、Web应用程序、机器人和传感器网络。然而，这条研究路线背后的一个重要假设——除了其他优点之外，响应式编程使大量原本笨重的应用程序更容易理解——从未被评估过。在本文中，我们展示了设计和第一项实证研究的结果，该研究评估了响应式编程与传统面向对象风格和观察者设计模式相比对可理解性的影响。结果证实了反应性编程提高了可理解性的猜想。在实验中，反应性编程组的表现明显优于其他组。

{"title":"An empirical study on program comprehension with reactive programming","authors":"G. Salvaneschi, Sven Amann, Sebastian Proksch, M. Mezini","doi":"10.1145/2635868.2635895","DOIUrl":"https://doi.org/10.1145/2635868.2635895","url":null,"abstract":"Starting from the first investigations with strictly functional languages, reactive programming has been proposed as THE programming paradigm for reactive applications. The advantages of designs based on this style over designs based on the Observer design pattern have been studied for a long time. Over the years, researchers have enriched reactive languages with more powerful abstractions, embedded these abstractions into mainstream languages – including object-oriented languages – and applied reactive programming to several domains, like GUIs, animations, Web applications, robotics, and sensor networks. However, an important assumption behind this line of research – that, beside other advantages, reactive programming makes a wide class of otherwise cumbersome applications more comprehensible – has never been evaluated. In this paper, we present the design and the results of the first empirical study that evaluates the effect of reactive programming on comprehensibility compared to the traditional object-oriented style with the Observer design pattern. Results confirm the conjecture that comprehensibility is enhanced by reactive programming. In the experiment, the reactive programming group significantly outperforms the other group.","PeriodicalId":250543,"journal":{"name":"Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123959560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 44

Omen+: a precise dynamic deadlock detector for multithreaded Java libraries Omen+:用于多线程Java库的精确动态死锁检测器

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2661670

Malavika Samak, M. Ramanathan

Designing thread-safe libraries without concurrency defects can be a challenging task. Detecting deadlocks while invoking methods in these libraries concurrently is hard due to the possible number of method invocation combinations, the object assignments to the parameters and the associated thread interleavings. In this paper, we describe the design and implementation of OMEN+ that takes a multithreaded library as the input and detects true deadlocks in a scalable manner. We achieve this by automatically synthesizing relevant multithreaded tests and analyze the associated execution traces using a precise deadlock detector. We validate the usefulness of OMEN+ by applying it on many multithreaded Java libraries and detect a number of deadlocks even in documented thread-safe libraries. The tool is available for free download at http://www.csa.iisc.ernet.in/~sss/tool/omenplus.html.

设计没有并发性缺陷的线程安全库可能是一项具有挑战性的任务。在并发调用这些库中的方法时检测死锁是很困难的，因为可能有很多方法调用组合、对象对参数的赋值以及相关的线程交错。在本文中，我们描述了OMEN+的设计和实现，它以多线程库作为输入，并以可扩展的方式检测真正的死锁。我们通过自动合成相关的多线程测试并使用精确的死锁检测器分析相关的执行跟踪来实现这一点。我们通过将OMEN+应用于许多多线程Java库来验证它的有用性，甚至在文档化的线程安全库中检测到许多死锁。该工具可在http://www.csa.iisc.ernet.in/~sss/tool/omenplus.html上免费下载。

引用次数: 9

On the efficiency of automated testing 关于自动化测试的效率

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2635923

Marcel Böhme, Soumya Paul

The aim of automated program testing is to gain confidence about a program's correctness by sampling its input space. The sampling process can be either systematic or random. For every systematic testing technique the sampling is informed by the analysis of some program artefacts, like the specification, the source code (e.g., to achieve coverage), or even faulty versions of the program (e.g., mutation testing). This analysis incurs some cost. In contrast, random testing is unsystematic and does not sustain any analysis cost. In this paper, we investigate the theoretical efficiency of systematic versus random testing. First, we mathematically model the most effective systematic testing technique S_0 in which every sampled test input strictly increases the "degree of confidence" and is subject to the analysis cost c. Note that the efficiency of S_0 depends on c. Specifically, if we increase c, we also increase the time it takes S_0 to establish the same degree of confidence. So, there exists a maximum analysis cost beyond which R is generally more efficient than S_0. Given that we require the confidence that the program works correctly for x% of its input, we prove an upper bound on c of S_0, beyond which R is more efficient on the average. We also show that this bound depends asymptotically only on x. For instance, let R take 10ms time to sample one test input; to establish that the program works correctly for 90% of its input, S_0 must take less than 41ms to sample one test input. Otherwise, R is expected to establish the 90%-degree of confidence earlier. We prove similar bounds on the cost if the software tester is interested in revealing as many errors as possible in a given time span.

自动化程序测试的目的是通过采样程序的输入空间来获得对程序正确性的信心。抽样过程可以是系统的，也可以是随机的。对于每一种系统的测试技术，抽样都是通过对一些程序工件的分析得到的，比如规范、源代码(例如，实现覆盖)，或者甚至是程序的错误版本(例如，突变测试)。这种分析产生了一些成本。相比之下，随机测试是非系统的，不需要任何分析成本。在本文中，我们研究了系统与随机测试的理论效率。首先，我们对最有效的系统测试技术S_0进行数学建模，其中每个采样测试输入严格增加“置信度”，并受分析成本c的约束。请注意，S_0的效率取决于c。具体而言，如果我们增加c，我们也增加了S_0建立相同置信度所需的时间。因此，存在一个最大分析成本，超过这个成本R通常比S_0更有效。假设我们需要确信程序在其输入的x%下正确工作，我们证明了S_0的c的上界，超过这个上界R的平均效率更高。我们还证明了这个界只渐近地依赖于x。例如，让R花10ms的时间对一个测试输入进行采样;为了确定程序对90%的输入正确工作，S_0必须花费少于41ms的时间来采样一个测试输入。否则，预计R会更早地建立90%的置信度。如果软件测试人员对在给定的时间范围内发现尽可能多的错误感兴趣，我们证明了类似的成本界限。

{"title":"On the efficiency of automated testing","authors":"Marcel Böhme, Soumya Paul","doi":"10.1145/2635868.2635923","DOIUrl":"https://doi.org/10.1145/2635868.2635923","url":null,"abstract":"The aim of automated program testing is to gain confidence about a program's correctness by sampling its input space. The sampling process can be either systematic or random. For every systematic testing technique the sampling is informed by the analysis of some program artefacts, like the specification, the source code (e.g., to achieve coverage), or even faulty versions of the program (e.g., mutation testing). This analysis incurs some cost. In contrast, random testing is unsystematic and does not sustain any analysis cost. In this paper, we investigate the theoretical efficiency of systematic versus random testing. First, we mathematically model the most effective systematic testing technique S_0 in which every sampled test input strictly increases the \"degree of confidence\" and is subject to the analysis cost c. Note that the efficiency of S_0 depends on c. Specifically, if we increase c, we also increase the time it takes S_0 to establish the same degree of confidence. So, there exists a maximum analysis cost beyond which R is generally more efficient than S_0. Given that we require the confidence that the program works correctly for x% of its input, we prove an upper bound on c of S_0, beyond which R is more efficient on the average. We also show that this bound depends asymptotically only on x. For instance, let R take 10ms time to sample one test input; to establish that the program works correctly for 90% of its input, S_0 must take less than 41ms to sample one test input. Otherwise, R is expected to establish the 90%-degree of confidence earlier. We prove similar bounds on the cost if the software tester is interested in revealing as many errors as possible in a given time span.","PeriodicalId":250543,"journal":{"name":"Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126597379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

ORBS: language-independent program slicing ORBS:独立于语言的程序切片

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2635893

D. Binkley, N. Gold, M. Harman, Syed S. Islam, J. Krinke, S. Yoo

Current slicing techniques cannot handle systems written in multiple programming languages. Observation-Based Slicing (ORBS) is a language-independent slicing technique capable of slicing multi-language systems, including systems which contain (third party) binary components. A potential slice obtained through repeated statement deletion is validated by observing the behaviour of the program: if the slice and original program behave the same under the slicing criterion, the deletion is accepted. The resulting slice is similar to a dynamic slice. We evaluate five variants of ORBS on ten programs of different sizes and languages showing that it is less expensive than similar existing techniques. We also evaluate it on bash and four other systems to demonstrate feasible large-scale operation in which a parallelised ORBS needs up to 82% less time when using four threads. The results show that an ORBS slicer is simple to construct, effective at slicing, and able to handle systems written in multiple languages without specialist analysis tools.

当前的切片技术不能处理用多种编程语言编写的系统。基于观察的切片(ORBS)是一种独立于语言的切片技术，能够切片多语言系统，包括包含(第三方)二进制组件的系统。通过观察程序的行为来验证通过重复语句删除获得的潜在切片:如果切片和原始程序在切片标准下的行为相同，则接受删除。生成的切片类似于动态切片。我们在10个不同规模和语言的程序中评估了五种ORBS变体，结果表明它比类似的现有技术更便宜。我们还在bash和其他四个系统上对其进行了评估，以演示可行的大规模操作，其中并行化的orb在使用四个线程时需要的时间最多减少82%。结果表明，ORBS切片器构造简单，切片效率高，无需专业分析工具即可处理多种语言编写的系统。

引用次数: 80

BugLocalizer: integrated tool support for bug localization BugLocalizer:支持bug本地化的集成工具

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2661678

Ferdian Thung, Tien-Duy B. Le, Pavneet Singh Kochhar, D. Lo

To manage bugs that appear in a software, developers often make use of a bug tracking system such as Bugzilla. Users can report bugs that they encounter in such a system. Whenever a user reports a new bug report, developers need to read the summary and description of the bug report and manually locate the buggy files based on this information. This manual process is often time consuming and tedious. Thus, a number of past studies have proposed bug localization techniques to automatically recover potentially buggy files from bug reports. Unfortunately, none of these techniques are integrated to bug tracking systems and thus it hinders their adoption by practitioners. To help disseminate research in bug localization to practitioners, we develop a tool named BugLocalizer, which is implemented as a Bugzilla extension and builds upon a recently proposed bug localization technique. Our tool extracts texts from summary and description fields of a bug report and source code files. It then computes similarities of the bug report with source code files to find the buggy files. Developers can use our tool online from a Bugzilla web interface by providing a link to a git source code repository and specifying the version of the repository to be analyzed. We have released our tool publicly in GitHub, which is available at: https://github.com/smagsmu/buglocalizer. We have also provided a demo video, which can be accessed at: http://youtu.be/iWHaLNCUjBY.

为了管理软件中出现的bug，开发人员经常使用bug跟踪系统，如Bugzilla。用户可以报告他们在这样的系统中遇到的错误。每当用户报告新的错误报告时，开发人员需要阅读错误报告的摘要和描述，并根据这些信息手动定位有错误的文件。这种手工过程通常既耗时又乏味。因此，许多过去的研究已经提出了bug本地化技术，以便从bug报告中自动恢复潜在的bug文件。不幸的是，这些技术都没有集成到bug跟踪系统中，因此阻碍了从业者对它们的采用。为了帮助向从业者传播bug定位的研究，我们开发了一个名为BugLocalizer的工具，它作为Bugzilla的扩展实现，并建立在最近提出的bug定位技术的基础上。我们的工具从bug报告和源代码文件的摘要和描述字段中提取文本。然后，它计算bug报告与源代码文件的相似性，以查找有bug的文件。开发人员可以在Bugzilla的web界面上在线使用我们的工具，只需提供一个git源代码库的链接，并指定要分析的库的版本。我们已经在GitHub上公开发布了我们的工具，可以在:https://github.com/smagsmu/buglocalizer上获得。我们还提供了一个演示视频，可以在http://youtu.be/iWHaLNCUjBY上访问。

{"title":"BugLocalizer: integrated tool support for bug localization","authors":"Ferdian Thung, Tien-Duy B. Le, Pavneet Singh Kochhar, D. Lo","doi":"10.1145/2635868.2661678","DOIUrl":"https://doi.org/10.1145/2635868.2661678","url":null,"abstract":"To manage bugs that appear in a software, developers often make use of a bug tracking system such as Bugzilla. Users can report bugs that they encounter in such a system. Whenever a user reports a new bug report, developers need to read the summary and description of the bug report and manually locate the buggy files based on this information. This manual process is often time consuming and tedious. Thus, a number of past studies have proposed bug localization techniques to automatically recover potentially buggy files from bug reports. Unfortunately, none of these techniques are integrated to bug tracking systems and thus it hinders their adoption by practitioners. To help disseminate research in bug localization to practitioners, we develop a tool named BugLocalizer, which is implemented as a Bugzilla extension and builds upon a recently proposed bug localization technique. Our tool extracts texts from summary and description fields of a bug report and source code files. It then computes similarities of the bug report with source code files to find the buggy files. Developers can use our tool online from a Bugzilla web interface by providing a link to a git source code repository and specifying the version of the repository to be analyzed. We have released our tool publicly in GitHub, which is available at: https://github.com/smagsmu/buglocalizer. We have also provided a demo video, which can be accessed at: http://youtu.be/iWHaLNCUjBY.","PeriodicalId":250543,"journal":{"name":"Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering","volume":"26 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121042431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

FlowTwist: efficient context-sensitive inside-out taint analysis for large codebases FlowTwist:针对大型代码库的高效上下文敏感的由内到外的污染分析

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2635878

Johannes Lerch, Ben Hermann, E. Bodden, M. Mezini

Over the past years, widely used platforms such as the Java Class Library have been under constant attack through vulnerabilities that involve a combination of two taint-analysis problems: an integrity problem allowing attackers to trigger sensitive operations within the platform, and a confidentiality problem allowing the attacker to retrieve sensitive information or pointers from the results of those operations. While existing static taint analyses are good at solving either of those problems, we show that they scale prohibitively badly when being applied to situations that require the exploitation of both an integrity and confidentiality problem in combination. The main problem is the huge attack surface of libraries such as the Java Class Library, which exposes thousands of methods potentially controllable by an attacker. In this work we thus present FlowTwist, a novel taint-analysis approach that works inside-out, i.e., tracks data flows from potentially vulnerable calls to the outer level of the API which the attacker might control. This inside-out analysis requires a careful, context-sensitive coordination of both a backward and a forward taint analysis. In this work, we expose a design of the analysis approach based on the IFDS algorithm, and explain several extensions to IFDS that enable not only this coordination but also a helpful reporting of error situations to security analysts. Experiments with the Java Class Library show that, while a simple forward taint-analysis approach does not scale even with much machine power, FlowTwist's algorithm is able to fully analyze the library within 10 minutes.

在过去的几年中，广泛使用的平台(如Java Class Library)一直受到漏洞的持续攻击，这些漏洞涉及两个污染分析问题的组合:完整性问题允许攻击者触发平台内的敏感操作，机密性问题允许攻击者从这些操作的结果中检索敏感信息或指针。虽然现有的静态污染分析很好地解决了这些问题中的任何一个，但我们表明，当应用于需要同时利用完整性和机密性问题的情况时，它们的可伸缩性非常糟糕。主要问题是库(如Java Class Library)的巨大攻击面，它暴露了攻击者可能控制的数千种方法。在这项工作中，我们因此提出了FlowTwist，这是一种新颖的由内而外的污染分析方法，即跟踪从潜在易受攻击的调用到攻击者可能控制的API外部级别的数据流。这种由内而外的分析需要仔细地、上下文敏感地协调向后和向前的污染分析。在这项工作中，我们揭示了基于IFDS算法的分析方法的设计，并解释了IFDS的几个扩展，这些扩展不仅可以实现这种协调，还可以帮助向安全分析师报告错误情况。Java类库的实验表明，虽然简单的前向污染分析方法即使在机器功率很大的情况下也无法扩展，但FlowTwist的算法能够在10分钟内完全分析该库。

{"title":"FlowTwist: efficient context-sensitive inside-out taint analysis for large codebases","authors":"Johannes Lerch, Ben Hermann, E. Bodden, M. Mezini","doi":"10.1145/2635868.2635878","DOIUrl":"https://doi.org/10.1145/2635868.2635878","url":null,"abstract":"Over the past years, widely used platforms such as the Java Class Library have been under constant attack through vulnerabilities that involve a combination of two taint-analysis problems: an integrity problem allowing attackers to trigger sensitive operations within the platform, and a confidentiality problem allowing the attacker to retrieve sensitive information or pointers from the results of those operations. While existing static taint analyses are good at solving either of those problems, we show that they scale prohibitively badly when being applied to situations that require the exploitation of both an integrity and confidentiality problem in combination. The main problem is the huge attack surface of libraries such as the Java Class Library, which exposes thousands of methods potentially controllable by an attacker. In this work we thus present FlowTwist, a novel taint-analysis approach that works inside-out, i.e., tracks data flows from potentially vulnerable calls to the outer level of the API which the attacker might control. This inside-out analysis requires a careful, context-sensitive coordination of both a backward and a forward taint analysis. In this work, we expose a design of the analysis approach based on the IFDS algorithm, and explain several extensions to IFDS that enable not only this coordination but also a helpful reporting of error situations to security analysts. Experiments with the Java Class Library show that, while a simple forward taint-analysis approach does not scale even with much machine power, FlowTwist's algorithm is able to fully analyze the library within 10 minutes.","PeriodicalId":250543,"journal":{"name":"Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121321054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 37

Learning to rank relevant files for bug reports using domain knowledge 学习使用领域知识对bug报告的相关文件进行排序

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2635874

Xin Ye, Razvan C. Bunescu, Chang Liu

When a new bug report is received, developers usually need to reproduce the bug and perform code reviews to find the cause, a process that can be tedious and time consuming. A tool for ranking all the source files of a project with respect to how likely they are to contain the cause of the bug would enable developers to narrow down their search and potentially could lead to a substantial increase in productivity. This paper introduces an adaptive ranking approach that leverages domain knowledge through functional decompositions of source code files into methods, API descriptions of library components used in the code, the bug-fixing history, and the code change history. Given a bug report, the ranking score of each source file is computed as a weighted combination of an array of features encoding domain knowledge, where the weights are trained automatically on previously solved bug reports using a learning-to-rank technique. We evaluated our system on six large scale open source Java projects, using the before-fix version of the project for every bug report. The experimental results show that the newly introduced learning-to-rank approach significantly outperforms two recent state-of-the-art methods in recommending relevant files for bug reports. In particular, our method makes correct recommendations within the top 10 ranked source files for over 70% of the bug reports in the Eclipse Platform and Tomcat projects.

当收到新的错误报告时，开发人员通常需要重新生成错误并执行代码审查以找到原因，这是一个乏味且耗时的过程。如果有一个工具可以根据包含错误原因的可能性对项目的所有源文件进行排序，这将使开发人员能够缩小搜索范围，并可能大大提高生产力。本文介绍了一种自适应排序方法，该方法通过将源代码文件分解为方法、代码中使用的库组件的API描述、错误修复历史和代码更改历史来利用领域知识。给定一个错误报告，每个源文件的排名分数被计算为编码领域知识的特征数组的加权组合，其中权重是使用学习排序技术在先前解决的错误报告上自动训练的。我们在六个大型开源Java项目上评估了我们的系统，对每个bug报告使用修复前的项目版本。实验结果表明，新引入的学习排序方法在为bug报告推荐相关文件方面明显优于最近两种最先进的方法。特别是，我们的方法对Eclipse平台和Tomcat项目中超过70%的bug报告中排名前10位的源文件给出了正确的建议。

{"title":"Learning to rank relevant files for bug reports using domain knowledge","authors":"Xin Ye, Razvan C. Bunescu, Chang Liu","doi":"10.1145/2635868.2635874","DOIUrl":"https://doi.org/10.1145/2635868.2635874","url":null,"abstract":"When a new bug report is received, developers usually need to reproduce the bug and perform code reviews to find the cause, a process that can be tedious and time consuming. A tool for ranking all the source files of a project with respect to how likely they are to contain the cause of the bug would enable developers to narrow down their search and potentially could lead to a substantial increase in productivity. This paper introduces an adaptive ranking approach that leverages domain knowledge through functional decompositions of source code files into methods, API descriptions of library components used in the code, the bug-fixing history, and the code change history. Given a bug report, the ranking score of each source file is computed as a weighted combination of an array of features encoding domain knowledge, where the weights are trained automatically on previously solved bug reports using a learning-to-rank technique. We evaluated our system on six large scale open source Java projects, using the before-fix version of the project for every bug report. The experimental results show that the newly introduced learning-to-rank approach significantly outperforms two recent state-of-the-art methods in recommending relevant files for bug reports. In particular, our method makes correct recommendations within the top 10 ranked source files for over 70% of the bug reports in the Eclipse Platform and Tomcat projects.","PeriodicalId":250543,"journal":{"name":"Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129012322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 257

Architecture challenges for internal software ecosystems: a large-scale industry case study 内部软件生态系统的架构挑战:大型行业案例研究

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2635876

K. Schultis, Christoph Elsner, D. Lohmann

The idea of software ecosystems encourages organizations to open software projects for external businesses, governing the cross-organizational development by architectural and other measures. Even within a single organization, this paradigm can be of high value for large-scale decentralized software projects that involve various internal, yet self-contained organizational units. However, this intra-organizational decentralization causes architecture challenges that must be understood to reason about suitable architectural measures. We present an in-depth case study on collaboration and architecture challenges in two of these large-scale software projects at Siemens. We performed a total of 46 hours of semi-structured interviews with 17 leading software architects from all involved organizational units. Our major findings are: (1) three collaboration models on a continuum that ranges from high to low coupling, (2) a classification of architecture challenges, together with (3) a qualitative and quantitative exposure of the identified recurring issues along each collaboration model. Our study results provide valuable insights for both industry and academia: Practitioners that find themselves in one of the collaboration models can use empirical evidence on challenges to make informed decisions about counteractive measures. Researchers can focus their attention on challenges faced by practitioners to make software engineering more effective.

软件生态系统的概念鼓励组织为外部业务开放软件项目，通过架构和其他措施管理跨组织的开发。即使在单个组织中，这种范例对于涉及各种内部但自包含的组织单元的大型分散软件项目也具有很高的价值。然而，这种组织内部的去中心化导致了架构挑战，必须理解这些挑战才能推断出合适的架构度量。我们对西门子两个大型软件项目中的协作和架构挑战进行了深入的案例研究。我们对来自所有相关组织单位的17位主要软件架构师进行了总共46小时的半结构化访谈。我们的主要发现是:(1)从高耦合到低耦合的连续体上的三个协作模型，(2)架构挑战的分类，以及(3)对每个协作模型中确定的重复问题的定性和定量暴露。我们的研究结果为工业界和学术界提供了有价值的见解:发现自己处于某个协作模型中的从业者可以使用挑战的经验证据来做出有关反措施的明智决策。研究人员可以将注意力集中在实践者所面临的挑战上，以使软件工程更有效。

{"title":"Architecture challenges for internal software ecosystems: a large-scale industry case study","authors":"K. Schultis, Christoph Elsner, D. Lohmann","doi":"10.1145/2635868.2635876","DOIUrl":"https://doi.org/10.1145/2635868.2635876","url":null,"abstract":"The idea of software ecosystems encourages organizations to open software projects for external businesses, governing the cross-organizational development by architectural and other measures. Even within a single organization, this paradigm can be of high value for large-scale decentralized software projects that involve various internal, yet self-contained organizational units. However, this intra-organizational decentralization causes architecture challenges that must be understood to reason about suitable architectural measures. We present an in-depth case study on collaboration and architecture challenges in two of these large-scale software projects at Siemens. We performed a total of 46 hours of semi-structured interviews with 17 leading software architects from all involved organizational units. Our major findings are: (1) three collaboration models on a continuum that ranges from high to low coupling, (2) a classification of architecture challenges, together with (3) a qualitative and quantitative exposure of the identified recurring issues along each collaboration model. Our study results provide valuable insights for both industry and academia: Practitioners that find themselves in one of the collaboration models can use empirical evidence on challenges to make informed decisions about counteractive measures. Researchers can focus their attention on challenges faced by practitioners to make software engineering more effective.","PeriodicalId":250543,"journal":{"name":"Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116337045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

Data hard with a vengeance (invited talk) 数据的硬与复仇(特邀演讲)

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2684431

Thomas Zimmermann

Action flicks and the analysis of software data in industry have more in common than you think. Both action heroes and development teams are on tight deadlines to save the day. Getting wrong information can lead to disastrous outcomes. In this talk, I will share experiences from my six years of research in the Empirical Software Engineering Group working with engineers towards sound data-driven decision about software.

动作电影和工业软件数据分析的共同点比你想象的要多。动作英雄和开发团队都在紧迫的最后期限内拯救世界。获得错误的信息可能会导致灾难性的后果。在这次演讲中，我将分享我在经验软件工程小组的六年研究经验，这些研究与工程师们一起致力于关于软件的可靠数据驱动决策。

引用次数: 0

Improving the software testing skills of novices during onboarding through social transparency 通过社会透明度提高新手入职时的软件测试技能

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pub Date : 2014-11-11 DOI: 10.1145/2635868.2666604

Raphael Pham

Inexperienced software developers - for example, undergraduates entering the workforce - exhibit a lack of testing skills. They have trouble understanding and applying basic testing techniques. These inexperienced developers are hired by software companies, where this lack of testing skills has already been recognized. Companies allocate valuable resources and invest time and money in different onboarding strategies to introduce new hires to the organization’s testing practices. However, if the lack of testing skills is not addressed properly, the new hire is left to her own devices. This hinders her in becoming a high-quality engineer for the software company. This thesis proposes to improve the onboarding strategies with traits of social transparency in order to specifically address testing issues of inexperienced new hires. Social transparency has been shown to influence the testing behavior of development teams on a social coding site. An environment that is open for discussion helps newcomers to understand and adapt a team’s testing culture. Tailoring the onboarding process to better address testing skills of new hires makes it more effective and more efficient. This reduces the danger of carrying new hire’s testing deficits into commercial software development.

没有经验的软件开发人员——例如，刚进入职场的大学生——表现出缺乏测试技能。他们很难理解和应用基本的测试技术。这些没有经验的开发人员被软件公司雇用，在那里他们已经认识到缺乏测试技能。公司分配宝贵的资源，并在不同的入职策略上投入时间和金钱，以将新员工引入组织的测试实践。然而，如果缺乏测试技能的问题没有得到妥善解决，新员工就会自生自弃。这阻碍了她成为软件公司的高质量工程师。本文提出改进具有社会透明度特征的入职策略，以具体解决缺乏经验的新员工的测试问题。社会透明度已经被证明会影响开发团队在社会编码站点上的测试行为。开放讨论的环境有助于新手理解和适应团队的测试文化。调整入职流程以更好地解决新员工的测试技能，使其更有效、更高效。这减少了将新员工的测试缺陷带入商业软件开发的危险。

{"title":"Improving the software testing skills of novices during onboarding through social transparency","authors":"Raphael Pham","doi":"10.1145/2635868.2666604","DOIUrl":"https://doi.org/10.1145/2635868.2666604","url":null,"abstract":"Inexperienced software developers - for example, undergraduates entering the workforce - exhibit a lack of testing skills. They have trouble understanding and applying basic testing techniques. These inexperienced developers are hired by software companies, where this lack of testing skills has already been recognized. Companies allocate valuable resources and invest time and money in different onboarding strategies to introduce new hires to the organization’s testing practices. However, if the lack of testing skills is not addressed properly, the new hire is left to her own devices. This hinders her in becoming a high-quality engineer for the software company. This thesis proposes to improve the onboarding strategies with traits of social transparency in order to specifically address testing issues of inexperienced new hires. Social transparency has been shown to influence the testing behavior of development teams on a social coding site. An environment that is open for discussion helps newcomers to understand and adapt a team’s testing culture. Tailoring the onboarding process to better address testing skills of new hires makes it more effective and more efficient. This reduces the danger of carrying new hire’s testing deficits into commercial software development.","PeriodicalId":250543,"journal":{"name":"Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering","volume":"12 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114109225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀