Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering最新文献

英文中文

Suggesting meaningful variable names for decompiled code: a machine translation approach 为反编译代码建议有意义的变量名:一种机器翻译方法

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3121274

A. Jaffe

Decompiled code lacks meaningful variable names. We used statistical machine translation to suggest variable names that are natural given the context. This technique has previously been successfully applied to obfuscated JavaScript code, but decompiled C code poses unique challenges in constructing an aligned corpus and selecting the best translation from among several candidates.

反编译代码缺少有意义的变量名。我们使用统计机器翻译来建议给定上下文的自然变量名称。这种技术以前已经成功地应用于混淆的JavaScript代码，但是反编译的C代码在构建对齐的语料库和从几个候选中选择最佳翻译时提出了独特的挑战。

引用次数: 3

Record and replay for Android: are we there yet in industrial cases? Android的记录和重放:我们在工业案例中已经实现了吗?

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3117769

Wing Lam, Zhengkai Wu, Dengfeng Li, Wenyu Wang, Haibing Zheng, Hui Luo, Peng Yan, Yuetang Deng, Tao Xie

Mobile applications, or apps for short, are gaining popularity. The input sources (e.g., touchscreen, sensors, transmitters) of the smart devices that host these apps enable the apps to offer a rich experience to the users, but these input sources pose testing complications to the developers (e.g., writing tests to accurately utilize multiple input sources together and be able to replay such tests at a later time). To alleviate these complications, researchers and practitioners in recent years have developed a variety of record-and-replay tools to support the testing expressiveness of smart devices. These tools allow developers to easily record and automate the replay of complicated usage scenarios of their app. Due to Android's large share of the smart-device market, numerous record-and-replay tools have been developed using a variety of techniques to test Android apps. To better understand the strengths and weaknesses of these tools, we present a comparison of popular record-and-replay tools from researchers and practitioners, by applying these tools to test three popular industrial apps downloaded from the Google Play store. Our comparison is based on three main metrics: (1) ability to reproduce common usage scenarios, (2) space overhead of traces created by the tools, and (3) robustness of traces created by the tools (when being replayed on devices with different resolutions). The results from our comparison show which record-and-replay tools may be the best for developers and identify future directions for improving these tools to better address testing complications of smart devices.

移动应用程序(简称app)越来越受欢迎。承载这些应用程序的智能设备的输入源(例如，触摸屏、传感器、发射器)使应用程序能够为用户提供丰富的体验，但这些输入源给开发人员带来了测试复杂性(例如，编写测试以准确地利用多个输入源，并能够在以后的时间重放这些测试)。为了减轻这些并发症，研究人员和从业人员近年来开发了各种记录和回放工具来支持智能设备的测试表现力。这些工具允许开发者轻松记录和自动回放他们的应用程序的复杂使用场景。由于Android在智能设备市场的巨大份额，许多记录和回放工具已经开发使用各种技术来测试Android应用程序。为了更好地理解这些工具的优点和缺点，我们对研究人员和从业者的流行记录和回放工具进行了比较，并应用这些工具测试了从Google Play商店下载的三个流行的工业应用程序。我们的比较基于三个主要指标:(1)重现常见使用场景的能力，(2)工具创建的痕迹的空间开销，以及(3)工具创建的痕迹的鲁棒性(当在不同分辨率的设备上重放时)。我们的比较结果显示了哪些记录和回放工具可能是最适合开发人员的，并确定了改进这些工具以更好地解决智能设备测试复杂性的未来方向。

{"title":"Record and replay for Android: are we there yet in industrial cases?","authors":"Wing Lam, Zhengkai Wu, Dengfeng Li, Wenyu Wang, Haibing Zheng, Hui Luo, Peng Yan, Yuetang Deng, Tao Xie","doi":"10.1145/3106237.3117769","DOIUrl":"https://doi.org/10.1145/3106237.3117769","url":null,"abstract":"Mobile applications, or apps for short, are gaining popularity. The input sources (e.g., touchscreen, sensors, transmitters) of the smart devices that host these apps enable the apps to offer a rich experience to the users, but these input sources pose testing complications to the developers (e.g., writing tests to accurately utilize multiple input sources together and be able to replay such tests at a later time). To alleviate these complications, researchers and practitioners in recent years have developed a variety of record-and-replay tools to support the testing expressiveness of smart devices. These tools allow developers to easily record and automate the replay of complicated usage scenarios of their app. Due to Android's large share of the smart-device market, numerous record-and-replay tools have been developed using a variety of techniques to test Android apps. To better understand the strengths and weaknesses of these tools, we present a comparison of popular record-and-replay tools from researchers and practitioners, by applying these tools to test three popular industrial apps downloaded from the Google Play store. Our comparison is based on three main metrics: (1) ability to reproduce common usage scenarios, (2) space overhead of traces created by the tools, and (3) robustness of traces created by the tools (when being replayed on devices with different resolutions). The results from our comparison show which record-and-replay tools may be the best for developers and identify future directions for improving these tools to better address testing complications of smart devices.","PeriodicalId":313494,"journal":{"name":"Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131742017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 31

Practical symbolic verification of regular properties 常规属性的实用符号验证

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3121275

Hengbiao Yu

It is challenging to verify regular properties of programs. This paper presents symbolic regular verification (SRV), a dynamic symbolic execution based technique for verifying regular properties. The key technique of SRV is a novel synergistic combination of property-oriented path slicing and guiding to mitigate the path explosion problem. Indeed, slicing can prune redundant paths, while guiding can boost the finding of counterexamples. We have implemented SRV for Java and evaluated it on 16 real-world open-source Java programs (totaling 270K lines of code). The experimental results demonstrate the effectiveness and efficiency of SRV.

验证程序的正则性是一项具有挑战性的工作。本文提出了一种基于动态符号执行的规则属性验证技术——符号规则验证(SRV)。SRV的关键技术是将面向属性的路径切片和制导相结合，以缓解路径爆炸问题。事实上，切片可以减少冗余路径，而引导可以促进反例的发现。我们已经为Java实现了SRV，并在16个真实的开源Java程序(总共270K行代码)上对其进行了评估。实验结果证明了SRV的有效性和高效性。

引用次数: 0

Reasons and drawbacks of using trivial npm packages: the developers' perspective 使用琐碎的npm包的原因和缺点:开发者的视角

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3121278

Rabe Abdalkareem

Code reuse is traditionally seen as good practice. Recent trends have pushed the idea of code reuse to an extreme, by using packages that implement simple and trivial tasks, which we call ‘trivial packages’. A recent incident where a trivial package led to the breakdown of some of the most popular web applications such as Facebook and Netflix, put the spotlight on whether using trivial packages should be encouraged. Therefore, in this research, we mine more than 230,000 npm packages and 38,000 JavaScript projects in order to study the prevalence of trivial packages. We found that trivial packages are common, making up 16.8% of the studied npm packages. We performed a survey with 88 Node.js developers who use trivial packages to understand the reasons for and drawbacks of their use. We found that trivial packages are used because they are perceived to be well-implemented and tested pieces of code. However, developers are concerned about maintaining and the risks of breakages due to the extra dependencies trivial packages introduce.

代码重用传统上被视为一种良好的实践。最近的趋势将代码重用的思想推向了一个极端，通过使用实现简单和琐碎任务的包，我们称之为“琐碎包”。最近，一个不起眼的软件包导致一些最受欢迎的网络应用程序(如Facebook和Netflix)瘫痪，这让人们开始关注是否应该鼓励使用不起眼的软件包。因此，在这项研究中，我们挖掘了超过230,000个npm包和38,000个JavaScript项目，以研究琐碎包的流行程度。我们发现琐碎的包很常见，占所研究的npm包的16.8%。我们对88名使用琐碎包的Node.js开发人员进行了调查，以了解他们使用这些包的原因和缺点。我们发现，使用琐碎包是因为它们被认为是实现良好且经过测试的代码片段。然而，开发人员担心由于琐碎包引入的额外依赖而导致的维护和破坏风险。

引用次数: 7

Understanding the impact of refactoring on smells: a longitudinal study of 23 software projects 理解重构对气味的影响:对23个软件项目的纵向研究

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3106259

Diego Cedrim, Alessandro F. Garcia, Melina Mongiovi, Rohit Gheyi, L. Sousa, R. Mello, B. Neto, Márcio Ribeiro, Alexander Chávez

Code smells in a program represent indications of structural quality problems, which can be addressed by software refactoring. However, refactoring intends to achieve different goals in practice, and its application may not reduce smelly structures. Developers may neglect or end up creating new code smells through refactoring. Unfortunately, little has been reported about the beneficial and harmful effects of refactoring on code smells. This paper reports a longitudinal study intended to address this gap. We analyze how often commonly-used refactoring types affect the density of 13 types of code smells along the version histories of 23 projects. Our findings are based on the analysis of 16,566 refactorings distributed in 10 different types. Even though 79.4% of the refactorings touched smelly elements, 57% did not reduce their occurrences. Surprisingly, only 9.7% of refactorings removed smells, while 33.3% induced the introduction of new ones. More than 95% of such refactoring-induced smells were not removed in successive commits, which suggest refactorings tend to more frequently introduce long-living smells instead of eliminating existing ones. We also characterized and quantified typical refactoring-smell patterns, and observed that harmful patterns are frequent, including: (i) approximately 30% of the Move Method and Pull Up Method refactorings induced the emergence of God Class, and (ii) the Extract Superclass refactoring creates the smell Speculative Generality in 68% of the cases.

程序中的代码气味表示结构性质量问题的迹象，可以通过软件重构来解决。然而，重构在实践中要达到不同的目标，它的应用可能不会减少有臭味的结构。开发人员可能会忽略或最终通过重构创建新的代码气味。不幸的是，很少有关于重构对代码气味的有益和有害影响的报道。本文报告了一项旨在解决这一差距的纵向研究。我们根据23个项目的版本历史分析了常用重构类型对13种代码气味密度的影响。我们的发现是基于对分布在10种不同类型的16,566个重构的分析。尽管79.4%的重构触及了有臭味的元素，但57%的重构并没有减少它们的出现。令人惊讶的是，只有9.7%的重构消除了气味，而33.3%的重构引入了新的气味。在连续的提交中，超过95%的这种重构引起的气味没有被去除，这表明重构倾向于更频繁地引入长期存在的气味，而不是消除现有的气味。我们还对典型的重构气味模式进行了表征和量化，并观察到有害的模式是频繁出现的，包括:(i)大约30%的Move方法和Pull Up方法重构导致了God类的出现，(ii)提取超类重构在68%的情况下产生了异味。

{"title":"Understanding the impact of refactoring on smells: a longitudinal study of 23 software projects","authors":"Diego Cedrim, Alessandro F. Garcia, Melina Mongiovi, Rohit Gheyi, L. Sousa, R. Mello, B. Neto, Márcio Ribeiro, Alexander Chávez","doi":"10.1145/3106237.3106259","DOIUrl":"https://doi.org/10.1145/3106237.3106259","url":null,"abstract":"Code smells in a program represent indications of structural quality problems, which can be addressed by software refactoring. However, refactoring intends to achieve different goals in practice, and its application may not reduce smelly structures. Developers may neglect or end up creating new code smells through refactoring. Unfortunately, little has been reported about the beneficial and harmful effects of refactoring on code smells. This paper reports a longitudinal study intended to address this gap. We analyze how often commonly-used refactoring types affect the density of 13 types of code smells along the version histories of 23 projects. Our findings are based on the analysis of 16,566 refactorings distributed in 10 different types. Even though 79.4% of the refactorings touched smelly elements, 57% did not reduce their occurrences. Surprisingly, only 9.7% of refactorings removed smells, while 33.3% induced the introduction of new ones. More than 95% of such refactoring-induced smells were not removed in successive commits, which suggest refactorings tend to more frequently introduce long-living smells instead of eliminating existing ones. We also characterized and quantified typical refactoring-smell patterns, and observed that harmful patterns are frequent, including: (i) approximately 30% of the Move Method and Pull Up Method refactorings induced the emergence of God Class, and (ii) the Extract Superclass refactoring creates the smell Speculative Generality in 68% of the cases.","PeriodicalId":313494,"journal":{"name":"Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131390688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 62

The power of "why" and "why not": enriching scenario exploration with provenance “为什么”和“为什么不”的力量:用出处丰富场景探索

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3106272

Tim Nelson, Natasha Danas, Daniel J. Dougherty, S. Krishnamurthi

Scenario-finding tools like the Alloy Analyzer are widely used in numerous concrete domains like security, network analysis, UML analysis, and so on. They can help to verify properties and, more generally, aid in exploring a system's behavior. While scenario finders are valuable for their ability to produce concrete examples, individual scenarios only give insight into what is possible, leaving the user to make their own conclusions about what might be necessary. This paper enriches scenario finding by allowing users to ask ``why?'' and ``why not?'' questions about the examples they are given. We show how to distinguish parts of an example that cannot be consistently removed (or changed) from those that merely reflect underconstraint in the specification. In the former case we show how to determine which elements of the specification and which other components of the example together explain the presence of such facts. This paper formalizes the act of computing provenance in scenario-finding. We present Amalgam, an extension of the popular Alloy scenario-finder, which implements these foundations and provides interactive exploration of examples. We also evaluate Amalgam's algorithmics on a variety of both textbook and real-world examples.

像Alloy Analyzer这样的场景查找工具被广泛应用于许多具体领域，比如安全、网络分析、UML分析等等。它们可以帮助验证属性，更一般地说，有助于探索系统的行为。虽然场景查找器因其产生具体示例的能力而很有价值，但单个场景只能深入了解可能的情况，让用户对可能需要的内容做出自己的结论。本文通过允许用户问“为什么”来丰富场景查找。和“为什么不呢?”的问题。我们将展示如何区分示例中不能始终删除(或更改)的部分与仅反映规范中约束不足的部分。在前一种情况下，我们展示了如何确定规范的哪些元素和示例的哪些其他组件一起解释了这些事实的存在。本文形式化了场景查找中计算来源的行为。我们介绍了Amalgam，一个流行的Alloy场景查找器的扩展，它实现了这些基础，并提供了示例的交互式探索。我们还评估了汞合金的算法在各种教科书和现实世界的例子。

{"title":"The power of \"why\" and \"why not\": enriching scenario exploration with provenance","authors":"Tim Nelson, Natasha Danas, Daniel J. Dougherty, S. Krishnamurthi","doi":"10.1145/3106237.3106272","DOIUrl":"https://doi.org/10.1145/3106237.3106272","url":null,"abstract":"Scenario-finding tools like the Alloy Analyzer are widely used in numerous concrete domains like security, network analysis, UML analysis, and so on. They can help to verify properties and, more generally, aid in exploring a system's behavior. While scenario finders are valuable for their ability to produce concrete examples, individual scenarios only give insight into what is possible, leaving the user to make their own conclusions about what might be necessary. This paper enriches scenario finding by allowing users to ask ``why?'' and ``why not?'' questions about the examples they are given. We show how to distinguish parts of an example that cannot be consistently removed (or changed) from those that merely reflect underconstraint in the specification. In the former case we show how to determine which elements of the specification and which other components of the example together explain the presence of such facts. This paper formalizes the act of computing provenance in scenario-finding. We present Amalgam, an extension of the popular Alloy scenario-finder, which implements these foundations and provides interactive exploration of examples. We also evaluate Amalgam's algorithmics on a variety of both textbook and real-world examples.","PeriodicalId":313494,"journal":{"name":"Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127585811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 29

Reproducing concurrency failures from crash stacks 从崩溃堆栈中再现并发失败

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3106292

F. A. Bianchi, M. Pezzè, Valerio Terragni

Reproducing field failures is the first essential step for understanding, localizing and removing faults. Reproducing concurrency field failures is hard due to the need of synthesizing a test code jointly with a thread interleaving that induce the failure in the presence of limited information from the field. Current techniques for reproducing concurrency failures focus on identifying failure-inducing interleavings, leaving largely open the problem of synthesizing the test code that manifests such interleavings. In this paper, we present ConCrash, a technique to automatically generate test codes that reproduce concurrency failures that violate thread-safety from crash stacks, which commonly summarize the conditions of field failures. ConCrash efficiently explores the huge space of possible test codes to identify a failure-inducing one by using a suitable set of search pruning strategies. Combined with existing techniques for exploring interleavings, ConCrash automatically reproduces a given concurrency failure that violates the thread-safety of a class by identifying both a failure-inducing test code and corresponding interleaving. In the paper, we define the ConCrash approach, present a prototype implementation of ConCrash, and discuss the experimental results that we obtained on a known set of ten field failures that witness the effectiveness of the approach.

再现现场故障是理解、定位和排除故障的第一步。再现并发字段失败是很困难的，因为需要将测试代码与线程交错结合起来，而线程交错会在来自字段的有限信息存在的情况下导致失败。当前重现并发性故障的技术主要关注于识别导致故障的交织，而在很大程度上留下了合成显示这种交织的测试代码的问题。在本文中，我们介绍了ConCrash，这是一种自动生成测试代码的技术，可以从崩溃堆栈中重现违反线程安全的并发失败，这些失败通常总结了字段失败的条件。ConCrash通过使用一组合适的搜索修剪策略，有效地探索可能的测试代码的巨大空间，以识别导致失败的代码。结合现有的交错探索技术，ConCrash通过识别导致失败的测试代码和相应的交错，自动再现违反类线程安全的给定并发失败。在本文中，我们定义了ConCrash方法，提出了ConCrash的原型实现，并讨论了我们在已知的10个现场故障上获得的实验结果，这些实验结果见证了该方法的有效性。

{"title":"Reproducing concurrency failures from crash stacks","authors":"F. A. Bianchi, M. Pezzè, Valerio Terragni","doi":"10.1145/3106237.3106292","DOIUrl":"https://doi.org/10.1145/3106237.3106292","url":null,"abstract":"Reproducing field failures is the first essential step for understanding, localizing and removing faults. Reproducing concurrency field failures is hard due to the need of synthesizing a test code jointly with a thread interleaving that induce the failure in the presence of limited information from the field. Current techniques for reproducing concurrency failures focus on identifying failure-inducing interleavings, leaving largely open the problem of synthesizing the test code that manifests such interleavings. In this paper, we present ConCrash, a technique to automatically generate test codes that reproduce concurrency failures that violate thread-safety from crash stacks, which commonly summarize the conditions of field failures. ConCrash efficiently explores the huge space of possible test codes to identify a failure-inducing one by using a suitable set of search pruning strategies. Combined with existing techniques for exploring interleavings, ConCrash automatically reproduces a given concurrency failure that violates the thread-safety of a class by identifying both a failure-inducing test code and corresponding interleaving. In the paper, we define the ConCrash approach, present a prototype implementation of ConCrash, and discuss the experimental results that we obtained on a known set of ten field failures that witness the effectiveness of the approach.","PeriodicalId":313494,"journal":{"name":"Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127361471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Screening heuristics for project gating systems 项目控制系统的筛选启发式方法

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3117766

Zahy Volf, Edi Shmueli

Continuous Integration (CI) is a hot topic. Yet, little attention is payed to how CI systems work and what impacts their behavior. In parallel, bug prediction in software is gaining high attention. But this is done mostly in the context of software engineering, and the relation to the realm of CI and CI systems engineering has not been established yet. In this paper we describe how Project Gating systems operate, which are a specific type of CI systems used to keep the mainline of development always clean. We propose and evaluate three heuristics for improving Gating performance and demonstrate their trade-offs. The third heuristic, which leverages state-of-the-art bug prediction achieves the best performance across the entire spectrum of workload conditions.

持续集成(CI)是一个热门话题。然而，很少有人关注CI系统是如何工作的以及是什么影响了它们的行为。与此同时，软件中的bug预测也受到了高度关注。但是这主要是在软件工程的背景下完成的，并且与CI和CI系统工程领域的关系还没有建立起来。在本文中，我们描述了项目控制系统是如何运作的，这是一种特定类型的CI系统，用于保持开发主线始终干净。我们提出并评估了改善门控性能的三种启发式方法，并展示了它们的权衡。第三种启发式方法利用了最先进的bug预测，在整个工作负载条件范围内实现了最佳性能。

引用次数: 5

Measuring neural efficiency of program comprehension 测量程序理解的神经效率

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3106268

J. Siegmund, Norman Peitek, Chris Parnin, S. Apel, Johannes C. Hofmeister, Christian Kästner, Andrew Begel, A. Bethmann, A. Brechmann

Most modern software programs cannot be understood in their entirety by a single programmer. Instead, programmers must rely on a set of cognitive processes that aid in seeking, filtering, and shaping relevant information for a given programming task. Several theories have been proposed to explain these processes, such as ``beacons,' for locating relevant code, and ``plans,'' for encoding cognitive models. However, these theories are decades old and lack validation with modern cognitive-neuroscience methods. In this paper, we report on a study using functional magnetic resonance imaging (fMRI) with 11 participants who performed program comprehension tasks. We manipulated experimental conditions related to beacons and layout to isolate specific cognitive processes related to bottom-up comprehension and comprehension based on semantic cues. We found evidence of semantic chunking during bottom-up comprehension and lower activation of brain areas during comprehension based on semantic cues, confirming that beacons ease comprehension.

大多数现代软件程序不可能由一个程序员完全理解。相反，程序员必须依赖于一组认知过程，这些过程有助于为给定的编程任务寻找、过滤和塑造相关信息。已经提出了几个理论来解释这些过程，例如用于定位相关代码的“信标”和用于编码认知模型的“计划”。然而，这些理论已经有几十年的历史了，缺乏现代认知神经科学方法的验证。在本文中，我们报告了一项使用功能磁共振成像(fMRI)对11名参与者进行程序理解任务的研究。我们操纵了与信标和布局相关的实验条件，分离了与自下而上理解和基于语义线索的理解相关的特定认知过程。我们发现了自下而上理解过程中语义块化的证据，以及基于语义线索的理解过程中大脑区域激活程度较低的证据，证实了信标有助于理解。

{"title":"Measuring neural efficiency of program comprehension","authors":"J. Siegmund, Norman Peitek, Chris Parnin, S. Apel, Johannes C. Hofmeister, Christian Kästner, Andrew Begel, A. Bethmann, A. Brechmann","doi":"10.1145/3106237.3106268","DOIUrl":"https://doi.org/10.1145/3106237.3106268","url":null,"abstract":"Most modern software programs cannot be understood in their entirety by a single programmer. Instead, programmers must rely on a set of cognitive processes that aid in seeking, filtering, and shaping relevant information for a given programming task. Several theories have been proposed to explain these processes, such as ``beacons,' for locating relevant code, and ``plans,'' for encoding cognitive models. However, these theories are decades old and lack validation with modern cognitive-neuroscience methods. In this paper, we report on a study using functional magnetic resonance imaging (fMRI) with 11 participants who performed program comprehension tasks. We manipulated experimental conditions related to beacons and layout to isolate specific cognitive processes related to bottom-up comprehension and comprehension based on semantic cues. We found evidence of semantic chunking during bottom-up comprehension and lower activation of brain areas during comprehension based on semantic cues, confirming that beacons ease comprehension.","PeriodicalId":313494,"journal":{"name":"Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116526968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 101

Static analysis for optimizing big data queries 静态分析优化大数据查询

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

Pub Date : 2017-08-21 DOI: 10.1145/3106237.3117774

D. Garbervetsky, Zvonimir Pavlinovic, Mike Barnett, Madan Musuvathi, Todd Mytkowicz, Edgardo Zoppi

Query languages for big data analysis provide user extensibility through a mechanism of user-defined operators (UDOs). These operators allow programmers to write proprietary functionalities on top of a relational query skeleton. However, achieving effective query optimization for such languages is extremely challenging since the optimizer needs to understand data dependencies induced by UDOs. SCOPE, the query language from Microsoft, allows for hand coded declarations of UDO data dependencies. Unfortunately, most programmers avoid using this facility since writing and maintaining the declarations is tedious and error-prone. In this work, we designed and implemented two sound and robust static analyses for computing UDO data dependencies. The analyses can detect what columns of an input table are never used or pass-through a UDO unchanged. This information can be used to significantly improve execution of SCOPE scripts. We evaluate our analyses on thousands of real-world queries and show we can catch many unused and pass-through columns automatically without relying on any manually provided declarations.

用于大数据分析的查询语言通过用户自定义操作符(UDOs)机制提供用户可扩展性。这些操作符允许程序员在关系查询框架之上编写专有功能。然而，为这些语言实现有效的查询优化是极具挑战性的，因为优化器需要理解由dos引起的数据依赖性。SCOPE是来自Microsoft的查询语言，它允许手工编码UDO数据依赖关系的声明。不幸的是，大多数程序员都避免使用这种功能，因为编写和维护声明既繁琐又容易出错。在这项工作中，我们设计并实现了两个可靠的静态分析，用于计算UDO数据依赖关系。分析可以检测输入表的哪些列从未使用过，或者不加更改地传递UDO。此信息可用于显著改进SCOPE脚本的执行。我们对数千个实际查询进行了分析，并证明我们可以自动捕获许多未使用的和传递的列，而不依赖于任何手动提供的声明。

{"title":"Static analysis for optimizing big data queries","authors":"D. Garbervetsky, Zvonimir Pavlinovic, Mike Barnett, Madan Musuvathi, Todd Mytkowicz, Edgardo Zoppi","doi":"10.1145/3106237.3117774","DOIUrl":"https://doi.org/10.1145/3106237.3117774","url":null,"abstract":"Query languages for big data analysis provide user extensibility through a mechanism of user-defined operators (UDOs). These operators allow programmers to write proprietary functionalities on top of a relational query skeleton. However, achieving effective query optimization for such languages is extremely challenging since the optimizer needs to understand data dependencies induced by UDOs. SCOPE, the query language from Microsoft, allows for hand coded declarations of UDO data dependencies. Unfortunately, most programmers avoid using this facility since writing and maintaining the declarations is tedious and error-prone. In this work, we designed and implemented two sound and robust static analyses for computing UDO data dependencies. The analyses can detect what columns of an input table are never used or pass-through a UDO unchanged. This information can be used to significantly improve execution of SCOPE scripts. We evaluate our analyses on thousands of real-world queries and show we can catch many unused and pass-through columns automatically without relying on any manually provided declarations.","PeriodicalId":313494,"journal":{"name":"Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122035704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀