2012 9th IEEE Working Conference on Mining Software Repositories (MSR)最新文献

英文中文

Bug introducing changes: A case study with Android Bug引入变更:Android案例研究

2012 9th IEEE Working Conference on Mining Software Repositories (MSR)

Pub Date : 2012-06-02 DOI: 10.1109/MSR.2012.6224267

M. Asaduzzaman, Michael C. Bullock, C. Roy, Kevin A. Schneider

Changes, a rather inevitable part of software development can cause maintenance implications if they introduce bugs into the system. By isolating and characterizing these bug introducing changes it is possible to uncover potential risky source code entities or issues that produce bugs. In this paper, we mine the bug introducing changes in the Android platform by mapping bug reports to the changes that introduced the bugs. We then use the change information to look for both potential problematic parts and dynamics in development that can cause maintenance implications. We believe that the results of our study can help better manage Android software development.

更改是软件开发中不可避免的一部分，如果它们将错误引入系统，则可能导致维护问题。通过隔离和描述这些引入错误的更改，可以发现产生错误的潜在风险源代码实体或问题。在本文中，我们通过将bug报告映射到引入bug的变更来挖掘Android平台中引入bug的bug。然后，我们使用变更信息来查找开发中可能导致维护影响的潜在问题部件和动态。我们相信我们的研究结果可以帮助更好地管理Android软件开发。

引用次数: 46

Are faults localizable? 故障是否可定位?

2012 9th IEEE Working Conference on Mining Software Repositories (MSR)

Pub Date : 2012-06-02 DOI: 10.1109/MSR.2012.6224302

Lucia, Ferdian Thung, D. Lo, Lingxiao Jiang

Many fault localization techniques have been proposed to facilitate debugging activities. Most of them attempt to pinpoint the location of faults (i.e., localize faults) based on a set of failing and correct executions and expect debuggers to investigate a certain number of located program elements to find faults. These techniques thus assume that faults are localizable, i.e., only one or a few lines of code that are close to one another are responsible for each fault. However, in reality, are faults localizable? In this work, we investigate hundreds of real faults in several software systems, and find that many faults may not be localizable to a few lines of code and these include faults with high severity level.

为了方便调试活动，已经提出了许多故障定位技术。它们中的大多数都试图根据一组失败和正确的执行来精确定位错误的位置(即，定位错误)，并期望调试器调查一定数量的已定位的程序元素以发现错误。因此，这些技术假定错误是可定位的，也就是说，只有一行或几行彼此接近的代码对每个错误负责。然而，在现实中，故障是可定位的吗?在这项工作中，我们调查了几个软件系统中的数百个实际故障，发现许多故障可能无法定位到几行代码中，这些故障包括高严重级别的故障。

引用次数: 57

Inferring semantically related words from software context 从软件上下文中推断语义相关的单词

2012 9th IEEE Working Conference on Mining Software Repositories (MSR)

Pub Date : 2012-06-02 DOI: 10.1109/MSR.2012.6224276

Jinqiu Yang, Lin Tan

Code search is an integral part of software development and program comprehension. The difficulty of code search lies in the inability to guess the exact words used in the code. Therefore, it is crucial for keyword-based code search to expand queries with semantically related words, e.g., synonyms and abbreviations, to increase the search effectiveness. However, it is limited to rely on resources such as English dictionaries and WordNet to obtain semantically related words in software, because many words that are semantically related in software are not semantically related in English. This paper proposes a simple and general technique to automatically infer semantically related words in software by leveraging the context of words in comments and code. We achieve a reasonable accuracy in seven large and popular code bases written in C and Java. Our further evaluation against the state of art shows that our technique can achieve a higher precision and recall.

代码搜索是软件开发和程序理解的重要组成部分。代码搜索的困难在于无法猜测代码中使用的确切单词。因此，在基于关键字的代码搜索中，使用语义相关的词(如同义词和缩写)扩展查询，以提高搜索效率是至关重要的。但是，依靠英语词典、WordNet等资源来获取软件中语义相关的词是有局限性的，因为很多在软件中语义相关的词在英语中却没有语义相关。本文提出了一种简单而通用的技术，通过利用注释和代码中的单词上下文来自动推断软件中语义相关的单词。我们在用C和Java编写的七个大型和流行的代码库中实现了合理的准确性。我们对目前技术水平的进一步评估表明，我们的技术可以达到更高的精度和召回率。

引用次数: 92

The build dependency perspective of Android's concrete architecture Android具体架构的构建依赖关系视角

2012 9th IEEE Working Conference on Mining Software Repositories (MSR)

Pub Date : 2012-06-02 DOI: 10.1109/MSR.2012.6224270

Wei Hu, Dan Han, Abram Hindle, Kenny Wong

Android is an operating system designed specifically for mobile devices. It has a layered architecture. In this paper, we extract Android's concrete layered architecture by analyzing the build dependency relation between Android sub-projects and use it to validate the proposed conceptual architecture. Our experiment shows that Android's concrete architecture conforms to the conceptual architecture. Apart from that, we also show the extracted architecture can help developers and users better understand the Android system and further demonstrate its potential benefits in studying the impact of changes.

Android是一个专门为移动设备设计的操作系统。它有一个分层的架构。本文通过分析Android子项目之间的构建依赖关系，提取出Android的具体分层体系结构，并对提出的概念体系结构进行验证。我们的实验表明，Android的具体架构符合概念架构。除此之外，我们还展示了提取的架构可以帮助开发人员和用户更好地了解Android系统，并进一步展示其在研究变化影响方面的潜在好处。

引用次数: 10

MIC check: A correlation tactic for ESE data MIC检查:ESE数据的相关策略

2012 9th IEEE Working Conference on Mining Software Repositories (MSR)

Pub Date : 2012-06-01 DOI: 10.1109/MSR.2012.6224295

Daryl Posnett, Premkumar T. Devanbu, V. Filkov

Empirical software engineering researchers are concerned with understanding the relationships between outcomes of interest, e.g. defects, and process and product measures. The use of correlations to uncover strong relationships is a natural precursor to multivariate modeling. Unfortunately, correlation coefficients can be difficult and/or misleading to interpret. For example, a strong correlation occurs between variables that stand in a polynomial relationship; this may lead one mistakenly, and eventually misleadingly, to model a polynomially related variable in a linear regression. Likewise, a non-monotonic functional, or even non-functional relationship might be entirely missed by a correlation coefficient. Outliers can influence standard correlation measures, tied values can unduly influence even robust non-parametric rank correlation, measures, and smaller sample sizes can cause instability in correlation measures. A new bivariate measure of association, Maximal Information Coefficient (MIC) [1], promises to simultaneously discover if two variables have: a) any association, b) a functional relationship, and c) a nonlinear relationship. The MIC is a very useful complement to standard and rank correlation measures. It separately characterizes the existence of a relationship and its precise nature; thus, it enables more informed choices in modeling non-functional and nonlinear relationships, and a more nuanced indicator of potential problems with the values reported by standard and rank correlation measures. We illustrate the use of MIC using a variety of software engineering metrics. We study and explain the distributional properties of MIC and related measures in software engineering data, and illustrate the value of these measures for the empirical software engineering researcher.

经验软件工程研究人员关心的是理解感兴趣的结果(例如缺陷)与过程和产品度量之间的关系。使用相关性来揭示强关系是多变量建模的自然先驱。不幸的是，相关系数很难解释，而且/或容易引起误解。例如，多项式关系中的变量之间存在很强的相关性;这可能导致人们错误地，并最终误导，在线性回归中建立多项式相关变量的模型。同样，非单调泛函关系，甚至非泛函关系也可能被相关系数完全忽略。异常值会影响标准相关度量，关联值甚至会过度影响稳健的非参数秩相关度量，而较小的样本量会导致相关度量不稳定。一种新的二元关联度量，最大信息系数(MIC)[1]，承诺同时发现两个变量是否具有:A)任何关联，b)函数关系，c)非线性关系。MIC是对标准和等级相关度量的非常有用的补充。它分别规定了一种关系的存在及其确切的性质;因此，它可以在建模非功能和非线性关系时做出更明智的选择，并且可以更细致地指示标准和等级相关度量所报告的值的潜在问题。我们使用各种软件工程度量来说明MIC的使用。我们研究和解释了软件工程数据中MIC和相关度量的分布特性，并说明了这些度量对经验软件工程研究者的价值。

{"title":"MIC check: A correlation tactic for ESE data","authors":"Daryl Posnett, Premkumar T. Devanbu, V. Filkov","doi":"10.1109/MSR.2012.6224295","DOIUrl":"https://doi.org/10.1109/MSR.2012.6224295","url":null,"abstract":"Empirical software engineering researchers are concerned with understanding the relationships between outcomes of interest, e.g. defects, and process and product measures. The use of correlations to uncover strong relationships is a natural precursor to multivariate modeling. Unfortunately, correlation coefficients can be difficult and/or misleading to interpret. For example, a strong correlation occurs between variables that stand in a polynomial relationship; this may lead one mistakenly, and eventually misleadingly, to model a polynomially related variable in a linear regression. Likewise, a non-monotonic functional, or even non-functional relationship might be entirely missed by a correlation coefficient. Outliers can influence standard correlation measures, tied values can unduly influence even robust non-parametric rank correlation, measures, and smaller sample sizes can cause instability in correlation measures. A new bivariate measure of association, Maximal Information Coefficient (MIC) [1], promises to simultaneously discover if two variables have: a) any association, b) a functional relationship, and c) a nonlinear relationship. The MIC is a very useful complement to standard and rank correlation measures. It separately characterizes the existence of a relationship and its precise nature; thus, it enables more informed choices in modeling non-functional and nonlinear relationships, and a more nuanced indicator of potential problems with the values reported by standard and rank correlation measures. We illustrate the use of MIC using a variety of software engineering metrics. We study and explain the distributional properties of MIC and related measures in software engineering data, and illustrate the value of these measures for the empirical software engineering researcher.","PeriodicalId":383774,"journal":{"name":"2012 9th IEEE Working Conference on Mining Software Repositories (MSR)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115430151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Issue handling performance in proprietary software projects 在专有软件项目中处理问题的表现

2012 9th IEEE Working Conference on Mining Software Repositories (MSR)

Pub Date : 2012-06-01 DOI: 10.1109/MSR.2012.6224282

Aigerim Issabayeva, Ariadi Nugroho, Joost Visser

Software maintenance tasks are mainly related to fixing defects and implementing new features. Higher efficiency in performing such tasks is therefore going to reduce the costs of maintenance. A previous study involving open source systems has shown that higher software maintainability corresponds to faster speed in fixing defects [1]. In this paper we replicate the previous study by mining bug report data of three proprietary software projects. In one of the projects, a correlation between higher software maintainability and faster defect resolution is confirmed. The quality of issue handling process (e.g., issue registration accuracy and completeness, scope and complexity of issue workflow) should be considered in further research as it might explain the circumstances under which the correlation can be observed.

软件维护任务主要与修复缺陷和实现新特性有关。因此，执行这些任务的更高效率将降低维护成本。先前一项涉及开源系统的研究表明，软件的可维护性越高，修复缺陷的速度就越快。在本文中，我们通过挖掘三个专有软件项目的bug报告数据来复制先前的研究。在其中一个项目中，更高的软件可维护性和更快的缺陷解决之间的相关性得到了证实。问题处理过程的质量(例如，问题登记的准确性和完整性，问题工作流程的范围和复杂性)应该在进一步的研究中考虑，因为它可能解释在什么情况下可以观察到相关性。

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2012 9th IEEE Working Conference on Mining Software Repositories (MSR)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀