Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement最新文献

DevOps Practitioners’ Perceptions of the Low-code Trend 开发运维从业者对低代码趋势的看法

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546635

S. Rafi, M. Akbar, Mary-Luz Sánchez-Gordón, Ricardo Colomo Palacios

Background: DevOps is currently one of the main trends in software development. Low-Code is also an emerging tendency that, combined with DevOps, may offer significant value to software businesses by improving the process. However, how DevOps practices and low-code are combined is little known. Aim: This study aims to understand the practitioner's perspectives on low-code trends. Method: Twelve interviews with IT professionals who deal with low-code in the context of DevOps were conducted. Then, a grounded theory approach was used to theme the interview quotes into emergent categories. Results: The main result of this exploratory study reveals that such an approach is the most common response to the skill shortages of software professionals. Conclusion: This study suggests the emergence of DevOps and low-code could significantly contribute to the development of quality products with low-cost and time.

背景:DevOps是当前软件开发的主要趋势之一。低代码也是一种新兴趋势，与DevOps相结合，可以通过改进流程为软件业务提供重要价值。然而，DevOps实践和低代码是如何结合在一起的却鲜为人知。目的:本研究旨在了解从业者对低码趋势的看法。方法:对DevOps背景下处理低代码的IT专业人员进行了12次访谈。然后，采用扎根理论的方法对访谈引语进行紧急分类。结果:这项探索性研究的主要结果表明，这种方法是对软件专业人员技能短缺的最常见的反应。结论:本研究表明，DevOps和low-code的出现可以显著促进低成本和时间的高质量产品的开发。

引用次数: 4

What Soft Skills Does the Software Industry *Really* Want? An Exploratory Study of Software Positions in New Zealand 软件行业真正需要的软技能是什么?新西兰软件行业职位的探索性研究

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546247

M. Galster, A. Mitrovic, S. Malinen, Jay Holland

Background: Soft skills of software professionals (e.g., communication, interpersonal skills) significantly contribute to project and product success. Aims: We aim to understand (a) what are relevant soft skills in software engineering, (b) how soft skills relate to types of software engineering positions, and (c) how soft skills relate to characteristics of hiring organizations. We focus on organizations in New Zealand, a country with a relatively small but growing software sector characterized by a skills shortage and embedded in a bi-cultural context. Method: We used a qualitative research method and manually analyzed 530 job adverts from New Zealand’s largest job portal for technology-related positions. We identified soft skills following an inductive approach, i.e., without a pre-defined set of soft skills. Results: We found explicit references to soft skills in 82% of adverts. We identified 17 soft skills and proposed a contextualized software engineering description. Communication-related soft skills are most in demand, regardless of the type of position. Soft skills related to broader human or societal values (e.g., empathy or cultural awareness) or distributed development are not frequently requested. Soft skills do not depend on company size or core business. Conclusions: Employers explicitly ask for soft skills. Our findings support previous studies that highlight the importance of communication. Characteristics specific to New Zealand do not impact the demand for soft skills. Our findings benefit researchers in human aspects of software engineering and to those responsible for staff, curricula and professional development.

背景:软件专业人员的软技能(如沟通、人际交往能力)对项目和产品的成功有重要贡献。目的:我们的目标是理解(a)什么是软件工程中相关的软技能，(b)软技能与软件工程职位类型的关系，以及(c)软技能与招聘组织的特征的关系。我们关注的是新西兰的组织，这个国家的软件行业规模相对较小，但不断增长，其特点是技能短缺，并植根于双文化背景。方法:采用定性研究方法，对新西兰最大的招聘门户网站上的530份招聘广告进行人工分析。我们按照归纳的方法确定软技能，也就是说，没有预先定义的软技能集。结果:我们发现在82%的广告中明确提到了软技能。我们确定了17种软技能，并提出了一种情境化的软件工程描述。无论职位类型如何，与沟通相关的软技能都是最需要的。与更广泛的人类或社会价值(例如，移情或文化意识)或分布式开发相关的软技能并不经常被要求。软技能与公司规模或核心业务无关。结论:雇主明确要求应聘者具备软技能。我们的发现支持了之前强调沟通重要性的研究。新西兰特有的特点并不影响对软技能的需求。我们的发现有利于软件工程的人类方面的研究人员，以及那些负责员工、课程和专业发展的人。

{"title":"What Soft Skills Does the Software Industry *Really* Want? An Exploratory Study of Software Positions in New Zealand","authors":"M. Galster, A. Mitrovic, S. Malinen, Jay Holland","doi":"10.1145/3544902.3546247","DOIUrl":"https://doi.org/10.1145/3544902.3546247","url":null,"abstract":"Background: Soft skills of software professionals (e.g., communication, interpersonal skills) significantly contribute to project and product success. Aims: We aim to understand (a) what are relevant soft skills in software engineering, (b) how soft skills relate to types of software engineering positions, and (c) how soft skills relate to characteristics of hiring organizations. We focus on organizations in New Zealand, a country with a relatively small but growing software sector characterized by a skills shortage and embedded in a bi-cultural context. Method: We used a qualitative research method and manually analyzed 530 job adverts from New Zealand’s largest job portal for technology-related positions. We identified soft skills following an inductive approach, i.e., without a pre-defined set of soft skills. Results: We found explicit references to soft skills in 82% of adverts. We identified 17 soft skills and proposed a contextualized software engineering description. Communication-related soft skills are most in demand, regardless of the type of position. Soft skills related to broader human or societal values (e.g., empathy or cultural awareness) or distributed development are not frequently requested. Soft skills do not depend on company size or core business. Conclusions: Employers explicitly ask for soft skills. Our findings support previous studies that highlight the importance of communication. Characteristics specific to New Zealand do not impact the demand for soft skills. Our findings benefit researchers in human aspects of software engineering and to those responsible for staff, curricula and professional development.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127819088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Towards Demystifying the Impact of Dependency Structures on Bug Locations in Deep Learning Libraries 揭示深度学习库中依赖结构对Bug位置的影响

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546246

Di Cui, Xingyu Li, Feiyang Liu, Siqi Wang, Jie Dai, Lu Wang, Qingshan Li

Background: Many safety-critical industrial applications have turned to deep learning systems as a fundamental component. Most of these systems rely on deep learning libraries, and bugs of such libraries can have irreparable consequences. Aims: Over the years, dependency structure has shown to be a practical indicator of software quality, widely used in numerous bug prediction techniques. The problem is that when analyzing bugs in deep learning libraries, researchers are unclear whether dependency structures still have a high correlation and which forms of dependency structures perform the best. Method: In this paper, we present a systematic investigation of the above question and implement a dependency structure-centric bug analysis tool: Depend4BL, capturing the interaction between dependency structures and bug locations in deep learning libraries. Results: We employ Depend4BL to analyze the top 5 open-source deep learning libraries on Github in terms of stars and forks, with 279,788 revision commits and 8,715 bug fixes. The results demonstrate the significant differences among syntactic, history, and semantic structures, and their vastly different impacts on bug locations. Their combinations have the potential to further improve bug prediction for deep learning libraries. Conclusions: In summary, our work provides a new perspective regarding to the correlation between dependency structures and bug locations in deep learning libraries. We release a large set of benchmarks and a prototype toolkit to automatically detect various forms of dependency structures for deep learning libraries. Our study also unveils useful findings based on quantitative and qualitative analysis that benefit bug prediction techniques for deep learning libraries.

背景:许多安全关键型工业应用已经转向深度学习系统作为基本组件。这些系统大多依赖于深度学习库，而这些库的错误可能会造成不可挽回的后果。目的:多年来，依赖关系结构已被证明是软件质量的实用指标，广泛用于许多错误预测技术中。问题在于，在分析深度学习库中的bug时，研究人员并不清楚依赖结构是否仍然具有高相关性，以及哪种形式的依赖结构表现最好。方法:在本文中，我们对上述问题进行了系统的研究，并实现了一个以依赖结构为中心的bug分析工具:Depend4BL，用于捕获深度学习库中依赖结构和bug位置之间的交互。结果:我们使用Depend4BL对Github上排名前5的开源深度学习库进行了星辰和分叉分析，共提交了279,788次修订，修复了8,715个bug。结果显示了语法、历史和语义结构之间的显著差异，以及它们对bug位置的巨大不同影响。它们的组合有可能进一步改善深度学习库的错误预测。结论:总之，我们的工作为深度学习库中依赖结构和bug位置之间的相关性提供了一个新的视角。我们发布了大量的基准测试和原型工具包，用于自动检测深度学习库的各种形式的依赖结构。我们的研究还揭示了基于定量和定性分析的有用发现，这些发现有利于深度学习库的bug预测技术。

{"title":"Towards Demystifying the Impact of Dependency Structures on Bug Locations in Deep Learning Libraries","authors":"Di Cui, Xingyu Li, Feiyang Liu, Siqi Wang, Jie Dai, Lu Wang, Qingshan Li","doi":"10.1145/3544902.3546246","DOIUrl":"https://doi.org/10.1145/3544902.3546246","url":null,"abstract":"Background: Many safety-critical industrial applications have turned to deep learning systems as a fundamental component. Most of these systems rely on deep learning libraries, and bugs of such libraries can have irreparable consequences. Aims: Over the years, dependency structure has shown to be a practical indicator of software quality, widely used in numerous bug prediction techniques. The problem is that when analyzing bugs in deep learning libraries, researchers are unclear whether dependency structures still have a high correlation and which forms of dependency structures perform the best. Method: In this paper, we present a systematic investigation of the above question and implement a dependency structure-centric bug analysis tool: Depend4BL, capturing the interaction between dependency structures and bug locations in deep learning libraries. Results: We employ Depend4BL to analyze the top 5 open-source deep learning libraries on Github in terms of stars and forks, with 279,788 revision commits and 8,715 bug fixes. The results demonstrate the significant differences among syntactic, history, and semantic structures, and their vastly different impacts on bug locations. Their combinations have the potential to further improve bug prediction for deep learning libraries. Conclusions: In summary, our work provides a new perspective regarding to the correlation between dependency structures and bug locations in deep learning libraries. We release a large set of benchmarks and a prototype toolkit to automatically detect various forms of dependency structures for deep learning libraries. Our study also unveils useful findings based on quantitative and qualitative analysis that benefit bug prediction techniques for deep learning libraries.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126398868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MMF3: Neural Code Summarization Based on Multi-Modal Fine-Grained Feature Fusion 基于多模态细粒度特征融合的神经代码摘要

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546251

Zheng Ma, Yuexiu Gao, Lei Lyu, Chen Lyu

Background: Code summarization automatically generates the corresponding natural language descriptions according to the input code to characterize the function implemented by source code. Comprehensiveness of code representation is critical to code summarization task. However, most existing approaches typically use coarse-grained fusion methods to integrate multi-modal features. They generally represent different modalities of a piece of code, such as an Abstract Syntax Tree (AST) and a token sequence, as two embeddings and then fuse the two ones at the AST/code levels. Such a coarse integration makes it difficult to learn the correlations between fine-grained code elements across modalities effectively. Aims: This study intends to improve the model’s prediction performance for high-quality code summarization by accurately aligning and fully fusing semantic and syntactic structure information of source code at node/token levels. Method: This paper proposes a Multi-Modal Fine-grained Feature Fusion approach (MMF3) for neural code summarization. The method uses the Transformer architecture. In particular, we introduce a novel fine-grained fusion method, which allows fine-grained fusion of multiple code modalities at the token and node levels. Specifically, we use this method to fuse information from both token and AST modalities and apply the fused features to code summarization. Results: We conduct experiments on one Java and one Python datasets, and evaluate generated summaries using four metrics. The results show that: 1) the performance of our model outperforms the current state-of-the-art models, and 2) the ablation experiments show that our proposed fine-grained fusion method can effectively improve the accuracy of generated summaries. Conclusion: MMF3 can mine the relationships between cross-modal elements and perform accurate fine-grained element-level alignment fusion accordingly. As a result, more clues can be provided to improve the accuracy of the generated code summaries.

背景:代码摘要根据输入的代码自动生成相应的自然语言描述，对源代码实现的功能进行表征。代码表示的全面性是完成代码摘要任务的关键。然而，大多数现有方法通常使用粗粒度融合方法来集成多模态特征。它们通常表示一段代码的不同形式，例如抽象语法树(AST)和标记序列，作为两个嵌入，然后在AST/代码级别融合这两个嵌入。这种粗糙的集成使得很难有效地学习跨模式的细粒度代码元素之间的相关性。目的:本研究旨在通过在节点/令牌级别准确对齐和充分融合源代码的语义和句法结构信息，提高模型对高质量代码摘要的预测性能。方法:提出一种多模态细粒度特征融合方法(MMF3)用于神经编码摘要。该方法使用Transformer体系结构。特别是，我们引入了一种新的细粒度融合方法，该方法允许在令牌和节点级别对多个代码模式进行细粒度融合。具体来说，我们使用这种方法来融合来自令牌和AST模式的信息，并将融合的特征应用于代码摘要。结果:我们在一个Java和一个Python数据集上进行实验，并使用四个指标评估生成的摘要。结果表明:1)我们的模型性能优于目前最先进的模型;2)烧蚀实验表明，我们提出的细粒度融合方法可以有效提高生成摘要的准确性。结论:MMF3可以挖掘跨模态元素之间的关系，并相应地进行精确的细粒度元素级对齐融合。因此，可以提供更多的线索来提高生成的代码摘要的准确性。

{"title":"MMF3: Neural Code Summarization Based on Multi-Modal Fine-Grained Feature Fusion","authors":"Zheng Ma, Yuexiu Gao, Lei Lyu, Chen Lyu","doi":"10.1145/3544902.3546251","DOIUrl":"https://doi.org/10.1145/3544902.3546251","url":null,"abstract":"Background: Code summarization automatically generates the corresponding natural language descriptions according to the input code to characterize the function implemented by source code. Comprehensiveness of code representation is critical to code summarization task. However, most existing approaches typically use coarse-grained fusion methods to integrate multi-modal features. They generally represent different modalities of a piece of code, such as an Abstract Syntax Tree (AST) and a token sequence, as two embeddings and then fuse the two ones at the AST/code levels. Such a coarse integration makes it difficult to learn the correlations between fine-grained code elements across modalities effectively. Aims: This study intends to improve the model’s prediction performance for high-quality code summarization by accurately aligning and fully fusing semantic and syntactic structure information of source code at node/token levels. Method: This paper proposes a Multi-Modal Fine-grained Feature Fusion approach (MMF3) for neural code summarization. The method uses the Transformer architecture. In particular, we introduce a novel fine-grained fusion method, which allows fine-grained fusion of multiple code modalities at the token and node levels. Specifically, we use this method to fuse information from both token and AST modalities and apply the fused features to code summarization. Results: We conduct experiments on one Java and one Python datasets, and evaluate generated summaries using four metrics. The results show that: 1) the performance of our model outperforms the current state-of-the-art models, and 2) the ablation experiments show that our proposed fine-grained fusion method can effectively improve the accuracy of generated summaries. Conclusion: MMF3 can mine the relationships between cross-modal elements and perform accurate fine-grained element-level alignment fusion accordingly. As a result, more clues can be provided to improve the accuracy of the generated code summaries.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132926873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Asking about Technical Debt: Characteristics and Automatic Identification of Technical Debt Questions on Stack Overflow 询问技术债务:栈溢出技术债务问题的特征与自动识别

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546245

Nicholas Kozanidis, R. Verdecchia, Emitzá Guzmán

Background: Q&A sites allow to study how users reference and request support on technical debt. To date only few studies, focusing on narrow aspects, investigate technical debt on Stack Overflow. Aims: We aim at gaining an in-depth understanding on the characteristics of technical debt questions on Stack Overflow. In addition, we assess if identification strategies based on machine learning can be used to automatically identify and classify technical debt questions. Method: We use automated and manual processes to identify technical debt questions on Stack Overflow. The final set of 415 questions is analyzed to study (i) technical debt types, (ii) question length, (iii) perceived urgency, (iv) sentiment, and (v) themes. Natural language processing and machine learning techniques are used to assess if questions can be identified and classified automatically. Results: Architecture debt is the most recurring debt type, followed by code and design debt. Most questions display mild urgency, with frequency of higher urgency steadily declining as urgency rises. Question length varies across debt types. Sentiment is mostly neutral. 29 recurrent themes emerge. Machine learning can be used to identify technical debt questions and binary urgency, but not debt types. Conclusions: Different patterns emerge from the analysis of technical debt questions on Stack Overflow. The results provide further insights on the phenomenon, and support the adoption of a more comprehensive strategy to identify technical debt questions.

背景:问答网站允许研究用户如何参考和请求技术债务支持。迄今为止，只有少数研究集中在狭窄的方面，调查了堆栈溢出的技术债务。目的:我们的目标是深入了解堆栈溢出的技术债务问题的特征。此外，我们评估了基于机器学习的识别策略是否可用于自动识别和分类技术债务问题。方法:我们使用自动化和手动流程来识别Stack Overflow上的技术债务问题。最后一组415个问题进行分析，以研究(i)技术债务类型，(ii)问题长度，(iii)感知紧迫性，(iv)情绪和(v)主题。使用自然语言处理和机器学习技术来评估问题是否可以自动识别和分类。结果:架构债是最常见的债类型，其次是代码和设计债。大多数问题表现出轻微的紧迫性，随着紧迫性的增加，较高紧迫性的出现频率稳步下降。问题的长度因债务类型而异。市场情绪基本中性。出现了29个反复出现的主题。机器学习可以用来识别技术债务问题和二元紧迫性，但不能识别债务类型。结论:通过对Stack Overflow的技术债务问题的分析，出现了不同的模式。结果提供了对该现象的进一步见解，并支持采用更全面的策略来识别技术债务问题。

{"title":"Asking about Technical Debt: Characteristics and Automatic Identification of Technical Debt Questions on Stack Overflow","authors":"Nicholas Kozanidis, R. Verdecchia, Emitzá Guzmán","doi":"10.1145/3544902.3546245","DOIUrl":"https://doi.org/10.1145/3544902.3546245","url":null,"abstract":"Background: Q&A sites allow to study how users reference and request support on technical debt. To date only few studies, focusing on narrow aspects, investigate technical debt on Stack Overflow. Aims: We aim at gaining an in-depth understanding on the characteristics of technical debt questions on Stack Overflow. In addition, we assess if identification strategies based on machine learning can be used to automatically identify and classify technical debt questions. Method: We use automated and manual processes to identify technical debt questions on Stack Overflow. The final set of 415 questions is analyzed to study (i) technical debt types, (ii) question length, (iii) perceived urgency, (iv) sentiment, and (v) themes. Natural language processing and machine learning techniques are used to assess if questions can be identified and classified automatically. Results: Architecture debt is the most recurring debt type, followed by code and design debt. Most questions display mild urgency, with frequency of higher urgency steadily declining as urgency rises. Question length varies across debt types. Sentiment is mostly neutral. 29 recurrent themes emerge. Machine learning can be used to identify technical debt questions and binary urgency, but not debt types. Conclusions: Different patterns emerge from the analysis of technical debt questions on Stack Overflow. The results provide further insights on the phenomenon, and support the adoption of a more comprehensive strategy to identify technical debt questions.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126273235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

PG-VulNet: Detect Supply Chain Vulnerabilities in IoT Devices using Pseudo-code and Graphs PG-VulNet:使用伪代码和图形检测物联网设备中的供应链漏洞

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546240

Xin Liu, Yixiong Wu, Qingchen Yu, Shangru Song, Yue Liu, Qingguo Zhou, Jianwei Zhuge

Background: With the boosting development of IoT technology, the supply chains of IoT devices become more powerful and sophisticated, and the security issues introduced by code reuse are becoming more prominent. Therefore, the detection and management of vulnerabilities through code similarity detection technology is of great significance for protecting the security of IoT devices. Aim: We aim to propose a more accurate, parallel-friendly, and realistic software supply chain vulnerability detection solution for IoT devices. Method: This paper presents PG-VulNet, standing for Vulnerability-detection Network based on Pseudo-code Graphs. It is a ”multi-model” cross-architecture vulnerability detection solution based on pseudo-code and Graph Matching Network (GMN). PG-VulNet extracts both behavioral and structural features of pseudo-code to build customized feature graphs and then uses GMN to detect supply chain vulnerabilities based on these graphs. Results: The experiments show that PG-VulNet achieves an average detection accuracy of 99.14%, significantly higher than existing approaches like Gemini, VulSeeker, FIT, and Asteria. In addition to this, PG-VulNet also excels in detection overhead and false alarms. In the real-world evaluation, PG-VulNet detected 690 known vulnerabilities in 1,611 firmwares. Conclusions: PG-VulNet can effectively detect the vulnerabilities introduced by software supply chain in IoT firmwares and is well suited for large-scale detection. Compared with existing approaches, PG-VulNet has significant advantages.

背景:随着物联网技术的飞速发展，物联网设备的供应链变得越来越强大和复杂，代码重用带来的安全问题也越来越突出。因此，通过代码相似度检测技术对漏洞进行检测和管理，对于保护物联网设备的安全具有重要意义。目标:我们的目标是为物联网设备提供一个更准确、并行友好、更现实的软件供应链漏洞检测解决方案。方法:本文提出了基于伪码图的漏洞检测网络PG-VulNet。它是一种基于伪码和图形匹配网络(GMN)的“多模型”跨架构漏洞检测方案。PG-VulNet同时提取伪代码的行为特征和结构特征，构建自定义特征图，然后利用GMN基于这些特征图检测供应链漏洞。结果:实验表明，PG-VulNet平均检测准确率达到99.14%，显著高于Gemini、VulSeeker、FIT、Asteria等现有方法。除此之外，PG-VulNet在检测开销和假警报方面也很出色。在实际评估中，PG-VulNet在1,611个固件中检测到690个已知漏洞。结论:PG-VulNet能够有效检测物联网固件中软件供应链引入的漏洞，适合大规模检测。与现有的方法相比，PG-VulNet具有明显的优势。

{"title":"PG-VulNet: Detect Supply Chain Vulnerabilities in IoT Devices using Pseudo-code and Graphs","authors":"Xin Liu, Yixiong Wu, Qingchen Yu, Shangru Song, Yue Liu, Qingguo Zhou, Jianwei Zhuge","doi":"10.1145/3544902.3546240","DOIUrl":"https://doi.org/10.1145/3544902.3546240","url":null,"abstract":"Background: With the boosting development of IoT technology, the supply chains of IoT devices become more powerful and sophisticated, and the security issues introduced by code reuse are becoming more prominent. Therefore, the detection and management of vulnerabilities through code similarity detection technology is of great significance for protecting the security of IoT devices. Aim: We aim to propose a more accurate, parallel-friendly, and realistic software supply chain vulnerability detection solution for IoT devices. Method: This paper presents PG-VulNet, standing for Vulnerability-detection Network based on Pseudo-code Graphs. It is a ”multi-model” cross-architecture vulnerability detection solution based on pseudo-code and Graph Matching Network (GMN). PG-VulNet extracts both behavioral and structural features of pseudo-code to build customized feature graphs and then uses GMN to detect supply chain vulnerabilities based on these graphs. Results: The experiments show that PG-VulNet achieves an average detection accuracy of 99.14%, significantly higher than existing approaches like Gemini, VulSeeker, FIT, and Asteria. In addition to this, PG-VulNet also excels in detection overhead and false alarms. In the real-world evaluation, PG-VulNet detected 690 known vulnerabilities in 1,611 firmwares. Conclusions: PG-VulNet can effectively detect the vulnerabilities introduced by software supply chain in IoT firmwares and is well suited for large-scale detection. Compared with existing approaches, PG-VulNet has significant advantages.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116525652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

In the Zone: An Analysis of the Music Practices of Remote Software Developers 在区域:远程软件开发人员的音乐实践分析

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546638

Makayla Moster, Aarav Chandra, Chris W. L. Chu, Weiyi Liu, Paige Rodeghero

Background: Listening to music is a common practice among software developers. Music listening after work can help release work-related stress; while listening to music at work can improve work efficiency and make tedious work more enjoyable. The working environment of developers in the past few years has changed dramatically due to the vast adoption of remote and hybrid work policies. Aims: We aim to understand how listening to music at work affects remote developers’ perceived productivity and creativity. Method: We investigated 130 software developers and collected their music listening habits during remote work in a questionnaire. We studied the impact of listening to music on developers’ creativity and productivity while working remotely during the COVID-19 pandemic. Results: Our survey data suggests that developers generally feel more productive and creative when listening to music during remote working conditions. Conclusion: We found that developers who listen to music feel more productive and creative while working remotely due to reducing environment distractions.

背景:听音乐是软件开发人员的一种常见做法。下班后听音乐可以帮助释放工作压力;而在工作中听音乐可以提高工作效率，让繁琐的工作变得更愉快。由于远程和混合工作策略的广泛采用，开发人员的工作环境在过去几年中发生了巨大的变化。目标:我们的目标是了解在工作中听音乐如何影响远程开发人员的生产力和创造力。方法:对130名软件开发人员进行问卷调查，收集其远程办公时的音乐收听习惯。我们研究了在COVID-19大流行期间远程工作时听音乐对开发人员创造力和生产力的影响。结果:我们的调查数据表明，开发人员在远程工作环境下听音乐时，通常会感到更有效率和更有创造力。结论:我们发现，听音乐的开发者在远程工作时效率更高、更有创造力，因为这样可以减少环境干扰。

{"title":"In the Zone: An Analysis of the Music Practices of Remote Software Developers","authors":"Makayla Moster, Aarav Chandra, Chris W. L. Chu, Weiyi Liu, Paige Rodeghero","doi":"10.1145/3544902.3546638","DOIUrl":"https://doi.org/10.1145/3544902.3546638","url":null,"abstract":"Background: Listening to music is a common practice among software developers. Music listening after work can help release work-related stress; while listening to music at work can improve work efficiency and make tedious work more enjoyable. The working environment of developers in the past few years has changed dramatically due to the vast adoption of remote and hybrid work policies. Aims: We aim to understand how listening to music at work affects remote developers’ perceived productivity and creativity. Method: We investigated 130 software developers and collected their music listening habits during remote work in a questionnaire. We studied the impact of listening to music on developers’ creativity and productivity while working remotely during the COVID-19 pandemic. Results: Our survey data suggests that developers generally feel more productive and creative when listening to music during remote working conditions. Conclusion: We found that developers who listen to music feel more productive and creative while working remotely due to reducing environment distractions.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129942911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

MEG: Multi-objective Ensemble Generation for Software Defect Prediction MEG:软件缺陷预测的多目标集成生成

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546255

Rebecca Moussa, Giovani Guizzo, Federica Sarro

Background: Defect Prediction research aims at assisting software engineers in the early identification of software defect during the development process. A variety of automated approaches, ranging from traditional classification models to more sophisticated learning approaches, have been explored to this end. Among these, recent studies have proposed the use of ensemble prediction models (i.e., aggregation of multiple base classifiers) to build more robust defect prediction models. Aims: In this paper, we introduce a novel approach based on multi-objective evolutionary search to automatically generate defect prediction ensembles. Our proposal is not only novel with respect to the more general area of evolutionary generation of ensembles, but it also advances the state-of-the-art in the use of ensemble in defect prediction. Method: We assess the effectiveness of our approach, dubbed as Multi-objectiveEnsembleGeneration (MEG), by empirically benchmarking it with respect to the most related proposals we found in the literature on defect prediction ensembles and on multi-objective evolutionary ensembles (which, to the best of our knowledge, had never been previously applied to tackle defect prediction). Result: Our results show that MEG is able to generate ensembles which produce similar or more accurate predictions than those achieved by all the other approaches considered in 73% of the cases (with favourable large effect sizes in 80% of them). Conclusions: MEG is not only able to generate ensembles that yield more accurate defect predictions with respect to the benchmarks considered, but it also does it automatically, thus relieving the engineers from the burden of manual design and experimentation.

背景:缺陷预测研究的目的是帮助软件工程师在开发过程中早期识别软件缺陷。各种各样的自动化方法，从传统的分类模型到更复杂的学习方法，已经为此目的进行了探索。其中，最近的研究提出了使用集成预测模型(即多个基分类器的聚合)来构建更健壮的缺陷预测模型。目的:提出了一种基于多目标进化搜索的缺陷预测集成自动生成方法。我们的建议不仅在集成的进化生成的更一般的领域是新颖的，而且它也推进了集成在缺陷预测中使用的最新技术。方法:我们评估我们的方法的有效性，被称为多目标集成生成(MEG)，通过对我们在缺陷预测集成和多目标进化集成的文献中发现的最相关的建议进行经验基准测试(据我们所知，以前从未应用于处理缺陷预测)。结果:我们的结果表明，在73%的情况下，MEG能够生成与所有其他方法所获得的预测相似或更准确的集成(其中80%具有有利的大效应量)。结论:MEG不仅能够生成与所考虑的基准相关的更准确的缺陷预测的集合，而且它还能自动地完成，从而减轻工程师手工设计和实验的负担。

{"title":"MEG: Multi-objective Ensemble Generation for Software Defect Prediction","authors":"Rebecca Moussa, Giovani Guizzo, Federica Sarro","doi":"10.1145/3544902.3546255","DOIUrl":"https://doi.org/10.1145/3544902.3546255","url":null,"abstract":"Background: Defect Prediction research aims at assisting software engineers in the early identification of software defect during the development process. A variety of automated approaches, ranging from traditional classification models to more sophisticated learning approaches, have been explored to this end. Among these, recent studies have proposed the use of ensemble prediction models (i.e., aggregation of multiple base classifiers) to build more robust defect prediction models. Aims: In this paper, we introduce a novel approach based on multi-objective evolutionary search to automatically generate defect prediction ensembles. Our proposal is not only novel with respect to the more general area of evolutionary generation of ensembles, but it also advances the state-of-the-art in the use of ensemble in defect prediction. Method: We assess the effectiveness of our approach, dubbed as Multi-objectiveEnsembleGeneration (MEG), by empirically benchmarking it with respect to the most related proposals we found in the literature on defect prediction ensembles and on multi-objective evolutionary ensembles (which, to the best of our knowledge, had never been previously applied to tackle defect prediction). Result: Our results show that MEG is able to generate ensembles which produce similar or more accurate predictions than those achieved by all the other approaches considered in 73% of the cases (with favourable large effect sizes in 80% of them). Conclusions: MEG is not only able to generate ensembles that yield more accurate defect predictions with respect to the benchmarks considered, but it also does it automatically, thus relieving the engineers from the burden of manual design and experimentation.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126410706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

How to Choose a Task? Mismatches in Perspectives of Newcomers and Existing Contributors 如何选择任务?新来者和现有贡献者的观点不匹配

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546236

F. Santos, Bianca Trinkenreich, J. F. Pimentel, I. Wiese, Igor Steinmacher, A. Sarma, M. Gerosa

[Background] Selecting an appropriate task is challenging for Open Source Software (OSS) project newcomers and a variety of strategies can help them in this process. [Aims] In this research, we compare the perspective of maintainers, newcomers, and existing contributors about the importance of strategies to support this process. Our goal is to identify possible gulfs of expectations between newcomers who are meant to be helped and contributors who have to put effort into these strategies, which can create friction and impede the usefulness of the strategies. [Method] We interviewed maintainers (n=17) and applied inductive qualitative analysis to derive a model of strategies meant to be adopted by newcomers and communities. Next, we sent a questionnaire (n=64) to maintainers, frequent contributors, and newcomers, asking them to rank these strategies based on their importance. We used the Schulze method to compare the different rankings from the different types of contributors. [Results] Maintainers and contributors diverged in their opinions about the relative importance of various strategies. The results suggest that newcomers want a better contribution process and more support to onboard, while maintainers expect to solve questions using the available communication channels. [Conclusions] The gaps in perspectives between newcomers and existing contributors create a gulf of expectation. OSS communities can leverage our results to prioritize the strategies considered the most important by newcomers.

【背景】对于开源软件(OSS)项目新人来说，选择一个合适的任务是一个挑战，在这个过程中有各种各样的策略可以帮助他们。[目的]在这项研究中，我们比较了维护者、新手和现有贡献者对支持该过程的策略的重要性的看法。我们的目标是找出在需要帮助的新来者和必须为这些策略付出努力的贡献者之间可能存在的期望鸿沟，这可能会产生摩擦并阻碍策略的有效性。[方法]我们采访了维护者(n=17)，并应用归纳定性分析来得出一个旨在为新手和社区采用的策略模型。接下来，我们向维护者、经常贡献者和新手发送了一份调查问卷(n=64)，要求他们根据重要性对这些策略进行排名。我们使用舒尔茨方法来比较不同类型贡献者的不同排名。[结果]维护者和贡献者对各种策略的相对重要性的看法存在分歧。结果表明，新手想要更好的贡献流程和更多的支持，而维护者则希望使用可用的沟通渠道来解决问题。【结论】新来者和现有贡献者之间的观点差距造成了期望的鸿沟。OSS社区可以利用我们的结果来优先考虑新来者认为最重要的策略。

{"title":"How to Choose a Task? Mismatches in Perspectives of Newcomers and Existing Contributors","authors":"F. Santos, Bianca Trinkenreich, J. F. Pimentel, I. Wiese, Igor Steinmacher, A. Sarma, M. Gerosa","doi":"10.1145/3544902.3546236","DOIUrl":"https://doi.org/10.1145/3544902.3546236","url":null,"abstract":"[Background] Selecting an appropriate task is challenging for Open Source Software (OSS) project newcomers and a variety of strategies can help them in this process. [Aims] In this research, we compare the perspective of maintainers, newcomers, and existing contributors about the importance of strategies to support this process. Our goal is to identify possible gulfs of expectations between newcomers who are meant to be helped and contributors who have to put effort into these strategies, which can create friction and impede the usefulness of the strategies. [Method] We interviewed maintainers (n=17) and applied inductive qualitative analysis to derive a model of strategies meant to be adopted by newcomers and communities. Next, we sent a questionnaire (n=64) to maintainers, frequent contributors, and newcomers, asking them to rank these strategies based on their importance. We used the Schulze method to compare the different rankings from the different types of contributors. [Results] Maintainers and contributors diverged in their opinions about the relative importance of various strategies. The results suggest that newcomers want a better contribution process and more support to onboard, while maintainers expect to solve questions using the available communication channels. [Conclusions] The gaps in perspectives between newcomers and existing contributors create a gulf of expectation. OSS communities can leverage our results to prioritize the strategies considered the most important by newcomers.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"234 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131667354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

A Tale of Two Tasks: Automated Issue Priority Prediction with Deep Multi-task Learning 两个任务的故事:基于深度多任务学习的自动问题优先级预测

Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

Pub Date : 2022-09-19 DOI: 10.1145/3544902.3546257

Yingling Li, Xing Che, Yuekai Huang, Junjie Wang, Song Wang, Yawen Wang, Qing Wang

Background. Issues are prevalent, and identifying the correct priority of the reported issues is crucial to reduce the maintenance effort and ensure higher software quality. There are several approaches for the automatic priority prediction, yet they do not fully utilize the related information that might influence the priority assignment. Our observation reveals that there are noticeable correlations between an issue’s priority and its category, e.g., an issue of bug category tends to be assigned with higher priority than an issue of document category. This correlation motivates us to employ multi-task learning to share the knowledge about issue’s category prediction and facilitating priority prediction. Aims. This paper aims at providing an automatic approach for effective issue’s priority prediction, to reduce the burden of the project members and better manage the issues. Method. We propose issue priority prediction approach PRIMA with deep multi-task learning, which takes the issue category prediction as another task to facilitate the information sharing and learning. It consists of three main phases: 1) data preparation and augmentation phase, which allows data sharing beyond single task learning; 2) model construction phase, which designs shared layers to encode the semantics of textual descriptions, and task-specific layers to model two tasks in parallel; it also includes the indicative attributes to better capture an issue’s inherent meaning; 3) model training phase, which enables eavesdropping by shared loss function between two tasks. Results. Evaluations with four large-scale open-source projects show that PRIMA outperforms commonly-used and state-of-the-art baselines, with 32% -55% higher precision, and 28% - 56% higher recall. Compared with single task learning, the performance improvement reaches 18% in precision and 19% in recall. Results from our user study further prove its potential practical value. Conclusions. The proposed approach provides a novel and effective way for issue priority prediction, and sheds light on jointly exploring other issue-management tasks.

背景。问题是普遍存在的，确定报告问题的正确优先级对于减少维护工作和确保更高的软件质量至关重要。自动优先级预测有几种方法，但它们都没有充分利用可能影响优先级分配的相关信息。我们的观察显示，问题的优先级与其类别之间存在明显的相关性，例如，bug类别的问题往往比文档类别的问题具有更高的优先级。这种相关性促使我们采用多任务学习来共享问题的类别预测知识，促进优先级预测。目标本文旨在为有效的问题优先级预测提供一种自动化的方法，以减轻项目成员的负担，更好地管理问题。方法。我们提出了具有深度多任务学习的问题优先级预测方法PRIMA，该方法将问题类别预测作为另一项任务，便于信息共享和学习。它包括三个主要阶段:1)数据准备和增强阶段，允许数据共享超越单任务学习;2)模型构建阶段，设计共享层对文本描述的语义进行编码，设计任务特定层对两个任务并行建模;它还包括指示性属性，以更好地捕捉问题的内在含义;3)模型训练阶段，通过两个任务之间的共享损失函数实现窃听。结果。对四个大型开源项目的评估表明，PRIMA优于常用的和最先进的基线，精度提高32% -55%，召回率提高28% - 56%。与单任务学习相比，准确率提高了18%，召回率提高了19%。用户研究结果进一步证明了其潜在的实用价值。结论。该方法为问题优先级预测提供了一种新颖有效的方法，并为共同探索其他问题管理任务提供了思路。

{"title":"A Tale of Two Tasks: Automated Issue Priority Prediction with Deep Multi-task Learning","authors":"Yingling Li, Xing Che, Yuekai Huang, Junjie Wang, Song Wang, Yawen Wang, Qing Wang","doi":"10.1145/3544902.3546257","DOIUrl":"https://doi.org/10.1145/3544902.3546257","url":null,"abstract":"Background. Issues are prevalent, and identifying the correct priority of the reported issues is crucial to reduce the maintenance effort and ensure higher software quality. There are several approaches for the automatic priority prediction, yet they do not fully utilize the related information that might influence the priority assignment. Our observation reveals that there are noticeable correlations between an issue’s priority and its category, e.g., an issue of bug category tends to be assigned with higher priority than an issue of document category. This correlation motivates us to employ multi-task learning to share the knowledge about issue’s category prediction and facilitating priority prediction. Aims. This paper aims at providing an automatic approach for effective issue’s priority prediction, to reduce the burden of the project members and better manage the issues. Method. We propose issue priority prediction approach PRIMA with deep multi-task learning, which takes the issue category prediction as another task to facilitate the information sharing and learning. It consists of three main phases: 1) data preparation and augmentation phase, which allows data sharing beyond single task learning; 2) model construction phase, which designs shared layers to encode the semantics of textual descriptions, and task-specific layers to model two tasks in parallel; it also includes the indicative attributes to better capture an issue’s inherent meaning; 3) model training phase, which enables eavesdropping by shared loss function between two tasks. Results. Evaluations with four large-scale open-source projects show that PRIMA outperforms commonly-used and state-of-the-art baselines, with 32% -55% higher precision, and 28% - 56% higher recall. Compared with single task learning, the performance improvement reaches 18% in precision and 19% in recall. Results from our user study further prove its potential practical value. Conclusions. The proposed approach provides a novel and effective way for issue priority prediction, and sheds light on jointly exploring other issue-management tasks.","PeriodicalId":220679,"journal":{"name":"Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132146654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4