2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)最新文献

英文中文

On ML-Based Program Translation: Perils and Promises 基于机器学习的程序翻译:风险与希望

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

Pub Date : 2023-02-21 DOI: 10.1109/ICSE-NIER58687.2023.00017

Aniketh Malyala, K. Zhou, Baishakhi Ray, Saikat Chakraborty

With the advent of new and advanced programming languages, it becomes imperative to migrate legacy software to new programming languages. Unsupervised Machine Learning-based Program Translation could play an essential role in such migration, even without a sufficiently sizeable reliable corpus of parallel source code. However, these translators are far from perfect due to their statistical nature. This work investigates unsupervised program translators and where and why they fail. With in-depth error analysis of such failures, we have identified that the cases where such translators fail follow a few particular patterns. With this insight, we develop a rule-based program mutation engine, which pre-processes the input code if the input follows specific patterns and post-process the output if the output follows certain patterns. We show that our code processing tool, in conjunction with the program translator, can form a hybrid program translator and significantly improve the state-of-the-art. In the future, we envision an end-to-end program translation tool where programming domain knowledge can be embedded into an ML-based translation pipeline using pre- and post-processing steps.

随着新的高级编程语言的出现，将遗留软件迁移到新的编程语言中变得势在必行。基于无监督机器学习的程序翻译可以在这种迁移中发挥重要作用，即使没有足够大的可靠的并行源代码语料库。然而，由于其统计性质，这些翻译还远远不够完美。这项工作调查了无人监督的程序翻译，以及它们失败的地方和原因。通过对此类错误的深入错误分析，我们发现此类翻译失败的情况遵循一些特定模式。有了这种见解，我们开发了一个基于规则的程序突变引擎，如果输入遵循特定模式，它就对输入代码进行预处理，如果输出遵循特定模式，它就对输出进行后处理。我们展示了我们的代码处理工具，与程序翻译器结合，可以形成一个混合程序翻译器，并显着提高了最先进的技术。在未来，我们设想一个端到端的程序翻译工具，其中编程领域的知识可以通过预处理和后处理步骤嵌入到基于ml的翻译管道中。

{"title":"On ML-Based Program Translation: Perils and Promises","authors":"Aniketh Malyala, K. Zhou, Baishakhi Ray, Saikat Chakraborty","doi":"10.1109/ICSE-NIER58687.2023.00017","DOIUrl":"https://doi.org/10.1109/ICSE-NIER58687.2023.00017","url":null,"abstract":"With the advent of new and advanced programming languages, it becomes imperative to migrate legacy software to new programming languages. Unsupervised Machine Learning-based Program Translation could play an essential role in such migration, even without a sufficiently sizeable reliable corpus of parallel source code. However, these translators are far from perfect due to their statistical nature. This work investigates unsupervised program translators and where and why they fail. With in-depth error analysis of such failures, we have identified that the cases where such translators fail follow a few particular patterns. With this insight, we develop a rule-based program mutation engine, which pre-processes the input code if the input follows specific patterns and post-process the output if the output follows certain patterns. We show that our code processing tool, in conjunction with the program translator, can form a hybrid program translator and significantly improve the state-of-the-art. In the future, we envision an end-to-end program translation tool where programming domain knowledge can be embedded into an ML-based translation pipeline using pre- and post-processing steps.","PeriodicalId":297025,"journal":{"name":"2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)","volume":"11 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120913260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Rapid Development of Compositional AI 快速发展的合成人工智能

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

Pub Date : 2023-02-12 DOI: 10.1109/ICSE-NIER58687.2023.00020

Lee Martie, Jessie Rosenberg, Véronique Demers, Gaoyuan Zhang, Onkar Bhardwaj, John Henning, Aditya Prasad, Matt Stallone, Ja Young Lee, Lucy Yip, D. Adesina, Elahe Paikari, Oscar Resendiz, Sarah Shaw, David Cox

Compositional AI systems, which combine multiple artificial intelligence components together with other application components to solve a larger problem, have no known pattern of development and are often approached in a bespoke and ad hoc style. This makes development slower and harder to reuse for future applications. To support the full rapid development cycle of compositional AI applications, we have developed a novel framework called (Bee)* (written as a regular expression and pronounced as "beestar"). We illustrate how (Bee)* supports building integrated, scalable, and interactive compositional AI applications with a simplified developer experience.

组合AI系统，将多个人工智能组件与其他应用程序组件结合在一起以解决更大的问题，没有已知的开发模式，并且通常以定制和特别的风格进行处理。这使得开发速度变慢，并且难以在未来的应用程序中重用。为了支持合成AI应用程序的完整快速开发周期，我们开发了一个名为(Bee)*的新框架(以正则表达式形式编写，发音为“beestar”)。我们演示了(Bee)*如何支持构建集成的、可扩展的、交互式的合成人工智能应用程序，并简化了开发人员的体验。

引用次数: 0

Towards Human-Centred Crowd Computing: Software for Better Use of Computational Resources 面向以人为本的群体计算:更好地利用计算资源的软件

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

Pub Date : 2023-02-11 DOI: 10.1109/ICSE-NIER58687.2023.00022

Niroshinie Fernando, Chetan Arora, S. Loke, L. Alam, S. L. Macchia, Helen Graesser

Internet-connected smart devices are increasing at an exponential rate. These powerful devices have created a yet-untapped pool of idle resources that can be utilised, among others, for processing data in resource-depleted environments. The idea of bringing together a pool of smart devices for "crowd computing" (CC) has been studied in the recent past from an infrastructural feasibility perspective. However, for the CC paradigm to be successful, numerous socio-technical and software engineering (SE), specifically the requirements engineering (RE)-related factors are at play and have not been investigated in the literature. In this paper, we motivate the SE-related aspects of CC and the ideas for implementing mobile apps required for CC scenarios. We present the results of a preliminary study on understanding the human aspects, incentives that motivate users, and CC app requirements, and present our future development plan in this relatively new field of research for SE applications.

联网智能设备正以指数级速度增长。这些功能强大的设备创建了一个尚未开发的空闲资源池，这些资源可以用于在资源耗尽的环境中处理数据。最近，从基础设施可行性的角度研究了将一堆智能设备聚集在一起进行“群体计算”(CC)的想法。然而，为了CC范式的成功，许多社会技术和软件工程(SE)，特别是需求工程(RE)相关的因素在起作用，并且没有在文献中进行调查。在本文中，我们激发了CC的se相关方面以及实现CC场景所需的移动应用程序的想法。我们介绍了一项关于理解人的方面、激励用户和CC应用程序需求的初步研究结果，并介绍了我们在这个相对较新的SE应用程序研究领域的未来发展计划。

引用次数: 0

Under the Bridge: Trolling and the Challenges of Recruiting Software Developers for Empirical Research Studies 桥下:为实证研究招聘软件开发人员的挑衅和挑战

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

Pub Date : 2023-02-01 DOI: 10.1109/ICSE-NIER58687.2023.00016

E. Kokinda, Makayla Moster, Paige Rodeghero

Much of software engineering research focuses on tools, algorithms, and optimization of software. Recently, we, as a community, have come to acknowledge that there is a gap in meta-research and addressing the human-factors in software engineering research. Through meta research, we aim to deepen our understanding of online participant recruitment and human-subjects software engineering research. In this paper we motivate the need to consider the unique challenges that human studies pose in software engineering research. We present several challenges faced by our research team in several distinct research studies, how they affected research, and motivate how, as researchers, we can address these challenges. We present results from a pilot study and categorize issues faced into three broad categories including participant recruitment, community engagement, and data poisoning. We further discuss how we can address these challenges and outline the benefits a full-study could provide to the software engineering research community.

许多软件工程研究集中在工具、算法和软件优化上。最近，作为一个社区，我们已经认识到在元研究和处理软件工程研究中的人为因素方面存在差距。通过元研究，我们旨在加深我们对在线参与者招募和人类受试者软件工程研究的理解。在本文中，我们激发了考虑人类研究在软件工程研究中提出的独特挑战的需要。我们将介绍我们的研究团队在几个不同的研究中面临的几个挑战，它们如何影响研究，并激励我们作为研究人员如何应对这些挑战。我们介绍了一项试点研究的结果，并将面临的问题分为三大类，包括参与者招募、社区参与和数据中毒。我们进一步讨论了如何应对这些挑战，并概述了全面研究可以为软件工程研究社区提供的好处。

{"title":"Under the Bridge: Trolling and the Challenges of Recruiting Software Developers for Empirical Research Studies","authors":"E. Kokinda, Makayla Moster, Paige Rodeghero","doi":"10.1109/ICSE-NIER58687.2023.00016","DOIUrl":"https://doi.org/10.1109/ICSE-NIER58687.2023.00016","url":null,"abstract":"Much of software engineering research focuses on tools, algorithms, and optimization of software. Recently, we, as a community, have come to acknowledge that there is a gap in meta-research and addressing the human-factors in software engineering research. Through meta research, we aim to deepen our understanding of online participant recruitment and human-subjects software engineering research. In this paper we motivate the need to consider the unique challenges that human studies pose in software engineering research. We present several challenges faced by our research team in several distinct research studies, how they affected research, and motivate how, as researchers, we can address these challenges. We present results from a pilot study and categorize issues faced into three broad categories including participant recruitment, community engagement, and data poisoning. We further discuss how we can address these challenges and outline the benefits a full-study could provide to the software engineering research community.","PeriodicalId":297025,"journal":{"name":"2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121151798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

The Risk-Taking Software Engineer: A Framed Portrait 勇于冒险的软件工程师:一幅被裱起来的肖像

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

Pub Date : 2023-01-21 DOI: 10.1109/ICSE-NIER58687.2023.00011

Lorenz Graf‐Vlachy

Background: Risk-taking is prevalent in a host of activities performed by software engineers on a daily basis, yet there is scant research on it. Aims and Method: We study if software engineers’ risk-taking is affected by framing effects and by software engineers’ personality. To this end, we perform a survey experiment with 124 software engineers. Results: We find that framing substantially affects their risk-taking. None of the "Big Five" personality traits are related to risk-taking in software engineers after correcting for multiple testing. Conclusions: Software engineers and their managers must be aware of framing effects and account for them properly.

背景:冒险在软件工程师的日常活动中是普遍存在的，然而关于它的研究却很少。目的与方法:研究软件工程师的冒险行为是否受到框架效应和软件工程师个性的影响。为此，我们对124名软件工程师进行了调查实验。结果:我们发现框架对他们的冒险行为有显著影响。在对多重测试进行校正后，“五大”人格特征中没有一个与软件工程师的冒险行为有关。结论:软件工程师和他们的管理者必须意识到框架效应，并正确地解释它们。

引用次数: 0

Towards using Few-Shot Prompt Learning for Automating Model Completion 用少镜头提示学习实现模型自动完成

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

Pub Date : 2022-12-07 DOI: 10.1109/ICSE-NIER58687.2023.00008

Meriem Ben Chaaben, Lola Burgueño, H. Sahraoui

We propose a simple yet a novel approach to improve completion in domain modeling activities. Our approach exploits the power of large language models by using few-shot prompt learning without the need to train or fine-tune those models with large datasets that are scarce in this field. We implemented our approach and tested it on the completion of static and dynamic domain diagrams. Our initial evaluation shows that such an approach is effective and can be integrated in different ways during the modeling activities.

我们提出了一种简单而新颖的方法来提高领域建模活动的完成度。我们的方法利用了大型语言模型的强大功能，通过使用少量的提示学习，而不需要使用该领域稀缺的大型数据集来训练或微调这些模型。我们实现了我们的方法，并在静态和动态域图的完成上进行了测试。我们的初步评估表明，这种方法是有效的，并且可以在建模活动中以不同的方式集成。

引用次数: 4

CodeS: Towards Code Model Generalization Under Distribution Shift 代码:分布移位下的代码模型泛化

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

Pub Date : 2022-06-11 DOI: 10.1109/ICSE-NIER58687.2023.00007

Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

Distribution shift has been a longstanding challenge for the reliable deployment of deep learning (DL) models due to unexpected accuracy degradation. Although DL has been becoming a driving force for large-scale source code analysis in the big code era, limited progress has been made on distribution shift analysis and benchmarking for source code tasks. To fill this gap, this paper initiates to propose CodeS, a distribution shift benchmark dataset, for source code learning. Specifically, CodeS supports two programming languages (Java and Python) and five shift types (task, programmer, time-stamp, token, and concrete syntax tree). Extensive experiments based on CodeS reveal that 1) out-of-distribution detectors from other domains (e.g., computer vision) do not generalize to source code, 2) all code classification models suffer from distribution shifts, 3) representation-based shifts have a higher impact on the model than others, and 4) pretrained bimodal models are relatively more resistant to distribution shifts.

由于意想不到的精度下降，分布转移一直是深度学习(DL)模型可靠部署的一个长期挑战。虽然深度学习已经成为大代码时代大规模源代码分析的推动力，但在源代码任务的分布转移分析和基准测试方面进展有限。为了填补这一空白，本文提出了用于源代码学习的分布位移基准数据集CodeS。具体来说，CodeS支持两种编程语言(Java和Python)和五种转换类型(任务、程序员、时间戳、令牌和具体语法树)。基于CodeS的大量实验表明，1)来自其他领域(如计算机视觉)的分布外检测器不能推广到源代码，2)所有代码分类模型都受到分布移位的影响，3)基于表示的移位对模型的影响比其他的更大，4)预训练的双峰模型相对更能抵抗分布移位。

引用次数: 5

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀