arXiv - CS - Software Engineering最新文献_第9页

Efficient Online Computation of Business Process State From Trace Prefixes via N-Gram Indexing 通过 N 符索引从跟踪前缀高效在线计算业务流程状态

arXiv - CS - Software Engineering

Pub Date : 2024-09-09 DOI: arxiv-2409.05658

David Chapela-Campa, Marlon Dumas

This paper addresses the following problem: Given a process model and anevent log containing trace prefixes of ongoing cases of a process, map eachcase to its corresponding state (i.e., marking) in the model. This statecomputation operation is a building block of other process mining operations,such as log animation and short-term simulation. An approach to this statecomputation problem is to perform a token-based replay of each trace prefixagainst the model. However, when a trace prefix does not strictly follow thebehavior of the process model, token replay may produce a state that is notreachable from the initial state of the process. An alternative approach is tofirst compute an alignment between the trace prefix of each ongoing case andthe model, and then replay the aligned trace prefix. However,(prefix-)alignment is computationally expensive. This paper proposes a methodthat, given a trace prefix of an ongoing case, computes its state in constanttime using an index that represents states as n-grams. An empirical evaluationshows that the proposed approach has an accuracy comparable to that of theprefix-alignment approach, while achieving a throughput of hundreds ofthousands of traces per second.

本文探讨了以下问题：给定一个流程模型和一个包含流程中正在发生的案例的跟踪前缀的事件日志，将每个案例映射到模型中的相应状态（即标记）。这种状态计算操作是日志动画和短期模拟等其他流程挖掘操作的基石。解决状态计算问题的一种方法是对模型中的每个跟踪前缀执行基于标记的重放。然而，当跟踪前缀并不严格遵循流程模型的行为时，标记重放可能会产生一个无法从流程初始状态到达的状态。另一种方法是首先计算每个正在进行的案例的跟踪前缀与模型之间的对齐度，然后重放对齐的跟踪前缀。然而，（前缀）对齐的计算成本很高。本文提出了一种方法，即在给定一个正在处理的案例的跟踪前缀的情况下，使用表示状态为 n-grams 的索引在恒定时间内计算其状态。实证评估表明，所提出的方法具有与前缀对齐方法相当的准确性，同时实现了每秒数十万条轨迹的吞吐量。

{"title":"Efficient Online Computation of Business Process State From Trace Prefixes via N-Gram Indexing","authors":"David Chapela-Campa, Marlon Dumas","doi":"arxiv-2409.05658","DOIUrl":"https://doi.org/arxiv-2409.05658","url":null,"abstract":"This paper addresses the following problem: Given a process model and an\u0000event log containing trace prefixes of ongoing cases of a process, map each\u0000case to its corresponding state (i.e., marking) in the model. This state\u0000computation operation is a building block of other process mining operations,\u0000such as log animation and short-term simulation. An approach to this state\u0000computation problem is to perform a token-based replay of each trace prefix\u0000against the model. However, when a trace prefix does not strictly follow the\u0000behavior of the process model, token replay may produce a state that is not\u0000reachable from the initial state of the process. An alternative approach is to\u0000first compute an alignment between the trace prefix of each ongoing case and\u0000the model, and then replay the aligned trace prefix. However,\u0000(prefix-)alignment is computationally expensive. This paper proposes a method\u0000that, given a trace prefix of an ongoing case, computes its state in constant\u0000time using an index that represents states as n-grams. An empirical evaluation\u0000shows that the proposed approach has an accuracy comparable to that of the\u0000prefix-alignment approach, while achieving a throughput of hundreds of\u0000thousands of traces per second.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Are Large Language Models a Threat to Programming Platforms? An Exploratory Study 大型语言模型会威胁编程平台吗？一项探索性研究

arXiv - CS - Software Engineering

Pub Date : 2024-09-09 DOI: arxiv-2409.05824

Md Mustakim Billah, Palash Ranjan Roy, Zadia Codabux, Banani Roy

Competitive programming platforms like LeetCode, Codeforces, and HackerRankevaluate programming skills, often used by recruiters for screening. With therise of advanced Large Language Models (LLMs) such as ChatGPT, Gemini, and MetaAI, their problem-solving ability on these platforms needs assessment. Thisstudy explores LLMs' ability to tackle diverse programming challenges acrossplatforms with varying difficulty, offering insights into their real-time andoffline performance and comparing them with human programmers. We tested 98 problems from LeetCode, 126 from Codeforces, covering 15categories. Nine online contests from Codeforces and LeetCode were conducted,along with two certification tests on HackerRank, to assess real-timeperformance. Prompts and feedback mechanisms were used to guide LLMs, andcorrelations were explored across different scenarios. LLMs, like ChatGPT (71.43% success on LeetCode), excelled in LeetCode andHackerRank certifications but struggled in virtual contests, particularly onCodeforces. They performed better than users in LeetCode archives, excelling intime and memory efficiency but underperforming in harder Codeforces contests.While not immediately threatening, LLMs performance on these platforms isconcerning, and future improvements will need addressing.

LeetCode、Codeforces 和 HackerRanke 等竞争性编程平台对编程技能进行评估，经常被招聘人员用于筛选。随着 ChatGPT、Gemini 和 MetaAI 等高级大语言模型（LLM）的出现，需要对这些平台上的问题解决能力进行评估。本研究探讨了 LLM 在不同平台上应对不同难度编程挑战的能力，深入了解了它们的实时和离线性能，并将它们与人类程序员进行了比较。我们测试了来自 LeetCode 的 98 个问题和来自 Codeforces 的 126 个问题，涵盖 15 个类别。我们在 Codeforces 和 LeetCode 上进行了九次在线竞赛，并在 HackerRank 上进行了两次认证测试，以评估实时性能。我们使用提示和反馈机制来指导 LLM，并探索了不同场景下的相关性。LLMs 和 ChatGPT（在 LeetCode 上的成功率为 71.43%）一样，在 LeetCode 和 HackerRank 认证中表现出色，但在虚拟竞赛中，尤其是在 Codeforces 上，却表现吃力。他们在 LeetCode 存档中的表现优于用户，在时间和内存效率方面表现出色，但在难度较高的 Codeforces 竞赛中表现不佳。LLMs 在这些平台上的表现虽然不会立即构成威胁，但令人担忧，需要在未来加以改进。

{"title":"Are Large Language Models a Threat to Programming Platforms? An Exploratory Study","authors":"Md Mustakim Billah, Palash Ranjan Roy, Zadia Codabux, Banani Roy","doi":"arxiv-2409.05824","DOIUrl":"https://doi.org/arxiv-2409.05824","url":null,"abstract":"Competitive programming platforms like LeetCode, Codeforces, and HackerRank\u0000evaluate programming skills, often used by recruiters for screening. With the\u0000rise of advanced Large Language Models (LLMs) such as ChatGPT, Gemini, and Meta\u0000AI, their problem-solving ability on these platforms needs assessment. This\u0000study explores LLMs' ability to tackle diverse programming challenges across\u0000platforms with varying difficulty, offering insights into their real-time and\u0000offline performance and comparing them with human programmers. We tested 98 problems from LeetCode, 126 from Codeforces, covering 15\u0000categories. Nine online contests from Codeforces and LeetCode were conducted,\u0000along with two certification tests on HackerRank, to assess real-time\u0000performance. Prompts and feedback mechanisms were used to guide LLMs, and\u0000correlations were explored across different scenarios. LLMs, like ChatGPT (71.43% success on LeetCode), excelled in LeetCode and\u0000HackerRank certifications but struggled in virtual contests, particularly on\u0000Codeforces. They performed better than users in LeetCode archives, excelling in\u0000time and memory efficiency but underperforming in harder Codeforces contests.\u0000While not immediately threatening, LLMs performance on these platforms is\u0000concerning, and future improvements will need addressing.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Future of Software Testing: AI-Powered Test Case Generation and Validation 软件测试的未来：人工智能驱动的测试用例生成与验证

arXiv - CS - Software Engineering

Pub Date : 2024-09-09 DOI: arxiv-2409.05808

Mohammad Baqar, Rajat Khanda

Software testing is a crucial phase in the software development lifecycle(SDLC), ensuring that products meet necessary functional, performance, andquality benchmarks before release. Despite advancements in automation,traditional methods of generating and validating test cases still facesignificant challenges, including prolonged timelines, human error, incompletetest coverage, and high costs of manual intervention. These limitations oftenlead to delayed product launches and undetected defects that compromisesoftware quality and user satisfaction. The integration of artificialintelligence (AI) into software testing presents a promising solution to thesepersistent challenges. AI-driven testing methods automate the creation ofcomprehensive test cases, dynamically adapt to changes, and leverage machinelearning to identify high-risk areas in the codebase. This approach enhancesregression testing efficiency while expanding overall test coverage.Furthermore, AI-powered tools enable continuous testing and self-healing testcases, significantly reducing manual oversight and accelerating feedback loops,ultimately leading to faster and more reliable software releases. This paperexplores the transformative potential of AI in improving test case generationand validation, focusing on its ability to enhance efficiency, accuracy, andscalability in testing processes. It also addresses key challenges associatedwith adapting AI for testing, including the need for high quality trainingdata, ensuring model transparency, and maintaining a balance between automationand human oversight. Through case studies and examples of real-worldapplications, this paper illustrates how AI can significantly enhance testingefficiency across both legacy and modern software systems.

软件测试是软件开发生命周期（SDLC）中的关键阶段，可确保产品在发布前达到必要的功能、性能和质量基准。尽管自动化技术不断进步，但生成和验证测试用例的传统方法仍然面临着重大挑战，包括时间过长、人为错误、测试覆盖范围不完整以及人工干预成本过高等。这些限制往往会导致产品延迟发布和未被发现的缺陷，从而影响软件质量和用户满意度。将人工智能（AI）集成到软件测试中，为解决这些持续存在的挑战提供了一个前景广阔的解决方案。人工智能驱动的测试方法可自动创建全面的测试用例，动态适应变化，并利用机器学习识别代码库中的高风险区域。此外，人工智能驱动的工具还能实现持续测试和自修复测试用例，从而大幅减少人工监督并加速反馈循环，最终实现更快、更可靠的软件发布。本论文探讨了人工智能在改进测试用例生成和验证方面的变革潜力，重点关注其提高测试流程的效率、准确性和可扩展性的能力。它还探讨了将人工智能应用于测试所面临的关键挑战，包括需要高质量的训练数据、确保模型的透明度以及保持自动化与人工监督之间的平衡。通过案例研究和实际应用实例，本文阐述了人工智能如何显著提高传统和现代软件系统的测试效率。

{"title":"The Future of Software Testing: AI-Powered Test Case Generation and Validation","authors":"Mohammad Baqar, Rajat Khanda","doi":"arxiv-2409.05808","DOIUrl":"https://doi.org/arxiv-2409.05808","url":null,"abstract":"Software testing is a crucial phase in the software development lifecycle\u0000(SDLC), ensuring that products meet necessary functional, performance, and\u0000quality benchmarks before release. Despite advancements in automation,\u0000traditional methods of generating and validating test cases still face\u0000significant challenges, including prolonged timelines, human error, incomplete\u0000test coverage, and high costs of manual intervention. These limitations often\u0000lead to delayed product launches and undetected defects that compromise\u0000software quality and user satisfaction. The integration of artificial\u0000intelligence (AI) into software testing presents a promising solution to these\u0000persistent challenges. AI-driven testing methods automate the creation of\u0000comprehensive test cases, dynamically adapt to changes, and leverage machine\u0000learning to identify high-risk areas in the codebase. This approach enhances\u0000regression testing efficiency while expanding overall test coverage.\u0000Furthermore, AI-powered tools enable continuous testing and self-healing test\u0000cases, significantly reducing manual oversight and accelerating feedback loops,\u0000ultimately leading to faster and more reliable software releases. This paper\u0000explores the transformative potential of AI in improving test case generation\u0000and validation, focusing on its ability to enhance efficiency, accuracy, and\u0000scalability in testing processes. It also addresses key challenges associated\u0000with adapting AI for testing, including the need for high quality training\u0000data, ensuring model transparency, and maintaining a balance between automation\u0000and human oversight. Through case studies and examples of real-world\u0000applications, this paper illustrates how AI can significantly enhance testing\u0000efficiency across both legacy and modern software systems.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142227613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement 通过多计划探索和反馈驱动改进生成代码的结对编程框架

arXiv - CS - Software Engineering

Pub Date : 2024-09-08 DOI: arxiv-2409.05001

Huan Zhang, Wei Cheng, Yuhan Wu, Wei Hu

Large language models (LLMs) have achieved impressive performance on codegeneration. Although prior studies enhanced LLMs with prompting techniques andcode refinement, they still struggle with complex programming problems due torigid solution plans. In this paper, we draw on pair programming practices topropose PairCoder, a novel LLM-based framework for code generation. PairCoderincorporates two collaborative LLM agents, namely a Navigator agent forhigh-level planning and a Driver agent for specific implementation. TheNavigator is responsible for proposing promising solution plans, selecting thecurrent optimal plan, and directing the next iteration round based on executionfeedback. The Driver follows the guidance of Navigator to undertake initialcode generation, code testing, and refinement. This interleaved and iterativeworkflow involves multi-plan exploration and feedback-based refinement, whichmimics the collaboration of pair programmers. We evaluate PairCoder with bothopen-source and closed-source LLMs on various code generation benchmarks.Extensive experimental results demonstrate the superior accuracy of PairCoder,achieving relative pass@1 improvements of 12.00%-162.43% compared to promptingLLMs directly.

大型语言模型（LLM）在代码生成方面取得了令人瞩目的成绩。尽管之前的研究利用提示技术和代码精炼技术增强了 LLM，但由于求解计划过于僵化，它们在处理复杂的编程问题时仍然举步维艰。在本文中，我们借鉴结对编程实践，提出了基于 LLM 的新型代码生成框架 PairCoder。PairCoder 包含两个协作式 LLM 代理，即负责高层规划的 Navigator 代理和负责具体实施的 Driver 代理。导航员负责提出有前途的解决方案计划，选择当前最优计划，并根据执行反馈指导下一轮迭代。驱动程序根据导航器的指导进行初始代码生成、代码测试和完善。这种交错迭代的工作流程包括多计划探索和基于反馈的完善，模拟了一对程序员的协作。我们在各种代码生成基准上评估了 PairCoder 与开源和闭源 LLM。大量实验结果表明，PairCoder 的准确性更胜一筹，与直接提示 LLM 相比，PairCoder 的相对通过率@1 提高了 12.00%-162.43% 。

{"title":"A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement","authors":"Huan Zhang, Wei Cheng, Yuhan Wu, Wei Hu","doi":"arxiv-2409.05001","DOIUrl":"https://doi.org/arxiv-2409.05001","url":null,"abstract":"Large language models (LLMs) have achieved impressive performance on code\u0000generation. Although prior studies enhanced LLMs with prompting techniques and\u0000code refinement, they still struggle with complex programming problems due to\u0000rigid solution plans. In this paper, we draw on pair programming practices to\u0000propose PairCoder, a novel LLM-based framework for code generation. PairCoder\u0000incorporates two collaborative LLM agents, namely a Navigator agent for\u0000high-level planning and a Driver agent for specific implementation. The\u0000Navigator is responsible for proposing promising solution plans, selecting the\u0000current optimal plan, and directing the next iteration round based on execution\u0000feedback. The Driver follows the guidance of Navigator to undertake initial\u0000code generation, code testing, and refinement. This interleaved and iterative\u0000workflow involves multi-plan exploration and feedback-based refinement, which\u0000mimics the collaboration of pair programmers. We evaluate PairCoder with both\u0000open-source and closed-source LLMs on various code generation benchmarks.\u0000Extensive experimental results demonstrate the superior accuracy of PairCoder,\u0000achieving relative pass@1 improvements of 12.00%-162.43% compared to prompting\u0000LLMs directly.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unraveling Challenges with Supply-Chain Levels for Software Artifacts (SLSA) for Securing the Software Supply Chain 破解软件工件供应链级别（SLSA）带来的挑战，确保软件供应链安全

arXiv - CS - Software Engineering

Pub Date : 2024-09-08 DOI: arxiv-2409.05014

Mahzabin Tamanna, Sivana Hamer, Mindy Tran, Sascha Fahl, Yasemin Acar, Laurie Williams

In 2023, Sonatype reported a 200% increase in software supply chain attacks,including major build infrastructure attacks. To secure the software supplychain, practitioners can follow security framework guidance like theSupply-chain Levels for Software Artifacts (SLSA). However, recent surveys andindustry summits have shown that despite growing interest, the adoption of SLSAis not widespread. To understand adoption challenges, textit{the goal of thisstudy is to aid framework authors and practitioners in improving the adoptionand development of Supply-Chain Levels for Software Artifacts (SLSA) through aqualitative study of SLSA-related issues on GitHub}. We analyzed 1,523SLSA-related issues extracted from 233 GitHub repositories. We conducted atopic-guided thematic analysis, leveraging the Latent Dirichlet Allocation(LDA) unsupervised machine learning algorithm, to explore the challenges ofadopting SLSA and the strategies for overcoming these challenges. We identifiedfour significant challenges and five suggested adoption strategies. The twomain challenges reported are complex implementation and unclear communication,highlighting the difficulties in implementing and understanding the SLSAprocess across diverse ecosystems. The suggested strategies includestreamlining provenance generation processes, improving the SLSA verificationprocess, and providing specific and detailed documentation. Our findingsindicate that some strategies can help mitigate multiple challenges, and somechallenges need future research and tool enhancement.

Sonatype 报告称，2023 年，软件供应链攻击增加了 200%，其中包括主要的构建基础设施攻击。为了确保软件供应链的安全，从业人员可以遵循软件工件供应链级别（SLSA）等安全框架指南。然而，最近的调查和行业峰会表明，尽管人们对 SLSA 的兴趣日益浓厚，但其采用并不普遍。为了了解采用方面的挑战，本研究的目标是通过对 GitHub 上与软件工件供应链级别（SLSA）相关的问题进行定量研究，帮助框架作者和实践者改进软件工件供应链级别（SLSA）的采用和开发。我们分析了从 233 个 GitHub 存储库中提取的 1,523 个与 SLSA 相关的问题。我们利用 Latent Dirichlet Allocation（LDA）无监督机器学习算法进行了主题分析，以探索采用 SLSA 所面临的挑战以及克服这些挑战的策略。我们确定了四个重大挑战和五个建议采用的策略。报告中提到的两个领域的挑战是复杂的实施和不明确的沟通，这凸显了在不同生态系统中实施和理解 SLSA 过程的困难。建议采取的策略包括简化出处生成流程、改进 SLSA 验证流程以及提供具体详细的文档。我们的研究结果表明，有些策略有助于缓解多重挑战，有些挑战则需要未来的研究和工具改进。

{"title":"Unraveling Challenges with Supply-Chain Levels for Software Artifacts (SLSA) for Securing the Software Supply Chain","authors":"Mahzabin Tamanna, Sivana Hamer, Mindy Tran, Sascha Fahl, Yasemin Acar, Laurie Williams","doi":"arxiv-2409.05014","DOIUrl":"https://doi.org/arxiv-2409.05014","url":null,"abstract":"In 2023, Sonatype reported a 200% increase in software supply chain attacks,\u0000including major build infrastructure attacks. To secure the software supply\u0000chain, practitioners can follow security framework guidance like the\u0000Supply-chain Levels for Software Artifacts (SLSA). However, recent surveys and\u0000industry summits have shown that despite growing interest, the adoption of SLSA\u0000is not widespread. To understand adoption challenges, textit{the goal of this\u0000study is to aid framework authors and practitioners in improving the adoption\u0000and development of Supply-Chain Levels for Software Artifacts (SLSA) through a\u0000qualitative study of SLSA-related issues on GitHub}. We analyzed 1,523\u0000SLSA-related issues extracted from 233 GitHub repositories. We conducted a\u0000topic-guided thematic analysis, leveraging the Latent Dirichlet Allocation\u0000(LDA) unsupervised machine learning algorithm, to explore the challenges of\u0000adopting SLSA and the strategies for overcoming these challenges. We identified\u0000four significant challenges and five suggested adoption strategies. The two\u0000main challenges reported are complex implementation and unclear communication,\u0000highlighting the difficulties in implementing and understanding the SLSA\u0000process across diverse ecosystems. The suggested strategies include\u0000streamlining provenance generation processes, improving the SLSA verification\u0000process, and providing specific and detailed documentation. Our findings\u0000indicate that some strategies can help mitigate multiple challenges, and some\u0000challenges need future research and tool enhancement.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unified External Stakeholder Engagement and Requirements Strategy 统一的外部利益相关方参与和需求战略

arXiv - CS - Software Engineering

Pub Date : 2024-09-08 DOI: arxiv-2409.05019

Ahmed Abdulaziz Alnhari, Rizwan Qureshi

Understanding stakeholder needs is essential for project success, asstakeholder importance varies across projects. This study proposes a frameworkfor early stakeholder identification and continuous engagement throughout theproject lifecycle. The framework addresses common organizational failures instakeholder communication that lead to project delays and cancellations. Byclassifying stakeholders by influence and interest, establishing clearcommunication channels, and implementing regular feedback loops, the frameworkensures effective stakeholder involvement. This approach allows for necessaryproject adjustments and builds long-term relationships, validated by a surveyof IT professionals. Engaging stakeholders strategically at all stagesminimizes misunderstandings and project risks, contributing to better projectmanagement and lifecycle outcomes.

了解利益相关者的需求对项目的成功至关重要，因为利益相关者在不同项目中的重要性各不相同。本研究提出了一个框架，用于早期识别利益相关者，并在整个项目生命周期中持续参与。该框架解决了导致项目延误和取消的利益相关者沟通中常见的组织失误问题。通过对利益相关者的影响力和兴趣进行分类、建立清晰的沟通渠道以及实施定期反馈循环，该框架可确保利益相关者的有效参与。这种方法允许对项目进行必要的调整，并建立长期的合作关系，这一点在一项针对 IT 专业人士的调查中得到了验证。让利益相关者在各个阶段都战略性地参与进来，可以最大限度地减少误解和项目风险，从而促进更好的项目管理和生命周期成果。

引用次数: 0

CONNECTOR: Enhancing the Traceability of Decentralized Bridge Applications via Automatic Cross-chain Transaction Association CONNECTOR：通过自动跨链交易关联增强去中心化桥接应用的可追溯性

arXiv - CS - Software Engineering

Pub Date : 2024-09-08 DOI: arxiv-2409.04937

Dan Lin, Jiajing Wu, Yuxin Su, Ziye Zheng, Yuhong Nan, Zibin Zheng

Decentralized bridge applications are important software that connectsvarious blockchains and facilitates cross-chain asset transfer in thedecentralized finance (DeFi) ecosystem which currently operates in amulti-chain environment. Cross-chain transaction association identifies andmatches unique transactions executed by bridge DApps, which is importantresearch to enhance the traceability of cross-chain bridge DApps. However,existing methods rely entirely on unobservable internal ledgers or APIs,violating the open and decentralized properties of blockchain. In this paper,we analyze the challenges of this issue and then present CONNECTOR, anautomated cross-chain transaction association analysis method based on bridgesmart contracts. Specifically, CONNECTOR first identifies deposit transactionsby extracting distinctive and generic features from the transaction traces ofbridge contracts. With the accurate deposit transactions, CONNECTOR mines theexecution logs of bridge contracts to achieve withdrawal transaction matching.We conduct real-world experiments on different types of bridges to demonstratethe effectiveness of CONNECTOR. The experiment demonstrates that CONNECTORsuccessfully identifies 100% deposit transactions, associates 95.81% withdrawaltransactions, and surpasses methods for CeFi bridges. Based on the associationresults, we obtain interesting findings about cross-chain transaction behaviorsin DeFi bridges and analyze the tracing abilities of CONNECTOR to assist theDeFi bridge apps.

去中心化桥接应用程序是连接各种区块链并促进去中心化金融（DeFi）生态系统中跨链资产转移的重要软件，该生态系统目前在多链环境中运行。跨链交易关联可以识别和匹配桥接 DApp 执行的唯一交易，这对于提高跨链桥接 DApp 的可追溯性具有重要研究意义。然而，现有方法完全依赖于不可观测的内部分类账或 API，违反了区块链的开放性和去中心化特性。本文分析了这一问题所面临的挑战，然后提出了基于桥接智能合约的自动化跨链交易关联分析方法 CONNECTOR。具体来说，CONNECTOR 首先通过从桥接合约的交易痕迹中提取独特和通用的特征来识别押金交易。我们在不同类型的桥梁上进行了实际实验，以证明 CONNECTOR 的有效性。实验表明，CONNECTOR 能成功识别 100% 的存款交易，关联 95.81% 的取款交易，并超越了针对 CeFi 桥接的方法。基于关联结果，我们获得了关于 DeFi 桥接中跨链交易行为的有趣发现，并分析了 CONNECTOR 协助 DeFi 桥接应用程序的追踪能力。

{"title":"CONNECTOR: Enhancing the Traceability of Decentralized Bridge Applications via Automatic Cross-chain Transaction Association","authors":"Dan Lin, Jiajing Wu, Yuxin Su, Ziye Zheng, Yuhong Nan, Zibin Zheng","doi":"arxiv-2409.04937","DOIUrl":"https://doi.org/arxiv-2409.04937","url":null,"abstract":"Decentralized bridge applications are important software that connects\u0000various blockchains and facilitates cross-chain asset transfer in the\u0000decentralized finance (DeFi) ecosystem which currently operates in a\u0000multi-chain environment. Cross-chain transaction association identifies and\u0000matches unique transactions executed by bridge DApps, which is important\u0000research to enhance the traceability of cross-chain bridge DApps. However,\u0000existing methods rely entirely on unobservable internal ledgers or APIs,\u0000violating the open and decentralized properties of blockchain. In this paper,\u0000we analyze the challenges of this issue and then present CONNECTOR, an\u0000automated cross-chain transaction association analysis method based on bridge\u0000smart contracts. Specifically, CONNECTOR first identifies deposit transactions\u0000by extracting distinctive and generic features from the transaction traces of\u0000bridge contracts. With the accurate deposit transactions, CONNECTOR mines the\u0000execution logs of bridge contracts to achieve withdrawal transaction matching.\u0000We conduct real-world experiments on different types of bridges to demonstrate\u0000the effectiveness of CONNECTOR. The experiment demonstrates that CONNECTOR\u0000successfully identifies 100% deposit transactions, associates 95.81% withdrawal\u0000transactions, and surpasses methods for CeFi bridges. Based on the association\u0000results, we obtain interesting findings about cross-chain transaction behaviors\u0000in DeFi bridges and analyze the tracing abilities of CONNECTOR to assist the\u0000DeFi bridge apps.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

From Struggle to Simplicity with a Usable and Secure API for Encryption in Java 使用实用安全的 Java 加密应用程序接口，从苦苦挣扎到化繁为简

arXiv - CS - Software Engineering

Pub Date : 2024-09-08 DOI: arxiv-2409.05128

Ehsan Firouzi, Ammar Mansuri, Mohammad Ghafari, Maziar Kaveh

Cryptography misuses are prevalent in the wild. Crypto APIs are hard to usefor developers, and static analysis tools do not detect every misuse. Wedeveloped SafEncrypt, an API that streamlines encryption tasks for Javadevelopers. It is built on top of the native Java Cryptography Architecture,and it shields developers from crypto complexities and erroneous low-leveldetails. Experiments showed that SafEncrypt is suitable for developers withvarying levels of experience.

密码学滥用在野外非常普遍。开发人员很难使用加密 API，静态分析工具也无法检测到所有滥用行为。我们开发了 SafEncrypt，这是一个为 Java 开发人员简化加密任务的 API。它建立在本地 Java 密码体系结构之上，使开发人员免受密码复杂性和错误低级细节的影响。实验表明，SafEncrypt 适合具有不同经验水平的开发人员使用。

引用次数: 0

LLM-based Abstraction and Concretization for GUI Test Migration 基于 LLM 的图形用户界面测试迁移的抽象和具体化

arXiv - CS - Software Engineering

Pub Date : 2024-09-08 DOI: arxiv-2409.05028

Yakun Zhang, Chen Liu, Xiaofei Xie, Yun Lin, Jin Song Dong, Dan Hao, Lu Zhang

GUI test migration aims to produce test cases with events and assertions totest specific functionalities of a target app. Existing migration approachestypically focus on the widget-mapping paradigm that maps widgets from sourceapps to target apps. However, since different apps may implement the samefunctionality in different ways, direct mapping may result in incomplete orbuggy test cases, thus significantly impacting the effectiveness of testingtarget functionality and the practical applicability. In this paper, we propose a new migration paradigm (i.e.,abstraction-concretization paradigm) that first abstracts the test logic forthe target functionality and then utilizes this logic to generate the concreteGUI test case. Furthermore, we introduce MACdroid, the first approach thatmigrates GUI test cases based on this paradigm. Specifically, we propose anabstraction technique that utilizes source test cases from source appstargeting the same functionality to extract a general test logic for thatfunctionality. Then, we propose a concretization technique that utilizes thegeneral test logic to guide an LLM in generating the corresponding GUI testcase (including events and assertions) for the target app. We evaluate MACdroidon two widely-used datasets (including 31 apps, 34 functionalities, and 123test cases). On the FrUITeR dataset, the test cases generated by MACdroidsuccessfully test 64% of the target functionalities, improving the baselines by191%. On the Lin dataset, MACdroid successfully tests 75% of the targetfunctionalities, outperforming the baselines by 42%. These results underscorethe effectiveness of MACdroid in GUI test migration.

图形用户界面测试迁移的目的是生成带有事件和断言的测试用例，以测试目标应用程序的特定功能。现有的移植方法通常侧重于部件映射范例，将源应用程序中的部件映射到目标应用程序中。然而，由于不同的应用程序可能以不同的方式实现相同的功能，直接映射可能会导致测试用例不完整或不准确，从而严重影响目标功能测试的有效性和实际应用性。在本文中，我们提出了一种新的移植范式（即抽象-具体化范式），首先抽象出目标功能的测试逻辑，然后利用该逻辑生成具体的图形用户界面测试用例。此外，我们还介绍了 MACdroid，这是第一种基于此范例迁移图形用户界面测试用例的方法。具体来说，我们提出了一种抽象技术，利用源应用程序中针对相同功能的源测试用例来提取该功能的通用测试逻辑。然后，我们提出一种具体化技术，利用通用测试逻辑指导 LLM 为目标应用程序生成相应的 GUI 测试用例（包括事件和断言）。我们在两个广泛使用的数据集（包括 31 个应用程序、34 种功能和 123 个测试用例）上对 MACdroid 进行了评估。在 FrUITeR 数据集上，MACdroids 生成的测试用例成功测试了 64% 的目标功能，比基线提高了 191%。在 Lin 数据集上，MACdroid 成功测试了 75% 的目标功能，比基线高出 42%。这些结果凸显了 MACdroid 在图形用户界面测试迁移中的有效性。

{"title":"LLM-based Abstraction and Concretization for GUI Test Migration","authors":"Yakun Zhang, Chen Liu, Xiaofei Xie, Yun Lin, Jin Song Dong, Dan Hao, Lu Zhang","doi":"arxiv-2409.05028","DOIUrl":"https://doi.org/arxiv-2409.05028","url":null,"abstract":"GUI test migration aims to produce test cases with events and assertions to\u0000test specific functionalities of a target app. Existing migration approaches\u0000typically focus on the widget-mapping paradigm that maps widgets from source\u0000apps to target apps. However, since different apps may implement the same\u0000functionality in different ways, direct mapping may result in incomplete or\u0000buggy test cases, thus significantly impacting the effectiveness of testing\u0000target functionality and the practical applicability. In this paper, we propose a new migration paradigm (i.e.,\u0000abstraction-concretization paradigm) that first abstracts the test logic for\u0000the target functionality and then utilizes this logic to generate the concrete\u0000GUI test case. Furthermore, we introduce MACdroid, the first approach that\u0000migrates GUI test cases based on this paradigm. Specifically, we propose an\u0000abstraction technique that utilizes source test cases from source apps\u0000targeting the same functionality to extract a general test logic for that\u0000functionality. Then, we propose a concretization technique that utilizes the\u0000general test logic to guide an LLM in generating the corresponding GUI test\u0000case (including events and assertions) for the target app. We evaluate MACdroid\u0000on two widely-used datasets (including 31 apps, 34 functionalities, and 123\u0000test cases). On the FrUITeR dataset, the test cases generated by MACdroid\u0000successfully test 64% of the target functionalities, improving the baselines by\u0000191%. On the Lin dataset, MACdroid successfully tests 75% of the target\u0000functionalities, outperforming the baselines by 42%. These results underscore\u0000the effectiveness of MACdroid in GUI test migration.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the Need to Monitor Continuous Integration Practices -- An Empirical Study 监控持续集成实践的必要性 -- 一项实证研究

arXiv - CS - Software Engineering

Pub Date : 2024-09-08 DOI: arxiv-2409.05101

Jadson Santos, Daniel Alencar da Costa, Shane McIntosh, Uirá Kulesza

Continuous Integration (CI) encompasses a set of widely adopted practicesthat enhance software development. However, there are indications thatdevelopers may not adequately monitor CI practices. Hence, this paper exploresdevelopers' perceptions regarding the monitoring CI practices. To achieve this,we first perform a Document Analysis to assess developers' expressed need forpractice monitoring in pull requests comments generated by developers duringthe development process. After that, we conduct a survey among developers from121 open-source projects to understand perception of the significance ofmonitoring seven CI practices in their projects. Finally, we triangulate theemergent themes from our survey by performing a second Document Analysis tounderstand the extent of monitoring features supported by existing CI services.Our key findings indicate that: 1) the most frequently mentioned CI practiceduring the development process is ``Test Coverage'' (> 80%), while ``BuildHealth'' and ``Time to Fix a Broken Build'' present notable opportunities formonitoring CI practices; 2) developers do not adequately monitor all CIpractices and express interest in monitoring additional practices; and 3) themost popular CI services currently offer limited native support for monitoringCI practices, requiring the use of third-party tools. Our results lead us toconclude that monitoring CI practices is often overlooked by both CI servicesand developers. Using third-party tools in conjunction with CI services ischallenging, they monitor some redundant practices and still falls short offully supporting CI practices monitoring. Therefore, CI services shouldimplement CI practices monitoring, which would facilitate and encouragedevelopers to monitor them.

持续集成（CI）包含一系列被广泛采用的实践，可增强软件开发能力。然而，有迹象表明，开发人员可能没有充分监控 CI 实践。因此，本文探讨了开发人员对监控 CI 实践的看法。为此，我们首先进行了文档分析，以评估开发人员在开发过程中产生的拉取请求评论中表达的对实践监控的需求。之后，我们对 121 个开源项目的开发人员进行了调查，以了解他们对其项目中监控七项 CI 实践的重要性的看法。最后，我们通过第二次文档分析，对调查中发现的主题进行了三角测量，以了解现有 CI 服务所支持的监控功能的范围：1）开发过程中最常提及的 CI 实践是 "测试覆盖率"（> 80%），而 "构建健康状况 "和 "修复错误构建的时间 "则为监控 CI 实践提供了显著的机会；2）开发人员并未充分监控所有 CI 实践，并表示有兴趣监控其他实践；3）目前最流行的 CI 服务为监控 CI 实践提供的本地支持有限，需要使用第三方工具。我们的研究结果使我们得出结论，CI 服务和开发人员都经常忽视对 CI 实践的监控。将第三方工具与 CI 服务结合使用具有挑战性，因为它们会监控一些冗余的实践，而且仍然无法完全支持 CI 实践监控。因此，CI 服务应实施 CI 实践监控，这将促进并鼓励开发人员对其进行监控。

{"title":"On the Need to Monitor Continuous Integration Practices -- An Empirical Study","authors":"Jadson Santos, Daniel Alencar da Costa, Shane McIntosh, Uirá Kulesza","doi":"arxiv-2409.05101","DOIUrl":"https://doi.org/arxiv-2409.05101","url":null,"abstract":"Continuous Integration (CI) encompasses a set of widely adopted practices\u0000that enhance software development. However, there are indications that\u0000developers may not adequately monitor CI practices. Hence, this paper explores\u0000developers' perceptions regarding the monitoring CI practices. To achieve this,\u0000we first perform a Document Analysis to assess developers' expressed need for\u0000practice monitoring in pull requests comments generated by developers during\u0000the development process. After that, we conduct a survey among developers from\u0000121 open-source projects to understand perception of the significance of\u0000monitoring seven CI practices in their projects. Finally, we triangulate the\u0000emergent themes from our survey by performing a second Document Analysis to\u0000understand the extent of monitoring features supported by existing CI services.\u0000Our key findings indicate that: 1) the most frequently mentioned CI practice\u0000during the development process is ``Test Coverage'' (> 80%), while ``Build\u0000Health'' and ``Time to Fix a Broken Build'' present notable opportunities for\u0000monitoring CI practices; 2) developers do not adequately monitor all CI\u0000practices and express interest in monitoring additional practices; and 3) the\u0000most popular CI services currently offer limited native support for monitoring\u0000CI practices, requiring the use of third-party tools. Our results lead us to\u0000conclude that monitoring CI practices is often overlooked by both CI services\u0000and developers. Using third-party tools in conjunction with CI services is\u0000challenging, they monitor some redundant practices and still falls short of\u0000fully supporting CI practices monitoring. Therefore, CI services should\u0000implement CI practices monitoring, which would facilitate and encourage\u0000developers to monitor them.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0