首页 > 最新文献

Information and Software Technology最新文献

英文 中文
Enhancing building environments: A digital twin approach for informed decision-making and better campus experiences 改善建筑环境:数字孪生方法,为明智的决策和更好的校园体验
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-04 DOI: 10.1016/j.infsof.2026.108044
Gianni Tumedei, Chiara Ceccarini, Luca Giulianini, Giovanni Delnevo, Catia Prandi

Context:

Smart buildings are foundational elements of smart cities, integrating technologies such as Building Information Modeling (BIM), Building Management Systems (BMS), and Digital Twins (DTs) to support design, construction, and operations. However, their potential to directly enhance occupant experience and inform individual decision-making remains relatively underexplored, particularly for non-technical users.

Objective:

This study investigates the effects of providing accessible, room-level environmental information to all building occupants using data from BIM/BMS systems. The goal is to support building occupants in their daily lives, increasing awareness of environmental conditions and offering added value for their decision-making processes.

Methods:

We designed and developed a system that combines real-time sensor readings with short-term environmental forecasts and a comfort computation algorithm. These data are visualized through an interactive wayfinding interface deployed on public touch displays across a university campus. We evaluated the system using a mixed-methods approach, gathering qualitative and quantitative feedback on information quality, design quality, usability, and perceived value.

Results:

The evaluation revealed strong interest from non-experts in accessing environmental information. Participants found the system informative, easy to use, and valuable in supporting their everyday activities on campus. The combination of real-time and forecasted data, comfort indicators, and wayfinding contributed to a positive user experience.

Conclusion:

Our findings demonstrate the feasibility and value of integrating environmental data into public, user-friendly smart buildings interfaces, highlighting the importance of accessible data in improving sustainability, efficiency, and occupant decision-making in smart campus digital twin environments.
背景:智能建筑是智慧城市的基本要素,它集成了建筑信息模型(BIM)、建筑管理系统(BMS)和数字孪生(dt)等技术,以支持设计、施工和运营。然而,它们直接增强乘员体验和为个人决策提供信息的潜力仍然相对未得到充分开发,特别是对于非技术用户。目的:本研究调查了利用BIM/BMS系统的数据向所有建筑居住者提供可访问的房间级环境信息的效果。其目标是支持建筑居住者的日常生活,提高他们对环境状况的认识,并为他们的决策过程提供附加价值。方法:我们设计并开发了一个将实时传感器读数与短期环境预测和舒适度计算算法相结合的系统。这些数据通过部署在大学校园公共触摸显示器上的交互式寻路界面可视化。我们使用混合方法对系统进行评估,收集关于信息质量、设计质量、可用性和感知价值的定性和定量反馈。结果:评价显示了非专家对获取环境信息的强烈兴趣。参与者发现该系统信息丰富,易于使用,对支持他们的日常校园活动很有价值。实时和预测数据、舒适指标和寻路功能的结合为用户带来了积极的体验。结论:我们的研究结果证明了将环境数据集成到公共、用户友好的智能建筑界面中的可行性和价值,突出了可访问数据在智能校园数字孪生环境中提高可持续性、效率和居住者决策方面的重要性。
{"title":"Enhancing building environments: A digital twin approach for informed decision-making and better campus experiences","authors":"Gianni Tumedei,&nbsp;Chiara Ceccarini,&nbsp;Luca Giulianini,&nbsp;Giovanni Delnevo,&nbsp;Catia Prandi","doi":"10.1016/j.infsof.2026.108044","DOIUrl":"10.1016/j.infsof.2026.108044","url":null,"abstract":"<div><h3>Context:</h3><div>Smart buildings are foundational elements of smart cities, integrating technologies such as Building Information Modeling (BIM), Building Management Systems (BMS), and Digital Twins (DTs) to support design, construction, and operations. However, their potential to directly enhance occupant experience and inform individual decision-making remains relatively underexplored, particularly for non-technical users.</div></div><div><h3>Objective:</h3><div>This study investigates the effects of providing accessible, room-level environmental information to all building occupants using data from BIM/BMS systems. The goal is to support building occupants in their daily lives, increasing awareness of environmental conditions and offering added value for their decision-making processes.</div></div><div><h3>Methods:</h3><div>We designed and developed a system that combines real-time sensor readings with short-term environmental forecasts and a comfort computation algorithm. These data are visualized through an interactive wayfinding interface deployed on public touch displays across a university campus. We evaluated the system using a mixed-methods approach, gathering qualitative and quantitative feedback on information quality, design quality, usability, and perceived value.</div></div><div><h3>Results:</h3><div>The evaluation revealed strong interest from non-experts in accessing environmental information. Participants found the system informative, easy to use, and valuable in supporting their everyday activities on campus. The combination of real-time and forecasted data, comfort indicators, and wayfinding contributed to a positive user experience.</div></div><div><h3>Conclusion:</h3><div>Our findings demonstrate the feasibility and value of integrating environmental data into public, user-friendly smart buildings interfaces, highlighting the importance of accessible data in improving sustainability, efficiency, and occupant decision-making in smart campus digital twin environments.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108044"},"PeriodicalIF":4.3,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
COTVD: A function-level vulnerability detection framework using chain-of-thought reasoning with large language models COTVD:一个使用大型语言模型的思维链推理的功能级漏洞检测框架
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-03 DOI: 10.1016/j.infsof.2026.108043
Yinan Chen , Xiangping Chen , Yuan Huang , Changlin Yang , Lei Yun

Context:

With the growth of open-source vulnerability databases, deep learning methods have been widely applied for vulnerability detection. However, most existing approaches focus on coarse-grained detection, predicting only whether a sample contains a vulnerability without providing detailed explanations.

Objective:

This study aims to propose a large language model (LLM)-based method for function-level vulnerability detection and analysis, providing detailed insights into detected vulnerabilities.

Methods:

We introduce CoTVD, which leverages the reasoning capabilities of LLMs through the Chain of Thought (CoT) approach. An instructive prompt strategy was designed to guide the model in analyzing data and control dependencies related to library function calls, enhancing detection flexibility and accuracy. CoTVD was evaluated on multiple LLMs, including GPT-4o-128K, GPT-3.5-Turbo, Gemini-1.5-Pro, Claude-3.5-Sonnet, Llama-3.1-405B, Qwen2-72B-Instruct-T, and DeepSeek-67B-T.

Results:

Experimental results show that CoTVD based on GPT-4o-128K outperforms other models, achieving a recall rate of 94.77%. CoTVD, however, generates a relatively high number of false positives due to its sensitivity to potential security risks, which include not only traditional vulnerabilities but also risks such as information leakage. The study further refines existing dataset labels into finer granularity. In a human evaluation with 10 experts on 200 samples, each participant identified, on average, 1.7 additional vulnerability samples and 1.1 lines of risky code when assisted by CoTVD, demonstrating its effectiveness in real-world scenarios.

Conclusion:

CoTVD enables function-level vulnerability detection with detailed analysis and proves effective in identifying vulnerabilities and code risks in practice, offering a novel approach for vulnerability detection and security analysis.
背景:随着开源漏洞数据库的增长,深度学习方法被广泛应用于漏洞检测。然而,大多数现有的方法侧重于粗粒度检测,仅预测样本是否包含漏洞,而不提供详细的解释。目的:本研究旨在提出一种基于大语言模型(large language model, LLM)的功能级漏洞检测与分析方法,为检测到的漏洞提供详细的洞察。方法:我们引入了CoTVD,它通过思维链(CoT)方法利用了法学硕士的推理能力。设计了一种指导性提示策略,用于指导模型分析与库函数调用相关的数据和控制依赖关系,从而提高检测的灵活性和准确性。在多个llm上进行CoTVD评估,包括gpt -40 - 128k, GPT-3.5-Turbo, Gemini-1.5-Pro, Claude-3.5-Sonnet, Llama-3.1-405B, Qwen2-72B-Instruct-T和DeepSeek-67B-T。结果:实验结果表明,基于gpt - 40 - 128k的CoTVD优于其他模型,召回率达到94.77%。然而,由于CoTVD对潜在安全风险的敏感性,它产生的误报率相对较高,这些安全风险不仅包括传统的漏洞,还包括信息泄露等风险。该研究进一步将现有数据集标签细化到更细的粒度。在由10位专家对200个样本进行的人类评估中,在CoTVD的帮助下,每个参与者平均识别出1.7个额外的漏洞样本和1.1行有风险的代码,证明了它在现实场景中的有效性。结论:CoTVD实现了功能级漏洞检测,并进行了详细的分析,实践证明,CoTVD在识别漏洞和代码风险方面是有效的,为漏洞检测和安全分析提供了一种新的方法。
{"title":"COTVD: A function-level vulnerability detection framework using chain-of-thought reasoning with large language models","authors":"Yinan Chen ,&nbsp;Xiangping Chen ,&nbsp;Yuan Huang ,&nbsp;Changlin Yang ,&nbsp;Lei Yun","doi":"10.1016/j.infsof.2026.108043","DOIUrl":"10.1016/j.infsof.2026.108043","url":null,"abstract":"<div><h3>Context:</h3><div>With the growth of open-source vulnerability databases, deep learning methods have been widely applied for vulnerability detection. However, most existing approaches focus on coarse-grained detection, predicting only whether a sample contains a vulnerability without providing detailed explanations.</div></div><div><h3>Objective:</h3><div>This study aims to propose a large language model (LLM)-based method for function-level vulnerability detection and analysis, providing detailed insights into detected vulnerabilities.</div></div><div><h3>Methods:</h3><div>We introduce CoTVD, which leverages the reasoning capabilities of LLMs through the Chain of Thought (CoT) approach. An instructive prompt strategy was designed to guide the model in analyzing data and control dependencies related to library function calls, enhancing detection flexibility and accuracy. CoTVD was evaluated on multiple LLMs, including GPT-4o-128K, GPT-3.5-Turbo, Gemini-1.5-Pro, Claude-3.5-Sonnet, Llama-3.1-405B, Qwen2-72B-Instruct-T, and DeepSeek-67B-T.</div></div><div><h3>Results:</h3><div>Experimental results show that CoTVD based on GPT-4o-128K outperforms other models, achieving a recall rate of 94.77%. CoTVD, however, generates a relatively high number of false positives due to its sensitivity to potential security risks, which include not only traditional vulnerabilities but also risks such as information leakage. The study further refines existing dataset labels into finer granularity. In a human evaluation with 10 experts on 200 samples, each participant identified, on average, 1.7 additional vulnerability samples and 1.1 lines of risky code when assisted by CoTVD, demonstrating its effectiveness in real-world scenarios.</div></div><div><h3>Conclusion:</h3><div>CoTVD enables function-level vulnerability detection with detailed analysis and proves effective in identifying vulnerabilities and code risks in practice, offering a novel approach for vulnerability detection and security analysis.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108043"},"PeriodicalIF":4.3,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transformer-aware sequence-to-sequence network for personalized tag recommendation in software information sites 面向软件信息站点个性化标签推荐的变压器感知序列到序列网络
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-30 DOI: 10.1016/j.infsof.2026.108061
Shubhi Bansal , Jahnavi Sunchu , Shahid Shafi Dar, Nagendra Kumar

Context:

Automatic tag recommendation is crucial for content understanding and retrieval on software information sites. Existing approaches formulate tag recommendation as either multi-label classification problem or sentence matching. However, multi-label classification by treating tags as independent labels, neglects semantic relationships and dependencies among them, leading to inconsistent recommendations. Sentence matching techniques, which rely on lexical similarity, fail to capture contextual information and broader semantic meaning. Moreover, several works leverage limited content sources such as title, body, and code snippet of software objects, leading to data sparsity issues. Extant research provides generic tag suggestions, overlooking users’ expertise and interests.

Objective:

To address these limitations, we propose a novel trANsformer-based sequeNce-to-sequence framework for persOnlaised TAg recommendaTION, dubbed as ANNOTATION.

Methods:

This approach enables the model to learn dependencies between tags and generate contextually relevant recommendations. To enhance the representation of software objects and mitigate data sparsity, we incorporate valuable information from associated comments, such as clarifications, usage examples, and bug reports. This additional conversational context provides insights into the problem, alternative solutions, and related concepts discussed by the community, resulting in more informed tag recommendations. Furthermore, we personalize tag suggestions by incorporating user profile descriptions and badges, which reflect users’ expertise and interests within specific domains. This ensures that generated tags align with both the software object and the user’s specific knowledge domain, contributing to a more tailored user experience.

Results:

Extensive empirical and qualitative evaluations on datasets from Code Review and Stack Overflow demonstrate that our approach significantly outperforms state-of-the-art methods.

Conclusion:

Our findings highlight the importance of considering tag dependencies, contextual information, and user preferences for accurate and personalized tag recommendation in software information sites.
上下文:自动标签推荐对于软件信息站点的内容理解和检索至关重要。现有的方法将标签推荐描述为多标签分类问题或句子匹配问题。然而,多标签分类将标签视为独立标签,忽略了标签之间的语义关系和依赖关系,导致推荐结果不一致。句子匹配技术依赖于词汇相似性,无法捕捉上下文信息和更广泛的语义。此外,一些作品利用了有限的内容来源,如软件对象的标题、主体和代码片段,从而导致数据稀疏性问题。现有的研究提供了通用的标签建议,忽略了用户的专业知识和兴趣。目的:为了解决这些限制,我们提出了一种新的基于变压器的个性化标签推荐序列到序列框架,称为ANNOTATION。方法:这种方法使模型能够学习标签之间的依赖关系,并生成上下文相关的建议。为了增强软件对象的表示并减轻数据的稀疏性,我们从相关的注释中合并了有价值的信息,例如澄清、使用示例和错误报告。这个额外的对话上下文提供了对问题、替代解决方案和社区讨论的相关概念的见解,从而产生更明智的标签建议。此外,我们通过结合用户简介描述和徽章来个性化标签建议,这反映了用户在特定领域的专业知识和兴趣。这确保了生成的标签与软件对象和用户的特定知识领域保持一致,从而有助于更定制化的用户体验。结果:对来自Code Review和Stack Overflow的数据集进行广泛的实证和定性评估表明,我们的方法明显优于最先进的方法。结论:我们的研究结果强调了在软件信息站点中考虑标签依赖性、上下文信息和用户偏好对于准确和个性化标签推荐的重要性。
{"title":"Transformer-aware sequence-to-sequence network for personalized tag recommendation in software information sites","authors":"Shubhi Bansal ,&nbsp;Jahnavi Sunchu ,&nbsp;Shahid Shafi Dar,&nbsp;Nagendra Kumar","doi":"10.1016/j.infsof.2026.108061","DOIUrl":"10.1016/j.infsof.2026.108061","url":null,"abstract":"<div><h3>Context:</h3><div>Automatic tag recommendation is crucial for content understanding and retrieval on software information sites. Existing approaches formulate tag recommendation as either multi-label classification problem or sentence matching. However, multi-label classification by treating tags as independent labels, neglects semantic relationships and dependencies among them, leading to inconsistent recommendations. Sentence matching techniques, which rely on lexical similarity, fail to capture contextual information and broader semantic meaning. Moreover, several works leverage limited content sources such as title, body, and code snippet of software objects, leading to data sparsity issues. Extant research provides generic tag suggestions, overlooking users’ expertise and interests.</div></div><div><h3>Objective:</h3><div>To address these limitations, we propose a novel tr<u><strong>AN</strong></u>sformer-based seque<u><strong>N</strong></u>ce-to-sequence framework for pers<u><strong>O</strong></u>nlaised <u><strong>TA</strong></u>g recommenda<u><strong>TION</strong></u>, dubbed as ANNOTATION.</div></div><div><h3>Methods:</h3><div>This approach enables the model to learn dependencies between tags and generate contextually relevant recommendations. To enhance the representation of software objects and mitigate data sparsity, we incorporate valuable information from associated comments, such as clarifications, usage examples, and bug reports. This additional conversational context provides insights into the problem, alternative solutions, and related concepts discussed by the community, resulting in more informed tag recommendations. Furthermore, we personalize tag suggestions by incorporating user profile descriptions and badges, which reflect users’ expertise and interests within specific domains. This ensures that generated tags align with both the software object and the user’s specific knowledge domain, contributing to a more tailored user experience.</div></div><div><h3>Results:</h3><div>Extensive empirical and qualitative evaluations on datasets from Code Review and Stack Overflow demonstrate that our approach significantly outperforms state-of-the-art methods.</div></div><div><h3>Conclusion:</h3><div>Our findings highlight the importance of considering tag dependencies, contextual information, and user preferences for accurate and personalized tag recommendation in software information sites.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108061"},"PeriodicalIF":4.3,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Template-guided interpretable reasoning with execution feedback for LLM-based program repair 基于llm的程序修复中带有执行反馈的模板引导可解释推理
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-28 DOI: 10.1016/j.infsof.2026.108058
Sichong Hao , Xianjun Shi , Hongwei Liu , Yuyang Yin , Xi Chen

Context:

Automated Program Repair (APR) seeks to fix software defects automatically. Large language models (LLMs) show promise in APR, especially when combined with traditional template-based methods. However, existing approaches suffer from low accuracy, poor interpretability, and incompatibility with mainstream closed-source LLMs, leaving the synergy between traditional methods and LLMs underexplored.

Objectives

: We propose TRACE, a template-guided reasoning framework that enhances LLMs’ bug-fixing performance across multiple programming languages, while improving repair interpretability to help developers understand and validate generated patches.

Methods:

Our approach is novel in integrating both the successful patterns from repair templates across multiple programming languages and the failure experiences from previous execution results into the LLM-driven repair reasoning process. TRACE comprises three components: (1) a multilingual repair template repository that covers 47 common error types and their corresponding repair actions; (2) a repair reasoning engine that, under template guidance, first analyzes root causes and then formulates concrete repair strategies; and (3) a feedback memory buffer that dynamically records execution data from failed attempts to guide subsequent repair iterations. Moreover, our approach enhances the interpretability of results by providing a reasoning trace that explains each patch.

Results

: Extensive experiments on the widely used Defects4J and recent multilingual DebugBench benchmarks demonstrate that TRACE achieves state-of-the-art repair performance. TRACE successfully repairs 204 bugs in Defects4J, outperforming the best baseline APR approach by 14.6%. On DebugBench, TRACE demonstrates its multilingual repair ability by achieving state-of-the-art results in Python, Java, and C++.

Conclusions:

Our approach bridges the gap between template-based methods and mainstream LLMs by effectively integrating different types of domain knowledge (i.e., common error types, corresponding repair templates, and execution feedback). This not only enhances repair performance across multiple programming languages but also improves result interpretability. Overall, our research highlights the promising prospects of integrating APR-specific domain knowledge with LLM reasoning in practice.
上下文:自动程序修复(APR)寻求自动修复软件缺陷。大型语言模型(llm)在APR中显示出前景,特别是与传统的基于模板的方法结合使用时。然而,现有方法存在准确率低、可解释性差、与主流闭源llm不兼容等问题,使得传统方法与llm之间的协同作用尚未得到充分探索。目标:我们提出TRACE,这是一个模板引导的推理框架,它可以增强llm跨多种编程语言的bug修复性能,同时提高修复的可解释性,以帮助开发人员理解和验证生成的补丁。方法:我们的方法在将跨多种编程语言的修复模板的成功模式和先前执行结果的失败经验集成到llm驱动的修复推理过程中是新颖的。TRACE由三个部分组成:(1)多语言修复模板库,涵盖47种常见错误类型及其相应的修复操作;(2)修复推理引擎,在模板指导下,首先分析根本原因,然后制定具体的修复策略;(3)反馈存储器缓冲区,动态记录失败尝试的执行数据,以指导后续的修复迭代。此外,我们的方法通过提供解释每个补丁的推理跟踪来增强结果的可解释性。结果:在广泛使用的Defects4J和最近的多语言DebugBench基准测试上进行的大量实验表明,TRACE实现了最先进的修复性能。TRACE成功地修复了缺陷4j中的204个bug,比最佳基准APR方法高出14.6%。在DebugBench上,TRACE通过在Python、Java和c++中实现最先进的结果来展示其多语言修复能力。结论:我们的方法通过有效地集成不同类型的领域知识(即常见错误类型、相应的修复模板和执行反馈),弥合了基于模板的方法和主流法学硕士之间的差距。这不仅提高了跨多种编程语言的修复性能,还提高了结果的可解释性。总的来说,我们的研究突出了在实践中将apr特定领域知识与LLM推理相结合的前景。
{"title":"Template-guided interpretable reasoning with execution feedback for LLM-based program repair","authors":"Sichong Hao ,&nbsp;Xianjun Shi ,&nbsp;Hongwei Liu ,&nbsp;Yuyang Yin ,&nbsp;Xi Chen","doi":"10.1016/j.infsof.2026.108058","DOIUrl":"10.1016/j.infsof.2026.108058","url":null,"abstract":"<div><h3>Context:</h3><div>Automated Program Repair (APR) seeks to fix software defects automatically. Large language models (LLMs) show promise in APR, especially when combined with traditional template-based methods. However, existing approaches suffer from low accuracy, poor interpretability, and incompatibility with mainstream closed-source LLMs, leaving the synergy between traditional methods and LLMs underexplored.</div></div><div><h3>Objectives</h3><div>: We propose TRACE, a template-guided reasoning framework that enhances LLMs’ bug-fixing performance across multiple programming languages, while improving repair interpretability to help developers understand and validate generated patches.</div></div><div><h3>Methods:</h3><div>Our approach is novel in integrating both the successful patterns from repair templates across multiple programming languages and the failure experiences from previous execution results into the LLM-driven repair reasoning process. TRACE comprises three components: (1) a multilingual repair template repository that covers 47 common error types and their corresponding repair actions; (2) a repair reasoning engine that, under template guidance, first analyzes root causes and then formulates concrete repair strategies; and (3) a feedback memory buffer that dynamically records execution data from failed attempts to guide subsequent repair iterations. Moreover, our approach enhances the interpretability of results by providing a reasoning trace that explains each patch.</div></div><div><h3>Results</h3><div>: Extensive experiments on the widely used Defects4J and recent multilingual DebugBench benchmarks demonstrate that TRACE achieves state-of-the-art repair performance. TRACE successfully repairs 204 bugs in Defects4J, outperforming the best baseline APR approach by 14.6%. On DebugBench, TRACE demonstrates its multilingual repair ability by achieving state-of-the-art results in Python, Java, and C++.</div></div><div><h3>Conclusions:</h3><div>Our approach bridges the gap between template-based methods and mainstream LLMs by effectively integrating different types of domain knowledge (i.e., common error types, corresponding repair templates, and execution feedback). This not only enhances repair performance across multiple programming languages but also improves result interpretability. Overall, our research highlights the promising prospects of integrating APR-specific domain knowledge with LLM reasoning in practice.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108058"},"PeriodicalIF":4.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CONKER: CONtrastive knowledge enhanced retrieval for app review bug classification CONKER:应用审查漏洞分类的对比知识增强检索
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-27 DOI: 10.1016/j.infsof.2026.108056
Meysam Roostaee , Seyed Mostafa Fakhrahmad , Mohammad Hadi Sadreddini

Context

Accurate and timely identification of software defects from user-generated app reviews is crucial for efficient software maintenance. However, automated classifiers face challenges with the volume and heterogeneity of review text, often struggling with generalization, decision transparency, and learning from limited labeled examples per bug category.

Objective

This paper introduces CONKER (CONtrastive Knowledge Enhanced Retrieval), a novel framework to address these issues by improving accuracy and providing inherent interpretability for multi-class app review bug classification.

Methods

CONKER employs a two-stage process. First, a transformer-based sentence encoder is fine-tuned using supervised contrastive learning (Multiple Negatives Ranking Loss) on the MABLE benchmark to create a discriminative embedding space. Second, a classification-by-retrieval mechanism assigns labels by matching new reviews against curated class-specific exemplars. The framework is evaluated using 10-fold cross-validation, an ablation study, and a human-in-the-loop study with professional developers.

Results

On the MABLE dataset, CONKER achieves a new state-of-the-art weighted F1-score of 0.8133. Critically, it also attains a Macro F1-score over 13 percentage points higher than the strongest baseline, demonstrating superior performance on minority classes. A new zero-shot experiment provides quantitative evidence of its strong cross-platform generalization, where CONKER outperforms a RoBERTa baseline by 10.1 percentage points on a developer-annotated Apple App Store dataset. This is complemented by a human-in-the-loop study that validated the practical utility of CONKER's exemplar-based explanations. Ablation studies confirm that both the contrastive fine-tuning and the per-class retrieval architecture are fundamental to this multi-faceted success.

Conclusion

Contrastive learning-driven retrieval is a scalable, interpretable, and high-fidelity approach for nuanced text categorization in empirical software engineering. The CONKER framework demonstrates that by engineering a task-specific semantic space, it is possible to reconcile high predictive accuracy with the transparency required for practical software maintenance workflows.
从用户生成的应用程序评论中准确及时地识别软件缺陷对于有效的软件维护至关重要。然而,自动分类器面临着审查文本的数量和异质性的挑战,经常在泛化、决策透明度和从每个错误类别的有限标记示例中学习方面挣扎。目的介绍CONKER(对比知识增强检索)框架,该框架通过提高准确率和提供多类应用审查错误分类的固有可解释性来解决这些问题。方法conker采用两阶段流程。首先,在MABLE基准上使用监督对比学习(Multiple negative Ranking Loss)对基于变压器的句子编码器进行微调,以创建判别嵌入空间。其次,通过检索分类的机制通过匹配新的评论来分配标签。该框架通过10倍交叉验证、消融研究和由专业开发人员进行的人在循环研究来评估。结果在MABLE数据集上,CONKER获得了新的最先进加权f1得分0.8133。至关重要的是,它还获得了比最强基准高出13个百分点以上的宏观f1分数,显示出在少数族裔班级的优异表现。一项新的零概率实验为其强大的跨平台泛化提供了定量证据,在开发者注释的苹果应用商店数据集上,CONKER比RoBERTa基线高出10.1个百分点。这是由一项“人在循环”的研究补充的,该研究验证了CONKER基于范例的解释的实际效用。消融研究证实,对比微调和每类检索架构都是这种多方面成功的基础。结论对比学习驱动检索是一种可扩展、可解释和高保真的方法,适用于经验软件工程中精细的文本分类。CONKER框架表明,通过设计特定于任务的语义空间,可以将高预测准确性与实际软件维护工作流所需的透明度相协调。
{"title":"CONKER: CONtrastive knowledge enhanced retrieval for app review bug classification","authors":"Meysam Roostaee ,&nbsp;Seyed Mostafa Fakhrahmad ,&nbsp;Mohammad Hadi Sadreddini","doi":"10.1016/j.infsof.2026.108056","DOIUrl":"10.1016/j.infsof.2026.108056","url":null,"abstract":"<div><h3>Context</h3><div>Accurate and timely identification of software defects from user-generated app reviews is crucial for efficient software maintenance. However, automated classifiers face challenges with the volume and heterogeneity of review text, often struggling with generalization, decision transparency, and learning from limited labeled examples per bug category.</div></div><div><h3>Objective</h3><div>This paper introduces CONKER (CONtrastive Knowledge Enhanced Retrieval), a novel framework to address these issues by improving accuracy and providing inherent interpretability for multi-class app review bug classification.</div></div><div><h3>Methods</h3><div>CONKER employs a two-stage process. First, a transformer-based sentence encoder is fine-tuned using supervised contrastive learning (Multiple Negatives Ranking Loss) on the MABLE benchmark to create a discriminative embedding space. Second, a classification-by-retrieval mechanism assigns labels by matching new reviews against curated class-specific exemplars. The framework is evaluated using 10-fold cross-validation, an ablation study, and a human-in-the-loop study with professional developers.</div></div><div><h3>Results</h3><div>On the MABLE dataset, CONKER achieves a new state-of-the-art weighted F1-score of 0.8133. Critically, it also attains a Macro F1-score over 13 percentage points higher than the strongest baseline, demonstrating superior performance on minority classes. A new zero-shot experiment provides quantitative evidence of its strong cross-platform generalization, where CONKER outperforms a RoBERTa baseline by 10.1 percentage points on a developer-annotated Apple App Store dataset. This is complemented by a human-in-the-loop study that validated the practical utility of CONKER's exemplar-based explanations. Ablation studies confirm that both the contrastive fine-tuning and the per-class retrieval architecture are fundamental to this multi-faceted success.</div></div><div><h3>Conclusion</h3><div>Contrastive learning-driven retrieval is a scalable, interpretable, and high-fidelity approach for nuanced text categorization in empirical software engineering. The CONKER framework demonstrates that by engineering a task-specific semantic space, it is possible to reconcile high predictive accuracy with the transparency required for practical software maintenance workflows.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108056"},"PeriodicalIF":4.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A first and fast symbolic approach for data-aware business process compliance checking 用于数据感知业务流程遵从性检查的第一种快速符号方法
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-27 DOI: 10.1016/j.infsof.2026.108047
Silvano Colombo Tosatto , Hannah Burke , Nick van Beest , Heerko Groefsema
State of the art design-time compliance checking techniques are generally limited to verifying control-flow relations. In this paper, we propose an algorithm for verifying design-time compliance of business process models including data. For a faithful resemblance to real-world cases, we include a degree of uncertainty in the outcomes of activities, such that a single execution is not guaranteed to lead to a unique data state. We represent the compliance requirements using conditional obligations. Our proposed approach uses a symbolic representation to compactly represent the possible data states across an execution, and evaluates them as a constraint satisfaction problem to first prune the state space of paths that require no further analysis and finally to evaluate if an execution violates one of the regulatory requirements. The performance of the approach is evaluated against existing algorithms implemented in NuXMV. The results show that our approach significantly outperforms the existing algorithms across the board.
目前的设计时遵从性检查技术通常仅限于验证控制流关系。在本文中,我们提出了一种验证包含数据的业务流程模型的设计时遵从性的算法。为了忠实地与现实世界的情况相似,我们在活动的结果中包含一定程度的不确定性,这样单个执行就不能保证导致唯一的数据状态。我们使用有条件义务来表示合规性要求。我们提出的方法使用符号表示来紧凑地表示执行过程中可能的数据状态,并将它们作为约束满足问题进行评估,首先修剪不需要进一步分析的路径的状态空间,最后评估执行是否违反了法规要求之一。根据NuXMV中实现的现有算法对该方法的性能进行了评估。结果表明,我们的方法明显优于现有的算法。
{"title":"A first and fast symbolic approach for data-aware business process compliance checking","authors":"Silvano Colombo Tosatto ,&nbsp;Hannah Burke ,&nbsp;Nick van Beest ,&nbsp;Heerko Groefsema","doi":"10.1016/j.infsof.2026.108047","DOIUrl":"10.1016/j.infsof.2026.108047","url":null,"abstract":"<div><div>State of the art design-time compliance checking techniques are generally limited to verifying control-flow relations. In this paper, we propose an algorithm for verifying design-time compliance of business process models including data. For a faithful resemblance to real-world cases, we include a degree of uncertainty in the outcomes of activities, such that a single execution is not guaranteed to lead to a unique data state. We represent the compliance requirements using conditional obligations. Our proposed approach uses a symbolic representation to compactly represent the possible data states across an execution, and evaluates them as a constraint satisfaction problem to first prune the state space of paths that require no further analysis and finally to evaluate if an execution violates one of the regulatory requirements. The performance of the approach is evaluated against existing algorithms implemented in NuXMV. The results show that our approach significantly outperforms the existing algorithms across the board.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108047"},"PeriodicalIF":4.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synergistic enhancement of requirement-to-code traceability: A framework combining large language model based data augmentation and an advanced encoder 需求到代码可追溯性的协同增强:结合基于数据增强的大型语言模型和高级编码器的框架
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-27 DOI: 10.1016/j.infsof.2026.108045
Jianzhang Zhang , Jialong Zhou , Nan Niu , Jinping Hua , Chuang Liu

Context:

Automated requirement-to-code traceability link recovery, essential for industrial system quality and safety, is critically hindered by the scarcity of labeled data, especially for methods relying on supervised fine-tuning.

Objective:

To address this bottleneck, this paper proposes and validates a synergistic framework that integrates large language model (LLM)-driven data augmentation with an advanced encoder.

Methods:

We first optimize data augmentation through a systematic evaluation of bi-directional and zero/few-shot prompting strategies. Building on the augmented data, we enhance an established state-of-the-art pre-trained language model based method by incorporating an encoder distinguished by a broader pre-training corpus and an extended context window. We conduct experiments on four public datasets, including rigorous validity checks and cross-project evaluations, to quantify the distinct contributions of our framework’s components.

Results:

We demonstrate that the choice among leading LLMs is not a significant performance factor, whereas the prompting strategy is effective. Data augmentation on its own consistently improves the baseline method, providing substantial performance gains of up to 26.66% in average F2 score, while applying it to the advanced encoder unlocks a further maximum 10.00% lift in recall. This synergy culminates in a fully optimized framework that decisively outperforms ten established baselines with maximum improvements of 23.13% on F1 score and 21.37% on F2 score, and achieves a recall of 50.70% in strict cross-project evaluations, significantly surpassing the 21.82% recall of a state-of-the-art RAG approach.

Conclusion:

This work contributes a pragmatic and scalable methodology to overcome the data scarcity bottleneck, paving the way for broader industrial adoption of data-driven requirement-to-code traceability.
背景:对工业系统质量和安全至关重要的自动化需求到代码的可追溯性链接恢复,受到标记数据的稀缺性的严重阻碍,特别是依赖于监督微调的方法。目的:为了解决这个瓶颈,本文提出并验证了一个协同框架,该框架将大型语言模型(LLM)驱动的数据增强与高级编码器集成在一起。方法:我们首先通过系统评估双向和零/少针提示策略来优化数据增强。在增强数据的基础上,我们通过结合一个编码器,以更广泛的预训练语料库和扩展的上下文窗口为特征,增强了一个已建立的最先进的基于预训练语言模型的方法。我们在四个公共数据集上进行实验,包括严格的有效性检查和跨项目评估,以量化我们框架组件的不同贡献。结果:我们证明,在领先法学硕士之间的选择不是一个显著的绩效因素,而激励策略是有效的。数据增强本身可以持续改进基线方法,在平均F2分数上提供高达26.66%的显著性能提升,而将其应用于高级编码器可以进一步解锁最大10.00%的召回率提升。这种协同作用在一个完全优化的框架中达到顶峰,该框架在F1分数上的最大改进为23.13%,在F2分数上的最大改进为21.37%,在严格的跨项目评估中达到50.70%的召回率,大大超过了最先进的RAG方法的21.82%召回率。结论:这项工作提供了一种实用且可扩展的方法来克服数据稀缺性瓶颈,为更广泛的工业采用数据驱动的需求到代码的可追溯性铺平了道路。
{"title":"Synergistic enhancement of requirement-to-code traceability: A framework combining large language model based data augmentation and an advanced encoder","authors":"Jianzhang Zhang ,&nbsp;Jialong Zhou ,&nbsp;Nan Niu ,&nbsp;Jinping Hua ,&nbsp;Chuang Liu","doi":"10.1016/j.infsof.2026.108045","DOIUrl":"10.1016/j.infsof.2026.108045","url":null,"abstract":"<div><h3>Context:</h3><div>Automated requirement-to-code traceability link recovery, essential for industrial system quality and safety, is critically hindered by the scarcity of labeled data, especially for methods relying on supervised fine-tuning.</div></div><div><h3>Objective:</h3><div>To address this bottleneck, this paper proposes and validates a synergistic framework that integrates large language model (LLM)-driven data augmentation with an advanced encoder.</div></div><div><h3>Methods:</h3><div>We first optimize data augmentation through a systematic evaluation of bi-directional and zero/few-shot prompting strategies. Building on the augmented data, we enhance an established state-of-the-art pre-trained language model based method by incorporating an encoder distinguished by a broader pre-training corpus and an extended context window. We conduct experiments on four public datasets, including rigorous validity checks and cross-project evaluations, to quantify the distinct contributions of our framework’s components.</div></div><div><h3>Results:</h3><div>We demonstrate that the choice among leading LLMs is not a significant performance factor, whereas the prompting strategy is effective. Data augmentation on its own consistently improves the baseline method, providing substantial performance gains of up to 26.66% in average <span><math><msub><mrow><mi>F</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> score, while applying it to the advanced encoder unlocks a further maximum 10.00% lift in recall. This synergy culminates in a fully optimized framework that decisively outperforms ten established baselines with maximum improvements of 23.13% on <span><math><msub><mrow><mi>F</mi></mrow><mrow><mn>1</mn></mrow></msub></math></span> score and 21.37% on <span><math><msub><mrow><mi>F</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> score, and achieves a recall of 50.70% in strict cross-project evaluations, significantly surpassing the 21.82% recall of a state-of-the-art RAG approach.</div></div><div><h3>Conclusion:</h3><div>This work contributes a pragmatic and scalable methodology to overcome the data scarcity bottleneck, paving the way for broader industrial adoption of data-driven requirement-to-code traceability.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108045"},"PeriodicalIF":4.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dynamic software start-up competence model 动态软件创业能力模型
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-27 DOI: 10.1016/j.infsof.2026.108057
Nana Assyne , Alfred Nyadroh , Emmanuel Adabor , Emmanuel Antwi-Boasiako , Isaac Wiafe

Context

Software startups are important drivers of innovation, job creation, and digital transformation. However, more than 60% fail within the first five years, often due to gaps in competencies that change as the organization evolves.

Objective

This study proposes and validates a dynamic competence model for software startups, aligned with the Crowne lifecycle framework comprising Startup, Stabilization, Growth, and Maturity stages.

Method

A mixed methods approach was applied, combining qualitative interviews with quantitative surveys. The study identifies evolving competencies across three domains: business, architecture, and innovation.

Results

The findings reveal that competencies shift from being “desired” to “required” as startups progress through lifecycle stages. In the Startup stage, business competencies such as corporate structuring are essential, while project management and customer support become critical during Growth and Maturity. Technical competencies like programming and algorithms are vital early on, with advanced skills such as application security gaining importance during scaling. Innovation competencies, particularly problem solving and persistence, remain consistently critical, while creative and abstract thinking change in emphasis as startups mature.

Conclusion

The dynamic competence model addresses gaps in static frameworks by offering actionable insights for practitioners to prioritize skill development and resource allocation. It also provides a foundation for researchers to study the evolution of competencies in entrepreneurial contexts. The study recommends extending the model to other sectors and conducting longitudinal studies to enhance its applicability, ultimately contributing to more sustainable software engineering practices and improved startup success.
软件创业公司是创新、创造就业和数字化转型的重要驱动力。然而,超过60%的人在前五年内失败,这通常是由于随着组织的发展而变化的能力差距。本研究提出并验证了软件初创公司的动态能力模型,该模型与crown生命周期框架相一致,包括启动、稳定、成长和成熟阶段。方法采用定性访谈与定量调查相结合的混合方法。该研究确定了跨越三个领域的不断发展的能力:业务、架构和创新。研究结果表明,随着创业公司在生命周期阶段的发展,能力从“期望”转变为“需要”。在启动阶段,企业结构等业务能力是必不可少的,而项目管理和客户支持在成长和成熟阶段变得至关重要。像编程和算法这样的技术能力在早期是至关重要的,而像应用程序安全这样的高级技能在扩展期间变得越来越重要。创新能力,尤其是解决问题的能力和坚持不懈的能力,始终是至关重要的,而随着创业公司的成熟,创造性和抽象思维的重要性也在改变。结论动态能力模型通过为从业者提供可操作的见解来优先考虑技能发展和资源分配,从而解决了静态框架中的空白。这也为研究创业环境下胜任力的演化提供了基础。该研究建议将该模型扩展到其他部门,并进行纵向研究以增强其适用性,最终有助于更可持续的软件工程实践和提高创业成功率。
{"title":"A dynamic software start-up competence model","authors":"Nana Assyne ,&nbsp;Alfred Nyadroh ,&nbsp;Emmanuel Adabor ,&nbsp;Emmanuel Antwi-Boasiako ,&nbsp;Isaac Wiafe","doi":"10.1016/j.infsof.2026.108057","DOIUrl":"10.1016/j.infsof.2026.108057","url":null,"abstract":"<div><h3>Context</h3><div>Software startups are important drivers of innovation, job creation, and digital transformation. However, more than 60% fail within the first five years, often due to gaps in competencies that change as the organization evolves.</div></div><div><h3>Objective</h3><div>This study proposes and validates a dynamic competence model for software startups, aligned with the Crowne lifecycle framework comprising Startup, Stabilization, Growth, and Maturity stages.</div></div><div><h3>Method</h3><div>A mixed methods approach was applied, combining qualitative interviews with quantitative surveys. The study identifies evolving competencies across three domains: business, architecture, and innovation.</div></div><div><h3>Results</h3><div>The findings reveal that competencies shift from being “desired” to “required” as startups progress through lifecycle stages. In the Startup stage, business competencies such as corporate structuring are essential, while project management and customer support become critical during Growth and Maturity. Technical competencies like programming and algorithms are vital early on, with advanced skills such as application security gaining importance during scaling. Innovation competencies, particularly problem solving and persistence, remain consistently critical, while creative and abstract thinking change in emphasis as startups mature.</div></div><div><h3>Conclusion</h3><div>The dynamic competence model addresses gaps in static frameworks by offering actionable insights for practitioners to prioritize skill development and resource allocation. It also provides a foundation for researchers to study the evolution of competencies in entrepreneurial contexts. The study recommends extending the model to other sectors and conducting longitudinal studies to enhance its applicability, ultimately contributing to more sustainable software engineering practices and improved startup success.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108057"},"PeriodicalIF":4.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Broadening software value from technical to management perspectives: a stakeholder-driven exploration using a modified-Delphi approach 将软件价值从技术角度扩展到管理角度:使用改进的德尔菲方法进行利益相关者驱动的探索
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-24 DOI: 10.1016/j.infsof.2026.108046
Chalani Oruthotaarachchi, Janaka Wijayanayake

Context

Software value is increasingly recognized as a multi-dimensional construct, encompassing technical performance, business alignment, and user-centric factors. While previous research has explored individual aspects of software value such as functionality, cost, and usability, an integrated and holistic discussion remains lacking.

Objectives

This research is conducted to emphasize a holistic view of software value through investigating value perspectives of diverse stakeholder categories, shared determinants enabling value alignment, and the external forces affecting such value perceptions.

Methods

A hybrid consensus-building approach that combines Delphi approach with Group Discussions was followed. Software process experts, technical experts, and enterprise system users expressed their opinions.

Results

Findings reveal distinct stakeholder priorities, where process experts emphasize strategic alignment, enterprise users focus on efficiency, and technical experts prioritize functional integrity. These priorities are influenced by system scope, technological constraints, and business domain. Synthesizing these insights, the study presents a structured software value model, offering practical guidance for practitioners and organizations to align development priorities with stakeholder expectations.

Conclusion

Software value is not a static description but differs depending on the role of stakeholders, their workplace contexts, personal priorities, and external circumstances. Further research is needed to investigate how stakeholders’ value perceptions change over time during the software development life cycle.
ContextSoftware的价值越来越被认为是一个多维结构,包括技术性能、业务一致性和以用户为中心的因素。虽然以前的研究已经探索了软件价值的各个方面,如功能、成本和可用性,但仍然缺乏集成和整体的讨论。本研究旨在通过调查不同利益相关者类别的价值视角、实现价值一致性的共同决定因素以及影响这种价值感知的外部力量,来强调软件价值的整体观点。方法采用德尔菲法和小组讨论相结合的混合共识建立方法。软件过程专家、技术专家和企业系统用户发表了他们的意见。结果发现揭示了不同的涉众优先级,其中流程专家强调战略一致性,企业用户关注效率,技术专家优先考虑功能完整性。这些优先级受到系统范围、技术约束和业务领域的影响。综合这些见解,该研究提出了一个结构化的软件价值模型,为从业者和组织提供了实际的指导,以使开发优先级与涉众期望保持一致。软件价值不是一个静态的描述,而是根据利益相关者的角色、他们的工作环境、个人优先级和外部环境而有所不同。需要进一步的研究来调查涉众的价值观念在软件开发生命周期中是如何随时间变化的。
{"title":"Broadening software value from technical to management perspectives: a stakeholder-driven exploration using a modified-Delphi approach","authors":"Chalani Oruthotaarachchi,&nbsp;Janaka Wijayanayake","doi":"10.1016/j.infsof.2026.108046","DOIUrl":"10.1016/j.infsof.2026.108046","url":null,"abstract":"<div><h3>Context</h3><div>Software value is increasingly recognized as a multi-dimensional construct, encompassing technical performance, business alignment, and user-centric factors. While previous research has explored individual aspects of software value such as functionality, cost, and usability, an integrated and holistic discussion remains lacking.</div></div><div><h3>Objectives</h3><div>This research is conducted to emphasize a holistic view of software value through investigating value perspectives of diverse stakeholder categories, shared determinants enabling value alignment, and the external forces affecting such value perceptions.</div></div><div><h3>Methods</h3><div>A hybrid consensus-building approach that combines Delphi approach with Group Discussions was followed. Software process experts, technical experts, and enterprise system users expressed their opinions.</div></div><div><h3>Results</h3><div>Findings reveal distinct stakeholder priorities, where process experts emphasize strategic alignment, enterprise users focus on efficiency, and technical experts prioritize functional integrity. These priorities are influenced by system scope, technological constraints, and business domain. Synthesizing these insights, the study presents a structured software value model, offering practical guidance for practitioners and organizations to align development priorities with stakeholder expectations.</div></div><div><h3>Conclusion</h3><div>Software value is not a static description but differs depending on the role of stakeholders, their workplace contexts, personal priorities, and external circumstances. Further research is needed to investigate how stakeholders’ value perceptions change over time during the software development life cycle.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108046"},"PeriodicalIF":4.3,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An architectural perspective on MLOps: Structures, processes, tools, and stakeholders mlop的体系结构视角:结构、过程、工具和涉众
IF 4.3 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-24 DOI: 10.1016/j.infsof.2026.108029
Faezeh Amou Najafabadi, Justus Bogner, Ilias Gerostathopoulos, Patricia Lago

Context:

Despite the increasing adoption of Machine Learning Operations (MLOps), teams still encounter challenges in effectively applying this paradigm to their projects. While numerous MLOps tools exist, consolidated knowledge to inform architecture design is still lacking.

Objective:

In response, our goal is to provide a comprehensive overview of MLOps architectures from a structural and process perspective, the tools mentioned supporting the implementation of architecture components, and the stakeholders responsible for the MLOps process.

Methods:

We conduct a systematic mapping study of 93 primary studies to collect and analyze the state of the art knowledge on MLOps systems using automatic, manual, and snowballing-based search strategies. Subsequently, we use card sorting to synthesize the results.

Results:

We contribute: (i) a categorization of 39 MLOps architecture components and a description of several MLOps architecture variants; (ii) a systematic map between the components and the existing MLOps tools; (iii) a description of 56 process steps for MLOps systems creation, deployment, and maintenance; and (iv) a description of MLOps stakeholders, their responsibilities, and a systematic map between the process steps and the responsible stakeholders.

Conclusion:

Our results serve as an overview of the state of the art in MLOps architectures from a structural and a process perspective to support researchers and practitioners in the architecture design of their MLOps systems.
背景:尽管越来越多的人采用机器学习操作(MLOps),但团队在有效地将这种模式应用到他们的项目中仍然遇到挑战。虽然存在许多MLOps工具,但是仍然缺乏为体系结构设计提供信息的统一知识。目标:作为回应,我们的目标是从结构和过程的角度提供对MLOps体系结构的全面概述,提到的支持体系结构组件实现的工具,以及负责MLOps过程的涉众。方法:我们对93项主要研究进行了系统的制图研究,以收集和分析关于MLOps系统的最新知识,使用自动、手动和基于滚雪球的搜索策略。随后,我们使用卡片排序来综合结果。结果:我们贡献了:(i)对39个MLOps架构组件进行了分类,并描述了几个MLOps架构变体;(ii)组件与现有MLOps工具之间的系统映射;(iii)对MLOps系统创建、部署和维护的56个流程步骤的描述;(iv)对MLOps利益相关者及其职责的描述,以及流程步骤与负责的利益相关者之间的系统映射。结论:我们的结果从结构和过程的角度概述了MLOps体系结构的现状,以支持研究人员和实践者进行MLOps系统的体系结构设计。
{"title":"An architectural perspective on MLOps: Structures, processes, tools, and stakeholders","authors":"Faezeh Amou Najafabadi,&nbsp;Justus Bogner,&nbsp;Ilias Gerostathopoulos,&nbsp;Patricia Lago","doi":"10.1016/j.infsof.2026.108029","DOIUrl":"10.1016/j.infsof.2026.108029","url":null,"abstract":"<div><h3>Context:</h3><div>Despite the increasing adoption of Machine Learning Operations (MLOps), teams still encounter challenges in effectively applying this paradigm to their projects. While numerous MLOps tools exist, consolidated knowledge to inform architecture design is still lacking.</div></div><div><h3>Objective:</h3><div>In response, our goal is to provide a comprehensive overview of MLOps architectures from a structural and process perspective, the tools mentioned supporting the implementation of architecture components, and the stakeholders responsible for the MLOps process.</div></div><div><h3>Methods:</h3><div>We conduct a systematic mapping study of 93 primary studies to collect and analyze the state of the art knowledge on MLOps systems using automatic, manual, and snowballing-based search strategies. Subsequently, we use card sorting to synthesize the results.</div></div><div><h3>Results:</h3><div>We contribute: (i) a categorization of 39 MLOps architecture components and a description of several MLOps architecture variants; (ii) a systematic map between the components and the existing MLOps tools; (iii) a description of 56 process steps for MLOps systems creation, deployment, and maintenance; and (iv) a description of MLOps stakeholders, their responsibilities, and a systematic map between the process steps and the responsible stakeholders.</div></div><div><h3>Conclusion:</h3><div>Our results serve as an overview of the state of the art in MLOps architectures from a structural and a process perspective to support researchers and practitioners in the architecture design of their MLOps systems.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"193 ","pages":"Article 108029"},"PeriodicalIF":4.3,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information and Software Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1