首页 > 最新文献

IET Software最新文献

英文 中文
Sentiment Analysis of Twitter Data on Quantum Computing: An Exploratory Silver-Label Baseline Study 量子计算上Twitter数据的情感分析:一项探索性的银标基线研究
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-12-11 DOI: 10.1049/sfw2/1648095
Faisal Mehmood, Abeer Abdulaziz Alsanad, Muhammad Azeem Akbar, Víctor Leiva, Cecilia Castro

Quantum software engineering is advancing rapidly in parallel with equally ambitious hardware roadmaps. However, systematic evidence on how online audiences perceive these advances remains scarce. We present an exploratory baseline of Twitter sentiment toward quantum computing, using automated (silver-standard) labels for benchmarking. Six months of English-language tweets containing the hashtag #Quantum (December 1, 2022 and May 31, 2023) were processed, with #Quantum treated as a proxy for online discourse on quantum computing. We then applied a transparent natural language processing (NLP) methodology combining two zero-shot lexicon-based tools (TextBlob and the Valence Aware Dictionary and sEntiment Reasoner [VADER]) with three lightweight supervised classifiers (multinomial naïve Bayes, Rocchio, and perceptron). Following standard preprocessing and a stratified 70/30 train–test split, we do not aim to measure definitive public opinion; rather, our primary contribution is to establish a transparent and reproducible baseline for future benchmarking. In this context, the multinomial naïve Bayes classifier attained a macro F1-score of 0.88 on the 30% hold-out set when benchmarked against the TextBlob silver labels. This score captures internal agreement rather than accuracy against human annotation. All five methods converged on a largely—though not universally—positive sentiment orientation (≈78%–81% of nonneutral tweets, depending on the tool). Grounded in the technology acceptance model (TAM) and the unified theory of acceptance and use of technology (UTAUT), we interpret our results as indicating the constructs of curiosity and perceived usefulness, rather than unequivocal adoption readiness. These constructs were not operationalized and serve only as interpretative lenses. By documenting every preprocessing step and model configuration, and making tweet identifiers and code available upon request, the study delivers a reproducible benchmark against which future work can (i) extend the query vocabulary, (ii) incorporate neutral and fine-grained emotions, (iii) apply cross-validation protocols, and (iv) evaluate advanced transformer models on manually annotated data. Addressing these four points is essential before making any definitive claims about public opinion.

量子软件工程正在与同样雄心勃勃的硬件路线图同步快速发展。然而,关于在线受众如何看待这些进步的系统证据仍然很少。我们提出了Twitter对量子计算情绪的探索性基线,使用自动(银标准)标签进行基准测试。研究人员对包含#Quantum标签(2022年12月1日至2023年5月31日)的六个月英文推文进行了处理,#Quantum被视为量子计算在线讨论的代表。然后,我们应用了一种透明的自然语言处理(NLP)方法,该方法结合了两个基于零概率词典的工具(TextBlob和Valence Aware Dictionary and sEntiment Reasoner [VADER])以及三个轻量级监督分类器(多项式naïve贝叶斯、罗基奥和感知器)。遵循标准的预处理和分层的70/30火车测试分割,我们的目的不是衡量明确的公众意见;相反,我们的主要贡献是为未来的基准测试建立一个透明和可重复的基线。在这种情况下,当对TextBlob银色标签进行基准测试时,多项naïve贝叶斯分类器在30%的保留集上获得了0.88的宏观f1分数。这个分数捕获的是内部一致性,而不是相对于人类注释的准确性。所有五种方法都集中在一个很大程度上——尽管不是普遍的——积极的情绪取向上(≈78%-81%的非中立推文,取决于工具)。在技术接受模型(TAM)和技术接受与使用统一理论(UTAUT)的基础上,我们将我们的结果解释为表明好奇心和感知有用性的结构,而不是明确的采用准备。这些构念没有被操作化,只是作为解释透镜。通过记录每个预处理步骤和模型配置,并根据要求提供tweet标识符和代码,该研究提供了一个可重复的基准,未来的工作可以(i)扩展查询词汇表,(ii)纳入中性和细粒度的情感,(iii)应用交叉验证协议,以及(iv)评估手动注释数据上的高级转换器模型。在对公众舆论做出任何明确的断言之前,解决这四点是至关重要的。
{"title":"Sentiment Analysis of Twitter Data on Quantum Computing: An Exploratory Silver-Label Baseline Study","authors":"Faisal Mehmood,&nbsp;Abeer Abdulaziz Alsanad,&nbsp;Muhammad Azeem Akbar,&nbsp;Víctor Leiva,&nbsp;Cecilia Castro","doi":"10.1049/sfw2/1648095","DOIUrl":"https://doi.org/10.1049/sfw2/1648095","url":null,"abstract":"<p>Quantum software engineering is advancing rapidly in parallel with equally ambitious hardware roadmaps. However, systematic evidence on how online audiences perceive these advances remains scarce. We present an exploratory baseline of Twitter sentiment toward quantum computing, using automated (silver-standard) labels for benchmarking. Six months of English-language tweets containing the hashtag #Quantum (December 1, 2022 and May 31, 2023) were processed, with #Quantum treated as a proxy for online discourse on quantum computing. We then applied a transparent natural language processing (NLP) methodology combining two zero-shot lexicon-based tools (TextBlob and the Valence Aware Dictionary and sEntiment Reasoner [VADER]) with three lightweight supervised classifiers (multinomial naïve Bayes, Rocchio, and perceptron). Following standard preprocessing and a stratified 70/30 train–test split, we do not aim to measure definitive public opinion; rather, our primary contribution is to establish a transparent and reproducible baseline for future benchmarking. In this context, the multinomial naïve Bayes classifier attained a macro F1-score of 0.88 on the 30% hold-out set when benchmarked against the TextBlob silver labels. This score captures internal agreement rather than accuracy against human annotation. All five methods converged on a largely—though not universally—positive sentiment orientation (≈78%–81% of nonneutral tweets, depending on the tool). Grounded in the technology acceptance model (TAM) and the unified theory of acceptance and use of technology (UTAUT), we interpret our results as indicating the constructs of curiosity and perceived usefulness, rather than unequivocal adoption readiness. These constructs were not operationalized and serve only as interpretative lenses. By documenting every preprocessing step and model configuration, and making tweet identifiers and code available upon request, the study delivers a reproducible benchmark against which future work can (i) extend the query vocabulary, (ii) incorporate neutral and fine-grained emotions, (iii) apply cross-validation protocols, and (iv) evaluate advanced transformer models on manually annotated data. Addressing these four points is essential before making any definitive claims about public opinion.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/1648095","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145739461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Multi-Lens Phase Modulation for Scale-Aware Privacy-Preserving Human Pose Recognition 自适应多镜头相位调制用于尺度感知保护隐私的人体姿势识别
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-12-05 DOI: 10.1049/sfw2/7879383
Weilong Peng, Quanwei Deng, Mingjie Li, Yangtao Wang, Yan Wang, Lisheng Fan, Zhaoyang Yu, Meie Fang

Recently, optical privacy protection has emerged as a promising approach for safeguarding visual privacy at the physical acquisition stage. However, existing methods often face a trade-off between privacy strength and human pose recognition accuracy, particularly in long-range and multi-scale scenarios. To address this challenge, we propose a novel adaptive optical privacy-preserving framework that integrates a learnable optical modulation system with a human pose recognition network. The core of our method lies in a sparse-weighted multi-lens model, where a lightweight multilayer perceptron (MLP) predicts a sparse set of coefficients to linearly combine predefined lens phase profiles based on facial region geometry. This enables dynamic control over the point spread function (PSF), adapting the degree of image degradation to subject scale in real time. Additionally, we introduce a privacy-aware loss function that selectively reduces facial localization accuracy while preserving body pose information. Extensive experiments on MSCOCO and FLIC datasets demonstrate that the proposed method achieves a favorable balance between privacy protection and pose estimation, outperforming previous optical- and software-based baselines.

近年来,光学隐私保护已成为在物理采集阶段保护视觉隐私的一种很有前途的方法。然而,现有的方法往往面临隐私强度和人体姿态识别精度之间的权衡,特别是在远距离和多尺度场景下。为了解决这一挑战,我们提出了一种新的自适应光学隐私保护框架,该框架将可学习的光学调制系统与人体姿态识别网络相结合。该方法的核心在于稀疏加权多透镜模型,其中轻量级多层感知器(MLP)预测一组稀疏系数,以基于面部区域几何形状线性组合预定义的透镜相位轮廓。这使得对点扩散函数(PSF)的动态控制,实时适应图像退化的程度。此外,我们引入了一个隐私感知损失函数,该函数在保留身体姿势信息的同时选择性地降低了面部定位的准确性。在MSCOCO和FLIC数据集上的大量实验表明,该方法在隐私保护和姿态估计之间取得了良好的平衡,优于以前基于光学和软件的基线。
{"title":"Adaptive Multi-Lens Phase Modulation for Scale-Aware Privacy-Preserving Human Pose Recognition","authors":"Weilong Peng,&nbsp;Quanwei Deng,&nbsp;Mingjie Li,&nbsp;Yangtao Wang,&nbsp;Yan Wang,&nbsp;Lisheng Fan,&nbsp;Zhaoyang Yu,&nbsp;Meie Fang","doi":"10.1049/sfw2/7879383","DOIUrl":"10.1049/sfw2/7879383","url":null,"abstract":"<p>Recently, optical privacy protection has emerged as a promising approach for safeguarding visual privacy at the physical acquisition stage. However, existing methods often face a trade-off between privacy strength and human pose recognition accuracy, particularly in long-range and multi-scale scenarios. To address this challenge, we propose a novel adaptive optical privacy-preserving framework that integrates a learnable optical modulation system with a human pose recognition network. The core of our method lies in a sparse-weighted multi-lens model, where a lightweight multilayer perceptron (MLP) predicts a sparse set of coefficients to linearly combine predefined lens phase profiles based on facial region geometry. This enables dynamic control over the point spread function (PSF), adapting the degree of image degradation to subject scale in real time. Additionally, we introduce a privacy-aware loss function that selectively reduces facial localization accuracy while preserving body pose information. Extensive experiments on MSCOCO and FLIC datasets demonstrate that the proposed method achieves a favorable balance between privacy protection and pose estimation, outperforming previous optical- and software-based baselines.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/7879383","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145686378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Co-Occurrences of Code Smells and Design Patterns on Internal Code Quality Attributes 代码气味和设计模式共同出现对内部代码质量属性的影响
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-12-01 DOI: 10.1049/sfw2/5579438
Sania Imran, Irum Inayat, Maya Daneva

The structural features of a code section that may indicate a more serious issue with the design of a system or code are known as code smells. Design patterns, on the other hand, are meant to describe the best reusable solution for creating object-oriented software systems. Even though design patterns and code smells are very different, they may co-occur. In fact, there may be a significant connection among the two, which requires further research. This study aims to (i) identify design patterns and code smells in web gaming code, (ii) investigate the co-occurrence of the two, and (iii) analyze the effects of these co-occurrences on internal quality aspects of code. An experiment is carried out on JavaScript (JS) web games utilizing machine learning classifiers to investigate the influence of co-occurrence on potential code smells and design patterns to evaluate games from a quality perspective. Moreover, statistical testing is performed to identify the impact of co-occurrences of code smells and design patterns on internal quality attributes. After examining the data, we determined that random forest is the most effective classifier, achieving an accuracy of 99.126% and 98.99% for both experimental situations, respectively. Moreover, on applying the Wilcoxon signed rank test, we found that co-occurrence has no impact on the coupling and complexity of web games codes, whereas there is a significant impact of co-occurrence on cohesion, size, and inheritance. Our results may guide developers in writing efficient games code to add to this swiftly growing market.

代码段的结构特征可能表明系统或代码设计中存在更严重的问题,这些特征被称为代码气味。另一方面,设计模式旨在描述创建面向对象软件系统的最佳可重用解决方案。尽管设计模式和代码气味非常不同,但它们可能同时出现。事实上,这两者之间可能存在着重要的联系,这需要进一步的研究。本研究旨在(i)识别网页游戏代码中的设计模式和代码气味,(ii)调查两者的共生关系,(iii)分析这些共生关系对代码内部质量方面的影响。我们利用机器学习分类器在JavaScript网页游戏上进行了一项实验,以调查共现对潜在代码气味和设计模式的影响,从而从质量角度评估游戏。此外,执行统计测试以确定代码气味和设计模式共同出现对内部质量属性的影响。在检查数据后,我们确定随机森林是最有效的分类器,在两种实验情况下分别达到99.126%和98.99%的准确率。此外,应用Wilcoxon符号秩检验,我们发现共现对网页游戏代码的耦合性和复杂性没有影响,而共现对网页游戏代码的内聚性、大小和继承性有显著影响。我们的研究结果可能会指导开发者编写有效的游戏代码,以加入这个快速增长的市场。
{"title":"Impact of Co-Occurrences of Code Smells and Design Patterns on Internal Code Quality Attributes","authors":"Sania Imran,&nbsp;Irum Inayat,&nbsp;Maya Daneva","doi":"10.1049/sfw2/5579438","DOIUrl":"10.1049/sfw2/5579438","url":null,"abstract":"<p>The structural features of a code section that may indicate a more serious issue with the design of a system or code are known as code smells. Design patterns, on the other hand, are meant to describe the best reusable solution for creating object-oriented software systems. Even though design patterns and code smells are very different, they may co-occur. In fact, there may be a significant connection among the two, which requires further research. This study aims to (i) identify design patterns and code smells in web gaming code, (ii) investigate the co-occurrence of the two, and (iii) analyze the effects of these co-occurrences on internal quality aspects of code. An experiment is carried out on JavaScript (JS) web games utilizing machine learning classifiers to investigate the influence of co-occurrence on potential code smells and design patterns to evaluate games from a quality perspective. Moreover, statistical testing is performed to identify the impact of co-occurrences of code smells and design patterns on internal quality attributes. After examining the data, we determined that random forest is the most effective classifier, achieving an accuracy of 99.126% and 98.99% for both experimental situations, respectively. Moreover, on applying the Wilcoxon signed rank test, we found that co-occurrence has no impact on the coupling and complexity of web games codes, whereas there is a significant impact of co-occurrence on cohesion, size, and inheritance. Our results may guide developers in writing efficient games code to add to this swiftly growing market.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/5579438","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145686301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blockchain-Audited Federated Learning: Securing Data and Model Updates With On-Chain Provenance 区块链审计联邦学习:利用链上来源保护数据和模型更新
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-11-25 DOI: 10.1049/sfw2/6670439
Seid Mehammed, Girma Bewuketu, Demeke Getaneh, Md Nasre Alam, Shakir Khan, Fatimah Alhayan

We present a permissioned blockchain–audited federated learning (FL) framework that strengthens data provenance and model-update integrity. Our contribution is primarily engineering and architectural: a modular two-channel design (provenance vs. update-audit), lightweight on-chain validation with off-chain analytics, and a practical mapping to the 1 + 5 architectural views. In a TensorFlow Federated + Hyperledger Fabric prototype with 10 clients, we observe ≈18% faster anomaly detection under attack and a + 0.4 pp accuracy delta versus a baseline FL setup, with ~6% communication and ~8% energy overhead. We also provide a proof-of-concept zero-knowledge succinct noninteractive argument of knowledge (zk-SNARK) flow to validate per-client summary properties off-chain while anchoring results on-chain. These contributions collectively advance the practical deployment of secure, auditable FL systems.

我们提出了一个许可的区块链审计联邦学习(FL)框架,该框架加强了数据来源和模型更新的完整性。我们的贡献主要是工程和架构:模块化的双通道设计(来源vs.更新审计),轻量级链上验证与链下分析,以及到1 + 5架构视图的实际映射。在一个有10个客户端的TensorFlow Federated + Hyperledger Fabric原型中,我们观察到攻击下的异常检测速度提高了约18%,与基线FL设置相比,精确度提高了0.4 pp,通信开销约为6%,能量开销约为8%。我们还提供了一个概念验证零知识简洁的非交互式知识参数(zk-SNARK)流,以验证链下的每个客户端摘要属性,同时将结果锚定在链上。这些贡献共同推进了安全的、可审计的FL系统的实际部署。
{"title":"Blockchain-Audited Federated Learning: Securing Data and Model Updates With On-Chain Provenance","authors":"Seid Mehammed,&nbsp;Girma Bewuketu,&nbsp;Demeke Getaneh,&nbsp;Md Nasre Alam,&nbsp;Shakir Khan,&nbsp;Fatimah Alhayan","doi":"10.1049/sfw2/6670439","DOIUrl":"https://doi.org/10.1049/sfw2/6670439","url":null,"abstract":"<p>We present a permissioned blockchain–audited federated learning (FL) framework that strengthens data provenance and model-update integrity. Our contribution is primarily engineering and architectural: a modular two-channel design (provenance vs. update-audit), lightweight on-chain validation with off-chain analytics, and a practical mapping to the 1 + 5 architectural views. In a TensorFlow Federated + Hyperledger Fabric prototype with 10 clients, we observe ≈18% faster anomaly detection under attack and a + 0.4 pp accuracy delta versus a baseline FL setup, with ~6% communication and ~8% energy overhead. We also provide a proof-of-concept zero-knowledge succinct noninteractive argument of knowledge (zk-SNARK) flow to validate per-client summary properties off-chain while anchoring results on-chain. These contributions collectively advance the practical deployment of secure, auditable FL systems.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/6670439","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145626002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on the Application of Artificial Intelligence Technology in Color Matching in Product Appearance and Function Design 人工智能技术在产品外观与功能设计配色中的应用研究
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-11-19 DOI: 10.1049/sfw2/4103554
Chi Zhang, Wanli Gu, Yan Gu, Changyuan Geng, Duk-hwan Kim

In the context of intensified market competition, consumer demand for aesthetically pleasing and functionally designed products has grown significantly. Color matching in product appearance plays a critical role in influencing consumer choice. However, designers often face challenges related to low color recognition accuracy, which hampers efficiency and design quality. This study explores the application of artificial intelligence (AI) technology to assist in the color matching process of product appearance and functional design. Experimental evaluation across various product types demonstrates that AI integration improves color accuracy by 2.09% and enhances the stability of image representation by 3.47%. Additionally, it reduces design analysis time, increases designer productivity, and boosts satisfaction scores by 5.4%. The findings confirm that AI technology effectively supports designers in achieving more accurate, efficient, and satisfactory color matching outcomes.

在市场竞争加剧的背景下,消费者对美观和功能设计产品的需求显著增长。产品外观的配色对消费者的选择起着至关重要的作用。然而,设计师经常面临与颜色识别精度低相关的挑战,这阻碍了效率和设计质量。本研究探讨人工智能(AI)技术在产品外观与功能设计配色过程中的辅助应用。不同产品类型的实验评估表明,人工智能集成后的颜色精度提高了2.09%,图像表示的稳定性提高了3.47%。此外,它减少了设计分析时间,提高了设计师的工作效率,并将满意度分数提高了5.4%。研究结果证实,人工智能技术有效地支持设计师实现更准确、高效和令人满意的配色结果。
{"title":"Research on the Application of Artificial Intelligence Technology in Color Matching in Product Appearance and Function Design","authors":"Chi Zhang,&nbsp;Wanli Gu,&nbsp;Yan Gu,&nbsp;Changyuan Geng,&nbsp;Duk-hwan Kim","doi":"10.1049/sfw2/4103554","DOIUrl":"https://doi.org/10.1049/sfw2/4103554","url":null,"abstract":"<p>In the context of intensified market competition, consumer demand for aesthetically pleasing and functionally designed products has grown significantly. Color matching in product appearance plays a critical role in influencing consumer choice. However, designers often face challenges related to low color recognition accuracy, which hampers efficiency and design quality. This study explores the application of artificial intelligence (AI) technology to assist in the color matching process of product appearance and functional design. Experimental evaluation across various product types demonstrates that AI integration improves color accuracy by 2.09% and enhances the stability of image representation by 3.47%. Additionally, it reduces design analysis time, increases designer productivity, and boosts satisfaction scores by 5.4%. The findings confirm that AI technology effectively supports designers in achieving more accurate, efficient, and satisfactory color matching outcomes.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/4103554","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145581007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation and Research on Several Key Issues of Software Defect Prediction 软件缺陷预测若干关键问题的调查与研究
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-11-11 DOI: 10.1049/sfw2/6615496
Ya Zhang, Ningzhong Liu

With the increasing size and complexity of software code, hidden defects can pose serious problems to systems, making zero-defect software an urgent need for current industrial software applications. Software defect prediction (SDP) serves to identify defective modules or classes, with prediction models trained using historical defect data from various projects. This enables defect prediction in test projects, aiding in the rational allocation of test resources and the enhancement of software quality. The efficacy of SDP closely hinges on the quality of the defect dataset, the selected metric index, the trained model, and the algorithm design. This article reviews recent literature on SDP, summarizing existing research from three key perspectives: the dataset and metric elements employed in SDP, dataset optimization processing techniques, and defect prediction model techniques. It primarily focuses on introducing commonly used datasets and two types of defect metrics for SDP. Regarding dataset optimization processing technology, it discusses methods for handling abnormal data, high-dimensional data, class imbalance data, and data disparity issues. Furthermore, it analyzes the construction of prediction models across four dimensions: supervised learning, semi-supervised learning, unsupervised learning, and deep learning (DL). Key observations include: (i) Researchers utilize datasets of varying quality, performance evaluation metrics, and SDP models. The efficacy of software product metrics and development process metrics varies across different application scenarios, necessitating flexible metric selection based on actual requirements. (ii) Commonly used datasets like Promise and NASA exhibit varying data quality. Appropriate data preprocessing methods and dataset creation are crucial before training SDP models. (iii) In scenarios with limited labeled data, cross-project transfer learning, semi-supervised, or unsupervised learning methods tend to better utilize a broader range of training data. Given that each step in the SDP process corresponds to different unresolved issues, each requiring varying levels of response measures, we suggest that researchers comprehensively consider research objectives such as dataset quality, SDP model, performance evaluation indicators, and the need for model interpretability when conducting SDP-related research. It’s important to note that no universal dataset or model can perform optimally across different application scenarios.

随着软件代码的日益庞大和复杂,隐藏缺陷会给系统带来严重的问题,使得零缺陷软件成为当前工业软件应用的迫切需求。软件缺陷预测(SDP)用于识别有缺陷的模块或类,使用来自不同项目的历史缺陷数据训练预测模型。这使得测试项目中的缺陷预测成为可能,有助于测试资源的合理分配和软件质量的提高。SDP的有效性与缺陷数据集的质量、选择的度量指标、训练的模型和算法设计密切相关。本文综述了近年来关于SDP的文献,从SDP中使用的数据集和度量元素、数据集优化处理技术和缺陷预测模型技术三个关键角度对现有研究进行了总结。它主要集中于介绍SDP常用的数据集和两种类型的缺陷度量。在数据集优化处理技术方面,讨论了异常数据、高维数据、类不平衡数据和数据差异问题的处理方法。此外,它还分析了四个维度的预测模型的构建:监督学习、半监督学习、无监督学习和深度学习(DL)。主要观察结果包括:(i)研究人员利用不同质量、绩效评估指标和SDP模型的数据集。软件产品度量和开发过程度量的有效性在不同的应用场景中是不同的,因此需要根据实际需求灵活地选择度量。(ii)常用的数据集如Promise和NASA表现出不同的数据质量。在训练SDP模型之前,适当的数据预处理方法和数据集创建是至关重要的。(iii)在标记数据有限的情况下,跨项目迁移学习、半监督或无监督学习方法往往能更好地利用更广泛的训练数据。鉴于SDP过程的每个步骤对应不同的未解决问题,每个问题都需要不同程度的响应措施,我们建议研究者在进行SDP相关研究时综合考虑数据集质量、SDP模型、绩效评估指标以及模型可解释性需求等研究目标。需要注意的是,没有通用的数据集或模型可以在不同的应用程序场景中表现最佳。
{"title":"Investigation and Research on Several Key Issues of Software Defect Prediction","authors":"Ya Zhang,&nbsp;Ningzhong Liu","doi":"10.1049/sfw2/6615496","DOIUrl":"https://doi.org/10.1049/sfw2/6615496","url":null,"abstract":"<p>With the increasing size and complexity of software code, hidden defects can pose serious problems to systems, making zero-defect software an urgent need for current industrial software applications. Software defect prediction (SDP) serves to identify defective modules or classes, with prediction models trained using historical defect data from various projects. This enables defect prediction in test projects, aiding in the rational allocation of test resources and the enhancement of software quality. The efficacy of SDP closely hinges on the quality of the defect dataset, the selected metric index, the trained model, and the algorithm design. This article reviews recent literature on SDP, summarizing existing research from three key perspectives: the dataset and metric elements employed in SDP, dataset optimization processing techniques, and defect prediction model techniques. It primarily focuses on introducing commonly used datasets and two types of defect metrics for SDP. Regarding dataset optimization processing technology, it discusses methods for handling abnormal data, high-dimensional data, class imbalance data, and data disparity issues. Furthermore, it analyzes the construction of prediction models across four dimensions: supervised learning, semi-supervised learning, unsupervised learning, and deep learning (DL). Key observations include: (i) Researchers utilize datasets of varying quality, performance evaluation metrics, and SDP models. The efficacy of software product metrics and development process metrics varies across different application scenarios, necessitating flexible metric selection based on actual requirements. (ii) Commonly used datasets like Promise and NASA exhibit varying data quality. Appropriate data preprocessing methods and dataset creation are crucial before training SDP models. (iii) In scenarios with limited labeled data, cross-project transfer learning, semi-supervised, or unsupervised learning methods tend to better utilize a broader range of training data. Given that each step in the SDP process corresponds to different unresolved issues, each requiring varying levels of response measures, we suggest that researchers comprehensively consider research objectives such as dataset quality, SDP model, performance evaluation indicators, and the need for model interpretability when conducting SDP-related research. It’s important to note that no universal dataset or model can perform optimally across different application scenarios.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/6615496","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Facial Expression Recognition Approach Combining Canny Edge Detection and Convolutional Neural Networks 一种结合精细边缘检测和卷积神经网络的面部表情识别新方法
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-11-08 DOI: 10.1049/sfw2/4943761
Jiao Ding, Li Yang, Tianfei Zhang, Songlin Zhang, Meiyu Liang

Facial expression recognition (FER) remains challenging under pose, illumination, and occlusion. This work presents CCFER, a dual-stream framework that couples explicit edge maps with appearance features. Grayscale faces undergo morphological closing followed by opening (5 × 5), then Canny with locally adaptive thresholds to produce clean edges for an edge branch; both streams use Dual-Direction Attention Mixed Feature Networks (DDAMFN). Multilevel fusion employs adaptively spatial feature fusion (ASFF), followed by Efficient Local Attention (ELA) and multihead attention (MHAtt) before classification. CCFER attains 92.19% on RAF-DB, 91.24% on FERPlus, and 67.32% on AffectNet-7, matching or approaching the recent state of the art with balanced cross-dataset performance. Controlled ablations (parameter-matched single-stream, random-noise edges) confirm gains stem from semantic contours, and efficiency measurements show modest overhead in parameters, GFLOPs, and latency, supporting practical deployment.

面部表情识别(FER)在姿势、光照和遮挡下仍然具有挑战性。这项工作提出了CCFER,一个双流框架,将显式边缘映射与外观特征耦合在一起。灰度面先进行形态学闭合,然后再进行5 × 5的开放,然后利用局部自适应阈值对边缘分支生成干净的边缘;这两种流都使用双向注意混合特征网络(DDAMFN)。多级融合首先采用自适应空间特征融合(adaptive spatial feature fusion, ASFF),然后采用高效局部注意(Efficient Local Attention, ELA)和多头注意(multihead Attention, MHAtt)进行分类。CCFER在RAF-DB上达到92.19%,在FERPlus上达到91.24%,在AffectNet-7上达到67.32%,在跨数据集性能平衡方面达到或接近最新水平。可控损耗(参数匹配的单流、随机噪声边缘)确认了来自语义轮廓的增益,效率测量显示参数、GFLOPs和延迟方面的开销适中,支持实际部署。
{"title":"A Novel Facial Expression Recognition Approach Combining Canny Edge Detection and Convolutional Neural Networks","authors":"Jiao Ding,&nbsp;Li Yang,&nbsp;Tianfei Zhang,&nbsp;Songlin Zhang,&nbsp;Meiyu Liang","doi":"10.1049/sfw2/4943761","DOIUrl":"10.1049/sfw2/4943761","url":null,"abstract":"<p>Facial expression recognition (FER) remains challenging under pose, illumination, and occlusion. This work presents CCFER, a dual-stream framework that couples explicit edge maps with appearance features. Grayscale faces undergo morphological closing followed by opening (5 × 5), then Canny with locally adaptive thresholds to produce clean edges for an edge branch; both streams use Dual-Direction Attention Mixed Feature Networks (DDAMFN). Multilevel fusion employs adaptively spatial feature fusion (ASFF), followed by Efficient Local Attention (ELA) and multihead attention (MHAtt) before classification. CCFER attains 92.19% on RAF-DB, 91.24% on FERPlus, and 67.32% on AffectNet-7, matching or approaching the recent state of the art with balanced cross-dataset performance. Controlled ablations (parameter-matched single-stream, random-noise edges) confirm gains stem from semantic contours, and efficiency measurements show modest overhead in parameters, GFLOPs, and latency, supporting practical deployment.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/4943761","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145470162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ReinSeed: Reinforcement Fuzz Testing With Multiphase Seed Optimization for Autonomous Driving Systems ReinSeed:基于多相种子优化的自动驾驶系统强化模糊测试
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-11-07 DOI: 10.1049/sfw2/8657455
Qi Jin, Tingting Wu, Yunwei Dong, Zuohua Ding, Yongkui Xu

Ensuring the safety of autonomous driving systems (ADSs) is essential, which requires effective testing methods to enhance system robustness. Fuzz testing (FT) is a widely used technique for uncovering software faults by generating test cases that trigger unexpected system behaviors. However, traditional FT in ADS suffers from significant limitations, including inefficient seed selection, low test case relevance, and inadequate exploration of diverse failure-inducing driving scenarios. Random fuzzing often yields redundant or ineffective cases, limiting the detection of safety-critical issues. To address these challenges, we propose ReinSeed, a reinforcement FT (RFT) framework that integrates three key phases: prefuzzing seed optimization, reinforcement learning (RL)–based scenario generation, and postfuzzing seed prioritization. We introduce a scenario complexity index to prioritize initial seeds before fuzzing. During fuzzing, we model the process as a Markov decision process (MDP) and apply Q-learning to generate scenarios with effective fuzzing action variations guided by driving behaviors, including undesired behaviors and trajectory coverage. To further improve testing effectiveness, we present a postfuzzing prioritization strategy that ranks fuzzed scenarios based on risk energy by incorporating control constraint violation analysis, safety-critical events, and risk-driven trajectory. Experimental results demonstrate that the unified framework—ReinSeed—significantly improves the detection of undesired behaviors, outperforming baseline methods across maps of varying complexity. Furthermore, the multiphase seed optimization showcases distinct contributions of scenario complexity, behavior-guided fuzzing, and risk energy in enhancing both the efficiency and effectiveness of discovering critical behaviors in ADS.

确保自动驾驶系统(ads)的安全性至关重要,这需要有效的测试方法来增强系统的鲁棒性。模糊测试(FT)是一种广泛使用的技术,通过生成触发意外系统行为的测试用例来发现软件故障。然而,ADS中的传统FT存在明显的局限性,包括低效率的种子选择,低测试用例相关性,以及对各种故障诱导驱动场景的探索不足。随机模糊通常会产生冗余或无效的情况,限制了对安全关键问题的检测。为了解决这些挑战,我们提出了ReinSeed,这是一个强化FT (RFT)框架,它集成了三个关键阶段:预模糊化种子优化、基于强化学习(RL)的场景生成和后模糊化种子优先级。在模糊化之前,我们引入了一个场景复杂性指数来确定初始种子的优先级。在模糊过程中,我们将过程建模为马尔可夫决策过程(MDP),并应用q -学习来生成由驾驶行为(包括不期望的行为和轨迹覆盖)指导的有效模糊行动变化的场景。为了进一步提高测试效率,我们提出了一种模糊后优先级策略,该策略通过结合控制约束违规分析、安全关键事件和风险驱动轨迹,根据风险能量对模糊场景进行排序。实验结果表明,统一框架(reinseed)显著提高了对不良行为的检测,在不同复杂性的映射中优于基线方法。此外,多相种子优化显示了场景复杂性、行为导向模糊和风险能量在提高ADS发现关键行为的效率和有效性方面的独特贡献。
{"title":"ReinSeed: Reinforcement Fuzz Testing With Multiphase Seed Optimization for Autonomous Driving Systems","authors":"Qi Jin,&nbsp;Tingting Wu,&nbsp;Yunwei Dong,&nbsp;Zuohua Ding,&nbsp;Yongkui Xu","doi":"10.1049/sfw2/8657455","DOIUrl":"https://doi.org/10.1049/sfw2/8657455","url":null,"abstract":"<p>Ensuring the safety of autonomous driving systems (ADSs) is essential, which requires effective testing methods to enhance system robustness. Fuzz testing (FT) is a widely used technique for uncovering software faults by generating test cases that trigger unexpected system behaviors. However, traditional FT in ADS suffers from significant limitations, including inefficient seed selection, low test case relevance, and inadequate exploration of diverse failure-inducing driving scenarios. Random fuzzing often yields redundant or ineffective cases, limiting the detection of safety-critical issues. To address these challenges, we propose ReinSeed, a reinforcement FT (RFT) framework that integrates three key phases: prefuzzing seed optimization, reinforcement learning (RL)–based scenario generation, and postfuzzing seed prioritization. We introduce a scenario complexity index to prioritize initial seeds before fuzzing. During fuzzing, we model the process as a Markov decision process (MDP) and apply <i>Q</i>-learning to generate scenarios with effective fuzzing action variations guided by driving behaviors, including undesired behaviors and trajectory coverage. To further improve testing effectiveness, we present a postfuzzing prioritization strategy that ranks fuzzed scenarios based on risk energy by incorporating control constraint violation analysis, safety-critical events, and risk-driven trajectory. Experimental results demonstrate that the unified framework—ReinSeed—significantly improves the detection of undesired behaviors, outperforming baseline methods across maps of varying complexity. Furthermore, the multiphase seed optimization showcases distinct contributions of scenario complexity, behavior-guided fuzzing, and risk energy in enhancing both the efficiency and effectiveness of discovering critical behaviors in ADS.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/8657455","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Software Engineering With AI: Innovations, Challenges, and Future Directions 用人工智能增强软件工程:创新、挑战和未来方向
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-10-28 DOI: 10.1049/sfw2/5691460
Tahir Abbas, Shujaat Ali Rathore, Amira Turki, Sunawar Khan, Omar Alghushairy, Ali Daud

Software engineering, along with the incorporation of Artificial Intelligence (AI), has emerged as a new technological vantage point that has permanently changed classical development practices and processes for any phase and aspect of the software lifecycle. In particular, this systematic literature review, which includes 135 peer-reviewed papers extracted from the years 2010 to 2025, follows PRISMA guidelines. It examines modern instances of AI-based requirements analysis, automated code transformation, predictive system modeling, proactive fault monitoring and detection, and advanced project guidance systems. Technologies can be powerful tools for increasing productivity and effectiveness and strengthening the quality of software development while making technology more complex—technologically, organizationally, and ethically. The generalization, explainability, privacy and algorithmic bias challenges of the model are discussed in detail. This paper shows how AI is helping companies to predict defects, automatically identify errors and optimize the software development. It also highlights the significant adoption barriers to these technologies for organizations. The review combines new industry research with existing practice to offer practical guidance on how these implementation challenges can be overcome and the ethical use of AI can be promoted. In contrast to existing reviews concentrating on isolated stages, the study offers an integrated review through life phases, distinctive ethical frameworks and a roadmap for adoption. Takeaway: Sustainable AI deployment in SE needs interdisciplinary collaboration, ethical oversight, and a mixture of guidelines to balance technology efficiency with responsibility. The paper highlights that interdisciplinary cooperation and ethical framings are requirements to integrate AI into software engineering in a sustainable, straightforward way. This review can be utilized as a guide for authors, scientists/practitioners, and policymakers in articulating the intellectual-practical gap.

软件工程,以及人工智能(AI)的结合,已经作为一种新的技术优势出现,它已经永久地改变了软件生命周期的任何阶段和方面的经典开发实践和过程。特别是,这一系统的文献综述,包括从2010年到2025年提取的135篇同行评议论文,遵循PRISMA的指导方针。它研究了基于人工智能的需求分析、自动代码转换、预测系统建模、主动故障监视和检测以及先进的项目指导系统的现代实例。技术可以成为提高生产力和效率的强大工具,并加强软件开发的质量,同时使技术更加复杂——技术上的、组织上的和道德上的。详细讨论了该模型的泛化、可解释性、隐私性和算法偏差挑战。本文展示了人工智能如何帮助公司预测缺陷,自动识别错误并优化软件开发。它还强调了组织采用这些技术的重大障碍。该报告将新的行业研究与现有实践相结合,为如何克服这些实施挑战和促进人工智能的道德使用提供实用指导。与现有的侧重于孤立阶段的审查不同,该研究提供了贯穿生命阶段的综合审查、独特的伦理框架和采用路线图。结论:可持续的人工智能部署需要跨学科合作、道德监督,以及平衡技术效率和责任的指导方针。该论文强调,跨学科合作和伦理框架是将人工智能以可持续、直接的方式整合到软件工程中的必要条件。这篇综述可以作为作者、科学家/从业者和政策制定者阐明智力与实践差距的指南。
{"title":"Enhancing Software Engineering With AI: Innovations, Challenges, and Future Directions","authors":"Tahir Abbas,&nbsp;Shujaat Ali Rathore,&nbsp;Amira Turki,&nbsp;Sunawar Khan,&nbsp;Omar Alghushairy,&nbsp;Ali Daud","doi":"10.1049/sfw2/5691460","DOIUrl":"https://doi.org/10.1049/sfw2/5691460","url":null,"abstract":"<p>Software engineering, along with the incorporation of Artificial Intelligence (AI), has emerged as a new technological vantage point that has permanently changed classical development practices and processes for any phase and aspect of the software lifecycle. In particular, this systematic literature review, which includes 135 peer-reviewed papers extracted from the years 2010 to 2025, follows PRISMA guidelines. It examines modern instances of AI-based requirements analysis, automated code transformation, predictive system modeling, proactive fault monitoring and detection, and advanced project guidance systems. Technologies can be powerful tools for increasing productivity and effectiveness and strengthening the quality of software development while making technology more complex—technologically, organizationally, and ethically. The generalization, explainability, privacy and algorithmic bias challenges of the model are discussed in detail. This paper shows how AI is helping companies to predict defects, automatically identify errors and optimize the software development. It also highlights the significant adoption barriers to these technologies for organizations. The review combines new industry research with existing practice to offer practical guidance on how these implementation challenges can be overcome and the ethical use of AI can be promoted. In contrast to existing reviews concentrating on isolated stages, the study offers an integrated review through life phases, distinctive ethical frameworks and a roadmap for adoption. Takeaway: Sustainable AI deployment in SE needs interdisciplinary collaboration, ethical oversight, and a mixture of guidelines to balance technology efficiency with responsibility. The paper highlights that interdisciplinary cooperation and ethical framings are requirements to integrate AI into software engineering in a sustainable, straightforward way. This review can be utilized as a guide for authors, scientists/practitioners, and policymakers in articulating the intellectual-practical gap.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/5691460","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vehicle Object Detection Algorithm Based on Region of Interest–Convolutional Neural Network 基于兴趣区域卷积神经网络的车辆目标检测算法
IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-10-23 DOI: 10.1049/sfw2/7289732
Zhaosheng Xu, Zhongming Liao, Jianbang Liu, Xiaoyong Xiao, Zhongqi Xiang, Xiuhong Xu

Traditional vehicle object detection faces problems such as low detection precision, high computational complexity, and poor performance in handling complex backgrounds. To address these challenges, this article adopts the simple linear iterative clustering (SLIC) algorithm for superpixel segmentation, generates candidate regions through selective search (SS), and uses the VGG16 deep convolutional neural network (CNN) for feature extraction, combined with a Softmax classifier for classification. Finally, the accuracy of vehicle detection boxes is improved by precisely adjusting the detection results through regional regression networks. In the training and testing of the model on large-scale datasets, the combination of transfer learning and data augmentation techniques improves the model’s robustness and generalization capabilities. The experimental results show that the F1-score of the model exceeds 0.95 in most vehicle categories, and the precision of the motorcycle detection reaches 0.978. The real-time performance test shows that with high-end graphics cards and optimization strategies, the model frame rate can reach 125 frames per second (FPS) and exhibits good robustness under complex lighting and weather conditions. Compared with the existing region of interest (ROI)–CNN-based method, the SLIC superpixel + SS candidate region generation strategy proposed in this paper significantly reduces the missed detection of small vehicles and improves the quality of candidate frames by maintaining target boundary information at the superpixel level and performing multilevel merging, thereby improving the recall rate of small targets. At the same time, the VGG16 combined with dilated convolution feature extraction scheme effectively retains the contextual information in occluded scenes by expanding the receptive field without reducing the resolution of the feature map, thereby enhancing the recognition stability of partially occluded vehicles. This proves that the model based on the ROI–CNN is effective in improving detection accuracy and real-time performance, showing its potential application value in applications such as intelligent transportation and autonomous driving.

传统的车辆目标检测存在检测精度低、计算量大、处理复杂背景性能差等问题。针对这些挑战,本文采用简单线性迭代聚类(SLIC)算法进行超像素分割,通过选择性搜索(SS)生成候选区域,并使用VGG16深度卷积神经网络(CNN)进行特征提取,结合Softmax分类器进行分类。最后,通过区域回归网络对检测结果进行精确调整,提高了车辆检测箱的精度。在大规模数据集上的模型训练和测试中,迁移学习和数据增强技术的结合提高了模型的鲁棒性和泛化能力。实验结果表明,该模型在大多数车辆类别中的f1得分超过0.95,摩托车的检测精度达到0.978。实时性能测试表明,在高端显卡和优化策略的支持下,模型的帧率可以达到125帧/秒(FPS),并且在复杂光照和天气条件下具有良好的鲁棒性。与现有的基于感兴趣区域(ROI) - cnn的方法相比,本文提出的SLIC超像素+ SS候选区域生成策略通过在超像素级保持目标边界信息并进行多级合并,显著降低了小型车辆的漏检率,提高了候选帧的质量,从而提高了小目标的召回率。同时,VGG16结合扩展卷积特征提取方案,在不降低特征图分辨率的前提下,通过扩大接受野,有效地保留了遮挡场景中的上下文信息,从而增强了部分遮挡车辆的识别稳定性。这证明了基于ROI-CNN的模型在提高检测精度和实时性方面是有效的,在智能交通、自动驾驶等应用中具有潜在的应用价值。
{"title":"Vehicle Object Detection Algorithm Based on Region of Interest–Convolutional Neural Network","authors":"Zhaosheng Xu,&nbsp;Zhongming Liao,&nbsp;Jianbang Liu,&nbsp;Xiaoyong Xiao,&nbsp;Zhongqi Xiang,&nbsp;Xiuhong Xu","doi":"10.1049/sfw2/7289732","DOIUrl":"https://doi.org/10.1049/sfw2/7289732","url":null,"abstract":"<p>Traditional vehicle object detection faces problems such as low detection precision, high computational complexity, and poor performance in handling complex backgrounds. To address these challenges, this article adopts the simple linear iterative clustering (SLIC) algorithm for superpixel segmentation, generates candidate regions through selective search (SS), and uses the VGG16 deep convolutional neural network (CNN) for feature extraction, combined with a Softmax classifier for classification. Finally, the accuracy of vehicle detection boxes is improved by precisely adjusting the detection results through regional regression networks. In the training and testing of the model on large-scale datasets, the combination of transfer learning and data augmentation techniques improves the model’s robustness and generalization capabilities. The experimental results show that the F1-score of the model exceeds 0.95 in most vehicle categories, and the precision of the motorcycle detection reaches 0.978. The real-time performance test shows that with high-end graphics cards and optimization strategies, the model frame rate can reach 125 frames per second (FPS) and exhibits good robustness under complex lighting and weather conditions. Compared with the existing region of interest (ROI)–CNN-based method, the SLIC superpixel + SS candidate region generation strategy proposed in this paper significantly reduces the missed detection of small vehicles and improves the quality of candidate frames by maintaining target boundary information at the superpixel level and performing multilevel merging, thereby improving the recall rate of small targets. At the same time, the VGG16 combined with dilated convolution feature extraction scheme effectively retains the contextual information in occluded scenes by expanding the receptive field without reducing the resolution of the feature map, thereby enhancing the recognition stability of partially occluded vehicles. This proves that the model based on the ROI–CNN is effective in improving detection accuracy and real-time performance, showing its potential application value in applications such as intelligent transportation and autonomous driving.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/7289732","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IET Software
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1