首页 > 最新文献

CAAI Transactions on Intelligence Technology最新文献

英文 中文
Random Strip Peeling: A novel lightweight image encryption for IoT devices based on colour planes permutation 随机条带剥离:一种基于彩色平面排列的物联网设备的新型轻量级图像加密
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-05 DOI: 10.1049/cit2.12401
Kenan İnce, Cemile İnce, Davut Hanbay

This paper introduces a novel lightweight colour image encryption algorithm, specifically designed for resource-constrained environments such as Internet of Things (IoT) devices. As IoT systems become increasingly prevalent, secure and efficient data transmission becomes crucial. The proposed algorithm addresses this need by offering a robust yet resource-efficient solution for image encryption. Traditional image encryption relies on confusion and diffusion steps. These stages are generally implemented linearly, but this work introduces a new RSP (Random Strip Peeling) algorithm for the confusion step, which disrupts linearity in the lightweight category by using two different sequences generated by the 1D Tent Map with varying initial conditions. The diffusion stage then employs an XOR matrix generated by the Logistic Map. Different evaluation metrics, such as entropy analysis, key sensitivity, statistical and differential attacks resistance, and robustness analysis demonstrate the proposed algorithm's lightweight, robust, and efficient. The proposed encryption scheme achieved average metric values of 99.6056 for NPCR, 33.4397 for UACI, and 7.9914 for information entropy in the SIPI image dataset. It also exhibits a time complexity of O(2×M×N) $O(2times Mtimes N)$ for an image of size M×N $Mtimes N$.

本文介绍了一种新的轻量级彩色图像加密算法,专门为资源受限环境(如物联网(IoT)设备)设计。随着物联网系统的日益普及,安全和高效的数据传输变得至关重要。该算法通过提供鲁棒且资源高效的图像加密解决方案来解决这一需求。传统的图像加密依赖于混淆和扩散步骤。这些阶段通常是线性实现的,但这项工作为混淆步骤引入了一种新的RSP (Random Strip Peeling)算法,该算法通过使用具有不同初始条件的1D Tent Map生成的两个不同序列来破坏轻量级类别的线性。扩散阶段然后使用逻辑映射生成的异或矩阵。不同的评估指标,如熵分析、密钥敏感性、统计和差分攻击抵抗以及鲁棒性分析,证明了该算法的轻量级、鲁棒性和高效性。该加密方案在SIPI图像数据集中实现了NPCR、UACI和信息熵的平均度量值分别为99.6056、33.4397和7.9914。对于大小相同的图像,它的时间复杂度为O(2 × M × N)$ O(2乘以M乘以N)$M × N$ M乘以N$。
{"title":"Random Strip Peeling: A novel lightweight image encryption for IoT devices based on colour planes permutation","authors":"Kenan İnce,&nbsp;Cemile İnce,&nbsp;Davut Hanbay","doi":"10.1049/cit2.12401","DOIUrl":"10.1049/cit2.12401","url":null,"abstract":"<p>This paper introduces a novel lightweight colour image encryption algorithm, specifically designed for resource-constrained environments such as Internet of Things (IoT) devices. As IoT systems become increasingly prevalent, secure and efficient data transmission becomes crucial. The proposed algorithm addresses this need by offering a robust yet resource-efficient solution for image encryption. Traditional image encryption relies on confusion and diffusion steps. These stages are generally implemented linearly, but this work introduces a new RSP (Random Strip Peeling) algorithm for the confusion step, which disrupts linearity in the lightweight category by using two different sequences generated by the 1D Tent Map with varying initial conditions. The diffusion stage then employs an XOR matrix generated by the Logistic Map. Different evaluation metrics, such as entropy analysis, key sensitivity, statistical and differential attacks resistance, and robustness analysis demonstrate the proposed algorithm's lightweight, robust, and efficient. The proposed encryption scheme achieved average metric values of 99.6056 for NPCR, 33.4397 for UACI, and 7.9914 for information entropy in the SIPI image dataset. It also exhibits a time complexity of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>O</mi>\u0000 <mrow>\u0000 <mo>(</mo>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 <mo>×</mo>\u0000 <mi>M</mi>\u0000 <mo>×</mo>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 <mo>)</mo>\u0000 </mrow>\u0000 </mrow>\u0000 <annotation> $O(2times Mtimes N)$</annotation>\u0000 </semantics></math> for an image of size <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 <mo>×</mo>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 <annotation> $Mtimes N$</annotation>\u0000 </semantics></math>.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"529-544"},"PeriodicalIF":7.3,"publicationDate":"2025-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12401","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143856788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Which is more faithful, seeing or saying? Multimodal sarcasm detection exploiting contrasting sentiment knowledge 看和说哪个更忠实?利用对比情感知识进行多模态讽刺检测
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-27 DOI: 10.1049/cit2.12400
Yutao Chen, Shumin Shi, Heyan Huang

Using sarcasm on social media platforms to express negative opinions towards a person or object has become increasingly common. However, detecting sarcasm in various forms of communication can be difficult due to conflicting sentiments. In this paper, we introduce a contrasting sentiment-based model for multimodal sarcasm detection (CS4MSD), which identifies inconsistent emotions by leveraging the CLIP knowledge module to produce sentiment features in both text and image. Then, five external sentiments are introduced to prompt the model learning sentimental preferences among modalities. Furthermore, we highlight the importance of verbal descriptions embedded in illustrations and incorporate additional knowledge-sharing modules to fuse such image-like features. Experimental results demonstrate that our model achieves state-of-the-art performance on the public multimodal sarcasm dataset.

在社交媒体平台上使用讽刺来表达对某人或某物的负面看法已经变得越来越普遍。然而,由于情绪冲突,在各种形式的交流中发现讽刺是很困难的。在本文中,我们引入了一种基于对比情绪的多模态讽刺检测模型(CS4MSD),该模型通过利用CLIP知识模块在文本和图像中生成情感特征来识别不一致的情绪。然后,引入五种外部情绪来促进模型学习模式间的情感偏好。此外,我们强调了插图中嵌入的语言描述的重要性,并结合了额外的知识共享模块来融合这些图像样的特征。实验结果表明,我们的模型在公共多模态讽刺数据集上达到了最先进的性能。
{"title":"Which is more faithful, seeing or saying? Multimodal sarcasm detection exploiting contrasting sentiment knowledge","authors":"Yutao Chen,&nbsp;Shumin Shi,&nbsp;Heyan Huang","doi":"10.1049/cit2.12400","DOIUrl":"10.1049/cit2.12400","url":null,"abstract":"<p>Using sarcasm on social media platforms to express negative opinions towards a person or object has become increasingly common. However, detecting sarcasm in various forms of communication can be difficult due to conflicting sentiments. In this paper, we introduce a contrasting sentiment-based model for multimodal sarcasm detection (CS4MSD), which identifies inconsistent emotions by leveraging the CLIP knowledge module to produce sentiment features in both text and image. Then, five external sentiments are introduced to prompt the model learning sentimental preferences among modalities. Furthermore, we highlight the importance of verbal descriptions embedded in illustrations and incorporate additional knowledge-sharing modules to fuse such image-like features. Experimental results demonstrate that our model achieves state-of-the-art performance on the public multimodal sarcasm dataset.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"375-386"},"PeriodicalIF":7.3,"publicationDate":"2024-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12400","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143857074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Laplacian attention: A plug-and-play algorithm without increasing model complexity for vision tasks 拉普拉斯注意:一种即插即用的算法,不会增加视觉任务的模型复杂度
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-26 DOI: 10.1049/cit2.12402
Xiaolei Chen, Yubing Lu, Runyu Wen

Most prevailing attention mechanism modules in contemporary research are convolution-based modules, and while these modules contribute to enhancing the accuracy of deep learning networks in visual tasks, they concurrently augment the overall model complexity. To address the problem, this paper proposes a plug-and-play algorithm that does not increase the complexity of the model, Laplacian attention (LA). The LA algorithm first calculates the similarity distance between feature points in the feature space and feature channel and constructs the residual Laplacian matrix between feature points through the similarity distance and Gaussian kernel. This construction serves to segregate non-similar feature points while aggregating those with similarities. Ultimately, the LA algorithm allocates the outputs of the feature channel and the feature space adaptively to derive the final LA outputs. Crucially, the LA algorithm is confined to the forward computation process and does not involve backpropagation or any parameter learning. The LA algorithm undergoes comprehensive experimentation on three distinct datasets—namely Cifar-10, miniImageNet, and Pascal VOC 2012. The experimental results demonstrate that, compared with the advanced attention mechanism modules in recent years, such as SENet, CBAM, ECANet, coordinate attention, and triplet attention, the LA algorithm exhibits superior performance across image classification, object detection and semantic segmentation tasks.

当代研究中流行的注意力机制模块大多是基于卷积的模块,虽然这些模块有助于提高深度学习网络在视觉任务中的准确性,但同时也增加了整体模型的复杂性。为了解决这个问题,本文提出了一种不会增加模型复杂度的即插即用算法--拉普拉斯注意(Laplacian attention,LA)。拉普拉斯注意算法首先计算特征空间和特征通道中特征点之间的相似性距离,然后通过相似性距离和高斯核构建特征点之间的残差拉普拉斯矩阵。这种构造的作用是隔离非相似特征点,同时聚合具有相似性的特征点。最后,LA 算法会自适应地分配特征通道和特征空间的输出,从而得出最终的 LA 输出。最重要的是,LA 算法仅限于前向计算过程,不涉及反向传播或任何参数学习。LA 算法在三个不同的数据集(即 Cifar-10、miniImageNet 和 Pascal VOC 2012)上进行了全面的实验。实验结果表明,与近年来先进的注意力机制模块(如 SENet、CBAM、ECANet、坐标注意力和三重注意力)相比,LA 算法在图像分类、物体检测和语义分割任务中表现出更优越的性能。
{"title":"Laplacian attention: A plug-and-play algorithm without increasing model complexity for vision tasks","authors":"Xiaolei Chen,&nbsp;Yubing Lu,&nbsp;Runyu Wen","doi":"10.1049/cit2.12402","DOIUrl":"10.1049/cit2.12402","url":null,"abstract":"<p>Most prevailing attention mechanism modules in contemporary research are convolution-based modules, and while these modules contribute to enhancing the accuracy of deep learning networks in visual tasks, they concurrently augment the overall model complexity. To address the problem, this paper proposes a plug-and-play algorithm that does not increase the complexity of the model, Laplacian attention (LA). The LA algorithm first calculates the similarity distance between feature points in the feature space and feature channel and constructs the residual Laplacian matrix between feature points through the similarity distance and Gaussian kernel. This construction serves to segregate non-similar feature points while aggregating those with similarities. Ultimately, the LA algorithm allocates the outputs of the feature channel and the feature space adaptively to derive the final LA outputs. Crucially, the LA algorithm is confined to the forward computation process and does not involve backpropagation or any parameter learning. The LA algorithm undergoes comprehensive experimentation on three distinct datasets—namely Cifar-10, miniImageNet, and Pascal VOC 2012. The experimental results demonstrate that, compared with the advanced attention mechanism modules in recent years, such as SENet, CBAM, ECANet, coordinate attention, and triplet attention, the LA algorithm exhibits superior performance across image classification, object detection and semantic segmentation tasks.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"545-556"},"PeriodicalIF":7.3,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12402","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143857005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KitWaSor: Pioneering pre-trained model for kitchen waste sorting with an innovative million-level benchmark dataset KitWaSor:开创性的厨房垃圾分类预训练模型,具有创新的百万级基准数据集
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-29 DOI: 10.1049/cit2.12399
Leyuan Fang, Shuaiyu Ding, Hao Feng, Junwu Yu, Lin Tang, Pedram Ghamisi

Intelligent sorting is an important prerequisite for the full quantitative consumption and harmless disposal of kitchen waste. The existing object detection method based on an ImageNet pre-trained model is an effective way of sorting. Owing to significant domain gaps between natural images and kitchen waste images, it is difficult to reflect the characteristics of diverse scales and dense distribution in kitchen waste based on an ImageNet pre-trained model, leading to poor generalisation. In this article, the authors propose the first pre-trained model for kitchen waste sorting called KitWaSor, which combines both contrastive learning (CL) and masked image modelling (MIM) through self-supervised learning (SSL). First, to address the issue of diverse scales, the authors propose a mixed masking strategy by introducing an incomplete masking branch based on the original random masking branch. It prevents the complete loss of small-scale objects while avoiding excessive leakage of large-scale object pixels. Second, to address the issue of dense distribution, the authors introduce semantic consistency constraints on the basis of the mixed masking strategy. That is, object semantic reasoning is performed through semantic consistency constraints to compensate for the lack of contextual information. To train KitWaSor, the authors construct the first million-level kitchen waste dataset across seasonal and regional distributions, named KWD-Million. Extensive experiments show that KitWaSor achieves state-of-the-art (SOTA) performance on the two most relevant downstream tasks for kitchen waste sorting (i.e. image classification and object detection), demonstrating the effectiveness of the proposed KitWaSor.

智能分拣是餐厨垃圾充分定量消纳、无害化处理的重要前提。现有的基于ImageNet预训练模型的目标检测方法是一种有效的分类方法。由于自然图像与餐厨垃圾图像之间存在明显的域差,基于ImageNet预训练模型难以反映餐厨垃圾尺度多样、分布密集的特征,导致泛化效果较差。在本文中,作者提出了第一个用于厨房垃圾分类的预训练模型KitWaSor,它通过自监督学习(SSL)结合了对比学习(CL)和掩膜图像建模(MIM)。首先,为了解决不同尺度的问题,作者提出了一种混合掩蔽策略,在原始随机掩蔽分支的基础上引入不完全掩蔽分支。它可以防止小尺度对象的完全丢失,同时避免大尺度对象像素的过度泄漏。其次,为了解决密集分布的问题,作者在混合掩蔽策略的基础上引入了语义一致性约束。即通过语义一致性约束来进行对象语义推理,以弥补上下文信息的缺失。为了训练KitWaSor,作者构建了第一个跨季节和地区分布的百万级厨房垃圾数据集,命名为KWD-Million。大量的实验表明,KitWaSor在厨房垃圾分类的两个最相关的下游任务(即图像分类和目标检测)上达到了最先进的(SOTA)性能,证明了所提出的KitWaSor的有效性。
{"title":"KitWaSor: Pioneering pre-trained model for kitchen waste sorting with an innovative million-level benchmark dataset","authors":"Leyuan Fang,&nbsp;Shuaiyu Ding,&nbsp;Hao Feng,&nbsp;Junwu Yu,&nbsp;Lin Tang,&nbsp;Pedram Ghamisi","doi":"10.1049/cit2.12399","DOIUrl":"10.1049/cit2.12399","url":null,"abstract":"<p>Intelligent sorting is an important prerequisite for the full quantitative consumption and harmless disposal of kitchen waste. The existing object detection method based on an ImageNet pre-trained model is an effective way of sorting. Owing to significant domain gaps between natural images and kitchen waste images, it is difficult to reflect the characteristics of diverse scales and dense distribution in kitchen waste based on an ImageNet pre-trained model, leading to poor generalisation. In this article, the authors propose the first pre-trained model for kitchen waste sorting called KitWaSor, which combines both contrastive learning (CL) and masked image modelling (MIM) through self-supervised learning (SSL). First, to address the issue of diverse scales, the authors propose a mixed masking strategy by introducing an incomplete masking branch based on the original random masking branch. It prevents the complete loss of small-scale objects while avoiding excessive leakage of large-scale object pixels. Second, to address the issue of dense distribution, the authors introduce semantic consistency constraints on the basis of the mixed masking strategy. That is, object semantic reasoning is performed through semantic consistency constraints to compensate for the lack of contextual information. To train KitWaSor, the authors construct the first million-level kitchen waste dataset across seasonal and regional distributions, named KWD-Million. Extensive experiments show that KitWaSor achieves state-of-the-art (SOTA) performance on the two most relevant downstream tasks for kitchen waste sorting (i.e. image classification and object detection), demonstrating the effectiveness of the proposed KitWaSor.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 1","pages":"94-114"},"PeriodicalIF":7.3,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12399","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143536116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature pyramid attention network for audio-visual scene classification 用于视听场景分类的特征金字塔关注网络
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-26 DOI: 10.1049/cit2.12375
Liguang Zhou, Yuhongze Zhou, Xiaonan Qi, Junjie Hu, Tin Lun Lam, Yangsheng Xu

Audio-visual scene classification (AVSC) poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals, coupled with the complex spatial patterns of objects and textures found in visual images. The focus of recent studies has predominantly revolved around extracting features from diverse neural network structures, inadvertently neglecting the acquisition of semantically meaningful regions and crucial components within audio-visual data. The authors present a feature pyramid attention network (FPANet) for audio-visual scene understanding, which extracts semantically significant characteristics from audio-visual data. The authors’ approach builds multi-scale hierarchical features of sound spectrograms and visual images using a feature pyramid representation and localises the semantically relevant regions with a feature pyramid attention module (FPAM). A dimension alignment (DA) strategy is employed to align feature maps from multiple layers, a pyramid spatial attention (PSA) to spatially locate essential regions, and a pyramid channel attention (PCA) to pinpoint significant temporal frames. Experiments on visual scene classification (VSC), audio scene classification (ASC), and AVSC tasks demonstrate that FPANet achieves performance on par with state-of-the-art (SOTA) approaches, with a 95.9 F1-score on the ADVANCE dataset and a relative improvement of 28.8%. Visualisation results show that FPANet can prioritise semantically meaningful areas in audio-visual signals.

视听场景分类(AVSC)是一项艰巨的挑战,因为视听信号具有错综复杂的时空关系,而视觉图像中的物体和纹理又具有复杂的空间模式。近期研究的重点主要围绕从不同的神经网络结构中提取特征,却无意中忽略了获取视听数据中具有语义意义的区域和关键组成部分。作者提出了一种用于视听场景理解的特征金字塔注意网络(FPANet),它能从视听数据中提取具有语义意义的特征。作者的方法使用特征金字塔表示法构建声音频谱图和视觉图像的多尺度分层特征,并使用特征金字塔注意模块(FPAM)定位语义相关区域。采用维度对齐(DA)策略对齐多层特征图,采用金字塔空间注意力(PSA)在空间上定位重要区域,采用金字塔通道注意力(PCA)精确定位重要的时间帧。在视觉场景分类(VSC)、音频场景分类(ASC)和AVSC任务上的实验表明,FPANet的性能与最先进的(SOTA)方法相当,在ADVANCE数据集上的F1分数为95.9,相对提高了28.8%。可视化结果表明,FPANet 可以对视听信号中具有语义意义的区域进行优先排序。
{"title":"Feature pyramid attention network for audio-visual scene classification","authors":"Liguang Zhou,&nbsp;Yuhongze Zhou,&nbsp;Xiaonan Qi,&nbsp;Junjie Hu,&nbsp;Tin Lun Lam,&nbsp;Yangsheng Xu","doi":"10.1049/cit2.12375","DOIUrl":"10.1049/cit2.12375","url":null,"abstract":"<p>Audio-visual scene classification (AVSC) poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals, coupled with the complex spatial patterns of objects and textures found in visual images. The focus of recent studies has predominantly revolved around extracting features from diverse neural network structures, inadvertently neglecting the acquisition of semantically meaningful regions and crucial components within audio-visual data. The authors present a feature pyramid attention network (FPANet) for audio-visual scene understanding, which extracts semantically significant characteristics from audio-visual data. The authors’ approach builds multi-scale hierarchical features of sound spectrograms and visual images using a feature pyramid representation and localises the semantically relevant regions with a feature pyramid attention module (FPAM). A dimension alignment (DA) strategy is employed to align feature maps from multiple layers, a pyramid spatial attention (PSA) to spatially locate essential regions, and a pyramid channel attention (PCA) to pinpoint significant temporal frames. Experiments on visual scene classification (VSC), audio scene classification (ASC), and AVSC tasks demonstrate that FPANet achieves performance on par with state-of-the-art (SOTA) approaches, with a 95.9 F1-score on the ADVANCE dataset and a relative improvement of 28.8%. Visualisation results show that FPANet can prioritise semantically meaningful areas in audio-visual signals.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"359-374"},"PeriodicalIF":7.3,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12375","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143857073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to ‘Trustworthy semi-supervised anomaly detection for online-to-offline logistics business in merchant identification’ 修正“商户识别中线上到线下物流业务的可信半监督异常检测”
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-16 DOI: 10.1049/cit2.12392

Yong Li, Shuhang Wang, Shijie Xu, and Jiao Yin. 2024. Trustworthy semi-supervised anomaly detection for online-to-offline logistics business in merchant identification. CAAI Transactions on Intelligence Technology 9, 3 (June 2024), 544–556. https://doi.org/10.1049/cit2.12301.

In the section discussing the spatial distribution of fraud and normal merchants' shipping addresses, the following text needs correction:

Please replace Figure 1 and 2 with the following text ‘According to the data analysis results, the spatial distribution of fraud merchants' shipping addresses is characterised by sparsity (because fraud merchants ship on behalf of others, resulting in a large number of shipping addresses with few shipments per address), while the distribution of normal merchants' shipping addresses is characterised by density (as normal merchants typically ship from centralised warehouses, resulting in a small number of shipping addresses with a large number of shipments per address). These differences in shipping behaviour can provide significant assistance in detecting fraud merchants.’

We apologise for this error.

Please note that due to the deletion of two images, the order of subsequent images has been adjusted accordingly.

李勇、王书航、徐世杰、尹娇。2024.商户识别中线上到线下物流业务的可信半监督异常检测。CAAI Transactions on Intelligence Technology 9, 3 (June 2024), 544-556. https://doi.org/10.1049/cit2.12301.在讨论欺诈和正常商家收货地址的空间分布一节中,以下文字需要更正:请将图 1 和图 2 替换为以下文字'根据数据分析结果,欺诈商户收货地址的空间分布特点是稀疏(因为欺诈商户代他人发货,导致收货地址数量多而每个地址发货量少),而正常商户收货地址的分布特点是密集(因为正常商户通常从中央仓库发货,导致收货地址数量少而每个地址发货量大)。这些发货行为上的差异可以为检测欺诈商户提供重要帮助。"我们对这一错误表示歉意。请注意,由于删除了两张图片,后续图片的顺序已作相应调整。
{"title":"Correction to ‘Trustworthy semi-supervised anomaly detection for online-to-offline logistics business in merchant identification’","authors":"","doi":"10.1049/cit2.12392","DOIUrl":"10.1049/cit2.12392","url":null,"abstract":"<p>Yong Li, Shuhang Wang, Shijie Xu, and Jiao Yin. 2024. Trustworthy semi-supervised anomaly detection for online-to-offline logistics business in merchant identification. CAAI Transactions on Intelligence Technology 9, 3 (June 2024), 544–556. https://doi.org/10.1049/cit2.12301.</p><p>In the section discussing the spatial distribution of fraud and normal merchants' shipping addresses, the following text needs correction:</p><p>Please replace Figure 1 and 2 with the following text ‘According to the data analysis results, the spatial distribution of fraud merchants' shipping addresses is characterised by sparsity (because fraud merchants ship on behalf of others, resulting in a large number of shipping addresses with few shipments per address), while the distribution of normal merchants' shipping addresses is characterised by density (as normal merchants typically ship from centralised warehouses, resulting in a small number of shipping addresses with a large number of shipments per address). These differences in shipping behaviour can provide significant assistance in detecting fraud merchants.’</p><p>We apologise for this error.</p><p>Please note that due to the deletion of two images, the order of subsequent images has been adjusted accordingly.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12392","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143857097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph neural link predictor based on cycle structure 基于循环结构的图神经链预测器
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-12 DOI: 10.1049/cit2.12396
Yanlin Yang, Zhonglin Ye, Lei Meng, Mingyuan Li, Haixing Zhao

Currently, the link prediction algorithms primarily focus on studying the interaction between nodes based on chain structure and star structure, which predominantly rely on low-order structural information and do not explore the multivariate interactions between nodes from the perspective of higher-order structural information present in the network. The cycle structure is a higher-order structure that lies between the star and clique structures, where all nodes within the same cycle can interact with each other, even in the absence of direct edges. If a node is encompassed by multiple cycles, it indicates that the node interacts and associates with a greater number of nodes in the network, and it means the node is more important in the network to some extent. Furthermore, if two nodes are included in multiple cycles, it signifies the two nodes are more likely to be connected. Therefore, firstly, a multi-information fusion node importance algorithm based on the cycle structure information is proposed, which integrates both high-order and low-order structural information. Secondly, the obtained integrated structure information and node feature information is regarded as the input features, a two-channel graph neural network model is designed to learn the cycle structure information. Then, the cycle structure information is utilised for the task of link prediction, and a graph neural link predictor with multi-information interactions based on the cycle structure is developed. Finally, extensive experimental validation and analysis show that the node ranking result of the proposed node importance index is more consistent with the actual situation, the proposed graph neural network model can effectively learn the cycle structure information, and using higher-order structural information—cycle information proves to significantly enhance the overall link prediction performance.

目前的链路预测算法主要是基于链状结构和星形结构研究节点间的相互作用,主要依赖于低阶结构信息,没有从网络中存在的高阶结构信息的角度探索节点间的多元相互作用。环结构是位于星形结构和团状结构之间的高阶结构,在同一环内的所有节点可以相互作用,即使没有直接边。如果一个节点被多个循环所包围,则表明该节点与网络中更多的节点交互和关联,在某种程度上意味着该节点在网络中更重要。此外,如果两个节点包含在多个循环中,则表明这两个节点更有可能连接。为此,首先提出了一种基于循环结构信息的多信息融合节点重要性算法,将高阶和低阶结构信息融合在一起;其次,将得到的综合结构信息和节点特征信息作为输入特征,设计双通道图神经网络模型学习循环结构信息;然后,将循环结构信息用于链路预测任务,开发了基于循环结构的多信息交互的图神经链路预测器。最后,大量的实验验证和分析表明,本文提出的节点重要性指标的节点排序结果更符合实际情况,所提出的图神经网络模型可以有效地学习循环结构信息,并且使用高阶结构信息循环信息被证明可以显著提高整体链路预测性能。
{"title":"Graph neural link predictor based on cycle structure","authors":"Yanlin Yang,&nbsp;Zhonglin Ye,&nbsp;Lei Meng,&nbsp;Mingyuan Li,&nbsp;Haixing Zhao","doi":"10.1049/cit2.12396","DOIUrl":"10.1049/cit2.12396","url":null,"abstract":"<p>Currently, the link prediction algorithms primarily focus on studying the interaction between nodes based on chain structure and star structure, which predominantly rely on low-order structural information and do not explore the multivariate interactions between nodes from the perspective of higher-order structural information present in the network. The cycle structure is a higher-order structure that lies between the star and clique structures, where all nodes within the same cycle can interact with each other, even in the absence of direct edges. If a node is encompassed by multiple cycles, it indicates that the node interacts and associates with a greater number of nodes in the network, and it means the node is more important in the network to some extent. Furthermore, if two nodes are included in multiple cycles, it signifies the two nodes are more likely to be connected. Therefore, firstly, a multi-information fusion node importance algorithm based on the cycle structure information is proposed, which integrates both high-order and low-order structural information. Secondly, the obtained integrated structure information and node feature information is regarded as the input features, a two-channel graph neural network model is designed to learn the cycle structure information. Then, the cycle structure information is utilised for the task of link prediction, and a graph neural link predictor with multi-information interactions based on the cycle structure is developed. Finally, extensive experimental validation and analysis show that the node ranking result of the proposed node importance index is more consistent with the actual situation, the proposed graph neural network model can effectively learn the cycle structure information, and using higher-order structural information—cycle information proves to significantly enhance the overall link prediction performance.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"615-632"},"PeriodicalIF":7.3,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12396","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143856978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Domain-independent adaptive histogram-based features for pomegranate fruit and leaf diseases classification 基于自适应直方图特征的石榴果实和叶片病害分类与领域无关
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-12 DOI: 10.1049/cit2.12390
Mohanmuralidhar Prajwala, Prabhuswamy Prajwal Kumar, Shanubhog Maheshwarappa Gopinath, Shivakumara Palaiahnakote, Mahadevappa Basavanna, Daniel P. Lopresti

Disease identification for fruits and leaves in the field of agriculture is important for estimating production, crop yield, and earnings for farmers. In the specific case of pomegranates, this is challenging because of the wide range of possible diseases and their effects on the plant and the crop. This study presents an adaptive histogram-based method for solving this problem. Our method describe is domain independent in the sense that it can be easily and efficiently adapted to other similar smart agriculture tasks. The approach explores colour spaces, namely, Red, Green, and Blue along with Grey. The histograms of colour spaces and grey space are analysed based on the notion that as the disease changes, the colour also changes. The proximity between the histograms of grey images with individual colour spaces is estimated to find the closeness of images. Since the grey image is the average of colour spaces (R, G, and B), it can be considered a reference image. For estimating the distance between grey and colour spaces, the proposed approach uses a Chi-Square distance measure. Further, the method uses an Artificial Neural Network for classification. The effectiveness of our approach is demonstrated by testing on a dataset of fruit and leaf images affected by different diseases. The results show that the method outperforms existing techniques in terms of average classification rate.

农业领域的果实和叶片病害鉴定对于估算产量、作物产量和农民收入非常重要。就石榴的具体情况而言,由于可能发生的病害及其对植物和作物的影响范围很广,因此这项工作具有挑战性。本研究提出了一种基于直方图的自适应方法来解决这一问题。我们所描述的方法不受领域限制,可以轻松高效地适用于其他类似的智能农业任务。该方法探索了色彩空间,即红色、绿色、蓝色和灰色。颜色空间和灰色空间的直方图是根据疾病变化时颜色也随之变化这一概念进行分析的。通过估算灰度图像直方图与各个色彩空间之间的接近程度,可以发现图像的接近程度。由于灰度图像是色彩空间(R、G 和 B)的平均值,因此可将其视为参考图像。为了估算灰度空间和彩色空间之间的距离,建议的方法使用了 Chi-Square 距离测量法。此外,该方法还使用人工神经网络进行分类。通过对受不同疾病影响的水果和树叶图像数据集进行测试,证明了我们方法的有效性。结果表明,就平均分类率而言,该方法优于现有技术。
{"title":"Domain-independent adaptive histogram-based features for pomegranate fruit and leaf diseases classification","authors":"Mohanmuralidhar Prajwala,&nbsp;Prabhuswamy Prajwal Kumar,&nbsp;Shanubhog Maheshwarappa Gopinath,&nbsp;Shivakumara Palaiahnakote,&nbsp;Mahadevappa Basavanna,&nbsp;Daniel P. Lopresti","doi":"10.1049/cit2.12390","DOIUrl":"10.1049/cit2.12390","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <p>Disease identification for fruits and leaves in the field of agriculture is important for estimating production, crop yield, and earnings for farmers. In the specific case of pomegranates, this is challenging because of the wide range of possible diseases and their effects on the plant and the crop. This study presents an adaptive histogram-based method for solving this problem. Our method describe is domain independent in the sense that it can be easily and efficiently adapted to other similar smart agriculture tasks. The approach explores colour spaces, namely, Red, Green, and Blue along with Grey. The histograms of colour spaces and grey space are analysed based on the notion that as the disease changes, the colour also changes. The proximity between the histograms of grey images with individual colour spaces is estimated to find the closeness of images. Since the grey image is the average of colour spaces (R, G, and B), it can be considered a reference image. For estimating the distance between grey and colour spaces, the proposed approach uses a Chi-Square distance measure. Further, the method uses an Artificial Neural Network for classification. The effectiveness of our approach is demonstrated by testing on a dataset of fruit and leaf images affected by different diseases. The results show that the method outperforms existing techniques in terms of average classification rate.</p>\u0000 </section>\u0000 </div>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"317-336"},"PeriodicalIF":7.3,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12390","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143857011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-station multi-robot task assignment method based on deep reinforcement learning 基于深度强化学习的多工位多机器人任务分配方法
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-07 DOI: 10.1049/cit2.12394
Junnan Zhang, Ke Wang, Chaoxu Mu

This paper focuses on the problem of multi-station multi-robot spot welding task assignment, and proposes a deep reinforcement learning (DRL) framework, which is made up of a public graph attention network and independent policy networks. The graph of welding spots distribution is encoded using the graph attention network. Independent policy networks with attention mechanism as a decoder can handle the encoded graph and decide to assign robots to different tasks. The policy network is used to convert the large scale welding spots allocation problem to multiple small scale single-robot welding path planning problems, and the path planning problem is quickly solved through existing methods. Then, the model is trained through reinforcement learning. In addition, the task balancing method is used to allocate tasks to multiple stations. The proposed algorithm is compared with classical algorithms, and the results show that the algorithm based on DRL can produce higher quality solutions.

针对多工位多机器人点焊任务分配问题,提出了一种由公共图关注网络和独立策略网络组成的深度强化学习框架。利用图注意网络对焊点分布图进行编码。以注意力机制作为解码器的独立策略网络可以处理编码后的图,并决定给机器人分配不同的任务。利用策略网络将大规模焊点分配问题转化为多个小型单机器人焊接路径规划问题,并通过现有方法快速求解路径规划问题。然后通过强化学习对模型进行训练。此外,还采用任务均衡的方法将任务分配到多个站点。将该算法与经典算法进行了比较,结果表明基于DRL的算法可以得到更高质量的解。
{"title":"Multi-station multi-robot task assignment method based on deep reinforcement learning","authors":"Junnan Zhang,&nbsp;Ke Wang,&nbsp;Chaoxu Mu","doi":"10.1049/cit2.12394","DOIUrl":"10.1049/cit2.12394","url":null,"abstract":"<p>This paper focuses on the problem of multi-station multi-robot spot welding task assignment, and proposes a deep reinforcement learning (DRL) framework, which is made up of a public graph attention network and independent policy networks. The graph of welding spots distribution is encoded using the graph attention network. Independent policy networks with attention mechanism as a decoder can handle the encoded graph and decide to assign robots to different tasks. The policy network is used to convert the large scale welding spots allocation problem to multiple small scale single-robot welding path planning problems, and the path planning problem is quickly solved through existing methods. Then, the model is trained through reinforcement learning. In addition, the task balancing method is used to allocate tasks to multiple stations. The proposed algorithm is compared with classical algorithms, and the results show that the algorithm based on DRL can produce higher quality solutions.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 1","pages":"134-146"},"PeriodicalIF":7.3,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12394","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143533414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal performance design of bat algorithm: An adaptive multi-stage structure 蝙蝠算法的最优性能设计:一种自适应多级结构
IF 7.3 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-04 DOI: 10.1049/cit2.12377
Helong Yu, Jiuman Song, Chengcheng Chen, Ali Asghar Heidari, Yuntao Ma, Huiling Chen, Yudong Zhang

The bat algorithm (BA) is a metaheuristic algorithm for global optimisation that simulates the echolocation behaviour of bats with varying pulse rates of emission and loudness, which can be used to find the globally optimal solutions for various optimisation problems. Knowing the recent criticises of the originality of equations, the principle of BA is concise and easy to implement, and its mathematical structure can be seen as a hybrid particle swarm with simulated annealing. In this research, the authors focus on the performance optimisation of BA as a solver rather than discussing its originality issues. In terms of operation effect, BA has an acceptable convergence speed. However, due to the low proportion of time used to explore the search space, it is easy to converge prematurely and fall into the local optima. The authors propose an adaptive multi-stage bat algorithm (AMSBA). By tuning the algorithm's focus at three different stages of the search process, AMSBA can achieve a better balance between exploration and exploitation and improve its exploration ability by enhancing its performance in escaping local optima as well as maintaining a certain convergence speed. Therefore, AMSBA can achieve solutions with better quality. A convergence analysis was conducted to demonstrate the global convergence of AMSBA. The authors also perform simulation experiments on 30 benchmark functions from IEEE CEC 2017 as the objective functions and compare AMSBA with some original and improved swarm-based algorithms. The results verify the effectiveness and superiority of AMSBA. AMSBA is also compared with eight representative optimisation algorithms on 10 benchmark functions derived from IEEE CEC 2020, while this experiment is carried out on five different dimensions of the objective functions respectively. A balance and diversity analysis was performed on AMSBA to demonstrate its improvement over the original BA in terms of balance. AMSBA was also applied to the multi-threshold image segmentation of Citrus Macular disease, which is a bacterial infection that causes lesions on citrus trees. The segmentation results were analysed by comparing each comparative algorithm's peak signal-to-noise ratio, structural similarity index and feature similarity index. The results show that the proposed BA-based algorithm has apparent advantages, and it can effectively segment the disease spots from citrus leaves when the segmentation threshold is at a low level. Based on a comprehensive study, the authors think the proposed optimiser has mitigated the main drawbacks of the BA, and it can be utilised as an effective optimisation tool.

蝙蝠算法(BA)是一种全局优化的元启发式算法,它模拟了蝙蝠在不同脉冲发射率和响度下的回声定位行为,可用于寻找各种优化问题的全局最优解。考虑到最近对方程原创性的批评,BA原理简洁,易于实现,其数学结构可以看作是模拟退火的混合粒子群。在本研究中,作者将重点放在BA作为求解器的性能优化上,而不是讨论其独创性问题。在操作效果方面,BA具有可接受的收敛速度。然而,由于用于探索搜索空间的时间比例较低,容易过早收敛,陷入局部最优。提出了一种自适应多阶段蝙蝠算法(AMSBA)。通过在搜索过程的三个不同阶段调整算法的重点,AMSBA可以更好地平衡探索和开发,在提高逃避局部最优性能的同时保持一定的收敛速度,从而提高算法的探索能力。因此,AMSBA可以获得质量更好的解决方案。通过收敛性分析证明了AMSBA的全局收敛性。作者还以IEEE CEC 2017中的30个基准函数作为目标函数进行了仿真实验,并将AMSBA与一些原始的和改进的基于群的算法进行了比较。结果验证了AMSBA的有效性和优越性。在IEEE CEC 2020衍生的10个基准函数上,AMSBA与8种具有代表性的优化算法进行了比较,并分别在目标函数的5个不同维度上进行了实验。对AMSBA进行了平衡和多样性分析,以证明其在平衡方面优于原始BA。AMSBA还应用于柑橘黄斑病的多阈值图像分割,柑橘黄斑病是一种引起柑橘树病变的细菌感染。通过比较各算法的峰值信噪比、结构相似度和特征相似度对分割结果进行分析。结果表明,本文提出的基于ba的算法具有明显的优势,在较低的分割阈值下,能够有效地分割柑橘叶片上的病斑。基于一项全面的研究,作者认为所提出的优化器已经减轻了BA的主要缺点,并且它可以被用作有效的优化工具。
{"title":"Optimal performance design of bat algorithm: An adaptive multi-stage structure","authors":"Helong Yu,&nbsp;Jiuman Song,&nbsp;Chengcheng Chen,&nbsp;Ali Asghar Heidari,&nbsp;Yuntao Ma,&nbsp;Huiling Chen,&nbsp;Yudong Zhang","doi":"10.1049/cit2.12377","DOIUrl":"10.1049/cit2.12377","url":null,"abstract":"<p>The bat algorithm (BA) is a metaheuristic algorithm for global optimisation that simulates the echolocation behaviour of bats with varying pulse rates of emission and loudness, which can be used to find the globally optimal solutions for various optimisation problems. Knowing the recent criticises of the originality of equations, the principle of BA is concise and easy to implement, and its mathematical structure can be seen as a hybrid particle swarm with simulated annealing. In this research, the authors focus on the performance optimisation of BA as a solver rather than discussing its originality issues. In terms of operation effect, BA has an acceptable convergence speed. However, due to the low proportion of time used to explore the search space, it is easy to converge prematurely and fall into the local optima. The authors propose an adaptive multi-stage bat algorithm (AMSBA). By tuning the algorithm's focus at three different stages of the search process, AMSBA can achieve a better balance between exploration and exploitation and improve its exploration ability by enhancing its performance in escaping local optima as well as maintaining a certain convergence speed. Therefore, AMSBA can achieve solutions with better quality. A convergence analysis was conducted to demonstrate the global convergence of AMSBA. The authors also perform simulation experiments on 30 benchmark functions from IEEE CEC 2017 as the objective functions and compare AMSBA with some original and improved swarm-based algorithms. The results verify the effectiveness and superiority of AMSBA. AMSBA is also compared with eight representative optimisation algorithms on 10 benchmark functions derived from IEEE CEC 2020, while this experiment is carried out on five different dimensions of the objective functions respectively. A balance and diversity analysis was performed on AMSBA to demonstrate its improvement over the original BA in terms of balance. AMSBA was also applied to the multi-threshold image segmentation of Citrus Macular disease, which is a bacterial infection that causes lesions on citrus trees. The segmentation results were analysed by comparing each comparative algorithm's peak signal-to-noise ratio, structural similarity index and feature similarity index. The results show that the proposed BA-based algorithm has apparent advantages, and it can effectively segment the disease spots from citrus leaves when the segmentation threshold is at a low level. Based on a comprehensive study, the authors think the proposed optimiser has mitigated the main drawbacks of the BA, and it can be utilised as an effective optimisation tool.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 3","pages":"755-814"},"PeriodicalIF":7.3,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12377","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144502961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
CAAI Transactions on Intelligence Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1