CAAI Transactions on Intelligence Technology最新文献_第6页

A topic-controllable keywords-to-text generator with knowledge base network 带知识库网络的主题可控关键词到文本生成器

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2024-01-13 DOI: 10.1049/cit2.12280

Li He, Kaize Shi, Dingxian Wang, Xianzhi Wang, Guandong Xu

With the introduction of more recent deep learning models such as encoder-decoder, text generation frameworks have gained a lot of popularity. In Natural Language Generation (NLG), controlling the information and style of the output produced is a crucial and challenging task. The purpose of this paper is to develop informative and controllable text using social media language by incorporating topic knowledge into a keyword-to-text framework. A novel Topic-Controllable Key-to-Text (TC-K2T) generator that focuses on the issues of ignoring unordered keywords and utilising subject-controlled information from previous research is presented. TC-K2T is built on the framework of conditional language encoders. In order to guide the model to produce an informative and controllable language, the generator first inputs unordered keywords and uses subjects to simulate prior human knowledge. Using an additional probability term, the model increases the likelihood of topic words appearing in the generated text to bias the overall distribution. The proposed TC-K2T can produce more informative and controllable senescence, outperforming state-of-the-art models, according to empirical research on automatic evaluation metrics and human annotations.

随着编码器-解码器等最新深度学习模型的引入，文本生成框架受到了广泛欢迎。在自然语言生成（NLG）中，控制输出的信息和风格是一项关键而又具有挑战性的任务。本文的目的是通过将主题知识纳入关键词到文本框架，利用社交媒体语言开发信息丰富且可控的文本。本文介绍了一种新颖的 "主题可控关键字到文本"（TC-K2T）生成器，该生成器重点解决了忽略无序关键字和利用以往研究中的主题控制信息的问题。TC-K2T 建立在条件语言编码器的框架之上。为了引导模型生成信息丰富且可控的语言，生成器首先输入无序关键词，并利用受试者模拟人类的先验知识。利用附加的概率项，模型增加了主题词出现在生成文本中的可能性，从而使整体分布出现偏差。根据对自动评估指标和人类注释的实证研究，所提出的 TC-K2T 可以生成信息量更大、更可控的衰老语，其性能优于最先进的模型。

{"title":"A topic-controllable keywords-to-text generator with knowledge base network","authors":"Li He, Kaize Shi, Dingxian Wang, Xianzhi Wang, Guandong Xu","doi":"10.1049/cit2.12280","DOIUrl":"10.1049/cit2.12280","url":null,"abstract":"With the introduction of more recent deep learning models such as encoder-decoder, text generation frameworks have gained a lot of popularity. In Natural Language Generation (NLG), controlling the information and style of the output produced is a crucial and challenging task. The purpose of this paper is to develop informative and controllable text using social media language by incorporating topic knowledge into a keyword-to-text framework. A novel Topic-Controllable Key-to-Text (TC-K2T) generator that focuses on the issues of ignoring unordered keywords and utilising subject-controlled information from previous research is presented. TC-K2T is built on the framework of conditional language encoders. In order to guide the model to produce an informative and controllable language, the generator first inputs unordered keywords and uses subjects to simulate prior human knowledge. Using an additional probability term, the model increases the likelihood of topic words appearing in the generated text to bias the overall distribution. The proposed TC-K2T can produce more informative and controllable senescence, outperforming state-of-the-art models, according to empirical research on automatic evaluation metrics and human annotations.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 3","pages":"585-594"},"PeriodicalIF":5.1,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12280","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139530560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Extraction of intersecting palm-vein and palmprint features for cancellable identity verification 提取手掌静脉和手掌指纹的交集特征，用于可注销身份验证

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2024-01-11 DOI: 10.1049/cit2.12277

Jaekwon Lee, Beom-Seok Oh, Kar-Ann Toh

A novel method based on the cross-modality intersecting features of the palm-vein and the palmprint is proposed for identity verification. Capitalising on the unique geometrical relationship between the two biometric modalities, the cross-modality intersecting points provides a stable set of features for identity verification. To facilitate flexibility in template changes, a template transformation is proposed. While maintaining non-invertibility, the template transformation allows transformation sizes beyond that offered by the conventional means. Extensive experiments using three public palm databases are conducted to verify the effectiveness the proposed system for identity recognition.

提出了一种基于掌静脉和掌纹跨模态交点特征的身份验证新方法。跨模态交叉点利用两种生物识别模式之间独特的几何关系，为身份验证提供了一组稳定的特征。为了便于灵活更换模板，提出了一种模板变换方法。在保持不可逆性的同时，模板变换允许的变换大小超出了传统方法所能提供的范围。我们利用三个公共手掌数据库进行了广泛的实验，以验证所提出的系统在身份识别方面的有效性。

引用次数: 0

A deep learning fusion model for accurate classification of brain tumours in Magnetic Resonance images 用于对磁共振图像中的脑肿瘤进行准确分类的深度学习融合模型

IF 8.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2024-01-04 DOI: 10.1049/cit2.12276

Nechirvan Asaad Zebari, Chira Nadheef Mohammed, Dilovan Asaad Zebari, Mazin Abed Mohammed, Diyar Qader Zeebaree, Haydar Abdulameer Marhoon, Karrar Hameed Abdulkareem, Seifedine Kadry, Wattana Viriyasitavat, Jan Nedoma, Radek Martinek

Detecting brain tumours is complex due to the natural variation in their location, shape, and intensity in images. While having accurate detection and segmentation of brain tumours would be beneficial, current methods still need to solve this problem despite the numerous available approaches. Precise analysis of Magnetic Resonance Imaging (MRI) is crucial for detecting, segmenting, and classifying brain tumours in medical diagnostics. Magnetic Resonance Imaging is a vital component in medical diagnosis, and it requires precise, efficient, careful, efficient, and reliable image analysis techniques. The authors developed a Deep Learning (DL) fusion model to classify brain tumours reliably. Deep Learning models require large amounts of training data to achieve good results, so the researchers utilised data augmentation techniques to increase the dataset size for training models. VGG16, ResNet50, and convolutional deep belief networks networks extracted deep features from MRI images. Softmax was used as the classifier, and the training set was supplemented with intentionally created MRI images of brain tumours in addition to the genuine ones. The features of two DL models were combined in the proposed model to generate a fusion model, which significantly increased classification accuracy. An openly accessible dataset from the internet was used to test the model's performance, and the experimental results showed that the proposed fusion model achieved a classification accuracy of 98.98%. Finally, the results were compared with existing methods, and the proposed model outperformed them significantly.

由于脑肿瘤在图像中的位置、形状和强度存在自然变化，因此检测脑肿瘤非常复杂。对脑肿瘤进行精确检测和分割固然有好处，但尽管有许多可用的方法，目前的方法仍需解决这一问题。在医疗诊断中，精确分析磁共振成像（MRI）对于检测、分割和分类脑肿瘤至关重要。磁共振成像是医学诊断的重要组成部分，需要精确、高效、细致、高效和可靠的图像分析技术。作者开发了一种深度学习（DL）融合模型，用于对脑肿瘤进行可靠的分类。深度学习模型需要大量的训练数据才能取得良好的效果，因此研究人员利用数据增强技术来增加训练模型的数据集规模。VGG16、ResNet50和卷积深度信念网络从核磁共振成像图像中提取深度特征。使用 Softmax 作为分类器，并在训练集中添加了除真实图像外有意创建的脑肿瘤 MRI 图像。在所提出的模型中，两个 DL 模型的特征被结合在一起，生成一个融合模型，从而显著提高了分类准确率。实验结果表明，融合模型的分类准确率达到了 98.98%。最后，将实验结果与现有方法进行了比较，发现所提出的模型明显优于现有方法。

{"title":"A deep learning fusion model for accurate classification of brain tumours in Magnetic Resonance images","authors":"Nechirvan Asaad Zebari, Chira Nadheef Mohammed, Dilovan Asaad Zebari, Mazin Abed Mohammed, Diyar Qader Zeebaree, Haydar Abdulameer Marhoon, Karrar Hameed Abdulkareem, Seifedine Kadry, Wattana Viriyasitavat, Jan Nedoma, Radek Martinek","doi":"10.1049/cit2.12276","DOIUrl":"10.1049/cit2.12276","url":null,"abstract":"Detecting brain tumours is complex due to the natural variation in their location, shape, and intensity in images. While having accurate detection and segmentation of brain tumours would be beneficial, current methods still need to solve this problem despite the numerous available approaches. Precise analysis of Magnetic Resonance Imaging (MRI) is crucial for detecting, segmenting, and classifying brain tumours in medical diagnostics. Magnetic Resonance Imaging is a vital component in medical diagnosis, and it requires precise, efficient, careful, efficient, and reliable image analysis techniques. The authors developed a Deep Learning (DL) fusion model to classify brain tumours reliably. Deep Learning models require large amounts of training data to achieve good results, so the researchers utilised data augmentation techniques to increase the dataset size for training models. VGG16, ResNet50, and convolutional deep belief networks networks extracted deep features from MRI images. Softmax was used as the classifier, and the training set was supplemented with intentionally created MRI images of brain tumours in addition to the genuine ones. The features of two DL models were combined in the proposed model to generate a fusion model, which significantly increased classification accuracy. An openly accessible dataset from the internet was used to test the model's performance, and the experimental results showed that the proposed fusion model achieved a classification accuracy of 98.98%. Finally, the results were compared with existing methods, and the proposed model outperformed them significantly.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 4","pages":"790-804"},"PeriodicalIF":8.4,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12276","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139536199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Guest Editorial: Special issue on intelligence technology for remote sensing image 特邀编辑：遥感图像智能技术特刊

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2023-12-07 DOI: 10.1049/cit2.12275

Xiangtao Zheng, Benoit Vozel, Danfeng Hong

With the development of artificial intelligence, remote sensing scene interpretation task has attracted extensive attention, which mainly includes scene classification, target detection, hyperspectral classification, and multi-modal analysis. The remote sensing scene interpretation has effectively promoted the development of the Earth observation field. It was the intention for this Special Issue to serve as a platform for the publication of the most recent research concepts from remote sensing image.To recognise remote sensing scenes, several methods have been proposed to represent the scene image. The first paper (Zhang et al.) proposes a lightweight privacy-preserving recognition framework which diffuses the error between the encryption block and the original block to adjacent blocks which makes the transmission of high-resolution images more secure and efficient. The second paper (Ning et al.) introduces a knowledge distillation network for aerial scene recognition, which produces consistent predictions by distilling the predictive distribution between different scales. With the development of scene recognition task, its branch scene retrieval task also emerges. In this regard, the third paper (Yuan et al.) shows how to efficiently optimise the average accuracy to improve remote sensing image retrieval. This approach enables a more flexible optimisation strategy by involving positive post-samples, which provides a new way to improve the retrieval performance.To detect targets, a series of advanced methods have been developed to improve detection accuracy and efficiency. The fourth paper (Zhang et al.) proposes an intelligent anchor learning strategy for arbitrary orientation target detection. The fifth paper (Ma et al.) focuses on infrared image detection of small and weak targets and proposes an efficient deep learning method. The sixth paper (Zhou et al.) proposes a convolutional transformer method based on spectral-spatial sequence features for hyperspectral image change detection. With the maturity of target detection techniques, researchers have begun to focus on more complex challenges, namely anomaly detection. In this subfield, the seventh paper (Wang et al.) provides a new solution for semi-supervised hyperspectral anomaly detection. It maps the raw spectrum into the fractional Fourier domain, thereby enhancing the distinguishability between background and anomaly. Meanwhile, the eighth paper (Zhao et al.) utilises a memory-enhanced self-encoder to improve the separation of anomaly samples from background in hyperspectral images. These studies demonstrate the rapid development in the target detection field, such as change detection and anomaly detection.To classify hyperspectral images, the ninth paper (Liao et al.) shows how to integrate the features of convolutional neural networks and transformers to enhance the performance of hyperspectral image classification. This approach fully utilises the respective a

随着人工智能的发展，遥感场景解译任务受到了广泛的关注，主要包括场景分类、目标检测、高光谱分类和多模态分析。遥感场景解译有效地促进了对地观测领域的发展。本期特刊的目的是作为一个平台，发表遥感影像的最新研究概念。为了识别遥感场景，提出了几种场景图像表示方法。第一篇论文(Zhang et al.)提出了一种轻量级的隐私保护识别框架，该框架将加密块与原始块之间的错误扩散到相邻块，从而使高分辨率图像的传输更加安全高效。第二篇论文(Ning et al.)介绍了一种用于航景识别的知识蒸馏网络，该网络通过提取不同尺度之间的预测分布来产生一致的预测。随着场景识别任务的发展，其分支场景检索任务也应运而生。在这方面，第三篇论文(Yuan et al.)展示了如何有效地优化平均精度以提高遥感图像检索。该方法通过引入阳性后样本实现了更灵活的优化策略，为提高检索性能提供了一种新的方法。为了检测目标，人们开发了一系列先进的方法来提高检测精度和效率。第四篇论文(Zhang等人)提出了一种用于任意方向目标检测的智能锚点学习策略。第五篇论文(Ma et al.)专注于红外图像弱小目标的检测，提出了一种高效的深度学习方法。第六篇论文(Zhou et al.)提出了一种基于光谱空间序列特征的卷积变换方法，用于高光谱图像变化检测。随着目标检测技术的成熟，研究人员开始关注更复杂的挑战，即异常检测。在该子领域，第七篇论文(Wang et al.)为半监督高光谱异常检测提供了一种新的解决方案。它将原始光谱映射到分数傅里叶域，从而增强背景和异常之间的可区分性。同时，第八篇论文(Zhao et al.)利用记忆增强的自编码器来提高高光谱图像中异常样本与背景的分离。这些研究表明了变化检测和异常检测等目标检测领域的快速发展。为了对高光谱图像进行分类，第九篇论文(Liao et al.)展示了如何结合卷积神经网络和变压器的特点来提高高光谱图像的分类性能。该方法充分利用了卷积网络和变压器各自的优势，为高光谱图像的特征提取提供了全面的解决方案。此外，第十篇论文(Xie et al.)采用融合语义、空间和频谱特征的变压器网络，展示了多种信息类型的组合如何提高分类的准确性和鲁棒性。同时，第11篇论文(Ran et al.)将深度变压器建模与小样本学习相结合，解决了高光谱图像分类中的挑战，特别是在样本数量有限的情况下。该方法充分利用了少量样本中的信息，有效地提高了分类模型的泛化能力。多模态分析增加了观测地球表面和处理具有挑战性问题的能力。第12篇论文(Hong et al.)利用多光谱遥感数据和地理加权回归实验揭示了绿色基础设施布局对缓解城市热岛效应的重要性。第13篇论文(Zhang et al.)介绍了降水预测的多任务框架。将雷达回波图像与其他辅助任务相结合，提高了降水预报的精度和效率。最后，第十四篇论文(Zhang et al.)通过融合视觉和音频数据来改进机器人在动态环境中的自定位和环境感知。该融合方法在多机器人协同场景下具有良好的稳定性和重构性能。这些研究证明了多源数据分析在改善环境监测、预测和机器人导航等领域的潜力。我们感谢所有作者的投稿和所有审稿人的宝贵评论和意见。我们希望这期特刊能够激发研究界在遥感场景解译方面的新成果。国家自然科学基金项目，资助/奖励号:62271484。

{"title":"Guest Editorial: Special issue on intelligence technology for remote sensing image","authors":"Xiangtao Zheng, Benoit Vozel, Danfeng Hong","doi":"10.1049/cit2.12275","DOIUrl":"https://doi.org/10.1049/cit2.12275","url":null,"abstract":"With the development of artificial intelligence, remote sensing scene interpretation task has attracted extensive attention, which mainly includes scene classification, target detection, hyperspectral classification, and multi-modal analysis. The remote sensing scene interpretation has effectively promoted the development of the Earth observation field. It was the intention for this Special Issue to serve as a platform for the publication of the most recent research concepts from remote sensing image.To recognise remote sensing scenes, several methods have been proposed to represent the scene image. The first paper (Zhang et al.) proposes a lightweight privacy-preserving recognition framework which diffuses the error between the encryption block and the original block to adjacent blocks which makes the transmission of high-resolution images more secure and efficient. The second paper (Ning et al.) introduces a knowledge distillation network for aerial scene recognition, which produces consistent predictions by distilling the predictive distribution between different scales. With the development of scene recognition task, its branch scene retrieval task also emerges. In this regard, the third paper (Yuan et al.) shows how to efficiently optimise the average accuracy to improve remote sensing image retrieval. This approach enables a more flexible optimisation strategy by involving positive post-samples, which provides a new way to improve the retrieval performance.To detect targets, a series of advanced methods have been developed to improve detection accuracy and efficiency. The fourth paper (Zhang et al.) proposes an intelligent anchor learning strategy for arbitrary orientation target detection. The fifth paper (Ma et al.) focuses on infrared image detection of small and weak targets and proposes an efficient deep learning method. The sixth paper (Zhou et al.) proposes a convolutional transformer method based on spectral-spatial sequence features for hyperspectral image change detection. With the maturity of target detection techniques, researchers have begun to focus on more complex challenges, namely anomaly detection. In this subfield, the seventh paper (Wang et al.) provides a new solution for semi-supervised hyperspectral anomaly detection. It maps the raw spectrum into the fractional Fourier domain, thereby enhancing the distinguishability between background and anomaly. Meanwhile, the eighth paper (Zhao et al.) utilises a memory-enhanced self-encoder to improve the separation of anomaly samples from background in hyperspectral images. These studies demonstrate the rapid development in the target detection field, such as change detection and anomaly detection.To classify hyperspectral images, the ninth paper (Liao et al.) shows how to integrate the features of convolutional neural networks and transformers to enhance the performance of hyperspectral image classification. This approach fully utilises the respective a","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"8 4","pages":"1164-1165"},"PeriodicalIF":5.1,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12275","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138678873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning feature alignment and dual correlation for few-shot image classification 学习特征对齐和双相关性，实现少镜头图像分类

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2023-12-05 DOI: 10.1049/cit2.12273

Xilang Huang, Seon Han Choi

Few-shot image classification is the task of classifying novel classes using extremely limited labelled samples. To perform classification using the limited samples, one solution is to learn the feature alignment (FA) information between the labelled and unlabelled sample features. Most FA methods use the feature mean as the class prototype and calculate the correlation between prototype and unlabelled features to learn an alignment strategy. However, mean prototypes tend to degenerate informative features because spatial features at the same position may not be equally important for the final classification, leading to inaccurate correlation calculations. Therefore, the authors propose an effective intraclass FA strategy that aggregates semantically similar spatial features from an adaptive reference prototype in low-dimensional feature space to obtain an informative prototype feature map for precise correlation computation. Moreover, a dual correlation module to learn the hard and soft correlations was developed by the authors. This module combines the correlation information between the prototype and unlabelled features in both the original and learnable feature spaces, aiming to produce a comprehensive cross-correlation between the prototypes and unlabelled features. Using both FA and cross-attention modules, our model can maintain informative class features and capture important shared features for classification. Experimental results on three few-shot classification benchmarks show that the proposed method outperformed related methods and resulted in a 3% performance boost in the 1-shot setting by inserting the proposed module into the related methods.

少射图像分类是使用极其有限的标记样本对新类别进行分类的任务。为了使用有限的样本进行分类，一种解决方案是学习标记和未标记样本特征之间的特征对齐(FA)信息。大多数遗传算法使用特征均值作为类原型，通过计算原型与未标记特征之间的相关性来学习对齐策略。然而，由于同一位置的空间特征对最终分类的重要性可能不相同，平均原型往往会使信息特征退化，从而导致相关性计算不准确。因此，作者提出了一种有效的类内特征分析策略，该策略将语义相似的空间特征从自适应参考原型中聚集到低维特征空间中，以获得信息丰富的原型特征图，用于精确的相关性计算。此外，作者还开发了一个双相关模块来学习硬相关和软相关。该模块结合了原始特征空间和可学习特征空间中原型和未标记特征之间的相关信息，旨在产生原型和未标记特征之间的全面交叉相关。使用FA和交叉注意模块，我们的模型可以维护信息丰富的类特征，并捕获重要的共享特征进行分类。实验结果表明，通过将所提出的模块插入到相关方法中，所提出的方法优于相关方法，并且在1 - shot设置中性能提升了3%。

{"title":"Learning feature alignment and dual correlation for few-shot image classification","authors":"Xilang Huang, Seon Han Choi","doi":"10.1049/cit2.12273","DOIUrl":"10.1049/cit2.12273","url":null,"abstract":"Few-shot image classification is the task of classifying novel classes using extremely limited labelled samples. To perform classification using the limited samples, one solution is to learn the feature alignment (FA) information between the labelled and unlabelled sample features. Most FA methods use the feature mean as the class prototype and calculate the correlation between prototype and unlabelled features to learn an alignment strategy. However, mean prototypes tend to degenerate informative features because spatial features at the same position may not be equally important for the final classification, leading to inaccurate correlation calculations. Therefore, the authors propose an effective intraclass FA strategy that aggregates semantically similar spatial features from an adaptive reference prototype in low-dimensional feature space to obtain an informative prototype feature map for precise correlation computation. Moreover, a dual correlation module to learn the hard and soft correlations was developed by the authors. This module combines the correlation information between the prototype and unlabelled features in both the original and learnable feature spaces, aiming to produce a comprehensive cross-correlation between the prototypes and unlabelled features. Using both FA and cross-attention modules, our model can maintain informative class features and capture important shared features for classification. Experimental results on three few-shot classification benchmarks show that the proposed method outperformed related methods and resulted in a 3% performance boost in the 1-shot setting by inserting the proposed module into the related methods.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 2","pages":"303-318"},"PeriodicalIF":5.1,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12273","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138598869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring the spatiotemporal relationship between green infrastructure and urban heat island under multi-source remote sensing imagery: A case study of Fuzhou City 探索多源遥感影像下绿色基础设施与城市热岛的时空关系：福州市案例研究

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2023-12-04 DOI: 10.1049/cit2.12272

Tingting Hong, Xiaohui Huang, Guangjian Chen, Yiwei Yang, Lijia Chen

Green Infrastructure (GI) has garnered increasing attention from various regions due to its potential to mitigate urban heat island (UHI), which has been exacerbated by global climate change. This study focuses on the central area of Fuzhou city, one of the “furnace” cities, and aims to explore the correlation between the GI pattern and land surface temperature (LST) in the spring and autumn seasons. The research adopts a multiscale approach, starting from the urban scale and using urban geographic spatial characteristics, multispectral remote sensing data, and morphological spatial pattern analysis (MSPA). Significant MSPA elements were tested and combined with LST to conduct a geographic weighted regression (GWR) experiment. The findings reveal that the UHI in the central area of Fuzhou city has a spatial characteristic of “high temperature in the middle and low temperature around,” which is coupled with a “central scattered and peripheral concentrated” distribution of GI. This suggests that remote sensing data can effectively be utilised for UHI inversion. Additionally, the study finds that the complexity of GI, whether from the perspective of the overall GI pattern or the classification study based on the proportion of the core area, has an impact on the alleviation of UHI in both seasons. In conclusion, this study underscores the importance of a reasonable layout of urban green infrastructure for mitigating UHI.

绿色基础设施(GI)因其缓解城市热岛(UHI)的潜力而受到越来越多地区的关注，而城市热岛(UHI)因全球气候变化而加剧。本文以中国“火炉”城市之一的福州市为研究对象，探讨了春秋季GI格局与地表温度的相关性。本研究采用多尺度方法，从城市尺度出发，利用城市地理空间特征、多光谱遥感数据和形态空间格局分析(MSPA)。对显著MSPA要素进行检验，并结合LST进行地理加权回归(GWR)试验。研究结果表明:福州市中心地区的城市热岛指数具有“中高温、周边低温”的空间特征，并伴有“中心分散、外围集中”的地理指数分布。这表明遥感数据可以有效地用于UHI反演。此外，研究发现，无论是从总体地理标志格局来看，还是基于核心区比例的分类研究来看，地理标志的复杂性对两个季节的热岛指数缓解都有影响。总之，本研究强调了合理布局城市绿色基础设施对于缓解城市热岛的重要性。

{"title":"Exploring the spatiotemporal relationship between green infrastructure and urban heat island under multi-source remote sensing imagery: A case study of Fuzhou City","authors":"Tingting Hong, Xiaohui Huang, Guangjian Chen, Yiwei Yang, Lijia Chen","doi":"10.1049/cit2.12272","DOIUrl":"10.1049/cit2.12272","url":null,"abstract":"Green Infrastructure (GI) has garnered increasing attention from various regions due to its potential to mitigate urban heat island (UHI), which has been exacerbated by global climate change. This study focuses on the central area of Fuzhou city, one of the “furnace” cities, and aims to explore the correlation between the GI pattern and land surface temperature (LST) in the spring and autumn seasons. The research adopts a multiscale approach, starting from the urban scale and using urban geographic spatial characteristics, multispectral remote sensing data, and morphological spatial pattern analysis (MSPA). Significant MSPA elements were tested and combined with LST to conduct a geographic weighted regression (GWR) experiment. The findings reveal that the UHI in the central area of Fuzhou city has a spatial characteristic of “high temperature in the middle and low temperature around,” which is coupled with a “central scattered and peripheral concentrated” distribution of GI. This suggests that remote sensing data can effectively be utilised for UHI inversion. Additionally, the study finds that the complexity of GI, whether from the perspective of the overall GI pattern or the classification study based on the proportion of the core area, has an impact on the alleviation of UHI in both seasons. In conclusion, this study underscores the importance of a reasonable layout of urban green infrastructure for mitigating UHI.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"8 4","pages":"1337-1349"},"PeriodicalIF":5.1,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12272","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138604706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Guest Editorial: Special issue on explainable AI empowered for indoor positioning and indoor navigation 特邀编辑：为室内定位和室内导航赋能的可解释人工智能特刊

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2023-12-03 DOI: 10.1049/cit2.12274

Ki-Il Kim, Aswani Kumar Cherukuri, Xue Jun Li, Tanveer Ahmad, Muhammad Rafiq, Shehzad Ashraf Chaudhry

The convergence of Internet of Things (IoT), vehicularad hoc network (VANET), and mobile ad hoc network relies on sensor networks to gather data from nodes or objects. These networks involve nodes, gateways, and anchors, operating on limited battery power, mainly used in broadcasting. IoT applications, like healthcare, smart cities, and transportation, often need position data and face challenges in delay sensitivity. Localisation is important in ITS and VANETs, influencing autonomous vehicles, collision warning systems, and road information dissemination. A robust localisation system, often combining GPS with techniques like Dead Reckoning and Image/Video Localisation, is essential for accuracy and security. Artificial intelligence (AI) integration, particularly in machine learning, enhances indoor wireless localisation effectiveness. Advancements in wireless communication (WSN, IoT, and massive MIMO) transform dense environments into programmable entities, but pose challenges in aligning self-learning AI with sensor tech for accuracy and budget considerations. We seek original research on sensor localisation, fusion, protocols, and positioning algorithms, inviting contributions from industry and academia to address these evolving challenges.This special issue titled ‘Sensing, Communication, and Localization in WSN, IoT & VANET’ appears in the CAAI Transactions on Intelligence Technology. We encourage contributions addressing localisation accuracy, network coverage, upper and lower bounding, lane and vehicle detection, and related topics.In the first paper, (Hamil et al.) explore how smartphone sensors and IoT devices aid in rescuing individuals during emergencies like fires in tall buildings. It introduces a pioneering Sensor Management and Data Fusion-Wireless Data Exchange fusion scheme, leveraging an evolutionary algorithm within complex multi-storey buildings. This scheme aims to diversify particle sets effectively, capturing the user's real-time state using wearable device sensors. The authors further explore how smartphones sensors utilise data for object movement alongside Bluetooth Low Energy beacon based localisation with the help of Sensor Management security and Data Fusion-Wireless Data Exchange scheme. The effectiveness of this scheme and its impact on a smartphone user's real-time state within indoor settings were assessed through various experiments in controlled environments.In the second paper, (Khan J et al.) proposed a novel method to fine-tune alpha-beta filter parameters using a feed-forward backpropagation neural network. This model, comprising the alpha-beta filter as the core predictor and a feedforward artificial neural network as the learning element, uses temperature and humidity sensor data for precise predictions from noisy readings. By integrating the feed-forward backpropagation neural network significantly boosts prediction accuracy, slashing both roots mean square error (RMSE) and mea

物联网(IoT)、车载自组织网络(VANET)和移动自组织网络的融合依赖于传感器网络从节点或对象收集数据。这些网络包括节点、网关和锚，在有限的电池电量下运行，主要用于广播。物联网应用，如医疗保健、智慧城市和交通，通常需要位置数据，并面临延迟敏感性的挑战。定位在ITS和VANETs中非常重要，影响着自动驾驶汽车、碰撞预警系统和道路信息发布。一个强大的定位系统，通常结合GPS与诸如航位推算和图像/视频定位等技术，对于准确性和安全性至关重要。人工智能(AI)集成，特别是在机器学习方面，提高了室内无线定位的有效性。无线通信(WSN、物联网和大规模MIMO)的进步将密集环境转变为可编程实体，但在将自学习AI与传感器技术结合起来以实现准确性和预算考虑方面提出了挑战。我们寻求在传感器定位、融合、协议和定位算法方面的原创研究，邀请工业界和学术界的贡献来解决这些不断变化的挑战。本期特刊题为“WSN、IoT中的传感、通信和定位”。VANET出现在CAAI智能技术汇刊上。我们鼓励就定位精度、网络覆盖、上下边界、车道和车辆检测以及相关主题发表意见。在第一篇论文中，(Hamil et al.)探讨了智能手机传感器和物联网设备如何在高层建筑火灾等紧急情况下帮助救援人员。它引入了一种开创性的传感器管理和数据融合-无线数据交换融合方案，利用复杂多层建筑中的进化算法。该方案旨在有效地多样化粒子集，利用可穿戴设备传感器捕捉用户的实时状态。作者进一步探讨了智能手机传感器如何在传感器管理安全和数据融合无线数据交换方案的帮助下，利用基于蓝牙低功耗信标的本地化数据进行物体运动。通过在受控环境中的各种实验，评估了该方案的有效性及其对室内智能手机用户实时状态的影响。在第二篇论文中，(Khan J et al.)提出了一种使用前馈反向传播神经网络微调α - β滤波器参数的新方法。该模型由alpha-beta滤波器作为核心预测器和前馈人工神经网络作为学习元素组成，使用温度和湿度传感器数据从噪声读数中进行精确预测。通过整合前馈反向传播神经网络，显著提高了预测精度，降低了均方根误差(RMSE)和平均绝对误差(MAE)。在与传统方法(如α - β和卡尔曼滤波器)的实验中，所提出的模型表现优于传统方法，MAE提高了35.1%，RMSE提高了38.2%。在第三篇论文中，(Imtiaz等人)提出了一种存在翻转歧义的工业物联网本地化方案。为了减少IIoT网络中的定位误差估计，作者提出了一种新的贪婪锚点选择策略GSAP。本文提出了利用多维尺度进行初始位置估计的总体思路，减少了算法的收敛时间。推导了所提算法的Cramer - Rao下界表达式，以检验其最优性，并将结果与目前的技术水平进行比较。在第四篇论文中(Ismail等人)推导了单个EH中继下的NOMA窄带物联网网络。然而，窄带物联网设备的增长也导致了同信道干扰的增加，从而影响了NOMA的性能增强。在上行EH中继NOMA窄带物联网网络中，作者的目标是优化窄带物联网设备数据速率，同时满足其最低要求。考虑到设备能量、EH中继能量和数据缓存约束，该模型创建了一个复杂的室内定位框架，涉及功率、数据和时隙调度。这个模型提出了一个非凸优化挑战，没有一个直接的分析解决方案。通过仿真验证了该方法的有效性。这些改进使网络的能源效率提高了44.1%，数据速率比例公平提高了11.9%，频谱效率提高了55.4%。我们感谢所有作者的投稿和审稿人的宝贵反馈。我们希望这期特刊能在循环动态神经网络领域为研究界带来新的成果。

{"title":"Guest Editorial: Special issue on explainable AI empowered for indoor positioning and indoor navigation","authors":"Ki-Il Kim, Aswani Kumar Cherukuri, Xue Jun Li, Tanveer Ahmad, Muhammad Rafiq, Shehzad Ashraf Chaudhry","doi":"10.1049/cit2.12274","DOIUrl":"https://doi.org/10.1049/cit2.12274","url":null,"abstract":"The convergence of Internet of Things (IoT), vehicularad hoc network (VANET), and mobile ad hoc network relies on sensor networks to gather data from nodes or objects. These networks involve nodes, gateways, and anchors, operating on limited battery power, mainly used in broadcasting. IoT applications, like healthcare, smart cities, and transportation, often need position data and face challenges in delay sensitivity. Localisation is important in ITS and VANETs, influencing autonomous vehicles, collision warning systems, and road information dissemination. A robust localisation system, often combining GPS with techniques like Dead Reckoning and Image/Video Localisation, is essential for accuracy and security. Artificial intelligence (AI) integration, particularly in machine learning, enhances indoor wireless localisation effectiveness. Advancements in wireless communication (WSN, IoT, and massive MIMO) transform dense environments into programmable entities, but pose challenges in aligning self-learning AI with sensor tech for accuracy and budget considerations. We seek original research on sensor localisation, fusion, protocols, and positioning algorithms, inviting contributions from industry and academia to address these evolving challenges.This special issue titled ‘Sensing, Communication, and Localization in WSN, IoT & VANET’ appears in the CAAI Transactions on Intelligence Technology. We encourage contributions addressing localisation accuracy, network coverage, upper and lower bounding, lane and vehicle detection, and related topics.In the first paper, (Hamil et al.) explore how smartphone sensors and IoT devices aid in rescuing individuals during emergencies like fires in tall buildings. It introduces a pioneering Sensor Management and Data Fusion-Wireless Data Exchange fusion scheme, leveraging an evolutionary algorithm within complex multi-storey buildings. This scheme aims to diversify particle sets effectively, capturing the user's real-time state using wearable device sensors. The authors further explore how smartphones sensors utilise data for object movement alongside Bluetooth Low Energy beacon based localisation with the help of Sensor Management security and Data Fusion-Wireless Data Exchange scheme. The effectiveness of this scheme and its impact on a smartphone user's real-time state within indoor settings were assessed through various experiments in controlled environments.In the second paper, (Khan J et al.) proposed a novel method to fine-tune alpha-beta filter parameters using a feed-forward backpropagation neural network. This model, comprising the alpha-beta filter as the core predictor and a feedforward artificial neural network as the learning element, uses temperature and humidity sensor data for precise predictions from noisy readings. By integrating the feed-forward backpropagation neural network significantly boosts prediction accuracy, slashing both roots mean square error (RMSE) and mea","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"8 4","pages":"1101-1103"},"PeriodicalIF":5.1,"publicationDate":"2023-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12274","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138678834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A verifiable essential secret image sharing scheme based on HLRs (VESIS-(t, s, k, n)) 基于 HLR 的可验证基本秘密图像共享方案 (VESIS-(t, s, k, n))

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2023-10-20 DOI: 10.1049/cit2.12271

Massoud Hadian Dehkordi, Seyed Taghi Farahi, Samaneh Mashhadi

In traditional secret image sharing schemes, a secret image is shared among shareholders who have the same position. But if the shareholders have two different positions, essential and non-essential, it is necessary to use essential secret image sharing schemes. In this article, a verifiable essential secret image sharing scheme based on HLRs is proposed. Shareholder's share consists of two parts. The first part is produced by the shareholders, which prevents the fraud of dealers. The second part is a shadow image that is produced by using HLRs and the first part of share. The verification of the first part of the shares is done for the first time by using multilinear and bilinear maps. Also, for verifying shadow images, Bloom Filters are used for the first time. The proposed scheme is more efficient than similar schemes, and for the first part of the shares, has formal security.

在传统的秘密图像共享方案中，秘密图像在具有相同职位的股东之间共享。但如果股东有两个不同的职位，即重要职位和非重要职位，则有必要使用重要秘密图像共享方案。本文提出了一种基于 HLR 的可验证基本秘密图像共享方案。股东份额由两部分组成。第一部分由股东制作，可防止经销商欺诈。第二部分是利用 HLR 和第一部分份额生成的阴影图像。第一部分股票的验证首次使用了多线性和双线性映射。此外，为了验证阴影图像，还首次使用了布鲁姆过滤器。所提出的方案比同类方案更有效，而且对于第一部分共享具有正式的安全性。

引用次数: 0

A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments 动态环境下移动机器人自主导航的安全强化学习方法

2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2023-10-09 DOI: 10.1049/cit2.12269

Zhiqian Zhou, Junkai Ren, Zhiwen Zeng, Junhao Xiao, Xinglong Zhang, Xian Guo, Zongtan Zhou, Huimin Lu

Abstract When deploying mobile robots in real‐world scenarios, such as airports, train stations, hospitals, and schools, collisions with pedestrians are intolerable and catastrophic. Motion safety becomes one of the most fundamental requirements for mobile robots. However, until now, efficient and safe robot navigation in such dynamic environments is still an open problem. The critical reason is that the inconsistency between navigation efficiency and motion safety is greatly intensified by the high dynamics and uncertainties of pedestrians. To face the challenge, this paper proposes a safe deep reinforcement learning algorithm named Conflict‐Averse Safe Reinforcement Learning (CASRL) for autonomous robot navigation in dynamic environments. Specifically, it first separates the collision avoidance sub‐task from the overall navigation task and maintains a safety critic to evaluate the safety/risk of actions. Later, it constructs two task‐specific but model‐agnostic policy gradients for goal‐reaching and collision avoidance sub‐tasks to eliminate their mutual interference. Then, it further performs a conflict‐averse gradient manipulation to address the inconsistency between two sub‐tasks. Finally, extensive experiments are performed to evaluate the superiority of CASRL. Simulation results show an average 8.2% performance improvement over the vanilla baseline in eight groups of dynamic environments, which is further extended to 13.4% in the most challenging group. Besides, forty real‐world experiments fully illustrated that the CASRL could be successfully deployed on a real robot.

当在机场、火车站、医院和学校等现实场景中部署移动机器人时，与行人的碰撞是不可容忍的，而且是灾难性的。运动安全成为移动机器人最基本的要求之一。然而，到目前为止，机器人在这种动态环境下的高效安全导航仍然是一个悬而未决的问题。其关键原因在于行人的高动态性和不确定性极大地加剧了导航效率与运动安全之间的不一致性。为了应对这一挑战，本文提出了一种安全的深度强化学习算法，称为冲突厌恶安全强化学习(CASRL)，用于动态环境下的自主机器人导航。具体来说，它首先将避碰子任务从整体导航任务中分离出来，并维护一个安全评论家来评估行动的安全性/风险。然后，构建了两个任务特定但模型不可知的策略梯度，用于目标到达和避免碰撞子任务，以消除它们的相互干扰。然后，它进一步执行冲突规避梯度操作来解决两个子任务之间的不一致性。最后，进行了大量的实验来评价CASRL的优越性。仿真结果表明，在八组动态环境中，平均性能比普通基准提高8.2%，在最具挑战性的一组中进一步扩展到13.4%。此外，40个真实世界的实验充分说明了CASRL可以成功地部署在真实的机器人上。

{"title":"A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments","authors":"Zhiqian Zhou, Junkai Ren, Zhiwen Zeng, Junhao Xiao, Xinglong Zhang, Xian Guo, Zongtan Zhou, Huimin Lu","doi":"10.1049/cit2.12269","DOIUrl":"https://doi.org/10.1049/cit2.12269","url":null,"abstract":"Abstract When deploying mobile robots in real‐world scenarios, such as airports, train stations, hospitals, and schools, collisions with pedestrians are intolerable and catastrophic. Motion safety becomes one of the most fundamental requirements for mobile robots. However, until now, efficient and safe robot navigation in such dynamic environments is still an open problem. The critical reason is that the inconsistency between navigation efficiency and motion safety is greatly intensified by the high dynamics and uncertainties of pedestrians. To face the challenge, this paper proposes a safe deep reinforcement learning algorithm named Conflict‐Averse Safe Reinforcement Learning (CASRL) for autonomous robot navigation in dynamic environments. Specifically, it first separates the collision avoidance sub‐task from the overall navigation task and maintains a safety critic to evaluate the safety/risk of actions. Later, it constructs two task‐specific but model‐agnostic policy gradients for goal‐reaching and collision avoidance sub‐tasks to eliminate their mutual interference. Then, it further performs a conflict‐averse gradient manipulation to address the inconsistency between two sub‐tasks. Finally, extensive experiments are performed to evaluate the superiority of CASRL. Simulation results show an average 8.2% performance improvement over the vanilla baseline in eight groups of dynamic environments, which is further extended to 13.4% in the most challenging group. Besides, forty real‐world experiments fully illustrated that the CASRL could be successfully deployed on a real robot.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135146697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Transfer learning from T1-weighted to T2-weighted Magnetic resonance sequences for brain image segmentation 从 T1 加权磁共振序列到 T2 加权磁共振序列的转移学习用于脑图像分割

IF 5.1 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology

Pub Date : 2023-10-04 DOI: 10.1049/cit2.12270

Imene Mecheter, Maysam Abbod, Habib Zaidi, Abbes Amira

Magnetic resonance (MR) imaging is a widely employed medical imaging technique that produces detailed anatomical images of the human body. The segmentation of MR images plays a crucial role in medical image analysis, as it enables accurate diagnosis, treatment planning, and monitoring of various diseases and conditions. Due to the lack of sufficient medical images, it is challenging to achieve an accurate segmentation, especially with the application of deep learning networks. The aim of this work is to study transfer learning from T1-weighted (T1-w) to T2-weighted (T2-w) MR sequences to enhance bone segmentation with minimal required computation resources. With the use of an excitation-based convolutional neural networks, four transfer learning mechanisms are proposed: transfer learning without fine tuning, open fine tuning, conservative fine tuning, and hybrid transfer learning. Moreover, a multi-parametric segmentation model is proposed using T2-w MR as an intensity-based augmentation technique. The novelty of this work emerges in the hybrid transfer learning approach that overcomes the overfitting issue and preserves the features of both modalities with minimal computation time and resources. The segmentation results are evaluated using 14 clinical 3D brain MR and CT images. The results reveal that hybrid transfer learning is superior for bone segmentation in terms of performance and computation time with DSCs of 0.5393 ± 0.0007. Although T2-w-based augmentation has no significant impact on the performance of T1-w MR segmentation, it helps in improving T2-w MR segmentation and developing a multi-sequences segmentation model.

磁共振（MR）成像是一种广泛应用的医学成像技术，可生成人体的详细解剖图像。磁共振图像的分割在医学图像分析中起着至关重要的作用，因为它能对各种疾病和病症进行准确诊断、治疗规划和监测。由于缺乏足够的医学图像，要实现准确的分割具有挑战性，尤其是在应用深度学习网络的情况下。这项工作的目的是研究从 T1 加权（T1-w）到 T2 加权（T2-w）磁共振序列的转移学习，以最小的所需计算资源增强骨骼分割。利用基于激励的卷积神经网络，提出了四种转移学习机制：无微调转移学习、开放微调、保守微调和混合转移学习。此外，利用 T2-w MR 作为基于强度的增强技术，提出了一种多参数分割模型。这项工作的新颖之处在于混合转移学习方法，它克服了过拟合问题，并以最少的计算时间和资源保留了两种模式的特征。利用 14 幅临床三维脑部 MR 和 CT 图像对分割结果进行了评估。结果显示，混合迁移学习在性能和计算时间方面都优于骨骼分割，DSCs 为 0.5393 ± 0.0007。虽然基于 T2-w 的增强对 T1-w 磁共振分割的性能没有显著影响，但它有助于改善 T2-w 磁共振分割和开发多序列分割模型。

{"title":"Transfer learning from T1-weighted to T2-weighted Magnetic resonance sequences for brain image segmentation","authors":"Imene Mecheter, Maysam Abbod, Habib Zaidi, Abbes Amira","doi":"10.1049/cit2.12270","DOIUrl":"10.1049/cit2.12270","url":null,"abstract":"Magnetic resonance (MR) imaging is a widely employed medical imaging technique that produces detailed anatomical images of the human body. The segmentation of MR images plays a crucial role in medical image analysis, as it enables accurate diagnosis, treatment planning, and monitoring of various diseases and conditions. Due to the lack of sufficient medical images, it is challenging to achieve an accurate segmentation, especially with the application of deep learning networks. The aim of this work is to study transfer learning from T1-weighted (T1-w) to T2-weighted (T2-w) MR sequences to enhance bone segmentation with minimal required computation resources. With the use of an excitation-based convolutional neural networks, four transfer learning mechanisms are proposed: transfer learning without fine tuning, open fine tuning, conservative fine tuning, and hybrid transfer learning. Moreover, a multi-parametric segmentation model is proposed using T2-w MR as an intensity-based augmentation technique. The novelty of this work emerges in the hybrid transfer learning approach that overcomes the overfitting issue and preserves the features of both modalities with minimal computation time and resources. The segmentation results are evaluated using 14 clinical 3D brain MR and CT images. The results reveal that hybrid transfer learning is superior for bone segmentation in terms of performance and computation time with DSCs of 0.5393 ± 0.0007. Although T2-w-based augmentation has no significant impact on the performance of T1-w MR segmentation, it helps in improving T2-w MR segmentation and developing a multi-sequences segmentation model.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 1","pages":"26-39"},"PeriodicalIF":5.1,"publicationDate":"2023-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12270","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135645979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0