Offline reinforcement learning (RL) aims to learn policies entirely from passively collected datasets, making it a data-driven decision method. One of the main challenges in offline RL is the distribution shift problem, which causes the algorithm to visit out-of-distribution (OOD) samples. The distribution shift can be mitigated by constraining the divergence between the target policy and the behaviour policy. However, this method can overly constrain the target policy and impair the algorithm's performance, as it does not directly distinguish between in-distribution and OOD samples. In addition, it is difficult to learn and represent multi-modal behaviour policy when the datasets are collected by several different behaviour policies. To overcome these drawbacks, the authors address the distribution shift problem by implicit policy constraints with energy-based models (EBMs) rather than explicitly modelling the behaviour policy. The EBM is powerful for representing complex multi-modal distributions as well as the ability to distinguish in-distribution samples and OODs. Experimental results show that their method significantly outperforms the explicit policy constraint method and other baselines. In addition, the learnt energy model can be used to indicate OOD visits and alert the possible failure.
{"title":"Implicit policy constraint for offline reinforcement learning","authors":"Zhiyong Peng, Yadong Liu, Changlin Han, Zongtan Zhou","doi":"10.1049/cit2.12304","DOIUrl":"10.1049/cit2.12304","url":null,"abstract":"<p>Offline reinforcement learning (RL) aims to learn policies entirely from passively collected datasets, making it a data-driven decision method. One of the main challenges in offline RL is the distribution shift problem, which causes the algorithm to visit out-of-distribution (OOD) samples. The distribution shift can be mitigated by constraining the divergence between the target policy and the behaviour policy. However, this method can overly constrain the target policy and impair the algorithm's performance, as it does not directly distinguish between in-distribution and OOD samples. In addition, it is difficult to learn and represent multi-modal behaviour policy when the datasets are collected by several different behaviour policies. To overcome these drawbacks, the authors address the distribution shift problem by implicit policy constraints with energy-based models (EBMs) rather than explicitly modelling the behaviour policy. The EBM is powerful for representing complex multi-modal distributions as well as the ability to distinguish in-distribution samples and OODs. Experimental results show that their method significantly outperforms the explicit policy constraint method and other baselines. In addition, the learnt energy model can be used to indicate OOD visits and alert the possible failure.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":8.4,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12304","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140237574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hengame Ahmadi Golilarz, Alireza Azadbar, Roohallah Alizadehsani, Juan Manuel Gorriz
Myocarditis is a significant public health concern because of its potential to cause heart failure and sudden death. The standard invasive diagnostic method, endomyocardial biopsy, is typically reserved for cases with severe complications, limiting its widespread use. Conversely, non-invasive cardiac magnetic resonance (CMR) imaging presents a promising alternative for detecting and monitoring myocarditis, because of its high signal contrast that reveals myocardial involvement. To assist medical professionals via artificial intelligence, the authors introduce generative adversarial networks - multi discriminator (GAN-MD), a deep learning model that uses binary classification to diagnose myocarditis from CMR images. Their approach employs a series of convolutional neural networks (CNNs) that extract and combine feature vectors for accurate diagnosis. The authors suggest a novel technique for improving the classification precision of CNNs. Using generative adversarial networks (GANs) to create synthetic images for data augmentation, the authors address challenges such as mode collapse and unstable training. Incorporating a reconstruction loss into the GAN loss function requires the generator to produce images reflecting the discriminator features, thus enhancing the generated images' quality to more accurately replicate authentic data patterns. Moreover, combining this loss function with other regularisation methods, such as gradient penalty, has proven to further improve the performance of diverse GAN models. A significant challenge in myocarditis diagnosis is the imbalance of classification, where one class dominates over the other. To mitigate this, the authors introduce a focal loss-based training method that effectively trains the model on the minority class samples. The GAN-MD approach, evaluated on the Z-Alizadeh Sani myocarditis dataset, achieves superior results (F-measure 86.2%; geometric mean 91.0%) compared with other deep learning models and traditional machine learning methods.
心肌炎可能导致心力衰竭和猝死,因此是一个重大的公共卫生问题。标准的侵入性诊断方法--心内膜心肌活检通常只用于有严重并发症的病例,因此限制了其广泛应用。相反,无创心脏磁共振成像(CMR)因其高信号对比度可显示心肌受累情况,为检测和监测心肌炎提供了一种很有前途的替代方法。为了通过人工智能帮助医疗专业人员,作者引入了生成对抗网络--多判别器(GAN-MD),这是一种深度学习模型,使用二元分类法从 CMR 图像中诊断心肌炎。他们的方法采用了一系列卷积神经网络(CNN),通过提取和组合特征向量来进行准确诊断。作者提出了一种提高 CNN 分类精度的新技术。作者利用生成对抗网络(GANs)创建合成图像用于数据增强,从而解决了模式崩溃和训练不稳定等难题。在 GAN 损失函数中加入重建损失,要求生成器生成反映判别特征的图像,从而提高生成图像的质量,更准确地复制真实数据模式。此外,事实证明,将该损失函数与梯度惩罚等其他正则化方法相结合,可进一步提高各种 GAN 模型的性能。心肌炎诊断中的一个重大挑战是分类的不平衡,即一个类别主导另一个类别。为了缓解这一问题,作者引入了一种基于焦点损失的训练方法,该方法能有效地在少数类别样本上训练模型。GAN-MD 方法在 Z-Alizadeh Sani 心肌炎数据集上进行了评估,与其他深度学习模型和传统机器学习方法相比,取得了优异的成绩(F-measure 86.2%;geometric mean 91.0%)。
{"title":"GAN-MD: A myocarditis detection using multi-channel convolutional neural networks and generative adversarial network-based data augmentation","authors":"Hengame Ahmadi Golilarz, Alireza Azadbar, Roohallah Alizadehsani, Juan Manuel Gorriz","doi":"10.1049/cit2.12307","DOIUrl":"10.1049/cit2.12307","url":null,"abstract":"<p>Myocarditis is a significant public health concern because of its potential to cause heart failure and sudden death. The standard invasive diagnostic method, endomyocardial biopsy, is typically reserved for cases with severe complications, limiting its widespread use. Conversely, non-invasive cardiac magnetic resonance (CMR) imaging presents a promising alternative for detecting and monitoring myocarditis, because of its high signal contrast that reveals myocardial involvement. To assist medical professionals via artificial intelligence, the authors introduce generative adversarial networks - multi discriminator (GAN-MD), a deep learning model that uses binary classification to diagnose myocarditis from CMR images. Their approach employs a series of convolutional neural networks (CNNs) that extract and combine feature vectors for accurate diagnosis. The authors suggest a novel technique for improving the classification precision of CNNs. Using generative adversarial networks (GANs) to create synthetic images for data augmentation, the authors address challenges such as mode collapse and unstable training. Incorporating a reconstruction loss into the GAN loss function requires the generator to produce images reflecting the discriminator features, thus enhancing the generated images' quality to more accurately replicate authentic data patterns. Moreover, combining this loss function with other regularisation methods, such as gradient penalty, has proven to further improve the performance of diverse GAN models. A significant challenge in myocarditis diagnosis is the imbalance of classification, where one class dominates over the other. To mitigate this, the authors introduce a focal loss-based training method that effectively trains the model on the minority class samples. The GAN-MD approach, evaluated on the Z-Alizadeh Sani myocarditis dataset, achieves superior results (F-measure 86.2%; geometric mean 91.0%) compared with other deep learning models and traditional machine learning methods.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":8.4,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12307","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140245001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tajinder Kumar, Purushottam Sharma, Jaswinder Tanwar, Hisham Alsghier, Shashi Bhushan, Hesham Alhumyani, Vivek Sharma, Ahmed I. Alutaibi
Cloud computing has drastically changed the delivery and consumption of live streaming content. The designs, challenges, and possible uses of cloud computing for live streaming are studied. A comprehensive overview of the technical and business issues surrounding cloud-based live streaming is provided, including the benefits of cloud computing, the various live streaming architectures, and the challenges that live streaming service providers face in delivering high-quality, real-time services. The different techniques used to improve the performance of video streaming, such as adaptive bit-rate streaming, multicast distribution, and edge computing are discussed and the necessity of low-latency and high-quality video transmission in cloud-based live streaming is underlined. Issues such as improving user experience and live streaming service performance using cutting-edge technology, like artificial intelligence and machine learning are discussed. In addition, the legal and regulatory implications of cloud-based live streaming, including issues with network neutrality, data privacy, and content moderation are addressed. The future of cloud computing for live streaming is examined in the section that follows, and it looks at the most likely new developments in terms of trends and technology. For technology vendors, live streaming service providers, and regulators, the findings have major policy-relevant implications. Suggestions on how stakeholders should address these concerns and take advantage of the potential presented by this rapidly evolving sector, as well as insights into the key challenges and opportunities associated with cloud-based live streaming are provided.
{"title":"Cloud-based video streaming services: Trends, challenges, and opportunities","authors":"Tajinder Kumar, Purushottam Sharma, Jaswinder Tanwar, Hisham Alsghier, Shashi Bhushan, Hesham Alhumyani, Vivek Sharma, Ahmed I. Alutaibi","doi":"10.1049/cit2.12299","DOIUrl":"10.1049/cit2.12299","url":null,"abstract":"<p>Cloud computing has drastically changed the delivery and consumption of live streaming content. The designs, challenges, and possible uses of cloud computing for live streaming are studied. A comprehensive overview of the technical and business issues surrounding cloud-based live streaming is provided, including the benefits of cloud computing, the various live streaming architectures, and the challenges that live streaming service providers face in delivering high-quality, real-time services. The different techniques used to improve the performance of video streaming, such as adaptive bit-rate streaming, multicast distribution, and edge computing are discussed and the necessity of low-latency and high-quality video transmission in cloud-based live streaming is underlined. Issues such as improving user experience and live streaming service performance using cutting-edge technology, like artificial intelligence and machine learning are discussed. In addition, the legal and regulatory implications of cloud-based live streaming, including issues with network neutrality, data privacy, and content moderation are addressed. The future of cloud computing for live streaming is examined in the section that follows, and it looks at the most likely new developments in terms of trends and technology. For technology vendors, live streaming service providers, and regulators, the findings have major policy-relevant implications. Suggestions on how stakeholders should address these concerns and take advantage of the potential presented by this rapidly evolving sector, as well as insights into the key challenges and opportunities associated with cloud-based live streaming are provided.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":5.1,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12299","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140242982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yang Fang, Bailian Xie, Uswah Khairuddin, Zijian Min, Bingbing Jiang, Weisheng Li
Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target-search feature correlation by self and/or cross attention operations, thus the model complexity will grow quadratically with the number of input images. To alleviate the burden of this tracking paradigm and facilitate practical deployment of Transformer-based trackers, we propose a dual pooling transformer tracking framework, dubbed as DPT, which consists of three components: a simple yet efficient spatiotemporal attention model (SAM), a mutual correlation pooling Transformer (MCPT) and a multiscale aggregation pooling Transformer (MAPT). SAM is designed to gracefully aggregates temporal dynamics and spatial appearance information of multi-frame templates along space-time dimensions. MCPT aims to capture multi-scale pooled and correlated contextual features, which is followed by MAPT that aggregates multi-scale features into a unified feature representation for tracking prediction. DPT tracker achieves AUC score of 69.5 on LaSOT and precision score of 82.8 on TrackingNet while maintaining a shorter sequence length of attention tokens, fewer parameters and FLOPs compared to existing state-of-the-art (SOTA) Transformer tracking methods. Extensive experiments demonstrate that DPT tracker yields a strong real-time tracking baseline with a good trade-off between tracking performance and inference efficiency.
{"title":"DPT-tracker: Dual pooling transformer for efficient visual tracking","authors":"Yang Fang, Bailian Xie, Uswah Khairuddin, Zijian Min, Bingbing Jiang, Weisheng Li","doi":"10.1049/cit2.12296","DOIUrl":"10.1049/cit2.12296","url":null,"abstract":"<p>Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target-search feature correlation by self and/or cross attention operations, thus the model complexity will grow quadratically with the number of input images. To alleviate the burden of this tracking paradigm and facilitate practical deployment of Transformer-based trackers, we propose a dual pooling transformer tracking framework, dubbed as DPT, which consists of three components: a simple yet efficient spatiotemporal attention model (SAM), a mutual correlation pooling Transformer (MCPT) and a multiscale aggregation pooling Transformer (MAPT). SAM is designed to gracefully aggregates temporal dynamics and spatial appearance information of multi-frame templates along space-time dimensions. MCPT aims to capture multi-scale pooled and correlated contextual features, which is followed by MAPT that aggregates multi-scale features into a unified feature representation for tracking prediction. DPT tracker achieves AUC score of 69.5 on LaSOT and precision score of 82.8 on TrackingNet while maintaining a shorter sequence length of attention tokens, fewer parameters and FLOPs compared to existing state-of-the-art (SOTA) Transformer tracking methods. Extensive experiments demonstrate that DPT tracker yields a strong real-time tracking baseline with a good trade-off between tracking performance and inference efficiency.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":8.4,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12296","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140244948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Li Ying, Duoqian Miao, Zhifei Zhang, Hongyun Zhang, Witold Pedrycz
Due to the characteristics of high resolution and rich texture information, visible light images are widely used for maritime ship detection. However, these images are susceptible to sea fog and ships of different sizes, which can result in missed detections and false alarms, ultimately resulting in lower detection accuracy. To address these issues, a novel multi-granularity feature enhancement network, MFENet, which includes a three-way dehazing module (3WDM) and a multi-granularity feature enhancement module (MFEM) is proposed. The 3WDM eliminates sea fog interference by using an image clarity automatic classification algorithm based on three-way decisions and FFA-Net to obtain clear image samples. Additionally, the MFEM improves the accuracy of detecting ships of different sizes by utilising an improved super-resolution reconstruction convolutional neural network to enhance the resolution and semantic representation capability of the feature maps from YOLOv7. Experimental results demonstrate that MFENet surpasses the other 15 competing models in terms of the mean Average Precision metric on two benchmark datasets, achieving 96.28% on the McShips dataset and 97.71% on the SeaShips dataset.
{"title":"Multi-granularity feature enhancement network for maritime ship detection","authors":"Li Ying, Duoqian Miao, Zhifei Zhang, Hongyun Zhang, Witold Pedrycz","doi":"10.1049/cit2.12310","DOIUrl":"10.1049/cit2.12310","url":null,"abstract":"<p>Due to the characteristics of high resolution and rich texture information, visible light images are widely used for maritime ship detection. However, these images are susceptible to sea fog and ships of different sizes, which can result in missed detections and false alarms, ultimately resulting in lower detection accuracy. To address these issues, a novel multi-granularity feature enhancement network, MFENet, which includes a three-way dehazing module (3WDM) and a multi-granularity feature enhancement module (MFEM) is proposed. The 3WDM eliminates sea fog interference by using an image clarity automatic classification algorithm based on three-way decisions and FFA-Net to obtain clear image samples. Additionally, the MFEM improves the accuracy of detecting ships of different sizes by utilising an improved super-resolution reconstruction convolutional neural network to enhance the resolution and semantic representation capability of the feature maps from YOLOv7. Experimental results demonstrate that MFENet surpasses the other 15 competing models in terms of the mean Average Precision metric on two benchmark datasets, achieving 96.28% on the McShips dataset and 97.71% on the SeaShips dataset.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":5.1,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12310","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140249217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiaoying Wang, Chunmei Li, Yilei Wang, Lin Yin, Qilin Zhou, Rui Zheng, Qingwu Wu, Yuqi Zhou, Min Dai
The epidemic characters of Omicron (e.g. large-scale transmission) are significantly different from the initial variants of COVID-19. The data generated by large-scale transmission is important to predict the trend of epidemic characters. However, the results of current prediction models are inaccurate since they are not closely combined with the actual situation of Omicron transmission. In consequence, these inaccurate results have negative impacts on the process of the manufacturing and the service industry, for example, the production of masks and the recovery of the tourism industry. The authors have studied the epidemic characters in two ways, that is, investigation and prediction. First, a large amount of data is collected by utilising the Baidu index and conduct questionnaire survey concerning epidemic characters. Second, the β-SEIDR model is established, where the population is classified as Susceptible, Exposed, Infected, Dead and β-Recovered persons, to intelligently predict the epidemic characters of COVID-19. Note that β-Recovered persons denote that the Recovered persons may become Susceptible persons with probability β. The simulation results show that the model can accurately predict the epidemic characters.
{"title":"An intelligent prediction model of epidemic characters based on multi-feature","authors":"Xiaoying Wang, Chunmei Li, Yilei Wang, Lin Yin, Qilin Zhou, Rui Zheng, Qingwu Wu, Yuqi Zhou, Min Dai","doi":"10.1049/cit2.12294","DOIUrl":"10.1049/cit2.12294","url":null,"abstract":"<p>The epidemic characters of Omicron (<i>e</i>.<i>g</i>. large-scale transmission) are significantly different from the initial variants of COVID-19. The data generated by large-scale transmission is important to predict the trend of epidemic characters. However, the results of current prediction models are inaccurate since they are not closely combined with the actual situation of Omicron transmission. In consequence, these inaccurate results have negative impacts on the process of the manufacturing and the service industry, for example, the production of masks and the recovery of the tourism industry. The authors have studied the epidemic characters in two ways, that is, investigation and prediction. First, a large amount of data is collected by utilising the Baidu index and conduct questionnaire survey concerning epidemic characters. Second, the <i>β</i>-SEIDR model is established, where the population is classified as Susceptible, Exposed, Infected, Dead and <i>β</i>-Recovered persons, to intelligently predict the epidemic characters of COVID-19. Note that <i>β</i>-Recovered persons denote that the Recovered persons may become Susceptible persons with probability <i>β</i>. The simulation results show that the model can accurately predict the epidemic characters.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":5.1,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12294","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140263286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jing Yang, Touseef Sadiq, Jiale Xiong, Muhammad Awais, Uzair Aslam Bhatti, R. Alizadehsani, J. M. Górriz
Myocarditis is a serious cardiovascular ailment that can lead to severe consequences if not promptly treated. It is triggered by viral infections and presents symptoms such as chest pain and heart dysfunction. Early detection is crucial for successful treatment, and cardiac magnetic resonance imaging (CMR) is a valuable tool for identifying this condition. However, the detection of myocarditis using CMR images can be challenging due to low contrast, variable noise, and the presence of multiple high CMR slices per patient. To overcome these challenges, the approach proposed incorporates advanced techniques such as convolutional neural networks (CNNs), an improved differential evolution (DE) algorithm for pre‐training, and a reinforcement learning (RL)‐based model for training. Developing this method presented a significant challenge due to the imbalanced classification of the Z‐Alizadeh Sani myocarditis dataset from Omid Hospital in Tehran. To address this, the training process is framed as a sequential decision‐making process, where the agent receives higher rewards/penalties for correctly/incorrectly classifying the minority/majority class. Additionally, the authors suggest an enhanced DE algorithm to initiate the backpropagation (BP) process, overcoming the initialisation sensitivity issue of gradient‐based methods like back‐propagation during the training phase. The effectiveness of the proposed model in diagnosing myocarditis is demonstrated through experimental results based on standard performance metrics. Overall, this method shows promise in expediting the triage of CMR images for automatic screening, facilitating early detection and successful treatment of myocarditis.
心肌炎是一种严重的心血管疾病,如不及时治疗,可导致严重后果。它由病毒感染引发,表现出胸痛和心脏功能障碍等症状。早期发现是成功治疗的关键,而心脏磁共振成像(CMR)是识别这种疾病的重要工具。然而,由于对比度低、噪音多变以及每个患者存在多个高CMR切片,使用CMR图像检测心肌炎具有挑战性。为了克服这些挑战,我们提出的方法采用了卷积神经网络(CNN)、用于预训练的改进型差分进化(DE)算法和基于强化学习(RL)的训练模型等先进技术。由于德黑兰奥米德医院的 Z-Alizadeh Sani 心肌炎数据集的分类不平衡,开发这种方法面临着巨大的挑战。为了解决这个问题,作者将训练过程设计成一个连续的决策过程,在这个过程中,代理在正确/不正确地对少数/多数类别进行分类时会得到更高的奖励/惩罚。此外,作者还提出了一种增强的 DE 算法来启动反向传播(BP)过程,从而克服了基于梯度的方法(如反向传播)在训练阶段的初始化敏感性问题。基于标准性能指标的实验结果证明了所提出的模型在诊断心肌炎方面的有效性。总之,这种方法有望加快用于自动筛查的 CMR 图像的分流,促进心肌炎的早期检测和成功治疗。
{"title":"A novel myocarditis detection combining deep reinforcement learning and an improved differential evolution algorithm","authors":"Jing Yang, Touseef Sadiq, Jiale Xiong, Muhammad Awais, Uzair Aslam Bhatti, R. Alizadehsani, J. M. Górriz","doi":"10.1049/cit2.12289","DOIUrl":"https://doi.org/10.1049/cit2.12289","url":null,"abstract":"Myocarditis is a serious cardiovascular ailment that can lead to severe consequences if not promptly treated. It is triggered by viral infections and presents symptoms such as chest pain and heart dysfunction. Early detection is crucial for successful treatment, and cardiac magnetic resonance imaging (CMR) is a valuable tool for identifying this condition. However, the detection of myocarditis using CMR images can be challenging due to low contrast, variable noise, and the presence of multiple high CMR slices per patient. To overcome these challenges, the approach proposed incorporates advanced techniques such as convolutional neural networks (CNNs), an improved differential evolution (DE) algorithm for pre‐training, and a reinforcement learning (RL)‐based model for training. Developing this method presented a significant challenge due to the imbalanced classification of the Z‐Alizadeh Sani myocarditis dataset from Omid Hospital in Tehran. To address this, the training process is framed as a sequential decision‐making process, where the agent receives higher rewards/penalties for correctly/incorrectly classifying the minority/majority class. Additionally, the authors suggest an enhanced DE algorithm to initiate the backpropagation (BP) process, overcoming the initialisation sensitivity issue of gradient‐based methods like back‐propagation during the training phase. The effectiveness of the proposed model in diagnosing myocarditis is demonstrated through experimental results based on standard performance metrics. Overall, this method shows promise in expediting the triage of CMR images for automatic screening, facilitating early detection and successful treatment of myocarditis.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":5.1,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139963010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mujeeb Ur Rehman, Arslan Shafique, Muhammad Shahbaz Khan, Maha Driss, Wadii Boulila, Yazeed Yasin Ghadi, Suresh Babu Changalasetty, Majed Alhaisoni, Jawad Ahmad
The Internet of Multimedia Things (IoMT) refers to a network of interconnected multimedia devices that communicate with each other over the Internet. Recently, smart healthcare has emerged as a significant application of the IoMT, particularly in the context of knowledge-based learning systems. Smart healthcare systems leverage knowledge-based learning to become more context-aware, adaptable, and auditable while maintaining the ability to learn from historical data. In smart healthcare systems, devices capture images, such as X-rays, Magnetic Resonance Imaging. The security and integrity of these images are crucial for the databases used in knowledge-based learning systems to foster structured decision-making and enhance the learning abilities of AI. Moreover, in knowledge-driven systems, the storage and transmission of HD medical images exert a burden on the limited bandwidth of the communication channel, leading to data transmission delays. To address the security and latency concerns, this paper presents a lightweight medical image encryption scheme utilising bit-plane decomposition and chaos theory. The results of the experiment yield entropy, energy, and correlation values of 7.999, 0.0156, and 0.0001, respectively. This validates the effectiveness of the encryption system proposed in this paper, which offers high-quality encryption, a large key space, key sensitivity, and resistance to statistical attacks.
多媒体物联网(IoMT)是指通过互联网相互连接的多媒体设备网络。最近,智能医疗已成为 IoMT 的一项重要应用,尤其是在基于知识的学习系统方面。智能医疗系统利用基于知识的学习来提高对环境的感知能力、适应能力和审计能力,同时保持从历史数据中学习的能力。在智能医疗系统中,设备会捕捉图像,如 X 光、磁共振成像。这些图像的安全性和完整性对基于知识的学习系统中使用的数据库至关重要,可促进结构化决策并增强人工智能的学习能力。此外,在知识驱动型系统中,高清医学图像的存储和传输对有限的通信信道带宽造成了负担,导致数据传输延迟。为了解决安全性和延迟问题,本文提出了一种利用位平面分解和混沌理论的轻量级医学图像加密方案。实验结果表明,该方案的熵值为 7.999,能量为 0.0156,相关性为 0.0001。这验证了本文提出的加密系统的有效性,它具有高质量加密、大密钥空间、密钥灵敏度和抗统计攻击等特点。
{"title":"A novel medical image data protection scheme for smart healthcare system","authors":"Mujeeb Ur Rehman, Arslan Shafique, Muhammad Shahbaz Khan, Maha Driss, Wadii Boulila, Yazeed Yasin Ghadi, Suresh Babu Changalasetty, Majed Alhaisoni, Jawad Ahmad","doi":"10.1049/cit2.12292","DOIUrl":"10.1049/cit2.12292","url":null,"abstract":"<p>The Internet of Multimedia Things (IoMT) refers to a network of interconnected multimedia devices that communicate with each other over the Internet. Recently, smart healthcare has emerged as a significant application of the IoMT, particularly in the context of knowledge-based learning systems. Smart healthcare systems leverage knowledge-based learning to become more context-aware, adaptable, and auditable while maintaining the ability to learn from historical data. In smart healthcare systems, devices capture images, such as X-rays, Magnetic Resonance Imaging. The security and integrity of these images are crucial for the databases used in knowledge-based learning systems to foster structured decision-making and enhance the learning abilities of AI. Moreover, in knowledge-driven systems, the storage and transmission of HD medical images exert a burden on the limited bandwidth of the communication channel, leading to data transmission delays. To address the security and latency concerns, this paper presents a lightweight medical image encryption scheme utilising bit-plane decomposition and chaos theory. The results of the experiment yield entropy, energy, and correlation values of 7.999, 0.0156, and 0.0001, respectively. This validates the effectiveness of the encryption system proposed in this paper, which offers high-quality encryption, a large key space, key sensitivity, and resistance to statistical attacks.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":8.4,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12292","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139841677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mujeeb Ur Rehman, Arslan Shafique, Muhammad Shahbaz Khan, Maha Driss, W. Boulila, Y. Ghadi, Suresh Babu Changalasetty, Majed Alhaisoni, Jawad Ahmad
The Internet of Multimedia Things (IoMT) refers to a network of interconnected multimedia devices that communicate with each other over the Internet. Recently, smart healthcare has emerged as a significant application of the IoMT, particularly in the context of knowledge‐based learning systems. Smart healthcare systems leverage knowledge‐based learning to become more context‐aware, adaptable, and auditable while maintaining the ability to learn from historical data. In smart healthcare systems, devices capture images, such as X‐rays, Magnetic Resonance Imaging. The security and integrity of these images are crucial for the databases used in knowledge‐based learning systems to foster structured decision‐making and enhance the learning abilities of AI. Moreover, in knowledge‐driven systems, the storage and transmission of HD medical images exert a burden on the limited bandwidth of the communication channel, leading to data transmission delays. To address the security and latency concerns, this paper presents a lightweight medical image encryption scheme utilising bit‐plane decomposition and chaos theory. The results of the experiment yield entropy, energy, and correlation values of 7.999, 0.0156, and 0.0001, respectively. This validates the effectiveness of the encryption system proposed in this paper, which offers high‐quality encryption, a large key space, key sensitivity, and resistance to statistical attacks.
多媒体物联网(IoMT)是指通过互联网相互连接的多媒体设备网络。最近,智能医疗已成为 IoMT 的一项重要应用,尤其是在基于知识的学习系统方面。智能医疗系统利用基于知识的学习来提高对环境的感知能力、适应能力和审计能力,同时保持从历史数据中学习的能力。在智能医疗系统中,设备会捕捉图像,如 X 光、磁共振成像。这些图像的安全性和完整性对基于知识的学习系统中使用的数据库至关重要,可促进结构化决策并增强人工智能的学习能力。此外,在知识驱动型系统中,高清医学图像的存储和传输对有限的通信信道带宽造成了负担,导致数据传输延迟。为了解决安全性和延迟问题,本文提出了一种利用位平面分解和混沌理论的轻量级医学图像加密方案。实验结果表明,该方案的熵值为 7.999,能量为 0.0156,相关性为 0.0001。这验证了本文提出的加密系统的有效性,它具有高质量加密、大密钥空间、密钥灵敏度和抗统计攻击等特点。
{"title":"A novel medical image data protection scheme for smart healthcare system","authors":"Mujeeb Ur Rehman, Arslan Shafique, Muhammad Shahbaz Khan, Maha Driss, W. Boulila, Y. Ghadi, Suresh Babu Changalasetty, Majed Alhaisoni, Jawad Ahmad","doi":"10.1049/cit2.12292","DOIUrl":"https://doi.org/10.1049/cit2.12292","url":null,"abstract":"The Internet of Multimedia Things (IoMT) refers to a network of interconnected multimedia devices that communicate with each other over the Internet. Recently, smart healthcare has emerged as a significant application of the IoMT, particularly in the context of knowledge‐based learning systems. Smart healthcare systems leverage knowledge‐based learning to become more context‐aware, adaptable, and auditable while maintaining the ability to learn from historical data. In smart healthcare systems, devices capture images, such as X‐rays, Magnetic Resonance Imaging. The security and integrity of these images are crucial for the databases used in knowledge‐based learning systems to foster structured decision‐making and enhance the learning abilities of AI. Moreover, in knowledge‐driven systems, the storage and transmission of HD medical images exert a burden on the limited bandwidth of the communication channel, leading to data transmission delays. To address the security and latency concerns, this paper presents a lightweight medical image encryption scheme utilising bit‐plane decomposition and chaos theory. The results of the experiment yield entropy, energy, and correlation values of 7.999, 0.0156, and 0.0001, respectively. This validates the effectiveness of the encryption system proposed in this paper, which offers high‐quality encryption, a large key space, key sensitivity, and resistance to statistical attacks.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":5.1,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139781485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hasnain Ali Shah, Faisal Saeed, Muhammad Diyan, N. Almujally, Jae-Mo Kang
Abnormalities in the heart's rhythm, known as arrhythmias, pose a significant threat to global health, often leading to severe cardiac conditions and sudden cardiac deaths. Therefore, early and accurate detection of arrhythmias is crucial for timely intervention and potentially life‐saving treatment. Artificial Intelligence, particularly deep learning, has revolutionised the detection and diagnosis of various health conditions, including arrhythmias. A unique hybrid architecture, ECG‐TransCovNet, that combines Convolutional Neural Networks and Transformer models for enhanced arrhythmia detection in Electrocardiogram signals is introduced. The authors’ approach leverages the superior temporal pattern recognition capabilities of Transformers and the spatial feature extraction strengths of convolutional neural networks, providing a robust and accurate solution for arrhythmia detection. The performance and generalisability of the authors’ proposed model are validated through tests on the MIT‐BIH arrhythmia and PhysioNet databases. The authors conducted experimental trials using these two benchmark datasets. The authors’ results demonstrate that the proposed ECG‐TransCovNet model achieves state‐of‐the‐art (SOTA) performance in terms of detection accuracy, reaching 98.6%. Additionally, the authors conducted several experiments and compared the results to the most recent techniques utilising their assessment measures. The experimental results demonstrate that the authors’ model can generally produce better results.
{"title":"ECG‐TransCovNet: A hybrid transformer model for accurate arrhythmia detection using Electrocardiogram signals","authors":"Hasnain Ali Shah, Faisal Saeed, Muhammad Diyan, N. Almujally, Jae-Mo Kang","doi":"10.1049/cit2.12293","DOIUrl":"https://doi.org/10.1049/cit2.12293","url":null,"abstract":"Abnormalities in the heart's rhythm, known as arrhythmias, pose a significant threat to global health, often leading to severe cardiac conditions and sudden cardiac deaths. Therefore, early and accurate detection of arrhythmias is crucial for timely intervention and potentially life‐saving treatment. Artificial Intelligence, particularly deep learning, has revolutionised the detection and diagnosis of various health conditions, including arrhythmias. A unique hybrid architecture, ECG‐TransCovNet, that combines Convolutional Neural Networks and Transformer models for enhanced arrhythmia detection in Electrocardiogram signals is introduced. The authors’ approach leverages the superior temporal pattern recognition capabilities of Transformers and the spatial feature extraction strengths of convolutional neural networks, providing a robust and accurate solution for arrhythmia detection. The performance and generalisability of the authors’ proposed model are validated through tests on the MIT‐BIH arrhythmia and PhysioNet databases. The authors conducted experimental trials using these two benchmark datasets. The authors’ results demonstrate that the proposed ECG‐TransCovNet model achieves state‐of‐the‐art (SOTA) performance in terms of detection accuracy, reaching 98.6%. Additionally, the authors conducted several experiments and compared the results to the most recent techniques utilising their assessment measures. The experimental results demonstrate that the authors’ model can generally produce better results.","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":null,"pages":null},"PeriodicalIF":5.1,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139845056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}