首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
Detection of Unknown-Unknowns in Human-in-Loop Human-in-Plant Safety Critical Systems 人在环人在厂安全关键系统中未知因素的检测
Pub Date : 2025-03-17 DOI: 10.1109/TAI.2025.3550913
Aranyak Maity;Ayan Banerjee;Sandeep K. S. Gupta
Errors in artificial intelligence (AI)-enabled autonomous systems (AASs) where both the cause and effect are unknown to the human operator at the time they occur are referred to as “unknown-unknown” errors. This article introduces a methodology for preemptively identifying “unknown-unknown” errors in AAS that arise due to unpredictable human interactions and complex real-world usage scenarios, potentially leading to critical safety incidents through unsafe shifts in operational data distributions. We posit that AAS functioning in human-in-the-loop and human-in-the-plant modes must adhere to established physical laws, even when unknown-unknown errors occur. Our approach employs constructing physics-guided models from operational data, coupled with conformal inference for assessing structural breaks in the underlying model caused by violations of physical laws, thereby facilitating early detection of such errors before unsafe shifts in operational data distribution occur. Validation across diverse contexts—zero-day vulnerabilities in autonomous vehicles, hardware failures in artificial pancreas systems, and design deficiencies in aircraft in maneuvering characteristics augmentation systems (MCASs)—demonstrates our framework's efficacy in preempting unsafe data distribution shifts due to unknown-unknowns. This methodology not only advances unknown-unknown error detection in AAS but also sets a new benchmark for integrating physics-guided models and machine learning to ensure system safety.
在人工智能(AI)支持的自主系统(AASs)中,如果发生的原因和结果对人类操作员来说都是未知的,则这些错误被称为“未知”错误。本文介绍了一种方法,用于先发制人地识别由于不可预测的人类交互和复杂的实际使用场景而产生的AAS中的“未知-未知”错误,这些错误可能会通过操作数据分布中的不安全转移导致严重的安全事件。我们假设在人在环和人在厂模式下运行的AAS必须遵守既定的物理定律,即使发生未知的错误。我们的方法采用从运行数据构建物理指导模型,并结合保形推理来评估因违反物理定律而导致的底层模型中的结构性断裂,从而促进在运行数据分布发生不安全变化之前早期发现此类错误。在不同的环境下进行验证——自动驾驶汽车的零日漏洞、人工胰腺系统的硬件故障,以及飞机在机动特性增强系统(MCASs)中的设计缺陷——证明了我们的框架在预防未知因素导致的不安全数据分布转移方面的有效性。该方法不仅推进了AAS中的未知错误检测,而且为整合物理指导模型和机器学习以确保系统安全设定了新的基准。
{"title":"Detection of Unknown-Unknowns in Human-in-Loop Human-in-Plant Safety Critical Systems","authors":"Aranyak Maity;Ayan Banerjee;Sandeep K. S. Gupta","doi":"10.1109/TAI.2025.3550913","DOIUrl":"https://doi.org/10.1109/TAI.2025.3550913","url":null,"abstract":"Errors in artificial intelligence (AI)-enabled autonomous systems (AASs) where both the cause and effect are unknown to the human operator at the time they occur are referred to as “unknown-unknown” errors. This article introduces a methodology for preemptively identifying “unknown-unknown” errors in AAS that arise due to unpredictable human interactions and complex real-world usage scenarios, potentially leading to critical safety incidents through unsafe shifts in operational data distributions. We posit that AAS functioning in human-in-the-loop and human-in-the-plant modes must adhere to established physical laws, even when unknown-unknown errors occur. Our approach employs constructing physics-guided models from operational data, coupled with conformal inference for assessing structural breaks in the underlying model caused by violations of physical laws, thereby facilitating early detection of such errors before unsafe shifts in operational data distribution occur. Validation across diverse contexts—zero-day vulnerabilities in autonomous vehicles, hardware failures in artificial pancreas systems, and design deficiencies in aircraft in maneuvering characteristics augmentation systems (MCASs)—demonstrates our framework's efficacy in preempting unsafe data distribution shifts due to unknown-unknowns. This methodology not only advances unknown-unknown error detection in AAS but also sets a new benchmark for integrating physics-guided models and machine learning to ensure system safety.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 9","pages":"2526-2541"},"PeriodicalIF":0.0,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144926900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep3BPP: Identification of Blood–Brain Barrier Penetrating Peptides Using Word Embedding Feature Extraction Method and CNN-LSTM Deep3BPP:基于词嵌入特征提取和CNN-LSTM的血脑屏障穿透肽识别
Pub Date : 2025-03-15 DOI: 10.1109/TAI.2025.3567434
Md. Ashikur Rahman;Md. Mamun Ali;Kawsar Ahmed;Imran Mahmud;Francis M. Bui;Li Chen;Mohammad Ali Moni
To prevent different chemicals from entering the brain, the blood–brain barrier penetrating peptide (3BPP) acts as a vital barrier between the bloodstream and the central nervous system (CNS). This barrier significantly hinders the treatment of neurological and CNS disorders. 3BPP can get beyond this barrier, making it easier to enter the brain and essential for treating CNS and neurological diseases and disorders. Computational techniques are being explored because traditional laboratory tests for 3BPP identification are costly and time-consuming. In this work, we introduced a novel technique for 3BPP prediction with a hybrid deep learning model. Our proposed model, Deep3BPP, leverages the LSA, a word embedding method for peptide sequence extraction, and integrates CNN with LSTM (CNN-LSTM) for the final prediction model. Deep3BPP performance metrics show a remarkable accuracy of 97.42%, a Kappa value of 0.9257, and an MCC of 0.9362. These findings indicate a more efficient and cost-effective method of identifying 3BPP, which has important implications for researchers in the pharmaceutical and medical industries. Thus, this work offers insightful information that can advance both scientific research and the well-being of people overall.
为了防止不同的化学物质进入大脑,血脑屏障穿透肽(3BPP)作为血液和中枢神经系统(CNS)之间的重要屏障。这一屏障严重阻碍了神经和中枢神经系统疾病的治疗。3BPP可以越过这一屏障,使其更容易进入大脑,对治疗中枢神经系统和神经系统疾病和紊乱至关重要。正在探索计算技术,因为用于3BPP识别的传统实验室测试既昂贵又耗时。在这项工作中,我们引入了一种使用混合深度学习模型进行3BPP预测的新技术。我们提出的模型Deep3BPP利用LSA(一种用于肽序列提取的词嵌入方法),并将CNN与LSTM (CNN-LSTM)集成为最终的预测模型。Deep3BPP性能指标的准确率为97.42%,Kappa值为0.9257,MCC为0.9362。这些发现表明了一种更有效和更具成本效益的识别3BPP的方法,这对制药和医疗行业的研究人员具有重要意义。因此,这项工作提供了深刻的信息,可以促进科学研究和人们的整体福祉。
{"title":"Deep3BPP: Identification of Blood–Brain Barrier Penetrating Peptides Using Word Embedding Feature Extraction Method and CNN-LSTM","authors":"Md. Ashikur Rahman;Md. Mamun Ali;Kawsar Ahmed;Imran Mahmud;Francis M. Bui;Li Chen;Mohammad Ali Moni","doi":"10.1109/TAI.2025.3567434","DOIUrl":"https://doi.org/10.1109/TAI.2025.3567434","url":null,"abstract":"To prevent different chemicals from entering the brain, the blood–brain barrier penetrating peptide (3BPP) acts as a vital barrier between the bloodstream and the central nervous system (CNS). This barrier significantly hinders the treatment of neurological and CNS disorders. 3BPP can get beyond this barrier, making it easier to enter the brain and essential for treating CNS and neurological diseases and disorders. Computational techniques are being explored because traditional laboratory tests for 3BPP identification are costly and time-consuming. In this work, we introduced a novel technique for 3BPP prediction with a hybrid deep learning model. Our proposed model, Deep3BPP, leverages the LSA, a word embedding method for peptide sequence extraction, and integrates CNN with LSTM (CNN-LSTM) for the final prediction model. Deep3BPP performance metrics show a remarkable accuracy of 97.42%, a Kappa value of 0.9257, and an MCC of 0.9362. These findings indicate a more efficient and cost-effective method of identifying 3BPP, which has important implications for researchers in the pharmaceutical and medical industries. Thus, this work offers insightful information that can advance both scientific research and the well-being of people overall.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"7 1","pages":"562-570"},"PeriodicalIF":0.0,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reduction of Class Activation Uncertainty With Background Information 利用背景信息减少类激活的不确定性
Pub Date : 2025-03-15 DOI: 10.1109/TAI.2025.3570282
H. M. Dipu Kabir
Multitask learning is a popular approach to training high-performing neural networks with improved generalization. In this article, we propose a background class to achieve improved generalization at a lower computation compared to multitask learning to help researchers and organizations with limited computation power. We also present a methodology for selecting background images and discuss potential future improvements. We apply our approach to several datasets and achieve improved generalization with much lower computation. Through the class activation mappings (CAMs) of the trained models, we observed the tendency toward looking at a bigger picture with the proposed model training methodology. Applying the vision transformer with the proposed background class, we receive state-of-the-art (SOTA) performance on CIFAR-10C, Caltech-101, and CINIC-10 datasets.
多任务学习是训练高性能神经网络的一种流行方法。在本文中,我们提出了一个背景类,与多任务学习相比,以更低的计算量实现改进的泛化,以帮助计算能力有限的研究人员和组织。我们还提出了一种选择背景图像的方法,并讨论了未来可能的改进。我们将该方法应用于多个数据集,并以更低的计算量实现了改进的泛化。通过训练模型的类激活映射(CAMs),我们观察到使用提议的模型训练方法观察更大图景的趋势。将视觉转换器与所提出的背景类一起应用,我们在CIFAR-10C、Caltech-101和CINIC-10数据集上获得了最先进的(SOTA)性能。
{"title":"Reduction of Class Activation Uncertainty With Background Information","authors":"H. M. Dipu Kabir","doi":"10.1109/TAI.2025.3570282","DOIUrl":"https://doi.org/10.1109/TAI.2025.3570282","url":null,"abstract":"Multitask learning is a popular approach to training high-performing neural networks with improved generalization. In this article, we propose a background class to achieve improved generalization at a lower computation compared to multitask learning to help researchers and organizations with limited computation power. We also present a methodology for selecting background images and discuss potential future improvements. We apply our approach to several datasets and achieve improved generalization with much lower computation. Through the class activation mappings (CAMs) of the trained models, we observed the tendency toward looking at a bigger picture with the proposed model training methodology. Applying the vision transformer with the proposed background class, we receive state-of-the-art (SOTA) performance on CIFAR-10C, Caltech-101, and CINIC-10 datasets.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"7 1","pages":"571-585"},"PeriodicalIF":0.0,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Survey on Diagnostic Microscopic Imaging Modalities, Challenges, Taxonomy, and Future Directions for Cervical Abnormality Detection and Grading 宫颈异常检测和分级的显微诊断成像方式、挑战、分类和未来方向的综合研究
Pub Date : 2025-03-13 DOI: 10.1109/TAI.2025.3551669
Anindita Mohanta;Sourav Dey Roy;Niharika Nath;Abhijit Datta;Mrinal Kanti Bhowmik
Cancer is one of the most severe diseases, affecting the lives of many people in the modern world. Among the various types of cancer, cervical cancer is one of the most frequently occurring cancers in the female population. In most cases, doctors and practitioners can typically only identify cervical cancer in its latter stages. Planning cancer therapy and increasing patient survival rates become very difficult as the disease progresses. As a result, diagnosing cervical cancer in its initial stages has become imperative to arrange proper therapy and surgery. In this article, we present a survey of automatic computerized methods for diagnosing cervical abnormalities based on microscopic imaging modalities. The present survey was conducted by defining a novel taxonomy of the surveyed techniques based on the approaches they used. We also discuss the challenges and subchallenges associated with an automatic cervical cancer diagnosis based on microscopic imaging modalities. Additionally, surveys on various public and private datasets used by the research community for developing new methods are presented. In this article, the performances of published papers are compared. The article concludes by suggesting possible research directions in these fields.
癌症是最严重的疾病之一,影响着现代世界许多人的生活。在各类癌症中,子宫颈癌是女性人群中最常见的癌症之一。在大多数情况下,医生和从业员通常只能在宫颈癌的后期阶段识别宫颈癌。随着病情的发展,规划癌症治疗和提高患者存活率变得非常困难。因此,及早诊断子宫颈癌,以安排适当的治疗和手术已成为当务之急。在这篇文章中,我们提出了一项基于显微成像模式的诊断宫颈异常的自动计算机方法的调查。目前的调查是通过根据他们使用的方法定义一种新的调查技术分类来进行的。我们还讨论了与基于显微成像方式的宫颈癌自动诊断相关的挑战和亚挑战。此外,还介绍了研究界用于开发新方法的各种公共和私人数据集的调查。本文对已发表论文的性能进行了比较。文章最后提出了今后可能的研究方向。
{"title":"A Comprehensive Survey on Diagnostic Microscopic Imaging Modalities, Challenges, Taxonomy, and Future Directions for Cervical Abnormality Detection and Grading","authors":"Anindita Mohanta;Sourav Dey Roy;Niharika Nath;Abhijit Datta;Mrinal Kanti Bhowmik","doi":"10.1109/TAI.2025.3551669","DOIUrl":"https://doi.org/10.1109/TAI.2025.3551669","url":null,"abstract":"Cancer is one of the most severe diseases, affecting the lives of many people in the modern world. Among the various types of cancer, cervical cancer is one of the most frequently occurring cancers in the female population. In most cases, doctors and practitioners can typically only identify cervical cancer in its latter stages. Planning cancer therapy and increasing patient survival rates become very difficult as the disease progresses. As a result, diagnosing cervical cancer in its initial stages has become imperative to arrange proper therapy and surgery. In this article, we present a survey of automatic computerized methods for diagnosing cervical abnormalities based on microscopic imaging modalities. The present survey was conducted by defining a novel taxonomy of the surveyed techniques based on the approaches they used. We also discuss the challenges and subchallenges associated with an automatic cervical cancer diagnosis based on microscopic imaging modalities. Additionally, surveys on various public and private datasets used by the research community for developing new methods are presented. In this article, the performances of published papers are compared. The article concludes by suggesting possible research directions in these fields.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 9","pages":"2354-2383"},"PeriodicalIF":0.0,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144926904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unimodal Distributions for Ordinal Regression 有序回归的单峰分布
Pub Date : 2025-03-10 DOI: 10.1109/TAI.2025.3549740
Jaime S. Cardoso;Ricardo P. M. Cruz;Tomé Albuquerque
In many real-world prediction tasks, the class labels contain information about the relative order between the labels that are not captured by commonly used loss functions such as multicategory cross-entropy. In ordinal regression, many works have incorporated ordinality into models and loss functions by promoting unimodality of the probability output. However, current approaches are based on heuristics, particularly nonparametric ones, which are still insufficiently explored in the literature. We analyze the set of unimodal distributions in the probability simplex, establishing fundamental properties and giving new perspectives to understand the ordinal regression problem. Two contributions are then proposed to incorporate the preference for unimodal distributions into the predictive model: 1) UnimodalNet, a new architecture that by construction ensures the output is a unimodal distribution, and 2) Wasserstein regularization, a new loss term that relies on the notion of projection in a set to promote unimodality. Experiments show that the new architecture achieves top performance, while the proposed new loss term is very competitive while maintaining high unimodality.
在许多现实世界的预测任务中,类标签包含关于标签之间的相对顺序的信息,这些信息不能被常用的损失函数(如多类别交叉熵)捕获。在序数回归中,许多研究通过提高概率输出的单模性,将序数纳入模型和损失函数中。然而,目前的方法是基于启发式的,特别是非参数的,这在文献中还没有得到充分的探讨。我们分析了概率单纯形中的单峰分布集,建立了基本性质,并为理解有序回归问题提供了新的视角。然后提出了两项贡献,将单峰分布的偏好纳入预测模型:1)单峰网络,一种新的架构,通过构造确保输出是单峰分布,以及2)Wasserstein正则化,一种新的损失项,依赖于集合中的投影概念来促进单峰分布。实验表明,新结构在保持高单峰性的同时,具有很强的竞争力。
{"title":"Unimodal Distributions for Ordinal Regression","authors":"Jaime S. Cardoso;Ricardo P. M. Cruz;Tomé Albuquerque","doi":"10.1109/TAI.2025.3549740","DOIUrl":"https://doi.org/10.1109/TAI.2025.3549740","url":null,"abstract":"In many real-world prediction tasks, the class labels contain information about the relative order between the labels that are not captured by commonly used loss functions such as multicategory cross-entropy. In ordinal regression, many works have incorporated ordinality into models and loss functions by promoting unimodality of the probability output. However, current approaches are based on heuristics, particularly nonparametric ones, which are still insufficiently explored in the literature. We analyze the set of unimodal distributions in the probability simplex, establishing fundamental properties and giving new perspectives to understand the ordinal regression problem. Two contributions are then proposed to incorporate the preference for unimodal distributions into the predictive model: 1) UnimodalNet, a new architecture that by construction ensures the output is a unimodal distribution, and 2) Wasserstein regularization, a new loss term that relies on the notion of projection in a set to promote unimodality. Experiments show that the new architecture achieves top performance, while the proposed new loss term is very competitive while maintaining high unimodality.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 9","pages":"2498-2509"},"PeriodicalIF":0.0,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10918699","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144926903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Artificial Intelligence Publication Information IEEE人工智能学报
Pub Date : 2025-03-10 DOI: 10.1109/TAI.2025.3546710
{"title":"IEEE Transactions on Artificial Intelligence Publication Information","authors":"","doi":"10.1109/TAI.2025.3546710","DOIUrl":"https://doi.org/10.1109/TAI.2025.3546710","url":null,"abstract":"","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 3","pages":"C2-C2"},"PeriodicalIF":0.0,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10918892","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143583131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Video-Based Human-Posture Monitoring From RGB-D Cameras 基于视频的RGB-D摄像机人体姿势监测
Pub Date : 2025-03-08 DOI: 10.1109/TAI.2025.3566926
Pavana Pradeep Kumar;Krishna Kant;Francesco Di Rienzo;Carlo Vallati
Correct pose/posture is crucial in most human activities, and increasingly in using computer screens of many form factors. In this article, we build a spatiotemporal reasoning infrastructure on top of standard computer vision (CV) algorithms to provide an alternate, much more accurate, faster method for tracking correct posture than pure deep learning (DL) methods. We use CV to determine poses of the 2-D human stick models from RGB images, which are further enhanced using depth information (from RGB-D camera) to determine relevant angles and compare them against the standards. By applying our method to two very different posture applications (knowledge worker and taekwondo), we show that it outperforms all others, including machine learning, deep learning, and time series-based prediction. Furthermore, superior performance is seen not only in the estimation accuracy but also in the estimation speed.
在大多数人类活动中,正确的姿势是至关重要的,在使用多种形式的电脑屏幕时也越来越重要。在本文中,我们在标准计算机视觉(CV)算法之上构建了一个时空推理基础设施,以提供一种比纯深度学习(DL)方法更准确、更快的替代方法来跟踪正确的姿势。我们使用CV从RGB图像中确定二维人体棒模型的姿态,并使用深度信息(来自RGB- d相机)进一步增强以确定相关角度并将其与标准进行比较。通过将我们的方法应用于两种非常不同的姿势应用(知识工作者和跆拳道),我们表明它优于所有其他方法,包括机器学习、深度学习和基于时间序列的预测。不仅在估计精度上,而且在估计速度上都有较好的表现。
{"title":"Video-Based Human-Posture Monitoring From RGB-D Cameras","authors":"Pavana Pradeep Kumar;Krishna Kant;Francesco Di Rienzo;Carlo Vallati","doi":"10.1109/TAI.2025.3566926","DOIUrl":"https://doi.org/10.1109/TAI.2025.3566926","url":null,"abstract":"Correct pose/posture is crucial in most human activities, and increasingly in using computer screens of many form factors. In this article, we build a spatiotemporal reasoning infrastructure on top of standard computer vision (CV) algorithms to provide an alternate, much more accurate, faster method for tracking correct posture than pure deep learning (DL) methods. We use CV to determine poses of the 2-D human stick models from RGB images, which are further enhanced using depth information (from RGB-D camera) to determine relevant angles and compare them against the standards. By applying our method to two very different posture applications (knowledge worker and taekwondo), we show that it outperforms all others, including machine learning, deep learning, and time series-based prediction. Furthermore, superior performance is seen not only in the estimation accuracy but also in the estimation speed.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 12","pages":"3406-3416"},"PeriodicalIF":0.0,"publicationDate":"2025-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-uniform Illumination Attack for Fooling Convolutional Neural Networks 欺骗卷积神经网络的非均匀光照攻击
Pub Date : 2025-03-07 DOI: 10.1109/TAI.2025.3549396
Akshay Jain;Shiv Ram Dubey;Satish Kumar Singh;KC Santosh;Bidyut Baran Chaudhuri
Convolutional neural networks (CNNs) have made remarkable strides; however, they remain susceptible to vulnerabilities, particularly to image perturbations that humans can easily recognize. This weakness, often termed as “attacks,” underscores the limited robustness of CNNs and the need for research into fortifying their resistance against such manipulations. This study introduces a novel nonuniform illumination (NUI) attack technique, where images are subtly altered using varying NUI masks. Extensive experiments are conducted on widely accepted datasets including CIFAR10, TinyImageNet, CalTech256, and NWPU-RESISC45 focusing on image classification with 12 different NUI masks. The resilience of VGG, ResNet, MobilenetV3-small, InceptionV3, and EfficientNet_b0 models against NUI attacks are evaluated. Our results show a substantial decline in the CNN models’ classification accuracy when subjected to NUI attacks, due to changes in the image pixel value distribution, indicating their vulnerability under NUI. To mitigate this, a defense strategy is proposed, including NUI-attacked images, generated through the new NUI transformation, into the training set. The results demonstrate a significant enhancement in CNN model performance when confronted with perturbed images affected by NUI attacks. This strategy seeks to bolster CNN models’ resilience against NUI attacks. A comparative study with other attack techniques shows the effectiveness of the NUI attack and defense technique.1

1The code is available at https://github.com/Akshayjain97/Non-Uniform_Illumination

卷积神经网络(cnn)已经取得了显著的进步;然而,它们仍然容易受到脆弱性的影响,特别是人类很容易识别的图像扰动。这个弱点,通常被称为“攻击”,强调了cnn有限的稳健性,以及加强其抵抗此类操纵的研究的必要性。本研究介绍了一种新的非均匀照明(NUI)攻击技术,其中使用不同的NUI掩模巧妙地改变图像。在CIFAR10、TinyImageNet、CalTech256和NWPU-RESISC45等被广泛接受的数据集上进行了大量的实验,重点研究了12种不同NUI掩码的图像分类。评估了VGG、ResNet、MobilenetV3-small、InceptionV3和EfficientNet_b0模型对NUI攻击的弹性。我们的研究结果表明,由于图像像素值分布的变化,CNN模型在受到NUI攻击时的分类精度出现了大幅下降,这说明了CNN模型在NUI攻击下的脆弱性。为了缓解这种情况,提出了一种防御策略,将通过新的NUI变换生成的NUI攻击图像纳入训练集。结果表明,当面对受NUI攻击影响的扰动图像时,CNN模型的性能有显著提高。这一策略旨在增强CNN模型抵御NUI攻击的能力。通过与其他攻击技术的对比研究,证明了NUI攻防技术的有效性。代码可在https://github.com/Akshayjain97/Non-Uniform_Illumination上获得
{"title":"Non-uniform Illumination Attack for Fooling Convolutional Neural Networks","authors":"Akshay Jain;Shiv Ram Dubey;Satish Kumar Singh;KC Santosh;Bidyut Baran Chaudhuri","doi":"10.1109/TAI.2025.3549396","DOIUrl":"https://doi.org/10.1109/TAI.2025.3549396","url":null,"abstract":"Convolutional neural networks (CNNs) have made remarkable strides; however, they remain susceptible to vulnerabilities, particularly to image perturbations that humans can easily recognize. This weakness, often termed as “attacks,” underscores the limited robustness of CNNs and the need for research into fortifying their resistance against such manipulations. This study introduces a novel nonuniform illumination (NUI) attack technique, where images are subtly altered using varying NUI masks. Extensive experiments are conducted on widely accepted datasets including CIFAR10, TinyImageNet, CalTech256, and NWPU-RESISC45 focusing on image classification with 12 different NUI masks. The resilience of VGG, ResNet, MobilenetV3-small, InceptionV3, and EfficientNet_b0 models against NUI attacks are evaluated. Our results show a substantial decline in the CNN models’ classification accuracy when subjected to NUI attacks, due to changes in the image pixel value distribution, indicating their vulnerability under NUI. To mitigate this, a defense strategy is proposed, including NUI-attacked images, generated through the new NUI transformation, into the training set. The results demonstrate a significant enhancement in CNN model performance when confronted with perturbed images affected by NUI attacks. This strategy seeks to bolster CNN models’ resilience against NUI attacks. A comparative study with other attack techniques shows the effectiveness of the NUI attack and defense technique.<xref><sup>1</sup></xref><fn><p><sup>1</sup>The code is available at <uri>https://github.com/Akshayjain97/Non-Uniform_Illumination</uri></p></fn>","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 9","pages":"2476-2485"},"PeriodicalIF":0.0,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144926890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MalaNet: A Small World Inspired Neural Network for Automated Malaria Diagnosis MalaNet:用于疟疾自动诊断的小世界启发神经网络
Pub Date : 2025-03-07 DOI: 10.1109/TAI.2025.3549406
Shubham Dwivedi;Kartikeya Pandey;Kumar Shubham;Om Jee Pandey;Achyut Mani Tripathi;Tushar Sandhan;Rajesh M. Hegde
In this work, a novel neural network architecture called MalaNet is proposed for the detection and diagnosis of malaria, an infectious disease that poses a major global health challenge. The proposed neural network architecture is inspired by small-world network principles, which generally involve the introduction of new links. A small-world neural network is realized by establishing new connections, thereby reducing the average path length and increasing clustering coefficient. These characteristics are known to enhance interconnectivity and improve feature propagation within the network. In the context of malaria diagnosis, these characteristics of MalaNet can enhance detection accuracy and enable better generalization in scenarios with limited data availability. Broadly, two variants of MalaNet are proposed in this work. First, a small-world-inspired feed-forward neural network (FNN) is developed for symptom and categorical feature-based diagnosis, providing an accessible solution when blood smear images are unavailable. Subsequently, a small-world-inspired convolutional neural network (CNN) is developed for precise and automated diagnosis when blood smear images are available. Both variants of MalaNet are rigorously validated using the National Institute of Health Malaria dataset, a clinical dataset from Federal Polytechnic Ilaro Medical Centre, Nigeria, and the APTOS dataset. Comparative results against several state-of-the-art neural network models in the literature demonstrate MalaNet’s superior performance, generalization capability, and computational efficiency. The small-world neural network architecture proposed in this work enhances feature learning, diagnostic accuracy, and adaptability in limited-data and resource-constrained settings, motivating its application in disease diagnosis where timely and accurate results are critical.
在这项工作中,一种名为MalaNet的新型神经网络架构被提出用于检测和诊断疟疾,这是一种对全球健康构成重大挑战的传染病。所提出的神经网络架构受到小世界网络原理的启发,通常涉及引入新链路。通过建立新的连接来实现小世界神经网络,从而减少平均路径长度,提高聚类系数。已知这些特征可以增强网络的互联性并改善特征在网络中的传播。在疟疾诊断的背景下,MalaNet的这些特征可以提高检测精度,并在数据可用性有限的情况下实现更好的泛化。从广义上讲,本文提出了MalaNet的两种变体。首先,开发了一种小世界启发的前馈神经网络(FNN),用于基于症状和分类特征的诊断,在无法获得血液涂片图像时提供可访问的解决方案。随后,当血液涂片图像可用时,开发了一个小世界启发的卷积神经网络(CNN),用于精确和自动诊断。MalaNet的两种变体都使用国家卫生研究所疟疾数据集、尼日利亚联邦理工学院Ilaro医学中心的临床数据集和APTOS数据集进行了严格验证。与文献中几种最先进的神经网络模型的比较结果表明,MalaNet具有优越的性能、泛化能力和计算效率。本研究提出的小世界神经网络架构增强了在有限数据和资源约束环境下的特征学习、诊断准确性和适应性,促进了其在疾病诊断中的应用,而及时和准确的结果是至关重要的。
{"title":"MalaNet: A Small World Inspired Neural Network for Automated Malaria Diagnosis","authors":"Shubham Dwivedi;Kartikeya Pandey;Kumar Shubham;Om Jee Pandey;Achyut Mani Tripathi;Tushar Sandhan;Rajesh M. Hegde","doi":"10.1109/TAI.2025.3549406","DOIUrl":"https://doi.org/10.1109/TAI.2025.3549406","url":null,"abstract":"In this work, a novel neural network architecture called MalaNet is proposed for the detection and diagnosis of malaria, an infectious disease that poses a major global health challenge. The proposed neural network architecture is inspired by small-world network principles, which generally involve the introduction of new links. A small-world neural network is realized by establishing new connections, thereby reducing the average path length and increasing clustering coefficient. These characteristics are known to enhance interconnectivity and improve feature propagation within the network. In the context of malaria diagnosis, these characteristics of MalaNet can enhance detection accuracy and enable better generalization in scenarios with limited data availability. Broadly, two variants of MalaNet are proposed in this work. First, a small-world-inspired feed-forward neural network (FNN) is developed for symptom and categorical feature-based diagnosis, providing an accessible solution when blood smear images are unavailable. Subsequently, a small-world-inspired convolutional neural network (CNN) is developed for precise and automated diagnosis when blood smear images are available. Both variants of MalaNet are rigorously validated using the National Institute of Health Malaria dataset, a clinical dataset from Federal Polytechnic Ilaro Medical Centre, Nigeria, and the APTOS dataset. Comparative results against several state-of-the-art neural network models in the literature demonstrate MalaNet’s superior performance, generalization capability, and computational efficiency. The small-world neural network architecture proposed in this work enhances feature learning, diagnostic accuracy, and adaptability in limited-data and resource-constrained settings, motivating its application in disease diagnosis where timely and accurate results are critical.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 9","pages":"2486-2497"},"PeriodicalIF":0.0,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144926892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Efficient Multi-UAV Air Combat: An Intention Inference and Sparse Transmission Based Multiagent Reinforcement Learning Algorithm 实现高效多无人机空战:基于意图推理和稀疏传输的多智能体强化学习算法
Pub Date : 2025-03-07 DOI: 10.1109/TAI.2025.3567431
Jinchao Han;Yan Yan;Baoxian Zhang
With the increasing utilization of unmanned aerial vehicles (UAVs) in military operations, multi-UAV air combat has been emerging as one of the most important modes for future warfare. How to achieve intelligent cooperative maneuver policies subject to limited information sharing caused by the communication constraints among UAVs is crucial for winning air combat. In this article, we formulate the communication-constrained multi-UAV air combat problem as a Markov game and propose a novel sparse inferred intention sharing multiagent reinforcement learning (SIIS-MARL) algorithm for improving the winning rate of multi-UAV air combat. Our proposed algorithm contains the following designs: An intention inference module that enables each UAV to infer the intentions of teammates through the theory of mind (ToM) network for improved cooperation among teammates, and an attention-based sparse transmission mechanism which utilizes the inferred intentions and encoded embeddings to learn communication weights of teammates for enabling efficient sparsity in communication without causing performance penalty. Simulation results validate the effectiveness of our proposed algorithm as compared with existing work.
随着无人机在军事行动中的应用日益广泛,多无人机空战已成为未来战争的重要模式之一。如何在通信约束下实现无人机间有限信息共享的智能协同机动策略,是赢得空战胜利的关键。本文将通信受限的多无人机空战问题形式化为马尔可夫博弈,提出了一种新的稀疏推断意图共享多智能体强化学习(SIIS-MARL)算法,以提高多无人机空战的胜率。我们提出的算法包含以下设计:意图推理模块,使每架无人机能够通过心理理论(ToM)网络推断队友的意图,以改善队友之间的合作;基于注意力的稀疏传输机制,利用推断的意图和编码嵌入来学习队友的通信权重,从而在不造成性能损失的情况下实现高效的稀疏通信。仿真结果验证了该算法的有效性。
{"title":"Towards Efficient Multi-UAV Air Combat: An Intention Inference and Sparse Transmission Based Multiagent Reinforcement Learning Algorithm","authors":"Jinchao Han;Yan Yan;Baoxian Zhang","doi":"10.1109/TAI.2025.3567431","DOIUrl":"https://doi.org/10.1109/TAI.2025.3567431","url":null,"abstract":"With the increasing utilization of unmanned aerial vehicles (UAVs) in military operations, multi-UAV air combat has been emerging as one of the most important modes for future warfare. How to achieve intelligent cooperative maneuver policies subject to limited information sharing caused by the communication constraints among UAVs is crucial for winning air combat. In this article, we formulate the communication-constrained multi-UAV air combat problem as a Markov game and propose a novel sparse inferred intention sharing multiagent reinforcement learning (SIIS-MARL) algorithm for improving the winning rate of multi-UAV air combat. Our proposed algorithm contains the following designs: An intention inference module that enables each UAV to infer the intentions of teammates through the theory of mind (ToM) network for improved cooperation among teammates, and an attention-based sparse transmission mechanism which utilizes the inferred intentions and encoded embeddings to learn communication weights of teammates for enabling efficient sparsity in communication without causing performance penalty. Simulation results validate the effectiveness of our proposed algorithm as compared with existing work.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 12","pages":"3441-3452"},"PeriodicalIF":0.0,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1