首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
An Improved Continuous-Encoding-Based Multiobjective Evolutionary Algorithm for Community Detection in Complex Networks 用于复杂网络中社群检测的基于连续编码的改进型多目标进化算法
Pub Date : 2024-08-13 DOI: 10.1109/TAI.2024.3442153
Jun Fu;Yan Wang
Community detection is a fundamental and widely studied field in network science. To perform community detection, various competitive multiobjective evolutionary algorithms (MOEAs) have been proposed. It is worth noting that the latest continuous encoding (CE) method transforms the original discrete problem into a continuous one, which can achieve better community partitioning. However, the original CE ignored important structural features of nodes, such as the clustering coefficient (CC), resulting in poor initial solutions and reduced the performance of community detection. Therefore, we propose a simple scheme to effectively utilize node structure feature vectors to enhance community detection. Specifically, a CE and CC-based (CE-CC) MOEA called CECC-Net is proposed. In CECC-Net, the CC vector performs the Hadamard product with a continuous vector (i.e., a concatenation of the continuous variables $mathbf{x}$ associated with the edges), resulting in an improved initial individual. Then, applying the nonlinear transformation to the continuous-valued individual yields a discrete-valued community grouping solution. Furthermore, a corresponding adaptive operator is designed as an essential part of this scheme to mitigate the negative effects of feature vectors on population diversity. The effectiveness of the proposed scheme was validated through ablation and comparative experiments. Experimental results on synthetic and real-world networks demonstrate that the proposed algorithm has competitive performance in comparison with several state-of-the-art EA-based community detection algorithms.
社群检测是网络科学中的一个基础领域,也是一个被广泛研究的领域。为了进行社群检测,人们提出了各种有竞争力的多目标进化算法(MOEAs)。值得注意的是,最新的连续编码(CE)方法将原来的离散问题转化为连续问题,可以实现更好的社区划分。但是,原有的 CE 忽略了节点的重要结构特征,如聚类系数(CC),导致初始解不理想,降低了社区检测的性能。因此,我们提出了一种简单的方案,有效利用节点结构特征向量来增强社群检测。具体来说,我们提出了一种基于 CE 和 CC(CE-CC)的 MOEA,称为 CECC-Net。在 CECC-Net 中,CC 向量与连续向量(即与边缘相关的连续变量 $/mathbf{x}$)进行哈达玛乘积,从而得到一个改进的初始个体。然后,将非线性变换应用于连续值个体,就能得到离散值群体分组解决方案。此外,还设计了一个相应的自适应算子,作为该方案的重要组成部分,以减轻特征向量对群体多样性的负面影响。通过消融和对比实验,验证了所提方案的有效性。在合成网络和真实世界网络上的实验结果表明,与几种最先进的基于 EA 的群落检测算法相比,所提出的算法具有很强的竞争力。
{"title":"An Improved Continuous-Encoding-Based Multiobjective Evolutionary Algorithm for Community Detection in Complex Networks","authors":"Jun Fu;Yan Wang","doi":"10.1109/TAI.2024.3442153","DOIUrl":"https://doi.org/10.1109/TAI.2024.3442153","url":null,"abstract":"Community detection is a fundamental and widely studied field in network science. To perform community detection, various competitive multiobjective evolutionary algorithms (MOEAs) have been proposed. It is worth noting that the latest continuous encoding (CE) method transforms the original discrete problem into a continuous one, which can achieve better community partitioning. However, the original CE ignored important structural features of nodes, such as the clustering coefficient (CC), resulting in poor initial solutions and reduced the performance of community detection. Therefore, we propose a simple scheme to effectively utilize node structure feature vectors to enhance community detection. Specifically, a CE and CC-based (CE-CC) MOEA called CECC-Net is proposed. In CECC-Net, the CC vector performs the Hadamard product with a continuous vector (i.e., a concatenation of the continuous variables \u0000<inline-formula><tex-math>$mathbf{x}$</tex-math></inline-formula>\u0000 associated with the edges), resulting in an improved initial individual. Then, applying the nonlinear transformation to the continuous-valued individual yields a discrete-valued community grouping solution. Furthermore, a corresponding adaptive operator is designed as an essential part of this scheme to mitigate the negative effects of feature vectors on population diversity. The effectiveness of the proposed scheme was validated through ablation and comparative experiments. Experimental results on synthetic and real-world networks demonstrate that the proposed algorithm has competitive performance in comparison with several state-of-the-art EA-based community detection algorithms.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5815-5827"},"PeriodicalIF":0.0,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Artificial Intelligence Publication Information IEEE Transactions on Artificial Intelligence 出版信息
Pub Date : 2024-08-13 DOI: 10.1109/TAI.2024.3436231
{"title":"IEEE Transactions on Artificial Intelligence Publication Information","authors":"","doi":"10.1109/TAI.2024.3436231","DOIUrl":"https://doi.org/10.1109/TAI.2024.3436231","url":null,"abstract":"","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 8","pages":"C2-C2"},"PeriodicalIF":0.0,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10635097","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141980065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent Multigrade Brain Tumor Identification in MRI: A Metaheuristic-Based Uncertain Set Framework 磁共振成像中的智能多级脑肿瘤识别:基于元搜索的不确定集合框架
Pub Date : 2024-08-12 DOI: 10.1109/TAI.2024.3441520
Saravanan Alagarsamy;Vishnuvarthanan Govindaraj;A. Shahina;D. Nagarajan
This research intends to address the critical need for precise brain tumor prediction through the development of an automated method that entwines the Firefly (FF) algorithm and the interval type-II fuzzy (IT2FLS) technique. The proposed method improves tumor delineation in complex brain tissue by using the FF algorithm to find possible cluster positions and the IT2FLS system for final clustering. This algorithm demonstrates its versatility by processing diverse image sequences from BRATS challenge datasets (2017, 2018, and 2020), which encompass varying levels of complexity. Through comprehensive evaluation metrics such as sensitivity, specificity, and dice-overlap index (DOI), the proposed algorithm consistently yields improved segmentation results. Ultimately, this research aims to augment oncologists' perceptual acumen, facilitating enhanced intuition and comprehension of patients' conditions, thereby advancing decision-making capabilities in medical research.
这项研究旨在通过开发一种结合了萤火虫(FF)算法和区间II型模糊(IT2FLS)技术的自动化方法,满足精确预测脑肿瘤的迫切需要。所提出的方法利用萤火虫算法寻找可能的聚类位置,并利用 IT2FLS 系统进行最终聚类,从而改进了复杂脑组织中的肿瘤划分。该算法通过处理来自 BRATS 挑战数据集(2017 年、2018 年和 2020 年)的各种图像序列,展示了其多功能性,这些数据集包含不同程度的复杂性。通过灵敏度、特异性和骰子重叠指数(DOI)等综合评估指标,所提出的算法始终能产生更好的分割结果。最终,这项研究旨在增强肿瘤学家的感知敏锐度,促进对患者病情的直觉和理解,从而提高医学研究的决策能力。
{"title":"Intelligent Multigrade Brain Tumor Identification in MRI: A Metaheuristic-Based Uncertain Set Framework","authors":"Saravanan Alagarsamy;Vishnuvarthanan Govindaraj;A. Shahina;D. Nagarajan","doi":"10.1109/TAI.2024.3441520","DOIUrl":"https://doi.org/10.1109/TAI.2024.3441520","url":null,"abstract":"This research intends to address the critical need for precise brain tumor prediction through the development of an automated method that entwines the Firefly (FF) algorithm and the interval type-II fuzzy (IT2FLS) technique. The proposed method improves tumor delineation in complex brain tissue by using the FF algorithm to find possible cluster positions and the IT2FLS system for final clustering. This algorithm demonstrates its versatility by processing diverse image sequences from BRATS challenge datasets (2017, 2018, and 2020), which encompass varying levels of complexity. Through comprehensive evaluation metrics such as sensitivity, specificity, and dice-overlap index (DOI), the proposed algorithm consistently yields improved segmentation results. Ultimately, this research aims to augment oncologists' perceptual acumen, facilitating enhanced intuition and comprehension of patients' conditions, thereby advancing decision-making capabilities in medical research.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5381-5391"},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepHGCN: Toward Deeper Hyperbolic Graph Convolutional Networks deep phgcn:迈向更深的双曲图卷积网络
Pub Date : 2024-08-08 DOI: 10.1109/TAI.2024.3440223
Jiaxu Liu;Xinping Yi;Xiaowei Huang
Hyperbolic graph convolutional networks (HGCNs) have demonstrated significant potential in extracting information from hierarchical graphs. However, existing HGCNs are limited to shallow architectures due to the computational expense of hyperbolic operations and the issue of oversmoothing as depth increases. Although treatments have been applied to alleviate oversmoothing in graph convolutional networks (GCNs), developing a hyperbolic solution presents distinct challenges since operations must be carefully designed to fit the hyperbolic nature. Addressing these challenges, we propose DeepHGCN, the first deep multilayer HGCN architecture with dramatically improved computational efficiency and substantially reduced oversmoothing. DeepHGCN features two key innovations: 1) a novel hyperbolic feature transformation layer that enables fast and accurate linear mappings; and 2) techniques such as hyperbolic residual connections and regularization for both weights and features, facilitated by an efficient hyperbolic midpoint method. Extensive experiments demonstrate that DeepHGCN achieves significant improvements in link prediction (LP) and node classification (NC) tasks compared to both Euclidean and shallow hyperbolic GCN variants.
双曲图卷积网络(HGCNs)在从层次图中提取信息方面显示出巨大的潜力。然而,由于双曲运算的计算成本和随着深度增加的过平滑问题,现有的hgcn仅限于浅层架构。尽管已经应用了一些处理方法来缓解图卷积网络(GCNs)中的过度平滑,但由于必须仔细设计操作以适应双曲性质,因此开发双曲解决方案提出了明显的挑战。为了解决这些挑战,我们提出了DeepHGCN,这是第一个深度多层HGCN架构,大大提高了计算效率,并大大减少了过平滑。DeepHGCN具有两个关键创新:1)一种新的双曲特征转换层,可以实现快速准确的线性映射;2)利用有效的双曲中点方法对权值和特征进行双曲残差连接和正则化等技术。大量实验表明,与欧几里得和浅双曲GCN变体相比,DeepHGCN在链路预测(LP)和节点分类(NC)任务方面取得了显着改进。
{"title":"DeepHGCN: Toward Deeper Hyperbolic Graph Convolutional Networks","authors":"Jiaxu Liu;Xinping Yi;Xiaowei Huang","doi":"10.1109/TAI.2024.3440223","DOIUrl":"https://doi.org/10.1109/TAI.2024.3440223","url":null,"abstract":"Hyperbolic graph convolutional networks (HGCNs) have demonstrated significant potential in extracting information from hierarchical graphs. However, existing HGCNs are limited to shallow architectures due to the computational expense of hyperbolic operations and the issue of oversmoothing as depth increases. Although treatments have been applied to alleviate oversmoothing in graph convolutional networks (GCNs), developing a hyperbolic solution presents distinct challenges since operations must be carefully designed to fit the hyperbolic nature. Addressing these challenges, we propose DeepHGCN, the first deep multilayer HGCN architecture with dramatically improved computational efficiency and substantially reduced oversmoothing. DeepHGCN features two key innovations: 1) a novel hyperbolic feature transformation layer that enables fast and accurate linear mappings; and 2) techniques such as hyperbolic residual connections and regularization for both weights and features, facilitated by an efficient hyperbolic midpoint method. Extensive experiments demonstrate that DeepHGCN achieves significant improvements in link prediction (LP) and node classification (NC) tasks compared to both Euclidean and shallow hyperbolic GCN variants.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6172-6185"},"PeriodicalIF":0.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Silver Lining in the Fake News Cloud: Can Large Language Models Help Detect Misinformation? 假新闻云中的一线希望:大型语言模型能帮助检测错误信息吗?
Pub Date : 2024-08-08 DOI: 10.1109/TAI.2024.3440248
Raghvendra Kumar;Bhargav Goddu;Sriparna Saha;Adam Jatowt
In the times of advanced generative artificial intelligence, distinguishing truth from fallacy and deception has become a critical societal challenge. This research attempts to analyze the capabilities of large language models (LLMs) for detecting misinformation. Our study employs a versatile approach, covering multiple LLMs with few- and zero-shot prompting. These models are rigorously evaluated across various fake news and rumor detection datasets. Introducing a novel dimension, we additionally incorporate sentiment and emotion annotations to understand the emotional influence on misinformation detection using LLMs. Moreover, to extend our inquiry, we employ ChatGPT to intentionally distort authentic news as well as human-written fake news, utilizing zero-shot and iterative prompts. This deliberate corruption allows for a detailed examination of various parameters such as abstractness, concreteness, and named entity density, providing insights into differentiating between unaltered news, human-written fake news, and its LLM-corrupted counterpart. Our findings aspire to furnish a refined framework for discerning authentic news, human-generated misinformation, and LLM-induced distortions. This multifaceted approach, utilizing various prompt techniques, contributes to a comprehensive understanding of the subtle variations shaping misinformation sources.
在先进的生成式人工智能时代,区分真理与谬误和欺骗已成为一项关键的社会挑战。本研究试图分析大型语言模型(llm)检测错误信息的能力。我们的研究采用了一种通用的方法,涵盖了多个llm与很少和零射击提示。这些模型经过各种假新闻和谣言检测数据集的严格评估。引入一个新的维度,我们还结合了情感和情感注释来理解情感对llm错误信息检测的影响。此外,为了扩大我们的调查范围,我们使用ChatGPT故意歪曲真实新闻和人工编写的假新闻,利用零射击和迭代提示。这种故意的破坏允许对各种参数进行详细检查,例如抽象性、具体性和命名实体密度,从而提供了区分未经修改的新闻、人工编写的假新闻和其llm破坏的对应内容的见解。我们的研究结果旨在为辨别真实新闻、人为错误信息和法学硕士引起的扭曲提供一个精细的框架。这种多方面的方法,利用各种提示技术,有助于全面了解塑造错误信息源的微妙变化。
{"title":"Silver Lining in the Fake News Cloud: Can Large Language Models Help Detect Misinformation?","authors":"Raghvendra Kumar;Bhargav Goddu;Sriparna Saha;Adam Jatowt","doi":"10.1109/TAI.2024.3440248","DOIUrl":"https://doi.org/10.1109/TAI.2024.3440248","url":null,"abstract":"In the times of advanced generative artificial intelligence, distinguishing truth from fallacy and deception has become a critical societal challenge. This research attempts to analyze the capabilities of large language models (LLMs) for detecting misinformation. Our study employs a versatile approach, covering multiple LLMs with few- and zero-shot prompting. These models are rigorously evaluated across various fake news and rumor detection datasets. Introducing a novel dimension, we additionally incorporate sentiment and emotion annotations to understand the emotional influence on misinformation detection using LLMs. Moreover, to extend our inquiry, we employ ChatGPT to intentionally distort authentic news as well as human-written fake news, utilizing zero-shot and iterative prompts. This deliberate corruption allows for a detailed examination of various parameters such as abstractness, concreteness, and named entity density, providing insights into differentiating between unaltered news, human-written fake news, and its LLM-corrupted counterpart. Our findings aspire to furnish a refined framework for discerning authentic news, human-generated misinformation, and LLM-induced distortions. This multifaceted approach, utilizing various prompt techniques, contributes to a comprehensive understanding of the subtle variations shaping misinformation sources.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 1","pages":"14-24"},"PeriodicalIF":0.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142975893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SSpose: Self-Supervised Spatial-Aware Model for Human Pose Estimation SSpose:用于人体姿态估计的自监督空间感知模型
Pub Date : 2024-08-08 DOI: 10.1109/TAI.2024.3440220
Linfang Yu;Zhen Qin;Liqun Xu;Zhiguang Qin;Kim-Kwang Raymond Choo
Human pose estimation (HPE) relies on the anatomical relationships among different body parts to locate keypoints. Despite the significant progress achieved by convolutional neural networks (CNN)-based models in HPE, they typically fail to explicitly learn the global dependencies among various body parts. To overcome this limitation, we propose a spatial-aware HPE model called SSpose that explicitly captures the spatial dependencies between specific key points and different locations in an image. The proposed SSpose model adopts a hybrid CNN-Transformer encoder to simultaneously capture local features and global dependencies. To better preserve image details, a multiscale fusion module is introduced to integrate coarse- and fine-grained image information. By establishing a connection with the activation maximization (AM) principle, the final attention layer of the Transformer aggregates contributions (i.e., attention scores) from all image positions and forms the maximum position in the heatmap, thereby achieving keypoint localization in the head structure. Additionally, to address the issue of visible information leakage in convolutional reconstruction, we have devised a self-supervised training framework for the SSpose model. This framework incorporates mask autoencoder (MAE) technology into SSpose models by utilizing masked convolution and hierarchical masking strategy, thereby facilitating efficient self-supervised learning. Extensive experiments demonstrate that SSpose performs exceptionally well in the pose estimation task. On the COCO val set, it achieves an AP and AR of 77.3% and 82.1%, respectively, while on the COCO test-dev set, the AP and AR are 76.4% and 81.5%. Moreover, the model exhibits strong generalization capabilities on MPII.
人体姿态估计(HPE)依赖于不同身体部位之间的解剖关系来定位关键点。尽管基于卷积神经网络(CNN)的模型在 HPE 方面取得了重大进展,但它们通常无法明确学习不同身体部位之间的全局依赖关系。为了克服这一局限,我们提出了一种名为 SSpose 的空间感知 HPE 模型,它能明确捕捉图像中特定关键点与不同位置之间的空间依赖关系。所提出的 SSpose 模型采用混合 CNN 变换器编码器,可同时捕捉局部特征和全局依赖关系。为了更好地保留图像细节,还引入了多尺度融合模块来整合粗粒度和细粒度图像信息。通过与激活最大化(AM)原理建立联系,变换器的最终注意力层汇总了来自所有图像位置的贡献(即注意力分数),并形成热图中的最大位置,从而实现头部结构中的关键点定位。此外,为了解决卷积重建中的可见信息泄漏问题,我们还为 SSpose 模型设计了一个自监督训练框架。该框架利用掩码卷积和分层掩码策略,将掩码自动编码器(MAE)技术融入 SSpose 模型,从而促进了高效的自我监督学习。大量实验证明,SSpose 在姿态估计任务中表现优异。在 COCO val 集上,它的 AP 和 AR 分别达到 77.3% 和 82.1%,而在 COCO test-dev 集上,AP 和 AR 分别为 76.4% 和 81.5%。此外,该模型在 MPII 上也表现出很强的泛化能力。
{"title":"SSpose: Self-Supervised Spatial-Aware Model for Human Pose Estimation","authors":"Linfang Yu;Zhen Qin;Liqun Xu;Zhiguang Qin;Kim-Kwang Raymond Choo","doi":"10.1109/TAI.2024.3440220","DOIUrl":"https://doi.org/10.1109/TAI.2024.3440220","url":null,"abstract":"Human pose estimation (HPE) relies on the anatomical relationships among different body parts to locate keypoints. Despite the significant progress achieved by convolutional neural networks (CNN)-based models in HPE, they typically fail to explicitly learn the global dependencies among various body parts. To overcome this limitation, we propose a spatial-aware HPE model called SSpose that explicitly captures the spatial dependencies between specific key points and different locations in an image. The proposed SSpose model adopts a hybrid CNN-Transformer encoder to simultaneously capture local features and global dependencies. To better preserve image details, a multiscale fusion module is introduced to integrate coarse- and fine-grained image information. By establishing a connection with the activation maximization (AM) principle, the final attention layer of the Transformer aggregates contributions (i.e., attention scores) from all image positions and forms the maximum position in the heatmap, thereby achieving keypoint localization in the head structure. Additionally, to address the issue of visible information leakage in convolutional reconstruction, we have devised a self-supervised training framework for the SSpose model. This framework incorporates mask autoencoder (MAE) technology into SSpose models by utilizing masked convolution and hierarchical masking strategy, thereby facilitating efficient self-supervised learning. Extensive experiments demonstrate that SSpose performs exceptionally well in the pose estimation task. On the COCO val set, it achieves an AP and AR of 77.3% and 82.1%, respectively, while on the COCO test-dev set, the AP and AR are 76.4% and 81.5%. Moreover, the model exhibits strong generalization capabilities on MPII.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5403-5417"},"PeriodicalIF":0.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Radiogenomic Feature Characterization of 19/20 Co-gain in Glioblastoma 胶质母细胞瘤19/20共增益的综合放射基因组学特征表征
Pub Date : 2024-08-08 DOI: 10.1109/TAI.2024.3440219
Padmaja Jonnalagedda;Brent Weinberg;Taejin L. Min;Shiv Bhanu;Bir Bhanu
The prognosis and treatment planning of glioblastoma multiforme (GBM) involves a holistic analysis of imaging, clinical, and molecular data. The correlation of imaging and molecular features has garnered much interest due to its potential to reduce the number of invasive procedures on a patient and resource utilization of the overall prognostic and treatment planning process. This article detects and characterizes the impact of tumor biomarkers (such as shape, texture, location, and the tissue surrounding the tumor) in detecting a prognostic mutation – the concurrent gain of 19 and 20 chromosomes, and proposes two novel ideas for this analysis. First, to address the challenges associated with the limited, diverse, and complex nature of medical data, this article proposes a novel generative model – the realistic radiogenomic design using disentanglement in generative adversarial networks (R2D2-GAN), designed to recreate highly subtle, unapparent manifestations of mutations in magnetic resonance imaging. It generates high-resolution, diverse data that captures the discriminatory visual features of the molecular markers while tackling the high diversity, unbalanced, and limited GBM data with rare mutations correlating with patient survival such as 19/20 co-gain. Second, this study proposes a quantitative metric called the synthetic image fidelity (SIF) score to evaluate the performance of GANs in learning visually unapparent prognostic features through the use of gradient-based model explanations. Results are compared with current methods.
多形性胶质母细胞瘤(GBM)的预后和治疗计划涉及影像学、临床和分子数据的整体分析。影像学和分子特征的相关性已经引起了人们的极大兴趣,因为它有可能减少对患者进行侵入性手术的次数,并利用整体预后和治疗计划过程的资源。本文检测并描述了肿瘤生物标志物(如形状、质地、位置和肿瘤周围组织)在检测预后突变(19和20染色体的并发增益)中的影响,并提出了两种新的分析思路。首先,为了解决与医疗数据的有限性、多样性和复杂性相关的挑战,本文提出了一种新的生成模型——在生成对抗网络(R2D2-GAN)中使用解纠缠的现实放射基因组设计,旨在重建磁共振成像中高度微妙、不明显的突变表现。它生成高分辨率、多样化的数据,捕获分子标记的歧视性视觉特征,同时处理具有与患者生存相关的罕见突变(如19/20共增益)的高多样性、不平衡和有限的GBM数据。其次,本研究提出了一种称为合成图像保真度(SIF)评分的定量指标,通过使用基于梯度的模型解释来评估gan在学习视觉上不明显的预后特征方面的性能。结果与现有方法进行了比较。
{"title":"A Comprehensive Radiogenomic Feature Characterization of 19/20 Co-gain in Glioblastoma","authors":"Padmaja Jonnalagedda;Brent Weinberg;Taejin L. Min;Shiv Bhanu;Bir Bhanu","doi":"10.1109/TAI.2024.3440219","DOIUrl":"https://doi.org/10.1109/TAI.2024.3440219","url":null,"abstract":"The prognosis and treatment planning of glioblastoma multiforme (GBM) involves a holistic analysis of imaging, clinical, and molecular data. The correlation of imaging and molecular features has garnered much interest due to its potential to reduce the number of invasive procedures on a patient and resource utilization of the overall prognostic and treatment planning process. This article detects and characterizes the impact of tumor biomarkers (such as shape, texture, location, and the tissue surrounding the tumor) in detecting a prognostic mutation – the concurrent gain of 19 and 20 chromosomes, and proposes two novel ideas for this analysis. First, to address the challenges associated with the limited, diverse, and complex nature of medical data, this article proposes a novel generative model – the realistic radiogenomic design using disentanglement in generative adversarial networks (R2D2-GAN), designed to recreate highly subtle, unapparent manifestations of mutations in magnetic resonance imaging. It generates high-resolution, diverse data that captures the discriminatory visual features of the molecular markers while tackling the high diversity, unbalanced, and limited GBM data with rare mutations correlating with patient survival such as 19/20 co-gain. Second, this study proposes a quantitative metric called the synthetic image fidelity (SIF) score to evaluate the performance of GANs in learning visually unapparent prognostic features through the use of gradient-based model explanations. Results are compared with current methods.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6442-6456"},"PeriodicalIF":0.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial Intelligence Driven Predictive Analysis of Acoustic and Linguistic Behaviors for ASD Identification 人工智能驱动的声学和语言行为预测分析用于 ASD 识别
Pub Date : 2024-08-08 DOI: 10.1109/TAI.2024.3439288
Ashwini B.;Deeptanshu;Sheffali Gulati;Jainendra Shukla
The identification of autism spectrum disorder (ASD) faces challenges due to the lack of reliable biomarkers and the subjectivity in diagnostic procedures, necessitating improved tools for objectivity and efficiency. Being a key characteristic of autism, language impairments are regarded as potential markers for identifying ASD. However, current research predominantly focuses on analyzing language characteristics in English, overlooking linguistic and contextual specificities in other resource-constrained languages. Motivated by these, we developed an artificial intelligence (AI)-based system to detect ASD, utilizing a range of acoustic and linguistic features extracted from dyadic conversations between a child and their communication partner. Validating our model on 76 English-speaking children [35 ASD and 41 typically developing (TD)] and 33 Hindi-speaking children (15 ASD and 18 TD), our extensive analysis of a diverse and comprehensive set of acoustic and linguistic speech attributes, including lexical, syntactic, semantic, and pragmatic elements revealed reliable speech attributes as predictors of ASD. This comprehensive analysis achieved a remarkable macro F1-score of approximately $boldsymbol{sim}$91.30%. We further addressed the influence of linguistic diversity on speech-based ASD assessment by examining speech behaviors in both English and the low-resource language, Hindi. Specific features such as adverbs and distinct roots contributed significantly to ASD classification in English, while the proportion of unintelligible utterances and adposition use held greater importance in Hindi. This study underscores the reliability of speech-based biomarkers in ASD assessment, emphasizing their effectiveness across diverse linguistic backgrounds and highlighting the need for language-specific research in this domain.
由于缺乏可靠的生物标志物和诊断程序的主观性,自闭症谱系障碍(ASD)的识别面临挑战,因此需要改进工具以提高客观性和效率。作为自闭症的一个主要特征,语言障碍被认为是识别自闭症谱系障碍的潜在标志物。然而,目前的研究主要集中于分析英语的语言特点,忽略了其他资源有限语言的语言和语境特异性。受此启发,我们开发了一种基于人工智能(AI)的系统,利用从儿童与其交流伙伴的双人对话中提取的一系列声学和语言特征来检测 ASD。在对 76 名英语儿童(35 名 ASD 儿童和 41 名典型发育(TD)儿童)和 33 名印地语儿童(15 名 ASD 儿童和 18 名典型发育(TD)儿童)的模型进行验证后,我们对包括词法、句法、语义和语用元素在内的各种语音和语言属性进行了广泛的分析,发现了作为 ASD 预测因子的可靠语音属性。这项综合分析的宏观 F1 分数高达 91.30%。通过研究英语和低资源语言印地语的语音行为,我们进一步探讨了语言多样性对基于语音的 ASD 评估的影响。在英语中,副词和独特的词根等特定特征对 ASD 的分类有很大帮助,而在印地语中,无法理解的语句比例和副词的使用则更为重要。这项研究强调了基于语音的生物标记在 ASD 评估中的可靠性,强调了它们在不同语言背景下的有效性,并突出了在这一领域开展特定语言研究的必要性。
{"title":"Artificial Intelligence Driven Predictive Analysis of Acoustic and Linguistic Behaviors for ASD Identification","authors":"Ashwini B.;Deeptanshu;Sheffali Gulati;Jainendra Shukla","doi":"10.1109/TAI.2024.3439288","DOIUrl":"https://doi.org/10.1109/TAI.2024.3439288","url":null,"abstract":"The identification of autism spectrum disorder (ASD) faces challenges due to the lack of reliable biomarkers and the subjectivity in diagnostic procedures, necessitating improved tools for objectivity and efficiency. Being a key characteristic of autism, language impairments are regarded as potential markers for identifying ASD. However, current research predominantly focuses on analyzing language characteristics in English, overlooking linguistic and contextual specificities in other resource-constrained languages. Motivated by these, we developed an artificial intelligence (AI)-based system to detect ASD, utilizing a range of acoustic and linguistic features extracted from dyadic conversations between a child and their communication partner. Validating our model on 76 English-speaking children [35 ASD and 41 typically developing (TD)] and 33 Hindi-speaking children (15 ASD and 18 TD), our extensive analysis of a diverse and comprehensive set of acoustic and linguistic speech attributes, including lexical, syntactic, semantic, and pragmatic elements revealed reliable speech attributes as predictors of ASD. This comprehensive analysis achieved a remarkable macro F1-score of approximately \u0000<inline-formula><tex-math>$boldsymbol{sim}$</tex-math></inline-formula>\u000091.30%. We further addressed the influence of linguistic diversity on speech-based ASD assessment by examining speech behaviors in both English and the low-resource language, Hindi. Specific features such as adverbs and distinct roots contributed significantly to ASD classification in English, while the proportion of unintelligible utterances and adposition use held greater importance in Hindi. This study underscores the reliability of speech-based biomarkers in ASD assessment, emphasizing their effectiveness across diverse linguistic backgrounds and highlighting the need for language-specific research in this domain.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5709-5719"},"PeriodicalIF":0.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tailor-Made Reinforcement Learning Approach With Advanced Noise Optimization for Soft Continuum Robots 针对软连续机器人的定制强化学习方法与高级噪声优化
Pub Date : 2024-08-08 DOI: 10.1109/TAI.2024.3440225
Jino Jayan;Lal Priya P.S.;Hari Kumar R.
Advancements in the fusion of reinforcement learning (RL) and soft robotics are presented in this study, with a focus on refining training methodologies for soft planar continuum robots (SPCRs). The proposed modifications to the twin-delayed deep deterministic (TD3) policy gradient algorithm introduce the innovative dynamic harmonic noise (DHN) to enhance exploration adaptability. Additionally, a tailored adaptive task achievement reward (ATAR) is introduced to balance goal achievement, time efficiency, and trajectory smoothness, thereby improving precision in SPCR navigation. Evaluation metrics, including mean squared distance (MSD), mean error (ME), and mean episodic reward (MER), demonstrate robust generalization capabilities. Significant improvements in average reward, success rate, and convergence speed for the proposed modified TD3 algorithm over traditional TD3 are highlighted in the comparative analysis. Specifically, a 45.17% increase in success rate and a 4.92% increase in convergence speed over TD3 are demonstrated by the proposed TD3. Beyond insights into RL and soft robotics, potential applicability of RL in diverse scenarios is underscored, laying the foundation for future breakthroughs in real-world applications.
本研究介绍了强化学习(RL)与软机器人技术融合的进展,重点是改进软平面连续机器人(SPCR)的训练方法。对双延迟深度确定性(TD3)策略梯度算法的修改引入了创新的动态谐波噪声(DHN),以增强探索的适应性。此外,还引入了量身定制的自适应任务成就奖励(ATAR),以平衡目标实现、时间效率和轨迹平滑性,从而提高 SPCR 导航的精度。包括平均平方距离(MSD)、平均误差(ME)和平均偶发奖励(MER)在内的评估指标都证明了强大的泛化能力。与传统的 TD3 相比,改进后的 TD3 算法在平均奖励、成功率和收敛速度方面都有显著提高,这一点在比较分析中得到了强调。具体来说,与 TD3 相比,拟议的 TD3 算法的成功率提高了 45.17%,收敛速度提高了 4.92%。除了对 RL 和软机器人学的深入了解,RL 在不同场景中的潜在适用性也得到了强调,为未来在现实世界的应用中取得突破奠定了基础。
{"title":"Tailor-Made Reinforcement Learning Approach With Advanced Noise Optimization for Soft Continuum Robots","authors":"Jino Jayan;Lal Priya P.S.;Hari Kumar R.","doi":"10.1109/TAI.2024.3440225","DOIUrl":"https://doi.org/10.1109/TAI.2024.3440225","url":null,"abstract":"Advancements in the fusion of reinforcement learning (RL) and soft robotics are presented in this study, with a focus on refining training methodologies for soft planar continuum robots (SPCRs). The proposed modifications to the twin-delayed deep deterministic (TD3) policy gradient algorithm introduce the innovative dynamic harmonic noise (DHN) to enhance exploration adaptability. Additionally, a tailored adaptive task achievement reward (ATAR) is introduced to balance goal achievement, time efficiency, and trajectory smoothness, thereby improving precision in SPCR navigation. Evaluation metrics, including mean squared distance (MSD), mean error (ME), and mean episodic reward (MER), demonstrate robust generalization capabilities. Significant improvements in average reward, success rate, and convergence speed for the proposed modified TD3 algorithm over traditional TD3 are highlighted in the comparative analysis. Specifically, a 45.17% increase in success rate and a 4.92% increase in convergence speed over TD3 are demonstrated by the proposed TD3. Beyond insights into RL and soft robotics, potential applicability of RL in diverse scenarios is underscored, laying the foundation for future breakthroughs in real-world applications.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5509-5518"},"PeriodicalIF":0.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simultaneous Learning and Planning Within Sensing Range: An Approach for Local Path Planning 感知范围内的同步学习与规划:一种局部路径规划方法
Pub Date : 2024-08-05 DOI: 10.1109/TAI.2024.3438094
Lokesh Kumar;Arup Kumar Sadhu;Ranjan Dasgupta
This article proposes an approach for local path planning. Unlike traditional approaches, the proposed local path planner simultaneously learns and plans within the sensing range (SLPA-SR) during local path planning. SLPA-SR is the synergy between the local path planner, the dynamic window approach (DWA), the obstacle avoidance by velocity obstacle (VO) approach, and the proposed next-best reward learning (NBR) algorithms. In the proposed SLPA-SR, the DWA acts as an actuator and helps to balance exploration and exploitation in the proposed NBR. In the proposed NBR, dimensions of state and action do not need to be defined a priori; rather, dimensions of state and action change dynamically. The proposed SLPA-SR is simulated and experimentally validated on the TurtleBot3 Waffle Pi. The performance of the proposed SLPA-SR is tested in several typical environments, both in simulation and hardware experiments. The proposed SLPA-SR outperforms the contender algorithms (i.e., DWA, DWA-RL, improved time elastic band, predictive artificial potential field, and artificial potential field) by a significant margin in terms of run-time, linear velocity, angular velocity, success rate, average trajectory length, and average velocity. The efficacy of the proposed NBR is established by analyzing the percentage of exploitation, average reward, and state-action pair count.
本文提出了一种局部路径规划方法。与传统方法不同,本文提出的局部路径规划器在局部路径规划过程中同时学习和规划感知范围内的路径(SLPA-SR)。SLPA-SR是局部路径规划、动态窗口法(DWA)、速度障碍避障法(VO)和次优奖励学习(NBR)算法之间的协同。在拟议的SLPA-SR中,DWA充当执行器,帮助平衡拟议NBR中的勘探和开发。在拟议的NBR中,不需要先验地定义状态和行为的维度;相反,状态和动作的维度是动态变化的。所提出的SLPA-SR在TurtleBot3华夫派上进行了仿真和实验验证。在仿真和硬件实验两种典型环境中对所提出的SLPA-SR的性能进行了测试。在运行时间、线速度、角速度、成功率、平均轨迹长度和平均速度方面,SLPA-SR算法明显优于现有的竞争算法(即DWA、DWA- rl、改进时间弹性带、预测人工势场和人工势场)。所提出的NBR的有效性是通过分析利用的百分比、平均奖励和状态-行动对计数来确定的。
{"title":"Simultaneous Learning and Planning Within Sensing Range: An Approach for Local Path Planning","authors":"Lokesh Kumar;Arup Kumar Sadhu;Ranjan Dasgupta","doi":"10.1109/TAI.2024.3438094","DOIUrl":"https://doi.org/10.1109/TAI.2024.3438094","url":null,"abstract":"This article proposes an approach for local path planning. Unlike traditional approaches, the proposed local path planner simultaneously learns and plans within the sensing range (SLPA-SR) during local path planning. SLPA-SR is the synergy between the local path planner, the dynamic window approach (DWA), the obstacle avoidance by velocity obstacle (VO) approach, and the proposed next-best reward learning (NBR) algorithms. In the proposed SLPA-SR, the DWA acts as an actuator and helps to balance exploration and exploitation in the proposed NBR. In the proposed NBR, dimensions of state and action do not need to be defined a \u0000<italic>priori</i>\u0000; rather, dimensions of state and action change dynamically. The proposed SLPA-SR is simulated and experimentally validated on the TurtleBot3 Waffle Pi. The performance of the proposed SLPA-SR is tested in several typical environments, both in simulation and hardware experiments. The proposed SLPA-SR outperforms the contender algorithms (i.e., DWA, DWA-RL, improved time elastic band, predictive artificial potential field, and artificial potential field) by a significant margin in terms of run-time, linear velocity, angular velocity, success rate, average trajectory length, and average velocity. The efficacy of the proposed NBR is established by analyzing the percentage of exploitation, average reward, and state-action pair count.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6399-6411"},"PeriodicalIF":0.0,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1