首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
Modeling and Clustering of Parabolic Granular Data 抛物线颗粒数据的建模与聚类
Pub Date : 2024-03-18 DOI: 10.1109/TAI.2024.3377172
Yiming Tang;Jianwei Gao;Witold Pedrycz;Xianghui Hu;Lei Xi;Fuji Ren;Min Hu
At present, there exist some problems in granular clustering methods, such as lack of nonlinear membership description and global optimization of granular data boundaries. To address these issues, in this study, revolving around the parabolic granular data, we propose an overall architecture for parabolic granular modeling and clustering. To begin with, novel coverage and specificity functions are established, and then a parabolic granular data structure is proposed. The fuzzy c-means (FCM) algorithm is used to obtain the numeric prototypes, and then particle swarm optimization (PSO) is introduced to construct the parabolic granular data from the global perspective under the guidance of principle of justifiable granularity (PJG). Combining the advantages of FCM and PSO, we propose the parabolic granular modeling and optimization (PGMO) method. Moreover, we put forward attribute weights and sample weights as well as a distance measure induced by the Gaussian kernel similarity, and then come up with the algorithm of weighted kernel fuzzy clustering for parabolic granularity (WKFC-PG). In addition, the assessment mechanism of parabolic granular clustering is discussed. In summary, we set up an overall architecture including parabolic granular modeling, clustering, and assessment. Finally, comparative experiments on artificial, UCI, and high-dimensional datasets validate that our overall architecture delivers a good improvement over previous strategies. The parameter analysis and time complexity are also given for WKFC-PG. In contrast with related granular clustering algorithms, it is observed that WKFC-PG performs better than other granular clustering algorithms and has superior stability in handling outliers, especially on high-dimensional datasets.
目前,颗粒聚类方法存在一些问题,如缺乏非线性成员描述和颗粒数据边界的全局优化。针对这些问题,本研究围绕抛物线颗粒数据,提出了抛物线颗粒建模和聚类的整体架构。首先,我们建立了新颖的覆盖率和特异性函数,然后提出了抛物线粒度数据结构。利用模糊 c-means 算法(FCM)获得数值原型,然后引入粒子群优化算法(PSO),在合理粒度原则(PJG)的指导下,从全局角度构建抛物线粒度数据。结合 FCM 和 PSO 的优点,我们提出了抛物面颗粒建模与优化(PGMO)方法。此外,我们还提出了属性权重和样本权重以及由高斯核相似性诱导的距离度量,进而提出了抛物线粒度的加权核模糊聚类算法(WKFC-PG)。此外,还讨论了抛物线粒度聚类的评估机制。总之,我们建立了一个包括抛物线粒度建模、聚类和评估的整体架构。最后,在人工数据集、UCI 数据集和高维数据集上进行的对比实验验证了我们的整体架构比以前的策略有了很好的改进。此外,还给出了 WKFC-PG 的参数分析和时间复杂度。与相关的粒度聚类算法相比,WKFC-PG 的表现优于其他粒度聚类算法,而且在处理异常值时具有更高的稳定性,尤其是在高维数据集上。
{"title":"Modeling and Clustering of Parabolic Granular Data","authors":"Yiming Tang;Jianwei Gao;Witold Pedrycz;Xianghui Hu;Lei Xi;Fuji Ren;Min Hu","doi":"10.1109/TAI.2024.3377172","DOIUrl":"https://doi.org/10.1109/TAI.2024.3377172","url":null,"abstract":"At present, there exist some problems in granular clustering methods, such as lack of nonlinear membership description and global optimization of granular data boundaries. To address these issues, in this study, revolving around the parabolic granular data, we propose an overall architecture for parabolic granular modeling and clustering. To begin with, novel coverage and specificity functions are established, and then a parabolic granular data structure is proposed. The fuzzy c-means (FCM) algorithm is used to obtain the numeric prototypes, and then particle swarm optimization (PSO) is introduced to construct the parabolic granular data from the global perspective under the guidance of principle of justifiable granularity (PJG). Combining the advantages of FCM and PSO, we propose the parabolic granular modeling and optimization (PGMO) method. Moreover, we put forward attribute weights and sample weights as well as a distance measure induced by the Gaussian kernel similarity, and then come up with the algorithm of weighted kernel fuzzy clustering for parabolic granularity (WKFC-PG). In addition, the assessment mechanism of parabolic granular clustering is discussed. In summary, we set up an overall architecture including parabolic granular modeling, clustering, and assessment. Finally, comparative experiments on artificial, UCI, and high-dimensional datasets validate that our overall architecture delivers a good improvement over previous strategies. The parameter analysis and time complexity are also given for WKFC-PG. In contrast with related granular clustering algorithms, it is observed that WKFC-PG performs better than other granular clustering algorithms and has superior stability in handling outliers, especially on high-dimensional datasets.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Reinforcement Learning in Periodic MDP 周期性 MDP 中的在线强化学习
Pub Date : 2024-03-18 DOI: 10.1109/TAI.2024.3375258
Ayush Aniket;Arpan Chattopadhyay
We study learning in periodic Markov decision process (MDP), a special type of nonstationary MDP where both the state transition probabilities and reward functions vary periodically, under the average reward maximization setting. We formulate the problem as a stationary MDP by augmenting the state space with the period index and propose a periodic upper confidence bound reinforcement learning-2 (PUCRL2) algorithm. We show that the regret of PUCRL2 varies linearly with the period $N$ and as $mathcal{O}(sqrt{T text{log} T})$ with the horizon length $T$. Utilizing the information about the sparsity of transition matrix of augmented MDP, we propose another algorithm [periodic upper confidence reinforcement learning with Bernstein bounds (PUCRLB) which enhances upon PUCRL2, both in terms of regret ($O(sqrt{N})$ dependency on period] and empirical performance. Finally, we propose two other algorithms U-PUCRL2 and U-PUCRLB for extended uncertainty in the environment in which the period is unknown but a set of candidate periods are known. Numerical results demonstrate the efficacy of all the algorithms.
我们研究了周期马尔可夫决策过程(MDP)中的学习,这是一种特殊的非稳态 MDP,在平均报酬最大化设置下,状态转换概率和报酬函数都会周期性变化。通过用周期指数增强状态空间,我们将该问题表述为静态 MDP,并提出了周期性置信上限强化学习-2(PUCRL2)算法。我们证明,PUCRL2 的遗憾随周期 $N$ 线性变化,随水平长度 $T$ 变化为 $mathcal{O}(sqrt{T text{log} T})$。利用增强 MDP 过渡矩阵的稀疏性信息,我们提出了另一种算法[具有伯恩斯坦边界的周期性上置信强化学习(PUCRLB)],它在遗憾值($O(sqrt{N})$ 对周期的依赖性)和经验性能方面都增强了 PUCRL2。最后,我们提出了另外两种算法 U-PUCRL2 和 U-PUCRLB,它们适用于周期未知但候选周期已知的扩展不确定性环境。数值结果证明了所有算法的有效性。
{"title":"Online Reinforcement Learning in Periodic MDP","authors":"Ayush Aniket;Arpan Chattopadhyay","doi":"10.1109/TAI.2024.3375258","DOIUrl":"https://doi.org/10.1109/TAI.2024.3375258","url":null,"abstract":"We study learning in periodic Markov decision process (MDP), a special type of nonstationary MDP where both the state transition probabilities and reward functions vary periodically, under the average reward maximization setting. We formulate the problem as a stationary MDP by augmenting the state space with the period index and propose a periodic upper confidence bound reinforcement learning-2 (PUCRL2) algorithm. We show that the regret of PUCRL2 varies linearly with the period \u0000<inline-formula><tex-math>$N$</tex-math></inline-formula>\u0000 and as \u0000<inline-formula><tex-math>$mathcal{O}(sqrt{T text{log} T})$</tex-math></inline-formula>\u0000 with the horizon length \u0000<inline-formula><tex-math>$T$</tex-math></inline-formula>\u0000. Utilizing the information about the sparsity of transition matrix of augmented MDP, we propose another algorithm [periodic upper confidence reinforcement learning with Bernstein bounds (PUCRLB) which enhances upon PUCRL2, both in terms of regret (\u0000<inline-formula><tex-math>$O(sqrt{N})$</tex-math></inline-formula>\u0000 dependency on period] and empirical performance. Finally, we propose two other algorithms U-PUCRL2 and U-PUCRLB for extended uncertainty in the environment in which the period is unknown but a set of candidate periods are known. Numerical results demonstrate the efficacy of all the algorithms.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing a Reliable Shallow Supervised Learning for Thermal Comfort Using Multiple ASHRAE Databases 利用多个 ASHRAE 数据库为热舒适度开发可靠的浅层监督学习方法
Pub Date : 2024-03-18 DOI: 10.1109/TAI.2024.3376319
Kanisius Karyono;Badr M. Abdullah;Alison J. Cotgrave;Ana Bras;Jeff Cullen
The artificial intelligence (AI) system faces the challenge of insufficient training datasets and the risk of an uncomfortable user experience during the data gathering and learning process. The unreliable training data leads to overfitting and poor system performance which will result in wasting operational energy. This work introduces a reliable data set for training the AI subsystem for thermal comfort. The most reliable current training data sets for thermal comfort are ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II, but the direct use of these data for learning will give a poor learning result of less than 60% accuracy. This article presents the algorithm for data filtering and semantic data augmentation for the multiple ASHRAE databases for the supervised learning process. The result was verified with the visual psychrometric chart method that can check for overfitting and verified by developing the Internet of Things (IoT) control system for residential usage based on shallow supervised learning. The AI system was a wide artificial neural network (ANN) which is simple enough to be implemented in a local node. The filtering and semantic augmentation method can increase the accuracy to 96.1%. The control algorithm that was developed based on the comfort zone identification can increase the comfort acknowledgement by 6.06% leading to energy saving for comfort. This work can contribute to 717.2 thousand tonnes of CO2 equivalent per year which is beneficial for a more sustainable thermal comfort system and the development of a reinforced learning system for thermal comfort.
人工智能(AI)系统面临着训练数据集不足的挑战,以及在数据收集和学习过程中用户体验不舒适的风险。不可靠的训练数据会导致过度拟合和系统性能低下,从而浪费运行能源。这项工作引入了一个可靠的数据集,用于训练热舒适度人工智能子系统。目前最可靠的热舒适度训练数据集是 ASHRAE RP-884 和 ASHRAE 全球热舒适度数据库 II,但直接使用这些数据进行学习的学习效果很差,准确率不到 60%。本文介绍了用于监督学习过程的多个 ASHRAE 数据库的数据过滤和语义数据增强算法。结果通过可视化心理测量图方法进行了验证,该方法可检查是否存在过拟合,并通过开发基于浅层监督学习的住宅物联网(IoT)控制系统进行了验证。人工智能系统是一个宽泛的人工神经网络(ANN),其简单程度足以在本地节点中实现。过滤和语义增强方法可将准确率提高到 96.1%。在舒适区识别基础上开发的控制算法可将舒适度确认提高 6.06%,从而实现舒适度节能。这项工作每年可减少 71.72 万吨二氧化碳当量,有利于建立更可持续的热舒适系统和开发热舒适强化学习系统。
{"title":"Developing a Reliable Shallow Supervised Learning for Thermal Comfort Using Multiple ASHRAE Databases","authors":"Kanisius Karyono;Badr M. Abdullah;Alison J. Cotgrave;Ana Bras;Jeff Cullen","doi":"10.1109/TAI.2024.3376319","DOIUrl":"https://doi.org/10.1109/TAI.2024.3376319","url":null,"abstract":"The artificial intelligence (AI) system faces the challenge of insufficient training datasets and the risk of an uncomfortable user experience during the data gathering and learning process. The unreliable training data leads to overfitting and poor system performance which will result in wasting operational energy. This work introduces a reliable data set for training the AI subsystem for thermal comfort. The most reliable current training data sets for thermal comfort are ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II, but the direct use of these data for learning will give a poor learning result of less than 60% accuracy. This article presents the algorithm for data filtering and semantic data augmentation for the multiple ASHRAE databases for the supervised learning process. The result was verified with the visual psychrometric chart method that can check for overfitting and verified by developing the Internet of Things (IoT) control system for residential usage based on shallow supervised learning. The AI system was a wide artificial neural network (ANN) which is simple enough to be implemented in a local node. The filtering and semantic augmentation method can increase the accuracy to 96.1%. The control algorithm that was developed based on the comfort zone identification can increase the comfort acknowledgement by 6.06% leading to energy saving for comfort. This work can contribute to 717.2 thousand tonnes of CO\u0000<sub>2</sub>\u0000 equivalent per year which is beneficial for a more sustainable thermal comfort system and the development of a reinforced learning system for thermal comfort.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Defending Against Poisoning Attacks in Federated Learning With Blockchain 利用区块链防御联盟学习中的中毒攻击
Pub Date : 2024-03-18 DOI: 10.1109/TAI.2024.3376651
Nanqing Dong;Zhipeng Wang;Jiahao Sun;Michael Kampffmeyer;William Knottenbelt;Eric Xing
In the era of deep learning, federated learning (FL) presents a promising approach that allows multiinstitutional data owners, or clients, to collaboratively train machine learning models without compromising data privacy. However, most existing FL approaches rely on a centralized server for global model aggregation, leading to a single point of failure. This makes the system vulnerable to malicious attacks when dealing with dishonest clients. In this work, we address this problem by proposing a secure and reliable FL system based on blockchain and distributed ledger technology. Our system incorporates a peer-to-peer voting mechanism and a reward-and-slash mechanism, which are powered by on-chain smart contracts, to detect and deter malicious behaviors. Both theoretical and empirical analyses are presented to demonstrate the effectiveness of the proposed approach, showing that our framework is robust against malicious client-side behaviors.
在深度学习时代,联合学习(FL)是一种前景广阔的方法,它允许多机构数据所有者或客户在不损害数据隐私的情况下协作训练机器学习模型。然而,大多数现有的联合学习方法都依赖于一个集中式服务器进行全局模型聚合,从而导致单点故障。这使得系统在面对不诚实的客户时容易受到恶意攻击。在这项工作中,我们提出了一种基于区块链和分布式账本技术的安全可靠的 FL 系统,以解决这一问题。我们的系统结合了点对点投票机制和奖惩机制,并由链上智能合约提供支持,以检测和阻止恶意行为。理论和实证分析都证明了所提方法的有效性,表明我们的框架对客户端恶意行为具有很强的抵御能力。
{"title":"Defending Against Poisoning Attacks in Federated Learning With Blockchain","authors":"Nanqing Dong;Zhipeng Wang;Jiahao Sun;Michael Kampffmeyer;William Knottenbelt;Eric Xing","doi":"10.1109/TAI.2024.3376651","DOIUrl":"https://doi.org/10.1109/TAI.2024.3376651","url":null,"abstract":"In the era of deep learning, federated learning (FL) presents a promising approach that allows multiinstitutional data owners, or clients, to collaboratively train machine learning models without compromising data privacy. However, most existing FL approaches rely on a centralized server for global model aggregation, leading to a single point of failure. This makes the system vulnerable to malicious attacks when dealing with dishonest clients. In this work, we address this problem by proposing a secure and reliable FL system based on blockchain and distributed ledger technology. Our system incorporates a peer-to-peer voting mechanism and a reward-and-slash mechanism, which are powered by on-chain smart contracts, to detect and deter malicious behaviors. Both theoretical and empirical analyses are presented to demonstrate the effectiveness of the proposed approach, showing that our framework is robust against malicious client-side behaviors.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10471193","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141631034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heterogeneous Graph Contrastive Learning With Augmentation Graph 带增强图的异构图对比学习
Pub Date : 2024-03-17 DOI: 10.1109/TAI.2024.3400751
Zijuan Zhao;Zequn Zhu;Yuan Liu;Jinli Guo;Kai Yang
Heterogeneous graph neural networks (HGNNs) have demonstrated promising capabilities in addressing various problems defined on heterogeneous graphs containing multiple types of nodes or edges. However, traditional HGNN models depend on label information and capture the local structural information of the original graph. In this article, we propose a novel heterogeneous graph contrastive learning method with augmentation graph (AHGCL). Specifically, we construct an augmentation graph by calculating the feature similarity of nodes to capture latent structural information. For the original graph and the augmentation graph, we employ a shared graph neural network (GNN) encoder to extract the semantic features of nodes with different meta-paths. The feature information is aggregated through a semantic-level attention mechanism to generate final node embeddings, which capture latent high-order semantic structural information. Considering the problems of label information for the real-world datasets, we adopt contrastive learning to train the GNN encoder for maximizing the common information between similar nodes from the original graph and the augmentation graph views. We conduct node classification experiments on four real-world datasets, AMiner, Freebase, digital bibliography & library project (DBLP), and association for computing machinery (ACM), to evaluate the performance of AHGCL. The results show that the proposed AHGCL demonstrates excellent stability and capability compared to existing graph representation learning methods.
异构图神经网络(HGNN)在解决定义在包含多种类型节点或边缘的异构图上的各种问题方面表现出了良好的能力。然而,传统的 HGNN 模型依赖于标签信息,无法捕捉到原始图的局部结构信息。在本文中,我们提出了一种带有增强图(AHGCL)的新型异构图对比学习方法。具体来说,我们通过计算节点的特征相似性来构建增强图,从而捕捉潜在的结构信息。对于原始图和增强图,我们采用共享图神经网络(GNN)编码器来提取具有不同元路径的节点的语义特征。这些特征信息通过语义级关注机制进行聚合,生成最终的节点嵌入,从而捕捉潜在的高阶语义结构信息。考虑到真实世界数据集的标签信息问题,我们采用对比学习来训练 GNN 编码器,以最大化原始图和增强图视图中相似节点之间的共同信息。我们在 AMiner、Freebase、digital bibliography & library project (DBLP) 和 association for computing machinery (ACM) 四个真实数据集上进行了节点分类实验,以评估 AHGCL 的性能。结果表明,与现有的图表示学习方法相比,所提出的 AHGCL 具有出色的稳定性和能力。
{"title":"Heterogeneous Graph Contrastive Learning With Augmentation Graph","authors":"Zijuan Zhao;Zequn Zhu;Yuan Liu;Jinli Guo;Kai Yang","doi":"10.1109/TAI.2024.3400751","DOIUrl":"https://doi.org/10.1109/TAI.2024.3400751","url":null,"abstract":"Heterogeneous graph neural networks (HGNNs) have demonstrated promising capabilities in addressing various problems defined on heterogeneous graphs containing multiple types of nodes or edges. However, traditional HGNN models depend on label information and capture the local structural information of the original graph. In this article, we propose a novel heterogeneous graph contrastive learning method with augmentation graph (AHGCL). Specifically, we construct an augmentation graph by calculating the feature similarity of nodes to capture latent structural information. For the original graph and the augmentation graph, we employ a shared graph neural network (GNN) encoder to extract the semantic features of nodes with different meta-paths. The feature information is aggregated through a semantic-level attention mechanism to generate final node embeddings, which capture latent high-order semantic structural information. Considering the problems of label information for the real-world datasets, we adopt contrastive learning to train the GNN encoder for maximizing the common information between similar nodes from the original graph and the augmentation graph views. We conduct node classification experiments on four real-world datasets, AMiner, Freebase, digital bibliography & library project (DBLP), and association for computing machinery (ACM), to evaluate the performance of AHGCL. The results show that the proposed AHGCL demonstrates excellent stability and capability compared to existing graph representation learning methods.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Artificial Intelligence Publication Information IEEE Transactions on Artificial Intelligence 出版信息
Pub Date : 2024-03-16 DOI: 10.1109/TAI.2024.3396531
{"title":"IEEE Transactions on Artificial Intelligence Publication Information","authors":"","doi":"10.1109/TAI.2024.3396531","DOIUrl":"https://doi.org/10.1109/TAI.2024.3396531","url":null,"abstract":"","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10532213","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141068951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Learning for Multiagent Route Planning: Adapting to Diverse Task Scales 多代理路线规划的可扩展学习:适应不同的任务规模
Pub Date : 2024-03-16 DOI: 10.1109/TAI.2024.3402193
Site Qu;Guoqiang Hu
When utilizing end-to-end learn-to-construct methods to solve routing problems for multiagent systems, the model is usually trained individually for different problem scales (i.e., the number of customers to be concurrently served within a map) to make the model adaptive to the corresponding scale, ensuring good solution quality. Otherwise, the model trained for one specific scale can lead to poor performance when applied to another different scale, and this situation can get worse when the scale discrepancy increases. Such a separate training strategy is inefficient and time-intensive. In this article, we propose a mix-scale learning framework that requires only a single training session, enabling the model to effectively plan high-quality routes for various problem scales. Based on the capacitated vehicle routing problem (CVRP), the test results reveal that: for problem scales which are no matter seen or unseen during training, our once-trained model can produce solution routes with performance comparable or even superior to those of individually trained models, and offer the highest average solution quality with improvement ratio ranging from 2.28% to 8.07%, which effectively spares the separate training session for each specific scale. Additionally, the extended comparison analysis with individually trained models on real-world benchmark dataset from CVRPLib further highlights our once-trained model's generalization performance across various problem scales and diverse node distributions.
在利用端到端 "学习到构建 "方法解决多代理系统的路由问题时,通常会针对不同的问题规模(即地图上同时服务的客户数量)对模型进行单独训练,以使模型适应相应的规模,从而确保良好的解决方案质量。否则,针对一个特定规模训练的模型在应用于另一个不同规模时可能会导致性能不佳,而当规模差异增大时,这种情况可能会变得更糟。这种单独的训练策略效率低且耗时。在本文中,我们提出了一种混合尺度学习框架,它只需要一次训练,就能使模型有效地为各种问题尺度规划高质量路线。测试结果表明:对于在训练过程中无论见过或没见过的问题规模,我们的一次训练模型都能生成与单独训练模型性能相当甚至更优的求解路线,并提供最高的平均求解质量,改进率从 2.28% 到 8.07%,从而有效地避免了针对每个特定规模的单独训练。此外,在 CVRPLib 的真实基准数据集上与单独训练的模型进行的扩展对比分析进一步突出了我们的一次性训练模型在不同问题规模和不同节点分布下的泛化性能。
{"title":"Scalable Learning for Multiagent Route Planning: Adapting to Diverse Task Scales","authors":"Site Qu;Guoqiang Hu","doi":"10.1109/TAI.2024.3402193","DOIUrl":"https://doi.org/10.1109/TAI.2024.3402193","url":null,"abstract":"When utilizing end-to-end learn-to-construct methods to solve routing problems for multiagent systems, the model is usually trained individually for different problem scales (i.e., the number of customers to be concurrently served within a map) to make the model adaptive to the corresponding scale, ensuring good solution quality. Otherwise, the model trained for one specific scale can lead to poor performance when applied to another different scale, and this situation can get worse when the scale discrepancy increases. Such a separate training strategy is inefficient and time-intensive. In this article, we propose a mix-scale learning framework that requires only a single training session, enabling the model to effectively plan high-quality routes for various problem scales. Based on the capacitated vehicle routing problem (CVRP), the test results reveal that: for problem scales which are no matter seen or unseen during training, our once-trained model can produce solution routes with performance comparable or even superior to those of individually trained models, and offer the highest average solution quality with improvement ratio ranging from 2.28% to 8.07%, which effectively spares the separate training session for each specific scale. Additionally, the extended comparison analysis with individually trained models on real-world benchmark dataset from CVRPLib further highlights our once-trained model's generalization performance across various problem scales and diverse node distributions.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MTPret: Improving X-Ray Image Analytics With Multitask Pretraining MTPret:利用多任务预训练提高 X 射线图像分析能力
Pub Date : 2024-03-15 DOI: 10.1109/TAI.2024.3400750
Weibin Liao;Qingzhong Wang;Xuhong Li;Yi Liu;Zeyu Chen;Siyu Huang;Dejing Dou;Yanwu Xu;Haoyi Xiong
While deep neural networks (DNNs) have been widely used in various X-ray image analytics tasks such as classification, segmentation, detection, etc., there frequently needs to collect and annotate a huge amount of training data to train a model for every single task. In this work, we proposed a multitask self-supervised pretraining strategy MTPret to improve the performance of DNNs in various X-ray analytics tasks. MTPret first trains the backbone to learn visual representations from multiple datasets of different tasks through contrastive learning, then MTPret leverages a multitask continual learning to learn discriminative features from various downstream tasks. To evaluate the performance of MTPret, we collected eleven X-ray image datasets from different body parts, such as heads, chest, lungs, bones, and etc., for various tasks to pretrain backbones, and fine-tuned the networks on seven of the tasks. The evaluation results on top of the seven tasks showed MTPret outperformed a large number of baseline methods, including other initialization strategies, pretrained models, and task-specific algorithms in recent studies. In addition, we also performed experiments based on two external tasks, where the datasets of external tasks have not been used in pretraining. The excellent performance of MTPret further confirmed the generalizability and superiority of the proposed multitask self-supervised pretraining.
虽然深度神经网络(DNN)已被广泛应用于分类、分割、检测等各种 X 射线图像分析任务中,但要为每个任务训练一个模型,往往需要收集和注释大量训练数据。在这项工作中,我们提出了一种多任务自监督预训练策略 MTPret,以提高 DNNs 在各种 X 射线分析任务中的性能。MTPret 首先通过对比学习训练骨干网络从不同任务的多个数据集中学习视觉表征,然后利用多任务持续学习从各种下游任务中学习判别特征。为了评估 MTPret 的性能,我们收集了 11 个不同身体部位(如头部、胸部、肺部、骨骼等)的 X 射线图像数据集,针对不同任务对骨干进行预训练,并在其中 7 个任务上对网络进行了微调。对七项任务的评估结果显示,MTPret优于大量基线方法,包括其他初始化策略、预训练模型和近期研究中的特定任务算法。此外,我们还基于两个外部任务进行了实验,其中外部任务的数据集未用于预训练。MTPret 的出色表现进一步证实了所提出的多任务自监督预训练的普适性和优越性。
{"title":"MTPret: Improving X-Ray Image Analytics With Multitask Pretraining","authors":"Weibin Liao;Qingzhong Wang;Xuhong Li;Yi Liu;Zeyu Chen;Siyu Huang;Dejing Dou;Yanwu Xu;Haoyi Xiong","doi":"10.1109/TAI.2024.3400750","DOIUrl":"https://doi.org/10.1109/TAI.2024.3400750","url":null,"abstract":"While deep neural networks (DNNs) have been widely used in various X-ray image analytics tasks such as classification, segmentation, detection, etc., there frequently needs to collect and annotate a huge amount of training data to train a model for every single task. In this work, we proposed a multitask self-supervised pretraining strategy \u0000<italic>MTPret</i>\u0000 to improve the performance of DNNs in various X-ray analytics tasks. \u0000<italic>MTPret</i>\u0000 first trains the backbone to learn visual representations from multiple datasets of different tasks through contrastive learning, then \u0000<italic>MTPret</i>\u0000 leverages a multitask continual learning to learn discriminative features from various downstream tasks. To evaluate the performance of \u0000<italic>MTPret</i>\u0000, we collected eleven X-ray image datasets from different body parts, such as heads, chest, lungs, bones, and etc., for various tasks to pretrain backbones, and fine-tuned the networks on seven of the tasks. The evaluation results on top of the seven tasks showed \u0000<italic>MTPret</i>\u0000 outperformed a large number of baseline methods, including other initialization strategies, pretrained models, and task-specific algorithms in recent studies. In addition, we also performed experiments based on two external tasks, where the datasets of external tasks have not been used in pretraining. The excellent performance of \u0000<italic>MTPret</i>\u0000 further confirmed the generalizability and superiority of the proposed multitask self-supervised pretraining.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142169724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring Electrocardiography From Optical Sensing Using Lightweight Neural Network 利用轻量级神经网络从光学传感推断心电图
Pub Date : 2024-03-15 DOI: 10.1109/TAI.2024.3400749
Yuenan Li;Xin Tian;Qiang Zhu;Min Wu
This article presents a computational solution that enables continuous cardiac monitoring through cross-modality inference of electrocardiogram (ECG). While some smartwatches now allow users to obtain a 30-s ECG test by tapping a built-in bio-sensor, these short-term ECG tests often miss intermittent and asymptomatic abnormalities of cardiac functions. It is also infeasible to expect persistently active user participation for long-term continuous cardiac monitoring in order to capture these and other types of cardiac abnormalities. To alleviate the need for continuous user attention and active participation, we design a lightweight neural network that infers ECG from the photoplethysmogram (PPG) signal sensed at the skin surface by a wearable optical sensor. We also develop a diagnosis-oriented training strategy to enable the neural network to capture the pathological features of ECG, aiming to increase the utility of reconstructed ECG signals for screening cardiovascular diseases (CVDs). We also leverage model interpretation to obtain insights from data-driven models, for example, to reveal some associations between CVDs and ECG/PPG and to demonstrate how the neural network copes with motion artifacts in the ambulatory application. The experimental results on three datasets demonstrate the feasibility of inferring ECG from PPG, achieving a high fidelity of ECG reconstruction with only about 40 000 parameters.
本文提出了一种计算解决方案,通过心电图(ECG)的跨模态推断实现连续心脏监测。虽然现在有些智能手表允许用户通过轻触内置生物传感器获得 30 秒的心电图测试,但这些短期心电图测试往往会错过间歇性和无症状的心脏功能异常。此外,期望用户持续积极参与长期连续的心脏监测以捕捉这些和其他类型的心脏异常也是不可行的。为了减轻对用户持续关注和积极参与的需求,我们设计了一种轻量级神经网络,可从可穿戴光学传感器在皮肤表面感应到的光电血流图(PPG)信号推断心电图。我们还开发了一种以诊断为导向的训练策略,使神经网络能够捕捉心电图的病理特征,从而提高重建心电信号在心血管疾病(CVD)筛查中的实用性。我们还利用模型解释从数据驱动模型中获取见解,例如,揭示心血管疾病与心电图/PPG 之间的一些关联,并展示神经网络如何在非卧床应用中应对运动伪影。三个数据集的实验结果证明了从 PPG 推断心电图的可行性,只需约 40,000 个参数就能实现高保真心电图重建。
{"title":"Inferring Electrocardiography From Optical Sensing Using Lightweight Neural Network","authors":"Yuenan Li;Xin Tian;Qiang Zhu;Min Wu","doi":"10.1109/TAI.2024.3400749","DOIUrl":"https://doi.org/10.1109/TAI.2024.3400749","url":null,"abstract":"This article presents a computational solution that enables continuous cardiac monitoring through cross-modality inference of electrocardiogram (ECG). While some smartwatches now allow users to obtain a 30-s ECG test by tapping a built-in bio-sensor, these short-term ECG tests often miss intermittent and asymptomatic abnormalities of cardiac functions. It is also infeasible to expect persistently active user participation for long-term continuous cardiac monitoring in order to capture these and other types of cardiac abnormalities. To alleviate the need for continuous user attention and active participation, we design a lightweight neural network that infers ECG from the photoplethysmogram (PPG) signal sensed at the skin surface by a wearable optical sensor. We also develop a diagnosis-oriented training strategy to enable the neural network to capture the pathological features of ECG, aiming to increase the utility of reconstructed ECG signals for screening cardiovascular diseases (CVDs). We also leverage model interpretation to obtain insights from data-driven models, for example, to reveal some associations between CVDs and ECG/PPG and to demonstrate how the neural network copes with motion artifacts in the ambulatory application. The experimental results on three datasets demonstrate the feasibility of inferring ECG from PPG, achieving a high fidelity of ECG reconstruction with only about 40 000 parameters.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141725567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multichannel Long-Term External Attention Network for Aeroengine Remaining Useful Life Prediction 用于航空发动机剩余使用寿命预测的多通道长期外部关注网络
Pub Date : 2024-03-15 DOI: 10.1109/TAI.2024.3400929
Xuezhen Liu;Yongyi Chen;Dan Zhang;Ruqiang Yan;Hongjie Ni
Accurately estimating the remaining useful life (RUL) of aircraft engines can effectively prevent aircraft crashes and human casualties. In some RUL prediction methods, particularly for aircraft engines running under complex conditions, they are difficult to comprehensively characterize the engine degradation process, resulting in poor predicted RUL. To address the above challenge, a multichannel long-term external attention network (MLEAN) is proposed for the RUL prediction of turbofan engines. First, the preprocessed samples are transformed to enable MLEAN to focus on learning inter-sensor correlations within the same degradation stage. To improve the feature representation capability of the network, multichannel time attention network (MTANet) is then designed to realize multiscale and multifrequency feature learning, which effectively achieves multiperspective analysis of long-term dependencies in different channels. Then, external attention block (EAB) is introduced to memorize important degraded features from different samples, which can improve the ability of global feature extraction and generalization ability of the network. The performance of MLEAN is examined on the C-MAPSS public dataset. The evaluation metrics RMSE and score values are 13.71 and 680, respectively. In comparison experiments, the proposed MLEAN performs better than the listed state-of-the-art RUL prediction methods.
准确估算飞机发动机的剩余使用寿命(RUL)可以有效防止飞机失事和人员伤亡。在一些 RUL 预测方法中,尤其是针对复杂工况下运行的飞机发动机,很难全面描述发动机的退化过程,导致 RUL 预测结果不佳。为解决上述难题,本文提出了一种用于涡扇发动机 RUL 预测的多通道长期外部注意力网络(MLEAN)。首先,对预处理样本进行转换,使 MLEAN 能够专注于学习同一退化阶段中传感器间的相关性。然后,为了提高网络的特征表示能力,设计了多通道时间注意网络(MTANet)来实现多尺度和多频率特征学习,从而有效地实现了对不同通道中长期依赖关系的多视角分析。然后,引入外部注意块(EAB)来记忆不同样本的重要退化特征,从而提高网络的全局特征提取能力和泛化能力。在 C-MAPSS 公开数据集上检验了 MLEAN 的性能。评估指标 RMSE 和得分值分别为 13.71 和 680。在对比实验中,所提出的 MLEAN 比所列出的最先进的 RUL 预测方法表现更好。
{"title":"A Multichannel Long-Term External Attention Network for Aeroengine Remaining Useful Life Prediction","authors":"Xuezhen Liu;Yongyi Chen;Dan Zhang;Ruqiang Yan;Hongjie Ni","doi":"10.1109/TAI.2024.3400929","DOIUrl":"https://doi.org/10.1109/TAI.2024.3400929","url":null,"abstract":"Accurately estimating the remaining useful life (RUL) of aircraft engines can effectively prevent aircraft crashes and human casualties. In some RUL prediction methods, particularly for aircraft engines running under complex conditions, they are difficult to comprehensively characterize the engine degradation process, resulting in poor predicted RUL. To address the above challenge, a multichannel long-term external attention network (MLEAN) is proposed for the RUL prediction of turbofan engines. First, the preprocessed samples are transformed to enable MLEAN to focus on learning inter-sensor correlations within the same degradation stage. To improve the feature representation capability of the network, multichannel time attention network (MTANet) is then designed to realize multiscale and multifrequency feature learning, which effectively achieves multiperspective analysis of long-term dependencies in different channels. Then, external attention block (EAB) is introduced to memorize important degraded features from different samples, which can improve the ability of global feature extraction and generalization ability of the network. The performance of MLEAN is examined on the C-MAPSS public dataset. The evaluation metrics RMSE and score values are 13.71 and 680, respectively. In comparison experiments, the proposed MLEAN performs better than the listed state-of-the-art RUL prediction methods.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1