首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
LSTM-Based Model Compression for CAN Security in Intelligent Vehicles
Pub Date : 2024-08-05 DOI: 10.1109/TAI.2024.3438110
Yuan Feng;Yingxu Lai;Ye Chen;Zhaoyi Zhang;Jingwen Wei
The rapid deployment and low-cost inference of controller area network (CAN) bus anomaly detection models on intelligent vehicles can drive the development of the Green Internet of Vehicles. Anomaly detection on intelligent vehicles often utilizes recurrent neural network models, but computational resources for these models are limited on small platforms. Model compression is essential to ensure CAN bus security with restricted computing resources while improving model computation efficiency. However, the existence of shared cyclic units significantly constrains the compression of recurrent neural networks. In this study, we propose a structured pruning method for long short-term memory (LSTM) based on the contribution values of shared vectors. By analyzing the contribution value of each dimension of shared vectors, the weight matrix of the model is structurally pruned, and the output value of the LSTM layer is supplemented to maintain the information integrity between adjacent network layers. We further propose an approximate matrix multiplication calculation module that runs in the whole process of model calculation and is deployed in parallel with the pruning module. Evaluated on a realistic public CAN bus dataset, our method effectively achieves highly structured pruning, improves model computing efficiency, and maintains performance stability compared to other compression methods.
{"title":"LSTM-Based Model Compression for CAN Security in Intelligent Vehicles","authors":"Yuan Feng;Yingxu Lai;Ye Chen;Zhaoyi Zhang;Jingwen Wei","doi":"10.1109/TAI.2024.3438110","DOIUrl":"https://doi.org/10.1109/TAI.2024.3438110","url":null,"abstract":"The rapid deployment and low-cost inference of controller area network (CAN) bus anomaly detection models on intelligent vehicles can drive the development of the Green Internet of Vehicles. Anomaly detection on intelligent vehicles often utilizes recurrent neural network models, but computational resources for these models are limited on small platforms. Model compression is essential to ensure CAN bus security with restricted computing resources while improving model computation efficiency. However, the existence of shared cyclic units significantly constrains the compression of recurrent neural networks. In this study, we propose a structured pruning method for long short-term memory (LSTM) based on the contribution values of shared vectors. By analyzing the contribution value of each dimension of shared vectors, the weight matrix of the model is structurally pruned, and the output value of the LSTM layer is supplemented to maintain the information integrity between adjacent network layers. We further propose an approximate matrix multiplication calculation module that runs in the whole process of model calculation and is deployed in parallel with the pruning module. Evaluated on a realistic public CAN bus dataset, our method effectively achieves highly structured pruning, improves model computing efficiency, and maintains performance stability compared to other compression methods.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6457-6471"},"PeriodicalIF":0.0,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation RADiff:射电天文地图生成的可控扩散模型
Pub Date : 2024-08-01 DOI: 10.1109/TAI.2024.3436538
Renato Sortino;Thomas Cecconello;Andrea De Marco;Giuseppe Fiameni;Andrea Pilzer;Daniel Magro;Andrew M. Hopkins;Simone Riggi;Eva Sciacca;Adriano Ingallinera;Cristobal Bordiu;Filomena Bufano;Concetto Spampinato
Along with the nearing completion of the square kilometer array (SKA), comes an increasing demand for accurate and reliable automated solutions to extract valuable information from the vast amount of data it will allow acquiring. Automated source finding is a particularly important task in this context, as it enables the detection and classification of astronomical objects. Deep-learning-based object detection and semantic segmentation models have proven to be suitable for this purpose. However, training such deep networks requires a high volume of labeled data, which is not trivial to obtain in the context of radio astronomy. Since data needs to be manually labeled by experts, this process is not scalable to large dataset sizes, limiting the possibilities of leveraging deep networks to address several tasks. In this work, we propose RADiff, a generative approach based on conditional diffusion models trained over an annotated radio dataset to generate synthetic images, containing radio sources of different morphologies, to augment existing datasets and reduce the problems caused by class imbalances. We also show that it is possible to generate fully synthetic image-annotation pairs to automatically augment any annotated dataset. We evaluate the effectiveness of this approach by training a semantic segmentation model on a real dataset augmented in two ways: 1) using synthetic images obtained from real masks; and 2) generating images from synthetic semantic masks. Finally, we also show how the model can be applied to populate background noise maps for simulating radio maps for data challenges.
{"title":"RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation","authors":"Renato Sortino;Thomas Cecconello;Andrea De Marco;Giuseppe Fiameni;Andrea Pilzer;Daniel Magro;Andrew M. Hopkins;Simone Riggi;Eva Sciacca;Adriano Ingallinera;Cristobal Bordiu;Filomena Bufano;Concetto Spampinato","doi":"10.1109/TAI.2024.3436538","DOIUrl":"https://doi.org/10.1109/TAI.2024.3436538","url":null,"abstract":"Along with the nearing completion of the square kilometer array (SKA), comes an increasing demand for accurate and reliable automated solutions to extract valuable information from the vast amount of data it will allow acquiring. Automated source finding is a particularly important task in this context, as it enables the detection and classification of astronomical objects. Deep-learning-based object detection and semantic segmentation models have proven to be suitable for this purpose. However, training such deep networks requires a high volume of labeled data, which is not trivial to obtain in the context of radio astronomy. Since data needs to be manually labeled by experts, this process is not scalable to large dataset sizes, limiting the possibilities of leveraging deep networks to address several tasks. In this work, we propose RADiff, a generative approach based on conditional diffusion models trained over an annotated radio dataset to generate synthetic images, containing radio sources of different morphologies, to augment existing datasets and reduce the problems caused by class imbalances. We also show that it is possible to generate fully synthetic image-annotation pairs to automatically augment any annotated dataset. We evaluate the effectiveness of this approach by training a semantic segmentation model on a real dataset augmented in two ways: 1) using synthetic images obtained from real masks; and 2) generating images from synthetic semantic masks. Finally, we also show how the model can be applied to populate background noise maps for simulating radio maps for data challenges.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6524-6535"},"PeriodicalIF":0.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cost-Efficient Feature Selection for Horizontal Federated Learning
Pub Date : 2024-08-01 DOI: 10.1109/TAI.2024.3436664
Sourasekhar Banerjee;Devvjiit Bhuyan;Erik Elmroth;Monowar Bhuyan
Horizontal federated learning (HFL) exhibits substantial similarities in feature space across distinct clients. However, not all features contribute significantly to the training of the global model. Moreover, the curse of dimensionality delays the training. Therefore, reducing irrelevant and redundant features from the feature space makes training faster and inexpensive. This work aims to identify the common feature subset from the clients in federated settings. We introduce a hybrid approach called Fed-MOFS,1

This manuscript is an extension of Banerjee et al. [1].

utilizing mutual information (MI) and clustering for local FS at each client. Unlike the Fed-FiS, which uses a scoring function for global feature ranking, Fed-MOFS employs multiobjective optimization to prioritize features based on their higher relevance and lower redundancy. This article compares the performance of Fed-MOFS2

We share our code, data, and supplementary copy through https://github.com/DevBhuyan/Horz-FL/blob/main/README.md.

with conventional and federated FS methods. Moreover, we tested the scalability, stability, and efficacy of both Fed-FiS and Fed-MOFS across diverse datasets. We also assessed how FS influenced model convergence and explored its impact in scenarios with data heterogeneity. Our results show that Fed-MOFS enhances global model performance with a 50% reduction in feature space and is at least twice as fast as the FSHFL method. The computational complexity for both approaches is O($d^{2}$), which is lower than the state of the art.
{"title":"Cost-Efficient Feature Selection for Horizontal Federated Learning","authors":"Sourasekhar Banerjee;Devvjiit Bhuyan;Erik Elmroth;Monowar Bhuyan","doi":"10.1109/TAI.2024.3436664","DOIUrl":"https://doi.org/10.1109/TAI.2024.3436664","url":null,"abstract":"Horizontal federated learning (HFL) exhibits substantial similarities in feature space across distinct clients. However, not all features contribute significantly to the training of the global model. Moreover, the curse of dimensionality delays the training. Therefore, reducing irrelevant and redundant features from the feature space makes training faster and inexpensive. This work aims to identify the common feature subset from the clients in federated settings. We introduce a hybrid approach called Fed-MOFS,\u0000<xref><sup>1</sup></xref>\u0000<fn><label><sup>1</sup></label><p>This manuscript is an extension of Banerjee et al. <xref>[1]</xref>.</p></fn>\u0000 utilizing mutual information (MI) and clustering for local FS at each client. Unlike the Fed-FiS, which uses a scoring function for global feature ranking, Fed-MOFS employs multiobjective optimization to prioritize features based on their higher relevance and lower redundancy. This article compares the performance of Fed-MOFS\u0000<xref><sup>2</sup></xref>\u0000<fn><label><sup>2</sup></label><p>We share our code, data, and supplementary copy through <uri>https://github.com/DevBhuyan/Horz-FL/blob/main/README.md</uri>.</p></fn>\u0000 with conventional and federated FS methods. Moreover, we tested the scalability, stability, and efficacy of both Fed-FiS and Fed-MOFS across diverse datasets. We also assessed how FS influenced model convergence and explored its impact in scenarios with data heterogeneity. Our results show that Fed-MOFS enhances global model performance with a 50% reduction in feature space and is at least twice as fast as the FSHFL method. The computational complexity for both approaches is O(\u0000<inline-formula><tex-math>$d^{2}$</tex-math></inline-formula>\u0000), which is lower than the state of the art.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6551-6565"},"PeriodicalIF":0.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Higher-Order Directed Community Detection by A Multiobjective Evolutionary Framework 多目标进化框架下的高阶定向群落检测
Pub Date : 2024-08-01 DOI: 10.1109/TAI.2024.3436659
Jing Xiao;Jing Cao;Xiao-Ke Xu
Higher-order community detection in real-life networks has recently gained significant attention, because motif-based communities reflect not only higher-order mesoscale structures but also functional characteristics. However, motif-based communities detected by existing methods for directed networks often disregard edge directionality (nonreciprocal directional arcs), so they typically fail to comprehensively reveal intrinsic characteristics of higher-order topology and information flow. To address this issue, first, we model higher-order directed community detection as a biobjective optimization problem, aiming to provide high-quality and diverse compromise partitions that capture both characteristics. Second, we introduce a multiobjective genetic algorithm based on motif density and information flow (MOGA-MI) to approximate the Pareto optimal higher-order directed community partitions. On the one hand, an arc-and-motif neighbor-based genetic generator (AMN-GA) is developed to generate high-quality and diverse offspring individuals; on the other hand, a higher-order directed neighbor community modification (HD-NCM) operation is designed to further improve generated partitions by modifying easily confused nodes into more appropriate motif-neighbor communities. Finally, experimental results demonstrate that the proposed MOGA-MI outperforms state-of-the-art algorithms in terms of higher-order topology and information flow indicators while providing more diverse community information.
{"title":"Higher-Order Directed Community Detection by A Multiobjective Evolutionary Framework","authors":"Jing Xiao;Jing Cao;Xiao-Ke Xu","doi":"10.1109/TAI.2024.3436659","DOIUrl":"https://doi.org/10.1109/TAI.2024.3436659","url":null,"abstract":"Higher-order community detection in real-life networks has recently gained significant attention, because motif-based communities reflect not only higher-order mesoscale structures but also functional characteristics. However, motif-based communities detected by existing methods for directed networks often disregard edge directionality (nonreciprocal directional arcs), so they typically fail to comprehensively reveal intrinsic characteristics of higher-order topology and information flow. To address this issue, first, we model higher-order directed community detection as a biobjective optimization problem, aiming to provide high-quality and diverse compromise partitions that capture both characteristics. Second, we introduce a multiobjective genetic algorithm based on motif density and information flow (MOGA-MI) to approximate the Pareto optimal higher-order directed community partitions. On the one hand, an arc-and-motif neighbor-based genetic generator (AMN-GA) is developed to generate high-quality and diverse offspring individuals; on the other hand, a higher-order directed neighbor community modification (HD-NCM) operation is designed to further improve generated partitions by modifying easily confused nodes into more appropriate motif-neighbor communities. Finally, experimental results demonstrate that the proposed MOGA-MI outperforms state-of-the-art algorithms in terms of higher-order topology and information flow indicators while providing more diverse community information.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6536-6550"},"PeriodicalIF":0.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CrackLens: Automated Sidewalk Crack Detection and Segmentation CrackLens:人行道裂缝自动检测与分割
Pub Date : 2024-07-31 DOI: 10.1109/TAI.2024.3435608
Chan Young Koh;Mohamed Ali;Abdeltawab Hendawi
Automatic sidewalk crack detection is necessary for urban infrastructure maintenance to ensure pedestrian safety. Such a task becomes complex on overgrown sidewalks, where crack detection usually misjudges vegetation as cracks. A lack of automated crack detection targets overgrown sidewalk problems; most crack detection focuses on vehicular roadway cracks that are recognizable even at the aerial photography level. Hence, this article introduces CrackLens, an automated sidewalk crack detection framework capable of detecting cracks even on overgrown sidewalks. We include several contributions as follows. First, we designed an automatic data parser using a red, green, and blue (RGB)-depth fusion sidewalk dataset we collected. The RGB and depth information are combined to create depth-embedded matrices, which are used to prelabel and separate the collected dataset into two categories (with and without crack). Second, we created an automatic annotation process using image processing methods and tailored the tool only to annotate cracks on overgrown sidewalks. This process is followed by a binary classification for verification, allowing the tool to target overgrown problems on sidewalks. Lastly, we explored the robustness of our framework by experimenting with it using 8,000 real sidewalk images with some overgrown problems. The evaluation leveraged several transformer-based neural network models. Our framework achieves substantial crack detection and segmentation in overgrown sidewalks by addressing the challenges of limited data and subjective manual annotations.
自动人行道裂缝检测是城市基础设施维护所必需的,以确保行人安全。在杂草丛生的人行道上,这项任务变得非常复杂,因为裂缝检测通常会将植被误判为裂缝。针对杂草丛生的人行道问题缺乏自动裂缝检测;大多数裂缝检测都集中在车行道裂缝上,即使在航拍水平上也能识别。因此,本文介绍了 CrackLens,这是一个人行道裂缝自动检测框架,即使在杂草丛生的人行道上也能检测到裂缝。我们的贡献包括以下几个方面。首先,我们利用收集到的红绿蓝(RGB)深度融合人行道数据集设计了一个自动数据解析器。将 RGB 和深度信息结合起来创建深度嵌入矩阵,用于预先标记并将收集到的数据集分为两类(有裂缝和无裂缝)。其次,我们使用图像处理方法创建了一个自动标注流程,并对该工具进行了定制,使其仅用于标注杂草丛生的人行道上的裂缝。在这一过程之后,我们进行了二元分类验证,从而使该工具能够锁定人行道上的杂草丛生问题。最后,我们使用 8000 张带有一些杂草丛生问题的真实人行道图像进行了实验,从而探索了我们框架的鲁棒性。评估利用了几个基于变压器的神经网络模型。我们的框架通过解决有限数据和主观人工标注的难题,实现了对杂草丛生的人行道的大量裂缝检测和分割。
{"title":"CrackLens: Automated Sidewalk Crack Detection and Segmentation","authors":"Chan Young Koh;Mohamed Ali;Abdeltawab Hendawi","doi":"10.1109/TAI.2024.3435608","DOIUrl":"https://doi.org/10.1109/TAI.2024.3435608","url":null,"abstract":"Automatic sidewalk crack detection is necessary for urban infrastructure maintenance to ensure pedestrian safety. Such a task becomes complex on overgrown sidewalks, where crack detection usually misjudges vegetation as cracks. A lack of automated crack detection targets overgrown sidewalk problems; most crack detection focuses on vehicular roadway cracks that are recognizable even at the aerial photography level. Hence, this article introduces CrackLens, an automated sidewalk crack detection framework capable of detecting cracks even on overgrown sidewalks. We include several contributions as follows. First, we designed an automatic data parser using a red, green, and blue (RGB)-depth fusion sidewalk dataset we collected. The RGB and depth information are combined to create depth-embedded matrices, which are used to prelabel and separate the collected dataset into two categories (with and without crack). Second, we created an automatic annotation process using image processing methods and tailored the tool only to annotate cracks on overgrown sidewalks. This process is followed by a binary classification for verification, allowing the tool to target overgrown problems on sidewalks. Lastly, we explored the robustness of our framework by experimenting with it using 8,000 real sidewalk images with some overgrown problems. The evaluation leveraged several transformer-based neural network models. Our framework achieves substantial crack detection and segmentation in overgrown sidewalks by addressing the challenges of limited data and subjective manual annotations.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5418-5430"},"PeriodicalIF":0.0,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Fusion Induced Attention Network for Industrial VOCs Detection
Pub Date : 2024-07-30 DOI: 10.1109/TAI.2024.3436037
Yu Kang;Kehao Shi;Jifang Tan;Yang Cao;Lijun Zhao;Zhenyi Xu
Industrial volatile organic compounds (VOCs) emissions and leakage have caused serious problems to the environment and public safety. Traditional VOCs monitoring systems require professionals to carry gas sensors into the emission area to collect VOCs, which might cause secondary hazards. VOCs infrared (IR) imaging visual inspection technology is a convenient and low-cost method. However, current visual detection methods with VOCs IR imaging are limited due to blurred imaging and indeterminate gas shapes. Moreover, major works pay attention to only IR modality for VOCs emissions detection, which would neglect semantic expressions of VOCs. To this end, we propose a dual-stream fusion detection framework to deal with visible and IR features of VOCs. Additionally, a multimodal fusion induced attention (MFIA) module is designed to realize feature fusion across modalities. Specifically, MFIA uses the spatial attention fusion module (SAFM) to mine association among modalities in terms of spatial location and generates fused features by spatial location weighting. Then, the modality adapter (MA) and induced attention module (IAM) are proposed to weight latent VOCs regions in IR features, which alleviates the problem of noise interference and degradation of VOCs characterization caused by fusion. Finally, comprehensive experiments are carried out on the challenging VOCs dataset, and the mAP@0.5 and F1-score of the proposed model are 0.527 and 0.601, which outperforms the state-of-the-art methods by 3.3% and 3.4%, respectively.
{"title":"Multimodal Fusion Induced Attention Network for Industrial VOCs Detection","authors":"Yu Kang;Kehao Shi;Jifang Tan;Yang Cao;Lijun Zhao;Zhenyi Xu","doi":"10.1109/TAI.2024.3436037","DOIUrl":"https://doi.org/10.1109/TAI.2024.3436037","url":null,"abstract":"Industrial volatile organic compounds (VOCs) emissions and leakage have caused serious problems to the environment and public safety. Traditional VOCs monitoring systems require professionals to carry gas sensors into the emission area to collect VOCs, which might cause secondary hazards. VOCs infrared (IR) imaging visual inspection technology is a convenient and low-cost method. However, current visual detection methods with VOCs IR imaging are limited due to blurred imaging and indeterminate gas shapes. Moreover, major works pay attention to only IR modality for VOCs emissions detection, which would neglect semantic expressions of VOCs. To this end, we propose a dual-stream fusion detection framework to deal with visible and IR features of VOCs. Additionally, a multimodal fusion induced attention (MFIA) module is designed to realize feature fusion across modalities. Specifically, MFIA uses the spatial attention fusion module (SAFM) to mine association among modalities in terms of spatial location and generates fused features by spatial location weighting. Then, the modality adapter (MA) and induced attention module (IAM) are proposed to weight latent VOCs regions in IR features, which alleviates the problem of noise interference and degradation of VOCs characterization caused by fusion. Finally, comprehensive experiments are carried out on the challenging VOCs dataset, and the mAP@0.5 and F1-score of the proposed model are 0.527 and 0.601, which outperforms the state-of-the-art methods by 3.3% and 3.4%, respectively.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6385-6398"},"PeriodicalIF":0.0,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Negative Sampling Approaches for Neural Topic Models 评估神经主题模型的负抽样方法
Pub Date : 2024-07-29 DOI: 10.1109/TAI.2024.3432857
Suman Adhya;Avishek Lahiri;Debarshi Kumar Sanyal;Partha Pratim Das
Negative sampling has emerged as an effective technique that enables deep learning models to learn better representations by introducing the paradigm of “learn-to-compare.” The goal of this approach is to add robustness to deep learning models to learn better representation by comparing the positive samples against the negative ones. Despite its numerous demonstrations in various areas of computer vision and natural language processing, a comprehensive study of the effect of negative sampling in an unsupervised domain such as topic modeling has not been well explored. In this article, we present a comprehensive analysis of the impact of different negative sampling strategies on neural topic models. We compare the performance of several popular neural topic models by incorporating a negative sampling technique in the decoder of variational autoencoder-based neural topic models. Experiments on four publicly available datasets demonstrate that integrating negative sampling into topic models results in significant enhancements across multiple aspects, including improved topic coherence, richer topic diversity, and more accurate document classification. Manual evaluations also indicate that the inclusion of negative sampling into neural topic models enhances the quality of the generated topics. These findings highlight the potential of negative sampling as a valuable tool for advancing the effectiveness of neural topic models.
负采样已成为一种有效的技术,通过引入 "学习-比较 "范式,深度学习模型可以学习到更好的表征。这种方法的目标是增加深度学习模型的鲁棒性,通过比较正样本和负样本来学习更好的表征。尽管这种方法在计算机视觉和自然语言处理等多个领域得到了广泛应用,但在主题建模等无监督领域,对负向采样效果的综合研究还没有得到很好的探讨。在本文中,我们全面分析了不同负采样策略对神经主题模型的影响。通过在基于变异自动编码器的神经主题模型的解码器中加入负采样技术,我们比较了几种流行的神经主题模型的性能。在四个公开可用的数据集上进行的实验表明,将负采样整合到主题模型中能显著提高多个方面的性能,包括改善主题一致性、丰富主题多样性和更准确的文档分类。人工评估也表明,将负采样纳入神经主题模型可提高生成主题的质量。这些发现凸显了负抽样作为一种有价值的工具在提高神经主题模型有效性方面的潜力。
{"title":"Evaluating Negative Sampling Approaches for Neural Topic Models","authors":"Suman Adhya;Avishek Lahiri;Debarshi Kumar Sanyal;Partha Pratim Das","doi":"10.1109/TAI.2024.3432857","DOIUrl":"https://doi.org/10.1109/TAI.2024.3432857","url":null,"abstract":"Negative sampling has emerged as an effective technique that enables deep learning models to learn better representations by introducing the paradigm of “learn-to-compare.” The goal of this approach is to add robustness to deep learning models to learn better representation by comparing the positive samples against the negative ones. Despite its numerous demonstrations in various areas of computer vision and natural language processing, a comprehensive study of the effect of negative sampling in an unsupervised domain such as topic modeling has not been well explored. In this article, we present a comprehensive analysis of the impact of different negative sampling strategies on neural topic models. We compare the performance of several popular neural topic models by incorporating a negative sampling technique in the decoder of variational autoencoder-based neural topic models. Experiments on four publicly available datasets demonstrate that integrating negative sampling into topic models results in significant enhancements across multiple aspects, including improved topic coherence, richer topic diversity, and more accurate document classification. Manual evaluations also indicate that the inclusion of negative sampling into neural topic models enhances the quality of the generated topics. These findings highlight the potential of negative sampling as a valuable tool for advancing the effectiveness of neural topic models.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5630-5642"},"PeriodicalIF":0.0,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Communication-Efficient Federated Learning for Decision Trees 决策树的通信效率联合学习
Pub Date : 2024-07-25 DOI: 10.1109/TAI.2024.3433419
Shuo Zhao;Zikun Zhu;Xin Li;Ying-Chi Chen
The increasing concerns about data privacy and security have driven the emergence of federated learning, which preserves privacy by collaborative learning across multiple clients without sharing their raw data. In this article, we propose a communication-efficient federated learning algorithm for decision trees (DTs), referred to as FL-DT. The key idea is to exchange the statistics of a small number of features among the server and all clients, enabling identification of the optimal feature to split each DT node without compromising privacy. To efficiently find the splitting feature based on the partially available information at each DT node, a novel formulation is derived to estimate the lower and upper bounds of Gini indexes of all features by solving a sequence of mixed-integer convex programming problems. Our experimental results based on various public datasets demonstrate that FL-DT can reduce the communication overhead substantially without surrendering any classification accuracy, compared to other conventional methods.
对数据隐私和安全的日益关注推动了联合学习的出现,联合学习通过多个客户端之间的协作学习来保护隐私,而无需共享原始数据。在本文中,我们为决策树(DT)提出了一种通信效率高的联合学习算法,称为 FL-DT。其主要思想是在服务器和所有客户端之间交换少量特征的统计信息,从而在不损害隐私的情况下识别出分割每个 DT 节点的最佳特征。为了根据每个 DT 节点的部分可用信息高效地找到分割特征,我们推导出了一种新颖的公式,通过求解一系列混合整数凸编程问题来估计所有特征的基尼指数下限和上限。我们基于各种公共数据集的实验结果表明,与其他传统方法相比,FL-DT 可以在不降低任何分类准确性的情况下大幅减少通信开销。
{"title":"Communication-Efficient Federated Learning for Decision Trees","authors":"Shuo Zhao;Zikun Zhu;Xin Li;Ying-Chi Chen","doi":"10.1109/TAI.2024.3433419","DOIUrl":"https://doi.org/10.1109/TAI.2024.3433419","url":null,"abstract":"The increasing concerns about data privacy and security have driven the emergence of federated learning, which preserves privacy by collaborative learning across multiple clients without sharing their raw data. In this article, we propose a communication-efficient federated learning algorithm for decision trees (DTs), referred to as FL-DT. The key idea is to exchange the statistics of a small number of features among the server and all clients, enabling identification of the optimal feature to split each DT node without compromising privacy. To efficiently find the splitting feature based on the partially available information at each DT node, a novel formulation is derived to estimate the lower and upper bounds of Gini indexes of all features by solving a sequence of mixed-integer convex programming problems. Our experimental results based on various public datasets demonstrate that FL-DT can reduce the communication overhead substantially without surrendering any classification accuracy, compared to other conventional methods.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5478-5492"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sector-Based Pairs Trading Strategy With Novel Pair Selection Technique
Pub Date : 2024-07-25 DOI: 10.1109/TAI.2024.3433469
Pranjala G. Kolapwar;Uday V. Kulkarni;Jaishri M. Waghmare
A pair trading strategy (PTS) is a balanced approach that involves simultaneous trading of two highly correlated stocks. This article introduces the PTS-return-based pair selection (PTS-R) strategy which is the modification of the traditional PTS. The PTS-R follows a similar framework to the traditional PTS, differing only in the criteria it employs for selecting stock pairs. Moreover, this article proposes a novel trading strategy called sector-based pairs trading strategy (SBPTS) along with its two variants, namely SBPTS-correlation-based pair selection (SBPTS-C) and SBPTS-return-based pair selection (SBPTS-R). The SBPTS focuses on the pairs of stocks within the same sector. It consists of three innovative phases: the classification of input stocks into the respective sectors, the identification of the best-performing sector, and the selection of stock pairs based on their returns. The goal is to identify the pairs with a strong historical correlation and the highest returns within the best-performing sector. These chosen pairs are then used for trading. The strategies are designed to enhance the efficacy of the pairs trading and are validated through experimentation on real-world stock data over a ten-year historical period from 2013 to 2023. The results demonstrate their effectiveness compared to the existing techniques for pair selection and trading strategy.
{"title":"Sector-Based Pairs Trading Strategy With Novel Pair Selection Technique","authors":"Pranjala G. Kolapwar;Uday V. Kulkarni;Jaishri M. Waghmare","doi":"10.1109/TAI.2024.3433469","DOIUrl":"https://doi.org/10.1109/TAI.2024.3433469","url":null,"abstract":"A pair trading strategy (PTS) is a balanced approach that involves simultaneous trading of two highly correlated stocks. This article introduces the PTS-return-based pair selection (PTS-R) strategy which is the modification of the traditional PTS. The PTS-R follows a similar framework to the traditional PTS, differing only in the criteria it employs for selecting stock pairs. Moreover, this article proposes a novel trading strategy called sector-based pairs trading strategy (SBPTS) along with its two variants, namely SBPTS-correlation-based pair selection (SBPTS-C) and SBPTS-return-based pair selection (SBPTS-R). The SBPTS focuses on the pairs of stocks within the same sector. It consists of three innovative phases: the classification of input stocks into the respective sectors, the identification of the best-performing sector, and the selection of stock pairs based on their returns. The goal is to identify the pairs with a strong historical correlation and the highest returns within the best-performing sector. These chosen pairs are then used for trading. The strategies are designed to enhance the efficacy of the pairs trading and are validated through experimentation on real-world stock data over a ten-year historical period from 2013 to 2023. The results demonstrate their effectiveness compared to the existing techniques for pair selection and trading strategy.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 1","pages":"3-13"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142976032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted Concept Factorization Based Incomplete Multi-view Clustering 基于加权概念因式分解的不完整多视角聚类
Pub Date : 2024-07-25 DOI: 10.1109/TAI.2024.3433379
Ghufran Ahmad Khan;Jalaluddin Khan;Taushif Anwar;Zubair Ashraf;Mohammad Hafeez Javed;Bassoma Diallo
The primary objective of classical multiview clustering (MVC) is to categorize data into separate clusters under the assumption that all perspectives are completely available. However, in practical situations, it is common to encounter cases where not all viewpoints of the data are accessible. This limitation can impede the effectiveness of traditional MVC methods. The incompleteness of the clustering of multiview data has witnessed substantial progress in recent years due to its promising applications. In response to the aforementioned issue, we have tackled it by introducing an inventive MVC algorithm that is tailored to handle incomplete data from various views. Additionally, we have proposed a distinct objective function that leverages a weighted concept factorization technique to address the absence of data instances within each incomplete perspective. To address inconsistencies between different views, we introduced a coregularization factor, which operates in conjunction with a shared consensus matrix. It is important to highlight that the proposed objective function is intrinsically nonconvex, presenting challenges in terms of optimization. To secure the optimal solution for this objective function, we have implemented an iterative optimization approach to reach the local minima for our method. To underscore the efficacy and validation of our approach, we experimented with real-world datasets and used state-of-the-art methods to perform comparative assessments.
经典多视角聚类(MVC)的主要目的是在假设所有视角都完全可用的情况下,将数据归类到不同的聚类中。然而,在实际情况中,经常会遇到并非数据的所有视角都可访问的情况。这种限制会妨碍传统 MVC 方法的有效性。近年来,多视角数据聚类的不完整性因其广阔的应用前景而取得了长足的进步。针对上述问题,我们引入了一种创造性的 MVC 算法,专门用于处理来自不同视图的不完整数据。此外,我们还提出了一个独特的目标函数,利用加权概念因式分解技术来解决每个不完整视角中缺乏数据实例的问题。为了解决不同观点之间的不一致性,我们引入了一个核心模块化因子,该因子与共享共识矩阵共同发挥作用。需要强调的是,所提出的目标函数本质上是非凸的,这给优化带来了挑战。为了确保该目标函数的最优解,我们采用了迭代优化方法,以达到我们方法的局部最小值。为了强调我们方法的有效性和验证,我们使用真实世界的数据集进行了实验,并使用最先进的方法进行了比较评估。
{"title":"Weighted Concept Factorization Based Incomplete Multi-view Clustering","authors":"Ghufran Ahmad Khan;Jalaluddin Khan;Taushif Anwar;Zubair Ashraf;Mohammad Hafeez Javed;Bassoma Diallo","doi":"10.1109/TAI.2024.3433379","DOIUrl":"https://doi.org/10.1109/TAI.2024.3433379","url":null,"abstract":"The primary objective of classical multiview clustering (MVC) is to categorize data into separate clusters under the assumption that all perspectives are completely available. However, in practical situations, it is common to encounter cases where not all viewpoints of the data are accessible. This limitation can impede the effectiveness of traditional MVC methods. The incompleteness of the clustering of multiview data has witnessed substantial progress in recent years due to its promising applications. In response to the aforementioned issue, we have tackled it by introducing an inventive MVC algorithm that is tailored to handle incomplete data from various views. Additionally, we have proposed a distinct objective function that leverages a weighted concept factorization technique to address the absence of data instances within each incomplete perspective. To address inconsistencies between different views, we introduced a coregularization factor, which operates in conjunction with a shared consensus matrix. It is important to highlight that the proposed objective function is intrinsically nonconvex, presenting challenges in terms of optimization. To secure the optimal solution for this objective function, we have implemented an iterative optimization approach to reach the local minima for our method. To underscore the efficacy and validation of our approach, we experimented with real-world datasets and used state-of-the-art methods to perform comparative assessments.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5699-5708"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1