首页 > 最新文献

Information Sciences最新文献

英文 中文
Combining Permutation Mass Functions based on distance and entropy of Random Permutation Set 基于随机置换集的距离和熵的置换质量函数组合
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121657
Linshan Li , Puhantong Rong , Meizhu Li
A novel set type, termed Random Permutation Set (RPS), has recently been introduced to account for permutations of sets. Serving as an extension of evidence theory, the concern about whether it might yield counterintuitive outcomes when confronted with high conflict akin to evidence theory arises as a pertinent issue in practical engineering applications. In this paper, we initially explore the outcomes of direct fusion amidst varying levels of extreme conflict. Following this, we innovatively proposed a fusion method based on RPS distance and entropy metrics. This method utilizes the distances between RPS for weighting and determines the final RPS subset used for weighting through the entropy of the RPS. Through the presentation of several examples and specific experiments, we demonstrate its efficacy in handling extreme conflict scenarios and enhancing the quality of fusion outcomes.
最近引入了一种新的集合类型,称为随机排列集合(RPS),用于解释集合的排列。作为证据理论的扩展,当面临类似证据理论的高度冲突时,它是否会产生反直觉的结果成为实际工程应用中的一个相关问题。在本文中,我们首先探讨了在不同程度的极端冲突中直接融合的结果。随后,我们创新性地提出了一种基于 RPS 距离和熵度量的融合方法。该方法利用 RPS 之间的距离进行加权,并通过 RPS 的熵值确定最终用于加权的 RPS 子集。通过几个例子和具体实验,我们证明了该方法在处理极端冲突场景和提高融合结果质量方面的功效。
{"title":"Combining Permutation Mass Functions based on distance and entropy of Random Permutation Set","authors":"Linshan Li ,&nbsp;Puhantong Rong ,&nbsp;Meizhu Li","doi":"10.1016/j.ins.2024.121657","DOIUrl":"10.1016/j.ins.2024.121657","url":null,"abstract":"<div><div>A novel set type, termed Random Permutation Set (RPS), has recently been introduced to account for permutations of sets. Serving as an extension of evidence theory, the concern about whether it might yield counterintuitive outcomes when confronted with high conflict akin to evidence theory arises as a pertinent issue in practical engineering applications. In this paper, we initially explore the outcomes of direct fusion amidst varying levels of extreme conflict. Following this, we innovatively proposed a fusion method based on RPS distance and entropy metrics. This method utilizes the distances between RPS for weighting and determines the final RPS subset used for weighting through the entropy of the RPS. Through the presentation of several examples and specific experiments, we demonstrate its efficacy in handling extreme conflict scenarios and enhancing the quality of fusion outcomes.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121657"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge-constrained interest-aware multi-behavior recommendation with behavior pattern identification 具有行为模式识别功能的知识约束兴趣感知多行为推荐
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121652
Gayeon Park , Hyeongjun Yang , Kyuhwan Yeom , Myeongheon Jeon , Yunjeong Ko , Byungkook Oh , Kyong-Ho Lee
Recommender systems aim to accurately capture user preferences based on interacted items. Conventional recommender systems mainly rely on the singular-type behavior of users, which may limit their ability to handle practical scenarios (e.g., E-commerce). In contrast, multi-type behavior recommendation (MBR) exploits auxiliary types of behaviors (e.g., view, cart), as well as the target behavior (e.g., buy), and has proven to be an effective way to identify user preferences from various perspectives. Existing MBR methods assume that all auxiliary behaviors of a user have a positive relevance with the target behavior. However, users may not interact with items using all available behaviors, but the degree of relatedness is not explicitly taken into account. To address the issue, we propose a Knowledge-constrained Interest-aware Framework with Behavior Pattern Identification (KIPI). The proposed model identifies user-specific behavior patterns by introducing pair-wise dependency modeling to explicitly reflect the fine-grained relatedness between behavior pairs. Additionally, we enhance item representations by leveraging both instance-view knowledge graph (KG) and ontology-view KG, which provides broader concept information of items. Moreover, we design a concept-constrained Bayesian Personalized Ranking loss to reflect a user's general interest. Extensive studies on the real-world datasets demonstrate that our model outperforms state-of-the-art baselines.
推荐系统旨在根据互动项目准确捕捉用户偏好。传统的推荐系统主要依赖于用户的单一类型行为,这可能会限制其处理实际场景(如电子商务)的能力。相比之下,多类型行为推荐(MBR)利用了辅助类型行为(如查看、购物车)和目标行为(如购买),已被证明是一种从不同角度识别用户偏好的有效方法。现有的 MBR 方法假设用户的所有辅助行为都与目标行为正相关。然而,用户可能不会使用所有可用行为与物品进行交互,但相关程度却没有明确考虑在内。为了解决这个问题,我们提出了一个具有行为模式识别功能的知识约束兴趣感知框架(KIPI)。所提出的模型通过引入成对依赖建模来明确反映行为对之间的细粒度相关性,从而识别用户特定的行为模式。此外,我们还利用实例视图知识图谱(KG)和本体视图知识图谱(KG)增强了项目表征,从而提供了更广泛的项目概念信息。此外,我们还设计了一种概念约束贝叶斯个性化排名损失,以反映用户的一般兴趣。对现实世界数据集的广泛研究表明,我们的模型优于最先进的基线模型。
{"title":"Knowledge-constrained interest-aware multi-behavior recommendation with behavior pattern identification","authors":"Gayeon Park ,&nbsp;Hyeongjun Yang ,&nbsp;Kyuhwan Yeom ,&nbsp;Myeongheon Jeon ,&nbsp;Yunjeong Ko ,&nbsp;Byungkook Oh ,&nbsp;Kyong-Ho Lee","doi":"10.1016/j.ins.2024.121652","DOIUrl":"10.1016/j.ins.2024.121652","url":null,"abstract":"<div><div>Recommender systems aim to accurately capture user preferences based on interacted items. Conventional recommender systems mainly rely on the singular-type behavior of users, which may limit their ability to handle practical scenarios (e.g., E-commerce). In contrast, multi-type behavior recommendation (MBR) exploits auxiliary types of behaviors (e.g., view, cart), as well as the target behavior (e.g., buy), and has proven to be an effective way to identify user preferences from various perspectives. Existing MBR methods assume that all auxiliary behaviors of a user have a positive relevance with the target behavior. However, users may not interact with items using all available behaviors, but the degree of relatedness is not explicitly taken into account. To address the issue, we propose a <strong>K</strong>nowledge-constrained <strong>I</strong>nterest-aware Framework with Behavior <strong>P</strong>attern <strong>I</strong>dentification (KIPI). The proposed model identifies user-specific behavior patterns by introducing pair-wise dependency modeling to explicitly reflect the fine-grained relatedness between behavior pairs. Additionally, we enhance item representations by leveraging both instance-view knowledge graph (KG) and ontology-view KG, which provides broader concept information of items. Moreover, we design a concept-constrained Bayesian Personalized Ranking loss to reflect a user's general interest. Extensive studies on the real-world datasets demonstrate that our model outperforms state-of-the-art baselines.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121652"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142703997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MEEF criterion-based spline adaptive filtering algorithm and its application 基于 MEEF 准则的样条自适应滤波算法及其应用
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121662
Haiquan Zhao , Yuan Gao
This paper presents an innovative the minimum error entropy with fiducial points (MEEF)-based spline adaptive filtering (S-AF) algorithm, called SAF-MEEF algorithm, which outperforms the conventional SAF algorithms that use the mean square error (MSE) criterion in reducing non-Gaussian interference. To overcome the limitation of the fixed step-size, a variable step-size strategy is also developed, resulting in the SAF-VMEEF algorithm, which improves the convergence speed and steady-state error performance. Furthermore, the computational complexity and convergence analysis of the SAF-MEEF are discussed. Nonlinear system identification simulations test the performance of the presented algorithms. Furthermore, this article accomplishes the application of nonlinear active noise control (ANC). Their effectiveness and robustness against non-Gaussian noise are demonstrated in different experimental scenarios, including α-stable noise, real-world functional magnetic resonance imaging (fMRI) noise, and real-life server room (SR) noise.
本文提出了一种创新的基于靶点(MEEF)的最小误差熵样条自适应滤波(S-AF)算法,称为 SAF-MEEF 算法,它在减少非高斯干扰方面优于使用均方误差(MSE)准则的传统 SAF 算法。为了克服固定步长的限制,还开发了一种可变步长策略,形成了 SAF-VMEEF 算法,该算法提高了收敛速度和稳态误差性能。此外,还讨论了 SAF-MEEF 算法的计算复杂性和收敛性分析。非线性系统辨识仿真检验了所提出算法的性能。此外,本文还完成了非线性主动噪声控制(ANC)的应用。在不同的实验场景中,包括α稳定噪声、真实世界的功能磁共振成像(fMRI)噪声和现实生活中的服务器机房(SR)噪声,都证明了它们对非高斯噪声的有效性和鲁棒性。
{"title":"MEEF criterion-based spline adaptive filtering algorithm and its application","authors":"Haiquan Zhao ,&nbsp;Yuan Gao","doi":"10.1016/j.ins.2024.121662","DOIUrl":"10.1016/j.ins.2024.121662","url":null,"abstract":"<div><div>This paper presents an innovative the minimum error entropy with fiducial points (MEEF)-based spline adaptive filtering (S-AF) algorithm, called SAF-MEEF algorithm, which outperforms the conventional SAF algorithms that use the mean square error (MSE) criterion in reducing non-Gaussian interference. To overcome the limitation of the fixed step-size, a variable step-size strategy is also developed, resulting in the SAF-VMEEF algorithm, which improves the convergence speed and steady-state error performance. Furthermore, the computational complexity and convergence analysis of the SAF-MEEF are discussed. Nonlinear system identification simulations test the performance of the presented algorithms. Furthermore, this article accomplishes the application of nonlinear active noise control (ANC). Their effectiveness and robustness against non-Gaussian noise are demonstrated in different experimental scenarios, including α-stable noise, real-world functional magnetic resonance imaging (fMRI) noise, and real-life server room (SR) noise.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121662"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Driver’s facial expression recognition by using deep local and global features 利用局部和全局深度特征识别驾驶员面部表情
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121658
Mozhgan Rezaie Manavand , Mohammad Hosien Salarifar , Mohammad Ghavami , Mehran Taghipour-Gorjikolaie
Understanding drivers’ emotions is crucial for safety and comfort in autonomous vehicles. While Facial Expression Recognition (FER) systems perform well in controlled environments, struggle in real driving situations. To address this challenge, an Interlaced Local Attention Block within a Convolutional Neural Network (ILAB-CNN) model has been proposed to analyze drivers’ emotions. In real-world scenarios, not all facial regions contribute equally to expressing emotions; specific areas or combinations are key. Inspired by the attention mechanism, an ILAB and a Modified Squeeze-and-Excitation (MSE) block has been proposed to learn more discriminative features. The MSE block applies a self-attention mechanism on the channels, effectively identifying key features by incorporating global information and discarding irrelevant features. ILAB employs the MSE and encoder-decoder structures for region-channel specific attention in one branch and combines it with the obtained feature map of the MSE from the other branch. The proposed approach successfully captures essential information from facial expressions while utilizing a reduced number of parameters, leading to significantly improved recognition accuracy and recognition time for real-time applications. Evaluated on diverse datasets, our method shows 75.3 % recognition rate on FER-2013, 85.06 % on RAF-DB, and 98.8 % on KMU-FED, demonstrating its potential to advance FER technology.
了解驾驶员的情绪对于自动驾驶汽车的安全性和舒适性至关重要。虽然面部表情识别(FER)系统在受控环境中表现良好,但在真实驾驶环境中却举步维艰。为了应对这一挑战,我们提出了一种卷积神经网络(ILAB-CNN)中的交错局部注意力块模型来分析驾驶员的情绪。在真实世界的场景中,并非所有面部区域都能对情绪表达做出同样的贡献;特定区域或组合才是关键。受注意力机制的启发,我们提出了一个 ILAB 和一个修正的挤压-激发(MSE)区块来学习更多的分辨特征。MSE 模块在信道上应用自我注意机制,通过整合全局信息和剔除无关特征,有效识别关键特征。ILAB 在一个分支中采用 MSE 和编码器-解码器结构进行区域-信道特定关注,并将其与另一个分支中获得的 MSE 特征图相结合。所提出的方法成功地捕捉到了面部表情的基本信息,同时减少了参数数量,从而显著提高了识别准确率,缩短了实时应用的识别时间。我们的方法在不同的数据集上进行了评估,在 FER-2013 上的识别率为 75.3%,在 RAF-DB 上的识别率为 85.06%,在 KMU-FED 上的识别率为 98.8%,证明了它在推进 FER 技术方面的潜力。
{"title":"Driver’s facial expression recognition by using deep local and global features","authors":"Mozhgan Rezaie Manavand ,&nbsp;Mohammad Hosien Salarifar ,&nbsp;Mohammad Ghavami ,&nbsp;Mehran Taghipour-Gorjikolaie","doi":"10.1016/j.ins.2024.121658","DOIUrl":"10.1016/j.ins.2024.121658","url":null,"abstract":"<div><div>Understanding drivers’ emotions is crucial for safety and comfort in autonomous vehicles. While Facial Expression Recognition (FER) systems perform well in controlled environments, struggle in real driving situations. To address this challenge, an Interlaced Local Attention Block within a Convolutional Neural Network (ILAB-CNN) model has been proposed to analyze drivers’ emotions. In real-world scenarios, not all facial regions contribute equally to expressing emotions; specific areas or combinations are key. Inspired by the attention mechanism, an ILAB and a Modified Squeeze-and-Excitation (MSE) block has been proposed to learn more discriminative features. The MSE block applies a self-attention mechanism on the channels, effectively identifying key features by incorporating global information and discarding irrelevant features. ILAB employs the MSE and encoder-decoder structures for region-channel specific attention in one branch and combines it with the obtained feature map of the MSE from the other branch. The proposed approach successfully captures essential information from facial expressions while utilizing a reduced number of parameters, leading to significantly improved recognition accuracy and recognition time for real-time applications. Evaluated on diverse datasets, our method shows 75.3 % recognition rate on FER-2013, 85.06 % on RAF-DB, and 98.8 % on KMU-FED, demonstrating its potential to advance FER technology.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121658"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Delay-induced consensus of fractional-order complex networks via intermittent sampled position control 通过间歇采样位置控制实现分数阶复杂网络的延迟诱导共识
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121667
Yanyan Ye , Lei Ouyang , Zhixia Ding , Yao Zhao
This paper investigates the consensus problem of fractional-order complex networks via intermittent sampled position control. A delay-induced consensus protocol with intermittent sampled position for the fractional-order system is proposed, where only the current and delayed sampled positions are used. Meanwhile, the controllers operate only for a period of time in each sampling interval, which decrease the working time and update rates of controllers. Successively, by discussing four situations involving delay, sampling period, and communication width, necessary and sufficient consensus criteria are presented. It is interesting to find the delay has a positive impact on consensus of fractional-order complex networks, since consensus cannot be achieved without delay under the proposed protocol. Finally, simulation examples are presented to illustrate the theoretical analysis.
本文通过间歇采样位置控制研究了分数阶复杂网络的共识问题。针对分数阶系统提出了一种间歇采样位置的延迟诱导共识协议,其中只使用当前和延迟采样位置。同时,控制器在每个采样间隔内只运行一段时间,从而减少了控制器的工作时间和更新率。随后,通过讨论涉及延迟、采样周期和通信宽度的四种情况,提出了必要和充分的共识标准。值得注意的是,延迟对分数阶复杂网络的共识有积极影响,因为在所提出的协议下,没有延迟就无法达成共识。最后,还列举了一些仿真实例来说明理论分析。
{"title":"Delay-induced consensus of fractional-order complex networks via intermittent sampled position control","authors":"Yanyan Ye ,&nbsp;Lei Ouyang ,&nbsp;Zhixia Ding ,&nbsp;Yao Zhao","doi":"10.1016/j.ins.2024.121667","DOIUrl":"10.1016/j.ins.2024.121667","url":null,"abstract":"<div><div>This paper investigates the consensus problem of fractional-order complex networks via intermittent sampled position control. A delay-induced consensus protocol with intermittent sampled position for the fractional-order system is proposed, where only the current and delayed sampled positions are used. Meanwhile, the controllers operate only for a period of time in each sampling interval, which decrease the working time and update rates of controllers. Successively, by discussing four situations involving delay, sampling period, and communication width, necessary and sufficient consensus criteria are presented. It is interesting to find the delay has a positive impact on consensus of fractional-order complex networks, since consensus cannot be achieved without delay under the proposed protocol. Finally, simulation examples are presented to illustrate the theoretical analysis.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121667"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting one-class transfer learning for multiple view uncertain data 针对多视图不确定数据的助推单类迁移学习
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121653
Bo Liu , Fan Cao , Shilei Zhao , Yanshan Xiao
Transfer learning can leverage knowledge from source tasks to improve learning on a target task, even when training samples are limited. However, most previous transfer learning approaches focus on a single view of the data and assume no uncertainty in the training samples. To address these limitations, we propose a novel method called boosting one-class transfer learning for multi-view uncertain data (UMTO-SVMs), which handles one-class classification in multi-view data with uncertain information. Our method transfers knowledge containing uncertainty from multiple source tasks to the target task and constrains complementary information across different views to improve consistency. By combining basic classifiers using the Adaboost algorithm, we build a robust classifier. We also design an iterative framework to optimize the method and prove the convergence of the algorithm. Experimental results on three benchmark datasets show that UMTO-SVMs outperform previous one-class classification methods.
迁移学习可以利用源任务的知识来提高目标任务的学习效果,即使在训练样本有限的情况下也是如此。然而,以前的迁移学习方法大多只关注数据的单一视图,并假设训练样本中没有不确定性。为了解决这些局限性,我们提出了一种名为 "多视角不确定数据的单类迁移学习(UMTO-SVMs)"的新方法,它可以处理具有不确定信息的多视角数据中的单类分类。我们的方法将包含不确定性的知识从多个源任务转移到目标任务,并约束不同视图之间的互补信息,以提高一致性。通过使用 Adaboost 算法组合基本分类器,我们建立了一个鲁棒分类器。我们还设计了一个迭代框架来优化该方法,并证明了算法的收敛性。在三个基准数据集上的实验结果表明,UMTO-SVM 优于之前的单类分类方法。
{"title":"Boosting one-class transfer learning for multiple view uncertain data","authors":"Bo Liu ,&nbsp;Fan Cao ,&nbsp;Shilei Zhao ,&nbsp;Yanshan Xiao","doi":"10.1016/j.ins.2024.121653","DOIUrl":"10.1016/j.ins.2024.121653","url":null,"abstract":"<div><div>Transfer learning can leverage knowledge from source tasks to improve learning on a target task, even when training samples are limited. However, most previous transfer learning approaches focus on a single view of the data and assume no uncertainty in the training samples. To address these limitations, we propose a novel method called boosting one-class transfer learning for multi-view uncertain data (UMTO-SVMs), which handles one-class classification in multi-view data with uncertain information. Our method transfers knowledge containing uncertainty from multiple source tasks to the target task and constrains complementary information across different views to improve consistency. By combining basic classifiers using the Adaboost algorithm, we build a robust classifier. We also design an iterative framework to optimize the method and prove the convergence of the algorithm. Experimental results on three benchmark datasets show that UMTO-SVMs outperform previous one-class classification methods.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121653"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse loss-aware ternarization for neural networks 神经网络的稀疏损失感知三元化
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121668
Ruizhi Zhou , Lingfeng Niu , Dachuan Xu
Deep neural networks (DNNs) have shown great success in machine learning tasks and widely used in many fields. However, the substantial computational and storage requirements inherent to DNNs are usually high, which poses challenges for deploying deep learning models on resource-limited devices and hindering further applications. To address this issue, the lightweight nature of neural networks has garnered significant attention, and quantization has become one of the most popular approaches to compress DNNs. In this paper, we introduce a sparse loss-aware ternarization (SLT) model for training ternary neural networks, which encodes the floating-point parameters into {1,0,1}. Specifically, we abstract the ternarization process as an optimization problem with discrete constraints, and then modify it by applying sparse regularization to identify insignificant weights. To deal with the challenges brought by the discreteness of the model, we decouple discrete constraints from the objective function and design a new algorithm based on the Alternating Direction Method of Multipliers (ADMM). Extensive experiments are conducted on public datasets with popular network architectures. Comparisons with several state-of-the-art baselines demonstrate that SLT always attains comparable accuracy while having better compression performance.
深度神经网络(DNN)在机器学习任务中取得了巨大成功,并广泛应用于许多领域。然而,DNN 固有的大量计算和存储要求通常很高,这给在资源有限的设备上部署深度学习模型带来了挑战,并阻碍了进一步的应用。为解决这一问题,神经网络的轻量级特性受到了广泛关注,量化已成为压缩 DNN 的最常用方法之一。本文介绍了一种用于训练三元神经网络的稀疏损失感知三元化(SLT)模型,它将浮点参数编码为 {-1,0,1}。具体来说,我们将三元化过程抽象为一个具有离散约束的优化问题,然后通过应用稀疏正则化来识别不重要的权重,从而对其进行修改。为了应对模型离散性带来的挑战,我们将离散约束与目标函数解耦,并设计了一种基于交替方向乘法(ADMM)的新算法。我们在采用流行网络架构的公共数据集上进行了广泛的实验。与几种最先进的基线算法进行比较后发现,SLT 算法总能达到相当的精度,同时具有更好的压缩性能。
{"title":"Sparse loss-aware ternarization for neural networks","authors":"Ruizhi Zhou ,&nbsp;Lingfeng Niu ,&nbsp;Dachuan Xu","doi":"10.1016/j.ins.2024.121668","DOIUrl":"10.1016/j.ins.2024.121668","url":null,"abstract":"<div><div>Deep neural networks (DNNs) have shown great success in machine learning tasks and widely used in many fields. However, the substantial computational and storage requirements inherent to DNNs are usually high, which poses challenges for deploying deep learning models on resource-limited devices and hindering further applications. To address this issue, the lightweight nature of neural networks has garnered significant attention, and quantization has become one of the most popular approaches to compress DNNs. In this paper, we introduce a sparse loss-aware ternarization (SLT) model for training ternary neural networks, which encodes the floating-point parameters into <span><math><mo>{</mo><mo>−</mo><mn>1</mn><mo>,</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo>}</mo></math></span>. Specifically, we abstract the ternarization process as an optimization problem with discrete constraints, and then modify it by applying sparse regularization to identify insignificant weights. To deal with the challenges brought by the discreteness of the model, we decouple discrete constraints from the objective function and design a new algorithm based on the Alternating Direction Method of Multipliers (ADMM). Extensive experiments are conducted on public datasets with popular network architectures. Comparisons with several state-of-the-art baselines demonstrate that SLT always attains comparable accuracy while having better compression performance.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"693 ","pages":"Article 121668"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142706188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
THATSN: Temporal hierarchical aggregation tree structure network for long-term time-series forecasting THATSN:用于长期时间序列预测的时空分层聚合树结构网络
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121659
Fan Zhang , Min Wang , Wenchang Zhang , Hua Wang
The performance of time-series forecasting, which is crucial for predicting future values from historical data, improves with an accurate representation of time-series data. An accurate representation aids in adopting effective strategies to address future uncertainties and reduce the risks associated with planning and decision-making. However, most studies in long-term time-series forecasting integrate features across all time steps, complicating the representation of relationships among cyclical patterns in a series. This paper introduces the Temporal Hierarchical Aggregation Tree Structure Network (THATSN), which is a model that focuses on dynamic modeling over time. We transform time-series data into sequences with multiple cyclical patterns and model the interrelationships among these features to generate a hierarchical tree structure. We design a tree-structured long short-term memory network that functions as a gated adaptive aggregator to process the features of learned child nodes. This aggregator, which can train parameters, adaptively selects child node information benefiting the parent node. This method enhances information usage within the tree and captures cyclical information dynamically. Experiments validate the effectiveness of THATSN, demonstrating its capacity to express cyclical relationships in time-series data. The model exhibits state-of-the-art forecasting performance across several datasets, achieving an overall 15% improvement in MSE, thereby establishing its robustness in long-term forecasting.
时间序列预测对于从历史数据中预测未来值至关重要,而时间序列数据的准确表示可以改善时间序列预测的性能。准确的表示有助于采取有效的策略来应对未来的不确定性,降低与规划和决策相关的风险。然而,长期时间序列预测方面的大多数研究都整合了所有时间步长的特征,从而使序列中周期模式之间关系的表示变得复杂。本文介绍了时序分层聚合树结构网络(THATSN),这是一个侧重于随时间动态建模的模型。我们将时间序列数据转化为具有多种循环模式的序列,并对这些特征之间的相互关系进行建模,从而生成分层树状结构。我们设计了一个树状结构的长短期记忆网络,作为一个门控自适应聚合器来处理所学子节点的特征。这个聚合器可以训练参数,自适应地选择有利于父节点的子节点信息。这种方法提高了树内信息的使用率,并能动态捕捉循环信息。实验验证了 THATSN 的有效性,证明它有能力表达时间序列数据中的周期关系。该模型在多个数据集上表现出最先进的预测性能,MSE 总体提高了 15%,从而确立了其在长期预测中的稳健性。
{"title":"THATSN: Temporal hierarchical aggregation tree structure network for long-term time-series forecasting","authors":"Fan Zhang ,&nbsp;Min Wang ,&nbsp;Wenchang Zhang ,&nbsp;Hua Wang","doi":"10.1016/j.ins.2024.121659","DOIUrl":"10.1016/j.ins.2024.121659","url":null,"abstract":"<div><div>The performance of time-series forecasting, which is crucial for predicting future values from historical data, improves with an accurate representation of time-series data. An accurate representation aids in adopting effective strategies to address future uncertainties and reduce the risks associated with planning and decision-making. However, most studies in long-term time-series forecasting integrate features across all time steps, complicating the representation of relationships among cyclical patterns in a series. This paper introduces the Temporal Hierarchical Aggregation Tree Structure Network (THATSN), which is a model that focuses on dynamic modeling over time. We transform time-series data into sequences with multiple cyclical patterns and model the interrelationships among these features to generate a hierarchical tree structure. We design a tree-structured long short-term memory network that functions as a gated adaptive aggregator to process the features of learned child nodes. This aggregator, which can train parameters, adaptively selects child node information benefiting the parent node. This method enhances information usage within the tree and captures cyclical information dynamically. Experiments validate the effectiveness of THATSN, demonstrating its capacity to express cyclical relationships in time-series data. The model exhibits state-of-the-art forecasting performance across several datasets, achieving an overall 15% improvement in MSE, thereby establishing its robustness in long-term forecasting.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121659"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep reinforcement learning-guided coevolutionary algorithm for constrained multiobjective optimization 受约束多目标优化的深度强化学习引导的协同进化算法
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-17 DOI: 10.1016/j.ins.2024.121648
Wenguan Luo , Xiaobing Yu , Gary G. Yen , Yifan Wei
Effectively managing convergence, diversity, and feasibility constitutes a fundamental trinity of tasks in optimizing constrained multiobjective optimization problems (CMOPs). Nevertheless, contemporary constrained multiobjective evolutionary algorithms (CMOEAs) frequently encounter challenges in reconciling these imperatives simultaneously. Drawing inspiration from overwhelming success in artificial intelligence, we propose a deep reinforcement learning-guided coevolutionary algorithm (DRLCEA) to tackle this predicament. DRLCEA employs two populations to optimize the original and unconstrained versions of the CMOP, respectively and then fosters cooperation between them according to the guidance of DRL. The established DRL employs two evaluation metrics to appraise population convergence, diversity, and feasibility, thus remarkably proficient in reflecting and steering the coevolution. Therefore, the proposed DRLCEA could effectively locate the feasible regions and approximate the constrained Pareto front. We assess the proposed algorithm on 32 benchmark CMOPs and one real-world UAV emergency track planning (UETP) application. Experimental results undoubtedly demonstrate the superiority and robustness of the proposed DRLCEA.
有效管理收敛性、多样性和可行性是优化受限多目标优化问题(CMOPs)的三位一体的基本任务。然而,当代的约束多目标进化算法(CMOEAs)在同时协调这些必要条件时经常遇到挑战。从人工智能领域的巨大成功中汲取灵感,我们提出了一种深度强化学习引导的协同进化算法(DRLCEA)来解决这一难题。DRLCEA 采用两个种群分别优化 CMOP 的原始版本和无约束版本,然后根据 DRL 的指导促进它们之间的合作。已建立的 DRL 采用了两个评价指标来评估种群的收敛性、多样性和可行性,因此能够很好地反映和引导协同进化。因此,所提出的 DRLCEA 可以有效地定位可行区域并逼近受约束的帕累托前沿。我们在 32 个基准 CMOP 和一个真实世界的无人机紧急轨迹规划(UETP)应用中评估了所提出的算法。实验结果无疑证明了所提出的 DRLCEA 的优越性和鲁棒性。
{"title":"Deep reinforcement learning-guided coevolutionary algorithm for constrained multiobjective optimization","authors":"Wenguan Luo ,&nbsp;Xiaobing Yu ,&nbsp;Gary G. Yen ,&nbsp;Yifan Wei","doi":"10.1016/j.ins.2024.121648","DOIUrl":"10.1016/j.ins.2024.121648","url":null,"abstract":"<div><div>Effectively managing convergence, diversity, and feasibility constitutes a fundamental trinity of tasks in optimizing constrained multiobjective optimization problems (CMOPs). Nevertheless, contemporary constrained multiobjective evolutionary algorithms (CMOEAs) frequently encounter challenges in reconciling these imperatives simultaneously. Drawing inspiration from overwhelming success in artificial intelligence, we propose a deep reinforcement learning-guided coevolutionary algorithm (DRLCEA) to tackle this predicament. DRLCEA employs two populations to optimize the original and unconstrained versions of the CMOP, respectively and then fosters cooperation between them according to the guidance of DRL. The established DRL employs two evaluation metrics to appraise population convergence, diversity, and feasibility, thus remarkably proficient in reflecting and steering the coevolution. Therefore, the proposed DRLCEA could effectively locate the feasible regions and approximate the constrained Pareto front. We assess the proposed algorithm on 32 benchmark CMOPs and one real-world UAV emergency track planning (UETP) application. Experimental results undoubtedly demonstrate the superiority and robustness of the proposed DRLCEA.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121648"},"PeriodicalIF":8.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142703999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An interval-valued carbon price prediction model based on improved multi-scale feature selection and optimal multi-kernel support vector regression 基于改进的多尺度特征选择和最优多核支持向量回归的区间值碳价格预测模型
IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-16 DOI: 10.1016/j.ins.2024.121651
Yuxuan Lu, Jujie Wang, Qian Li
Precise carbon price prediction is crucial for informing climate policies, maintaining carbon markets, and driving global green transformation. Currently, decomposition integration methods are extensively employed for carbon price forecasting. However, most studies focus on predicting single values, neglecting the inherent volatility and uncertainty of interval-valued data. To address this gap, this study introduces an advanced interval-valued decomposition integration model that incorporates data preprocessing, improved multi-scale feature selection, and data-driven prediction technique. Initially, data preprocessing transforms the maximum and minimum carbon prices into central and radius sequences, capturing greater volatility information while eliminating noise and managing outliers. Subsequently, improved variational mode decomposition is utilized to optimally decompose and reconstruct the center and radius series, which enables a deeper exploration of the features of interval-valued data. A tailored data-driven prediction method is then employed to analyze sub-sequences with distinct characteristics separately, significantly reducing prediction errors. To assess the reliability and stability of the proposed model, a comprehensive comparative experiment is conducted, with results providing strong evidence supporting its effectiveness.
精确的碳价格预测对于为气候政策提供信息、维护碳市场以及推动全球绿色转型至关重要。目前,分解整合方法被广泛应用于碳价格预测。然而,大多数研究都侧重于预测单一数值,忽略了区间值数据固有的波动性和不确定性。针对这一缺陷,本研究引入了一种先进的区间值分解整合模型,该模型融合了数据预处理、改进的多尺度特征选择和数据驱动预测技术。首先,数据预处理将最大和最小碳价格转换为中心序列和半径序列,从而捕捉到更多的波动信息,同时消除噪声和管理异常值。随后,利用改进的变模分解技术对中心序列和半径序列进行优化分解和重构,从而更深入地挖掘区间值数据的特征。然后,采用量身定制的数据驱动预测方法,分别分析具有不同特征的子序列,从而显著减少预测误差。为了评估所提出模型的可靠性和稳定性,我们进行了全面的对比实验,结果有力地证明了其有效性。
{"title":"An interval-valued carbon price prediction model based on improved multi-scale feature selection and optimal multi-kernel support vector regression","authors":"Yuxuan Lu,&nbsp;Jujie Wang,&nbsp;Qian Li","doi":"10.1016/j.ins.2024.121651","DOIUrl":"10.1016/j.ins.2024.121651","url":null,"abstract":"<div><div>Precise carbon price prediction is crucial for informing climate policies, maintaining carbon markets, and driving global green transformation. Currently, decomposition integration methods are extensively employed for carbon price forecasting. However, most studies focus on predicting single values, neglecting the inherent volatility and uncertainty of interval-valued data. To address this gap, this study introduces an advanced interval-valued decomposition integration model that incorporates data preprocessing, improved multi-scale feature selection, and data-driven prediction technique. Initially, data preprocessing transforms the maximum and minimum carbon prices into central and radius sequences, capturing greater volatility information while eliminating noise and managing outliers. Subsequently, improved variational mode decomposition is utilized to optimally decompose and reconstruct the center and radius series, which enables a deeper exploration of the features of interval-valued data. A tailored data-driven prediction method is then employed to analyze sub-sequences with distinct characteristics separately, significantly reducing prediction errors. To assess the reliability and stability of the proposed model, a comprehensive comparative experiment is conducted, with results providing strong evidence supporting its effectiveness.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"692 ","pages":"Article 121651"},"PeriodicalIF":8.1,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142703995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1