Pub Date : 2024-08-21DOI: 10.1016/j.ins.2024.121363
The process of training feedforward neural networks (FFNNs) can benefit from an automated process where the best heuristic to train the network is sought out automatically by means of a high-level probabilistic-based heuristic. This research introduces a novel population-based Bayesian hyper-heuristic (BHH) that is used to train feedforward neural networks (FFNNs). The performance of the BHH is compared to that of ten popular low-level heuristics, each with different search behaviours. The chosen heuristic pool consists of classic gradient-based heuristics as well as meta-heuristics (MHs). The empirical process is executed on fourteen datasets consisting of classification and regression problems with varying characteristics. The BHH is shown to be able to train FFNNs well and provide an automated method for finding the best heuristic to train the FFNNs at various stages of the training process.
{"title":"Training feedforward neural networks with Bayesian hyper-heuristics","authors":"","doi":"10.1016/j.ins.2024.121363","DOIUrl":"10.1016/j.ins.2024.121363","url":null,"abstract":"<div><p>The process of training <em>feedforward neural networks</em> (FFNNs) can benefit from an automated process where the best heuristic to train the network is sought out automatically by means of a high-level probabilistic-based heuristic. This research introduces a novel population-based <em>Bayesian hyper-heuristic</em> (BHH) that is used to train <em>feedforward neural networks</em> (FFNNs). The performance of the BHH is compared to that of ten popular low-level heuristics, each with different search behaviours. The chosen heuristic pool consists of classic gradient-based heuristics as well as <em>meta-heuristics</em> (MHs). The empirical process is executed on fourteen datasets consisting of classification and regression problems with varying characteristics. The BHH is shown to be able to train FFNNs well and provide an automated method for finding the best heuristic to train the FFNNs at various stages of the training process.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020025524012775/pdfft?md5=1d463e50138859a6bcaad1358d8cf44d&pid=1-s2.0-S0020025524012775-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1016/j.ins.2024.121343
The study demonstrates the application of OWA operators to binary and multiclass classification problems and seeks a way to improve classification accuracy using smoothing methods. OWA operators are used to aggregate class membership probabilities obtained from individual classifiers. Smoothing methods inspired by Newton-Cotes quadratures are applied before the aggregation step to improve the quality of the final results. Moreover, several sets of weights are used for OWA operators, including sets of weights based on the accuracy of individual classifiers. The experiments are conducted on 20 datasets, from which 7 are designed for binary classification and the rest are for multiclass classification. A comparison of the average accuracy for different sets of weights is shown. On the basis of experimental results, smoothing methods that significantly improve classification accuracy are identified.
{"title":"Smooth Ordered Weighted Averaging operators","authors":"","doi":"10.1016/j.ins.2024.121343","DOIUrl":"10.1016/j.ins.2024.121343","url":null,"abstract":"<div><p>The study demonstrates the application of OWA operators to binary and multiclass classification problems and seeks a way to improve classification accuracy using smoothing methods. OWA operators are used to aggregate class membership probabilities obtained from individual classifiers. Smoothing methods inspired by Newton-Cotes quadratures are applied before the aggregation step to improve the quality of the final results. Moreover, several sets of weights are used for OWA operators, including sets of weights based on the accuracy of individual classifiers. The experiments are conducted on 20 datasets, from which 7 are designed for binary classification and the rest are for multiclass classification. A comparison of the average accuracy for different sets of weights is shown. On the basis of experimental results, smoothing methods that significantly improve classification accuracy are identified.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142012038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1016/j.ins.2024.121344
As heavy-tailed distributions are ubiquitous in many real applications, robust regression has been extensively applied in machine learning and exhibits the superiority in deal with heavy-tailed distribution. Current existing robust methods for regression are based on independence assumption of features and ignore interaction between them, which may lead to poor generalization due to most datasets unsatisfying the assumption. Actually, interaction features formed by the composition of two or more features are particularly helpful to obtain the high-order information in many applications. In this paper, we propose a novel adaptive robust feature interaction selection model for regression with heavy-tailed distributions, termed Adaptive Robust Feature Interaction Selection (ARFIS). Firstly, we consider pairwise feature interaction by augmenting a feature vector with product of features for regression with heavy tailed distribution. Secondly, we propose feature interaction selection models based on quantile loss with different regularizers to learn parameters. The consistency of the proposed model ARFIS is theoretically proven, and an efficient algorithm is presented for solving proposed model. Finally, experimental results on simulation data, UCI datasets and a real-world dataset validate good accuracy, interpretability and robustness of our proposed models.
{"title":"ARFIS: An adaptive robust model for regression with heavy-tailed distribution","authors":"","doi":"10.1016/j.ins.2024.121344","DOIUrl":"10.1016/j.ins.2024.121344","url":null,"abstract":"<div><p>As heavy-tailed distributions are ubiquitous in many real applications, robust regression has been extensively applied in machine learning and exhibits the superiority in deal with heavy-tailed distribution. Current existing robust methods for regression are based on independence assumption of features and ignore interaction between them, which may lead to poor generalization due to most datasets unsatisfying the assumption. Actually, interaction features formed by the composition of two or more features are particularly helpful to obtain the high-order information in many applications. In this paper, we propose a novel adaptive robust feature interaction selection model for regression with heavy-tailed distributions, termed Adaptive Robust Feature Interaction Selection (ARFIS). Firstly, we consider pairwise feature interaction by augmenting a feature vector with product of features for regression with heavy tailed distribution. Secondly, we propose feature interaction selection models based on quantile loss with different regularizers to learn parameters. The consistency of the proposed model ARFIS is theoretically proven, and an efficient algorithm is presented for solving proposed model. Finally, experimental results on simulation data, UCI datasets and a real-world dataset validate good accuracy, interpretability and robustness of our proposed models.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142076827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-19DOI: 10.1016/j.ins.2024.121374
Developing an early warning model for company financial crises holds critical significance in robust risk management and ensuring the enduring stability of the capital market. Although the existing research has achieved rich results, the disadvantages of insufficient text information mining and poor model performance still exist. To alleviate the problem of insufficient text information mining, we collect related financial and annual report data from 820 listed companies in mainland China from 2018 to 2023 by using sophisticated web crawlers and advanced text sentiment analysis technologies and using missing value interpolation, standardization, and data balancing to build multi-source datasets of companies. Ranking the feature importance of multi-source data promotes understanding the formation of financial crises for companies. In the meantime, a novel Attention-based Fuzzy Neural Network (AFNN) was proposed to parse multi-source data to forecast financial crises among listed companies. Experimental results indicate that AFNN exhibits significantly improved performance compared to other advanced methods.
{"title":"Attention-based fuzzy neural networks designed for early warning of financial crises of listed companies","authors":"","doi":"10.1016/j.ins.2024.121374","DOIUrl":"10.1016/j.ins.2024.121374","url":null,"abstract":"<div><p>Developing an early warning model for company financial crises holds critical significance in robust risk management and ensuring the enduring stability of the capital market. Although the existing research has achieved rich results, the disadvantages of insufficient text information mining and poor model performance still exist. To alleviate the problem of insufficient text information mining, we collect related financial and annual report data from 820 listed companies in mainland China from 2018 to 2023 by using sophisticated web crawlers and advanced text sentiment analysis technologies and using missing value interpolation, standardization, and data balancing to build multi-source datasets of companies. Ranking the feature importance of multi-source data promotes understanding the formation of financial crises for companies. In the meantime, a novel Attention-based Fuzzy Neural Network (AFNN) was proposed to parse multi-source data to forecast financial crises among listed companies. Experimental results indicate that AFNN exhibits significantly improved performance compared to other advanced methods.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-19DOI: 10.1016/j.ins.2024.121376
Brain age gap can be estimated from brain images, serving as a valuable biomarker for aging-associated diseases, using deep neural networks. Traditional brain age prediction methods tend to rely on unimodal data. Multimodal data can provide more comprehensive information and improve prediction accuracy. However, existing multimodal fusion methods often fall short in fully leveraging the correlations and complementarities between different modalities. This paper introduces a novel multimodal fuzzy feature fusion collaborative prediction algorithm for brain age estimation (MFCA). The proposed approach integrates multiple imaging modalities using a fuzzy fusion module and a multimodal collaborative convolutional module to effectively leverage inter-modal correlations and complementary information. Specifically, a convolutional neural network is used to extract feature from multimodal brain images, which are then combined into a global feature tensor via radial joins. The fuzzy fusion module employs fuzzy theory to fuse the correlation features of different modalities, while the multimodal collaborative convolutional module enhances complementary information through modality-specific convolutional layers. Age prediction is then performed by an age prediction module containing three linear regression modules. Additionally, an optimized sorting contrast loss is incorporated to improve the accuracy of age prediction. The proposed method was evaluated on the SRPBS multi-disorder MRI dataset, and the experimental results demonstrate that MFCA achieves a mean absolute error of 5.661 and a Pearson correlation coefficient of 0.947, outperforming several state-of-the-art methods.
{"title":"MFCA: Collaborative prediction algorithm of brain age based on multimodal fuzzy feature fusion","authors":"","doi":"10.1016/j.ins.2024.121376","DOIUrl":"10.1016/j.ins.2024.121376","url":null,"abstract":"<div><p>Brain age gap can be estimated from brain images, serving as a valuable biomarker for aging-associated diseases, using deep neural networks. Traditional brain age prediction methods tend to rely on unimodal data. Multimodal data can provide more comprehensive information and improve prediction accuracy. However, existing multimodal fusion methods often fall short in fully leveraging the correlations and complementarities between different modalities. This paper introduces a novel multimodal fuzzy feature fusion collaborative prediction algorithm for brain age estimation (MFCA). The proposed approach integrates multiple imaging modalities using a fuzzy fusion module and a multimodal collaborative convolutional module to effectively leverage inter-modal correlations and complementary information. Specifically, a convolutional neural network is used to extract feature from multimodal brain images, which are then combined into a global feature tensor via radial joins. The fuzzy fusion module employs fuzzy theory to fuse the correlation features of different modalities, while the multimodal collaborative convolutional module enhances complementary information through modality-specific convolutional layers. Age prediction is then performed by an age prediction module containing three linear regression modules. Additionally, an optimized sorting contrast loss is incorporated to improve the accuracy of age prediction. The proposed method was evaluated on the SRPBS multi-disorder MRI dataset, and the experimental results demonstrate that MFCA achieves a mean absolute error of 5.661 and a Pearson correlation coefficient of 0.947, outperforming several state-of-the-art methods.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-18DOI: 10.1016/j.ins.2024.121352
A multivariable adaptive decoupling control scheme is proposed based on stochastic configuration networks with serial-parallel switching distribution (SPSCN). Firstly, a linear controller is designed by combining a PID controller, feedback decoupling, and one-step optimal control. Secondly, a nonlinear controller is presented to deal with higher-order nonlinear terms and unknown external perturbations. SPSCN is used to improve the prediction accuracy of unmodeled dynamics. It combines uniform and normal search strategies in a serial-parallel fashion, aiming at improving the node quality and reducing the model complexity. The approximation performance of the SPSCN algorithm is demonstrated by performing approximation experiments with two functions and four benchmark datasets. Compared with the generalized minimum variance control (GMVC) algorithm in controlling the process of cement raw material decomposition, our proposed control scheme outperforms.
{"title":"Multivariable adaptive decoupling control based on stochastic configuration networks with serial-parallel switching distribution","authors":"","doi":"10.1016/j.ins.2024.121352","DOIUrl":"10.1016/j.ins.2024.121352","url":null,"abstract":"<div><p>A multivariable adaptive decoupling control scheme is proposed based on stochastic configuration networks with serial-parallel switching distribution (SPSCN). Firstly, a linear controller is designed by combining a PID controller, feedback decoupling, and one-step optimal control. Secondly, a nonlinear controller is presented to deal with higher-order nonlinear terms and unknown external perturbations. SPSCN is used to improve the prediction accuracy of unmodeled dynamics. It combines uniform and normal search strategies in a serial-parallel fashion, aiming at improving the node quality and reducing the model complexity. The approximation performance of the SPSCN algorithm is demonstrated by performing approximation experiments with two functions and four benchmark datasets. Compared with the generalized minimum variance control (GMVC) algorithm in controlling the process of cement raw material decomposition, our proposed control scheme outperforms.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-16DOI: 10.1016/j.ins.2024.121366
Autonomous execution of tasks by unmanned aerial vehicles (UAVs) relies heavily on object detection. However, object detection in most images presents challenges such as complex backgrounds, small targets, and obstructions. Additionally, the limited computing speed and memory of the UAV processor affects the accuracy of conventional object detection algorithms. This paper proposes LUD-You Only Look Once (YOLO), a small and lightweight object detection algorithm for UAVs based on YOLOv8. The proposed algorithm introduces a new multiscale feature fusion mode that solves the degradation in feature propagation and interaction through the introduction of upsampling in the feature pyramid network and the progressive feature pyramid network. The application of the dynamic sparse attention mechanism in the Cf2 module achieves flexible computing allocation and content awareness. Furthermore, the proposed model is optimized to be sparse and lightweight, making it possible to deploy on UAV edge devices. Finally, the effectiveness and superiority of LUD-YOLO were verified on the VisDrone2019 and UAVDT datasets. The results of ablation and comparison experiments show that compared with the original algorithm, LUDY-N and LUDY-S have shown excellent performance in various evaluation indexes, indicating that the proposed improvement strategies make the model have better robustness and generalization. Moreover, compared with multiple other popular competitors, the proposed improvement strategies enable LUD-YOLO to have the best overall performance, providing an effective solution for UAVs object detection while balancing model size and detection accuracy.
无人飞行器(UAV)自主执行任务在很大程度上依赖于目标检测。然而,大多数图像中的物体检测都面临着复杂背景、小目标和障碍物等挑战。此外,无人飞行器处理器有限的计算速度和内存也影响了传统物体检测算法的准确性。本文提出了一种基于 YOLOv8 的小型轻量级无人机目标检测算法 LUD--You Only Look Once (YOLO)。该算法引入了新的多尺度特征融合模式,通过在特征金字塔网络和渐进式特征金字塔网络中引入上采样,解决了特征传播和交互中的退化问题。Cf2 模块中动态稀疏关注机制的应用实现了灵活的计算分配和内容感知。此外,所提出的模型经过优化,具有稀疏性和轻量级的特点,可以部署在无人机边缘设备上。最后,在 VisDrone2019 和 UAVDT 数据集上验证了 LUD-YOLO 的有效性和优越性。消融和对比实验结果表明,与原始算法相比,LUDY-N 和 LUDY-S 在各项评价指标上均表现优异,表明所提出的改进策略使模型具有更好的鲁棒性和泛化能力。此外,与其他多个流行的竞争对手相比,所提出的改进策略使 LUD-YOLO 的整体性能最佳,为无人机物体检测提供了有效的解决方案,同时兼顾了模型大小和检测精度。
{"title":"LUD-YOLO: A novel lightweight object detection network for unmanned aerial vehicle","authors":"","doi":"10.1016/j.ins.2024.121366","DOIUrl":"10.1016/j.ins.2024.121366","url":null,"abstract":"<div><p>Autonomous execution of tasks by unmanned aerial vehicles (UAVs) relies heavily on object detection. However, object detection in most images presents challenges such as complex backgrounds, small targets, and obstructions. Additionally, the limited computing speed and memory of the UAV processor affects the accuracy of conventional object detection algorithms. This paper proposes LUD-You Only Look Once (YOLO), a small and lightweight object detection algorithm for UAVs based on YOLOv8. The proposed algorithm introduces a new multiscale feature fusion mode that solves the degradation in feature propagation and interaction through the introduction of upsampling in the feature pyramid network and the progressive feature pyramid network. The application of the dynamic sparse attention mechanism in the Cf2 module achieves flexible computing allocation and content awareness. Furthermore, the proposed model is optimized to be sparse and lightweight, making it possible to deploy on UAV edge devices. Finally, the effectiveness and superiority of LUD-YOLO were verified on the VisDrone2019 and UAVDT datasets. The results of ablation and comparison experiments show that compared with the original algorithm, LUDY-N and LUDY-S have shown excellent performance in various evaluation indexes, indicating that the proposed improvement strategies make the model have better robustness and generalization. Moreover, compared with multiple other popular competitors, the proposed improvement strategies enable LUD-YOLO to have the best overall performance, providing an effective solution for UAVs object detection while balancing model size and detection accuracy.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020025524012805/pdfft?md5=d596a6997eac3036bbb68e709c00a107&pid=1-s2.0-S0020025524012805-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-15DOI: 10.1016/j.ins.2024.121348
The preference information of decision-makers (DMs) often cannot be explicitly expressed in complex decision-making environments. Therefore, to address decision-making problems with uncertain preference information, this paper proposes a method based on the prospect SMAA-2 model and extended cross-entropy in interval number environments. We first construct the prospect SMAA-2 model to simulate preference information. This model incorporates central risk-averse and risk-seeking factors, significantly enhancing the ability to identify alternatives. When DMs’ preference information is unknown or partially known, these factors can determine the appropriate level of risk-averse or risk-seeking for alternatives. Next, we devise an extended cross-entropy algorithm based on the continuous ordered weighted harmonic (C-OWH) averaging operator to handle interval numbers. Subsequently, a comprehensive algorithm is designed to derive the weights of DMs, taking into account the relationships among individuals as well as between individuals and the group. Furthermore, we construct the framework for the proposed method. Finally, the applicability of the developed method can be validated by an illustrative example. Comparative analysis is used to verify the rationality and superiority of this method.
{"title":"An interval number group decision-making method based on the prospect SMAA-2 model and extended cross-entropy","authors":"","doi":"10.1016/j.ins.2024.121348","DOIUrl":"10.1016/j.ins.2024.121348","url":null,"abstract":"<div><p>The preference information of decision-makers (DMs) often cannot be explicitly expressed in complex decision-making environments. Therefore, to address decision-making problems with uncertain preference information, this paper proposes a method based on the prospect SMAA-2 model and extended cross-entropy in interval number environments. We first construct the prospect SMAA-2 model to simulate preference information. This model incorporates central risk-averse and risk-seeking factors, significantly enhancing the ability to identify alternatives. When DMs’ preference information is unknown or partially known, these factors can determine the appropriate level of risk-averse or risk-seeking for alternatives. Next, we devise an extended cross-entropy algorithm based on the continuous ordered weighted harmonic (C-OWH) averaging operator to handle interval numbers. Subsequently, a comprehensive algorithm is designed to derive the weights of DMs, taking into account the relationships among individuals as well as between individuals and the group. Furthermore, we construct the framework for the proposed method. Finally, the applicability of the developed method can be validated by an illustrative example. Comparative analysis is used to verify the rationality and superiority of this method.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142012170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-15DOI: 10.1016/j.ins.2024.121355
In existing trust-aware collaborative filtering algorithms, each trust relationship between two users is usually represented by a real number, but such a number is neither sufficient to reflect the quantity of the trust relationship existing in the user’s mind nor easy to be given. This leads to the inaccuracy of the trust relationship and poor final recommendations. To solve this problem, we propose an approach to deduce interval-valued trust relationships from the given real-valued trust relationships, which enables the new trust relationships to optimally reflect the true trust relationships existing in users’ minds. The coming problem we face is how to fuse the interval-valued trust relationships and the real-valued ratings. Though most existing trust-aware collaborative filtering algorithms use matrix factorization to fuse the real-valued data, they are not capable of fusing interval-valued trust relationships and real-valued ratings. The reason is that the arithmetic operations on intervals and arithmetic operations on real numbers are different. Therefore, we proposed a novel interval-valued matrix factorization approach. After that, an interval-valued matrix factorization based trust-aware collaborative filtering (IMF_TCF) algorithm is designed. The experiments carried out with open datasets indicate that IMF_TCF achieves the best recommendation performance compared with the state-of-the-art algorithms.
{"title":"An interval-valued matrix factorization based trust-aware collaborative filtering algorithm for recommendation systems","authors":"","doi":"10.1016/j.ins.2024.121355","DOIUrl":"10.1016/j.ins.2024.121355","url":null,"abstract":"<div><p>In existing trust-aware collaborative filtering algorithms, each trust relationship between two users is usually represented by a real number, but such a number is neither sufficient to reflect the quantity of the trust relationship existing in the user’s mind nor easy to be given. This leads to the inaccuracy of the trust relationship and poor final recommendations. To solve this problem, we propose an approach to deduce interval-valued trust relationships from the given real-valued trust relationships, which enables the new trust relationships to optimally reflect the true trust relationships existing in users’ minds. The coming problem we face is how to fuse the interval-valued trust relationships and the real-valued ratings. Though most existing trust-aware collaborative filtering algorithms use matrix factorization to fuse the real-valued data, they are not capable of fusing interval-valued trust relationships and real-valued ratings. The reason is that the arithmetic operations on intervals and arithmetic operations on real numbers are different. Therefore, we proposed a novel interval-valued matrix factorization approach. After that, an interval-valued matrix factorization based trust-aware collaborative filtering (IMF_TCF) algorithm is designed. The experiments carried out with open datasets indicate that IMF_TCF achieves the best recommendation performance compared with the state-of-the-art algorithms.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-15DOI: 10.1016/j.ins.2024.121351
To benefit from the modeling capacity of deep models in system identification without worrying about inference time, this study presents a novel training strategy that uses deep models only during the training stage. For this purpose, two separate models with different structures and goals are employed. The first one is a deep generative model aiming at modeling the distribution of system output(s), called the teacher model, and the second one is a shallow basis function model, named the student model, fed by system input(s) to predict the system output(s). That means these isolated paths must reach the same ultimate target. As deep models show a great performance in modeling highly nonlinear systems, aligning the representation space learned by these two models makes the student model inherit the teacher model’s approximation power. The proposed objective function consists of the objective of each student and teacher model, adding up with a distance penalty between the learned latent representations. The simulation results on three nonlinear benchmarks show a comparative performance with examined deep architectures applied on the same benchmarks. Algorithmic transparency and structure efficiency are also achieved as byproducts.
{"title":"Exploiting the capacity of deep networks only at the training stage for nonlinear black-box system identification","authors":"","doi":"10.1016/j.ins.2024.121351","DOIUrl":"10.1016/j.ins.2024.121351","url":null,"abstract":"<div><p>To benefit from the modeling capacity of deep models in system identification without worrying about inference time, this study presents a novel training strategy that uses deep models only during the training stage. For this purpose, two separate models with different structures and goals are employed. The first one is a deep generative model aiming at modeling the distribution of system output(s), called the teacher model, and the second one is a shallow basis function model, named the student model, fed by system input(s) to predict the system output(s). That means these isolated paths must reach the same ultimate target. As deep models show a great performance in modeling highly nonlinear systems, aligning the representation space learned by these two models makes the student model inherit the teacher model’s approximation power. The proposed objective function consists of the objective of each student and teacher model, adding up with a distance penalty between the learned latent representations. The simulation results on three nonlinear benchmarks show a comparative performance with examined deep architectures applied on the same benchmarks. Algorithmic transparency and structure efficiency are also achieved as byproducts.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142006895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}