首页 > 最新文献

Information Sciences最新文献

英文 中文
Training feedforward neural networks with Bayesian hyper-heuristics 用贝叶斯超启发法训练前馈神经网络
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-21 DOI: 10.1016/j.ins.2024.121363

The process of training feedforward neural networks (FFNNs) can benefit from an automated process where the best heuristic to train the network is sought out automatically by means of a high-level probabilistic-based heuristic. This research introduces a novel population-based Bayesian hyper-heuristic (BHH) that is used to train feedforward neural networks (FFNNs). The performance of the BHH is compared to that of ten popular low-level heuristics, each with different search behaviours. The chosen heuristic pool consists of classic gradient-based heuristics as well as meta-heuristics (MHs). The empirical process is executed on fourteen datasets consisting of classification and regression problems with varying characteristics. The BHH is shown to be able to train FFNNs well and provide an automated method for finding the best heuristic to train the FFNNs at various stages of the training process.

训练前馈神经网络(FFNN)的过程可以受益于自动化过程,即通过基于高级概率的启发式自动寻找训练网络的最佳启发式。这项研究引入了一种新颖的基于群体的贝叶斯超启发式(BHH),用于训练前馈神经网络(FFNN)。BHH 的性能与十种流行的低级启发式进行了比较,每种启发式都有不同的搜索行为。所选启发式库包括基于梯度的经典启发式和元启发式(MH)。实证过程在 14 个数据集上执行,这些数据集包括具有不同特征的分类和回归问题。结果表明,BHH 能够很好地训练 FFNN,并提供了一种自动方法,用于在训练过程的不同阶段找到训练 FFNN 的最佳启发式。
{"title":"Training feedforward neural networks with Bayesian hyper-heuristics","authors":"","doi":"10.1016/j.ins.2024.121363","DOIUrl":"10.1016/j.ins.2024.121363","url":null,"abstract":"<div><p>The process of training <em>feedforward neural networks</em> (FFNNs) can benefit from an automated process where the best heuristic to train the network is sought out automatically by means of a high-level probabilistic-based heuristic. This research introduces a novel population-based <em>Bayesian hyper-heuristic</em> (BHH) that is used to train <em>feedforward neural networks</em> (FFNNs). The performance of the BHH is compared to that of ten popular low-level heuristics, each with different search behaviours. The chosen heuristic pool consists of classic gradient-based heuristics as well as <em>meta-heuristics</em> (MHs). The empirical process is executed on fourteen datasets consisting of classification and regression problems with varying characteristics. The BHH is shown to be able to train FFNNs well and provide an automated method for finding the best heuristic to train the FFNNs at various stages of the training process.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020025524012775/pdfft?md5=1d463e50138859a6bcaad1358d8cf44d&pid=1-s2.0-S0020025524012775-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smooth Ordered Weighted Averaging operators 平滑有序加权平均算子
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-20 DOI: 10.1016/j.ins.2024.121343

The study demonstrates the application of OWA operators to binary and multiclass classification problems and seeks a way to improve classification accuracy using smoothing methods. OWA operators are used to aggregate class membership probabilities obtained from individual classifiers. Smoothing methods inspired by Newton-Cotes quadratures are applied before the aggregation step to improve the quality of the final results. Moreover, several sets of weights are used for OWA operators, including sets of weights based on the accuracy of individual classifiers. The experiments are conducted on 20 datasets, from which 7 are designed for binary classification and the rest are for multiclass classification. A comparison of the average accuracy for different sets of weights is shown. On the basis of experimental results, smoothing methods that significantly improve classification accuracy are identified.

本研究展示了 OWA 运算符在二元和多类分类问题中的应用,并寻求一种利用平滑方法提高分类准确性的方法。OWA 算子用于汇总从单个分类器获得的类别成员概率。在聚合步骤之前,会应用受牛顿-科茨四次方法启发的平滑方法,以提高最终结果的质量。此外,OWA 运算符还使用了几组权重,包括基于单个分类器准确性的权重。实验在 20 个数据集上进行,其中 7 个数据集用于二元分类,其余数据集用于多分类。实验显示了不同权重集的平均准确率对比。根据实验结果,确定了能显著提高分类准确率的平滑方法。
{"title":"Smooth Ordered Weighted Averaging operators","authors":"","doi":"10.1016/j.ins.2024.121343","DOIUrl":"10.1016/j.ins.2024.121343","url":null,"abstract":"<div><p>The study demonstrates the application of OWA operators to binary and multiclass classification problems and seeks a way to improve classification accuracy using smoothing methods. OWA operators are used to aggregate class membership probabilities obtained from individual classifiers. Smoothing methods inspired by Newton-Cotes quadratures are applied before the aggregation step to improve the quality of the final results. Moreover, several sets of weights are used for OWA operators, including sets of weights based on the accuracy of individual classifiers. The experiments are conducted on 20 datasets, from which 7 are designed for binary classification and the rest are for multiclass classification. A comparison of the average accuracy for different sets of weights is shown. On the basis of experimental results, smoothing methods that significantly improve classification accuracy are identified.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142012038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ARFIS: An adaptive robust model for regression with heavy-tailed distribution ARFIS:用于重尾分布回归的自适应稳健模型
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-20 DOI: 10.1016/j.ins.2024.121344

As heavy-tailed distributions are ubiquitous in many real applications, robust regression has been extensively applied in machine learning and exhibits the superiority in deal with heavy-tailed distribution. Current existing robust methods for regression are based on independence assumption of features and ignore interaction between them, which may lead to poor generalization due to most datasets unsatisfying the assumption. Actually, interaction features formed by the composition of two or more features are particularly helpful to obtain the high-order information in many applications. In this paper, we propose a novel adaptive robust feature interaction selection model for regression with heavy-tailed distributions, termed Adaptive Robust Feature Interaction Selection (ARFIS). Firstly, we consider pairwise feature interaction by augmenting a feature vector with product of features for regression with heavy tailed distribution. Secondly, we propose feature interaction selection models based on quantile loss with different regularizers to learn parameters. The consistency of the proposed model ARFIS is theoretically proven, and an efficient algorithm is presented for solving proposed model. Finally, experimental results on simulation data, UCI datasets and a real-world dataset validate good accuracy, interpretability and robustness of our proposed models.

由于重尾分布在许多实际应用中无处不在,稳健回归已被广泛应用于机器学习中,并在处理重尾分布时表现出优越性。目前现有的鲁棒回归方法都是基于特征的独立性假设,忽略了特征之间的交互性,这可能会导致大多数数据集无法满足该假设,从而导致泛化效果不佳。实际上,由两个或多个特征组成的交互特征在很多应用中特别有助于获取高阶信息。本文提出了一种用于重尾分布回归的新型自适应鲁棒特征交互选择模型,称为自适应鲁棒特征交互选择(ARFIS)。首先,我们考虑了成对特征交互,即在重尾分布回归中用特征乘积增强特征向量。其次,我们提出了基于量化损失的特征交互选择模型,并使用不同的正则化器来学习参数。从理论上证明了所提模型 ARFIS 的一致性,并提出了求解所提模型的高效算法。最后,在仿真数据、UCI 数据集和现实世界数据集上的实验结果验证了我们提出的模型具有良好的准确性、可解释性和鲁棒性。
{"title":"ARFIS: An adaptive robust model for regression with heavy-tailed distribution","authors":"","doi":"10.1016/j.ins.2024.121344","DOIUrl":"10.1016/j.ins.2024.121344","url":null,"abstract":"<div><p>As heavy-tailed distributions are ubiquitous in many real applications, robust regression has been extensively applied in machine learning and exhibits the superiority in deal with heavy-tailed distribution. Current existing robust methods for regression are based on independence assumption of features and ignore interaction between them, which may lead to poor generalization due to most datasets unsatisfying the assumption. Actually, interaction features formed by the composition of two or more features are particularly helpful to obtain the high-order information in many applications. In this paper, we propose a novel adaptive robust feature interaction selection model for regression with heavy-tailed distributions, termed Adaptive Robust Feature Interaction Selection (ARFIS). Firstly, we consider pairwise feature interaction by augmenting a feature vector with product of features for regression with heavy tailed distribution. Secondly, we propose feature interaction selection models based on quantile loss with different regularizers to learn parameters. The consistency of the proposed model ARFIS is theoretically proven, and an efficient algorithm is presented for solving proposed model. Finally, experimental results on simulation data, UCI datasets and a real-world dataset validate good accuracy, interpretability and robustness of our proposed models.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142076827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention-based fuzzy neural networks designed for early warning of financial crises of listed companies 基于注意力的模糊神经网络设计用于上市公司财务预警
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-19 DOI: 10.1016/j.ins.2024.121374

Developing an early warning model for company financial crises holds critical significance in robust risk management and ensuring the enduring stability of the capital market. Although the existing research has achieved rich results, the disadvantages of insufficient text information mining and poor model performance still exist. To alleviate the problem of insufficient text information mining, we collect related financial and annual report data from 820 listed companies in mainland China from 2018 to 2023 by using sophisticated web crawlers and advanced text sentiment analysis technologies and using missing value interpolation, standardization, and data balancing to build multi-source datasets of companies. Ranking the feature importance of multi-source data promotes understanding the formation of financial crises for companies. In the meantime, a novel Attention-based Fuzzy Neural Network (AFNN) was proposed to parse multi-source data to forecast financial crises among listed companies. Experimental results indicate that AFNN exhibits significantly improved performance compared to other advanced methods.

开发公司财务危机预警模型对于稳健的风险管理和确保资本市场的持久稳定具有重要意义。现有研究虽然取得了丰富的成果,但仍存在文本信息挖掘不足、模型性能较差等弊端。为缓解文本信息挖掘不足的问题,我们利用成熟的网络爬虫和先进的文本情感分析技术,收集了中国大陆820家上市公司2018年至2023年的相关财务数据和年报数据,并利用缺失值插值、标准化和数据均衡等方法建立了公司多源数据集。对多源数据的特征重要性进行排序,有助于理解企业财务危机的形成。同时,还提出了一种新颖的基于注意力的模糊神经网络(AFNN)来解析多源数据,以预测上市公司的财务危机。实验结果表明,与其他先进方法相比,AFNN 的性能显著提高。
{"title":"Attention-based fuzzy neural networks designed for early warning of financial crises of listed companies","authors":"","doi":"10.1016/j.ins.2024.121374","DOIUrl":"10.1016/j.ins.2024.121374","url":null,"abstract":"<div><p>Developing an early warning model for company financial crises holds critical significance in robust risk management and ensuring the enduring stability of the capital market. Although the existing research has achieved rich results, the disadvantages of insufficient text information mining and poor model performance still exist. To alleviate the problem of insufficient text information mining, we collect related financial and annual report data from 820 listed companies in mainland China from 2018 to 2023 by using sophisticated web crawlers and advanced text sentiment analysis technologies and using missing value interpolation, standardization, and data balancing to build multi-source datasets of companies. Ranking the feature importance of multi-source data promotes understanding the formation of financial crises for companies. In the meantime, a novel Attention-based Fuzzy Neural Network (AFNN) was proposed to parse multi-source data to forecast financial crises among listed companies. Experimental results indicate that AFNN exhibits significantly improved performance compared to other advanced methods.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MFCA: Collaborative prediction algorithm of brain age based on multimodal fuzzy feature fusion MFCA:基于多模态模糊特征融合的脑年龄协同预测算法
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-19 DOI: 10.1016/j.ins.2024.121376

Brain age gap can be estimated from brain images, serving as a valuable biomarker for aging-associated diseases, using deep neural networks. Traditional brain age prediction methods tend to rely on unimodal data. Multimodal data can provide more comprehensive information and improve prediction accuracy. However, existing multimodal fusion methods often fall short in fully leveraging the correlations and complementarities between different modalities. This paper introduces a novel multimodal fuzzy feature fusion collaborative prediction algorithm for brain age estimation (MFCA). The proposed approach integrates multiple imaging modalities using a fuzzy fusion module and a multimodal collaborative convolutional module to effectively leverage inter-modal correlations and complementary information. Specifically, a convolutional neural network is used to extract feature from multimodal brain images, which are then combined into a global feature tensor via radial joins. The fuzzy fusion module employs fuzzy theory to fuse the correlation features of different modalities, while the multimodal collaborative convolutional module enhances complementary information through modality-specific convolutional layers. Age prediction is then performed by an age prediction module containing three linear regression modules. Additionally, an optimized sorting contrast loss is incorporated to improve the accuracy of age prediction. The proposed method was evaluated on the SRPBS multi-disorder MRI dataset, and the experimental results demonstrate that MFCA achieves a mean absolute error of 5.661 and a Pearson correlation coefficient of 0.947, outperforming several state-of-the-art methods.

利用深度神经网络,可以从大脑图像中估算出脑年龄差距,从而作为衰老相关疾病的重要生物标志物。传统的脑年龄预测方法往往依赖于单模态数据。多模态数据可以提供更全面的信息,提高预测准确性。然而,现有的多模态融合方法往往不能充分利用不同模态之间的相关性和互补性。本文介绍了一种用于脑年龄估计的新型多模态模糊特征融合协同预测算法(MFCA)。所提出的方法利用模糊融合模块和多模态协作卷积模块整合了多种成像模态,以有效利用模态间的相关性和互补性信息。具体来说,卷积神经网络用于从多模态脑图像中提取特征,然后通过径向连接将其组合成全局特征张量。模糊融合模块采用模糊理论融合不同模态的相关特征,而多模态协同卷积模块则通过特定模态卷积层增强互补信息。年龄预测由包含三个线性回归模块的年龄预测模块完成。此外,还加入了优化的排序对比度损失,以提高年龄预测的准确性。实验结果表明,MFCA 的平均绝对误差为 5.661,皮尔逊相关系数为 0.947,优于几种最先进的方法。
{"title":"MFCA: Collaborative prediction algorithm of brain age based on multimodal fuzzy feature fusion","authors":"","doi":"10.1016/j.ins.2024.121376","DOIUrl":"10.1016/j.ins.2024.121376","url":null,"abstract":"<div><p>Brain age gap can be estimated from brain images, serving as a valuable biomarker for aging-associated diseases, using deep neural networks. Traditional brain age prediction methods tend to rely on unimodal data. Multimodal data can provide more comprehensive information and improve prediction accuracy. However, existing multimodal fusion methods often fall short in fully leveraging the correlations and complementarities between different modalities. This paper introduces a novel multimodal fuzzy feature fusion collaborative prediction algorithm for brain age estimation (MFCA). The proposed approach integrates multiple imaging modalities using a fuzzy fusion module and a multimodal collaborative convolutional module to effectively leverage inter-modal correlations and complementary information. Specifically, a convolutional neural network is used to extract feature from multimodal brain images, which are then combined into a global feature tensor via radial joins. The fuzzy fusion module employs fuzzy theory to fuse the correlation features of different modalities, while the multimodal collaborative convolutional module enhances complementary information through modality-specific convolutional layers. Age prediction is then performed by an age prediction module containing three linear regression modules. Additionally, an optimized sorting contrast loss is incorporated to improve the accuracy of age prediction. The proposed method was evaluated on the SRPBS multi-disorder MRI dataset, and the experimental results demonstrate that MFCA achieves a mean absolute error of 5.661 and a Pearson correlation coefficient of 0.947, outperforming several state-of-the-art methods.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariable adaptive decoupling control based on stochastic configuration networks with serial-parallel switching distribution 基于具有串行-并行开关分布的随机配置网络的多变量自适应解耦控制
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-18 DOI: 10.1016/j.ins.2024.121352

A multivariable adaptive decoupling control scheme is proposed based on stochastic configuration networks with serial-parallel switching distribution (SPSCN). Firstly, a linear controller is designed by combining a PID controller, feedback decoupling, and one-step optimal control. Secondly, a nonlinear controller is presented to deal with higher-order nonlinear terms and unknown external perturbations. SPSCN is used to improve the prediction accuracy of unmodeled dynamics. It combines uniform and normal search strategies in a serial-parallel fashion, aiming at improving the node quality and reducing the model complexity. The approximation performance of the SPSCN algorithm is demonstrated by performing approximation experiments with two functions and four benchmark datasets. Compared with the generalized minimum variance control (GMVC) algorithm in controlling the process of cement raw material decomposition, our proposed control scheme outperforms.

本文提出了一种基于串并联开关分布随机配置网络(SPSCN)的多变量自适应解耦控制方案。首先,结合 PID 控制器、反馈解耦和一步最优控制,设计了一个线性控制器。其次,提出了一种非线性控制器,以处理高阶非线性项和未知外部扰动。SPSCN 用于提高未建模动力学的预测精度。它以串行并行方式结合了均匀搜索和正常搜索策略,旨在提高节点质量并降低模型复杂度。通过对两个函数和四个基准数据集进行逼近实验,证明了 SPSCN 算法的逼近性能。与控制水泥生料分解过程的广义最小方差控制(GMVC)算法相比,我们提出的控制方案性能更优。
{"title":"Multivariable adaptive decoupling control based on stochastic configuration networks with serial-parallel switching distribution","authors":"","doi":"10.1016/j.ins.2024.121352","DOIUrl":"10.1016/j.ins.2024.121352","url":null,"abstract":"<div><p>A multivariable adaptive decoupling control scheme is proposed based on stochastic configuration networks with serial-parallel switching distribution (SPSCN). Firstly, a linear controller is designed by combining a PID controller, feedback decoupling, and one-step optimal control. Secondly, a nonlinear controller is presented to deal with higher-order nonlinear terms and unknown external perturbations. SPSCN is used to improve the prediction accuracy of unmodeled dynamics. It combines uniform and normal search strategies in a serial-parallel fashion, aiming at improving the node quality and reducing the model complexity. The approximation performance of the SPSCN algorithm is demonstrated by performing approximation experiments with two functions and four benchmark datasets. Compared with the generalized minimum variance control (GMVC) algorithm in controlling the process of cement raw material decomposition, our proposed control scheme outperforms.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LUD-YOLO: A novel lightweight object detection network for unmanned aerial vehicle LUD-YOLO:用于无人飞行器的新型轻量级物体探测网络
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-16 DOI: 10.1016/j.ins.2024.121366

Autonomous execution of tasks by unmanned aerial vehicles (UAVs) relies heavily on object detection. However, object detection in most images presents challenges such as complex backgrounds, small targets, and obstructions. Additionally, the limited computing speed and memory of the UAV processor affects the accuracy of conventional object detection algorithms. This paper proposes LUD-You Only Look Once (YOLO), a small and lightweight object detection algorithm for UAVs based on YOLOv8. The proposed algorithm introduces a new multiscale feature fusion mode that solves the degradation in feature propagation and interaction through the introduction of upsampling in the feature pyramid network and the progressive feature pyramid network. The application of the dynamic sparse attention mechanism in the Cf2 module achieves flexible computing allocation and content awareness. Furthermore, the proposed model is optimized to be sparse and lightweight, making it possible to deploy on UAV edge devices. Finally, the effectiveness and superiority of LUD-YOLO were verified on the VisDrone2019 and UAVDT datasets. The results of ablation and comparison experiments show that compared with the original algorithm, LUDY-N and LUDY-S have shown excellent performance in various evaluation indexes, indicating that the proposed improvement strategies make the model have better robustness and generalization. Moreover, compared with multiple other popular competitors, the proposed improvement strategies enable LUD-YOLO to have the best overall performance, providing an effective solution for UAVs object detection while balancing model size and detection accuracy.

无人飞行器(UAV)自主执行任务在很大程度上依赖于目标检测。然而,大多数图像中的物体检测都面临着复杂背景、小目标和障碍物等挑战。此外,无人飞行器处理器有限的计算速度和内存也影响了传统物体检测算法的准确性。本文提出了一种基于 YOLOv8 的小型轻量级无人机目标检测算法 LUD--You Only Look Once (YOLO)。该算法引入了新的多尺度特征融合模式,通过在特征金字塔网络和渐进式特征金字塔网络中引入上采样,解决了特征传播和交互中的退化问题。Cf2 模块中动态稀疏关注机制的应用实现了灵活的计算分配和内容感知。此外,所提出的模型经过优化,具有稀疏性和轻量级的特点,可以部署在无人机边缘设备上。最后,在 VisDrone2019 和 UAVDT 数据集上验证了 LUD-YOLO 的有效性和优越性。消融和对比实验结果表明,与原始算法相比,LUDY-N 和 LUDY-S 在各项评价指标上均表现优异,表明所提出的改进策略使模型具有更好的鲁棒性和泛化能力。此外,与其他多个流行的竞争对手相比,所提出的改进策略使 LUD-YOLO 的整体性能最佳,为无人机物体检测提供了有效的解决方案,同时兼顾了模型大小和检测精度。
{"title":"LUD-YOLO: A novel lightweight object detection network for unmanned aerial vehicle","authors":"","doi":"10.1016/j.ins.2024.121366","DOIUrl":"10.1016/j.ins.2024.121366","url":null,"abstract":"<div><p>Autonomous execution of tasks by unmanned aerial vehicles (UAVs) relies heavily on object detection. However, object detection in most images presents challenges such as complex backgrounds, small targets, and obstructions. Additionally, the limited computing speed and memory of the UAV processor affects the accuracy of conventional object detection algorithms. This paper proposes LUD-You Only Look Once (YOLO), a small and lightweight object detection algorithm for UAVs based on YOLOv8. The proposed algorithm introduces a new multiscale feature fusion mode that solves the degradation in feature propagation and interaction through the introduction of upsampling in the feature pyramid network and the progressive feature pyramid network. The application of the dynamic sparse attention mechanism in the Cf2 module achieves flexible computing allocation and content awareness. Furthermore, the proposed model is optimized to be sparse and lightweight, making it possible to deploy on UAV edge devices. Finally, the effectiveness and superiority of LUD-YOLO were verified on the VisDrone2019 and UAVDT datasets. The results of ablation and comparison experiments show that compared with the original algorithm, LUDY-N and LUDY-S have shown excellent performance in various evaluation indexes, indicating that the proposed improvement strategies make the model have better robustness and generalization. Moreover, compared with multiple other popular competitors, the proposed improvement strategies enable LUD-YOLO to have the best overall performance, providing an effective solution for UAVs object detection while balancing model size and detection accuracy.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020025524012805/pdfft?md5=d596a6997eac3036bbb68e709c00a107&pid=1-s2.0-S0020025524012805-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An interval number group decision-making method based on the prospect SMAA-2 model and extended cross-entropy 基于前景 SMAA-2 模型和扩展交叉熵的区间数群体决策方法
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-15 DOI: 10.1016/j.ins.2024.121348

The preference information of decision-makers (DMs) often cannot be explicitly expressed in complex decision-making environments. Therefore, to address decision-making problems with uncertain preference information, this paper proposes a method based on the prospect SMAA-2 model and extended cross-entropy in interval number environments. We first construct the prospect SMAA-2 model to simulate preference information. This model incorporates central risk-averse and risk-seeking factors, significantly enhancing the ability to identify alternatives. When DMs’ preference information is unknown or partially known, these factors can determine the appropriate level of risk-averse or risk-seeking for alternatives. Next, we devise an extended cross-entropy algorithm based on the continuous ordered weighted harmonic (C-OWH) averaging operator to handle interval numbers. Subsequently, a comprehensive algorithm is designed to derive the weights of DMs, taking into account the relationships among individuals as well as between individuals and the group. Furthermore, we construct the framework for the proposed method. Finally, the applicability of the developed method can be validated by an illustrative example. Comparative analysis is used to verify the rationality and superiority of this method.

在复杂的决策环境中,决策者(DMs)的偏好信息往往无法明确表达。因此,为了解决偏好信息不确定的决策问题,本文提出了一种基于前景 SMAA-2 模型和区间数环境下扩展交叉熵的方法。我们首先构建了前景 SMAA-2 模型来模拟偏好信息。该模型包含了中心风险规避和风险寻求因素,大大提高了识别备选方案的能力。当管理者的偏好信息未知或部分已知时,这些因素可以决定备选方案的适当风险规避或风险寻求水平。接下来,我们设计了一种基于连续有序加权谐波(C-OWH)平均算子的扩展交叉熵算法来处理区间数。随后,考虑到个体之间以及个体与群体之间的关系,我们设计了一种综合算法来推导 DM 的权重。此外,我们还构建了建议方法的框架。最后,可以通过一个示例来验证所开发方法的适用性。比较分析用于验证该方法的合理性和优越性。
{"title":"An interval number group decision-making method based on the prospect SMAA-2 model and extended cross-entropy","authors":"","doi":"10.1016/j.ins.2024.121348","DOIUrl":"10.1016/j.ins.2024.121348","url":null,"abstract":"<div><p>The preference information of decision-makers (DMs) often cannot be explicitly expressed in complex decision-making environments. Therefore, to address decision-making problems with uncertain preference information, this paper proposes a method based on the prospect SMAA-2 model and extended cross-entropy in interval number environments. We first construct the prospect SMAA-2 model to simulate preference information. This model incorporates central risk-averse and risk-seeking factors, significantly enhancing the ability to identify alternatives. When DMs’ preference information is unknown or partially known, these factors can determine the appropriate level of risk-averse or risk-seeking for alternatives. Next, we devise an extended cross-entropy algorithm based on the continuous ordered weighted harmonic (C-OWH) averaging operator to handle interval numbers. Subsequently, a comprehensive algorithm is designed to derive the weights of DMs, taking into account the relationships among individuals as well as between individuals and the group. Furthermore, we construct the framework for the proposed method. Finally, the applicability of the developed method can be validated by an illustrative example. Comparative analysis is used to verify the rationality and superiority of this method.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142012170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An interval-valued matrix factorization based trust-aware collaborative filtering algorithm for recommendation systems 基于区间值矩阵因式分解的推荐系统信任感知协同过滤算法
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-15 DOI: 10.1016/j.ins.2024.121355

In existing trust-aware collaborative filtering algorithms, each trust relationship between two users is usually represented by a real number, but such a number is neither sufficient to reflect the quantity of the trust relationship existing in the user’s mind nor easy to be given. This leads to the inaccuracy of the trust relationship and poor final recommendations. To solve this problem, we propose an approach to deduce interval-valued trust relationships from the given real-valued trust relationships, which enables the new trust relationships to optimally reflect the true trust relationships existing in users’ minds. The coming problem we face is how to fuse the interval-valued trust relationships and the real-valued ratings. Though most existing trust-aware collaborative filtering algorithms use matrix factorization to fuse the real-valued data, they are not capable of fusing interval-valued trust relationships and real-valued ratings. The reason is that the arithmetic operations on intervals and arithmetic operations on real numbers are different. Therefore, we proposed a novel interval-valued matrix factorization approach. After that, an interval-valued matrix factorization based trust-aware collaborative filtering (IMF_TCF) algorithm is designed. The experiments carried out with open datasets indicate that IMF_TCF achieves the best recommendation performance compared with the state-of-the-art algorithms.

在现有的信任感知协同过滤算法中,两个用户之间的每一种信任关系通常用一个实数来表示,但这样的数字既不足以反映用户心中存在的信任关系的数量,也不容易给出。这就导致信任关系不准确,最终推荐效果不佳。为了解决这个问题,我们提出了一种从给定的实值信任关系中推导出区间值信任关系的方法,从而使新的信任关系能够最佳地反映用户心中存在的真实信任关系。我们即将面临的问题是如何融合区间值信任关系和实值评级。虽然现有的大多数信任感知协同过滤算法都使用矩阵因式分解来融合实值数据,但它们无法融合区间值信任关系和实值评级。原因在于区间的算术运算和实数的算术运算是不同的。因此,我们提出了一种新颖的区间值矩阵因式分解方法。随后,我们设计了一种基于区间值矩阵因式分解的信任感知协同过滤算法(IMF_TCF)。利用开放数据集进行的实验表明,与最先进的算法相比,IMF_TCF 实现了最佳的推荐性能。
{"title":"An interval-valued matrix factorization based trust-aware collaborative filtering algorithm for recommendation systems","authors":"","doi":"10.1016/j.ins.2024.121355","DOIUrl":"10.1016/j.ins.2024.121355","url":null,"abstract":"<div><p>In existing trust-aware collaborative filtering algorithms, each trust relationship between two users is usually represented by a real number, but such a number is neither sufficient to reflect the quantity of the trust relationship existing in the user’s mind nor easy to be given. This leads to the inaccuracy of the trust relationship and poor final recommendations. To solve this problem, we propose an approach to deduce interval-valued trust relationships from the given real-valued trust relationships, which enables the new trust relationships to optimally reflect the true trust relationships existing in users’ minds. The coming problem we face is how to fuse the interval-valued trust relationships and the real-valued ratings. Though most existing trust-aware collaborative filtering algorithms use matrix factorization to fuse the real-valued data, they are not capable of fusing interval-valued trust relationships and real-valued ratings. The reason is that the arithmetic operations on intervals and arithmetic operations on real numbers are different. Therefore, we proposed a novel interval-valued matrix factorization approach. After that, an interval-valued matrix factorization based trust-aware collaborative filtering (IMF_TCF) algorithm is designed. The experiments carried out with open datasets indicate that IMF_TCF achieves the best recommendation performance compared with the state-of-the-art algorithms.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploiting the capacity of deep networks only at the training stage for nonlinear black-box system identification 仅在非线性黑盒系统识别的训练阶段利用深度网络的能力
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-15 DOI: 10.1016/j.ins.2024.121351

To benefit from the modeling capacity of deep models in system identification without worrying about inference time, this study presents a novel training strategy that uses deep models only during the training stage. For this purpose, two separate models with different structures and goals are employed. The first one is a deep generative model aiming at modeling the distribution of system output(s), called the teacher model, and the second one is a shallow basis function model, named the student model, fed by system input(s) to predict the system output(s). That means these isolated paths must reach the same ultimate target. As deep models show a great performance in modeling highly nonlinear systems, aligning the representation space learned by these two models makes the student model inherit the teacher model’s approximation power. The proposed objective function consists of the objective of each student and teacher model, adding up with a distance penalty between the learned latent representations. The simulation results on three nonlinear benchmarks show a comparative performance with examined deep architectures applied on the same benchmarks. Algorithmic transparency and structure efficiency are also achieved as byproducts.

为了在系统识别中受益于深度模型的建模能力,而无需担心推理时间,本研究提出了一种新颖的训练策略,即仅在训练阶段使用深度模型。为此,采用了两个结构和目标不同的独立模型。第一个是深度生成模型,旨在模拟系统输出的分布,称为教师模型;第二个是浅基函数模型,称为学生模型,由系统输入来预测系统输出。这意味着这些孤立的路径必须达到相同的最终目标。由于深度模型在高度非线性系统建模方面表现出色,因此将这两个模型学习到的表示空间对齐,可以使学生模型继承教师模型的近似能力。所提出的目标函数包括每个学生模型和教师模型的目标,再加上所学潜表征之间的距离惩罚。在三个非线性基准上的模拟结果显示,在相同基准上应用的深度架构与经过检验的深度架构性能相当。作为副产品,还实现了算法透明度和结构效率。
{"title":"Exploiting the capacity of deep networks only at the training stage for nonlinear black-box system identification","authors":"","doi":"10.1016/j.ins.2024.121351","DOIUrl":"10.1016/j.ins.2024.121351","url":null,"abstract":"<div><p>To benefit from the modeling capacity of deep models in system identification without worrying about inference time, this study presents a novel training strategy that uses deep models only during the training stage. For this purpose, two separate models with different structures and goals are employed. The first one is a deep generative model aiming at modeling the distribution of system output(s), called the teacher model, and the second one is a shallow basis function model, named the student model, fed by system input(s) to predict the system output(s). That means these isolated paths must reach the same ultimate target. As deep models show a great performance in modeling highly nonlinear systems, aligning the representation space learned by these two models makes the student model inherit the teacher model’s approximation power. The proposed objective function consists of the objective of each student and teacher model, adding up with a distance penalty between the learned latent representations. The simulation results on three nonlinear benchmarks show a comparative performance with examined deep architectures applied on the same benchmarks. Algorithmic transparency and structure efficiency are also achieved as byproducts.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142006895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1