首页 > 最新文献

Information Sciences最新文献

英文 中文
Observer-based reinforcement learning for optimal fault-tolerant consensus control of nonlinear multi-agent systems via a dynamic event-triggered mechanism 基于观察者的强化学习,通过动态事件触发机制实现非线性多代理系统的最佳容错共识控制
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121350

In this article, an adaptive optimized consensus tracking control problem is studied for nonlinear strict-feedback dynamic multi-agent systems (MASs), considering both unmeasurable system states and time-varying bias faults. By utilizing the backstepping technique, we develop an adaptive reinforcement learning (RL) algorithm within the observer-critic-actor architecture, specially designed to compensate for the lack of state information and derive control inputs, thereby achieving approximate optimal control. Moreover, an event-triggered mechanism is introduced in the sensor-to-controller channel, which dynamically adjusts the triggering threshold online and employs event-sampled states to initiate control actions. To address discontinuities caused by state triggering, we construct virtual controllers that continuously sample state signals and reconfigure the actual controller based on previously triggered states. The outputs of the MASs are shown to accurately track the desired reference signals while ensuring the boundedness of all closed-loop signals. Additionally, the proposed controller is verified to be devoid of Zeno behavior. Finally, the effectiveness of our control methodology is demonstrated through numerical simulation.

本文研究了非线性严格反馈动态多代理系统(MAS)的自适应优化共识跟踪控制问题,同时考虑了不可测量的系统状态和时变偏差故障。通过利用反向步进技术,我们在观测器-批判-代理架构内开发了一种自适应强化学习(RL)算法,专门用于补偿状态信息的缺乏和推导控制输入,从而实现近似最优控制。此外,在传感器到控制器的通道中引入了事件触发机制,可在线动态调整触发阈值,并利用事件采样状态启动控制行动。为了解决状态触发引起的不连续性问题,我们构建了虚拟控制器,持续采样状态信号,并根据先前触发的状态重新配置实际控制器。结果表明,MAS 的输出能准确跟踪所需的参考信号,同时确保所有闭环信号的有界性。此外,还验证了所提出的控制器不存在 Zeno 行为。最后,我们通过数值模拟证明了控制方法的有效性。
{"title":"Observer-based reinforcement learning for optimal fault-tolerant consensus control of nonlinear multi-agent systems via a dynamic event-triggered mechanism","authors":"","doi":"10.1016/j.ins.2024.121350","DOIUrl":"10.1016/j.ins.2024.121350","url":null,"abstract":"<div><p>In this article, an adaptive optimized consensus tracking control problem is studied for nonlinear strict-feedback dynamic multi-agent systems (MASs), considering both unmeasurable system states and time-varying bias faults. By utilizing the backstepping technique, we develop an adaptive reinforcement learning (RL) algorithm within the observer-critic-actor architecture, specially designed to compensate for the lack of state information and derive control inputs, thereby achieving approximate optimal control. Moreover, an event-triggered mechanism is introduced in the sensor-to-controller channel, which dynamically adjusts the triggering threshold online and employs event-sampled states to initiate control actions. To address discontinuities caused by state triggering, we construct virtual controllers that continuously sample state signals and reconfigure the actual controller based on previously triggered states. The outputs of the MASs are shown to accurately track the desired reference signals while ensuring the boundedness of all closed-loop signals. Additionally, the proposed controller is verified to be devoid of Zeno behavior. Finally, the effectiveness of our control methodology is demonstrated through numerical simulation.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression trees for fast and adaptive prediction intervals 用于快速自适应预测区间的回归树
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121369

In predictive modeling, quantifying prediction uncertainty is crucial for reliable decision-making. Traditional conformal inference methods provide marginally valid predictive regions but often produce non-adaptive intervals when naively applied to regression, potentially biasing applications. Recent advances using quantile regressors or conditional density estimators improve adaptability but are typically tied to specific prediction models, limiting their ability to quantify uncertainty around arbitrary models. Similarly, methods based on partitioning the feature space adopt sub-optimal strategies, failing to consistently measure predictive uncertainty across the feature space, especially in adversarial examples. This paper introduces a model-agnostic family of methods to calibrate prediction intervals for regression with local coverage guarantees. By leveraging regression trees and Random Forests, our approach constructs data-adaptive partitions of the feature space to approximate conditional coverage, enhancing the accuracy and scalability of prediction intervals. Our methods outperform established benchmarks on simulated and real-world datasets. They are implemented in the Python package clover, which integrates seamlessly with the scikit-learn interface for practical application.

在预测建模中,量化预测的不确定性对于可靠的决策至关重要。传统的保角推理方法可提供略微有效的预测区域,但在简单地应用于回归时,往往会产生非适应性区间,从而可能使应用产生偏差。使用量化回归或条件密度估算器的最新进展提高了适应性,但这些方法通常与特定的预测模型相绑定,限制了它们量化任意模型不确定性的能力。同样,基于特征空间划分的方法也采用了次优策略,无法一致地测量整个特征空间的预测不确定性,尤其是在对抗性示例中。本文介绍了一系列与模型无关的方法,用于校准具有局部覆盖保证的回归预测区间。通过利用回归树和随机森林,我们的方法构建了特征空间的数据自适应分区来近似条件覆盖率,从而提高了预测区间的准确性和可扩展性。我们的方法在模拟和实际数据集上的表现优于既定基准。这些方法是在 Python 软件包 clover 中实现的,它与 scikit-learn 界面无缝集成,便于实际应用。
{"title":"Regression trees for fast and adaptive prediction intervals","authors":"","doi":"10.1016/j.ins.2024.121369","DOIUrl":"10.1016/j.ins.2024.121369","url":null,"abstract":"<div><p>In predictive modeling, quantifying prediction uncertainty is crucial for reliable decision-making. Traditional conformal inference methods provide marginally valid predictive regions but often produce non-adaptive intervals when naively applied to regression, potentially biasing applications. Recent advances using quantile regressors or conditional density estimators improve adaptability but are typically tied to specific prediction models, limiting their ability to quantify uncertainty around arbitrary models. Similarly, methods based on partitioning the feature space adopt sub-optimal strategies, failing to consistently measure predictive uncertainty across the feature space, especially in adversarial examples. This paper introduces a model-agnostic family of methods to calibrate prediction intervals for regression with local coverage guarantees. By leveraging regression trees and Random Forests, our approach constructs data-adaptive partitions of the feature space to approximate conditional coverage, enhancing the accuracy and scalability of prediction intervals. Our methods outperform established benchmarks on simulated and real-world datasets. They are implemented in the Python package <span>clover</span>, which integrates seamlessly with the <em>scikit-learn</em> interface for practical application.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142084047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FRRI: A novel algorithm for fuzzy-rough rule induction FRRI:一种新颖的模糊粗糙规则归纳算法
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121362

Interpretability is the next frontier in machine learning research. In the search for white box models — as opposed to black box models, like random forests or neural networks — rule induction algorithms are a logical and promising option, since the rules can easily be understood by humans. Fuzzy and rough set theory have been successfully applied to this archetype, almost always separately. As both approaches offer different ways to deal with imprecise and uncertain information, often with the use of an indiscernibility relation, it is natural to combine them. The QuickRules [20] algorithm was a first attempt at using fuzzy rough set theory for rule induction. It is based on QuickReduct, a greedy algorithm for building decision superreducts. QuickRules already showed an improvement over other rule induction methods. However, to evaluate the full potential of a fuzzy rough rule induction algorithm, one needs to start from the foundations. Accordingly, the novel rule induction algorithm, Fuzzy Rough Rule Induction (FRRI), we introduce in this paper, uses an approach that has not yet been utilised in this setting. We provide background and explain the workings of our algorithm. Furthermore, we perform a computational experiment to evaluate the performance of our algorithm and compare it to other state-of-the-art rule induction approaches. We find that our algorithm is more accurate while creating small rulesets consisting of relatively short rules.

可解释性是机器学习研究的下一个前沿领域。相对于随机森林或神经网络等黑盒子模型,在寻找白盒子模型的过程中,规则归纳算法是一个合乎逻辑且前景广阔的选择,因为规则很容易被人类理解。模糊理论和粗糙集理论已成功应用于这一原型,但几乎都是单独应用。由于这两种方法都提供了处理不精确和不确定信息的不同方法,通常还使用了不可辨认关系,因此将它们结合起来是很自然的。QuickRules [20] 算法是利用模糊粗糙集理论进行规则归纳的首次尝试。它基于 QuickReduct 算法,这是一种用于建立决策超归纳的贪婪算法。与其他规则归纳法相比,QuickRules 已经显示出了进步。然而,要评估模糊粗糙规则归纳算法的全部潜力,我们需要从基础开始。因此,我们在本文中介绍的新型规则归纳算法--模糊粗糙规则归纳(FRRI),采用了一种尚未在此环境中使用过的方法。我们提供了背景资料,并解释了我们算法的工作原理。此外,我们还进行了计算实验,以评估我们算法的性能,并将其与其他最先进的规则归纳方法进行比较。我们发现,在创建由相对较短的规则组成的小型规则集时,我们的算法更为准确。
{"title":"FRRI: A novel algorithm for fuzzy-rough rule induction","authors":"","doi":"10.1016/j.ins.2024.121362","DOIUrl":"10.1016/j.ins.2024.121362","url":null,"abstract":"<div><p>Interpretability is the next frontier in machine learning research. In the search for white box models — as opposed to black box models, like random forests or neural networks — rule induction algorithms are a logical and promising option, since the rules can easily be understood by humans. Fuzzy and rough set theory have been successfully applied to this archetype, almost always separately. As both approaches offer different ways to deal with imprecise and uncertain information, often with the use of an indiscernibility relation, it is natural to combine them. The QuickRules <span><span>[20]</span></span> algorithm was a first attempt at using fuzzy rough set theory for rule induction. It is based on QuickReduct, a greedy algorithm for building decision superreducts. QuickRules already showed an improvement over other rule induction methods. However, to evaluate the full potential of a fuzzy rough rule induction algorithm, one needs to start from the foundations. Accordingly, the novel rule induction algorithm, Fuzzy Rough Rule Induction (FRRI), we introduce in this paper, uses an approach that has not yet been utilised in this setting. We provide background and explain the workings of our algorithm. Furthermore, we perform a computational experiment to evaluate the performance of our algorithm and compare it to other state-of-the-art rule induction approaches. We find that our algorithm is more accurate while creating small rulesets consisting of relatively short rules.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HardVD: High-capacity cross-modal adversarial reprogramming for data-efficient vulnerability detection HardVD:高容量跨模式对抗重编程,实现数据高效漏洞检测
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121370

The substantial proliferation of software vulnerabilities poses a persistent threat to system security, driving increased interest in applying deep learning (DL) for vulnerability detection. However, DL-based detectors often operate with a fixed number of input tokens, leading to semantic loss over large code snippets. Additionally, developing these detectors demands substantial labeled data and training time. To address these limitations, this paper proposes HardVD, which explores High-capacity cross-modal adversarial reprogramming for data-efficient Vulnerability Detection. HardVD devises a high-capacity semantic extractor to capture salient features per line of code, which are then arranged as patches to form an image representing the target function. These images are processed using convolutional filters as universal perturbations and non-parametric label remapping to adapt a pretrained Vision Transformer (ViT) for vulnerability detection, updating only the limited parameters of the perturbation filters during training. Extensive experiments demonstrate that HardVD outperforms DL-based baselines in terms of detection effectiveness, data-limited performance, and computational overhead. The ablation study also confirms the essential role of our high-capacity semantic extractor, without which an averaged relative decrease of 5.87% and 7.98% in accuracy and F1 score is observed, respectively.

软件漏洞的大量涌现对系统安全构成了持续威胁,促使人们对应用深度学习(DL)进行漏洞检测的兴趣与日俱增。然而,基于深度学习的检测器通常使用固定数量的输入标记进行操作,导致大量代码片段的语义损失。此外,开发这些检测器需要大量的标记数据和训练时间。为了解决这些局限性,本文提出了 HardVD,探索高容量跨模式对抗重编程,以实现数据高效的漏洞检测。HardVD 设计了一种大容量语义提取器,用于捕捉每行代码的显著特征,然后将这些特征排列成补丁,形成代表目标函数的图像。在处理这些图像时,使用卷积滤波器作为通用扰动和非参数标签重映射,以调整用于漏洞检测的预训练视觉变换器(ViT),在训练过程中只更新扰动滤波器的有限参数。大量实验证明,HardVD 在检测效果、数据限制性能和计算开销方面都优于基于 DL 的基线。消融研究还证实了我们的大容量语义提取器的重要作用,如果没有它,准确率和 F1 分数的平均相对降幅分别为 5.87% 和 7.98%。
{"title":"HardVD: High-capacity cross-modal adversarial reprogramming for data-efficient vulnerability detection","authors":"","doi":"10.1016/j.ins.2024.121370","DOIUrl":"10.1016/j.ins.2024.121370","url":null,"abstract":"<div><p>The substantial proliferation of software vulnerabilities poses a persistent threat to system security, driving increased interest in applying deep learning (DL) for vulnerability detection. However, DL-based detectors often operate with a fixed number of input tokens, leading to semantic loss over large code snippets. Additionally, developing these detectors demands substantial labeled data and training time. To address these limitations, this paper proposes HardVD, which explores <u>H</u>igh-capacity cross-modal <u>a</u>dversarial <u>r</u>eprogramming for <u>d</u>ata-efficient <u>V</u>ulnerability <u>D</u>etection. HardVD devises a high-capacity semantic extractor to capture salient features per line of code, which are then arranged as patches to form an image representing the target function. These images are processed using convolutional filters as universal perturbations and non-parametric label remapping to adapt a pretrained Vision Transformer (ViT) for vulnerability detection, updating only the limited parameters of the perturbation filters during training. Extensive experiments demonstrate that HardVD outperforms DL-based baselines in terms of detection effectiveness, data-limited performance, and computational overhead. The ablation study also confirms the essential role of our high-capacity semantic extractor, without which an averaged relative decrease of 5.87% and 7.98% in accuracy and F1 score is observed, respectively.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142076830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two-stage accelerated search strategy for large-scale multi-objective evolutionary algorithm 大规模多目标进化算法的两阶段加速搜索策略
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121347

Since large-scale multi-objective problems (LSMOPs) have huge decision variables, the traditional evolutionary algorithms are facing difficulties of low exploitation efficiency and high exploration costs in solving LSMOPs. Therefore, this paper proposes an evolutionary strategy based on two-stage accelerated search optimizers (ATAES). Specifically, a convergence optimizer is devised in the first stage, while a three-layer lightweight convolutional neural network model is built, and the population is homogenized into two subsets, the diversity subset, and the convergence subset, which serve as input nodes and the expected output nodes of the neural network, respectively. Then, by constantly backpropagating the gradient, a satisfactory individual will be produced. Once exploitation stagnation is discovered in the first phase, the second phase will be run, where a diversity optimizer using a differential optimization algorithm with opposite learning is suggested to increase the exploration range of candidate solutions and thereby increase the population's diversity. Finally, to validate the algorithm's performance, on multi-objective LSMOP and DTLZ benchmark suits with decision variable quantities of 100, 300, 500, and 1000, the ATAES demonstrated its superiority with other advanced multi-objective evolutionary algorithms.

由于大规模多目标问题(LSMOPs)具有巨大的决策变量,传统的进化算法在求解 LSMOPs 时面临着利用效率低、探索成本高的难题。因此,本文提出了一种基于两阶段加速搜索优化器(ATAES)的进化策略。具体来说,第一阶段设计了一个收敛优化器,同时建立了一个三层轻量级卷积神经网络模型,并将种群均质化为两个子集,即多样性子集和收敛子集,分别作为神经网络的输入节点和预期输出节点。然后,通过不断反向传播梯度,就会产生一个令人满意的个体。一旦在第一阶段发现开发停滞,就会运行第二阶段,在第二阶段,建议使用具有相反学习的差分优化算法的多样性优化器来增加候选解的探索范围,从而增加种群的多样性。最后,为了验证该算法的性能,在决策变量数量为 100、300、500 和 1000 的多目标 LSMOP 和 DTLZ 基准套件上,ATAES 展示了其优于其他先进多目标进化算法的性能。
{"title":"A two-stage accelerated search strategy for large-scale multi-objective evolutionary algorithm","authors":"","doi":"10.1016/j.ins.2024.121347","DOIUrl":"10.1016/j.ins.2024.121347","url":null,"abstract":"<div><p>Since large-scale multi-objective problems (LSMOPs) have huge decision variables, the traditional evolutionary algorithms are facing difficulties of low exploitation efficiency and high exploration costs in solving LSMOPs. Therefore, this paper proposes an evolutionary strategy based on two-stage accelerated search optimizers (ATAES). Specifically, a convergence optimizer is devised in the first stage, while a three-layer lightweight convolutional neural network model is built, and the population is homogenized into two subsets, the diversity subset, and the convergence subset, which serve as input nodes and the expected output nodes of the neural network, respectively. Then, by constantly backpropagating the gradient, a satisfactory individual will be produced. Once exploitation stagnation is discovered in the first phase, the second phase will be run, where a diversity optimizer using a differential optimization algorithm with opposite learning is suggested to increase the exploration range of candidate solutions and thereby increase the population's diversity. Finally, to validate the algorithm's performance, on multi-objective LSMOP and DTLZ benchmark suits with decision variable quantities of 100, 300, 500, and 1000, the ATAES demonstrated its superiority with other advanced multi-objective evolutionary algorithms.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancing continual lifelong learning in neural information retrieval: Definition, dataset, framework, and empirical evaluation 推进神经信息检索的终身学习:定义、数据集、框架和实证评估
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121368

Continual learning refers to the capability of a machine learning model to learn and adapt to new information, without compromising its performance on previously learned tasks. Although several studies have investigated continual learning methods for neural information retrieval (NIR) tasks, a well-defined task definition is still lacking, and it is unclear how typical learning strategies perform in this context. To address this challenge, a systematic task definition of continual NIR is presented, along with a multiple-topic dataset that simulates continuous information retrieval. A comprehensive continual neural information retrieval framework consisting of typical retrieval models and continual learning strategies is then proposed. Empirical evaluations illustrate that the proposed framework can successfully prevent catastrophic forgetting in neural information retrieval and enhance performance on previously learned tasks. The results also indicate that embedding-based retrieval models experience a decline in their continual learning performance as the topic shift distance and dataset volume of new tasks increase. In contrast, pretraining-based models do not show any such correlation. Adopting suitable learning strategies can mitigate the effects of topic shift and data augmentation in continual neural information retrieval.

持续学习指的是机器学习模型学习和适应新信息的能力,而不会影响其在先前学习任务中的表现。虽然已有多项研究对神经信息检索(NIR)任务的持续学习方法进行了调查,但目前仍缺乏一个明确的任务定义,也不清楚典型的学习策略在这种情况下的表现如何。为了应对这一挑战,本文介绍了持续神经信息检索的系统任务定义,以及模拟持续信息检索的多主题数据集。然后提出了一个由典型检索模型和持续学习策略组成的综合持续神经信息检索框架。经验评估表明,所提出的框架可以成功地防止神经信息检索中的灾难性遗忘,并提高先前所学任务的性能。结果还表明,随着新任务的主题转移距离和数据集数量的增加,基于嵌入的检索模型的持续学习性能会下降。相比之下,基于预训练的模型则没有表现出这种相关性。采用合适的学习策略可以减轻持续神经信息检索中主题转移和数据增加的影响。
{"title":"Advancing continual lifelong learning in neural information retrieval: Definition, dataset, framework, and empirical evaluation","authors":"","doi":"10.1016/j.ins.2024.121368","DOIUrl":"10.1016/j.ins.2024.121368","url":null,"abstract":"<div><p>Continual learning refers to the capability of a machine learning model to learn and adapt to new information, without compromising its performance on previously learned tasks. Although several studies have investigated continual learning methods for neural information retrieval (NIR) tasks, a well-defined task definition is still lacking, and it is unclear how typical learning strategies perform in this context. To address this challenge, a systematic task definition of continual NIR is presented, along with a multiple-topic dataset that simulates continuous information retrieval. A comprehensive continual neural information retrieval framework consisting of typical retrieval models and continual learning strategies is then proposed. Empirical evaluations illustrate that the proposed framework can successfully prevent catastrophic forgetting in neural information retrieval and enhance performance on previously learned tasks. The results also indicate that embedding-based retrieval models experience a decline in their continual learning performance as the topic shift distance and dataset volume of new tasks increase. In contrast, pretraining-based models do not show any such correlation. Adopting suitable learning strategies can mitigate the effects of topic shift and data augmentation in continual neural information retrieval.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020025524012829/pdfft?md5=4615ff1900c782ca16ffa814bfc2c2dc&pid=1-s2.0-S0020025524012829-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrastive prototype network with prototype augmentation for few-shot classification 采用原型增强技术的对比原型网络,适用于少镜头分类
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121372

In recent years, metric-based meta-learning methods have received widespread attention because of their effectiveness in solving few-shot classification problems. However, the scarcity of data frequently results in suboptimal embeddings, causing a discrepancy between anticipated class prototypes and those derived from the support set. These problems severely limit the generalizability of such methods, necessitating further development of Few-Shot Learning (FSL). In this study, we propose the Contrastive Prototype Network (CPN) consisting of three components: (1) Contrastive learning proposed as an auxiliary path to reduce the distance between homogeneous samples and amplify the differences between heterogeneous samples, thereby enhancing the effectiveness and quality of embeddings; (2) A pseudo-prototype strategy proposed to address the bias in prototypes, whereby the pseudo prototypes generated using query set samples are integrated with the initial prototypes to obtain more representative prototypes; (3) A new data augmentation technique, mixupPatch, introduced to alleviate the issue of insufficient data samples, whereby enhanced images are generated by blending the images and labels from different samples, to increase the number of samples. Extensive experiments and ablation studies conducted on five datasets demonstrated that CPN achieves robust results against recent solutions.

近年来,基于度量的元学习方法因其在解决少量分类问题方面的有效性而受到广泛关注。然而,数据的稀缺性经常会导致次优嵌入,造成预期的类原型与从支持集得出的类原型之间的差异。这些问题严重限制了此类方法的通用性,因此有必要进一步开发少点学习(FSL)。在本研究中,我们提出了由三个部分组成的对比原型网络(CPN):(1) 提出对比学习作为辅助路径,以缩小同质样本之间的距离,放大异质样本之间的差异,从而提高嵌入的效果和质量;(2) 提出伪原型策略以解决原型的偏差问题,即利用查询集样本生成的伪原型与初始原型进行整合,以获得更具代表性的原型;(3) 引入新的数据增强技术 mixupPatch,以缓解数据样本不足的问题,即通过混合不同样本的图像和标签生成增强图像,从而增加样本数量。在五个数据集上进行的大量实验和消融研究表明,与最新的解决方案相比,CPN 取得了稳健的结果。
{"title":"Contrastive prototype network with prototype augmentation for few-shot classification","authors":"","doi":"10.1016/j.ins.2024.121372","DOIUrl":"10.1016/j.ins.2024.121372","url":null,"abstract":"<div><p>In recent years, metric-based meta-learning methods have received widespread attention because of their effectiveness in solving few-shot classification problems. However, the scarcity of data frequently results in suboptimal embeddings, causing a discrepancy between anticipated class prototypes and those derived from the support set. These problems severely limit the generalizability of such methods, necessitating further development of Few-Shot Learning (FSL). In this study, we propose the Contrastive Prototype Network (CPN) consisting of three components: (1) Contrastive learning proposed as an auxiliary path to reduce the distance between homogeneous samples and amplify the differences between heterogeneous samples, thereby enhancing the effectiveness and quality of embeddings; (2) A pseudo-prototype strategy proposed to address the bias in prototypes, whereby the pseudo prototypes generated using query set samples are integrated with the initial prototypes to obtain more representative prototypes; (3) A new data augmentation technique, mixupPatch, introduced to alleviate the issue of insufficient data samples, whereby enhanced images are generated by blending the images and labels from different samples, to increase the number of samples. Extensive experiments and ablation studies conducted on five datasets demonstrated that CPN achieves robust results against recent solutions.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142084041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hierarchical algorithm with randomized learning for robust tissue segmentation and classification in digital pathology 用于数字病理学中稳健组织分割和分类的随机学习分层算法
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121358

Highly detailed and accurate segmentation and classification of images constitutes an important class of tasks in computer vision. Typical “universal” domain-agnostic methods are known to suffer from instabilities and are prone to adversarial perturbations. Natural heterogeneity inherent in biological tissue structures complicates the interpretation of images even by trained physicians. Yet, algorithms in the medical domain require a high level of stability and interpretability to ensure their adoption by clinical experts and acceptance in clinical decision-making. In this work, we propose a novel method for segmentation and classification to address these challenges. The method is based on a hierarchical approach and biologically-informed feature extraction. The method's technical pipeline includes the automatic extraction of key biologically-informed features typically considered by physicians. This is followed by image classification using these features. Both stages rely on randomized ML techniques. The proposed hierarchical biomedically-informed approach significantly improved the image classification quality compared to the baseline solution of image classification in the task of colorectal cancer (CRC) analysis. The average F1-score for the four tissue types increased from 0.737 to 0.956. Using tumor tissue classification task as an example, we showed that the proposed algorithm offers an effective and practical avenue to solve these challenging issues.

高精细、高精度的图像分割和分类是计算机视觉领域的一类重要任务。众所周知,典型的 "通用 "领域识别方法存在不稳定性,容易受到对抗性扰动的影响。生物组织结构中固有的天然异质性使得即使是训练有素的医生也很难解读图像。然而,医学领域的算法需要高度的稳定性和可解释性,以确保其被临床专家采用并在临床决策中得到认可。在这项工作中,我们提出了一种新的分割和分类方法来应对这些挑战。该方法基于分层方法和生物特征提取。该方法的技术流程包括自动提取医生通常会考虑的关键生物特征。然后利用这些特征进行图像分类。这两个阶段都依赖于随机化的 ML 技术。在结直肠癌(CRC)分析任务中,与图像分类基线解决方案相比,所提出的分层生物信息方法显著提高了图像分类质量。四种组织类型的平均 F1 分数从 0.737 提高到 0.956。以肿瘤组织分类任务为例,我们表明所提出的算法为解决这些具有挑战性的问题提供了一种有效而实用的途径。
{"title":"A hierarchical algorithm with randomized learning for robust tissue segmentation and classification in digital pathology","authors":"","doi":"10.1016/j.ins.2024.121358","DOIUrl":"10.1016/j.ins.2024.121358","url":null,"abstract":"<div><p>Highly detailed and accurate segmentation and classification of images constitutes an important class of tasks in computer vision. Typical “universal” domain-agnostic methods are known to suffer from instabilities and are prone to adversarial perturbations. Natural heterogeneity inherent in biological tissue structures complicates the interpretation of images even by trained physicians. Yet, algorithms in the medical domain require a high level of stability and interpretability to ensure their adoption by clinical experts and acceptance in clinical decision-making. In this work, we propose a novel method for segmentation and classification to address these challenges. The method is based on a hierarchical approach and biologically-informed feature extraction. The method's technical pipeline includes the automatic extraction of key biologically-informed features typically considered by physicians. This is followed by image classification using these features. Both stages rely on randomized ML techniques. The proposed hierarchical biomedically-informed approach significantly improved the image classification quality compared to the baseline solution of image classification in the task of colorectal cancer (CRC) analysis. The average F1-score for the four tissue types increased from 0.737 to 0.956. Using tumor tissue classification task as an example, we showed that the proposed algorithm offers an effective and practical avenue to solve these challenging issues.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid deep recurrent artificial neural network with a simple exponential smoothing feedback mechanism 具有简单指数平滑反馈机制的混合深度递归人工神经网络
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121356

Both classical forecasting methods and machine learning approaches are used to solve forecasting problems. Deep artificial neural networks, one of the machine learning methods, are widely used today and give very good results. Recurrent neural networks, a type of deep neural network, are very important in forecasting problems. Simple recurrent artificial neural networks, which are the simplest deep recurrent neural networks, are often preferred in solving forecasting problems due to the small number of parameters they use. Simple exponential smoothing, one of the classical forecasting methods, attracts attention with its performance in solving forecasting problems. The motivation of the study is to create a new forecasting method by combining a classical and simple forecasting method with a deep recurrent artificial neural network in an architecture. In this, a new hybrid deep recurrent artificial neural network with a simple exponential smoothing feedback mechanism is proposed. The architecture of the proposed method is created as a combination of simple recurrent artificial neural networks and simple exponential smoothing methods. In the training of the proposed method, two training algorithms based on sine cosine optimization and particle swarm optimization algorithms are proposed. In these training algorithms, two different solution strategies such as restarting, and early stopping rule are used to avoid overfitting and local optimum problems. The performance of the proposed method is analysed using stock market datasets and compared with both different deep and shallow artificial neural networks and classical forecasting methods. As a result of the analyses, it is concluded that the proposed method is successful in one step ahead of forecasting performance.

传统预测方法和机器学习方法都被用来解决预测问题。深度人工神经网络是机器学习方法之一,目前已得到广泛应用,并取得了非常好的效果。递归神经网络是深度神经网络的一种,在预测问题中非常重要。简单递归人工神经网络是最简单的深度递归神经网络,由于其使用的参数数量较少,在解决预测问题时往往是首选。简单指数平滑法是经典的预测方法之一,它在解决预测问题方面的性能备受关注。本研究的动机是通过将经典的简单预测方法与深度递归人工神经网络相结合,创建一种新的预测方法。为此,我们提出了一种具有简单指数平滑反馈机制的新型混合深度递归人工神经网络。所提方法的架构是简单递归人工神经网络和简单指数平滑方法的结合。在所提方法的训练中,提出了基于正弦余弦优化和粒子群优化算法的两种训练算法。在这些训练算法中,使用了两种不同的求解策略,如重新开始和早期停止规则,以避免过拟合和局部最优问题。利用股市数据集分析了所提方法的性能,并与不同的深层和浅层人工神经网络以及经典预测方法进行了比较。分析结果表明,所提出的方法在预测性能方面领先一步,是成功的。
{"title":"A hybrid deep recurrent artificial neural network with a simple exponential smoothing feedback mechanism","authors":"","doi":"10.1016/j.ins.2024.121356","DOIUrl":"10.1016/j.ins.2024.121356","url":null,"abstract":"<div><p>Both classical forecasting methods and machine learning approaches are used to solve forecasting problems. Deep artificial neural networks, one of the machine learning methods, are widely used today and give very good results. Recurrent neural networks, a type of deep neural network, are very important in forecasting problems. Simple recurrent artificial neural networks, which are the simplest deep recurrent neural networks, are often preferred in solving forecasting problems due to the small number of parameters they use. Simple exponential smoothing, one of the classical forecasting methods, attracts attention with its performance in solving forecasting problems. The motivation of the study is to create a new forecasting method by combining a classical and simple forecasting method with a deep recurrent artificial neural network in an architecture. In this, a new hybrid deep recurrent artificial neural network with a simple exponential smoothing feedback mechanism is proposed. The architecture of the proposed method is created as a combination of simple recurrent artificial neural networks and simple exponential smoothing methods. In the training of the proposed method, two training algorithms based on sine cosine optimization and particle swarm optimization algorithms are proposed. In these training algorithms, two different solution strategies such as restarting, and early stopping rule are used to avoid overfitting and local optimum problems. The performance of the proposed method is analysed using stock market datasets and compared with both different deep and shallow artificial neural networks and classical forecasting methods. As a result of the analyses, it is concluded that the proposed method is successful in one step ahead of forecasting performance.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142077489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards forward secure verifiable data streaming with support for keyword query 支持关键字查询的前向安全可验证数据流
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-22 DOI: 10.1016/j.ins.2024.121375

Verifiable data streaming (VDS) allows users to continuously upload their encrypted data items to untrusted cloud servers to reduce the burden of local storage. Meanwhile, it enables users to update outsourced data and initiate queries to verify data integrity. However, most previous works focus on queries for a certain index and are not compatible with general keyword queries. To this end, we propose a VDS protocol that supports keyword queries. Specifically, we first present an efficient dynamic symmetric searchable encryption (DSSE) with range query, which can represent multiple queried keywords within a range as the concise query token, thereby achieving communication-optimized keyword queries while achieving unbounded data appending. Furthermore, we construct a new VDS protocol supporting keyword queries by integrating the proposed range DSSE. Specifically, suppose the user searches for multiple keywords within a certain range. In that case, cloud servers can match the corresponding data items based on the user's query token, and users can verify the integrity of these data items. Security analysis demonstrates that our VDS protocol achieves forward security. Experimental results show that it sacrifices acceptable costs in exchange for keyword retrieval capability.

可验证数据流(VDS)允许用户将加密数据项持续上传到不受信任的云服务器,以减轻本地存储的负担。同时,它还能让用户更新外包数据并发起查询以验证数据的完整性。然而,以前的大多数工作都集中在对某个索引的查询上,与一般的关键字查询不兼容。为此,我们提出了一种支持关键字查询的 VDS 协议。具体来说,我们首先提出了一种具有范围查询功能的高效动态对称可搜索加密(DSSE),它可以将一个范围内的多个查询关键词表示为简洁的查询标记,从而在实现无限制数据追加的同时,实现了通信优化的关键词查询。此外,我们还通过集成所提出的范围 DSSE,构建了一个支持关键字查询的新 VDS 协议。具体来说,假设用户在一定范围内搜索多个关键词。在这种情况下,云服务器可根据用户的查询令牌匹配相应的数据项,而用户则可验证这些数据项的完整性。安全性分析表明,我们的 VDS 协议实现了前向安全性。实验结果表明,它牺牲了可接受的成本,换来了关键词检索能力。
{"title":"Towards forward secure verifiable data streaming with support for keyword query","authors":"","doi":"10.1016/j.ins.2024.121375","DOIUrl":"10.1016/j.ins.2024.121375","url":null,"abstract":"<div><p>Verifiable data streaming (VDS) allows users to continuously upload their encrypted data items to untrusted cloud servers to reduce the burden of local storage. Meanwhile, it enables users to update outsourced data and initiate queries to verify data integrity. However, most previous works focus on queries for a certain index and are not compatible with general keyword queries. To this end, we propose a VDS protocol that supports keyword queries. Specifically, we first present an efficient dynamic symmetric searchable encryption (DSSE) with range query, which can represent multiple queried keywords within a range as the concise query token, thereby achieving communication-optimized keyword queries while achieving unbounded data appending. Furthermore, we construct a new VDS protocol supporting keyword queries by integrating the proposed range DSSE. Specifically, suppose the user searches for multiple keywords within a certain range. In that case, cloud servers can match the corresponding data items based on the user's query token, and users can verify the integrity of these data items. Security analysis demonstrates that our VDS protocol achieves forward security. Experimental results show that it sacrifices acceptable costs in exchange for keyword retrieval capability.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1