Pub Date : 2024-09-03DOI: 10.1007/s12293-024-00423-5
Thanh-Hoang Nguyen-Vo, Trang T. T. Do, Binh P. Nguyen
Molecular property prediction is an important step in the drug discovery pipeline. Numerous computational methods have been developed to predict a wide range of molecular properties. While recent approaches have shown promising results, no single architecture can comprehensively address all tasks, making this area persistently challenging and requiring substantial time and effort. Beyond traditional machine learning and deep learning architectures for regular data, several deep learning architectures have been designed for graph-structured data to overcome the limitations of conventional methods. Utilizing graph-structured data in quantitative structure–activity relationship (QSAR) modeling allows models to effectively extract unique features, especially where connectivity information is crucial. In our study, we developed residual graph attention networks (ResGAT), a deep learning architecture for molecular graph-structured data. This architecture is a combination of graph attention networks and shortcut connections to address both regression and classification problems. It is also customizable to adapt to various dataset sizes, enhancing the learning process based on molecular patterns. When tested multiple times with both random and scaffold sampling strategies on nine benchmark molecular datasets, QSAR models developed using ResGAT demonstrated stability and competitive performance compared to state-of-the-art methods.
{"title":"ResGAT: Residual Graph Attention Networks for molecular property prediction","authors":"Thanh-Hoang Nguyen-Vo, Trang T. T. Do, Binh P. Nguyen","doi":"10.1007/s12293-024-00423-5","DOIUrl":"https://doi.org/10.1007/s12293-024-00423-5","url":null,"abstract":"<p>Molecular property prediction is an important step in the drug discovery pipeline. Numerous computational methods have been developed to predict a wide range of molecular properties. While recent approaches have shown promising results, no single architecture can comprehensively address all tasks, making this area persistently challenging and requiring substantial time and effort. Beyond traditional machine learning and deep learning architectures for regular data, several deep learning architectures have been designed for graph-structured data to overcome the limitations of conventional methods. Utilizing graph-structured data in quantitative structure–activity relationship (QSAR) modeling allows models to effectively extract unique features, especially where connectivity information is crucial. In our study, we developed residual graph attention networks (ResGAT), a deep learning architecture for molecular graph-structured data. This architecture is a combination of graph attention networks and shortcut connections to address both regression and classification problems. It is also customizable to adapt to various dataset sizes, enhancing the learning process based on molecular patterns. When tested multiple times with both random and scaffold sampling strategies on nine benchmark molecular datasets, QSAR models developed using ResGAT demonstrated stability and competitive performance compared to state-of-the-art methods.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"99 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142215603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-27DOI: 10.1007/s12293-024-00424-4
Chenwei Jin, Ruibin Bai, Yuyang Zhou, Xinan Chen, Leshan Tan
Over the past decade, the surge in global container port throughput has heightened the demand for terminal efficiency, with the container yard operations being central to the overall port performance. However, the unpredictable arrival of external trucks poses significant challenges for yard cranes which must simultaneously schedule operations for both internal and external tasks. Traditional yard crane scheduling methods often rely on outdated assumptions that fail to account for the dynamic impact of external tasks. In response, container terminals increasingly model the yard crane scheduling as an online problem. A notable advancement in online scheduling is the online rollout method, which evaluates the decisions based on the potential outcomes of their future rollout schedules rather than immediate priorities. Although this method outperforms the previous approach, it faces two main issues: the rollout simulation is time consuming, and decisions based solely on objective value of rollout schedules may not align with long-term scheduling objectives. To overcome these limitations, we have developed a two-stage adaptive rollout decision model. In the first stage, less desirable tasks are dynamically filtered out to reduce the number of rollout simulations required, while the second stage employs a genetic programming evolved evaluation function to infuse more refined forward-looking insights into the scheduling process. This approach has proven to significantly enhance yard scheduling efficiency and performance, as confirmed by experimental validation. Given the dynamic nature of yard crane operations, we believe this method could be beneficially applied to other dynamic scheduling contexts.
{"title":"Enhancing online yard crane scheduling through a two-stage rollout memetic genetic programming","authors":"Chenwei Jin, Ruibin Bai, Yuyang Zhou, Xinan Chen, Leshan Tan","doi":"10.1007/s12293-024-00424-4","DOIUrl":"https://doi.org/10.1007/s12293-024-00424-4","url":null,"abstract":"<p>Over the past decade, the surge in global container port throughput has heightened the demand for terminal efficiency, with the container yard operations being central to the overall port performance. However, the unpredictable arrival of external trucks poses significant challenges for yard cranes which must simultaneously schedule operations for both internal and external tasks. Traditional yard crane scheduling methods often rely on outdated assumptions that fail to account for the dynamic impact of external tasks. In response, container terminals increasingly model the yard crane scheduling as an online problem. A notable advancement in online scheduling is the online rollout method, which evaluates the decisions based on the potential outcomes of their future rollout schedules rather than immediate priorities. Although this method outperforms the previous approach, it faces two main issues: the rollout simulation is time consuming, and decisions based solely on objective value of rollout schedules may not align with long-term scheduling objectives. To overcome these limitations, we have developed a two-stage adaptive rollout decision model. In the first stage, less desirable tasks are dynamically filtered out to reduce the number of rollout simulations required, while the second stage employs a genetic programming evolved evaluation function to infuse more refined forward-looking insights into the scheduling process. This approach has proven to significantly enhance yard scheduling efficiency and performance, as confirmed by experimental validation. Given the dynamic nature of yard crane operations, we believe this method could be beneficially applied to other dynamic scheduling contexts.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"4 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142215604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-17DOI: 10.1007/s12293-024-00419-1
Yiming Peng, Gang Chen, Mengjie Zhang, Bing Xue
Evolutionary Algorithms (EAs), including Evolutionary Strategies (ES) and Genetic Algorithms (GAs), have been widely accepted as competitive alternatives to Policy Gradient techniques for Deep Reinforcement Learning (DRL). However, they remain eclipsed by cutting-edge DRL algorithms in terms of time efficiency, sample complexity, and learning effectiveness. In this paper, aiming at advancing evolutionary DRL research, we develop an evolutionary policy optimization algorithm with three key technical improvements. First, we design an efficient layer-wise strategy for training DNNs through Covariance Matrix Adaptation Evolutionary Strategies (CMA-ES) in a highly scalable manner. Second, we establish a surrogate model based on proximal performance lower bound for fitness evaluations with low sample complexity. Third, we embed a gradient-based local search technique within the evolutionary policy optimization process to further improve the learning effectiveness. The three technical innovations jointly forge a new EA for DRL method named Proximal Evolutionary Strategies (PES). Our experiments on ten continuous control problems show that PES with layer-wise training can be more computationally efficient than CMA-ES; our surrogate model can remarkably reduce the sample complexity of PES in comparison to latest EAs for DRL including CMA-ES, OpenAI-ES, and Uber-GA; PES with gradient-based local search can significantly outperform several promising DRL algorithms including TRPO, AKCTR, PPO, OpenAI-ES, and Uber-GA.
进化算法(EAs),包括进化策略(ES)和遗传算法(GAs),已被广泛接受为深度强化学习(DRL)中政策梯度技术的竞争性替代方案。然而,它们在时间效率、样本复杂度和学习效果方面仍然无法与最先进的 DRL 算法相提并论。本文旨在推进进化 DRL 研究,我们开发了一种进化策略优化算法,并在技术上做了三项关键改进。首先,我们通过协方差矩阵适应进化策略(CMA-ES)设计了一种高效的分层策略,以高度可扩展的方式训练 DNN。其次,我们建立了一个基于近端性能下限的代用模型,用于低样本复杂度的适配性评估。第三,我们在进化策略优化过程中嵌入了基于梯度的局部搜索技术,以进一步提高学习效率。这三项技术创新共同打造了 DRL 方法的新 EA,命名为 "近端进化策略"(PES)。我们在 10 个连续控制问题上的实验表明,与 CMA-ES 相比,采用分层训练的 PES 计算效率更高;与 CMA-ES、OpenAI-ES 和 Uber-GA 等最新的 DRL EA 相比,我们的代用模型可以显著降低 PES 的样本复杂度;与 TRPO、AKCTR、PPO、OpenAI-ES 和 Uber-GA 等几种有前途的 DRL 算法相比,采用基于梯度的局部搜索的 PES 可以明显优于它们。
{"title":"Proximal evolutionary strategy: improving deep reinforcement learning through evolutionary policy optimization","authors":"Yiming Peng, Gang Chen, Mengjie Zhang, Bing Xue","doi":"10.1007/s12293-024-00419-1","DOIUrl":"https://doi.org/10.1007/s12293-024-00419-1","url":null,"abstract":"<p>Evolutionary Algorithms (EAs), including Evolutionary Strategies (ES) and Genetic Algorithms (GAs), have been widely accepted as competitive alternatives to Policy Gradient techniques for Deep Reinforcement Learning (DRL). However, they remain eclipsed by cutting-edge DRL algorithms in terms of time efficiency, sample complexity, and learning effectiveness. In this paper, aiming at advancing evolutionary DRL research, we develop an evolutionary policy optimization algorithm with three key technical improvements. First, we design an efficient layer-wise strategy for training DNNs through Covariance Matrix Adaptation Evolutionary Strategies (CMA-ES) in a highly scalable manner. Second, we establish a surrogate model based on proximal performance lower bound for fitness evaluations with low sample complexity. Third, we embed a gradient-based local search technique within the evolutionary policy optimization process to further improve the learning effectiveness. The three technical innovations jointly forge a new EA for DRL method named Proximal Evolutionary Strategies (PES). Our experiments on ten continuous control problems show that PES with layer-wise training can be more computationally efficient than CMA-ES; our surrogate model can remarkably reduce the sample complexity of PES in comparison to latest EAs for DRL including CMA-ES, OpenAI-ES, and Uber-GA; PES with gradient-based local search can significantly outperform several promising DRL algorithms including TRPO, AKCTR, PPO, OpenAI-ES, and Uber-GA.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"1 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142215605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-16DOI: 10.1007/s12293-024-00425-3
Shifan Xu, Zhibin Xu, Jiannan Zheng, Hai Lin, Liang Zou, Meng Lei
Accurate tracing of crude oil origins is essential for thwarting deceptive trade practices, including origin falsification to evade taxes, thereby preventing economic losses and security threats for importing nations. Traditional crude oil origin determination methods require complex sample preparation, expensive instrumentation, and stable testing environments, rendering them impractical for real-time analysis at locations such as ports. This paper introduces a novel approach utilizing near-infrared spectroscopy (NIRS) combined with deep learning algorithms to expedite and enhance the precision of crude oil source identification. To effectively eliminate outliers, an improved Mahalanobis distance is introduced, incorporating regularization principles and global-local concepts. This approach addresses the challenges of inverting covariance matrices in high-dimensional spectra and excludes samples with localized aberrations. Furthermore, the integration of multi-receptive fields perception, Transformer-based global information interaction, and the scSE attention mechanism has led to the development of an MG-Unet model, designed to resolve spectral peak overlap issues and capture long-range feature dependencies. The proposed method achieves state-of-the-art accuracy of 96.92%, demonstrating significant potential for reliable crude oil source tracing.
{"title":"Where does the crude oil originate? The role of near-infrared spectroscopy in accurate source detection","authors":"Shifan Xu, Zhibin Xu, Jiannan Zheng, Hai Lin, Liang Zou, Meng Lei","doi":"10.1007/s12293-024-00425-3","DOIUrl":"https://doi.org/10.1007/s12293-024-00425-3","url":null,"abstract":"<p>Accurate tracing of crude oil origins is essential for thwarting deceptive trade practices, including origin falsification to evade taxes, thereby preventing economic losses and security threats for importing nations. Traditional crude oil origin determination methods require complex sample preparation, expensive instrumentation, and stable testing environments, rendering them impractical for real-time analysis at locations such as ports. This paper introduces a novel approach utilizing near-infrared spectroscopy (NIRS) combined with deep learning algorithms to expedite and enhance the precision of crude oil source identification. To effectively eliminate outliers, an improved Mahalanobis distance is introduced, incorporating regularization principles and global-local concepts. This approach addresses the challenges of inverting covariance matrices in high-dimensional spectra and excludes samples with localized aberrations. Furthermore, the integration of multi-receptive fields perception, Transformer-based global information interaction, and the scSE attention mechanism has led to the development of an MG-Unet model, designed to resolve spectral peak overlap issues and capture long-range feature dependencies. The proposed method achieves state-of-the-art accuracy of 96.92%, demonstrating significant potential for reliable crude oil source tracing.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"18 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142215606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-13DOI: 10.1007/s12293-024-00422-6
Yan Jia, Yuqing Cheng, Peng Qiao
Self-supervised learning, particularly through contrastive learning, has shown significant promise in vision tasks. Although effective, contrastive learning faces the issue of false negatives, particularly under domain shifts in domain adaptation scenarios. The Bootstrap Your Own Latent approach, with its asymmetric structure and avoidance of unnecessary negative samples, offers a foundation to address this issue, which remains underexplored in domain adaptation. We introduce an asymmetrically structured network, the Bootstrap Contrastive Domain Adaptation (BCDA), that innovatively applies contrastive learning to domain adaptation. BCDA utilizes a bootstrap clustering positive sampling strategy to ensure stable, end-to-end domain adaptation, preventing model collapse often seen in asymmetric networks. This method not only aligns domains internally through mean square loss but also enhances semantic inter-domain alignment, effectively eliminating false negatives. Our approach, BCDA, represents the first foray into non-contrastive domain adaptation and could serve as a foundational model for future studies. It shows potential to supersede contrastive domain adaptation methods in eliminating false negatives, evidenced by high-level results on three well-known domain adaptation benchmark datasets.
自我监督学习,特别是通过对比学习,在视觉任务中显示出了巨大的前景。对比学习虽然有效,但也面临着假阴性的问题,尤其是在领域适应场景中的领域转移情况下。Bootstrap Your Own Latent 方法采用非对称结构,避免了不必要的负样本,为解决这一问题提供了基础,而这一问题在领域适应中仍未得到充分探索。我们引入了一种非对称结构网络--自举对比领域适应(BCDA),它创新性地将对比学习应用于领域适应。BCDA 采用自举聚类正向采样策略,确保稳定的端到端领域适应,防止非对称网络中常见的模型崩溃。这种方法不仅能通过均方损失实现域内部对齐,还能增强域间语义对齐,有效消除假阴性。我们的 BCDA 方法是对非对比域适应性的首次尝试,可作为未来研究的基础模型。在三个著名的领域适应基准数据集上取得的高水平结果表明,它在消除假否定方面具有超越对比领域适应方法的潜力。
{"title":"Bootstrap contrastive domain adaptation","authors":"Yan Jia, Yuqing Cheng, Peng Qiao","doi":"10.1007/s12293-024-00422-6","DOIUrl":"https://doi.org/10.1007/s12293-024-00422-6","url":null,"abstract":"<p>Self-supervised learning, particularly through contrastive learning, has shown significant promise in vision tasks. Although effective, contrastive learning faces the issue of false negatives, particularly under domain shifts in domain adaptation scenarios. The Bootstrap Your Own Latent approach, with its asymmetric structure and avoidance of unnecessary negative samples, offers a foundation to address this issue, which remains underexplored in domain adaptation. We introduce an asymmetrically structured network, the Bootstrap Contrastive Domain Adaptation (BCDA), that innovatively applies contrastive learning to domain adaptation. BCDA utilizes a bootstrap clustering positive sampling strategy to ensure stable, end-to-end domain adaptation, preventing model collapse often seen in asymmetric networks. This method not only aligns domains internally through mean square loss but also enhances semantic inter-domain alignment, effectively eliminating false negatives. Our approach, BCDA, represents the first foray into non-contrastive domain adaptation and could serve as a foundational model for future studies. It shows potential to supersede contrastive domain adaptation methods in eliminating false negatives, evidenced by high-level results on three well-known domain adaptation benchmark datasets.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"10 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142215607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-12DOI: 10.1007/s12293-024-00429-z
Pengwei Huang, Kehui Liu
The quality of knowledge graphs (KGs) significantly influences their utility in downstream applications. Traditional methods for enhancing KG quality typically involve manual efforts and knowledge pattern learning to detect errors and complete missing triples. These approaches often incur high manual costs. To address these challenges, this paper proposes a novel “data-driven” approach to KG improvement. This method utilizes numerical data records to validate and enhance the information within KGs, overcoming limitations such as the requirement for a robust internal structure of KGs or the scarcity of expert resources. A pioneering technique that integrates Markov Boundary discovery with correlation analysis of data properties is developed in this study. This technique aims to identify and correct errors, as well as to fill in missing components of the KGs. To evaluate the effectiveness of this approach, experimental analysis was conducted, highlighting its potential to significantly improve KG accuracy and completeness. This data-driven strategy reduces reliance on extensive manual intervention and expert knowledge, introducing a scalable way to refine KGs using empirical data. The results from the experiments demonstrate the capability of this method to enhance the quality of KGs, marking it as a valuable contribution to the field of knowledge management.
知识图谱(KG)的质量极大地影响着其在下游应用中的效用。提高知识图谱质量的传统方法通常涉及人工操作和知识模式学习,以检测错误和补全缺失的三元组。这些方法通常会产生高昂的人工成本。为了应对这些挑战,本文提出了一种新颖的 "数据驱动 "KG 改进方法。这种方法利用数字数据记录来验证和增强知识库中的信息,克服了知识库内部结构不健全或专家资源稀缺等局限性。本研究开发了一种开创性的技术,将马尔可夫边界发现与数据属性的相关性分析相结合。该技术旨在识别和纠正错误,并填补 KGs 中缺失的部分。为了评估这种方法的有效性,我们进行了实验分析,结果表明这种方法具有显著提高 KG 准确性和完整性的潜力。这种数据驱动策略减少了对大量人工干预和专家知识的依赖,引入了一种利用经验数据完善 KG 的可扩展方法。实验结果表明,这种方法有能力提高知识库的质量,是对知识管理领域的宝贵贡献。
{"title":"Can data improve knowledge graph?","authors":"Pengwei Huang, Kehui Liu","doi":"10.1007/s12293-024-00429-z","DOIUrl":"https://doi.org/10.1007/s12293-024-00429-z","url":null,"abstract":"<p>The quality of knowledge graphs (KGs) significantly influences their utility in downstream applications. Traditional methods for enhancing KG quality typically involve manual efforts and knowledge pattern learning to detect errors and complete missing triples. These approaches often incur high manual costs. To address these challenges, this paper proposes a novel “data-driven” approach to KG improvement. This method utilizes numerical data records to validate and enhance the information within KGs, overcoming limitations such as the requirement for a robust internal structure of KGs or the scarcity of expert resources. A pioneering technique that integrates Markov Boundary discovery with correlation analysis of data properties is developed in this study. This technique aims to identify and correct errors, as well as to fill in missing components of the KGs. To evaluate the effectiveness of this approach, experimental analysis was conducted, highlighting its potential to significantly improve KG accuracy and completeness. This data-driven strategy reduces reliance on extensive manual intervention and expert knowledge, introducing a scalable way to refine KGs using empirical data. The results from the experiments demonstrate the capability of this method to enhance the quality of KGs, marking it as a valuable contribution to the field of knowledge management.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"193 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-12DOI: 10.1007/s12293-024-00431-5
Xuan Lu, Lei Chen, Hai-Lin Liu
Evolutionary multi-objective multitasking optimization (MTO) has emerged as a popular research field in evolutionary computation. By simultaneously considering multiple objectives and tasks while identifying valuable knowledge for intertask transfer, MTO aims to discover solutions that deliver optimal performance across all objectives and tasks. Nevertheless, MTO presents a substantial challenge concerning the effective transport of high-quality information between tasks. To handle this challenge, this paper introduces a novel approach named TL-MOMFEA (multi-objective multifactorial evolutionary algorithm based on domain transfer learning) for MTO problems. TL-MOMFEA uses domain-transfer learning to adapt the population from one task to another, resulting in the reproduction of higher-quality solutions. Furthermore, TL-MOMFEA employs a model transfer strategy where population distribution rules learned from one task are succinctly summarized and applied to similar tasks. By capitalizing on the knowledge acquired from solved tasks, TL-MOMFEA effectively circumvents futile searches and accurately identifies global optimum predictions with increased precision. The effectiveness of TL-MOMFEA is evaluated through experimental studies in two widely used test suites, and experimental comparisons have shown that the proposed paradigm achieves excellent results in terms of solution quality and search efficiency, thus demonstrating its clear superiority over other state-of-the-art MTO frameworks.
进化多目标多任务优化(MTO)已成为进化计算的一个热门研究领域。通过同时考虑多个目标和任务,同时识别任务间传输的有价值知识,多任务优化旨在发现能在所有目标和任务中实现最佳性能的解决方案。然而,MTO 在任务间有效传输高质量信息方面提出了巨大挑战。为了应对这一挑战,本文针对 MTO 问题提出了一种名为 TL-MOMFEA(基于领域转移学习的多目标多因素进化算法)的新方法。TL-MOMFEA 利用领域转移学习来调整从一个任务到另一个任务的群体,从而产生更高质量的解决方案。此外,TL-MOMFEA 还采用了模型转移策略,将从一项任务中学到的群体分布规则进行简明总结,并应用于类似任务。通过利用从已解决任务中获得的知识,TL-MOMFEA 有效地避免了徒劳无益的搜索,并以更高的精度准确确定了全局最优预测。通过在两个广泛使用的测试套件中进行实验研究,对 TL-MOMFEA 的有效性进行了评估,实验比较表明,所提出的范式在解决方案质量和搜索效率方面都取得了优异的结果,从而证明了它明显优于其他最先进的 MTO 框架。
{"title":"TL-MOMFEA: a transfer learning-based multi-objective multitasking optimization evolutionary algorithm","authors":"Xuan Lu, Lei Chen, Hai-Lin Liu","doi":"10.1007/s12293-024-00431-5","DOIUrl":"https://doi.org/10.1007/s12293-024-00431-5","url":null,"abstract":"<p>Evolutionary multi-objective multitasking optimization (MTO) has emerged as a popular research field in evolutionary computation. By simultaneously considering multiple objectives and tasks while identifying valuable knowledge for intertask transfer, MTO aims to discover solutions that deliver optimal performance across all objectives and tasks. Nevertheless, MTO presents a substantial challenge concerning the effective transport of high-quality information between tasks. To handle this challenge, this paper introduces a novel approach named TL-MOMFEA (multi-objective multifactorial evolutionary algorithm based on domain transfer learning) for MTO problems. TL-MOMFEA uses domain-transfer learning to adapt the population from one task to another, resulting in the reproduction of higher-quality solutions. Furthermore, TL-MOMFEA employs a model transfer strategy where population distribution rules learned from one task are succinctly summarized and applied to similar tasks. By capitalizing on the knowledge acquired from solved tasks, TL-MOMFEA effectively circumvents futile searches and accurately identifies global optimum predictions with increased precision. The effectiveness of TL-MOMFEA is evaluated through experimental studies in two widely used test suites, and experimental comparisons have shown that the proposed paradigm achieves excellent results in terms of solution quality and search efficiency, thus demonstrating its clear superiority over other state-of-the-art MTO frameworks.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"51 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In the realm of maritime emergencies, unmanned aerial vehicles (UAVs) play a crucial role in enhancing search and rescue (SAR) operations. They help in efficiently rescuing distressed crews, strengthening maritime surveillance, and maintaining national security due to their cost-effectiveness, versatility, and effectiveness. However, the vast expanse of sea territories and the rapid changes in maritime conditions make a single SAR center insufficient for handling complex emergencies. Thus, it is vital to develop strategies for quickly deploying UAV resources from multiple SAR centers for area reconnaissance and supporting maritime rescue operations. This study introduces a graph-structured planning model for the maritime SAR path planning problem, considering multiple rescue centers (MSARPPP-MRC). It incorporates workload distribution among SAR centers and UAV operational constraints. We propose a reinforcement learning-based genetic algorithm (GA-RL) to tackle the MSARPPP-MRC problem. GA-RL uses heuristic rules to initialize the population and employs the Q-learning method to manage the progeny during each generation, including their retention, storage, or disposal. When the elite repository’s capacity is reached, a decision is made on the utilization of these members to refresh the population. Additionally, adaptive crossover and perturbation strategies are applied to develop a more effective SAR scheme. Extensive testing proves that GA-RL surpasses other algorithms in optimization efficacy and efficiency, highlighting the benefits of reinforcement learning in population management.
在海上紧急情况领域,无人驾驶飞行器(UAV)在加强搜救(SAR)行动方面发挥着至关重要的作用。由于其成本效益高、用途广泛且效果显著,它们有助于有效营救遇险船员、加强海上监视和维护国家安全。然而,幅员辽阔的海域和瞬息万变的海况使得单一的 SAR 中心不足以应对复杂的紧急情况。因此,制定从多个 SAR 中心快速部署无人机资源的战略,用于区域侦察和支持海上救援行动至关重要。本研究针对考虑多个救援中心的海上搜救路径规划问题引入了图结构规划模型(MSARPPP-MRC)。该模型纳入了 SAR 中心之间的工作量分配和无人机操作约束。我们提出了一种基于强化学习的遗传算法(GA-RL)来解决 MSARPPP-MRC 问题。GA-RL 使用启发式规则来初始化种群,并采用 Q-learning 方法来管理每一代的后代,包括保留、存储或处置。当精英库的容量达到一定程度时,就会决定是否利用这些成员来刷新种群。此外,还应用了自适应交叉和扰动策略来开发更有效的 SAR 方案。广泛的测试证明,GA-RL 在优化效果和效率方面超越了其他算法,突出了强化学习在种群管理方面的优势。
{"title":"A reinforcement learning-based evolutionary algorithm for the unmanned aerial vehicles maritime search and rescue path planning problem considering multiple rescue centers","authors":"Haowen Zhan, Yue Zhang, Jingbo Huang, Yanjie Song, Lining Xing, Jie Wu, Zengyun Gao","doi":"10.1007/s12293-024-00420-8","DOIUrl":"https://doi.org/10.1007/s12293-024-00420-8","url":null,"abstract":"<p>In the realm of maritime emergencies, unmanned aerial vehicles (UAVs) play a crucial role in enhancing search and rescue (SAR) operations. They help in efficiently rescuing distressed crews, strengthening maritime surveillance, and maintaining national security due to their cost-effectiveness, versatility, and effectiveness. However, the vast expanse of sea territories and the rapid changes in maritime conditions make a single SAR center insufficient for handling complex emergencies. Thus, it is vital to develop strategies for quickly deploying UAV resources from multiple SAR centers for area reconnaissance and supporting maritime rescue operations. This study introduces a graph-structured planning model for the maritime SAR path planning problem, considering multiple rescue centers (MSARPPP-MRC). It incorporates workload distribution among SAR centers and UAV operational constraints. We propose a reinforcement learning-based genetic algorithm (GA-RL) to tackle the MSARPPP-MRC problem. GA-RL uses heuristic rules to initialize the population and employs the Q-learning method to manage the progeny during each generation, including their retention, storage, or disposal. When the elite repository’s capacity is reached, a decision is made on the utilization of these members to refresh the population. Additionally, adaptive crossover and perturbation strategies are applied to develop a more effective SAR scheme. Extensive testing proves that GA-RL surpasses other algorithms in optimization efficacy and efficiency, highlighting the benefits of reinforcement learning in population management.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"55 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper investigates a new problem in an identical parallel machine environment called parallel machine scheduling with job family, release time, and mold availability constraints (PMS-JRM), which is highly challenging from the computational perspective as it extends the basic NP-hard problem (P_m||sum C_j). The mold availability notion, first introduced in this paper, represents the availability relationship between jobs and machines. The PMS-JRM model originates from the imaging data collaborative processing in a low-earth-orbit satellite constellation under a time-varying communication network, and it can represent other multi-resource collaborative scheduling problems with discontinuous communication. An integer programming model was proposed to formulate the PMS-JRM. Due to its NP-hardness, two highly efficient heuristic solution approaches were proposed, namely a greedy algorithm with a hybrid first come first serve (HFCFS) dispatching rule (GA-HFCFS) and a Memetic Algorithm with Heterogeneous swap and Key job insertion operators (MA-HK). Extensive experiments were conducted on a set of test cases with various scales, and the results showed that GA-HFCFS outperforms three classical dispatching rules available in the literature. Taking the results of GA-HFCFS as initial solutions, MA-HK achieves optimal solutions for all small-scale cases while providing superior solutions within the same running time compared to two other competitors for large-scale cases. In particular, MA-HK yields better solutions in less running time than the state-of-the-art CPLEX solver. Additional experiments were conducted to highlight the critical ingredients of MA-HK.
{"title":"Parallel machine scheduling with job family, release time, and mold availability constraints: model and two solution approaches","authors":"Xiang Lin, Yuning Chen, Junhua Xue, Boquan Zhang, Yingwu Chen, Cheng Chen","doi":"10.1007/s12293-024-00421-7","DOIUrl":"https://doi.org/10.1007/s12293-024-00421-7","url":null,"abstract":"<p>This paper investigates a new problem in an identical parallel machine environment called parallel machine scheduling with job family, release time, and mold availability constraints (PMS-JRM), which is highly challenging from the computational perspective as it extends the basic NP-hard problem <span>(P_m||sum C_j)</span>. The mold availability notion, first introduced in this paper, represents the availability relationship between jobs and machines. The PMS-JRM model originates from the imaging data collaborative processing in a low-earth-orbit satellite constellation under a time-varying communication network, and it can represent other multi-resource collaborative scheduling problems with discontinuous communication. An integer programming model was proposed to formulate the PMS-JRM. Due to its NP-hardness, two highly efficient heuristic solution approaches were proposed, namely a greedy algorithm with a hybrid first come first serve (HFCFS) dispatching rule (GA-HFCFS) and a Memetic Algorithm with Heterogeneous swap and Key job insertion operators (MA-HK). Extensive experiments were conducted on a set of test cases with various scales, and the results showed that GA-HFCFS outperforms three classical dispatching rules available in the literature. Taking the results of GA-HFCFS as initial solutions, MA-HK achieves optimal solutions for all small-scale cases while providing superior solutions within the same running time compared to two other competitors for large-scale cases. In particular, MA-HK yields better solutions in less running time than the state-of-the-art CPLEX solver. Additional experiments were conducted to highlight the critical ingredients of MA-HK.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"78 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-06DOI: 10.1007/s12293-024-00428-0
Zhenwei Wang, Ruibin Bai, Fazlullah Khan, Ender Özcan, Tiehua Zhang
Learning-based methods have become increasingly popular for solving vehicle routing problems (VRP) due to their near-optimal performance and fast inference speed. Among them, the combination of deep reinforcement learning and graph representation allows for the abstraction of node topology structures and features in an encoder–decoder style. Such an approach makes it possible to solve routing problems end-to-end without needing complicated heuristic operators designed by domain experts. Existing research studies have been focusing on novel encoding and decoding structures via various neural network models to enhance the node embedding representation. Despite the sophisticated approaches being designed for VRP, there is a noticeable lack of consideration for the graph-theoretic properties inherent to routing problems. Moreover, the potential ramifications of inter-nodal interactions on the decision-making efficacy of the models have not been adequately explored. To bridge this gap, we propose an adaptive graph attention sampling with the edges fusion framework, where nodes’ embedding is determined through attention calculation from certain highly correlated neighborhoods and edges, utilizing a filtered adjacency matrix. In detail, the selections of particular neighbors and adjacency edges are led by a multi-head attention mechanism, contributing directly to the message passing and node embedding in graph attention sampling networks. Furthermore, an adaptive actor-critic algorithm with policy improvements is incorporated to expedite the training convergence. We then conduct comprehensive experiments against baseline methods on learning-based VRP tasks from different perspectives. Our proposed model outperforms the existing methods by 2.08–6.23% and shows stronger generalization ability, achieving the state-of-the-art performance on randomly generated instances and standard benchmark datasets.
{"title":"Gase: graph attention sampling with edges fusion for solving vehicle routing problems","authors":"Zhenwei Wang, Ruibin Bai, Fazlullah Khan, Ender Özcan, Tiehua Zhang","doi":"10.1007/s12293-024-00428-0","DOIUrl":"https://doi.org/10.1007/s12293-024-00428-0","url":null,"abstract":"<p>Learning-based methods have become increasingly popular for solving vehicle routing problems (VRP) due to their near-optimal performance and fast inference speed. Among them, the combination of deep reinforcement learning and graph representation allows for the abstraction of node topology structures and features in an encoder–decoder style. Such an approach makes it possible to solve routing problems end-to-end without needing complicated heuristic operators designed by domain experts. Existing research studies have been focusing on novel encoding and decoding structures via various neural network models to enhance the node embedding representation. Despite the sophisticated approaches being designed for VRP, there is a noticeable lack of consideration for the graph-theoretic properties inherent to routing problems. Moreover, the potential ramifications of inter-nodal interactions on the decision-making efficacy of the models have not been adequately explored. To bridge this gap, we propose an adaptive graph attention sampling with the edges fusion framework, where nodes’ embedding is determined through attention calculation from certain highly correlated neighborhoods and edges, utilizing a filtered adjacency matrix. In detail, the selections of particular neighbors and adjacency edges are led by a multi-head attention mechanism, contributing directly to the message passing and node embedding in graph attention sampling networks. Furthermore, an adaptive actor-critic algorithm with policy improvements is incorporated to expedite the training convergence. We then conduct comprehensive experiments against baseline methods on learning-based VRP tasks from different perspectives. Our proposed model outperforms the existing methods by 2.08–6.23% and shows stronger generalization ability, achieving the state-of-the-art performance on randomly generated instances and standard benchmark datasets.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"79 1","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}