首页 > 最新文献

Complex & Intelligent Systems最新文献

英文 中文
DCASAM: advancing aspect-based sentiment analysis through a deep context-aware sentiment analysis model DCASAM:通过深度情境感知情感分析模型推进基于方面的情感分析
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-10 DOI: 10.1007/s40747-024-01570-5
Xiangkui Jiang, Binglong Ren, Qing Wu, Wuwei Wang, Hong Li

Aspect-level sentiment analysis plays a pivotal role in fine-grained sentiment categorization, especially given the rapid expansion of online information. Traditional methods often struggle with accurately determining sentiment polarity when faced with implicit or ambiguous data, leading to limited accuracy and context-awareness. To address these challenges, we propose the Deep Context-Aware Sentiment Analysis Model (DCASAM). This model integrates the capabilities of Deep Bidirectional Long Short-Term Memory Network (DBiLSTM) and Densely Connected Graph Convolutional Network (DGCN), enhancing the ability to capture long-distance dependencies and subtle contextual variations.The DBiLSTM component effectively captures sequential dependencies, while the DGCN component leverages densely connected structures to model intricate relationships within the data. This combination allows DCASAM to maintain a high level of contextual understanding and sentiment detection accuracy.Experimental evaluations on well-known public datasets, including Restaurant14, Laptop14, and Twitter, demonstrate the superior performance of DCASAM over existing models. Our model achieves an average improvement in accuracy by 1.07% and F1 score by 1.68%, showcasing its robustness and efficacy in handling complex sentiment analysis tasks.These results highlight the potential of DCASAM for real-world applications, offering a solid foundation for future research in aspect-level sentiment analysis. By providing a more nuanced understanding of sentiment, our model contributes significantly to the advancement of fine-grained sentiment analysis techniques.

方面级情感分析在细粒度情感分类中起着举足轻重的作用,尤其是在网络信息迅速膨胀的情况下。传统方法在面对隐含或模糊数据时往往难以准确判断情感极性,导致准确性和上下文感知能力有限。为了应对这些挑战,我们提出了深度情境感知情感分析模型(DCASAM)。该模型集成了深度双向长短期记忆网络(DBiLSTM)和密集连接图卷积网络(DGCN)的功能,增强了捕捉长距离依赖关系和微妙上下文变化的能力。DBiLSTM 组件能有效捕捉顺序依赖关系,而 DGCN 组件则利用密集连接结构来模拟数据中的复杂关系。这种组合使 DCASAM 能够保持较高的上下文理解能力和情感检测准确性。在知名公共数据集(包括 Restaurant14、Laptop14 和 Twitter)上进行的实验评估表明,DCASAM 的性能优于现有模型。我们的模型在准确率方面平均提高了 1.07%,在 F1 分数方面平均提高了 1.68%,展示了它在处理复杂情感分析任务时的稳健性和有效性。这些结果凸显了 DCASAM 在现实世界应用中的潜力,为未来的方面级情感分析研究奠定了坚实的基础。通过提供对情感的更细致入微的理解,我们的模型极大地推动了细粒度情感分析技术的发展。
{"title":"DCASAM: advancing aspect-based sentiment analysis through a deep context-aware sentiment analysis model","authors":"Xiangkui Jiang, Binglong Ren, Qing Wu, Wuwei Wang, Hong Li","doi":"10.1007/s40747-024-01570-5","DOIUrl":"https://doi.org/10.1007/s40747-024-01570-5","url":null,"abstract":"<p>Aspect-level sentiment analysis plays a pivotal role in fine-grained sentiment categorization, especially given the rapid expansion of online information. Traditional methods often struggle with accurately determining sentiment polarity when faced with implicit or ambiguous data, leading to limited accuracy and context-awareness. To address these challenges, we propose the Deep Context-Aware Sentiment Analysis Model (DCASAM). This model integrates the capabilities of Deep Bidirectional Long Short-Term Memory Network (DBiLSTM) and Densely Connected Graph Convolutional Network (DGCN), enhancing the ability to capture long-distance dependencies and subtle contextual variations.The DBiLSTM component effectively captures sequential dependencies, while the DGCN component leverages densely connected structures to model intricate relationships within the data. This combination allows DCASAM to maintain a high level of contextual understanding and sentiment detection accuracy.Experimental evaluations on well-known public datasets, including Restaurant14, Laptop14, and Twitter, demonstrate the superior performance of DCASAM over existing models. Our model achieves an average improvement in accuracy by 1.07% and F1 score by 1.68%, showcasing its robustness and efficacy in handling complex sentiment analysis tasks.These results highlight the potential of DCASAM for real-world applications, offering a solid foundation for future research in aspect-level sentiment analysis. By providing a more nuanced understanding of sentiment, our model contributes significantly to the advancement of fine-grained sentiment analysis techniques.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"191 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141910466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integration of attention mechanism and CNN-BiGRU for TDOA/FDOA collaborative mobile underwater multi-scene localization algorithm 将注意力机制和 CNN-BiGRU 集成用于 TDOA/FDOA 协同移动水下多场景定位算法
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-10 DOI: 10.1007/s40747-024-01583-0
Duo Peng, Ming Shuo Liu, Kun Xie

The aim of this study is to address the issue of TDOA/FDOA measurement accuracy in complex underwater environments, which is affected by multipath effects and variations in water sound velocity induced by the challenging nature of the underwater environment. To this end, a novel cooperative localisation algorithm has been developed, integrating the attention mechanism and convolutional neural network-bidirectional gated recurrent unit (CNN-BiGRU) with TDOA/FDOA and two-step weighted least squares (ImTSWLS). This algorithm is designed to enhance the accuracy of TDOA/FDOA measurements in complex underwater environments. The algorithm initially makes use of the considerable capacity of a convolutional neural network (CNN) to extract profound spatial and frequency domain characteristics from multimodal data. These features are of paramount importance for the characterisation of underwater signal propagation, particularly in complex environments. Subsequently, through the use of a bidirectional gated recurrent unit (BiGRU), the algorithm is able to effectively capture long-term dependencies in time series data. This enables a more comprehensive analysis and understanding of the changing pattern of signals over time. Furthermore, the incorporation of an attention mechanism within the algorithm enables the model to focus more on the signal features that have a significant impact on localisation, while simultaneously suppressing the interference of extraneous information. This further enhances the efficiency of identifying and utilising the key signal features. ImTSWLS is employed to resolve the position and velocity data following the acquisition of the predicted TDOA/FDOA, thereby enabling the accurate estimation of the position and velocity of the mobile radiation source. The algorithm was subjected to a series of tests in a variety of simulated underwater environments, including different sea states, target motion speeds and base station configurations. The experimental results demonstrate that the algorithm exhibits a deviation of only 2.88 m/s in velocity estimation and 2.58 m in position estimation when the noise level is 20 dB. The algorithm presented in this paper demonstrates superior performance in both position and velocity estimation compared to other algorithms.

本研究旨在解决复杂水下环境中的 TDOA/FDOA 测量精度问题,该问题受到多径效应和水下环境的挑战性所引起的水声速度变化的影响。为此,我们开发了一种新型合作定位算法,将注意力机制和卷积神经网络-双向门控递归单元(CNN-BiGRU)与 TDOA/FDOA 和两步加权最小二乘法(ImTSWLS)相结合。该算法旨在提高复杂水下环境中的 TDOA/FDOA 测量精度。该算法最初利用卷积神经网络(CNN)的强大能力,从多模态数据中提取深刻的空间和频域特征。这些特征对于描述水下信号传播,尤其是复杂环境中的信号传播至关重要。随后,通过使用双向门控递归单元(BiGRU),该算法能够有效捕捉时间序列数据中的长期依赖关系。这样就能对信号随时间变化的模式进行更全面的分析和理解。此外,在算法中加入注意力机制,可使模型更加关注对定位有重大影响的信号特征,同时抑制无关信息的干扰。这进一步提高了识别和利用关键信号特征的效率。在获取预测的 TDOA/FDOA 之后,采用 ImTSWLS 解析位置和速度数据,从而能够准确估计移动辐射源的位置和速度。该算法在各种模拟水下环境中进行了一系列测试,包括不同的海况、目标运动速度和基站配置。实验结果表明,当噪声水平为 20 dB 时,该算法的速度估计偏差仅为 2.88 m/s,位置估计偏差仅为 2.58 m。与其他算法相比,本文介绍的算法在位置和速度估计方面都表现出卓越的性能。
{"title":"Integration of attention mechanism and CNN-BiGRU for TDOA/FDOA collaborative mobile underwater multi-scene localization algorithm","authors":"Duo Peng, Ming Shuo Liu, Kun Xie","doi":"10.1007/s40747-024-01583-0","DOIUrl":"https://doi.org/10.1007/s40747-024-01583-0","url":null,"abstract":"<p>The aim of this study is to address the issue of TDOA/FDOA measurement accuracy in complex underwater environments, which is affected by multipath effects and variations in water sound velocity induced by the challenging nature of the underwater environment. To this end, a novel cooperative localisation algorithm has been developed, integrating the attention mechanism and convolutional neural network-bidirectional gated recurrent unit (CNN-BiGRU) with TDOA/FDOA and two-step weighted least squares (ImTSWLS). This algorithm is designed to enhance the accuracy of TDOA/FDOA measurements in complex underwater environments. The algorithm initially makes use of the considerable capacity of a convolutional neural network (CNN) to extract profound spatial and frequency domain characteristics from multimodal data. These features are of paramount importance for the characterisation of underwater signal propagation, particularly in complex environments. Subsequently, through the use of a bidirectional gated recurrent unit (BiGRU), the algorithm is able to effectively capture long-term dependencies in time series data. This enables a more comprehensive analysis and understanding of the changing pattern of signals over time. Furthermore, the incorporation of an attention mechanism within the algorithm enables the model to focus more on the signal features that have a significant impact on localisation, while simultaneously suppressing the interference of extraneous information. This further enhances the efficiency of identifying and utilising the key signal features. ImTSWLS is employed to resolve the position and velocity data following the acquisition of the predicted TDOA/FDOA, thereby enabling the accurate estimation of the position and velocity of the mobile radiation source. The algorithm was subjected to a series of tests in a variety of simulated underwater environments, including different sea states, target motion speeds and base station configurations. The experimental results demonstrate that the algorithm exhibits a deviation of only 2.88 m/s in velocity estimation and 2.58 m in position estimation when the noise level is 20 dB. The algorithm presented in this paper demonstrates superior performance in both position and velocity estimation compared to other algorithms.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"14 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141910474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing long-short term memory neural networks for electroencephalogram anomaly detection using variable neighborhood search with dynamic strategy change 利用动态策略变化的可变邻域搜索优化用于脑电图异常检测的长短期记忆神经网络
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-10 DOI: 10.1007/s40747-024-01592-z
Branislav Radomirovic, Nebojsa Bacanin, Luka Jovanovic, Vladimir Simic, Angelinu Njegus, Dragan Pamucar, Mario Köppen, Miodrag Zivkovic

Electroencephalography (EEG) serves as a crucial neurodiagnostic tool by recording the electrical brain activity via attached electrodes on the patient’s head. While artificial intelligence (AI) exhibited considerable promise in medical diagnostics, its potential in the realm of neurodiagnostics remains underexplored. This research addresses this gap by proposing an innovative approach employing time-series classification of EEG data, leveraging long-short-term memory (LSTM) neural networks for the identification of abnormal brain activity, particularly seizures. To enhance the performance of the proposed model, metaheuristic algorithms were employed for optimizing hyperparameter collection. Additionally, a tailored modification of the variable neighborhood search (VNS) is introduced, specifically tailored for this neurodiagnostic application. The effectiveness of this methodology is evaluated using a carefully curated dataset comprising real-world EEG recordings from both healthy individuals and those affected by epilepsy. This software-based approach demonstrates noteworthy results, showcasing its efficacy in anomaly and seizure detection, even when working with relatively modest sample sizes. This research contributes to the field by illuminating the potential of AI in neurodiagnostics, presenting a methodology that enhances accuracy in identifying abnormal brain activities, with implications for improved patient care and diagnostic precision.

脑电图(EEG)通过连接在患者头部的电极记录脑电活动,是一种重要的神经诊断工具。虽然人工智能(AI)在医疗诊断领域大有可为,但其在神经诊断领域的潜力仍未得到充分挖掘。本研究针对这一空白,提出了一种采用时间序列分类脑电图数据的创新方法,利用长短期记忆(LSTM)神经网络来识别异常大脑活动,尤其是癫痫发作。为提高所提模型的性能,采用了元启发式算法来优化超参数收集。此外,还引入了变量邻域搜索(VNS)的定制修改,专门针对这一神经诊断应用。该方法的有效性通过一个精心策划的数据集进行了评估,该数据集包括来自健康人和癫痫患者的真实世界脑电图记录。这种基于软件的方法取得了显著的成果,展示了其在异常和癫痫发作检测方面的功效,即使在样本量相对较少的情况下也是如此。这项研究阐明了人工智能在神经诊断领域的潜力,提出了一种能提高识别异常大脑活动准确性的方法,对改善病人护理和提高诊断精确度具有重要意义,从而为该领域做出了贡献。
{"title":"Optimizing long-short term memory neural networks for electroencephalogram anomaly detection using variable neighborhood search with dynamic strategy change","authors":"Branislav Radomirovic, Nebojsa Bacanin, Luka Jovanovic, Vladimir Simic, Angelinu Njegus, Dragan Pamucar, Mario Köppen, Miodrag Zivkovic","doi":"10.1007/s40747-024-01592-z","DOIUrl":"https://doi.org/10.1007/s40747-024-01592-z","url":null,"abstract":"<p>Electroencephalography (EEG) serves as a crucial neurodiagnostic tool by recording the electrical brain activity via attached electrodes on the patient’s head. While artificial intelligence (AI) exhibited considerable promise in medical diagnostics, its potential in the realm of neurodiagnostics remains underexplored. This research addresses this gap by proposing an innovative approach employing time-series classification of EEG data, leveraging long-short-term memory (LSTM) neural networks for the identification of abnormal brain activity, particularly seizures. To enhance the performance of the proposed model, metaheuristic algorithms were employed for optimizing hyperparameter collection. Additionally, a tailored modification of the variable neighborhood search (VNS) is introduced, specifically tailored for this neurodiagnostic application. The effectiveness of this methodology is evaluated using a carefully curated dataset comprising real-world EEG recordings from both healthy individuals and those affected by epilepsy. This software-based approach demonstrates noteworthy results, showcasing its efficacy in anomaly and seizure detection, even when working with relatively modest sample sizes. This research contributes to the field by illuminating the potential of AI in neurodiagnostics, presenting a methodology that enhances accuracy in identifying abnormal brain activities, with implications for improved patient care and diagnostic precision.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"34 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141910477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved cross-domain sequential recommendation model based on intra-domain and inter-domain contrastive learning 基于域内和域间对比学习的改进型跨域顺序推荐模型
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-09 DOI: 10.1007/s40747-024-01590-1
Jianjun Ni, Tong Shen, Yonghao Zhao, Guangyi Tang, Yang Gu

Cross-domain recommendation aims to integrate data from multiple domains and introduce information from source domains, thereby achieving good recommendations on the target domain. Recently, contrastive learning has been introduced into the cross-domain recommendations and has obtained some better results. However, most cross-domain recommendation algorithms based on contrastive learning suffer from the bias problem. In addition, the correlation between the user’s single-domain and cross-domain preferences is not considered. To address these problems, a new recommendation model is proposed for cross-domain scenarios based on intra-domain and inter-domain contrastive learning, which aims to obtain unbiased user preferences in cross-domain scenarios and improve the recommendation performance of both domains. Firstly, a network enhancement module is proposed to capture users’ complete preference by applying a graphical convolution and attentional aggregator. This module can reduce the limitations of only considering user preferences in a single domain. Then, a cross-domain infomax objective with noise contrast is presented to ensure that users’ single-domain and cross-domain preferences are correlated closely in sequential interactions. Finally, a joint training strategy is designed to improve the recommendation performances of two domains, which can achieve unbiased cross-domain recommendation results. At last, extensive experiments are conducted on two real-world cross-domain scenarios. The experimental results show that the proposed model in this paper achieves the best recommendation results in comparison with existing models.

跨领域推荐旨在整合多个领域的数据,引入源领域的信息,从而实现对目标领域的良好推荐。最近,对比学习被引入到跨领域推荐中,并取得了一些较好的效果。然而,大多数基于对比学习的跨域推荐算法都存在偏差问题。此外,用户的单域偏好和跨域偏好之间的相关性也未被考虑在内。针对这些问题,我们提出了一种基于域内和域间对比学习的跨域场景推荐模型,旨在获取跨域场景中无偏见的用户偏好,提高两个域的推荐性能。首先,提出了一个网络增强模块,通过应用图形卷积和注意力聚合器来捕捉用户的完整偏好。该模块可以减少只考虑单一领域用户偏好的局限性。然后,提出了具有噪声对比度的跨域 infomax 目标,以确保用户的单域和跨域偏好在连续交互中密切相关。最后,设计了一种联合训练策略来提高两个域的推荐性能,从而实现无偏的跨域推荐结果。最后,在两个真实的跨域场景中进行了大量实验。实验结果表明,与现有模型相比,本文提出的模型取得了最佳推荐效果。
{"title":"An improved cross-domain sequential recommendation model based on intra-domain and inter-domain contrastive learning","authors":"Jianjun Ni, Tong Shen, Yonghao Zhao, Guangyi Tang, Yang Gu","doi":"10.1007/s40747-024-01590-1","DOIUrl":"https://doi.org/10.1007/s40747-024-01590-1","url":null,"abstract":"<p>Cross-domain recommendation aims to integrate data from multiple domains and introduce information from source domains, thereby achieving good recommendations on the target domain. Recently, contrastive learning has been introduced into the cross-domain recommendations and has obtained some better results. However, most cross-domain recommendation algorithms based on contrastive learning suffer from the bias problem. In addition, the correlation between the user’s single-domain and cross-domain preferences is not considered. To address these problems, a new recommendation model is proposed for cross-domain scenarios based on intra-domain and inter-domain contrastive learning, which aims to obtain unbiased user preferences in cross-domain scenarios and improve the recommendation performance of both domains. Firstly, a network enhancement module is proposed to capture users’ complete preference by applying a graphical convolution and attentional aggregator. This module can reduce the limitations of only considering user preferences in a single domain. Then, a cross-domain infomax objective with noise contrast is presented to ensure that users’ single-domain and cross-domain preferences are correlated closely in sequential interactions. Finally, a joint training strategy is designed to improve the recommendation performances of two domains, which can achieve unbiased cross-domain recommendation results. At last, extensive experiments are conducted on two real-world cross-domain scenarios. The experimental results show that the proposed model in this paper achieves the best recommendation results in comparison with existing models.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"152 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141909284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel iteration scheme with conjugate gradient for faster pruning on transformer models 新颖的共轭梯度迭代方案,可加快变压器模型的剪枝速度
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-07 DOI: 10.1007/s40747-024-01595-w
Jun Li, Yuchen Zhu, Kexue Sun

Pre-trained models based on the Transformer architecture have significantly advanced research within the domain of Natural Language Processing (NLP) due to their superior performance and extensive applicability across multiple technological sectors. Despite these advantages, there is a significant challenge in optimizing these models for more efficient deployment. To be concrete, the existing post-training pruning frameworks of transformer models suffer from inefficiencies in the crucial stage of pruning accuracy recovery, which impacts the overall pruning efficiency. To address this issue, this paper introduces a novel and efficient iteration scheme with conjugate gradient in the pruning recovery stage. By constructing a series of conjugate iterative directions, this approach ensures each optimization step is orthogonal to the previous ones, which effectively reduces redundant explorations of the search space. Consequently, each iteration progresses effectively towards the global optimum, thereby significantly enhancing search efficiency. The conjugate gradient-based faster-pruner reduces the time expenditure of the pruning process while maintaining accuracy, demonstrating a high degree of solution stability and exceptional model acceleration effects. In pruning experiments conducted on the BERTBASE and DistilBERT models, the faster-pruner exhibited outstanding performance on the GLUE benchmark dataset, achieving a reduction of up to 36.27% in pruning time and a speed increase of up to 1.45× on an RTX 3090 GPU.

基于 Transformer 架构的预训练模型因其卓越的性能和在多个技术领域的广泛适用性,极大地推动了自然语言处理(NLP)领域的研究。尽管有这些优势,但在优化这些模型以提高部署效率方面仍面临巨大挑战。具体来说,现有的转换器模型训练后剪枝框架在剪枝准确性恢复的关键阶段效率低下,影响了整体剪枝效率。针对这一问题,本文在剪枝恢复阶段引入了一种新颖高效的共轭梯度迭代方案。通过构建一系列共轭迭代方向,这种方法确保了每个优化步骤都与之前的步骤正交,从而有效减少了对搜索空间的冗余探索。因此,每次迭代都能有效实现全局最优,从而显著提高搜索效率。基于共轭梯度的快速剪枝器在保持精度的同时,减少了剪枝过程的时间消耗,表现出高度的解稳定性和卓越的模型加速效果。在对 BERTBASE 和 DistilBERT 模型进行的剪枝实验中,更快剪枝器在 GLUE 基准数据集上表现出色,在 RTX 3090 GPU 上实现了高达 36.27% 的剪枝时间缩减和高达 1.45 倍的速度提升。
{"title":"A novel iteration scheme with conjugate gradient for faster pruning on transformer models","authors":"Jun Li, Yuchen Zhu, Kexue Sun","doi":"10.1007/s40747-024-01595-w","DOIUrl":"https://doi.org/10.1007/s40747-024-01595-w","url":null,"abstract":"<p>Pre-trained models based on the Transformer architecture have significantly advanced research within the domain of Natural Language Processing (NLP) due to their superior performance and extensive applicability across multiple technological sectors. Despite these advantages, there is a significant challenge in optimizing these models for more efficient deployment. To be concrete, the existing post-training pruning frameworks of transformer models suffer from inefficiencies in the crucial stage of pruning accuracy recovery, which impacts the overall pruning efficiency. To address this issue, this paper introduces a novel and efficient iteration scheme with conjugate gradient in the pruning recovery stage. By constructing a series of conjugate iterative directions, this approach ensures each optimization step is orthogonal to the previous ones, which effectively reduces redundant explorations of the search space. Consequently, each iteration progresses effectively towards the global optimum, thereby significantly enhancing search efficiency. The conjugate gradient-based faster-pruner reduces the time expenditure of the pruning process while maintaining accuracy, demonstrating a high degree of solution stability and exceptional model acceleration effects. In pruning experiments conducted on the BERT<sub>BASE</sub> and DistilBERT models, the faster-pruner exhibited outstanding performance on the GLUE benchmark dataset, achieving a reduction of up to 36.27% in pruning time and a speed increase of up to 1.45× on an RTX 3090 GPU.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"55 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141899854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A learning-based model predictive control scheme for injection speed tracking in injection molding process 基于学习的注塑成型工艺注塑速度跟踪模型预测控制方案
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-05 DOI: 10.1007/s40747-024-01588-9
Zhigang Ren, Jianpu Cai, Bo Zhang, Zongze Wu

Injection molding is a pivotal industrial process renowned for its high production speed, efficiency, and automation. Controlling the motion speed of injection molding machines is a crucial factor that influences production processes, directly affecting product quality and efficiency. This paper aims to tackle the challenge of achieving optimal tracking control of injection speed in a standard class of injection molding machines (IMMs) characterized by nonlinear dynamics. To achieve this goal, we propose a learning-based model predictive control (LMPC) scheme that incorporates Gaussian process regression (GPR) to predict and model uncertainty in the injection molding process (IMP). Specifically, the scheme formulates a nonlinear tracking control problem for injection speed, utilizing a GPR-based learning residual model to capture uncertainty and provide accurate predictions. It learns the dynamics model and historical data of the IMM, automatically adjusting the injection speed according to target requirements for optimal production control. Additionally, the optimization problem is efficiently solved using a control-constrained differential dynamic programming approach. Finally, we conduct comprehensive numerical experiments to demonstrate the effectiveness and efficiency of the proposed LMPC scheme for controlling injection speed in IMP.

注塑成型是一种关键的工业流程,以生产速度快、效率高和自动化程度高而著称。注塑机运动速度的控制是影响生产过程的关键因素,直接影响产品质量和效率。本文旨在解决在一类标准注塑机(IMMs)中实现注塑速度最佳跟踪控制这一难题,该注塑机具有非线性动力学特征。为实现这一目标,我们提出了一种基于学习的模型预测控制 (LMPC) 方案,该方案结合了高斯过程回归 (GPR),可对注塑成型过程 (IMP) 中的不确定性进行预测和建模。具体来说,该方案为注塑速度制定了一个非线性跟踪控制问题,利用基于 GPR 的学习残差模型来捕捉不确定性并提供准确预测。它学习 IMM 的动态模型和历史数据,根据目标要求自动调整注塑速度,以实现最佳生产控制。此外,我们还利用控制受限的微分动态编程方法有效地解决了优化问题。最后,我们进行了全面的数值实验,证明了所提出的 LMPC 方案在控制 IMP 喷射速度方面的有效性和效率。
{"title":"A learning-based model predictive control scheme for injection speed tracking in injection molding process","authors":"Zhigang Ren, Jianpu Cai, Bo Zhang, Zongze Wu","doi":"10.1007/s40747-024-01588-9","DOIUrl":"https://doi.org/10.1007/s40747-024-01588-9","url":null,"abstract":"<p>Injection molding is a pivotal industrial process renowned for its high production speed, efficiency, and automation. Controlling the motion speed of injection molding machines is a crucial factor that influences production processes, directly affecting product quality and efficiency. This paper aims to tackle the challenge of achieving optimal tracking control of injection speed in a standard class of injection molding machines (IMMs) characterized by nonlinear dynamics. To achieve this goal, we propose a learning-based model predictive control (LMPC) scheme that incorporates Gaussian process regression (GPR) to predict and model uncertainty in the injection molding process (IMP). Specifically, the scheme formulates a nonlinear tracking control problem for injection speed, utilizing a GPR-based learning residual model to capture uncertainty and provide accurate predictions. It learns the dynamics model and historical data of the IMM, automatically adjusting the injection speed according to target requirements for optimal production control. Additionally, the optimization problem is efficiently solved using a control-constrained differential dynamic programming approach. Finally, we conduct comprehensive numerical experiments to demonstrate the effectiveness and efficiency of the proposed LMPC scheme for controlling injection speed in IMP.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"33 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HSC: a multi-hierarchy descriptor for loop closure detection in overhead occlusion scenes HSC:用于高空闭塞场景中环路闭合检测的多层描述符
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-05 DOI: 10.1007/s40747-024-01581-2
Weilong Lv, Wei Zhou, Gang Wang

Loop closure detection is a key technology for robotic navigation. Existing research primarily focuses on feature extraction from global scenes but often neglects local overhead occlusion scenes. In these local scenes, objects such as vehicles, trees, and buildings vary in height, creating a complex multi-layered structure with vertical occlusions. Current methods predominantly employ a single-level extraction strategy to construct descriptors, which fails to capture the characteristics of occluded objects. This limitation results in descriptors with restricted descriptive capabilities. This paper introduces a descriptor named Hierarchy Scan Context (HSC) to address this shortfall. HSC effectively extracts height feature information of objects at different levels in overhead occlusion scenes through hierarchical division, demonstrating enhanced descriptive capabilities. Additionally, a time series enhancement strategy is proposed to reduce the number of algorithmic missed detections. In the experiments, the proposed method is validated using a self-collected dataset and the public KITTI and NCLT datasets, demonstrating superior performance compared to competitive methods. Furthermore, the proposed method also achieves an average maximum F1 score of 0.92 in experiments conducted on nine selected road segments with overhead occlusion.

环路闭合检测是机器人导航的一项关键技术。现有研究主要关注全局场景的特征提取,但往往忽视局部高空遮挡场景。在这些局部场景中,车辆、树木和建筑物等物体的高度各不相同,形成了复杂的多层结构和垂直遮挡。目前的方法主要采用单层提取策略来构建描述符,这种方法无法捕捉到遮挡物体的特征。这种局限性导致描述符的描述能力受到限制。本文引入了一种名为 "层次扫描上下文"(HSC)的描述符来解决这一不足。HSC 通过层次划分,有效地提取了高空遮挡场景中不同层次物体的高度特征信息,显示出更强的描述能力。此外,还提出了一种时间序列增强策略,以减少算法漏检的次数。在实验中,使用自收集的数据集以及公开的 KITTI 和 NCLT 数据集对所提出的方法进行了验证,结果表明该方法的性能优于其他竞争方法。此外,在对九个有高空遮挡的选定路段进行的实验中,所提方法的平均最高 F1 分数也达到了 0.92。
{"title":"HSC: a multi-hierarchy descriptor for loop closure detection in overhead occlusion scenes","authors":"Weilong Lv, Wei Zhou, Gang Wang","doi":"10.1007/s40747-024-01581-2","DOIUrl":"https://doi.org/10.1007/s40747-024-01581-2","url":null,"abstract":"<p>Loop closure detection is a key technology for robotic navigation. Existing research primarily focuses on feature extraction from global scenes but often neglects local overhead occlusion scenes. In these local scenes, objects such as vehicles, trees, and buildings vary in height, creating a complex multi-layered structure with vertical occlusions. Current methods predominantly employ a single-level extraction strategy to construct descriptors, which fails to capture the characteristics of occluded objects. This limitation results in descriptors with restricted descriptive capabilities. This paper introduces a descriptor named Hierarchy Scan Context (HSC) to address this shortfall. HSC effectively extracts height feature information of objects at different levels in overhead occlusion scenes through hierarchical division, demonstrating enhanced descriptive capabilities. Additionally, a time series enhancement strategy is proposed to reduce the number of algorithmic missed detections. In the experiments, the proposed method is validated using a self-collected dataset and the public KITTI and NCLT datasets, demonstrating superior performance compared to competitive methods. Furthermore, the proposed method also achieves an average maximum F1 score of 0.92 in experiments conducted on nine selected road segments with overhead occlusion.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"100 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HRDLNet: a semantic segmentation network with high resolution representation for urban street view images HRDLNet:为城市街景图像提供高分辨率表示的语义分割网络
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-05 DOI: 10.1007/s40747-024-01582-1
Wenyi Chen, Zongcheng Miao, Yang Qu, Guokai Shi

Semantic segmentation of urban street scenes has attracted much attention in the field of autonomous driving, which not only helps vehicles perceive the environment in real time, but also significantly improves the decision-making ability of autonomous driving systems. However, most of the current methods based on Convolutional Neural Network (CNN) mainly use coding the input image to a low resolution and then try to recover the high resolution, which leads to problems such as loss of spatial information, accumulation of errors, and difficulty in dealing with large-scale changes. To address these problems, in this paper, we propose a new semantic segmentation network (HRDLNet) for urban street scene images with high-resolution representation, which improves the accuracy of segmentation by always maintaining a high-resolution representation of the image. Specifically, we propose a feature extraction module (FHR) with high-resolution representation, which efficiently handles multi-scale targets and high-resolution image information by efficiently fusing high-resolution information and multi-scale features. Secondly, we design a multi-scale feature extraction enhancement (MFE) module, which significantly expands the sensory field of the network, thus enhancing the ability to capture correlations between image details and global contextual information. In addition, we introduce a dual-attention mechanism module (CSD), which dynamically adjusts the network to more accurately capture subtle features and rich semantic information in images. We trained and evaluated HRDLNet on the Cityscapes Dataset and the PASCAL VOC 2012 Augmented Dataset, and verified the model’s excellent performance in the field of urban streetscape image segmentation. The unique advantages of our proposed HRDLNet in the field of semantic segmentation of urban streetscapes are also verified by comparing it with the state-of-the-art methods.

城市街道场景的语义分割在自动驾驶领域备受关注,它不仅能帮助车辆实时感知环境,还能显著提高自动驾驶系统的决策能力。然而,目前大多数基于卷积神经网络(CNN)的方法主要是将输入图像编码为低分辨率,然后再尝试恢复高分辨率,这导致了空间信息丢失、误差积累、难以处理大规模变化等问题。针对这些问题,本文提出了一种新的城市街道场景图像高分辨率表示语义分割网络(HRDLNet),通过始终保持图像的高分辨率表示来提高分割的准确性。具体而言,我们提出了具有高分辨率表示的特征提取模块(FHR),通过有效融合高分辨率信息和多尺度特征,高效处理多尺度目标和高分辨率图像信息。其次,我们设计了一个多尺度特征提取增强模块(MFE),大大扩展了网络的感知领域,从而增强了捕捉图像细节与全局上下文信息之间相关性的能力。此外,我们还引入了双重关注机制模块(CSD),该模块可动态调整网络,以更准确地捕捉图像中的细微特征和丰富语义信息。我们在城市景观数据集(Cityscapes Dataset)和 PASCAL VOC 2012 增强数据集(PASCAL VOC 2012 Augmented Dataset)上对 HRDLNet 进行了训练和评估,验证了该模型在城市街景图像分割领域的卓越性能。通过与最先进的方法进行比较,我们提出的 HRDLNet 在城市街景语义分割领域的独特优势也得到了验证。
{"title":"HRDLNet: a semantic segmentation network with high resolution representation for urban street view images","authors":"Wenyi Chen, Zongcheng Miao, Yang Qu, Guokai Shi","doi":"10.1007/s40747-024-01582-1","DOIUrl":"https://doi.org/10.1007/s40747-024-01582-1","url":null,"abstract":"<p>Semantic segmentation of urban street scenes has attracted much attention in the field of autonomous driving, which not only helps vehicles perceive the environment in real time, but also significantly improves the decision-making ability of autonomous driving systems. However, most of the current methods based on Convolutional Neural Network (CNN) mainly use coding the input image to a low resolution and then try to recover the high resolution, which leads to problems such as loss of spatial information, accumulation of errors, and difficulty in dealing with large-scale changes. To address these problems, in this paper, we propose a new semantic segmentation network (HRDLNet) for urban street scene images with high-resolution representation, which improves the accuracy of segmentation by always maintaining a high-resolution representation of the image. Specifically, we propose a feature extraction module (FHR) with high-resolution representation, which efficiently handles multi-scale targets and high-resolution image information by efficiently fusing high-resolution information and multi-scale features. Secondly, we design a multi-scale feature extraction enhancement (MFE) module, which significantly expands the sensory field of the network, thus enhancing the ability to capture correlations between image details and global contextual information. In addition, we introduce a dual-attention mechanism module (CSD), which dynamically adjusts the network to more accurately capture subtle features and rich semantic information in images. We trained and evaluated HRDLNet on the Cityscapes Dataset and the PASCAL VOC 2012 Augmented Dataset, and verified the model’s excellent performance in the field of urban streetscape image segmentation. The unique advantages of our proposed HRDLNet in the field of semantic segmentation of urban streetscapes are also verified by comparing it with the state-of-the-art methods.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"44 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-selective receptive field network for person re-identification 用于人员再识别的自选择感受野网络
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-05 DOI: 10.1007/s40747-024-01565-2
Shaoqi Hou, Xueting liu, Chenyu Wu, Guangqiang Yin, Xinzhong Wang, Zhiguo Wang

Person Re-identification (Re-ID) technology aims to solve the matching problem of the same pedestrians at different times and places, which has important application value in the field of public safety. At present, most scholars focus on designing complex models to improve the accuracy of Re-ID, but the high complexity of the model further restricts the practical application of Re-ID algorithm. To solve the above problems, this paper designs a lightweight Self-selective Receptive Field (SRF) block instead of directly designing complex models. Specifically, the module can be plug-and-play on the general backbone network, so as to significantly improve the performance of Re-ID while effectively controlling the amount of its own parameter and calculation: (1) the SRF block encodes pedestrian targets and image contexts at different scales by constructing pyramidal convolution group and allows the module to independently select the size of the receptive field through training by means of self-adaptive weighting; (2) in order to reduce the complexity of SRF block, we introduce a "channel scaling factor" and design a "grouped convolution operation" by constraining the channels of the feature map and changing the structure of the convolution kernel respectively. Experiments on multiple datasets show that SRF Network (SRFNet) for Re-ID can achieve a good balance between performance and complexity, which fully demonstrates the effectiveness of SRF block.

人员再识别(Re-ID)技术旨在解决同一行人在不同时间、不同地点的匹配问题,在公共安全领域具有重要的应用价值。目前,大多数学者都侧重于设计复杂的模型来提高 Re-ID 的准确性,但模型的高复杂性进一步限制了 Re-ID 算法的实际应用。为了解决上述问题,本文设计了一种轻量级的自选择接收场(SRF)模块,而不是直接设计复杂的模型。具体来说,该模块可以在一般骨干网络上即插即用,从而在有效控制自身参数量和计算量的同时,显著提高 Re-ID 的性能:(1)SRF 块通过构建金字塔卷积组对不同尺度的行人目标和图像上下文进行编码,并通过自适应加权允许模块通过训练自主选择感受野的大小;(2)为了降低 SRF 块的复杂度,我们引入了 "通道缩放因子",并分别通过约束特征图的通道和改变卷积核的结构设计了 "分组卷积运算"。在多个数据集上的实验表明,用于 Re-ID 的 SRF 网络(SRFNet)可以在性能和复杂度之间取得良好的平衡,这充分证明了 SRF 块的有效性。
{"title":"Self-selective receptive field network for person re-identification","authors":"Shaoqi Hou, Xueting liu, Chenyu Wu, Guangqiang Yin, Xinzhong Wang, Zhiguo Wang","doi":"10.1007/s40747-024-01565-2","DOIUrl":"https://doi.org/10.1007/s40747-024-01565-2","url":null,"abstract":"<p>Person Re-identification (Re-ID) technology aims to solve the matching problem of the same pedestrians at different times and places, which has important application value in the field of public safety. At present, most scholars focus on designing complex models to improve the accuracy of Re-ID, but the high complexity of the model further restricts the practical application of Re-ID algorithm. To solve the above problems, this paper designs a lightweight Self-selective Receptive Field (SRF) block instead of directly designing complex models. Specifically, the module can be plug-and-play on the general backbone network, so as to significantly improve the performance of Re-ID while effectively controlling the amount of its own parameter and calculation: (1) the SRF block encodes pedestrian targets and image contexts at different scales by constructing pyramidal convolution group and allows the module to independently select the size of the receptive field through training by means of self-adaptive weighting; (2) in order to reduce the complexity of SRF block, we introduce a \"channel scaling factor\" and design a \"grouped convolution operation\" by constraining the channels of the feature map and changing the structure of the convolution kernel respectively. Experiments on multiple datasets show that SRF Network (SRFNet) for Re-ID can achieve a good balance between performance and complexity, which fully demonstrates the effectiveness of SRF block.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"22 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent-SDE: guiding stochastic differential equations in latent space for unpaired image-to-image translation Latent-SDE:引导潜空间随机微分方程实现无配对图像到图像的平移
IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-01 DOI: 10.1007/s40747-024-01566-1
Xianjie Zhang, Min Li, Yujie He, Yao Gou, Yusen Zhang

Score-based diffusion models have shown promising results in unpaired image-to-image translation (I2I). However, the existing methods only perform unpaired I2I in pixel space, which requires high computation costs. To this end, we propose guiding stochastic differential equations in latent space (Latent-SDE) that extracts domain-specific and domain-independent features of the image in the latent space to calculate the loss and guides the inference process of a pretrained SDE in the latent space for unpaired I2I. To refine the image in the latent space, we propose a latent time-travel strategy that increases the sampling timestep. Empirically, we compare Latent-SDE to the baseline of the score-based diffusion model on three widely adopted unpaired I2I tasks under two metrics. Latent-SDE achieves state-of-the-art on Cat (rightarrow ) Dog and is competitive on the other two tasks. Our code will be freely available for public use upon acceptance at https://github.com/zhangXJ147/Latent-SDE.

基于分数的扩散模型在无配对图像到图像平移(I2I)中显示出良好的效果。然而,现有方法只能在像素空间执行非配对 I2I,这需要很高的计算成本。为此,我们提出了潜在空间中的指导性随机微分方程(Latent-SDE),它能提取潜在空间中图像的特定领域和独立于领域的特征来计算损失,并指导潜在空间中预训练的 SDE 的推理过程,以实现非配对 I2I。为了完善潜空间中的图像,我们提出了一种增加采样时间步的潜时间旅行策略。在两个指标下,我们将 Latent-SDE 与基于分数的扩散模型基线在三个广泛采用的非配对 I2I 任务中进行了实证比较。Latent-SDE 在 Cat (rightarrow ) Dog 上达到了最先进水平,在其他两个任务上也具有竞争力。我们的代码将在 https://github.com/zhangXJ147/Latent-SDE 上被接受后免费提供给公众使用。
{"title":"Latent-SDE: guiding stochastic differential equations in latent space for unpaired image-to-image translation","authors":"Xianjie Zhang, Min Li, Yujie He, Yao Gou, Yusen Zhang","doi":"10.1007/s40747-024-01566-1","DOIUrl":"https://doi.org/10.1007/s40747-024-01566-1","url":null,"abstract":"<p>Score-based diffusion models have shown promising results in unpaired image-to-image translation (I2I). However, the existing methods only perform unpaired I2I in pixel space, which requires high computation costs. To this end, we propose guiding stochastic differential equations in latent space (Latent-SDE) that extracts domain-specific and domain-independent features of the image in the latent space to calculate the loss and guides the inference process of a pretrained SDE in the latent space for unpaired I2I. To refine the image in the latent space, we propose a latent time-travel strategy that increases the sampling timestep. Empirically, we compare Latent-SDE to the baseline of the score-based diffusion model on three widely adopted unpaired I2I tasks under two metrics. Latent-SDE achieves state-of-the-art on Cat <span>(rightarrow )</span> Dog and is competitive on the other two tasks. Our code will be freely available for public use upon acceptance at https://github.com/zhangXJ147/Latent-SDE.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"356 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Complex & Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1