首页 > 最新文献

Engineering Applications of Artificial Intelligence最新文献

英文 中文
Analysis of computer communication networks based on evaluation of domination and double domination for interval-valued T-spherical fuzzy graphs and their applications in decision-making problems 基于区间值 T 球形模糊图的支配和双重支配评价的计算机通信网络分析及其在决策问题中的应用
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109650
Sami Ullah Khan , Fiaz Hussain , Tapan Senapati , Shoukat Hussain , Zeeshan Ali , Domokos Esztergár-Kiss , Sarbast Moslem
This research introduces the Interval-Valued T-Spherical Fuzzy Graph (IVTSFG), a novel extension of fuzzy graph theory designed to address imprecision in decision-making processes, network analysis, and Computer Communication Networks (CCNs). Integrating four types of membership degrees-membership, non-membership, abstinence, and hesitancy-the IVTSFG framework significantly enhances the ability to model and analyze complex systems with uncertain data. The study explores the theories of domination and double domination within the context of IVTSFGs, presenting new methods for evaluating network resilience and optimization. Key findings include the development of innovative techniques for applying domination and double domination in IVTSFGs, demonstrating improved performance in managing CCNs. Comparative analysis with existing fuzzy graph models highlights the advantages of IVTSFGs, particularly in capturing nuanced relationships within network structures. The research provides practical examples and empirical comparisons, showcasing the framework's effectiveness in various decision-making scenarios.
本研究介绍了区间值 T 球形模糊图(IVTSFG),它是模糊图理论的一种新扩展,旨在解决决策过程、网络分析和计算机通信网络(CCN)中的不确定性问题。IVTSFG 框架集成了四种成员度--成员度、非成员度、弃权度和犹豫度,大大提高了对具有不确定数据的复杂系统进行建模和分析的能力。该研究在 IVTSFG 的背景下探讨了支配和双重支配理论,提出了评估网络弹性和优化的新方法。研究的主要发现包括开发了在 IVTSFGs 中应用支配和双重支配的创新技术,从而提高了 CCNs 的管理性能。与现有模糊图模型的对比分析凸显了 IVTSFG 的优势,尤其是在捕捉网络结构中的细微关系方面。研究提供了实际案例和经验比较,展示了该框架在各种决策场景中的有效性。
{"title":"Analysis of computer communication networks based on evaluation of domination and double domination for interval-valued T-spherical fuzzy graphs and their applications in decision-making problems","authors":"Sami Ullah Khan ,&nbsp;Fiaz Hussain ,&nbsp;Tapan Senapati ,&nbsp;Shoukat Hussain ,&nbsp;Zeeshan Ali ,&nbsp;Domokos Esztergár-Kiss ,&nbsp;Sarbast Moslem","doi":"10.1016/j.engappai.2024.109650","DOIUrl":"10.1016/j.engappai.2024.109650","url":null,"abstract":"<div><div>This research introduces the Interval-Valued T-Spherical Fuzzy Graph (IVTSFG), a novel extension of fuzzy graph theory designed to address imprecision in decision-making processes, network analysis, and Computer Communication Networks (CCNs). Integrating four types of membership degrees-membership, non-membership, abstinence, and hesitancy-the IVTSFG framework significantly enhances the ability to model and analyze complex systems with uncertain data. The study explores the theories of domination and double domination within the context of IVTSFGs, presenting new methods for evaluating network resilience and optimization. Key findings include the development of innovative techniques for applying domination and double domination in IVTSFGs, demonstrating improved performance in managing CCNs. Comparative analysis with existing fuzzy graph models highlights the advantages of IVTSFGs, particularly in capturing nuanced relationships within network structures. The research provides practical examples and empirical comparisons, showcasing the framework's effectiveness in various decision-making scenarios.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109650"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Word-Sequence Entropy: Towards uncertainty estimation in free-form medical question answering applications and beyond 词序熵:在自由格式医学问题解答应用及其他应用中实现不确定性估计
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109553
Zhiyuan Wang , Jinhao Duan , Chenxi Yuan , Qingyu Chen , Tianlong Chen , Yue Zhang , Ren Wang , Xiaoshuang Shi , Kaidi Xu
Uncertainty estimation is crucial for the reliability of safety-critical human and artificial intelligence (AI) interaction systems, particularly in the domain of healthcare engineering. However, a robust and general uncertainty measure for free-form answers has not been well-established in open-ended medical question-answering (QA) tasks, where generative inequality introduces a large number of irrelevant words and sequences within the generated set for uncertainty quantification (UQ), which can lead to biases. This paper proposes Word-Sequence Entropy (WSE), which calibrates uncertainty at both the word and sequence levels based on semantic relevance, highlighting keywords and enlarging the generative probability of trustworthy responses when performing UQ. We compare WSE with six baseline methods on five free-form medical QA datasets, utilizing seven popular large language models (LLMs), and demonstrate that WSE exhibits superior performance in accurate UQ under two standard criteria for correctness evaluation. Additionally, in terms of the potential for real-world medical QA applications, we achieve a significant enhancement (e.g., a 6.36% improvement in model accuracy on the COVID-QA dataset) in the performance of LLMs when employing responses with lower uncertainty that are identified by WSE as final answers, without requiring additional task-specific fine-tuning or architectural modifications.
不确定性估计对于安全关键型人类和人工智能(AI)交互系统的可靠性至关重要,尤其是在医疗保健工程领域。然而,在开放式医疗问题解答(QA)任务中,针对自由形式答案的稳健且通用的不确定性度量方法尚未得到很好的确立,因为在不确定性量化(UQ)的生成集合中,生成不等式引入了大量不相关的单词和序列,这可能会导致偏差。本文提出了单词-序列熵(WSE),它能根据语义相关性在单词和序列层面校准不确定性,在进行不确定性量化时突出关键词并扩大可信回答的生成概率。我们利用七种流行的大型语言模型(LLM),在五个自由形式的医疗质量保证数据集上比较了 WSE 和六种基线方法,结果表明 WSE 在两个标准的正确性评估标准下,在准确的 UQ 方面表现出更优越的性能。此外,就实际医疗质量保证应用的潜力而言,当采用 WSE 识别出的不确定性较低的回答作为最终答案时,我们显著提高了 LLM 的性能(例如,在 COVID-QA 数据集上模型准确率提高了 6.36%),而无需额外的特定任务微调或架构修改。
{"title":"Word-Sequence Entropy: Towards uncertainty estimation in free-form medical question answering applications and beyond","authors":"Zhiyuan Wang ,&nbsp;Jinhao Duan ,&nbsp;Chenxi Yuan ,&nbsp;Qingyu Chen ,&nbsp;Tianlong Chen ,&nbsp;Yue Zhang ,&nbsp;Ren Wang ,&nbsp;Xiaoshuang Shi ,&nbsp;Kaidi Xu","doi":"10.1016/j.engappai.2024.109553","DOIUrl":"10.1016/j.engappai.2024.109553","url":null,"abstract":"<div><div>Uncertainty estimation is crucial for the reliability of safety-critical human and artificial intelligence (AI) interaction systems, particularly in the domain of healthcare engineering. However, a robust and general uncertainty measure for free-form answers has not been well-established in open-ended medical question-answering (QA) tasks, where generative inequality introduces a large number of irrelevant words and sequences within the generated set for uncertainty quantification (UQ), which can lead to biases. This paper proposes Word-Sequence Entropy (<em>WSE</em>), which calibrates uncertainty at both the word and sequence levels based on semantic relevance, highlighting keywords and enlarging the generative probability of trustworthy responses when performing UQ. We compare <em>WSE</em> with six baseline methods on five free-form medical QA datasets, utilizing seven popular large language models (LLMs), and demonstrate that <em>WSE</em> exhibits superior performance in accurate UQ under two standard criteria for correctness evaluation. Additionally, in terms of the potential for real-world medical QA applications, we achieve a significant enhancement (e.g., a 6.36% improvement in model accuracy on the COVID-QA dataset) in the performance of LLMs when employing responses with lower uncertainty that are identified by <em>WSE</em> as final answers, without requiring additional task-specific fine-tuning or architectural modifications.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109553"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention based network for fusion of polarimetric and contextual features for polarimetric synthetic aperture radar image classification 基于注意力的网络,用于融合偏振和上下文特征,进行偏振合成孔径雷达图像分类
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109665
Maryam Imani
Polarimetric synthetic aperture radar (PolSAR) images containing polarimetric, scattering and contextual features are useful radar data for ground surface classification. Appropriate feature extraction and fusion by using a small set of available labeled samples is an important and challenging task. Several transformers with self-attention mechanism have recently achieved great success for PolSAR image classification. While almost all methods just exploit the self-attention features from the PolSAR cube, the feature fusion method proposed in this work, which is called attention based scattering and contextual (ASC) network, utilizes the polarimetric self-attention beside two cross-attention blocks. The cross-attention blocks extract the polarimetric-scattering dependencies and polarimetric-contextual interactions, individually. The proposed ASC network uses three inputs: the PolSAR cube, the scattering feature maps obtained by clustering of the entropy-alpha features, and the segmentation maps obtained by a super-pixel generation algorithm. The features extracted by self- and cross-attention blocks are fused together, and the residual learning improves the feature learning. While transformers and attention-based networks usually need large training sets, the proposed ASC network shows high efficiency with relatively low number of training samples in various real and synthetic PolSAR images. For example, in the Flevoland PolSAR image containing 15 classes acquired by AIRSAR in L-band, with using 100 training samples per class (less than 1% of labeled samples), the ASC network achieves the overall accuracy of 99.51, which is statistically preferred than the self-attention-based network according to the McNemars test.
极坐标合成孔径雷达(PolSAR)图像包含极坐标、散射和上下文特征,是地表分类的有用雷达数据。利用少量可用的标注样本集进行适当的特征提取和融合是一项重要而具有挑战性的任务。最近,一些具有自注意机制的变换器在 PolSAR 图像分类方面取得了巨大成功。几乎所有的方法都只是利用 PolSAR 立方体中的自注意特征,而本研究提出的特征融合方法被称为基于注意散射和上下文(ASC)网络,它利用了偏振自注意和两个交叉注意块。交叉注意块分别提取极坐标-散射相关性和极坐标-上下文相互作用。拟议的 ASC 网络使用三个输入:PolSAR 立方体、通过熵-α 特征聚类获得的散射特征图,以及通过超级像素生成算法获得的分割图。自注意力和交叉注意力区块提取的特征融合在一起,残差学习改进了特征学习。变换器和基于注意力的网络通常需要大量的训练集,而所提出的 ASC 网络在各种真实和合成 PolSAR 图像中,只需要相对较少的训练样本,就能显示出较高的效率。例如,在由 AIRSAR 在 L 波段获取的包含 15 个类别的 Flevoland PolSAR 图像中,ASC 网络在每个类别使用 100 个训练样本(不到标注样本的 1%)的情况下,总体准确率达到 99.51,根据 McNemars 检验,在统计学上优于基于自我注意的网络。
{"title":"Attention based network for fusion of polarimetric and contextual features for polarimetric synthetic aperture radar image classification","authors":"Maryam Imani","doi":"10.1016/j.engappai.2024.109665","DOIUrl":"10.1016/j.engappai.2024.109665","url":null,"abstract":"<div><div>Polarimetric synthetic aperture radar (PolSAR) images containing polarimetric, scattering and contextual features are useful radar data for ground surface classification. Appropriate feature extraction and fusion by using a small set of available labeled samples is an important and challenging task. Several transformers with self-attention mechanism have recently achieved great success for PolSAR image classification. While almost all methods just exploit the self-attention features from the PolSAR cube, the feature fusion method proposed in this work, which is called attention based scattering and contextual (ASC) network, utilizes the polarimetric self-attention beside two cross-attention blocks. The cross-attention blocks extract the polarimetric-scattering dependencies and polarimetric-contextual interactions, individually. The proposed ASC network uses three inputs: the PolSAR cube, the scattering feature maps obtained by clustering of the entropy-alpha features, and the segmentation maps obtained by a super-pixel generation algorithm. The features extracted by self- and cross-attention blocks are fused together, and the residual learning improves the feature learning. While transformers and attention-based networks usually need large training sets, the proposed ASC network shows high efficiency with relatively low number of training samples in various real and synthetic PolSAR images. For example, in the Flevoland PolSAR image containing 15 classes acquired by AIRSAR in L-band, with using 100 training samples per class (less than 1% of labeled samples), the ASC network achieves the overall accuracy of 99.51, which is statistically preferred than the self-attention-based network according to the McNemars test.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109665"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fine-tuning language model embeddings to reveal domain knowledge: An explainable artificial intelligence perspective on medical decision making 微调语言模型嵌入以揭示领域知识:从可解释的人工智能角度看医疗决策
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109561
Ceca Kraišniković , Robert Harb , Markus Plass , Wael Al Zoughbi , Andreas Holzinger , Heimo Müller
Integrating large language models (LLMs) to retrieve targeted medical knowledge from electronic health records enables significant advancements in medical research. However, recognizing the challenges associated with using LLMs in healthcare is essential for successful implementation. One challenge is that medical records combine unstructured textual information with highly sensitive personal data. This, in turn, highlights the need for explainable Artificial Intelligence (XAI) methods to understand better how LLMs function in the medical domain. In this study, we propose a novel XAI tool to accelerate data-driven cancer research. We apply the Bidirectional Encoder Representations from Transformers (BERT) model to German language pathology reports examining the effects of domain-specific language adaptation and fine-tuning. We demonstrate our model on a real-world pathology dataset, analyzing the contextual representations of diagnostic reports. By illustrating decisions made by fine-tuned models, we provide decision values that can be applied in medical research. To address interpretability, we conduct a performance evaluation of the classifications generated by our fine-tuned model, as assessed by an expert pathologist. In domains such as medicine, inspection of the medical knowledge map in conjunction with expert evaluation reveals valuable information about how contextual representations of key disease features are categorized. This ultimately benefits data structuring and labeling and paves the way for even more advanced approaches to XAI, combining text with other input modalities, such as images which are then applicable to various engineering problems.
整合大型语言模型(LLMs),从电子健康记录中检索目标医学知识,可使医学研究取得重大进展。然而,认识到在医疗保健领域使用 LLMs 所面临的挑战对于成功实施 LLMs 至关重要。挑战之一是医疗记录结合了非结构化文本信息和高度敏感的个人数据。这反过来又凸显了对可解释人工智能(XAI)方法的需求,以更好地理解 LLM 在医疗领域的功能。在本研究中,我们提出了一种新型 XAI 工具,以加速数据驱动的癌症研究。我们将来自变换器的双向编码器表征(BERT)模型应用于德语病理学报告,以检验特定领域语言适应和微调的效果。我们在真实病理数据集上演示了我们的模型,分析了诊断报告的上下文表征。通过说明微调模型做出的决策,我们提供了可应用于医学研究的决策值。为了解决可解释性问题,我们对微调模型生成的分类进行了性能评估,由病理专家进行评估。在医学等领域,结合专家评估对医学知识图谱进行检查,可以发现关于关键疾病特征的上下文表示如何分类的宝贵信息。这最终有利于数据的结构化和标签化,并为更先进的 XAI 方法铺平了道路,将文本与其他输入模式(如图像)相结合,从而适用于各种工程问题。
{"title":"Fine-tuning language model embeddings to reveal domain knowledge: An explainable artificial intelligence perspective on medical decision making","authors":"Ceca Kraišniković ,&nbsp;Robert Harb ,&nbsp;Markus Plass ,&nbsp;Wael Al Zoughbi ,&nbsp;Andreas Holzinger ,&nbsp;Heimo Müller","doi":"10.1016/j.engappai.2024.109561","DOIUrl":"10.1016/j.engappai.2024.109561","url":null,"abstract":"<div><div>Integrating large language models (LLMs) to retrieve targeted medical knowledge from electronic health records enables significant advancements in medical research. However, recognizing the challenges associated with using LLMs in healthcare is essential for successful implementation. One challenge is that medical records combine unstructured textual information with highly sensitive personal data. This, in turn, highlights the need for explainable Artificial Intelligence (XAI) methods to understand better how LLMs function in the medical domain. In this study, we propose a novel XAI tool to accelerate data-driven cancer research. We apply the Bidirectional Encoder Representations from Transformers (BERT) model to German language pathology reports examining the effects of domain-specific language adaptation and fine-tuning. We demonstrate our model on a real-world pathology dataset, analyzing the contextual representations of diagnostic reports. By illustrating decisions made by fine-tuned models, we provide decision values that can be applied in medical research. To address interpretability, we conduct a performance evaluation of the classifications generated by our fine-tuned model, as assessed by an expert pathologist. In domains such as medicine, inspection of the medical knowledge map in conjunction with expert evaluation reveals valuable information about how contextual representations of key disease features are categorized. This ultimately benefits data structuring and labeling and paves the way for even more advanced approaches to XAI, combining text with other input modalities, such as images which are then applicable to various engineering problems.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109561"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142658988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online prediction of hydro-pneumatic tensioner system of floating platform under internal waves 内波下浮式平台水气张紧器系统的在线预测
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109656
Jianwei Wang , Xiaofan Jin , Xuchu Liu , Ze He , Jiachen Chai , Pengfa Liu , Yuqing Wang , Wei Cai , Rui Guo
To address the issue of low accuracy in the current motion response prediction model of the floating platform tensioner system, this paper proposes an online prediction method that integrates Empirical Mode Decomposition (EMD), Kernel Principal Component Analysis (KPCA), and Long Short-Term Memory (LSTM). The EMD technique is employed to decompose the sequence of environmental factors, reducing their non-stationarity. Subsequently, KPCA is used to extract key influencing factors and reduce input dimensionality. Finally, LSTM neural networks are applied to capture long-term dependencies in features and make accurate predictions. By validating the model using motion response data from the tensioner platform device under two scenarios with and without internal waves, it is compared against other models. The results show that the EMD-KPCA-LSTM model has high prediction accuracy in both scenarios. In particular, compared with the Convolutional Neural Network (CNN) model, the mean Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) of the displacement and tension of the system decreased by 52.97%, 55.14%, 56.31%, 68.97%, 71.02% 57.60%, respectively, and R-square (R2) increased by 7.14% and 12.37%. In summary, the model has a good ability for data fitting and high prediction accuracy and has important practical value.
针对目前浮动平台拉伸器系统运动响应预测模型精度较低的问题,本文提出了一种集成了经验模式分解(EMD)、核主成分分析(KPCA)和长短期记忆(LSTM)的在线预测方法。EMD 技术用于分解环境因素序列,减少其非平稳性。随后,利用 KPCA 提取关键影响因素并降低输入维度。最后,应用 LSTM 神经网络捕捉特征中的长期依赖关系,并做出准确预测。通过使用拉伸器平台装置在有内波和无内波两种情况下的运动响应数据对模型进行验证,并与其他模型进行比较。结果表明,EMD-KPCA-LSTM 模型在两种情况下都具有很高的预测精度。其中,与卷积神经网络(CNN)模型相比,系统位移和拉力的平均均方根误差(RMSE)、平均绝对误差(MAE)和平均绝对百分比误差(MAPE)分别降低了 52.97%、55.14%、56.31%、68.97%、71.02%、57.60%,R 平方(R2)分别提高了 7.14%和 12.37%。综上所述,该模型具有良好的数据拟合能力和较高的预测精度,具有重要的实用价值。
{"title":"Online prediction of hydro-pneumatic tensioner system of floating platform under internal waves","authors":"Jianwei Wang ,&nbsp;Xiaofan Jin ,&nbsp;Xuchu Liu ,&nbsp;Ze He ,&nbsp;Jiachen Chai ,&nbsp;Pengfa Liu ,&nbsp;Yuqing Wang ,&nbsp;Wei Cai ,&nbsp;Rui Guo","doi":"10.1016/j.engappai.2024.109656","DOIUrl":"10.1016/j.engappai.2024.109656","url":null,"abstract":"<div><div>To address the issue of low accuracy in the current motion response prediction model of the floating platform tensioner system, this paper proposes an online prediction method that integrates Empirical Mode Decomposition (EMD), Kernel Principal Component Analysis (KPCA), and Long Short-Term Memory (LSTM). The EMD technique is employed to decompose the sequence of environmental factors, reducing their non-stationarity. Subsequently, KPCA is used to extract key influencing factors and reduce input dimensionality. Finally, LSTM neural networks are applied to capture long-term dependencies in features and make accurate predictions. By validating the model using motion response data from the tensioner platform device under two scenarios with and without internal waves, it is compared against other models. The results show that the EMD-KPCA-LSTM model has high prediction accuracy in both scenarios. In particular, compared with the Convolutional Neural Network (CNN) model, the mean Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) of the displacement and tension of the system decreased by 52.97%, 55.14%, 56.31%, 68.97%, 71.02% 57.60%, respectively, and R-square (R<sup>2</sup>) increased by 7.14% and 12.37%. In summary, the model has a good ability for data fitting and high prediction accuracy and has important practical value.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109656"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diameter-adjustable mandrel for thin-wall tube bending and its domain knowledge-integrated optimization design framework 用于薄壁管弯曲的直径可调心轴及其整合领域知识的优化设计框架
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109634
Zili Wang , Jie Li , Xiaojian Liu , Shuyou Zhang , Yaochen Lin , Jianrong Tan
In response to the growing demand for small-batch bending tube production, traditional bending dies require separate customization for each tube size, resulting in extended design cycles and high costs. To meet bending requirements for tubes of different diameters using a single mandrel, a novel adjustable diameter mechanism (DAM) and its optimization design method are proposed. Initially, the DAM based on a planetary bevel gear-screw transmission set is developed for bending tubes of varying diameters. Subsequently, a domain knowledge-integrated optimization design framework is introduced. To reduce the cost of acquiring training samples for training surrogate models, a monotonicity-constrained neural network based on cascade boosting architecture (CB-MCNN) is introduced that enhances prediction accuracy while maintaining monotonicity. To improve the optimization speed and quality of Evolutionary Algorithms (EAs), a domain knowledge-guided EA (DK-EA) method is proposed, incorporating domain knowledge into the population initialization phase. The results indicate that: (1) CB-MCNN outperforms traditional methods and shows excellent performance on small-sample datasets. (2) DK-EA accelerates optimization processes and produces better outcomes. As a result, the domain knowledge-integrated optimization design framework enables the DAM to achieve a wider diameter variation range and enhanced reliability. The optimized DAM demonstrates the capability to bend tubes with diameters of 46–60 mm.
为满足日益增长的小批量弯管生产需求,传统的弯管模具需要针对每种管材尺寸进行单独定制,导致设计周期延长、成本高昂。为满足使用单一芯轴弯曲不同直径管材的要求,我们提出了一种新型可调直径机构(DAM)及其优化设计方法。首先,开发了基于行星锥齿轮-螺杆传动装置的可调直径机构,用于弯曲不同直径的管材。随后,引入了一个整合领域知识的优化设计框架。为了降低训练代用模型时获取训练样本的成本,引入了基于级联提升架构的单调性受限神经网络(CB-MCNN),在保持单调性的同时提高了预测精度。为了提高进化算法(EA)的优化速度和质量,提出了一种领域知识指导的进化算法(DK-EA)方法,将领域知识纳入种群初始化阶段。结果表明(1) CB-MCNN 优于传统方法,在小样本数据集上表现出色。(2) DK-EA 加快了优化过程,并产生了更好的结果。因此,整合了领域知识的优化设计框架使 DAM 的直径变化范围更广,可靠性更高。优化后的 DAM 能够弯曲直径为 46-60 毫米的管道。
{"title":"Diameter-adjustable mandrel for thin-wall tube bending and its domain knowledge-integrated optimization design framework","authors":"Zili Wang ,&nbsp;Jie Li ,&nbsp;Xiaojian Liu ,&nbsp;Shuyou Zhang ,&nbsp;Yaochen Lin ,&nbsp;Jianrong Tan","doi":"10.1016/j.engappai.2024.109634","DOIUrl":"10.1016/j.engappai.2024.109634","url":null,"abstract":"<div><div>In response to the growing demand for small-batch bending tube production, traditional bending dies require separate customization for each tube size, resulting in extended design cycles and high costs. To meet bending requirements for tubes of different diameters using a single mandrel, a novel adjustable diameter mechanism (DAM) and its optimization design method are proposed. Initially, the DAM based on a planetary bevel gear-screw transmission set is developed for bending tubes of varying diameters. Subsequently, a domain knowledge-integrated optimization design framework is introduced. To reduce the cost of acquiring training samples for training surrogate models, a monotonicity-constrained neural network based on cascade boosting architecture (CB-MCNN) is introduced that enhances prediction accuracy while maintaining monotonicity. To improve the optimization speed and quality of Evolutionary Algorithms (EAs), a domain knowledge-guided EA (DK-EA) method is proposed, incorporating domain knowledge into the population initialization phase. The results indicate that: (1) CB-MCNN outperforms traditional methods and shows excellent performance on small-sample datasets. (2) DK-EA accelerates optimization processes and produces better outcomes. As a result, the domain knowledge-integrated optimization design framework enables the DAM to achieve a wider diameter variation range and enhanced reliability. The optimized DAM demonstrates the capability to bend tubes with diameters of 46–60 mm.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109634"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Laparoscopic stereo matching using 3-Dimensional Fourier transform with full multi-scale features 利用三维傅立叶变换与全多尺度特征进行腹腔镜立体匹配
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109654
Renkai Wu , Pengchen Liang , Yinghao Liu , Yiqi Huang , Wangyan Li , Qing Chang
3-Dimensional (3D) reconstruction of laparoscopic surgical scenes is a key task for future surgical navigation and automated robotic minimally invasive surgery. Binocular laparoscopy with stereo matching enables 3D reconstruction. Stereo matching models used for natural images such as autopilot tend to be less suitable for laparoscopic environments due to the constraints of small samples of laparoscopic images, complex textures, and uneven illumination. In addition, current stereo matching modules use 3D convolutions and transformers in the spatial domain as the base module, which is limited by the ability to learn in the spatial domain. In this paper, we propose a model for laparoscopic stereo matching using 3D Fourier Transform combined with Full Multi-scale Features (FT-FMF Net). Specifically, the proposed Full Multi-scale Fusion Module (FMFM) is able to fuse the full multi-scale feature information from the feature extractor into the stereo matching block, which densely learns the feature information with parallax and FMFM fusion information in the frequency domain using the proposed Dense Fourier Transform Module (DFTM). We validated the proposed method in both the laparoscopic dataset (SCARED) and the endoscopic dataset (SERV-CT). In comparison with other popular and advanced deep learning models available at present, FT-FMF Net achieves the most advanced stereo matching performance available. In the SCARED and SERV-CT public datasets, the End-Point-Error (EPE) was 0.7265 and 2.3119, and the Root Mean Square Error Depth (RMSE Depth) was 4.00 mm and 3.69 mm, respectively. In addition, the inference time is only 0.17s. Our project code is available on https://github.com/wurenkai/FT-FMF.
腹腔镜手术场景的三维(3D)重建是未来手术导航和自动机器人微创手术的关键任务。带有立体匹配功能的双目腹腔镜可实现三维重建。由于腹腔镜图像样本小、纹理复杂、光照不均等限制,用于自动驾驶等自然图像的立体匹配模型往往不太适合腹腔镜环境。此外,目前的立体匹配模块使用空间域的三维卷积和变换器作为基础模块,这就限制了空间域的学习能力。在本文中,我们提出了一种使用三维傅立叶变换结合全多尺度特征(FT-FMF Net)的腹腔镜立体匹配模型。具体来说,所提出的全多尺度融合模块(FMFM)能够将特征提取器中的全多尺度特征信息融合到立体匹配模块中,该模块利用所提出的密集傅立叶变换模块(DFTM)在频域中密集学习带有视差和 FMFM 融合信息的特征信息。我们在腹腔镜数据集(SCARED)和内窥镜数据集(SERV-CT)中验证了所提出的方法。与目前其他流行的高级深度学习模型相比,FT-FMF Net 实现了目前最先进的立体匹配性能。在 SCARED 和 SERV-CT 公共数据集中,端点误差(EPE)分别为 0.7265 和 2.3119,均方根误差深度(RMSE Depth)分别为 4.00 毫米和 3.69 毫米。此外,推理时间仅为 0.17 秒。我们的项目代码可在 https://github.com/wurenkai/FT-FMF 上获取。
{"title":"Laparoscopic stereo matching using 3-Dimensional Fourier transform with full multi-scale features","authors":"Renkai Wu ,&nbsp;Pengchen Liang ,&nbsp;Yinghao Liu ,&nbsp;Yiqi Huang ,&nbsp;Wangyan Li ,&nbsp;Qing Chang","doi":"10.1016/j.engappai.2024.109654","DOIUrl":"10.1016/j.engappai.2024.109654","url":null,"abstract":"<div><div><strong>3</strong>-<strong>D</strong>imensional (3D) reconstruction of laparoscopic surgical scenes is a key task for future surgical navigation and automated robotic minimally invasive surgery. Binocular laparoscopy with stereo matching enables 3D reconstruction. Stereo matching models used for natural images such as autopilot tend to be less suitable for laparoscopic environments due to the constraints of small samples of laparoscopic images, complex textures, and uneven illumination. In addition, current stereo matching modules use 3D convolutions and transformers in the spatial domain as the base module, which is limited by the ability to learn in the spatial domain. In this paper, we propose a model for laparoscopic stereo matching using 3D <strong>F</strong>ourier <strong>T</strong>ransform combined with <strong>F</strong>ull <strong>M</strong>ulti-scale <strong>F</strong>eatures (FT-FMF Net). Specifically, the proposed <strong>F</strong>ull <strong>M</strong>ulti-scale <strong>F</strong>usion <strong>M</strong>odule (FMFM) is able to fuse the full multi-scale feature information from the feature extractor into the stereo matching block, which densely learns the feature information with parallax and FMFM fusion information in the frequency domain using the proposed <strong>D</strong>ense <strong>F</strong>ourier <strong>T</strong>ransform <strong>M</strong>odule (DFTM). We validated the proposed method in both the laparoscopic dataset (SCARED) and the endoscopic dataset (SERV-CT). In comparison with other popular and advanced deep learning models available at present, FT-FMF Net achieves the most advanced stereo matching performance available. In the SCARED and SERV-CT public datasets, the End-Point-Error (EPE) was 0.7265 and 2.3119, and the <strong>R</strong>oot <strong>M</strong>ean <strong>S</strong>quare <strong>E</strong>rror Depth (RMSE Depth) was 4.00 mm and 3.69 mm, respectively. In addition, the inference time is only 0.17s. Our project code is available on <span><span>https://github.com/wurenkai/FT-FMF</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109654"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142658989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parallax-aware dual-view feature enhancement and adaptive detail compensation for dual-pixel defocus deblurring 视差感知双视角特征增强和自适应细节补偿,用于双像素失焦去模糊
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109612
Yuzhen Niu , Yuqi He , Rui Xu , Yuezhou Li , Yuzhong Chen
Defocus deblurring using dual-pixel sensors has gathered significant attention in recent years. However, current methodologies have not adequately addressed the challenge of defocus disparity between dual views, resulting in suboptimal performance in recovering details from severely defocused pixels. To counteract this limitation, we introduce in this paper a parallax-aware dual-view feature enhancement and adaptive detail compensation network (PA-Net), specifically tailored for dual-pixel defocus deblurring task. Our proposed PA-Net leverages an encoder–decoder architecture augmented with skip connections, designed to initially extract distinct features from the left and right views. A pivotal aspect of our model lies at the network’s bottleneck, where we introduce a parallax-aware dual-view feature enhancement based on Transformer blocks, which aims to align and enhance extracted dual-pixel features, aggregating them into a unified feature. Furthermore, taking into account the disparity and the rich details embedded in encoder features, we design an adaptive detail compensation module to adaptively incorporate dual-view encoder features into image reconstruction, aiding in restoring image details. Experimental results demonstrate that our proposed PA-Net exhibits superior performance and visual effects on the real-world dataset.
近年来,使用双像素传感器进行散焦去模糊技术受到了广泛关注。然而,目前的方法并没有充分解决双视图之间的散焦差异问题,导致从严重散焦的像素中恢复细节的性能不理想。为了克服这一局限性,我们在本文中介绍了一种视差感知双视角特征增强和自适应细节补偿网络(PA-Net),它是专门为双像素散焦去模糊任务定制的。我们提出的 PA-Net 采用编码器-解码器架构,并增加了跳转连接,旨在从左右视图中初步提取不同的特征。我们模型的一个关键方面在于网络的瓶颈,我们在此引入了基于变换器块的视差感知双视角特征增强,旨在对齐和增强提取的双像素特征,将它们聚合成一个统一的特征。此外,考虑到差距和编码器特征中蕴含的丰富细节,我们设计了一个自适应细节补偿模块,将双视角编码器特征自适应地纳入图像重建,帮助恢复图像细节。实验结果表明,我们提出的 PA-Net 在实际数据集上表现出了卓越的性能和视觉效果。
{"title":"Parallax-aware dual-view feature enhancement and adaptive detail compensation for dual-pixel defocus deblurring","authors":"Yuzhen Niu ,&nbsp;Yuqi He ,&nbsp;Rui Xu ,&nbsp;Yuezhou Li ,&nbsp;Yuzhong Chen","doi":"10.1016/j.engappai.2024.109612","DOIUrl":"10.1016/j.engappai.2024.109612","url":null,"abstract":"<div><div>Defocus deblurring using dual-pixel sensors has gathered significant attention in recent years. However, current methodologies have not adequately addressed the challenge of defocus disparity between dual views, resulting in suboptimal performance in recovering details from severely defocused pixels. To counteract this limitation, we introduce in this paper a parallax-aware dual-view feature enhancement and adaptive detail compensation network (PA-Net), specifically tailored for dual-pixel defocus deblurring task. Our proposed PA-Net leverages an encoder–decoder architecture augmented with skip connections, designed to initially extract distinct features from the left and right views. A pivotal aspect of our model lies at the network’s bottleneck, where we introduce a parallax-aware dual-view feature enhancement based on Transformer blocks, which aims to align and enhance extracted dual-pixel features, aggregating them into a unified feature. Furthermore, taking into account the disparity and the rich details embedded in encoder features, we design an adaptive detail compensation module to adaptively incorporate dual-view encoder features into image reconstruction, aiding in restoring image details. Experimental results demonstrate that our proposed PA-Net exhibits superior performance and visual effects on the real-world dataset.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109612"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SenticNet and Abstract Meaning Representation driven Attention-Gate semantic framework for aspect sentiment triplet extraction SenticNet 和抽象意义表征驱动的注意门语义框架用于方面情感三元组提取
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109625
Xiaowen Sun, Jiangtao Qi, Zhenfang Zhu, Meng Li, Hongli Pei, Jing Meng
Aspect sentiment triplet extraction aims to analyze aspect-level sentiment in the form of triplets, including extracting aspect-opinion pairs and predicting the sentiment polarities of these pairs. Many recent works rely on syntactic information (e.g. part-of-speech and syntactic dependency relation) to handle this semantic task, which ignores uncommon part-of-speech items and matches semantically unrelated words. To overcome these drawbacks, we propose a SenticNet and Abstract Meaning Representation (AMR) driven Attention-Gate semantic framework (SAAG), which introduces semantic sentiment knowledge SenticNet and semantic structure AMR as semantic information to replace syntactic information. To highlight the affective meanings in words, an affective-driven attention mechanism is designed to emphasizes sentiment intent within word representations. To match semantically related words, the designed AMR-driven gate mechanism balances the word pair expressions under varying semantic contexts. Extensive experiments on two public datasets demonstrate the effectiveness of our approach.
方面情感三连抽取旨在分析三连形式的方面级情感,包括抽取方面-观点对和预测这些对的情感极性。最近的许多研究都依赖句法信息(如语音部分和句法依赖关系)来处理这一语义任务,这就忽略了不常见的语音部分项,并匹配语义上不相关的词。为了克服这些缺点,我们提出了一种由 SenticNet 和抽象意义表示(AMR)驱动的注意门语义框架(SAAG),它引入了语义情感知识 SenticNet 和语义结构 AMR 作为语义信息来替代句法信息。为了突出词语中的情感含义,设计了情感驱动的注意机制,以强调词语表征中的情感意图。为了匹配语义相关的词语,所设计的 AMR 驱动门机制可在不同语义语境下平衡词对表达。在两个公开数据集上进行的广泛实验证明了我们方法的有效性。
{"title":"SenticNet and Abstract Meaning Representation driven Attention-Gate semantic framework for aspect sentiment triplet extraction","authors":"Xiaowen Sun,&nbsp;Jiangtao Qi,&nbsp;Zhenfang Zhu,&nbsp;Meng Li,&nbsp;Hongli Pei,&nbsp;Jing Meng","doi":"10.1016/j.engappai.2024.109625","DOIUrl":"10.1016/j.engappai.2024.109625","url":null,"abstract":"<div><div>Aspect sentiment triplet extraction aims to analyze aspect-level sentiment in the form of triplets, including extracting aspect-opinion pairs and predicting the sentiment polarities of these pairs. Many recent works rely on syntactic information (e.g. part-of-speech and syntactic dependency relation) to handle this semantic task, which ignores uncommon part-of-speech items and matches semantically unrelated words. To overcome these drawbacks, we propose a SenticNet and Abstract Meaning Representation (AMR) driven Attention-Gate semantic framework (SAAG), which introduces semantic sentiment knowledge SenticNet and semantic structure AMR as semantic information to replace syntactic information. To highlight the affective meanings in words, an affective-driven attention mechanism is designed to emphasizes sentiment intent within word representations. To match semantically related words, the designed AMR-driven gate mechanism balances the word pair expressions under varying semantic contexts. Extensive experiments on two public datasets demonstrate the effectiveness of our approach.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109625"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph regularized discriminative nonnegative matrix factorization 图形正则化鉴别非负矩阵因式分解
IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-15 DOI: 10.1016/j.engappai.2024.109629
Zhonghua Liu , Fa Zhu , Hao Xiong , Xingchi Chen , Danilo Pelusi , Athanasios V. Vasilakos
It is well known that both the label information and the local geometry structure information are very important for image data clustering and classification. However, nonnegative matrix factorization (NMF) and its variants do not fully utilize the information or only use one of them. This paper presents a graph regularized discriminative nonnegative matrix factorization (GDNMF) for image data clustering, in which the local geometrical structure and label information of the observed samples are thoroughly considered. In the objective function of NMF, two constraint terms are added to preserve the above information. One is a sparse graph, which is adaptively constructed to obtain the local geometrical structure information. The other is data label information, which is used to capture discriminative information of the original data. By using local and label information, the proposed regularized discriminative nonnegative matrix factorization indeed improves the discrimination power of matrix decomposition. In addition, the F-norm formulation based cost function of regularized discriminative nonnegative matrix factorization is given, and the update rules for the optimization function of regularized discriminative nonnegative matrix factorization are proved. The experiment results on several public image datasets demonstrate the effectiveness of GDNMF algorithm. The innovation of this paper lies in extending unsupervised NMF to semi-supervised case and adaptively capturing the local structure of data based on sparse graph. However, the proposed method does not take into account the challenges of multiview data processing.
众所周知,标签信息和局部几何结构信息对于图像数据的聚类和分类非常重要。然而,非负矩阵因式分解(NMF)及其变体并不能充分利用这些信息,或者只能利用其中之一。本文提出了一种用于图像数据聚类的图正则化判别非负矩阵因式分解(GDNMF),其中充分考虑了观察样本的局部几何结构和标签信息。在 NMF 的目标函数中,添加了两个约束项以保留上述信息。一个是稀疏图,通过自适应构建来获取局部几何结构信息。另一个是数据标签信息,用于捕捉原始数据的判别信息。通过使用局部信息和标签信息,所提出的正则化判别非负矩阵因式分解确实提高了矩阵分解的判别能力。此外,还给出了正则化判别非负矩阵因式分解的基于 F 准则公式的代价函数,并证明了正则化判别非负矩阵因式分解优化函数的更新规则。在多个公共图像数据集上的实验结果证明了 GDNMF 算法的有效性。本文的创新之处在于将无监督 NMF 扩展到半监督情况,并基于稀疏图自适应地捕捉数据的局部结构。然而,所提出的方法并没有考虑到多视图数据处理所面临的挑战。
{"title":"Graph regularized discriminative nonnegative matrix factorization","authors":"Zhonghua Liu ,&nbsp;Fa Zhu ,&nbsp;Hao Xiong ,&nbsp;Xingchi Chen ,&nbsp;Danilo Pelusi ,&nbsp;Athanasios V. Vasilakos","doi":"10.1016/j.engappai.2024.109629","DOIUrl":"10.1016/j.engappai.2024.109629","url":null,"abstract":"<div><div>It is well known that both the label information and the local geometry structure information are very important for image data clustering and classification. However, nonnegative matrix factorization (NMF) and its variants do not fully utilize the information or only use one of them. This paper presents a graph regularized discriminative nonnegative matrix factorization (GDNMF) for image data clustering, in which the local geometrical structure and label information of the observed samples are thoroughly considered. In the objective function of NMF, two constraint terms are added to preserve the above information. One is a sparse graph, which is adaptively constructed to obtain the local geometrical structure information. The other is data label information, which is used to capture discriminative information of the original data. By using local and label information, the proposed regularized discriminative nonnegative matrix factorization indeed improves the discrimination power of matrix decomposition. In addition, the <em>F</em>-norm formulation based cost function of regularized discriminative nonnegative matrix factorization is given, and the update rules for the optimization function of regularized discriminative nonnegative matrix factorization are proved. The experiment results on several public image datasets demonstrate the effectiveness of GDNMF algorithm. The innovation of this paper lies in extending unsupervised NMF to semi-supervised case and adaptively capturing the local structure of data based on sparse graph. However, the proposed method does not take into account the challenges of multiview data processing.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109629"},"PeriodicalIF":7.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Engineering Applications of Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1