首页 > 最新文献

International Journal of Intelligent Systems最新文献

英文 中文
Optimization Enabled Online Tiger-Claw Fuzzy Region With Clustering Based Neovascularization Segmentation and Classification Using YOLO-V5 From Retinal Fundus Images 基于YOLO-V5优化的基于聚类的在线虎爪模糊区域视网膜眼底图像新生血管分割与分类
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-17 DOI: 10.1155/int/6119924
M. Kathiravan, Ashwini A., Balasubramaniam S., T. D. Subha, Gururama Senthilvel P., Sivakumar T. A.

The pathological development of abnormal blood vessels results in neovascularization as a major vision-threatening condition of diabetic retinopathy. The main factor behind pathological vessel growth results from retinal capillary depletion of oxygen that causes abnormal vascular development patterns. Early detection of these fundus image abnormalities requires precision because it enables ophthalmologists to provide effective treatment and make proper diagnoses. A multiple-step image processing system treats this problem. A fusion-based contrast enhancement method begins the process of enhancing diabetic retinopathy fundus image brightness and contrast. After the initial process, the system applies detail weighted histogram equalization to the green channel for better structural detail visualization. In the second stage, the proposed online tiger-claw algorithm segments abnormal neovascularization from normal blood vessels. Next, the combination of fuzzy zone-based clustering with optimization and classifier thresholding performs local identification along with highlight generation for neovascularized areas. Neovascularization detection makes use of a YOLOv5 neural network in the third stage through feature extraction and classification operations. A refined segmentation process occurs with the application of multistage gray wolf optimization. The proposed algorithm underwent testing through its application to the public datasets STARE, DRIVE, MESSIDOR, and DIARETDB1. Experimental tests indicate that the neovascularization region marking performed with 98.19% sensitivity and 96.56% specificity while reaching 99.27% accuracy. The proposed approach demonstrates 97.03% accuracy and 98.94% sensitivity, together with 97.17% specificity in neovascularization detection.

异常血管的病理发展导致新生血管形成是糖尿病视网膜病变的主要视力威胁条件。病理血管生长背后的主要因素是视网膜毛细血管缺氧,导致血管发育模式异常。早期发现这些眼底图像异常需要精确,因为它使眼科医生能够提供有效的治疗和做出正确的诊断。多步图像处理系统解决了这个问题。一种基于融合的对比度增强方法开始了增强糖尿病视网膜病变眼底图像亮度和对比度的过程。经过初始处理后,系统对绿色通道进行细节加权直方图均衡化,以获得更好的结构细节可视化效果。第二阶段,提出的在线虎爪算法将异常新生血管从正常血管中分割出来。接下来,基于模糊区域的聚类与优化和分类器阈值相结合,对新血管化区域进行局部识别和高光生成。新生血管检测在第三阶段通过特征提取和分类操作使用YOLOv5神经网络。应用多阶段灰狼优化,实现了精细的分割过程。本文提出的算法通过对公共数据集STARE、DRIVE、MESSIDOR和DIARETDB1的应用进行了测试。实验结果表明,新血管区标记的灵敏度为98.19%,特异性为96.56%,准确率为99.27%。该方法在新生血管检测中准确率为97.03%,灵敏度为98.94%,特异性为97.17%。
{"title":"Optimization Enabled Online Tiger-Claw Fuzzy Region With Clustering Based Neovascularization Segmentation and Classification Using YOLO-V5 From Retinal Fundus Images","authors":"M. Kathiravan,&nbsp;Ashwini A.,&nbsp;Balasubramaniam S.,&nbsp;T. D. Subha,&nbsp;Gururama Senthilvel P.,&nbsp;Sivakumar T. A.","doi":"10.1155/int/6119924","DOIUrl":"https://doi.org/10.1155/int/6119924","url":null,"abstract":"<p>The pathological development of abnormal blood vessels results in neovascularization as a major vision-threatening condition of diabetic retinopathy. The main factor behind pathological vessel growth results from retinal capillary depletion of oxygen that causes abnormal vascular development patterns. Early detection of these fundus image abnormalities requires precision because it enables ophthalmologists to provide effective treatment and make proper diagnoses. A multiple-step image processing system treats this problem. A fusion-based contrast enhancement method begins the process of enhancing diabetic retinopathy fundus image brightness and contrast. After the initial process, the system applies detail weighted histogram equalization to the green channel for better structural detail visualization. In the second stage, the proposed online tiger-claw algorithm segments abnormal neovascularization from normal blood vessels. Next, the combination of fuzzy zone-based clustering with optimization and classifier thresholding performs local identification along with highlight generation for neovascularized areas. Neovascularization detection makes use of a YOLOv5 neural network in the third stage through feature extraction and classification operations. A refined segmentation process occurs with the application of multistage gray wolf optimization. The proposed algorithm underwent testing through its application to the public datasets STARE, DRIVE, MESSIDOR, and DIARETDB1. Experimental tests indicate that the neovascularization region marking performed with 98.19% sensitivity and 96.56% specificity while reaching 99.27% accuracy. The proposed approach demonstrates 97.03% accuracy and 98.94% sensitivity, together with 97.17% specificity in neovascularization detection.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/6119924","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145572459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy-Aware Regression in Spiking Neural Networks for Autonomous Driving: A Comparative Study With Convolutional Networks 用于自动驾驶的脉冲神经网络能量感知回归:与卷积网络的比较研究
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-14 DOI: 10.1155/int/4879993
Fernando Sevilla Martínez, Jordi Casas-Roma, Laia Subirats, Raúl Parada

As autonomous driving (AD) systems grow more complex, their rising computational demands pose significant energy and sustainability challenges. This paper investigates spiking neural networks (SNNs) as low-power alternatives to convolutional neural networks (CNNs) for regression tasks in AD. We introduce a membrane-potential (Vmem) decoding framework that converts binary spike trains into continuous outputs and propose the energy-to-error ratio (EER), a unified metric combining prediction error with energy consumption. Three CNN architectures (PilotNet, LaksNet, and MiniNet) and their corresponding SNN variants are trained and evaluated using delta, latency, and rate encoding across varied parameter settings, with energy use and emissions logged. Delta-encoded SNNs achieve the highest EER, substantial energy savings with minimal performance loss, whereas CNNs, despite slightly better MSE, incur 10–20 × higher energy costs. Rate encoding underperforms, and latency encoding, though improving relative error, demands excessive energy. Parameter tuning (threshold θ, temporal dynamics (S), membrane time constant (τ), and gain G) directly influences eco-efficiency. All experiments run on standard GPUs, showing SNNs can surpass CNNs in eco-efficiency without specialized hardware. Paired statistical tests confirm that only delta-encoded SNNs achieve significant EER improvements. This work presents a practical, energy-aware evaluation framework for neural architectures, establishing EER as a critical metric for sustainable machine learning in intelligent transport and beyond.

随着自动驾驶(AD)系统变得越来越复杂,其不断增长的计算需求带来了重大的能源和可持续性挑战。本文研究了尖峰神经网络(SNNs)作为卷积神经网络(cnn)的低功耗替代品,用于AD中的回归任务。我们引入了一个膜电位(Vmem)解码框架,将二进制尖峰串转换为连续输出,并提出了能量误差率(EER),这是一个结合预测误差和能量消耗的统一度量。三种CNN架构(PilotNet, LaksNet和MiniNet)及其相应的SNN变体使用delta,延迟和速率编码在不同参数设置下进行训练和评估,并记录能源使用和排放。delta编码snn实现了最高的EER,以最小的性能损失节省了大量的能源,而cnn尽管MSE稍好,但会产生10-20倍的能源成本。速率编码性能不佳,延迟编码虽然可以改善相对误差,但需要过多的能量。参数调整(阈值θ、时间动态(S)、膜时间常数(τ)和增益G)直接影响生态效率。所有实验都在标准gpu上运行,表明SNNs在没有专门硬件的情况下可以超越cnn的生态效率。配对统计检验证实,只有delta编码的snn实现了显著的EER改进。这项工作提出了一个实用的、能源意识的神经架构评估框架,将EER作为智能交通及其他领域可持续机器学习的关键指标。
{"title":"Energy-Aware Regression in Spiking Neural Networks for Autonomous Driving: A Comparative Study With Convolutional Networks","authors":"Fernando Sevilla Martínez,&nbsp;Jordi Casas-Roma,&nbsp;Laia Subirats,&nbsp;Raúl Parada","doi":"10.1155/int/4879993","DOIUrl":"https://doi.org/10.1155/int/4879993","url":null,"abstract":"<p>As autonomous driving (AD) systems grow more complex, their rising computational demands pose significant energy and sustainability challenges. This paper investigates spiking neural networks (SNNs) as low-power alternatives to convolutional neural networks (CNNs) for regression tasks in AD. We introduce a membrane-potential (<i>V</i><sub>mem</sub>) decoding framework that converts binary spike trains into continuous outputs and propose the energy-to-error ratio (EER), a unified metric combining prediction error with energy consumption. Three CNN architectures (PilotNet, LaksNet, and MiniNet) and their corresponding SNN variants are trained and evaluated using delta, latency, and rate encoding across varied parameter settings, with energy use and emissions logged. Delta-encoded SNNs achieve the highest EER, substantial energy savings with minimal performance loss, whereas CNNs, despite slightly better MSE, incur 10–20 × higher energy costs. Rate encoding underperforms, and latency encoding, though improving relative error, demands excessive energy. Parameter tuning (threshold <i>θ</i>, temporal dynamics (<i>S</i>), membrane time constant (<i>τ</i>), and gain <i>G</i>) directly influences eco-efficiency. All experiments run on standard GPUs, showing SNNs can surpass CNNs in eco-efficiency without specialized hardware. Paired statistical tests confirm that only delta-encoded SNNs achieve significant EER improvements. This work presents a practical, energy-aware evaluation framework for neural architectures, establishing EER as a critical metric for sustainable machine learning in intelligent transport and beyond.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/4879993","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145522267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge Distillation in Federated Learning: A Survey on Long Lasting Challenges and New Solutions 联邦学习中的知识提炼:长期挑战与新解决方案综述
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1155/int/7406934
Laiqiao Qin, Tianqing Zhu, Wanlei Zhou, Philip S. Yu

Federated learning (FL) is a distributed and privacy-preserving machine learning paradigm that coordinates multiple clients to train a model while keeping the raw data localized. However, this traditional FL poses some challenges, including privacy risks, data heterogeneity, communication bottlenecks, and system heterogeneity issues. To tackle these challenges, knowledge distillation (KD) has been widely applied in FL since 2020. KD is a validated and efficacious model compression and enhancement algorithm. The core concept of KD involves facilitating knowledge transfer between models by exchanging logits at intermediate or output layers. These properties make KD an excellent solution for the long-lasting challenges in FL. Up to now, there have been few reviews that summarize and analyze the current trend and methods for how KD can be applied in FL efficiently. This article aims to provide a comprehensive survey of KD-based FL, focusing on addressing the above challenges. First, we provide an overview of KD-based FL, including its motivation, basics, taxonomy, and a comparison with traditional FL and where KD should execute. We also analyze the critical factors in KD-based FL in the Appendix, including teachers, knowledge, data, and methods. We discuss how KD can address the challenges in FL, including privacy protection, data heterogeneity, communication efficiency, and personalization. Finally, we discuss the challenges facing KD-based FL algorithms and future research directions. We hope this survey can provide insights and guidance for researchers and practitioners in the FL area.

联邦学习(FL)是一种分布式和保护隐私的机器学习范例,它协调多个客户端来训练模型,同时保持原始数据的本地化。然而,这种传统的FL带来了一些挑战,包括隐私风险、数据异构、通信瓶颈和系统异构问题。为了应对这些挑战,自2020年以来,知识蒸馏(KD)在FL中得到了广泛应用。KD是一种经过验证的有效的模型压缩和增强算法。KD的核心概念包括通过在中间层或输出层交换逻辑来促进模型之间的知识转移。这些特性使KD成为FL长期挑战的极好解决方案。到目前为止,很少有综述总结和分析KD如何有效应用于FL的当前趋势和方法。本文旨在提供基于kd的FL的全面调查,重点是解决上述挑战。首先,我们概述了基于KD的FL,包括其动机,基础知识,分类法,以及与传统FL的比较,以及KD应该在哪里执行。我们还在附录中分析了基于kd的外语教学的关键因素,包括教师、知识、数据和方法。我们讨论了KD如何解决FL中的挑战,包括隐私保护、数据异构、通信效率和个性化。最后,我们讨论了基于kd的FL算法面临的挑战和未来的研究方向。我们希望这项调查能够为FL领域的研究人员和从业者提供见解和指导。
{"title":"Knowledge Distillation in Federated Learning: A Survey on Long Lasting Challenges and New Solutions","authors":"Laiqiao Qin,&nbsp;Tianqing Zhu,&nbsp;Wanlei Zhou,&nbsp;Philip S. Yu","doi":"10.1155/int/7406934","DOIUrl":"https://doi.org/10.1155/int/7406934","url":null,"abstract":"<p>Federated learning (FL) is a distributed and privacy-preserving machine learning paradigm that coordinates multiple clients to train a model while keeping the raw data localized. However, this traditional FL poses some challenges, including privacy risks, data heterogeneity, communication bottlenecks, and system heterogeneity issues. To tackle these challenges, knowledge distillation (KD) has been widely applied in FL since 2020. KD is a validated and efficacious model compression and enhancement algorithm. The core concept of KD involves facilitating knowledge transfer between models by exchanging logits at intermediate or output layers. These properties make KD an excellent solution for the long-lasting challenges in FL. Up to now, there have been few reviews that summarize and analyze the current trend and methods for how KD can be applied in FL efficiently. This article aims to provide a comprehensive survey of KD-based FL, focusing on addressing the above challenges. First, we provide an overview of KD-based FL, including its motivation, basics, taxonomy, and a comparison with traditional FL and where KD should execute. We also analyze the critical factors in KD-based FL in the Appendix, including teachers, knowledge, data, and methods. We discuss how KD can address the challenges in FL, including privacy protection, data heterogeneity, communication efficiency, and personalization. Finally, we discuss the challenges facing KD-based FL algorithms and future research directions. We hope this survey can provide insights and guidance for researchers and practitioners in the FL area.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/7406934","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Noncontact Fault Diagnosis of Electrical Equipment Using Modified Multiscale Two-Dimensional Color Distribution Entropy and Thermal Imaging 基于改进多尺度二维颜色分布熵和热成像的电气设备非接触故障诊断
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-12 DOI: 10.1155/int/4805844
Shun Wang, Yolanda Vidal, Francesc Pozo

Effective health monitoring of electrical equipment is critical for industrial reliability. Although infrared thermal imaging offers a powerful noncontact diagnostic method, accurately interpreting its complex and often noisy thermal patterns remains a significant challenge. Entropy-based analysis is well suited for quantifying this complexity, but its application to images has been limited. Existing two-dimensional entropy methods are not only less developed than their one-dimensional counterparts but also typically require converting thermal images to grayscale, which discards vital diagnostic information from color channels. To overcome these limitations, this study introduces the modified multiscale two-dimensional color distribution entropy (MMCDEn2D). This novel method directly integrates the attributes of the RGB, preserving a richer feature set for analysis. The effectiveness of the proposed method is demonstrated first through synthetic signals, showing low sensitivity to image size and high computational efficiency. The study further extends the application of entropy-based analysis to noncontact health monitoring scenarios, implementing MMCDEn2D for thermal image-based fault diagnosis of induction motors and power transformers. The method achieves a diagnostic accuracy that exceeds 95%, significantly outperforming traditional approaches. Crucially, it demonstrates superior robustness in challenging scenarios, improving accuracy by 2%–5% under high-noise conditions and with small sample sizes. These results establish MMCDEn2D as a highly effective and reliable tool to advance noncontact fault diagnosis in critical electrical equipment.

有效的电气设备健康监测对工业可靠性至关重要。尽管红外热成像提供了一种强大的非接触式诊断方法,但准确解释其复杂且经常嘈杂的热模式仍然是一个重大挑战。基于熵的分析非常适合量化这种复杂性,但它在图像上的应用受到限制。现有的二维熵方法不仅不如一维熵方法发达,而且通常需要将热图像转换为灰度,从而丢弃了来自颜色通道的重要诊断信息。为了克服这些局限性,本研究引入了改进的多尺度二维颜色分布熵(MMCDEn2D)。该方法直接集成了RGB的属性,为分析保留了更丰富的特征集。首先通过合成信号验证了该方法的有效性,该方法对图像大小的敏感性低,计算效率高。本研究将基于熵的分析方法进一步扩展到非接触健康监测场景,实现了基于MMCDEn2D的感应电机和电力变压器热图像故障诊断。该方法的诊断准确率超过95%,显著优于传统方法。至关重要的是,它在具有挑战性的场景中表现出了卓越的鲁棒性,在高噪声条件下和小样本量下,准确率提高了2%-5%。这些结果表明,MMCDEn2D是一种非常有效和可靠的工具,可以推进关键电气设备的非接触故障诊断。
{"title":"Noncontact Fault Diagnosis of Electrical Equipment Using Modified Multiscale Two-Dimensional Color Distribution Entropy and Thermal Imaging","authors":"Shun Wang,&nbsp;Yolanda Vidal,&nbsp;Francesc Pozo","doi":"10.1155/int/4805844","DOIUrl":"https://doi.org/10.1155/int/4805844","url":null,"abstract":"<p>Effective health monitoring of electrical equipment is critical for industrial reliability. Although infrared thermal imaging offers a powerful noncontact diagnostic method, accurately interpreting its complex and often noisy thermal patterns remains a significant challenge. Entropy-based analysis is well suited for quantifying this complexity, but its application to images has been limited. Existing two-dimensional entropy methods are not only less developed than their one-dimensional counterparts but also typically require converting thermal images to grayscale, which discards vital diagnostic information from color channels. To overcome these limitations, this study introduces the modified multiscale two-dimensional color distribution entropy (MMCDEn<sub>2D</sub>). This novel method directly integrates the attributes of the RGB, preserving a richer feature set for analysis. The effectiveness of the proposed method is demonstrated first through synthetic signals, showing low sensitivity to image size and high computational efficiency. The study further extends the application of entropy-based analysis to noncontact health monitoring scenarios, implementing MMCDEn<sub>2D</sub> for thermal image-based fault diagnosis of induction motors and power transformers. The method achieves a diagnostic accuracy that exceeds 95%, significantly outperforming traditional approaches. Crucially, it demonstrates superior robustness in challenging scenarios, improving accuracy by 2%–5% under high-noise conditions and with small sample sizes. These results establish MMCDEn<sub>2D</sub> as a highly effective and reliable tool to advance noncontact fault diagnosis in critical electrical equipment.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/4805844","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DiffG-MTL: A Dynamic Multidiffusion Graph Network for Multitask Traffic Accident Prediction 多任务交通事故预测的动态多扩散图网络
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1155/int/8995422
Nana Bu, Zongtao Duan, Wen Dang

Traffic accident prediction serves as a cornerstone of intelligent transportation systems, enabling proactive city-wide control strategies and public safety interventions. Effective models must capture the evolving spatiotemporal propagation of risk while addressing heterogeneous data distributions across urban regions. Current approaches face significant limitations: fixed graph topologies fail to represent nonstationary accident patterns, while uniform task weighting leads to optimization bias toward data-rich areas, ultimately constraining adaptability in adjacency construction and multihop spatial reasoning. To address these challenges, we propose a dynamic multidiffusion graph network with multitask learning (DiffG-MTL) for city-scale accident prediction. Specifically, a dynamic diffusion adjacency generation (DDAG) module constructs time-varying, diffusion-based adjacency matrices through multiple propagation pathways. A multiscale graph structure learning (MGSL) module captures multihop spatial relationships and temporal cues, while effectively highlighting anomalous traffic behaviors. To alleviate regional data imbalance, we introduce a dynamic multitask learning objective that adaptively redistributes learning focus using recall-aware weighting and task-level normalization. Comprehensive evaluations on six widely used datasets demonstrate that DiffG-MTL consistently outperforms state-of-the-art baselines across multiple evaluation metrics. Additional experiments validate its robustness and effectiveness in modeling complex spatiotemporal accident patterns.

交通事故预测是智能交通系统的基石,可以实现城市范围内的主动控制策略和公共安全干预。有效的模型必须捕捉风险的时空传播,同时处理城市区域间的异构数据分布。目前的方法面临着明显的局限性:固定的图拓扑不能表示非平稳的事故模式,而统一的任务加权导致优化偏向于数据丰富的区域,最终限制了邻接构建和多跳空间推理的适应性。为了解决这些挑战,我们提出了一个具有多任务学习的动态多扩散图网络(DiffG-MTL)用于城市规模的事故预测。具体来说,动态扩散邻接生成(DDAG)模块通过多种传播途径构建时变的、基于扩散的邻接矩阵。多尺度图结构学习(MGSL)模块捕获多跳空间关系和时间线索,同时有效地突出异常交通行为。为了缓解区域数据不平衡,我们引入了一个动态多任务学习目标,该目标使用回忆感知加权和任务级归一化自适应地重新分配学习焦点。对六个广泛使用的数据集的综合评估表明,DiffG-MTL在多个评估指标上始终优于最先进的基线。实验验证了该方法对复杂时空事故模式建模的鲁棒性和有效性。
{"title":"DiffG-MTL: A Dynamic Multidiffusion Graph Network for Multitask Traffic Accident Prediction","authors":"Nana Bu,&nbsp;Zongtao Duan,&nbsp;Wen Dang","doi":"10.1155/int/8995422","DOIUrl":"https://doi.org/10.1155/int/8995422","url":null,"abstract":"<p>Traffic accident prediction serves as a cornerstone of intelligent transportation systems, enabling proactive city-wide control strategies and public safety interventions. Effective models must capture the evolving spatiotemporal propagation of risk while addressing heterogeneous data distributions across urban regions. Current approaches face significant limitations: fixed graph topologies fail to represent nonstationary accident patterns, while uniform task weighting leads to optimization bias toward data-rich areas, ultimately constraining adaptability in adjacency construction and multihop spatial reasoning. To address these challenges, we propose a dynamic multidiffusion graph network with multitask learning (DiffG-MTL) for city-scale accident prediction. Specifically, a dynamic diffusion adjacency generation (DDAG) module constructs time-varying, diffusion-based adjacency matrices through multiple propagation pathways. A multiscale graph structure learning (MGSL) module captures multihop spatial relationships and temporal cues, while effectively highlighting anomalous traffic behaviors. To alleviate regional data imbalance, we introduce a dynamic multitask learning objective that adaptively redistributes learning focus using recall-aware weighting and task-level normalization. Comprehensive evaluations on six widely used datasets demonstrate that DiffG-MTL consistently outperforms state-of-the-art baselines across multiple evaluation metrics. Additional experiments validate its robustness and effectiveness in modeling complex spatiotemporal accident patterns.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/8995422","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Deep Learning for Predicting Cerebral Herniation Using Sagittal CT and Clinical Data 利用矢状位CT和临床数据预测脑疝的多模态深度学习
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1155/int/9369999
Like Ji, Fuxing Yang, Zicheng Xiong, Jun Qiu, Fang Zuo, Kefan Yi, Shengbo Chen, Wenying Chen, Kai Zhao, Ghulam Mohi-ud-din

Cerebral herniation is a life-threatening neurological emergency, where timely and accurate prediction is crucial for improving patient prognosis. Due to its rapid imaging advantages, CT becomes the preferred choice for cerebral herniation screening. With the continuous development of artificial intelligence technology in the field of neurological diseases, CT-based models provide significant support for computer-aided clinical diagnosis. However, current research on cerebral herniation diagnosis remains limited. Existing methods rely on traditional machine learning or focus solely on midline shift detection, which not only exhibits strong subjectivity but also neglects key structures such as the brainstem and the rich information from sagittal CT images. To address these limitations, this study focuses on mid-sagittal CT images including the brainstem and combines clinical data to construct a multimodal deep learning framework for cerebral herniation prediction. The model integrates mature and advanced deep learning architectures to extract and fuse features from CT images and clinical text data, employing multiscale convolution and attention mechanisms for diagnostic classification. The model is evaluated on datasets from two centers. Results show that on the internal test set, the model achieves accuracy, sensitivity, specificity, and AUC of 89%, 92%, 88%, and 0.94, respectively; on the external test set, it attains accuracy, sensitivity, specificity, and AUC of 81%, 82%, 80%, and 0.89, respectively, outperforming baseline methods and existing state-of-the-art approaches. Additionally, when compared with radiologists on the internal test set, the model’s performance matches or exceeds the consensus of physicians. We also reveal the model’s focus region through visual analysis, which further deepens the understanding of the model’s prediction process and enhances its interpretability. Experiments demonstrate that the proposed method holds significant potential in assisting cerebral herniation diagnosis.

脑疝是一种危及生命的神经急症,及时准确的预测对改善患者预后至关重要。由于其快速成像的优势,CT成为脑疝筛查的首选。随着人工智能技术在神经系统疾病领域的不断发展,基于ct的模型为计算机辅助临床诊断提供了重要支持。然而,目前对脑疝诊断的研究仍然有限。现有方法依赖于传统的机器学习或只关注中线偏移检测,主观性强,忽略了脑干等关键结构和矢状CT图像的丰富信息。为了解决这些局限性,本研究将重点放在包括脑干在内的中矢状位CT图像上,并结合临床数据构建脑疝预测的多模态深度学习框架。该模型集成了成熟和先进的深度学习架构,从CT图像和临床文本数据中提取和融合特征,采用多尺度卷积和注意机制进行诊断分类。该模型在两个中心的数据集上进行了评估。结果表明,在内部测试集上,该模型的准确率为89%,灵敏度为92%,特异度为88%,AUC为0.94;在外部测试集上,其准确性、灵敏度、特异性和AUC分别为81%、82%、80%和0.89,优于基线方法和现有的最先进方法。此外,当与内部测试集的放射科医生进行比较时,该模型的性能达到或超过了医生的共识。我们还通过可视化分析揭示了模型的焦点区域,进一步加深了对模型预测过程的理解,增强了模型的可解释性。实验表明,该方法在辅助脑疝诊断方面具有很大的潜力。
{"title":"Multimodal Deep Learning for Predicting Cerebral Herniation Using Sagittal CT and Clinical Data","authors":"Like Ji,&nbsp;Fuxing Yang,&nbsp;Zicheng Xiong,&nbsp;Jun Qiu,&nbsp;Fang Zuo,&nbsp;Kefan Yi,&nbsp;Shengbo Chen,&nbsp;Wenying Chen,&nbsp;Kai Zhao,&nbsp;Ghulam Mohi-ud-din","doi":"10.1155/int/9369999","DOIUrl":"https://doi.org/10.1155/int/9369999","url":null,"abstract":"<p>Cerebral herniation is a life-threatening neurological emergency, where timely and accurate prediction is crucial for improving patient prognosis. Due to its rapid imaging advantages, CT becomes the preferred choice for cerebral herniation screening. With the continuous development of artificial intelligence technology in the field of neurological diseases, CT-based models provide significant support for computer-aided clinical diagnosis. However, current research on cerebral herniation diagnosis remains limited. Existing methods rely on traditional machine learning or focus solely on midline shift detection, which not only exhibits strong subjectivity but also neglects key structures such as the brainstem and the rich information from sagittal CT images. To address these limitations, this study focuses on mid-sagittal CT images including the brainstem and combines clinical data to construct a multimodal deep learning framework for cerebral herniation prediction. The model integrates mature and advanced deep learning architectures to extract and fuse features from CT images and clinical text data, employing multiscale convolution and attention mechanisms for diagnostic classification. The model is evaluated on datasets from two centers. Results show that on the internal test set, the model achieves accuracy, sensitivity, specificity, and AUC of 89%, 92%, 88%, and 0.94, respectively; on the external test set, it attains accuracy, sensitivity, specificity, and AUC of 81%, 82%, 80%, and 0.89, respectively, outperforming baseline methods and existing state-of-the-art approaches. Additionally, when compared with radiologists on the internal test set, the model’s performance matches or exceeds the consensus of physicians. We also reveal the model’s focus region through visual analysis, which further deepens the understanding of the model’s prediction process and enhances its interpretability. Experiments demonstrate that the proposed method holds significant potential in assisting cerebral herniation diagnosis.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/9369999","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Systematic Review of Intelligent Agents, Language Models, and Recurrent Neural Networks in Industrial Maintenance: Driving Value Creation for the Mining Sector 工业维护中的智能代理、语言模型和递归神经网络的系统综述:推动矿业部门的价值创造
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1155/int/9953223
Luis Rojas, Beatriz Hernandez, José Garcia

This PRISMA 2020–compliant systematic review examines how intelligent agents, large language models (LLMs), and recurrent neural networks (RNNs) can be combined for industrial maintenance, with a sector-specific focus on mining. Scopus and Web of Science (2018–2025) were searched using replicable queries, and a dual text-representation pipeline (TF–IDF with bi/trigrams and sentence-transformer embeddings) was applied. Model selection scanned k over a predefined grid with internal indices (Silhouette, Davies–Bouldin, and Calinski–Harabasz), and robustness was assessed through multiseed stability, bootstrap consensus, representation-sensitivity checks, and a control run with HDBSCAN. Study quality and risk of bias were appraised with an AI-and-control–oriented matrix (ACE-QA). Two macroclusters emerged. The first centers on distributed control, consensus and formation, fault tolerance, observers, and learning-based designs (fuzzy/neural/RL), including finite/predefined-time and event/dynamic event–triggered mechanisms. The second addresses secure and resilient cooperation under cyber threats (DoS, deception, and FDIA), integrating observer-based estimation and communication-efficient protocols. Cross-cutting findings indicate that event-triggered updates reduce bandwidth and compute requirements, while robust estimation and fault-tolerant control improve availability under harsh conditions and intermittent networks—typical in mining. A maturity map suggests high technical readiness and growing adoption for RNN-based sensing analytics, advancing readiness but emerging adoption for multiagent coordination, and early adoption of LLMs for text-grounded maintenance intelligence. Evidence gaps persist in replicability, cross-site transfer, uncertainty reporting, and mining-grade validation at the edge. A design agenda is outlined that prioritizes digital-twin stress testing, edge-first evaluation of agent coordination, secure-by-design pipelines (authenticated/encrypted messaging and adversarial testing), and shift-aware validation. In sum, a hybrid stack—RNNs for perception, LLMs for knowledge grounding, and agents for coordinated action—offers a practical route to reliable, secure, and communication-efficient predictive maintenance in Mining 4.0.

这份符合PRISMA 2020标准的系统综述研究了智能代理、大型语言模型(llm)和循环神经网络(rnn)如何结合起来进行工业维护,并以特定行业为重点。使用可复制查询对Scopus和Web of Science(2018-2025)进行检索,并采用双文本表示管道(TF-IDF与bi/ triram和句子转换器嵌入)。模型选择扫描k在一个预定义的网格与内部指数(Silhouette, Davies-Bouldin和calinsky - harabasz),鲁棒性是通过多种子稳定性,引导共识,表示敏感性检查,并与HDBSCAN控制运行进行评估。采用人工智能和控制导向矩阵(ACE-QA)评价研究质量和偏倚风险。出现了两个宏观集群。第一个集中在分布式控制、共识和形成、容错、观察者和基于学习的设计(模糊/神经/强化学习),包括有限/预定义时间和事件/动态事件触发机制。第二部分涉及网络威胁(DoS、欺骗和FDIA)下的安全和弹性合作,整合基于观察者的估计和通信高效协议。横切研究结果表明,事件触发的更新减少了带宽和计算需求,而鲁棒估计和容错控制提高了恶劣条件和间歇性网络(采矿中典型的)下的可用性。成熟度图表明,基于rnn的传感分析技术成熟度高,采用度高,多智能体协调技术成熟度高,采用度低,基于文本的维护智能技术早期采用llm。证据差距持续存在于可复制性、跨站点转移、不确定性报告和边缘的采矿品位验证。概述了一个设计议程,优先考虑数字孪生压力测试、代理协调的边缘优先评估、设计安全管道(身份验证/加密消息传递和对抗性测试)和位移感知验证。总之,在Mining 4.0中,混合堆栈——用于感知的rnn,用于知识基础的llm,以及用于协调行动的代理——为可靠、安全和通信高效的预测性维护提供了一条实用的途径。
{"title":"A Systematic Review of Intelligent Agents, Language Models, and Recurrent Neural Networks in Industrial Maintenance: Driving Value Creation for the Mining Sector","authors":"Luis Rojas,&nbsp;Beatriz Hernandez,&nbsp;José Garcia","doi":"10.1155/int/9953223","DOIUrl":"https://doi.org/10.1155/int/9953223","url":null,"abstract":"<p>This PRISMA 2020–compliant systematic review examines how intelligent agents, large language models (LLMs), and recurrent neural networks (RNNs) can be combined for industrial maintenance, with a sector-specific focus on mining. Scopus and Web of Science (2018–2025) were searched using replicable queries, and a dual text-representation pipeline (TF–IDF with bi/trigrams and sentence-transformer embeddings) was applied. Model selection scanned <i>k</i> over a predefined grid with internal indices (Silhouette, Davies–Bouldin, and Calinski–Harabasz), and robustness was assessed through multiseed stability, bootstrap consensus, representation-sensitivity checks, and a control run with HDBSCAN. Study quality and risk of bias were appraised with an AI-and-control–oriented matrix (ACE-QA). Two macroclusters emerged. The first centers on distributed control, consensus and formation, fault tolerance, observers, and learning-based designs (fuzzy/neural/RL), including finite/predefined-time and event/dynamic event–triggered mechanisms. The second addresses secure and resilient cooperation under cyber threats (DoS, deception, and FDIA), integrating observer-based estimation and communication-efficient protocols. Cross-cutting findings indicate that event-triggered updates reduce bandwidth and compute requirements, while robust estimation and fault-tolerant control improve availability under harsh conditions and intermittent networks—typical in mining. A maturity map suggests high technical readiness and growing adoption for RNN-based sensing analytics, advancing readiness but emerging adoption for multiagent coordination, and early adoption of LLMs for text-grounded maintenance intelligence. Evidence gaps persist in replicability, cross-site transfer, uncertainty reporting, and mining-grade validation at the edge. A design agenda is outlined that prioritizes digital-twin stress testing, edge-first evaluation of agent coordination, secure-by-design pipelines (authenticated/encrypted messaging and adversarial testing), and shift-aware validation. In sum, a hybrid stack—RNNs for perception, LLMs for knowledge grounding, and agents for coordinated action—offers a practical route to reliable, secure, and communication-efficient predictive maintenance in Mining 4.0.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/9953223","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Unified Deep Learning Framework for Student Performance Prediction With Multimodal Data in a Blended Learning Environment 在混合学习环境中使用多模态数据预测学生成绩的统一深度学习框架
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1155/int/7978546
Wu Xiuguo

Prediction of student performance is crucial for enhancing the quality of higher education worldwide by enabling timely interventions and personalized support. The blended learning environment, which integrates online and offline instruction, has become a predominant paradigm, yet it also brings significant challenges for the prediction of student performance due to the inherent complexity and heterogeneity of multimodal data generated across both environments. Specifically, existing approaches often fail to effectively leverage the synergistic potential of numerical behavioral traces and unstructured textual feedback from instructors. To address this problem, this study proposes a unified deep learning framework that integrates convolutional neural networks (CNN) and bidirectional long short-term memory (BiLSTM) to capture both temporal dynamics and spatial correlations among features in blended learning environment. Unlike previous studies, we incorporate teacher comments on student assignments as textual inputs alongside traditional numerical features. Extensive experiments based on a real-world dataset of approximately 14,878 student records from Shandong University of Finance and Economics (SDUFE) demonstrate that the proposed hybrid model outperforms existing approaches in predictive performance. The results also highlight the significant value of leveraging teacher feedback for improving prediction accuracy, offering practical insights for enhancing educational management and supporting student success in blended learning contexts in higher education.

预测学生的表现对于提高全球高等教育质量至关重要,因为它能提供及时的干预和个性化支持。融合线上和线下教学的混合式学习环境已经成为一种主流模式,但由于在这两种环境中生成的多模态数据固有的复杂性和异质性,它也给学生表现的预测带来了重大挑战。具体来说,现有的方法往往不能有效地利用数字行为痕迹和教师的非结构化文本反馈的协同潜力。为了解决这一问题,本研究提出了一个统一的深度学习框架,该框架集成了卷积神经网络(CNN)和双向长短期记忆(BiLSTM),以捕捉混合学习环境中特征之间的时间动态和空间相关性。与以前的研究不同,我们将教师对学生作业的评论作为文本输入与传统的数字特征结合起来。基于山东财经大学(SDUFE)约14,878名学生记录的真实数据集的大量实验表明,所提出的混合模型在预测性能方面优于现有方法。研究结果还强调了利用教师反馈来提高预测准确性的重要价值,为加强教育管理和支持学生在高等教育混合学习环境中取得成功提供了实际见解。
{"title":"A Unified Deep Learning Framework for Student Performance Prediction With Multimodal Data in a Blended Learning Environment","authors":"Wu Xiuguo","doi":"10.1155/int/7978546","DOIUrl":"https://doi.org/10.1155/int/7978546","url":null,"abstract":"<p>Prediction of student performance is crucial for enhancing the quality of higher education worldwide by enabling timely interventions and personalized support. The blended learning environment, which integrates online and offline instruction, has become a predominant paradigm, yet it also brings significant challenges for the prediction of student performance due to the inherent complexity and heterogeneity of multimodal data generated across both environments. Specifically, existing approaches often fail to effectively leverage the synergistic potential of numerical behavioral traces and unstructured textual feedback from instructors. To address this problem, this study proposes a unified deep learning framework that integrates convolutional neural networks (CNN) and bidirectional long short-term memory (BiLSTM) to capture both temporal dynamics and spatial correlations among features in blended learning environment. Unlike previous studies, we incorporate teacher comments on student assignments as textual inputs alongside traditional numerical features. Extensive experiments based on a real-world dataset of approximately 14,878 student records from Shandong University of Finance and Economics (SDUFE) demonstrate that the proposed hybrid model outperforms existing approaches in predictive performance. The results also highlight the significant value of leveraging teacher feedback for improving prediction accuracy, offering practical insights for enhancing educational management and supporting student success in blended learning contexts in higher education.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/7978546","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Task Scheduling for Heterogeneous Multi-Core Processors Based on Deep Reinforcement Learning 基于深度强化学习的异构多核处理器任务调度
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-06 DOI: 10.1155/int/7562400
Qiguang Tan, Wei Chen, Dake Liu

Heterogeneous multicore processor systems are commonly used for scheduling tasks of DAG applications. Deep reinforcement learning, with its superior ability to perceive decisions directly and handle high-dimensional state actions, has become a prevalent solution for scheduling these systems. However, the incomplete environment models and large action spaces of deep reinforcement learning present significant challenges to scheduling. This paper investigates a scheduling problem in a heterogeneous multicore processor environment. Initially, system environment information is extracted and encoded using a graph convolutional neural network based on integrating adapter and AdapterFusion into the transformer architecture. Then, by separating task selection and processor allocation, the decision space is reduced: the former uses a deep neural network to learn to select nodes, and the latter allocates processors using a heuristic scheduling algorithm combining earliest completion time-based node replication and rolling technology. The entire scheduling process is a Markov decision problem. Therefore, the PPO algorithm with dynamic adjustment of the clipping factor, combined with an advantage actor-critic network, is employed for training, optimizing, and evaluating the algorithm to find the optimal scheduling strategy. The training process adopts a reward function for the time and power consumption required for completed task scheduling to ensure that multiple DAG application task scheduling can achieve optimal performance. Experiments conducted in various environments with different parameters show that, compared to other algorithms, this algorithm reduces the overall execution time and power consumption cost of heterogeneous multicore processor tasks by 11.09%.

异构多核处理器系统通常用于DAG应用程序的调度任务。深度强化学习具有直接感知决策和处理高维状态行为的卓越能力,已成为调度这些系统的普遍解决方案。然而,深度强化学习的不完整环境模型和大的动作空间对调度提出了重大挑战。研究了异构多核处理器环境下的调度问题。首先,基于将适配器和AdapterFusion集成到变压器体系结构中的图卷积神经网络对系统环境信息进行提取和编码。然后,通过分离任务选择和处理器分配,减小决策空间:前者使用深度神经网络学习选择节点,后者使用基于最早完成时间的节点复制和滚动技术相结合的启发式调度算法分配处理器。整个调度过程是一个马尔可夫决策问题。因此,采用动态调整裁剪因子的PPO算法,结合优势行为者批判网络,对算法进行训练、优化和评估,寻找最优调度策略。训练过程对完成任务调度所需的时间和功耗采用奖励函数,以确保多DAG应用任务调度能够达到最优性能。在不同参数环境下进行的实验表明,与其他算法相比,该算法将异构多核处理器任务的总体执行时间和功耗成本降低了11.09%。
{"title":"Task Scheduling for Heterogeneous Multi-Core Processors Based on Deep Reinforcement Learning","authors":"Qiguang Tan,&nbsp;Wei Chen,&nbsp;Dake Liu","doi":"10.1155/int/7562400","DOIUrl":"https://doi.org/10.1155/int/7562400","url":null,"abstract":"<p>Heterogeneous multicore processor systems are commonly used for scheduling tasks of DAG applications. Deep reinforcement learning, with its superior ability to perceive decisions directly and handle high-dimensional state actions, has become a prevalent solution for scheduling these systems. However, the incomplete environment models and large action spaces of deep reinforcement learning present significant challenges to scheduling. This paper investigates a scheduling problem in a heterogeneous multicore processor environment. Initially, system environment information is extracted and encoded using a graph convolutional neural network based on integrating adapter and AdapterFusion into the transformer architecture. Then, by separating task selection and processor allocation, the decision space is reduced: the former uses a deep neural network to learn to select nodes, and the latter allocates processors using a heuristic scheduling algorithm combining earliest completion time-based node replication and rolling technology. The entire scheduling process is a Markov decision problem. Therefore, the PPO algorithm with dynamic adjustment of the clipping factor, combined with an advantage actor-critic network, is employed for training, optimizing, and evaluating the algorithm to find the optimal scheduling strategy. The training process adopts a reward function for the time and power consumption required for completed task scheduling to ensure that multiple DAG application task scheduling can achieve optimal performance. Experiments conducted in various environments with different parameters show that, compared to other algorithms, this algorithm reduces the overall execution time and power consumption cost of heterogeneous multicore processor tasks by 11.09%.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/7562400","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Underactuated Dynamic Visual Servoing of Aerial Mobile Robots Using Adaptive Calibration of Camera 基于摄像机自适应标定的空中移动机器人欠驱动动态视觉伺服
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-06 DOI: 10.1155/int/1464484
Yi Lyu, Aoqi Liu, Zhengfei Wen, Guanyu Lai, Weijun Yang, Qiangqiang Dong

The dynamic visual servoing problem studied in this paper differs from existing approaches in two key aspects: the dynamics of the aerial mobile robot are underactuated, and the onboard camera is adaptively calibrated. To address the first challenge, a novel cascade visual servoing framework is developed, consisting of three control loops: the image loop, the attitude loop, and the angular velocity loop. Based on this framework, an extended eye-in-hand vision system is constructed, in which the perspective projection of feature points onto the image plane is decoupled from the rigid body’s attitude. This design allows the proposed visual controller to effectively compensate for image dynamics. Furthermore, unknown intrinsic and extrinsic camera parameters make compensation for image dynamics more difficult. To overcome this issue, a depth-independent composite matrix is introduced, enabling the unknown visual dynamics to be linearly parameterized and integrated with an adaptive control technique. A novel online algorithm is developed to estimate the unknown camera parameters in real time, and an additional adaptation mechanism is incorporated to estimate the rotational inertia of the rigid body. Using Lyapunov theory and Barbalat’s lemma, it is proven that the image tracking error asymptotically converges to zero while all physical variables remain locally bounded. Experimental results confirm that the image tracking error converges to zero over time, with a maximum deviation of no more than two pixels, thereby validating the effectiveness of the proposed visual controller.

本文研究的动态视觉伺服问题与现有方法的不同之处在于两个关键方面:空中移动机器人的动力学欠驱动和机载摄像机的自适应标定。为了解决第一个挑战,开发了一种新的级联视觉伺服框架,该框架由三个控制回路组成:图像回路、姿态回路和角速度回路。基于该框架,构建了一个扩展的眼手视觉系统,该系统将特征点在图像平面上的透视投影与刚体姿态解耦。该设计允许所提出的视觉控制器有效地补偿图像动态。此外,未知的相机内外参数使图像动力学补偿变得更加困难。为了克服这个问题,引入了一个与深度无关的复合矩阵,使未知的视觉动态能够线性参数化,并与自适应控制技术相结合。提出了一种新的在线实时估计未知摄像机参数的算法,并引入了附加的自适应机制来估计刚体的转动惯量。利用Lyapunov理论和Barbalat引理,证明了当所有物理变量保持局部有界时,图像跟踪误差渐近收敛于零。实验结果证实,随着时间的推移,图像跟踪误差收敛到零,最大偏差不超过两个像素,从而验证了所提出视觉控制器的有效性。
{"title":"Underactuated Dynamic Visual Servoing of Aerial Mobile Robots Using Adaptive Calibration of Camera","authors":"Yi Lyu,&nbsp;Aoqi Liu,&nbsp;Zhengfei Wen,&nbsp;Guanyu Lai,&nbsp;Weijun Yang,&nbsp;Qiangqiang Dong","doi":"10.1155/int/1464484","DOIUrl":"https://doi.org/10.1155/int/1464484","url":null,"abstract":"<p>The dynamic visual servoing problem studied in this paper differs from existing approaches in two key aspects: the dynamics of the aerial mobile robot are underactuated, and the onboard camera is adaptively calibrated. To address the first challenge, a novel cascade visual servoing framework is developed, consisting of three control loops: the image loop, the attitude loop, and the angular velocity loop. Based on this framework, an extended eye-in-hand vision system is constructed, in which the perspective projection of feature points onto the image plane is decoupled from the rigid body’s attitude. This design allows the proposed visual controller to effectively compensate for image dynamics. Furthermore, unknown intrinsic and extrinsic camera parameters make compensation for image dynamics more difficult. To overcome this issue, a depth-independent composite matrix is introduced, enabling the unknown visual dynamics to be linearly parameterized and integrated with an adaptive control technique. A novel online algorithm is developed to estimate the unknown camera parameters in real time, and an additional adaptation mechanism is incorporated to estimate the rotational inertia of the rigid body. Using Lyapunov theory and Barbalat’s lemma, it is proven that the image tracking error asymptotically converges to zero while all physical variables remain locally bounded. Experimental results confirm that the image tracking error converges to zero over time, with a maximum deviation of no more than two pixels, thereby validating the effectiveness of the proposed visual controller.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/1464484","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1