首页 > 最新文献

Machine learning with applications最新文献

英文 中文
IF 4.9 Pub Date : 2026-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100836"},"PeriodicalIF":4.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147055634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 4.9 Pub Date : 2026-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100859"},"PeriodicalIF":4.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147055646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 4.9 Pub Date : 2026-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100841"},"PeriodicalIF":4.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147055651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 4.9 Pub Date : 2026-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100817"},"PeriodicalIF":4.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147055660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-driven detection of tiny pests in foliage: Integrating image processing and deep learning 树叶中微小害虫的人工智能检测:融合图像处理和深度学习
IF 4.9 Pub Date : 2025-12-31 DOI: 10.1016/j.mlwa.2025.100834
Lucía Baeza-Moreno , Pedro Blanco-Carmona , Eduardo Hidalgo-Fort , Rubén Martín-Clemente , Ramón González-Carvajal
We present a novel computer vision method for detecting insect pests on plant and tree leaves under real-world conditions, combining deep learning with classical image processing techniques. Detecting small, sparsely distributed, or camouflaged insects is challenging, as current state-of-the-art object detection methods, primarily designed for larger objects, often overlook them. Our approach to this problem is twofold. First, we employ a deep learning model to analyze suspicious leaves for anomalies (a task well suited to deep learning). However, since deep models struggle with tiny objects in complex backgrounds, we complement them with conventional image processing to pre-identify potentially infested foliage, guiding the model toward relevant areas and mitigating its limitations. This combined strategy proves effective and competitive with other methods across diverse datasets and real-world scenarios. Furthermore, we also conduct a detailed analysis to interpret the model’s predictions, strengthening confidence in its effectiveness.
我们提出了一种新的计算机视觉方法,将深度学习与经典图像处理技术相结合,在现实世界条件下检测植物和树木叶片上的害虫。检测小的、稀疏分布的或伪装的昆虫是具有挑战性的,因为目前最先进的目标检测方法主要是为较大的目标设计的,经常忽略它们。我们解决这个问题的方法是双重的。首先,我们采用深度学习模型来分析可疑叶子的异常(这是一项非常适合深度学习的任务)。然而,由于深度模型在复杂背景下与微小物体作斗争,我们用传统的图像处理来补充它们,以预先识别潜在的受感染的树叶,引导模型走向相关区域并减轻其局限性。事实证明,这种组合策略在不同的数据集和现实场景中与其他方法相比是有效的,并且具有竞争力。此外,我们还进行了详细的分析来解释模型的预测,加强了对其有效性的信心。
{"title":"AI-driven detection of tiny pests in foliage: Integrating image processing and deep learning","authors":"Lucía Baeza-Moreno ,&nbsp;Pedro Blanco-Carmona ,&nbsp;Eduardo Hidalgo-Fort ,&nbsp;Rubén Martín-Clemente ,&nbsp;Ramón González-Carvajal","doi":"10.1016/j.mlwa.2025.100834","DOIUrl":"10.1016/j.mlwa.2025.100834","url":null,"abstract":"<div><div>We present a novel computer vision method for detecting insect pests on plant and tree leaves under real-world conditions, combining deep learning with classical image processing techniques. Detecting small, sparsely distributed, or camouflaged insects is challenging, as current state-of-the-art object detection methods, primarily designed for larger objects, often overlook them. Our approach to this problem is twofold. First, we employ a deep learning model to analyze suspicious leaves for anomalies (a task well suited to deep learning). However, since deep models struggle with tiny objects in complex backgrounds, we complement them with conventional image processing to pre-identify potentially infested foliage, guiding the model toward relevant areas and mitigating its limitations. This combined strategy proves effective and competitive with other methods across diverse datasets and real-world scenarios. Furthermore, we also conduct a detailed analysis to interpret the model’s predictions, strengthening confidence in its effectiveness.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100834"},"PeriodicalIF":4.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145924638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine-interactive decision-assistance using a pre-trained natural language processing model for 4D printing technique selection 使用预训练的自然语言处理模型进行4D打印技术选择的机器交互决策辅助
IF 4.9 Pub Date : 2025-12-30 DOI: 10.1016/j.mlwa.2025.100833
Chandramohan Abhishek , Nadimpalli Raghukiran
The present research showcases a machine-interactive approach for making decisions using a pre-trained natural language processing (NLP) model. The method is developed for 4D (4-dimensional) printing technique selection, as a plurality of variables is involved, such as process, material, design, and sequence selections. Due to the availability of numerous options, arriving at a preferred choice of technique requires expertise and time. The developed method aids in finding assistance from a single source. The approach incorporates bidirectional encoder representations from transformers (BERT), which accommodates parallel meanings of user requests, such as synonyms and adjectives, among others. The closed-loop system is programmed with a set of 7 prompts. It also introduces additional affirmation prompts to navigate both ambiguous phrasing and out-of-scope detection in order to receive a meaningful recommendation from the machine. The rule-governed technique (lightweight rule set) guides the selection of the conformable request during each prompt. The inference-based approach takes user requests, performs objective classification using BERT according to selected criteria, then dynamically filters the data, and recommends suggestions, with an inference time of 0.79 s. The modified model also establishes multi-level relationships among prompts for text classification. k-fold validation reached highest possible accuracy upon training with optimal hyperparameters. The fine-tuned method developed in Python environment can be generalized for other systems. The present research demonstrates the possibility of adapting an openly accessible model for developing a decision-assistance system with minimal personal computational resources.
本研究展示了使用预训练的自然语言处理(NLP)模型进行决策的机器交互方法。该方法是为4D(四维)打印技术选择而开发的,因为涉及多个变量,如工艺,材料,设计和顺序选择。由于可供选择的方法很多,要找到一种最佳的技术需要专业知识和时间。开发的方法有助于从单一来源寻求帮助。该方法结合了来自转换器(BERT)的双向编码器表示,它可以容纳用户请求的并行含义,例如同义词和形容词等。闭环系统由一组7个提示程序编程。它还引入了额外的确认提示,以导航模糊的短语和超出范围的检测,以便从机器接收有意义的推荐。规则控制的技术(轻量级规则集)指导在每个提示期间选择符合的请求。基于推理的方法接受用户请求,根据选择的标准使用BERT进行客观分类,然后动态过滤数据并推荐建议,推理时间为0.79 s。修改后的模型还建立了文本分类提示之间的多级关系。K-fold验证在最优超参数训练后达到最高可能的准确性。在Python环境中开发的微调方法可以推广到其他系统。目前的研究表明,采用开放可访问的模型来开发具有最小个人计算资源的决策辅助系统的可能性。
{"title":"Machine-interactive decision-assistance using a pre-trained natural language processing model for 4D printing technique selection","authors":"Chandramohan Abhishek ,&nbsp;Nadimpalli Raghukiran","doi":"10.1016/j.mlwa.2025.100833","DOIUrl":"10.1016/j.mlwa.2025.100833","url":null,"abstract":"<div><div>The present research showcases a machine-interactive approach for making decisions using a pre-trained natural language processing (NLP) model. The method is developed for 4D (4-dimensional) printing technique selection, as a plurality of variables is involved, such as process, material, design, and sequence selections. Due to the availability of numerous options, arriving at a preferred choice of technique requires expertise and time. The developed method aids in finding assistance from a single source. The approach incorporates bidirectional encoder representations from transformers (BERT), which accommodates parallel meanings of user requests, such as synonyms and adjectives, among others. The closed-loop system is programmed with a set of 7 prompts. It also introduces additional affirmation prompts to navigate both ambiguous phrasing and out-of-scope detection in order to receive a meaningful recommendation from the machine. The rule-governed technique (lightweight rule set) guides the selection of the conformable request during each prompt. The inference-based approach takes user requests, performs objective classification using BERT according to selected criteria, then dynamically filters the data, and recommends suggestions, with an inference time of 0.79 s. The modified model also establishes multi-level relationships among prompts for text classification. k-fold validation reached highest possible accuracy upon training with optimal hyperparameters. The fine-tuned method developed in Python environment can be generalized for other systems. The present research demonstrates the possibility of adapting an openly accessible model for developing a decision-assistance system with minimal personal computational resources.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100833"},"PeriodicalIF":4.9,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145925172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LSEL: A lightweight deep learning model for social-emotional classification of classical music LSEL:用于古典音乐社会情感分类的轻量级深度学习模型
IF 4.9 Pub Date : 2025-12-30 DOI: 10.1016/j.mlwa.2025.100832
Yuan-Jin Lin , Yu-Chi Chou , Shan-Ken Chien , Pen-Chiang Chao , Kuang-Kai Yeh , Yen-Chia Peng , Chen-Hao Tsao , Chih-Yun Chen , Shih-Lun Chen , Kuo-Chen Li , Wei-Chen Tu

Background/Objectives

Social-emotional learning (SEL) plays a crucial role in special education, yet current assessment approaches rely heavily on subjective teacher observation, which can be time-consuming and difficult to standardize. Music provides a meaningful medium for evaluating emotional competencies, creating an opportunity for artificial intelligence to support more objective and scalable SEL assessment.

Methods

We propose a lightweight social-emotional music classification model, termed LSEL, designed to identify three SEL-related competencies: Empathetic Perspective-Taking, Outlook, and Problem-Solving. LSEL utilizes 40×128 mel-frequency cepstral coefficient as input to capture core spectral–temporal characteristics relevant to SEL perception. Moreover, we provided an open-source SEM dataset for domain experts, utilizing 591 samples, which consisted of 194 Empathetic, 214 Outlook, and 183 Perspective-Taking samples, to analyze LSEL performance.

Results

LSEL reaching an average accuracy of 96.55 % and mAP of 99.29 % across experiments. With Cohen’s κ averaging 94.32 % and R² reaching 94.15 %, indicating high consistency with ground-truth. Per-category accuracies were similarly stable, including 96.95 % for Empathetic Perspective-Taking, 95.16 % for Outlook, and 95.36 % for Problem-Solving.

Conclusions

The lightweight LSEL framework offers an effective and robust solution for social-emotional music classification, supporting objective SEL assessment in educational contexts. The release of the SEM dataset further contributes to a valuable resource for advancing AI-assisted SEL research.
背景/目的社会情绪学习(SEL)在特殊教育中起着至关重要的作用,但目前的评估方法严重依赖于教师的主观观察,这既耗时又难以标准化。音乐为评估情感能力提供了一种有意义的媒介,为人工智能支持更客观、可扩展的情感能力评估创造了机会。方法我们提出了一个轻量级的社会情感音乐分类模型,称为LSEL,旨在识别三种与sel相关的能力:移情视角,展望和问题解决。LSEL利用40×128 mel-frequency倒谱系数作为输入,捕捉与SEL感知相关的核心频谱-时间特征。此外,我们为领域专家提供了一个开源的SEM数据集,利用591个样本,其中包括194个移情样本,214个展望样本和183个视角样本,来分析LSEL的表现。结果slsel的平均准确率为96.55%,mAP的平均准确率为99.29%。Cohen’s κ均值为94.32%,R²均值为94.15%,与ground-truth的一致性较高。每个类别的准确性同样稳定,包括共情换位思考的96.95%,展望的95.16%,问题解决的95.36%。结论轻量级LSEL框架为社会情感音乐分类提供了一种有效且稳健的解决方案,支持客观的教育背景下的SEL评估。SEM数据集的发布进一步为推进人工智能辅助SEL研究提供了宝贵的资源。
{"title":"LSEL: A lightweight deep learning model for social-emotional classification of classical music","authors":"Yuan-Jin Lin ,&nbsp;Yu-Chi Chou ,&nbsp;Shan-Ken Chien ,&nbsp;Pen-Chiang Chao ,&nbsp;Kuang-Kai Yeh ,&nbsp;Yen-Chia Peng ,&nbsp;Chen-Hao Tsao ,&nbsp;Chih-Yun Chen ,&nbsp;Shih-Lun Chen ,&nbsp;Kuo-Chen Li ,&nbsp;Wei-Chen Tu","doi":"10.1016/j.mlwa.2025.100832","DOIUrl":"10.1016/j.mlwa.2025.100832","url":null,"abstract":"<div><h3>Background/Objectives</h3><div>Social-emotional learning (SEL) plays a crucial role in special education, yet current assessment approaches rely heavily on subjective teacher observation, which can be time-consuming and difficult to standardize. Music provides a meaningful medium for evaluating emotional competencies, creating an opportunity for artificial intelligence to support more objective and scalable SEL assessment.</div></div><div><h3>Methods</h3><div>We propose a lightweight social-emotional music classification model, termed LSEL, designed to identify three SEL-related competencies: Empathetic Perspective-Taking, Outlook, and Problem-Solving. LSEL utilizes 40×128 mel-frequency cepstral coefficient as input to capture core spectral–temporal characteristics relevant to SEL perception. Moreover, we provided an open-source SEM dataset for domain experts, utilizing 591 samples, which consisted of 194 Empathetic, 214 Outlook, and 183 Perspective-Taking samples, to analyze LSEL performance.</div></div><div><h3>Results</h3><div>LSEL reaching an average accuracy of 96.55 % and mAP of 99.29 % across experiments. With Cohen’s κ averaging 94.32 % and R² reaching 94.15 %, indicating high consistency with ground-truth. Per-category accuracies were similarly stable, including 96.95 % for Empathetic Perspective-Taking, 95.16 % for Outlook, and 95.36 % for Problem-Solving.</div></div><div><h3>Conclusions</h3><div>The lightweight LSEL framework offers an effective and robust solution for social-emotional music classification, supporting objective SEL assessment in educational contexts. The release of the SEM dataset further contributes to a valuable resource for advancing AI-assisted SEL research.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100832"},"PeriodicalIF":4.9,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145925171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PRCSL: A privacy-preserving continual split learning framework for decentralized medical diagnosis PRCSL:用于分散医疗诊断的隐私保护持续分裂学习框架
IF 4.9 Pub Date : 2025-12-29 DOI: 10.1016/j.mlwa.2025.100828
Jungmin Eom , Minjun Kang , Myungkeun Yoon , Nikil Dutt , Jinkyu Kim , Jaekoo Lee
Deep learning-based medical AI systems are increasingly deployed for disease diagnosis in decentralized healthcare environments where data are siloed across hospitals and IoT devices and cannot be freely shared due to strict privacy and security regulations. However, most existing continual learning and distributed learning approaches either assume centrally aggregated data or overlook incremental clinical changes, leading to catastrophic forgetting when applied to real-world medical data streams.
This paper introduces a novel healthcare-specific framework that integrates continual learning and distributed learning methods to utilize medical AI models effectively by addressing the practical constraints of the healthcare and medical ecosystem, such as data privacy, security, and changing clinical environments. Through the proposed framework, medical clients, such as hospital devices and IoT-based smart devices, can collaboratively train deep learning-based models on distributed computing resources without sharing sensitive data. Additionally, by considering incremental characteristics in medical environments such as mutations, new diseases, and abnormalities, the proposed framework can improve the disease diagnosis of medical AI models in actual clinical scenarios.
We propose Privacy-preserving Rehearsal-based Continual Split Learning (PRCSL), a healthcare-specific continual split learning framework that combines differential-privacy-based exemplar sharing, a mutual information alignment (MIA) module to correct representation shifts induced by noisy exemplars, and a parameter-free nearest-mean-of-exemplars (NME) classifier to mitigate task-recency bias under non-IID data distributions. o=Across eight benchmark datasets, including four MedMNIST subsets, HAM10000, CCH5000, c=CIFAR,cp=, p=100, and SVHN, PRCSL achieves competitive performance compared with representative continual learning baselines in terms of average accuracy and average forgetting. In particular, PRCSL achieves up to 3.62%p higher average accuracy than the best baseline. These results indicate that PRCSL enables privacy-preserving, communication-efficient, and continually adaptable medical AI in realistic decentralized clinical and IoT-enabled ecosystems. Our code is publicly available at our repository.
基于深度学习的医疗人工智能系统越来越多地部署在分散的医疗环境中进行疾病诊断,这些环境中的数据分散在医院和物联网设备之间,由于严格的隐私和安全法规,无法自由共享。然而,大多数现有的持续学习和分布式学习方法要么假设集中汇总的数据,要么忽略增量临床变化,在应用于现实世界的医疗数据流时导致灾难性的遗忘。本文介绍了一种新的医疗保健特定框架,该框架集成了持续学习和分布式学习方法,通过解决医疗保健和医疗生态系统的实际限制,如数据隐私、安全性和不断变化的临床环境,有效地利用医疗人工智能模型。通过提出的框架,医疗客户端(如医院设备和基于物联网的智能设备)可以在不共享敏感数据的情况下,在分布式计算资源上协同训练基于深度学习的模型。此外,通过考虑突变、新疾病、异常等医疗环境中的增量特征,该框架可以提高医疗AI模型在实际临床场景中的疾病诊断能力。我们提出了一种基于隐私保护预演的持续分裂学习(PRCSL),这是一种医疗保健特定的持续分裂学习框架,它结合了基于差分隐私的范例共享,一个相互信息校准(MIA)模块来纠正由噪声范例引起的表示移位,以及一个无参数的最接近范例均值(NME)分类器来减轻非iid数据分布下的任务近因偏差。在八个基准数据集上,包括四个MedMNIST子集,HAM10000, CCH5000, c=CIFAR,cp=, p=100和SVHN, PRCSL在平均准确率和平均遗忘方面与代表性的持续学习基线相比具有竞争力。特别是,PRCSL的平均准确度比最佳基线高出3.62%p。这些结果表明,PRCSL能够在现实的分散临床和物联网生态系统中实现隐私保护、通信高效和持续适应性强的医疗人工智能。我们的代码在我们的存储库中是公开的。
{"title":"PRCSL: A privacy-preserving continual split learning framework for decentralized medical diagnosis","authors":"Jungmin Eom ,&nbsp;Minjun Kang ,&nbsp;Myungkeun Yoon ,&nbsp;Nikil Dutt ,&nbsp;Jinkyu Kim ,&nbsp;Jaekoo Lee","doi":"10.1016/j.mlwa.2025.100828","DOIUrl":"10.1016/j.mlwa.2025.100828","url":null,"abstract":"<div><div>Deep learning-based medical AI systems are increasingly deployed for disease diagnosis in decentralized healthcare environments where data are siloed across hospitals and IoT devices and cannot be freely shared due to strict privacy and security regulations. However, most existing continual learning and distributed learning approaches either assume centrally aggregated data or overlook incremental clinical changes, leading to catastrophic forgetting when applied to real-world medical data streams.</div><div>This paper introduces a novel healthcare-specific framework that integrates continual learning and distributed learning methods to utilize medical AI models effectively by addressing the practical constraints of the healthcare and medical ecosystem, such as data privacy, security, and changing clinical environments. Through the proposed framework, medical clients, such as hospital devices and IoT-based smart devices, can collaboratively train deep learning-based models on distributed computing resources without sharing sensitive data. Additionally, by considering incremental characteristics in medical environments such as mutations, new diseases, and abnormalities, the proposed framework can improve the disease diagnosis of medical AI models in actual clinical scenarios.</div><div>We propose Privacy-preserving Rehearsal-based Continual Split Learning (PRCSL), a healthcare-specific continual split learning framework that combines differential-privacy-based exemplar sharing, a mutual information alignment (MIA) module to correct representation shifts induced by noisy exemplars, and a parameter-free nearest-mean-of-exemplars (NME) classifier to mitigate task-recency bias under non-IID data distributions. o=Across eight benchmark datasets, including four MedMNIST subsets, HAM10000, CCH5000, c=CIFAR,cp=, p=100, and SVHN, PRCSL achieves competitive performance compared with representative continual learning baselines in terms of average accuracy and average forgetting. In particular, PRCSL achieves up to 3.62%p higher average accuracy than the best baseline. These results indicate that PRCSL enables privacy-preserving, communication-efficient, and continually adaptable medical AI in realistic decentralized clinical and IoT-enabled ecosystems. Our code is publicly available at our repository.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100828"},"PeriodicalIF":4.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A traffic-aware federated learning prediction framework with custom aggregation 具有自定义聚合的流量感知联邦学习预测框架
IF 4.9 Pub Date : 2025-12-29 DOI: 10.1016/j.mlwa.2025.100829
Seerat Kaur, Sukhjit Singh Sehra, Darisuh Ebrahimi, Emad A. Mohammed
Reliable traffic predictions are essential for managing congestion, optimizing routes, improving commuter safety, and advancing the performance of intelligent transportation systems (ITS). However, existing centralized systems often lack adaptability to real-world traffic patterns and fail to capture spatio-temporal variability and client-level heterogeneity. These systems require large amounts of sensitive data to be collected on central servers, intensifying privacy risks. This study proposes a privacy-preserving Federated Learning (FL) framework for traffic flow and speed prediction (5 to 60 mins ahead) using non-independent and identically distributed (non-IID) traffic data. The objectives of this study are threefold: (1) design a client-aware custom FL aggregation strategy that accounts for traffic heterogeneity and client-specific dynamics, ignored in standard FL methods, (2) improve personalization by grouping clients based on real-world traffic pattern similarity via clustering-based approach and, (3) enhance convergence and predictive performance of global aggregation using dynamic, traffic-aware aggregation scores. The proposed framework designs a hybrid FL long-short-term memory (FedLSTM) model augmented with an attention mechanism to effectively model both temporal and spatial traffic variations across junctions, while ensuring that all raw data remains local. To improve learning under traffic diversity and imbalanced traffic distribution patterns, we propose a custom traffic-aware aggregation strategy that dynamically weighs client contributions based on six traffic-based metrics. Evaluations on clustered client partitions demonstrate that our custom aggregation consistently outperformed the baseline strategies across multiple evaluation metrics. These results highlight the effectiveness of integrating traffic-aware aggregation in enhancing the performance and generalization capability of FL-based traffic prediction frameworks.
可靠的交通预测对于管理拥堵、优化路线、提高通勤安全性和提高智能交通系统(ITS)的性能至关重要。然而,现有的集中式系统往往缺乏对现实世界交通模式的适应性,无法捕捉时空变化和客户级异质性。这些系统需要在中央服务器上收集大量敏感数据,从而加剧了隐私风险。本研究提出了一个隐私保护的联邦学习(FL)框架,用于使用非独立和同分布(非iid)交通数据进行交通流量和速度预测(提前5至60分钟)。本研究的目标有三个:(1)设计一个客户感知的自定义FL聚合策略,该策略考虑了标准FL方法中忽略的流量异质性和客户特定动态;(2)通过基于聚类的方法,根据真实交通模式的相似性对客户进行分组,从而提高个性化;(3)使用动态的、流量感知的聚合分数,增强全局聚合的收敛性和预测性能。该框架设计了一个混合FL长短期记忆(FedLSTM)模型,增强了注意机制,以有效地模拟跨路口的时空交通变化,同时确保所有原始数据保持本地。为了提高在流量多样性和不平衡流量分布模式下的学习能力,我们提出了一种自定义流量感知聚合策略,该策略基于六个基于流量的指标动态加权客户端贡献。对集群客户机分区的评估表明,我们的自定义聚合在多个评估指标上的性能始终优于基线策略。这些结果突出了集成流量感知聚合在提高基于fl的流量预测框架的性能和泛化能力方面的有效性。
{"title":"A traffic-aware federated learning prediction framework with custom aggregation","authors":"Seerat Kaur,&nbsp;Sukhjit Singh Sehra,&nbsp;Darisuh Ebrahimi,&nbsp;Emad A. Mohammed","doi":"10.1016/j.mlwa.2025.100829","DOIUrl":"10.1016/j.mlwa.2025.100829","url":null,"abstract":"<div><div>Reliable traffic predictions are essential for managing congestion, optimizing routes, improving commuter safety, and advancing the performance of intelligent transportation systems (ITS). However, existing centralized systems often lack adaptability to real-world traffic patterns and fail to capture spatio-temporal variability and client-level heterogeneity. These systems require large amounts of sensitive data to be collected on central servers, intensifying privacy risks. This study proposes a privacy-preserving Federated Learning (FL) framework for traffic flow and speed prediction (5 to 60 mins ahead) using non-independent and identically distributed (non-IID) traffic data. The objectives of this study are threefold: (1) design a client-aware custom FL aggregation strategy that accounts for traffic heterogeneity and client-specific dynamics, ignored in standard FL methods, (2) improve personalization by grouping clients based on real-world traffic pattern similarity via clustering-based approach and, (3) enhance convergence and predictive performance of global aggregation using dynamic, traffic-aware aggregation scores. The proposed framework designs a hybrid FL long-short-term memory (FedLSTM) model augmented with an attention mechanism to effectively model both temporal and spatial traffic variations across junctions, while ensuring that all raw data remains local. To improve learning under traffic diversity and imbalanced traffic distribution patterns, we propose a custom traffic-aware aggregation strategy that dynamically weighs client contributions based on six traffic-based metrics. Evaluations on clustered client partitions demonstrate that our custom aggregation consistently outperformed the baseline strategies across multiple evaluation metrics. These results highlight the effectiveness of integrating traffic-aware aggregation in enhancing the performance and generalization capability of FL-based traffic prediction frameworks.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100829"},"PeriodicalIF":4.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145925173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time wheat growth stage detection via improved Swin transformer for edge devices 基于改进Swin变压器的边缘设备小麦生长阶段实时检测
IF 4.9 Pub Date : 2025-12-29 DOI: 10.1016/j.mlwa.2025.100831
Xianyuan Zhu
Accurate identification of crop growth stages is crucial for precision agriculture and automated field management. This study designed and developed an improved Swin Transformer-based detection system for wheat growth stages, with an emphasis on real time deployment on embedded edge devices. Specifically, we incorporate a Progressive Transfer Learning strategy to ensure robust generalization on agricultural data and introduce an Ordinal Regression Loss to effectively mitigate misclassifications in transitional growth stages. The proposed approach integrates a hierarchical Transformer backbone with an optimized deployment pipeline for NVIDIA Jetson Orin NX, supporting gallery images, video streams, and live camera inputs. Experimental evaluation demonstrated that the system achieves consistently high recognition accuracy (above 93%) while maintaining real-time performance (above 12FPS) under different modes, with moderate power consumption (6–8 W). Compared with baseline CNNs (ResNet-50, MobileNetV3) and Transformer models (ViT), the proposed design achieves a favorable balance among accuracy, efficiency, and robustness. These results suggest that the system can contribute to the development of practical agricultural monitoring and provide a step toward intelligent control strategies in precision farming.
作物生长阶段的准确识别对于精准农业和自动化田间管理至关重要。本研究设计并开发了一种改进的基于Swin变压器的小麦生长阶段检测系统,重点是在嵌入式边缘设备上的实时部署。具体而言,我们采用渐进迁移学习策略来确保农业数据的鲁棒泛化,并引入序数回归损失来有效减轻过渡生长阶段的错误分类。所提出的方法集成了一个分层Transformer主干和一个针对NVIDIA Jetson Orin NX的优化部署管道,支持图库图像、视频流和实时摄像机输入。实验评估表明,该系统在不同模式下均能保持较高的识别准确率(93%以上),同时保持实时性(12FPS以上),且功耗适中(6-8 W)。与基线cnn (ResNet-50、MobileNetV3)和Transformer模型(ViT)相比,本文提出的设计在准确率、效率和鲁棒性之间取得了良好的平衡。这些结果表明,该系统可以促进实际农业监测的发展,并为精准农业的智能控制策略提供一步。
{"title":"Real-time wheat growth stage detection via improved Swin transformer for edge devices","authors":"Xianyuan Zhu","doi":"10.1016/j.mlwa.2025.100831","DOIUrl":"10.1016/j.mlwa.2025.100831","url":null,"abstract":"<div><div>Accurate identification of crop growth stages is crucial for precision agriculture and automated field management. This study designed and developed an improved Swin Transformer-based detection system for wheat growth stages, with an emphasis on real time deployment on embedded edge devices. Specifically, we incorporate a Progressive Transfer Learning strategy to ensure robust generalization on agricultural data and introduce an Ordinal Regression Loss to effectively mitigate misclassifications in transitional growth stages. The proposed approach integrates a hierarchical Transformer backbone with an optimized deployment pipeline for NVIDIA Jetson Orin NX, supporting gallery images, video streams, and live camera inputs. Experimental evaluation demonstrated that the system achieves consistently high recognition accuracy (above 93%) while maintaining real-time performance (above 12FPS) under different modes, with moderate power consumption (6–8 W). Compared with baseline CNNs (ResNet-50, MobileNetV3) and Transformer models (ViT), the proposed design achieves a favorable balance among accuracy, efficiency, and robustness. These results suggest that the system can contribute to the development of practical agricultural monitoring and provide a step toward intelligent control strategies in precision farming.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100831"},"PeriodicalIF":4.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine learning with applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1