首页 > 最新文献

Neural Computing & Applications最新文献

英文 中文
Difference rewards policy gradients. 差异奖励政策梯度。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2022-11-11 DOI: 10.1007/s00521-022-07960-5
Jacopo Castellini, Sam Devlin, Frans A Oliehoek, Rahul Savani

Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent's contribution to the overall performance, which is crucial for learning good policies. We propose a novel algorithm called Dr.Reinforce that explicitly tackles this by combining difference rewards with policy gradients to allow for learning decentralized policies when the reward function is known. By differencing the reward function directly, Dr.Reinforce avoids difficulties associated with learning the Q-function as done by counterfactual multi-agent policy gradients (COMA), a state-of-the-art difference rewards method. For applications where the reward function is unknown, we show the effectiveness of a version of Dr.Reinforce that learns an additional reward network that is used to estimate the difference rewards.

策略梯度方法已成为多智能体强化学习中最流行的算法之一。然而,许多方法都没有解决的一个关键挑战是多代理信用分配:评估代理对整体性能的贡献,这对于学习好的策略至关重要。我们提出了一种名为Dr.Reinforce的新算法,它通过将差异奖励与策略梯度相结合来明确解决这个问题,以便在奖励函数已知的情况下学习分散的策略。通过直接区分奖励函数,Dr.Reinforce避免了与学习q函数相关的困难,这与反事实多代理策略梯度(反事实多代理策略梯度,一种最先进的差异奖励方法)是一样的。对于奖励函数未知的应用程序,我们展示了Dr.Reinforce的一个版本的有效性,它学习了一个额外的奖励网络,用于估计不同的奖励。
{"title":"Difference rewards policy gradients.","authors":"Jacopo Castellini, Sam Devlin, Frans A Oliehoek, Rahul Savani","doi":"10.1007/s00521-022-07960-5","DOIUrl":"10.1007/s00521-022-07960-5","url":null,"abstract":"<p><p>Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent's contribution to the overall performance, which is crucial for learning good policies. We propose a novel algorithm called Dr.Reinforce that explicitly tackles this by combining difference rewards with policy gradients to allow for learning decentralized policies when the reward function is known. By differencing the reward function directly, Dr.Reinforce avoids difficulties associated with learning the <i>Q</i>-function as done by counterfactual multi-agent policy gradients (COMA), a state-of-the-art difference rewards method. For applications where the reward function is unknown, we show the effectiveness of a version of Dr.Reinforce that learns an additional reward network that is used to estimate the difference rewards.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 19","pages":"13163-13186"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204931/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Int-HRL: towards intention-based hierarchical reinforcement learning. Int-HRL:迈向基于意图的分层强化学习。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2024-12-11 DOI: 10.1007/s00521-024-10596-2
Anna Penzkofer, Simon Schaefer, Florian Strohm, Mihai Bâce, Stefan Leutenegger, Andreas Bulling

While deep reinforcement learning (RL) agents outperform humans on an increasing number of tasks, training them requires data equivalent to decades of human gameplay. Recent hierarchical RL methods have increased sample efficiency by incorporating information inherent to the structure of the decision problem but at the cost of having to discover or use human-annotated sub-goals that guide the learning process. We show that intentions of human players, i.e. the precursor of goal-oriented decisions, can be robustly predicted from eye gaze even for the long-horizon sparse rewards task of Montezuma's Revenge-one of the most challenging RL tasks in the Atari2600 game suite. We propose Int-HRL: Hierarchical RL with intention-based sub-goals that are inferred from human eye gaze. Our novel sub-goal extraction pipeline is fully automatic and replaces the need for manual sub-goal annotation by human experts. Our evaluations show that replacing hand-crafted sub-goals with automatically extracted intentions leads to an HRL agent that is significantly more sample efficient than previous methods.

虽然深度强化学习(RL)智能体在越来越多的任务上比人类表现得更好,但训练它们需要的数据相当于人类几十年的游戏经验。最近的分层强化学习方法通过结合决策问题结构固有的信息来提高样本效率,但代价是必须发现或使用人工注释的子目标来指导学习过程。我们的研究表明,人类玩家的意图,即目标导向决策的前身,可以通过眼睛的凝视来预测,即使是在《Montezuma’s revenge》的长期稀疏奖励任务中也是如此——这是Atari2600游戏套件中最具挑战性的强化学习任务之一。我们提出了Int-HRL:具有基于意图的子目标的分层强化学习,这些子目标是从人眼注视中推断出来的。我们的新子目标提取管道是全自动的,取代了人工标注子目标的需要。我们的评估表明,用自动提取的意图取代手工制作的子目标,可以产生比以前的方法更有效的HRL代理。
{"title":"Int-HRL: towards intention-based hierarchical reinforcement learning.","authors":"Anna Penzkofer, Simon Schaefer, Florian Strohm, Mihai Bâce, Stefan Leutenegger, Andreas Bulling","doi":"10.1007/s00521-024-10596-2","DOIUrl":"10.1007/s00521-024-10596-2","url":null,"abstract":"<p><p>While deep reinforcement learning (RL) agents outperform humans on an increasing number of tasks, training them requires data equivalent to decades of human gameplay. Recent hierarchical RL methods have increased sample efficiency by incorporating information inherent to the structure of the decision problem but at the cost of having to discover or use human-annotated sub-goals that guide the learning process. We show that intentions of human players, i.e. the precursor of goal-oriented decisions, can be robustly predicted from eye gaze even for the long-horizon sparse rewards task of Montezuma's Revenge-one of the most challenging RL tasks in the Atari2600 game suite. We propose <i>Int-HRL</i>: Hierarchical RL with intention-based sub-goals that are inferred from human eye gaze. Our novel sub-goal extraction pipeline is fully automatic and replaces the need for manual sub-goal annotation by human experts. Our evaluations show that replacing hand-crafted sub-goals with automatically extracted intentions leads to an HRL agent that is significantly more sample efficient than previous methods.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 23","pages":"18823-18834"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313806/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144776756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep multi-objective reinforcement learning for utility-based infrastructural maintenance optimization. 基于公用事业基础设施维护优化的深度多目标强化学习。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2025-01-10 DOI: 10.1007/s00521-024-10954-0
Jesse van Remmerden, Maurice Kenter, Diederik M Roijers, Charalampos Andriotis, Yingqian Zhang, Zaharah Bukhsh

In this paper, we introduce multi-objective deep centralized multi-agent actor-critic (MO-DCMAC), a multi-objective reinforcement learning method for infrastructural maintenance optimization, an area traditionally dominated by single-objective reinforcement learning (RL) approaches. Previous single-objective RL methods combine multiple objectives, such as probability of collapse and cost, into a singular reward signal through reward-shaping. In contrast, MO-DCMAC can optimize a policy for multiple objectives directly, even when the utility function is nonlinear. We evaluated MO-DCMAC using two utility functions, which use probability of collapse and cost as input. The first utility function is the threshold utility, in which MO-DCMAC should minimize cost so that the probability of collapse is never above the threshold. The second is based on the failure mode, effects, and criticality analysis methodology used by asset managers to assess maintenance plans. We evaluated MO-DCMAC, with both utility functions, in multiple maintenance environments, including ones based on a case study of the historical quay walls of Amsterdam. The performance of MO-DCMAC was compared against multiple rule-based policies based on heuristics currently used for constructing maintenance plans. Our results demonstrate that MO-DCMAC outperforms traditional rule-based policies across various environments and utility functions.

在本文中,我们引入了多目标深度集中式多智能体行为者评价(MO-DCMAC),这是一种多目标强化学习方法,用于基础设施维护优化,这是一个传统上由单目标强化学习(RL)方法主导的领域。以往的单目标强化学习方法通过奖励塑造将崩溃概率和成本等多个目标组合成一个单一的奖励信号。相比之下,MO-DCMAC可以直接针对多个目标优化策略,即使效用函数是非线性的。我们使用两个效用函数来评估MO-DCMAC,它们使用崩溃概率和成本作为输入。第一个实用函数是阈值实用函数,其中MO-DCMAC应该最小化成本,以便崩溃的概率永远不会超过阈值。第二种是基于资产管理公司用来评估维护计划的故障模式、影响和临界性分析方法。我们在多种维护环境中评估了MO-DCMAC,其中包括基于阿姆斯特丹历史码头墙的案例研究。将MO-DCMAC的性能与目前用于构建维护计划的基于启发式的多个基于规则的策略进行了比较。我们的结果表明,MO-DCMAC在各种环境和实用功能中优于传统的基于规则的策略。
{"title":"Deep multi-objective reinforcement learning for utility-based infrastructural maintenance optimization.","authors":"Jesse van Remmerden, Maurice Kenter, Diederik M Roijers, Charalampos Andriotis, Yingqian Zhang, Zaharah Bukhsh","doi":"10.1007/s00521-024-10954-0","DOIUrl":"10.1007/s00521-024-10954-0","url":null,"abstract":"<p><p>In this paper, we introduce multi-objective deep centralized multi-agent actor-critic (MO-DCMAC), a multi-objective reinforcement learning method for infrastructural maintenance optimization, an area traditionally dominated by single-objective reinforcement learning (RL) approaches. Previous single-objective RL methods combine multiple objectives, such as probability of collapse and cost, into a singular reward signal through reward-shaping. In contrast, MO-DCMAC can optimize a policy for multiple objectives directly, even when the utility function is nonlinear. We evaluated MO-DCMAC using two utility functions, which use probability of collapse and cost as input. The first utility function is the threshold utility, in which MO-DCMAC should minimize cost so that the probability of collapse is never above the threshold. The second is based on the failure mode, effects, and criticality analysis methodology used by asset managers to assess maintenance plans. We evaluated MO-DCMAC, with both utility functions, in multiple maintenance environments, including ones based on a case study of the historical quay walls of Amsterdam. The performance of MO-DCMAC was compared against multiple rule-based policies based on heuristics currently used for constructing maintenance plans. Our results demonstrate that MO-DCMAC outperforms traditional rule-based policies across various environments and utility functions.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 30","pages":"24719-24742"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12511271/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145281527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A fairness scale for real-time recidivism forecasts using a national database of convicted offenders. 一个利用国家罪犯数据库实时预测累犯的公平尺度。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2025-08-01 DOI: 10.1007/s00521-025-11478-x
Jacob Verrey, Peter Neyroud, Lawrence Sherman, Barak Ariel

This investigation explores whether machine learning can predict recidivism while addressing societal biases. To investigate this, we obtained conviction data from the UK's Police National Computer (PNC) on 346,685 records between January 1, 2000, and February 3, 2006 (His Majesty's Inspectorate of Constabulary in Use of the Police National Computer: An inspection of the ACRO Criminal Records Office. His Majesty's Inspectorate of Constabulary, Birmingham, https://assets-hmicfrs.justiceinspectorates.gov.uk/uploads/police-national-computer-use-acro-criminal-records-office.pdf, 2017). We generate twelve machine learning models-six to forecast general recidivism, and six to forecast violent recidivism-over a 3-year period, evaluated via fivefold cross-validation. Our best-performing models outperform the existing state-of-the-arts, receiving an area under curve (AUC) score of 0.8660 and 0.8375 for general and violent recidivism, respectively. Next, we construct a fairness scale that communicates the semantic and technical trade-offs associated with debiasing a criminal justice forecasting model. We use this scale to debias our best-performing models. Results indicate both models can achieve all five fairness definitions because the metrics measuring these definitions-the statistical range of recall, precision, positive rate, and error balance between demographics-indicate that these scores are within a one percentage point difference of each other. Deployment recommendations and implications are discussed. These include recommended safeguards against false positives, an explication of how these models addressed societal biases, and a case study illustrating how these models can improve existing criminal justice practices. That is, these models may help police identify fewer people in a way less impacted by structural bias while still reducing crime. A randomized control trial is proposed to test this illustrated case study, and further directions explored.

Supplementary information: The online version contains supplementary material available at 10.1007/s00521-025-11478-x.

这项调查探讨了机器学习是否可以在解决社会偏见的同时预测再犯。为了对此进行调查,我们从英国警察国家计算机(PNC)中获得了2000年1月1日至2006年2月3日期间346,685条记录的定罪数据(《使用警察国家计算机的警察陛下检查:对犯罪记录办公室的检查》)。英国皇家警察监察局,伯明翰,https://assets-hmicfrs.justiceinspectorates.gov.uk/uploads/police-national-computer-use-acro-criminal-records-office.pdf, 2017)。我们生成了12个机器学习模型——6个用于预测一般累犯,6个用于预测暴力累犯——在3年的时间里,通过五倍交叉验证进行评估。我们的最佳表现模型优于现有的最先进的技术,对于一般和暴力累犯的曲线下面积(AUC)得分分别为0.8660和0.8375。接下来,我们构建了一个公平量表,该量表传达了与消除刑事司法预测模型偏见相关的语义和技术权衡。我们使用这个尺度来筛选表现最好的模型。结果表明,这两种模型都可以实现所有五个公平定义,因为衡量这些定义的指标——召回率、精确度、正确率和人口学之间的错误平衡的统计范围——表明这些分数彼此之间的差异在一个百分点以内。讨论了部署建议和含义。这些建议包括防止误报的建议措施,对这些模型如何解决社会偏见的解释,以及一个案例研究,说明这些模型如何改善现有的刑事司法实践。也就是说,这些模型可以帮助警察以一种较少受结构性偏见影响的方式识别更少的人,同时仍然减少犯罪。提出了一项随机对照试验来验证这一案例研究,并探索了进一步的方向。补充信息:在线版本包含补充资料,下载地址为10.1007/s00521-025-11478-x。
{"title":"A fairness scale for real-time recidivism forecasts using a national database of convicted offenders.","authors":"Jacob Verrey, Peter Neyroud, Lawrence Sherman, Barak Ariel","doi":"10.1007/s00521-025-11478-x","DOIUrl":"10.1007/s00521-025-11478-x","url":null,"abstract":"<p><p>This investigation explores whether machine learning can predict recidivism while addressing societal biases. To investigate this, we obtained conviction data from the UK's Police National Computer (PNC) on 346,685 records between January 1, 2000, and February 3, 2006 (His Majesty's Inspectorate of Constabulary in Use of the Police National Computer: An inspection of the ACRO Criminal Records Office. His Majesty's Inspectorate of Constabulary, Birmingham, https://assets-hmicfrs.justiceinspectorates.gov.uk/uploads/police-national-computer-use-acro-criminal-records-office.pdf, 2017). We generate twelve machine learning models-six to forecast general recidivism, and six to forecast violent recidivism-over a 3-year period, evaluated via fivefold cross-validation. Our best-performing models outperform the existing state-of-the-arts, receiving an area under curve (AUC) score of 0.8660 and 0.8375 for general and violent recidivism, respectively. Next, we construct a fairness scale that communicates the semantic and technical trade-offs associated with debiasing a criminal justice forecasting model. We use this scale to debias our best-performing models. Results indicate both models can achieve all five fairness definitions because the metrics measuring these definitions-the statistical range of recall, precision, positive rate, and error balance between demographics-indicate that these scores are within a one percentage point difference of each other. Deployment recommendations and implications are discussed. These include recommended safeguards against false positives, an explication of how these models addressed societal biases, and a case study illustrating how these models can improve existing criminal justice practices. That is, these models may help police identify fewer people in a way less impacted by structural bias while still reducing crime. A randomized control trial is proposed to test this illustrated case study, and further directions explored.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s00521-025-11478-x.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 26","pages":"21607-21657"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12401775/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144994036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling dislocation dynamics data using semantic web technologies. 基于语义web技术的错位动力学数据建模。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2024-12-14 DOI: 10.1007/s00521-024-10674-5
Ahmad Zainul Ihsan, Said Fathalla, Stefan Sandfeld

The research in Materials Science and Engineering focuses on the design, synthesis, properties, and performance of materials. An important class of materials that is widely investigated are crystalline materials, including metals and semiconductors. Crystalline material typically contains a specific type of defect called "dislocation". This defect significantly affects various material properties, including bending strength, fracture toughness, and ductility. Researchers have devoted a significant effort in recent years to understanding dislocation behaviour through experimental characterization techniques and simulations, e.g., dislocation dynamics simulations. This paper presents how data from dislocation dynamics simulations can be modelled using semantic web technologies through annotating data with ontologies. We extend the dislocation ontology by adding missing concepts and aligning it with two other domain-related ontologies (i.e., the Elementary Multi-perspective Material Ontology and the Materials Design Ontology), allowing for efficiently representing the dislocation simulation data. Moreover, we present a real-world use case for representing the discrete dislocation dynamics data as a knowledge graph (DisLocKG) which can depict the relationship between them. We also developed a SPARQL endpoint that brings extensive flexibility for querying DisLocKG.

材料科学与工程专业主要研究材料的设计、合成、性能和性能。被广泛研究的一类重要材料是晶体材料,包括金属和半导体。晶体材料通常含有一种称为“位错”的特殊缺陷。该缺陷显著影响各种材料性能,包括抗弯强度、断裂韧性和延展性。近年来,研究人员通过实验表征技术和模拟(例如位错动力学模拟)来理解位错行为。本文介绍了如何使用语义网技术通过用本体注释数据来对错位动力学模拟数据进行建模。我们通过添加缺失概念并将其与其他两个领域相关的本体(即基本多角度材料本体和材料设计本体)对齐来扩展位错本体,从而有效地表示位错模拟数据。此外,我们提出了将离散位错动力学数据表示为知识图(DisLocKG)的实际用例,该知识图可以描述它们之间的关系。我们还开发了一个SPARQL端点,它为查询DisLocKG带来了广泛的灵活性。
{"title":"Modeling dislocation dynamics data using semantic web technologies.","authors":"Ahmad Zainul Ihsan, Said Fathalla, Stefan Sandfeld","doi":"10.1007/s00521-024-10674-5","DOIUrl":"10.1007/s00521-024-10674-5","url":null,"abstract":"<p><p>The research in Materials Science and Engineering focuses on the design, synthesis, properties, and performance of materials. An important class of materials that is widely investigated are crystalline materials, including metals and semiconductors. Crystalline material typically contains a specific type of defect called \"dislocation\". This defect significantly affects various material properties, including bending strength, fracture toughness, and ductility. Researchers have devoted a significant effort in recent years to understanding dislocation behaviour through experimental characterization techniques and simulations, e.g., dislocation dynamics simulations. This paper presents how data from dislocation dynamics simulations can be modelled using semantic web technologies through annotating data with ontologies. We extend the dislocation ontology by adding missing concepts and aligning it with two other domain-related ontologies (i.e., the Elementary Multi-perspective Material Ontology and the Materials Design Ontology), allowing for efficiently representing the dislocation simulation data. Moreover, we present a real-world use case for representing the discrete dislocation dynamics data as a knowledge graph (DisLocKG) which can depict the relationship between them. We also developed a SPARQL endpoint that brings extensive flexibility for querying DisLocKG.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 18","pages":"11737-11753"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12174205/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144334315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balancing misclassification errors in image-based inference using problem domain semantics and a nested cascade architecture. 利用问题域语义和嵌套级联结构平衡基于图像的推理中的误分类错误。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2025-09-13 DOI: 10.1007/s00521-025-11613-8
Xin Du, Rajesh Jena, Katayoun Farrahi, Mahesan Niranjan

Pattern recognition models, particularly neural networks, often focus on maximising classification accuracy. However, in practice, the types of errors made (misclassification between different classes) can have varying associated costs. Current methods overlook varying misclassification error types. Misclassification labels can either be available from expert knowledge or derived from semantics of textual descriptions of class labels. Exploiting such misclassification costs can have significant implications when deploying machine learning systems. Here, using five examples from image and tabular domains, we show how a deep neural architecture trained in a nested layer-wise fashion (cascade learning) in which early layers solve easier problems than later ones could exploit such hierarchical aspects of class labels. We employ a measure of performance called "severity" of errors and show how emphasis could be placed on classes that are deeper in the hierarchy, ignoring errors that arise between semantic neighbours.

Supplementary information: The online version contains supplementary material available at 10.1007/s00521-025-11613-8.

模式识别模型,特别是神经网络,通常关注于最大限度地提高分类精度。然而,在实践中,所犯错误的类型(不同类别之间的错误分类)可能会产生不同的相关成本。目前的方法忽略了不同的误分类错误类型。错误分类标签可以从专家知识中获得,也可以从类标签的文本描述的语义中获得。在部署机器学习系统时,利用这种错误分类成本可能会产生重大影响。在这里,使用来自图像和表格领域的五个示例,我们展示了如何以嵌套的分层方式(级联学习)训练深度神经架构,其中早期的层比后期的层解决更容易的问题,可以利用类标签的分层方面。我们采用了一种称为错误“严重性”的性能度量,并展示了如何将重点放在层次结构中更深的类上,而忽略语义相邻之间出现的错误。补充信息:在线版本包含补充资料,可在10.1007/s00521-025-11613-8获得。
{"title":"Balancing misclassification errors in image-based inference using problem domain semantics and a nested cascade architecture.","authors":"Xin Du, Rajesh Jena, Katayoun Farrahi, Mahesan Niranjan","doi":"10.1007/s00521-025-11613-8","DOIUrl":"10.1007/s00521-025-11613-8","url":null,"abstract":"<p><p>Pattern recognition models, particularly neural networks, often focus on maximising classification accuracy. However, in practice, the types of errors made (misclassification between different classes) can have varying associated costs. Current methods overlook varying misclassification error types. Misclassification labels can either be available from expert knowledge or derived from semantics of textual descriptions of class labels. Exploiting such misclassification costs can have significant implications when deploying machine learning systems. Here, using five examples from image and tabular domains, we show how a deep neural architecture trained in a nested layer-wise fashion (cascade learning) in which early layers solve easier problems than later ones could exploit such hierarchical aspects of class labels. We employ a measure of performance called \"severity\" of errors and show how emphasis could be placed on classes that are deeper in the hierarchy, ignoring errors that arise between semantic neighbours.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s00521-025-11613-8.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 31","pages":"26021-26036"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12535507/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145338090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fourier convolutional decoder: reconstructing solar flare images via deep learning. 傅里叶卷积解码器:通过深度学习重建太阳耀斑图像。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2025-05-27 DOI: 10.1007/s00521-025-11283-6
Merve Selcuk-Simsek, Paolo Massa, Hualin Xiao, Säm Krucker, André Csillaghy

Reconstructing images from observational data is a complex and time-consuming process, particularly in astronomy, where traditional algorithms like CLEAN require extensive computational resources and expert interpretation to distinguish genuine features from artifacts, especially without ground truth data. To address these challenges, we developed the Fourier convolutional decoder (FCD), a custom-made overcomplete autoencoder trained on simulated data with available ground truth. This enables the network to generate outputs that closely approximate expected ground truth. The model's versatility was demonstrated on both simulated and observational datasets, with a specific application to data from the spectrometer/telescope for imaging X-rays (STIX) on the solar orbiter. In the simulated environment, FCD's performance was evaluated using multiple-image reconstruction metrics, demonstrating its ability to produce accurate images with minimal artifacts. For observational data, FCD was compared with benchmark algorithms, focusing on reconstruction metrics related to Fourier components. Our evaluation found that FCD is the fastest imaging method, with runtimes on the order of milliseconds. Its computational cost is comparable to the most efficient reconstruction algorithm and 280 × faster than the slowest imaging method for single-image reconstruction on a CPU. Additionally, its runtime can be reduced by an order of magnitude for multiple-image reconstruction on a GPU. FCD outperforms or matches state-of-the-art methods on simulated data, achieving a mean MS-SSIM of 0.97, LPIPS of 0.04, PSNR of 35.70 dB, a Dice coefficient of 0.83, and a Hausdorff distance of 5.08. Finally, on experimental STIX observations, FCD remains competitive with top methods despite reduced performance compared to simulated data.

从观测数据中重建图像是一个复杂而耗时的过程,特别是在天文学中,像CLEAN这样的传统算法需要大量的计算资源和专家解释来区分真实特征和人工特征,特别是在没有地面真实数据的情况下。为了解决这些挑战,我们开发了傅里叶卷积解码器(FCD),这是一种定制的超完整自动编码器,使用可用的地面真值模拟数据进行训练。这使得网络产生的输出非常接近预期的真实值。该模型的多功能性在模拟和观测数据集上都得到了证明,并具体应用于太阳轨道器上用于成像x射线的光谱仪/望远镜(STIX)的数据。在模拟环境中,使用多图像重建指标对FCD的性能进行了评估,证明了其能够以最小的伪像生成准确的图像。对于观测数据,FCD与基准算法进行了比较,重点关注与傅里叶分量相关的重建指标。我们的评估发现,FCD是最快的成像方法,运行时间在毫秒量级。它的计算成本与最有效的重建算法相当,比最慢的成像方法在CPU上进行单图像重建快280倍。此外,对于GPU上的多图像重建,它的运行时间可以减少一个数量级。FCD在模拟数据上优于或匹配最先进的方法,平均MS-SSIM为0.97,LPIPS为0.04,PSNR为35.70 dB, Dice系数为0.83,豪斯多夫距离为5.08。最后,在实验STIX观测中,FCD与顶级方法相比仍然具有竞争力,尽管与模拟数据相比性能有所下降。
{"title":"Fourier convolutional decoder: reconstructing solar flare images via deep learning.","authors":"Merve Selcuk-Simsek, Paolo Massa, Hualin Xiao, Säm Krucker, André Csillaghy","doi":"10.1007/s00521-025-11283-6","DOIUrl":"10.1007/s00521-025-11283-6","url":null,"abstract":"<p><p>Reconstructing images from observational data is a complex and time-consuming process, particularly in astronomy, where traditional algorithms like CLEAN require extensive computational resources and expert interpretation to distinguish genuine features from artifacts, especially without ground truth data. To address these challenges, we developed the Fourier convolutional decoder (FCD), a custom-made overcomplete autoencoder trained on simulated data with available ground truth. This enables the network to generate outputs that closely approximate expected ground truth. The model's versatility was demonstrated on both simulated and observational datasets, with a specific application to data from the spectrometer/telescope for imaging X-rays (STIX) on the solar orbiter. In the simulated environment, FCD's performance was evaluated using multiple-image reconstruction metrics, demonstrating its ability to produce accurate images with minimal artifacts. For observational data, FCD was compared with benchmark algorithms, focusing on reconstruction metrics related to Fourier components. Our evaluation found that FCD is the fastest imaging method, with runtimes on the order of milliseconds. Its computational cost is comparable to the most efficient reconstruction algorithm and 280 <math><mo>×</mo></math> faster than the slowest imaging method for single-image reconstruction on a CPU. Additionally, its runtime can be reduced by an order of magnitude for multiple-image reconstruction on a GPU. FCD outperforms or matches state-of-the-art methods on simulated data, achieving a mean MS-SSIM of 0.97, LPIPS of 0.04, PSNR of 35.70 dB, a Dice coefficient of 0.83, and a Hausdorff distance of 5.08. Finally, on experimental STIX observations, FCD remains competitive with top methods despite reduced performance compared to simulated data.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 20","pages":"15573-15604"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12234595/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144602115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural network-based surrogate model in postprocessing of topology optimized structures. 基于神经网络的拓扑优化结构后处理代理模型。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2025-02-28 DOI: 10.1007/s00521-025-11039-2
Jude Thaddeus Persia, Myung Kyun Sung, Soobum Lee, Devin E Burns

This paper proposes a general method of creating an accurate neural network-based surrogate model for postprocessing a topologically optimized structure. When topology optimization results are converted into computer-aided design (CAD) files with smooth boundaries for manufacturability, finite element method (FEM) based stresses often do not agree with the topology optimized results due to changes of surface and mesh density. The conversion between topology optimization derived results and CAD files often requires postprocessing, an additional fine tuning of the geometry parameters to reconcile the change of the stress values. In this work, a feedforward, deep artificial neural network (DANN) is presented with varying architecture parameters that are found for each stress output of interest. This network is trained with the data based on a combination of Design of Experiments (DoE) models that have the geometry dimensions as inputs and stress readings under various loads as the outputs. A DANN-based surrogate model is constructed to enable fine tuning of all relevant stress performance metrics. This method of constructing an artificial network-based surrogate model minimizes the number of FEM computations required to generate an optimized, post-processed design. We present a case study of postprocessing a wind tunnel balance, a measurement device that yields the six force and moment components of a test aircraft. It needs to be designed considering multiple stress measures under combinations of the six loading conditions. Excellent performance of a neural network is presented in this paper in terms of accurate prediction of the highly nonlinear stresses under combinations of the six loads. Von Mises stress predictions are within 10% and axial force sensor stress predictions are within 2% for the final post-processed topology. The results support its usefulness for postprocessing of topology optimized structures.

本文提出了一种创建精确的基于神经网络的代理模型的通用方法,用于对拓扑优化结构进行后处理。在将拓扑优化结果转换为具有光滑可制造性边界的计算机辅助设计(CAD)文件时,由于表面和网格密度的变化,基于有限元法(FEM)的应力往往与拓扑优化结果不一致。拓扑优化结果与CAD文件之间的转换通常需要后处理,对几何参数进行额外的微调以协调应力值的变化。在这项工作中,提出了一个前馈深度人工神经网络(DANN),该网络具有针对每个感兴趣的应力输出找到的不同架构参数。该网络使用基于实验设计(DoE)模型组合的数据进行训练,该模型将几何尺寸作为输入,并将各种负载下的应力读数作为输出。构建了基于dann的代理模型,以实现所有相关应力性能指标的微调。这种构建基于人工网络的代理模型的方法最大限度地减少了生成优化后处理设计所需的FEM计算次数。我们提出了一个风洞平衡后处理的案例研究,风洞平衡是一种测量装置,可以产生测试飞机的六个力和力矩分量。在设计时需要考虑六种荷载条件组合下的多种应力措施。本文介绍了神经网络在六种载荷组合作用下高度非线性应力的准确预测方面的优异性能。对于最终的后处理拓扑,Von Mises应力预测在10%以内,轴向力传感器应力预测在2%以内。结果支持了该方法对拓扑优化结构后处理的有效性。
{"title":"Neural network-based surrogate model in postprocessing of topology optimized structures.","authors":"Jude Thaddeus Persia, Myung Kyun Sung, Soobum Lee, Devin E Burns","doi":"10.1007/s00521-025-11039-2","DOIUrl":"https://doi.org/10.1007/s00521-025-11039-2","url":null,"abstract":"<p><p>This paper proposes a general method of creating an accurate neural network-based surrogate model for postprocessing a topologically optimized structure. When topology optimization results are converted into computer-aided design (CAD) files with smooth boundaries for manufacturability, finite element method (FEM) based stresses often do not agree with the topology optimized results due to changes of surface and mesh density. The conversion between topology optimization derived results and CAD files often requires postprocessing, an additional fine tuning of the geometry parameters to reconcile the change of the stress values. In this work, a feedforward, deep artificial neural network (DANN) is presented with varying architecture parameters that are found for each stress output of interest. This network is trained with the data based on a combination of Design of Experiments (DoE) models that have the geometry dimensions as inputs and stress readings under various loads as the outputs. A DANN-based surrogate model is constructed to enable fine tuning of all relevant stress performance metrics. This method of constructing an artificial network-based surrogate model minimizes the number of FEM computations required to generate an optimized, post-processed design. We present a case study of postprocessing a wind tunnel balance, a measurement device that yields the six force and moment components of a test aircraft. It needs to be designed considering multiple stress measures under combinations of the six loading conditions. Excellent performance of a neural network is presented in this paper in terms of accurate prediction of the highly nonlinear stresses under combinations of the six loads. Von Mises stress predictions are within 10% and axial force sensor stress predictions are within 2% for the final post-processed topology. The results support its usefulness for postprocessing of topology optimized structures.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 15","pages":"8845-8867"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12053174/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144024819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning in public goods games: the effects of uncertainty and communication on cooperation. 公共物品博弈中的学习:不确定性和沟通对合作的影响。
IF 4.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-01 Epub Date: 2025-01-30 DOI: 10.1007/s00521-024-10530-6
Nicole Orzan, Erman Acar, Davide Grossi, Roxana Rădulescu

Communication is a widely used mechanism to promote cooperation in multi-agent systems. In the field of emergent communication, agents are typically trained in specific environments: cooperative, competitive or mixed-motive. Motivated by the idea that real-world settings are characterized by incomplete information and that humans face daily interactions under a wide spectrum of incentives, we aim to explore the role of emergent communication when simultaneously exploited across all these contexts. In this work, we pursue this line of research by focusing on social dilemmas. To do this, we developed an extended version of the Public Goods Game, which allows us to train independent reinforcement learning agents simultaneously in different scenarios where incentives are (mis)aligned to various extents. Additionally, agents experience uncertainty in terms of the alignment of their incentives with those of others. We equip agents with the ability to learn a communication policy and study the impact of emergent communication in the face of uncertainty among agents. Our findings show that in settings where all agents have the same level of uncertainty, communication can enhance the cooperation of the whole group. However, in cases of asymmetric uncertainty, the agents that do not face uncertainty learn to use communication to deceive and exploit their uncertain peers.

通信是多智能体系统中广泛使用的促进合作的机制。在紧急通信领域,智能体通常在特定的环境中训练:合作、竞争或混合动机。考虑到现实环境的特点是信息不完整,以及人类在各种激励下面临日常互动,我们的目标是探索在所有这些环境中同时利用紧急沟通的作用。在这项工作中,我们通过关注社会困境来追求这一研究方向。为此,我们开发了一个扩展版本的公共物品博弈,它允许我们在不同的激励(错误)对齐到不同程度的情况下同时训练独立的强化学习代理。此外,代理体会到他们的激励与他人的激励的一致性方面的不确定性。我们赋予agent学习沟通策略的能力,并研究agent之间面对不确定性时紧急沟通的影响。我们的研究结果表明,在所有代理人都具有相同程度的不确定性的情况下,沟通可以增强整个群体的合作。然而,在不对称不确定性的情况下,不面临不确定性的代理学习使用通信来欺骗和利用他们不确定的同伴。
{"title":"Learning in public goods games: the effects of uncertainty and communication on cooperation.","authors":"Nicole Orzan, Erman Acar, Davide Grossi, Roxana Rădulescu","doi":"10.1007/s00521-024-10530-6","DOIUrl":"10.1007/s00521-024-10530-6","url":null,"abstract":"<p><p>Communication is a widely used mechanism to promote cooperation in multi-agent systems. In the field of emergent communication, agents are typically trained in specific environments: cooperative, competitive or mixed-motive. Motivated by the idea that real-world settings are characterized by incomplete information and that humans face daily interactions under a wide spectrum of incentives, we aim to explore the role of emergent communication when simultaneously exploited across all these contexts. In this work, we pursue this line of research by focusing on social dilemmas. To do this, we developed an extended version of the Public Goods Game, which allows us to train independent reinforcement learning agents simultaneously in different scenarios where incentives are (mis)aligned to various extents. Additionally, agents experience uncertainty in terms of the alignment of their incentives with those of others. We equip agents with the ability to learn a communication policy and study the impact of emergent communication in the face of uncertainty among agents. Our findings show that in settings where all agents have the same level of uncertainty, communication can enhance the cooperation of the whole group. However, in cases of asymmetric uncertainty, the agents that do not face uncertainty learn to use communication to deceive and exploit their uncertain peers.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":"37 23","pages":"18899-18932"},"PeriodicalIF":4.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313843/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144776757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stress monitoring using wearable sensors: IoT techniques in medical field. 使用可穿戴传感器进行压力监测:医疗领域的物联网技术。
IF 6 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-06-02 DOI: 10.1007/s00521-023-08681-z
Fatma M Talaat, Rana Mohamed El-Balka

The concept "Internet of Things" (IoT), which facilitates communication between linked devices, is relatively new. It refers to the next generation of the Internet. IoT supports healthcare and is essential to numerous applications for tracking medical services. By examining the pattern of observed parameters, the type of the disease can be anticipated. For people with a range of diseases, health professionals and technicians have developed an excellent system that employs commonly utilized techniques like wearable technology, wireless channels, and other remote equipment to give low-cost healthcare monitoring. Whether put in living areas or worn on the body, network-related sensors gather detailed data to evaluate the patient's physical and mental health. The main objective of this study is to examine the current e-health monitoring system using integrated systems. Automatically providing patients with a prescription based on their status is the main goal of the e-health monitoring system. The doctor can keep an eye on the patient's health without having to communicate with them. The purpose of the study is to examine how IoT technologies are applied in the medical industry and how they help to raise the bar of healthcare delivered by healthcare institutions. The study will also include the uses of IoT in the medical area, the degree to which it is used to enhance conventional practices in various health fields, and the degree to which IoT may raise the standard of healthcare services. The main contributions in this paper are as follows: (1) importing signals from wearable devices, extracting signals from non-signals, performing peak enhancement; (2) processing and analyzing the incoming signals; (3) proposing a new stress monitoring algorithm (SMA) using wearable sensors; (4) comparing between various ML algorithms; (5) the proposed stress monitoring algorithm (SMA) is composed of four main phases: (a) data acquisition phase, (b) data and signal processing phase, (c) prediction phase, and (d) model performance evaluation phase; and (6) grid search is used to find the optimal values for hyperparameters of SVM (C and gamma). From the findings, it is shown that random forest is best suited for this classification, with decision tree and XGBoost following closely behind.

“物联网”(IoT)的概念相对较新,它促进了连接设备之间的通信。它指的是下一代互联网。物联网支持医疗保健,对跟踪医疗服务的众多应用程序至关重要。通过检查观察到的参数模式,可以预测疾病的类型。对于患有一系列疾病的人,卫生专业人员和技术人员开发了一个出色的系统,该系统采用了可穿戴技术、无线信道和其他远程设备等常用技术,提供低成本的医疗监测。无论是放在生活区还是戴在身上,与网络相关的传感器都会收集详细的数据,以评估患者的身心健康。本研究的主要目的是使用集成系统来检查当前的电子健康监测系统。根据患者的状态自动为其提供处方是电子健康监测系统的主要目标。医生可以密切关注患者的健康状况,而无需与他们沟通。该研究的目的是研究物联网技术如何应用于医疗行业,以及它们如何帮助提高医疗机构提供的医疗保健标准。该研究还将包括物联网在医疗领域的用途,它在多大程度上被用于加强各个健康领域的传统做法,以及物联网可以在多大限度上提高医疗服务标准。本文的主要贡献如下:(1)从可穿戴设备中导入信号,从非信号中提取信号,进行峰值增强;(2) 处理和分析输入信号;(3) 提出了一种新的使用可穿戴传感器的应力监测算法(SMA);(4) 在各种ML算法之间进行比较;(5) 所提出的应力监测算法由四个主要阶段组成:(a)数据采集阶段、(b)数据和信号处理阶段、(c)预测阶段和(d)模型性能评估阶段;以及(6)网格搜索用于找到SVM(C和伽玛)的超参数的最优值。研究结果表明,随机森林最适合这种分类,决策树和XGBoost紧随其后。
{"title":"Stress monitoring using wearable sensors: IoT techniques in medical field.","authors":"Fatma M Talaat,&nbsp;Rana Mohamed El-Balka","doi":"10.1007/s00521-023-08681-z","DOIUrl":"10.1007/s00521-023-08681-z","url":null,"abstract":"<p><p>The concept \"Internet of Things\" (IoT), which facilitates communication between linked devices, is relatively new. It refers to the next generation of the Internet. IoT supports healthcare and is essential to numerous applications for tracking medical services. By examining the pattern of observed parameters, the type of the disease can be anticipated. For people with a range of diseases, health professionals and technicians have developed an excellent system that employs commonly utilized techniques like wearable technology, wireless channels, and other remote equipment to give low-cost healthcare monitoring. Whether put in living areas or worn on the body, network-related sensors gather detailed data to evaluate the patient's physical and mental health. The main objective of this study is to examine the current e-health monitoring system using integrated systems. Automatically providing patients with a prescription based on their status is the main goal of the e-health monitoring system. The doctor can keep an eye on the patient's health without having to communicate with them. The purpose of the study is to examine how IoT technologies are applied in the medical industry and how they help to raise the bar of healthcare delivered by healthcare institutions. The study will also include the uses of IoT in the medical area, the degree to which it is used to enhance conventional practices in various health fields, and the degree to which IoT may raise the standard of healthcare services. The main contributions in this paper are as follows: (1) importing signals from wearable devices, extracting signals from non-signals, performing peak enhancement; (2) processing and analyzing the incoming signals; (3) proposing a new stress monitoring algorithm (SMA) using wearable sensors; (4) comparing between various ML algorithms; (5) the proposed stress monitoring algorithm (SMA) is composed of four main phases: (a) data acquisition phase, (b) data and signal processing phase, (c) prediction phase, and (d) model performance evaluation phase; and (6) grid search is used to find the optimal values for hyperparameters of SVM (C and gamma). From the findings, it is shown that random forest is best suited for this classification, with decision tree and XGBoost following closely behind.</p>","PeriodicalId":49766,"journal":{"name":"Neural Computing & Applications","volume":" ","pages":"1-14"},"PeriodicalIF":6.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10237081/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9771493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Neural Computing & Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1