Machine learning and knowledge extraction最新文献_第5页

Comparing the Performance of Machine Learning Algorithms in the Automatic Classification of Psychotherapeutic Interactions in Avatar Therapy 比较机器学习算法在阿凡达治疗中心理治疗相互作用自动分类中的性能

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-24 DOI: 10.3390/make5030057

A. Hudon, Kingsada Phraxayavong, S. Potvin, A. Dumais

(1) Background: Avatar Therapy (AT) is currently being studied to help patients suffering from treatment-resistant schizophrenia. Facilitating annotations of immersive verbatims in AT by using classification algorithms could be an interesting avenue to reduce the time and cost of conducting such analysis and adding objective quantitative data in the classification of the different interactions taking place during the therapy. The aim of this study is to compare the performance of machine learning algorithms in the automatic annotation of immersive session verbatims of AT. (2) Methods: Five machine learning algorithms were implemented over a dataset as per the Scikit-Learn library: Support vector classifier, Linear support vector classifier, Multinomial Naïve Bayes, Decision Tree, and Multi-layer perceptron classifier. The dataset consisted of the 27 different types of interactions taking place in AT for the Avatar and the patient for 35 patients who underwent eight immersive sessions as part of their treatment in AT. (3) Results: The Linear SVC performed best over the dataset as compared with the other algorithms with the highest accuracy score, recall score, and F1-Score. The regular SVC performed best for precision. (4) Conclusions: This study presented an objective method for classifying textual interactions based on immersive session verbatims and gave a first comparison of multiple machine learning algorithms on AT.

(1)背景:目前正在研究阿凡达疗法(Avatar Therapy, AT)来帮助难治性精神分裂症患者。通过使用分类算法来促进AT中沉浸式单词的注释可能是一种有趣的途径，可以减少进行此类分析的时间和成本，并在治疗过程中发生的不同相互作用的分类中添加客观的定量数据。本研究的目的是比较机器学习算法在沉浸式会话逐字自动标注中的性能。(2)方法:基于Scikit-Learn库在数据集上实现五种机器学习算法:支持向量分类器、线性支持向量分类器、多项式Naïve贝叶斯、决策树和多层感知器分类器。该数据集包括27种不同类型的交互作用，这些交互作用发生在阿凡达和35名患者身上，这些患者经历了8次沉浸式会话，作为他们在AT治疗中的一部分。(3)结果:与其他算法相比，线性SVC算法在数据集上的表现最好，准确率、召回率和F1-Score得分最高。常规SVC在精度方面表现最好。(4)结论:本研究提出了一种基于沉浸式会话逐字的文本交互分类的客观方法，并首次对AT上的多种机器学习算法进行了比较。

{"title":"Comparing the Performance of Machine Learning Algorithms in the Automatic Classification of Psychotherapeutic Interactions in Avatar Therapy","authors":"A. Hudon, Kingsada Phraxayavong, S. Potvin, A. Dumais","doi":"10.3390/make5030057","DOIUrl":"https://doi.org/10.3390/make5030057","url":null,"abstract":"(1) Background: Avatar Therapy (AT) is currently being studied to help patients suffering from treatment-resistant schizophrenia. Facilitating annotations of immersive verbatims in AT by using classification algorithms could be an interesting avenue to reduce the time and cost of conducting such analysis and adding objective quantitative data in the classification of the different interactions taking place during the therapy. The aim of this study is to compare the performance of machine learning algorithms in the automatic annotation of immersive session verbatims of AT. (2) Methods: Five machine learning algorithms were implemented over a dataset as per the Scikit-Learn library: Support vector classifier, Linear support vector classifier, Multinomial Naïve Bayes, Decision Tree, and Multi-layer perceptron classifier. The dataset consisted of the 27 different types of interactions taking place in AT for the Avatar and the patient for 35 patients who underwent eight immersive sessions as part of their treatment in AT. (3) Results: The Linear SVC performed best over the dataset as compared with the other algorithms with the highest accuracy score, recall score, and F1-Score. The regular SVC performed best for precision. (4) Conclusions: This study presented an objective method for classifying textual interactions based on immersive session verbatims and gave a first comparison of multiple machine learning algorithms on AT.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"56 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41273478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Analyzing Quality Measurements for Dimensionality Reduction 分析降维的质量测量

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-21 DOI: 10.3390/make5030056

Michael C. Thrun, Julian Märte, Quirin Stier

Dimensionality reduction methods can be used to project high-dimensional data into low-dimensional space. If the output space is restricted to two dimensions, the result is a scatter plot whose goal is to present insightful visualizations of distance- and density-based structures. The topological invariance of dimension indicates that the two-dimensional similarities in the scatter plot cannot coercively represent high-dimensional distances. In praxis, projections of several datasets with distance- and density-based structures show a misleading interpretation of the underlying structures. The examples outline that the evaluation of projections remains essential. Here, 19 unsupervised quality measurements (QM) are grouped into semantic classes with the aid of graph theory. We use three representative benchmark datasets to show that QMs fail to evaluate the projections of straightforward structures when common methods such as Principal Component Analysis (PCA), Uniform Manifold Approximation projection, or t-distributed stochastic neighbor embedding (t-SNE) are applied. This work shows that unsupervised QMs are biased towards assumed underlying structures. Based on insights gained from graph theory, we propose a new quality measurement called the Gabriel Classification Error (GCE). This work demonstrates that GCE can make an unbiased evaluation of projections. The GCE is accessible within the R package DR quality available on CRAN.

降维方法可用于将高维数据投影到低维空间中。如果输出空间被限制为二维，结果是一个散点图，其目标是呈现基于距离和密度的结构的深刻可视化。维度的拓扑不变性表明散点图中的二维相似性不能强制表示高维距离。在实践中，具有基于距离和密度的结构的几个数据集的投影显示出对底层结构的误导性解释。这些例子表明，对预测的评估仍然至关重要。这里，借助图论将19个无监督质量测量（QM）分组为语义类。我们使用三个具有代表性的基准数据集来表明，当应用主成分分析（PCA）、均匀流形近似投影或t-分布随机邻居嵌入（t-SNE）等常用方法时，QM无法评估直接结构的投影。这项工作表明，无监督QM偏向于假定的底层结构。基于从图论中获得的见解，我们提出了一种新的质量测量方法，称为Gabriel分类误差（GCE）。这项工作表明，GCE可以对预测做出公正的评估。GCE可在CRAN上提供的R包DR质量内访问。

{"title":"Analyzing Quality Measurements for Dimensionality Reduction","authors":"Michael C. Thrun, Julian Märte, Quirin Stier","doi":"10.3390/make5030056","DOIUrl":"https://doi.org/10.3390/make5030056","url":null,"abstract":"Dimensionality reduction methods can be used to project high-dimensional data into low-dimensional space. If the output space is restricted to two dimensions, the result is a scatter plot whose goal is to present insightful visualizations of distance- and density-based structures. The topological invariance of dimension indicates that the two-dimensional similarities in the scatter plot cannot coercively represent high-dimensional distances. In praxis, projections of several datasets with distance- and density-based structures show a misleading interpretation of the underlying structures. The examples outline that the evaluation of projections remains essential. Here, 19 unsupervised quality measurements (QM) are grouped into semantic classes with the aid of graph theory. We use three representative benchmark datasets to show that QMs fail to evaluate the projections of straightforward structures when common methods such as Principal Component Analysis (PCA), Uniform Manifold Approximation projection, or t-distributed stochastic neighbor embedding (t-SNE) are applied. This work shows that unsupervised QMs are biased towards assumed underlying structures. Based on insights gained from graph theory, we propose a new quality measurement called the Gabriel Classification Error (GCE). This work demonstrates that GCE can make an unbiased evaluation of projections. The GCE is accessible within the R package DR quality available on CRAN.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46759622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tabular Machine Learning Methods for Predicting Gas Turbine Emissions 预测燃气轮机排放的表格式机器学习方法

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-14 DOI: 10.3390/make5030055

Rebecca Potts, Rick Hackney, Georgios Leontidis

Predicting emissions for gas turbines is critical for monitoring harmful pollutants being released into the atmosphere. In this study, we evaluate the performance of machine learning models for predicting emissions for gas turbines. We compared an existing predictive emissions model, a first-principles-based Chemical Kinetics model, against two machine learning models we developed based on the Self-Attention and Intersample Attention Transformer (SAINT) and eXtreme Gradient Boosting (XGBoost), with the aim to demonstrate the improved predictive performance of nitrogen oxides (NOx) and carbon monoxide (CO) using machine learning techniques and determine whether XGBoost or a deep learning model performs the best on a specific real-life gas turbine dataset. Our analysis utilises a Siemens Energy gas turbine test bed tabular dataset to train and validate the machine learning models. Additionally, we explore the trade-off between incorporating more features to enhance the model complexity, and the resulting presence of increased missing values in the dataset.

预测燃气轮机的排放量对于监测排放到大气中的有害污染物至关重要。在本研究中，我们评估了用于预测燃气轮机排放的机器学习模型的性能。我们将现有的预测排放模型(基于第一性原理的化学动力学模型)与我们基于自注意和样本间注意转换器(SAINT)和极端梯度增强(XGBoost)开发的两种机器学习模型进行了比较。目的是展示使用机器学习技术改进的氮氧化物(NOx)和一氧化碳(CO)预测性能，并确定XGBoost或深度学习模型在特定的现实燃气轮机数据集上表现最佳。我们的分析利用西门子能源燃气轮机试验台表格数据集来训练和验证机器学习模型。此外，我们探讨了合并更多特征以提高模型复杂性与数据集中缺失值增加之间的权衡。

引用次数: 1

Defining a Digital Twin: A Data Science-Based Unification 定义数字孪生:基于数据科学的统一

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-12 DOI: 10.3390/make5030054

F. Emmert-Streib

The concept of a digital twin (DT) has gained significant attention in academia and industry because of its perceived potential to address critical global challenges, such as climate change, healthcare, and economic crises. Originally introduced in manufacturing, many attempts have been made to present proper definitions of this concept. Unfortunately, there remains a great deal of confusion surrounding the underlying concept, with many scientists still uncertain about the distinction between a simulation, a mathematical model and a DT. The aim of this paper is to propose a formal definition of a digital twin. To achieve this goal, we utilize a data science framework that facilitates a functional representation of a DT and other components that can be combined together to form a larger entity we refer to as a digital twin system (DTS). In our framework, a DT is an open dynamical system with an updating mechanism, also referred to as complex adaptive system (CAS). Its primary function is to generate data via simulations, ideally, indistinguishable from its physical counterpart. On the other hand, a DTS provides techniques for analyzing data and decision-making based on the generated data. Interestingly, we find that a DTS shares similarities to the principles of general systems theory. This multi-faceted view of a DTS explains its versatility in adapting to a wide range of problems in various application domains such as engineering, manufacturing, urban planning, and personalized medicine.

数字孪生（DT）的概念在学术界和工业界引起了极大的关注，因为它被认为有潜力应对气候变化、医疗保健和经济危机等关键的全球挑战。最初是在制造业中引入的，已经进行了许多尝试来给出这个概念的正确定义。不幸的是，围绕着基本概念仍然存在很多困惑，许多科学家仍然不确定模拟、数学模型和DT之间的区别。本文的目的是提出数字孪生的正式定义。为了实现这一目标，我们利用了一个数据科学框架，该框架有助于DT和其他组件的功能表示，这些组件可以组合在一起，形成一个更大的实体，我们称之为数字孪生系统（DTS）。在我们的框架中，DT是一个具有更新机制的开放动态系统，也称为复杂自适应系统（CAS）。它的主要功能是通过模拟生成数据，理想情况下，与物理数据无法区分。另一方面，DTS提供了用于分析数据和基于生成的数据进行决策的技术。有趣的是，我们发现DTS与一般系统理论的原理有相似之处。DTS的这种多方面观点解释了它在适应工程、制造、城市规划和个性化医疗等各种应用领域的广泛问题方面的多功能性。

{"title":"Defining a Digital Twin: A Data Science-Based Unification","authors":"F. Emmert-Streib","doi":"10.3390/make5030054","DOIUrl":"https://doi.org/10.3390/make5030054","url":null,"abstract":"The concept of a digital twin (DT) has gained significant attention in academia and industry because of its perceived potential to address critical global challenges, such as climate change, healthcare, and economic crises. Originally introduced in manufacturing, many attempts have been made to present proper definitions of this concept. Unfortunately, there remains a great deal of confusion surrounding the underlying concept, with many scientists still uncertain about the distinction between a simulation, a mathematical model and a DT. The aim of this paper is to propose a formal definition of a digital twin. To achieve this goal, we utilize a data science framework that facilitates a functional representation of a DT and other components that can be combined together to form a larger entity we refer to as a digital twin system (DTS). In our framework, a DT is an open dynamical system with an updating mechanism, also referred to as complex adaptive system (CAS). Its primary function is to generate data via simulations, ideally, indistinguishable from its physical counterpart. On the other hand, a DTS provides techniques for analyzing data and decision-making based on the generated data. Interestingly, we find that a DTS shares similarities to the principles of general systems theory. This multi-faceted view of a DTS explains its versatility in adapting to a wide range of problems in various application domains such as engineering, manufacturing, urban planning, and personalized medicine.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42597676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Artificial Intelligence Ethics and Challenges in Healthcare Applications: A Comprehensive Review in the Context of the European GDPR Mandate 医疗保健应用中的人工智能伦理和挑战:欧洲GDPR授权背景下的全面审查

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-07 DOI: 10.3390/make5030053

Mohammad MohammadAmini, Marcia Jesus, Davood Fanaei Sheikholeslami, Paulo Alves, Aliakbar Hassanzadeh Benam, Fatemeh Hariri

This study examines the ethical issues surrounding the use of Artificial Intelligence (AI) in healthcare, specifically nursing, under the European General Data Protection Regulation (GDPR). The analysis delves into how GDPR applies to healthcare AI projects, encompassing data collection and decision-making stages, to reveal the ethical implications at each step. A comprehensive review of the literature categorizes research investigations into three main categories: Ethical Considerations in AI; Practical Challenges and Solutions in AI Integration; and Legal and Policy Implications in AI. The analysis uncovers a significant research deficit in this field, with a particular focus on data owner rights and AI ethics within GDPR compliance. To address this gap, the study proposes new case studies that emphasize the importance of comprehending data owner rights and establishing ethical norms for AI use in medical applications, especially in nursing. This review makes a valuable contribution to the AI ethics debate and assists nursing and healthcare professionals in developing ethical AI practices. The insights provided help stakeholders navigate the intricate terrain of data protection, ethical considerations, and regulatory compliance in AI-driven healthcare. Lastly, the study introduces a case study of a real AI health-tech project named SENSOMATT, spotlighting GDPR and privacy issues.

本研究根据欧洲通用数据保护条例(GDPR)探讨了在医疗保健，特别是护理中使用人工智能(AI)的伦理问题。该分析深入研究了GDPR如何应用于医疗保健人工智能项目，包括数据收集和决策阶段，以揭示每个步骤的道德影响。对文献的全面回顾将研究调查分为三大类:人工智能中的伦理考虑;人工智能集成的现实挑战与解决方案以及人工智能的法律和政策影响。该分析揭示了该领域的一个重大研究缺陷，特别关注数据所有者权利和GDPR合规性中的人工智能伦理。为了弥补这一差距，该研究提出了新的案例研究，强调了理解数据所有者权利和为人工智能在医疗应用(特别是护理领域)中的使用建立道德规范的重要性。这篇综述对人工智能伦理辩论做出了有价值的贡献，并帮助护理和医疗保健专业人员发展人工智能伦理实践。所提供的见解有助于利益相关者在人工智能驱动的医疗保健中导航复杂的数据保护、道德考虑和法规遵从性。最后，该研究介绍了一个名为SENSOMATT的真实人工智能健康技术项目的案例研究，突出了GDPR和隐私问题。

{"title":"Artificial Intelligence Ethics and Challenges in Healthcare Applications: A Comprehensive Review in the Context of the European GDPR Mandate","authors":"Mohammad MohammadAmini, Marcia Jesus, Davood Fanaei Sheikholeslami, Paulo Alves, Aliakbar Hassanzadeh Benam, Fatemeh Hariri","doi":"10.3390/make5030053","DOIUrl":"https://doi.org/10.3390/make5030053","url":null,"abstract":"This study examines the ethical issues surrounding the use of Artificial Intelligence (AI) in healthcare, specifically nursing, under the European General Data Protection Regulation (GDPR). The analysis delves into how GDPR applies to healthcare AI projects, encompassing data collection and decision-making stages, to reveal the ethical implications at each step. A comprehensive review of the literature categorizes research investigations into three main categories: Ethical Considerations in AI; Practical Challenges and Solutions in AI Integration; and Legal and Policy Implications in AI. The analysis uncovers a significant research deficit in this field, with a particular focus on data owner rights and AI ethics within GDPR compliance. To address this gap, the study proposes new case studies that emphasize the importance of comprehending data owner rights and establishing ethical norms for AI use in medical applications, especially in nursing. This review makes a valuable contribution to the AI ethics debate and assists nursing and healthcare professionals in developing ethical AI practices. The insights provided help stakeholders navigate the intricate terrain of data protection, ethical considerations, and regulatory compliance in AI-driven healthcare. Lastly, the study introduces a case study of a real AI health-tech project named SENSOMATT, spotlighting GDPR and privacy issues.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"28 1","pages":"1023-1035"},"PeriodicalIF":0.0,"publicationDate":"2023-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80681861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Improving Spiking Neural Network Performance with Auxiliary Learning 用辅助学习改进脉冲神经网络性能

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-05 DOI: 10.3390/make5030052

P. G. Cachi, S. Ventura, Krzysztof J. Cios

The use of back propagation through the time learning rule enabled the supervised training of deep spiking neural networks to process temporal neuromorphic data. However, their performance is still below non-spiking neural networks. Previous work pointed out that one of the main causes is the limited number of neuromorphic data currently available, which are also difficult to generate. With the goal of overcoming this problem, we explore the usage of auxiliary learning as a means of helping spiking neural networks to identify more general features. Tests are performed on neuromorphic DVS-CIFAR10 and DVS128-Gesture datasets. The results indicate that training with auxiliary learning tasks improves their accuracy, albeit slightly. Different scenarios, including manual and automatic combination losses using implicit differentiation, are explored to analyze the usage of auxiliary tasks.

通过时间学习规则的反向传播，使深度尖峰神经网络的监督训练能够处理时间神经形态数据。然而，它们的性能仍然低于非尖峰神经网络。先前的研究指出，其中一个主要原因是目前可用的神经形态数据数量有限，这些数据也难以生成。为了克服这个问题，我们探索使用辅助学习作为帮助尖峰神经网络识别更一般特征的手段。在神经形态的DVS-CIFAR10和DVS128-Gesture数据集上进行测试。结果表明，辅助学习任务的训练提高了他们的准确性，尽管幅度很小。探讨了不同的场景，包括使用隐式微分的手动和自动组合损失，以分析辅助任务的使用情况。

引用次数: 0

Identifying the Regions of a Space with the Self-Parameterized Recursively Assessed Decomposition Algorithm (SPRADA) 基于自参数化递归评估分解算法(SPRADA)的空间区域识别

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-04 DOI: 10.3390/make5030051

Dylan Molinié, K. Madani, V. Amarger, A. Chebira

This paper introduces a non-parametric methodology based on classical unsupervised clustering techniques to automatically identify the main regions of a space, without requiring the objective number of clusters, so as to identify the major regular states of unknown industrial systems. Indeed, useful knowledge on real industrial processes entails the identification of their regular states, and their historically encountered anomalies. Since both should form compact and salient groups of data, unsupervised clustering generally performs this task fairly accurately; however, this often requires the number of clusters upstream, knowledge which is rarely available. As such, the proposed algorithm operates a first partitioning of the space, then it estimates the integrity of the clusters, and splits them again and again until every cluster obtains an acceptable integrity; finally, a step of merging based on the clusters’ empirical distributions is performed to refine the partitioning. Applied to real industrial data obtained in the scope of a European project, this methodology proved able to automatically identify the main regular states of the system. Results show the robustness of the proposed approach in the fully-automatic and non-parametric identification of the main regions of a space, knowledge which is useful to industrial anomaly detection and behavioral modeling.

本文介绍了一种基于经典无监督聚类技术的非参数方法，在不需要客观聚类数量的情况下，自动识别空间的主要区域，从而识别未知工业系统的主要规则状态。事实上，关于真实工业过程的有用知识需要识别它们的规则状态，以及它们在历史上遇到的异常。由于两者都应该形成紧凑和显著的数据组，因此无监督聚类通常相当准确地执行此任务;然而，这通常需要上游集群的数量，而这些知识很少可用。因此，该算法首先对空间进行划分，然后对聚类的完整性进行估计，并进行多次分割，直到每个聚类获得可接受的完整性;最后，根据聚类的经验分布进行合并，以细化划分。应用于在欧洲项目范围内获得的实际工业数据，该方法证明能够自动识别系统的主要规则状态。结果表明，该方法在空间主要区域的全自动非参数识别方面具有鲁棒性，可用于工业异常检测和行为建模。

{"title":"Identifying the Regions of a Space with the Self-Parameterized Recursively Assessed Decomposition Algorithm (SPRADA)","authors":"Dylan Molinié, K. Madani, V. Amarger, A. Chebira","doi":"10.3390/make5030051","DOIUrl":"https://doi.org/10.3390/make5030051","url":null,"abstract":"This paper introduces a non-parametric methodology based on classical unsupervised clustering techniques to automatically identify the main regions of a space, without requiring the objective number of clusters, so as to identify the major regular states of unknown industrial systems. Indeed, useful knowledge on real industrial processes entails the identification of their regular states, and their historically encountered anomalies. Since both should form compact and salient groups of data, unsupervised clustering generally performs this task fairly accurately; however, this often requires the number of clusters upstream, knowledge which is rarely available. As such, the proposed algorithm operates a first partitioning of the space, then it estimates the integrity of the clusters, and splits them again and again until every cluster obtains an acceptable integrity; finally, a step of merging based on the clusters’ empirical distributions is performed to refine the partitioning. Applied to real industrial data obtained in the scope of a European project, this methodology proved able to automatically identify the main regular states of the system. Results show the robustness of the proposed approach in the fully-automatic and non-parametric identification of the main regions of a space, knowledge which is useful to industrial anomaly detection and behavioral modeling.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"64 1","pages":"979-1009"},"PeriodicalIF":0.0,"publicationDate":"2023-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84020672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Alternative Formulations of Decision Rule Learning from Neural Networks 基于神经网络的决策规则学习的备选公式

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-03 DOI: 10.3390/make5030049

Litao Qiao, Weijia Wang, Bill Lin

This paper extends recent work on decision rule learning from neural networks for tabular data classification. We propose alternative formulations to trainable Boolean logic operators as neurons with continuous weights, including trainable NAND neurons. These alternative formulations provide uniform treatments to different trainable logic neurons so that they can be uniformly trained, which enables, for example, the direct application of existing sparsity-promoting neural net training techniques like reweighted L1 regularization to derive sparse networks that translate to simpler rules. In addition, we present an alternative network architecture based on trainable NAND neurons by applying De Morgan’s law to realize a NAND-NAND network instead of an AND-OR network, both of which can be readily mapped to decision rule sets. Our experimental results show that these alternative formulations can also generate accurate decision rule sets that achieve state-of-the-art performance in terms of accuracy in tabular learning applications.

本文扩展了基于神经网络的表数据分类决策规则学习的最新研究成果。我们提出了可训练布尔逻辑算子作为连续权重神经元的替代公式，包括可训练的NAND神经元。这些可选的公式为不同的可训练逻辑神经元提供了统一的处理方法，以便它们可以被统一训练，例如，可以直接应用现有的促进稀疏性的神经网络训练技术，如重新加权L1正则化，以派生出可转换为更简单规则的稀疏网络。此外，我们提出了一种基于可训练NAND神经元的替代网络架构，通过应用De Morgan定律来实现NAND-NAND网络，而不是AND-OR网络，两者都可以很容易地映射到决策规则集。我们的实验结果表明，这些替代公式也可以生成准确的决策规则集，在表格学习应用程序的准确性方面达到最先进的性能。

引用次数: 0

Behavior-Aware Pedestrian Trajectory Prediction in Ego-Centric Camera Views with Spatio-Temporal Ego-Motion Estimation 基于时空自我运动估计的以自我为中心的摄像机视图中行为感知的行人轨迹预测

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-03 DOI: 10.3390/make5030050

Phillip Czech, Markus Braun, U. Kressel, Bin Yang

With the ongoing development of automated driving systems, the crucial task of predicting pedestrian behavior is attracting growing attention. The prediction of future pedestrian trajectories from the ego-vehicle camera perspective is particularly challenging due to the dynamically changing scene. Therefore, we present Behavior-Aware Pedestrian Trajectory Prediction (BA-PTP), a novel approach to pedestrian trajectory prediction for ego-centric camera views. It incorporates behavioral features extracted from real-world traffic scene observations such as the body and head orientation of pedestrians, as well as their pose, in addition to positional information from body and head bounding boxes. For each input modality, we employed independent encoding streams that are combined through a modality attention mechanism. To account for the ego-motion of the camera in an ego-centric view, we introduced Spatio-Temporal Ego-Motion Module (STEMM), a novel approach to ego-motion prediction. Compared to the related works, it utilizes spatial goal points of the ego-vehicle that are sampled from its intended route. We experimentally validated the effectiveness of our approach using two datasets for pedestrian behavior prediction in urban traffic scenes. Based on ablation studies, we show the advantages of incorporating different behavioral features for pedestrian trajectory prediction in the image plane. Moreover, we demonstrate the benefit of integrating STEMM into our pedestrian trajectory prediction method, BA-PTP. BA-PTP achieves state-of-the-art performance on the PIE dataset, outperforming prior work by 7% in MSE-1.5 s and CMSE as well as 9% in CFMSE.

随着自动驾驶系统的不断发展，预测行人行为的关键任务越来越受到人们的关注。由于动态变化的场景，从自我车辆摄像机的角度预测未来行人的轨迹尤其具有挑战性。因此，我们提出了行为感知行人轨迹预测(BA-PTP)，这是一种以自我为中心的摄像机视图下行人轨迹预测的新方法。它结合了从现实交通场景观察中提取的行为特征，如行人的身体和头部方向，以及他们的姿势，以及来自身体和头部边界框的位置信息。对于每种输入模态，我们采用了独立的编码流，这些编码流通过模态注意机制组合在一起。为了解释相机在自我中心视角下的自我运动，我们引入了一种新的自我运动预测方法——时空自我运动模块(STEMM)。与相关工作相比，它利用了自驾车从预定路线中采样的空间目标点。我们用两个数据集实验验证了我们的方法在城市交通场景中行人行为预测的有效性。在消融研究的基础上，我们展示了在图像平面上结合不同行为特征进行行人轨迹预测的优势。此外，我们还展示了将STEMM集成到我们的行人轨迹预测方法BA-PTP中的好处。BA-PTP在PIE数据集上实现了最先进的性能，在MSE-1.5 s和CMSE中优于先前的工作7%，在CFMSE中优于9%。

{"title":"Behavior-Aware Pedestrian Trajectory Prediction in Ego-Centric Camera Views with Spatio-Temporal Ego-Motion Estimation","authors":"Phillip Czech, Markus Braun, U. Kressel, Bin Yang","doi":"10.3390/make5030050","DOIUrl":"https://doi.org/10.3390/make5030050","url":null,"abstract":"With the ongoing development of automated driving systems, the crucial task of predicting pedestrian behavior is attracting growing attention. The prediction of future pedestrian trajectories from the ego-vehicle camera perspective is particularly challenging due to the dynamically changing scene. Therefore, we present Behavior-Aware Pedestrian Trajectory Prediction (BA-PTP), a novel approach to pedestrian trajectory prediction for ego-centric camera views. It incorporates behavioral features extracted from real-world traffic scene observations such as the body and head orientation of pedestrians, as well as their pose, in addition to positional information from body and head bounding boxes. For each input modality, we employed independent encoding streams that are combined through a modality attention mechanism. To account for the ego-motion of the camera in an ego-centric view, we introduced Spatio-Temporal Ego-Motion Module (STEMM), a novel approach to ego-motion prediction. Compared to the related works, it utilizes spatial goal points of the ego-vehicle that are sampled from its intended route. We experimentally validated the effectiveness of our approach using two datasets for pedestrian behavior prediction in urban traffic scenes. Based on ablation studies, we show the advantages of incorporating different behavioral features for pedestrian trajectory prediction in the image plane. Moreover, we demonstrate the benefit of integrating STEMM into our pedestrian trajectory prediction method, BA-PTP. BA-PTP achieves state-of-the-art performance on the PIE dataset, outperforming prior work by 7% in MSE-1.5 s and CMSE as well as 9% in CFMSE.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"15 12 1","pages":"957-978"},"PeriodicalIF":0.0,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86988912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Achievable Minimally-Contrastive Counterfactual Explanations 可实现的最小对比反事实解释

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-08-03 DOI: 10.3390/make5030048

H. Barzekar, S. McRoy

Decision support systems based on machine learning models should be able to help users identify opportunities and threats. Popular model-agnostic explanation models can identify factors that support various predictions, answering questions such as “What factors affect sales?” or “Why did sales decline?”, but do not highlight what a person should or could do to get a more desirable outcome. Counterfactual explanation approaches address intervention, and some even consider feasibility, but none consider their suitability for real-time applications, such as question answering. Here, we address this gap by introducing a novel model-agnostic method that provides specific, feasible changes that would impact the outcomes of a complex Black Box AI model for a given instance and assess its real-world utility by measuring its real-time performance and ability to find achievable changes. The method uses the instance of concern to generate high-precision explanations and then applies a secondary method to find achievable minimally-contrastive counterfactual explanations (AMCC) while limiting the search to modifications that satisfy domain-specific constraints. Using a widely recognized dataset, we evaluated the classification task to ascertain the frequency and time required to identify successful counterfactuals. For a 90% accurate classifier, our algorithm identified AMCC explanations in 47% of cases (38 of 81), with an average discovery time of 80 ms. These findings verify the algorithm’s efficiency in swiftly producing AMCC explanations, suitable for real-time systems. The AMCC method enhances the transparency of Black Box AI models, aiding individuals in evaluating remedial strategies or assessing potential outcomes.

基于机器学习模型的决策支持系统应该能够帮助用户识别机会和威胁。流行的模型不可知论解释模型可以识别支持各种预测的因素，回答诸如“什么因素影响销售?”或者“为什么销售额下降了?”，但不要强调一个人应该或可以做什么来获得更理想的结果。反事实解释方法解决干预问题，有些甚至考虑可行性，但没有人考虑它们是否适合实时应用，例如问答。在这里，我们通过引入一种新颖的模型不可知方法来解决这一差距，该方法提供了具体的、可行的变化，这些变化将影响给定实例中复杂黑盒AI模型的结果，并通过测量其实时性能和发现可实现变化的能力来评估其在现实世界中的效用。该方法使用关注的实例来生成高精度的解释，然后应用第二种方法来找到可实现的最小对比反事实解释(AMCC)，同时将搜索限制在满足特定领域约束的修改上。使用广泛认可的数据集，我们评估了分类任务，以确定识别成功的反事实所需的频率和时间。对于准确率为90%的分类器，我们的算法在47%的情况下(81例中的38例)识别出AMCC解释，平均发现时间为80 ms。这些发现验证了该算法在快速生成AMCC解释方面的效率，适用于实时系统。AMCC方法提高了黑匣子人工智能模型的透明度，帮助个人评估补救策略或评估潜在结果。

{"title":"Achievable Minimally-Contrastive Counterfactual Explanations","authors":"H. Barzekar, S. McRoy","doi":"10.3390/make5030048","DOIUrl":"https://doi.org/10.3390/make5030048","url":null,"abstract":"Decision support systems based on machine learning models should be able to help users identify opportunities and threats. Popular model-agnostic explanation models can identify factors that support various predictions, answering questions such as “What factors affect sales?” or “Why did sales decline?”, but do not highlight what a person should or could do to get a more desirable outcome. Counterfactual explanation approaches address intervention, and some even consider feasibility, but none consider their suitability for real-time applications, such as question answering. Here, we address this gap by introducing a novel model-agnostic method that provides specific, feasible changes that would impact the outcomes of a complex Black Box AI model for a given instance and assess its real-world utility by measuring its real-time performance and ability to find achievable changes. The method uses the instance of concern to generate high-precision explanations and then applies a secondary method to find achievable minimally-contrastive counterfactual explanations (AMCC) while limiting the search to modifications that satisfy domain-specific constraints. Using a widely recognized dataset, we evaluated the classification task to ascertain the frequency and time required to identify successful counterfactuals. For a 90% accurate classifier, our algorithm identified AMCC explanations in 47% of cases (38 of 81), with an average discovery time of 80 ms. These findings verify the algorithm’s efficiency in swiftly producing AMCC explanations, suitable for real-time systems. The AMCC method enhances the transparency of Black Box AI models, aiding individuals in evaluating remedial strategies or assessing potential outcomes.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"32 1","pages":"922-936"},"PeriodicalIF":0.0,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77499375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0