Machine learning and knowledge extraction最新文献_第7页

A Survey of Deep Learning for Alzheimer's Disease 深度学习治疗阿尔茨海默病的研究综述

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-06-09 DOI: 10.3390/make5020035

Qinghua Zhou, Jiaji Wang, Xiang Yu, Shuihua Wang, Yudong Zhang

Alzheimer’s and related diseases are significant health issues of this era. The interdisciplinary use of deep learning in this field has shown great promise and gathered considerable interest. This paper surveys deep learning literature related to Alzheimer’s disease, mild cognitive impairment, and related diseases from 2010 to early 2023. We identify the major types of unsupervised, supervised, and semi-supervised methods developed for various tasks in this field, including the most recent developments, such as the application of recurrent neural networks, graph-neural networks, and generative models. We also provide a summary of data sources, data processing, training protocols, and evaluation methods as a guide for future deep learning research into Alzheimer’s disease. Although deep learning has shown promising performance across various studies and tasks, it is limited by interpretation and generalization challenges. The survey also provides a brief insight into these challenges and the possible pathways for future studies.

阿尔茨海默病和相关疾病是这个时代的重大健康问题。深度学习在这一领域的跨学科应用已经显示出巨大的前景，并引起了相当大的兴趣。本文综述了2010年至2023年初与阿尔茨海默病、轻度认知障碍及相关疾病相关的深度学习文献。我们确定了为该领域的各种任务开发的无监督、有监督和半监督方法的主要类型，包括最新的发展，如循环神经网络、图神经网络和生成模型的应用。我们还提供了数据来源、数据处理、训练协议和评估方法的总结，作为未来深度学习研究阿尔茨海默病的指南。尽管深度学习在各种研究和任务中表现出了良好的表现，但它受到解释和泛化挑战的限制。该调查还提供了对这些挑战和未来研究可能的途径的简要见解。

引用次数: 3

A Mathematical Framework for Enriching Human-Machine Interactions 丰富人机交互的数学框架

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-06-06 DOI: 10.3390/make5020034

A. Ehresmann, Mathias Béjean, J. Vanbremeersch

This paper presents a conceptual mathematical framework for developing rich human–machine interactions in order to improve decision-making in a social organisation, S. The idea is to model how S can create a “multi-level artificial cognitive system”, called a data analyser (DA), to collaborate with humans in collecting and learning how to analyse data, to anticipate situations, and to develop new responses, thus improving decision-making. In this model, the DA is “processed” to not only gather data and extend existing knowledge, but also to learn how to act autonomously with its own specific procedures or even to create new ones. An application is given in cases where such rich human–machine interactions are expected to allow the DA+S partnership to acquire deep anticipation capabilities for possible future changes, e.g., to prevent risks or seize opportunities. The way the social organization S operates over time, including the construction of DA, is described using the conceptual framework comprising “memory evolutive systems” (MES), a mathematical theoretical approach introduced by Ehresmann and Vanbremeersch for evolutionary multi-scale, multi-agent and multi-temporality systems. This leads to the definition of a “data analyser–MES”.

本文提出了一个概念性数学框架，用于开发丰富的人机交互，以改善社会组织S的决策。其想法是模拟S如何创建一个“多层次人工认知系统”，称为数据分析器(DA)，与人类合作收集和学习如何分析数据，预测情况，并制定新的响应，从而改善决策。在这个模型中，数据处理不仅要收集数据和扩展现有知识，还要学习如何根据自己的特定程序自主行动，甚至创建新的程序。在这种丰富的人机交互期望允许DA+S伙伴关系获得对未来可能变化的深度预测能力的情况下，例如，预防风险或抓住机会。社会组织S随时间的运行方式，包括数据分析的构建，使用包含“记忆进化系统”(MES)的概念框架来描述，MES是Ehresmann和Vanbremeersch为进化的多尺度、多主体和多时间系统引入的数学理论方法。这就引出了“数据分析师- mes”的定义。

{"title":"A Mathematical Framework for Enriching Human-Machine Interactions","authors":"A. Ehresmann, Mathias Béjean, J. Vanbremeersch","doi":"10.3390/make5020034","DOIUrl":"https://doi.org/10.3390/make5020034","url":null,"abstract":"This paper presents a conceptual mathematical framework for developing rich human–machine interactions in order to improve decision-making in a social organisation, S. The idea is to model how S can create a “multi-level artificial cognitive system”, called a data analyser (DA), to collaborate with humans in collecting and learning how to analyse data, to anticipate situations, and to develop new responses, thus improving decision-making. In this model, the DA is “processed” to not only gather data and extend existing knowledge, but also to learn how to act autonomously with its own specific procedures or even to create new ones. An application is given in cases where such rich human–machine interactions are expected to allow the DA+S partnership to acquire deep anticipation capabilities for possible future changes, e.g., to prevent risks or seize opportunities. The way the social organization S operates over time, including the construction of DA, is described using the conceptual framework comprising “memory evolutive systems” (MES), a mathematical theoretical approach introduced by Ehresmann and Vanbremeersch for evolutionary multi-scale, multi-agent and multi-temporality systems. This leads to the definition of a “data analyser–MES”.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"613 1","pages":"597-610"},"PeriodicalIF":0.0,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86676769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Systematic Review of Recommendation Systems for Course Selection 课程选择推荐系统的系统回顾

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-06-06 DOI: 10.3390/make5020033

Shrooq Algarni, Frederick T. Sheldon

Course recommender systems play an increasingly pivotal role in the educational landscape, driving personalization and informed decision-making for students. However, these systems face significant challenges, including managing a large and dynamic decision space and addressing the cold start problem for new students. This article endeavors to provide a comprehensive review and background to fully understand recent research on course recommender systems and their impact on learning. We present a detailed summary of empirical data supporting the use of these systems in educational strategic planning. We examined case studies conducted over the previous six years (2017–2022), with a focus on 35 key studies selected from 1938 academic papers found using the CADIMA tool. This systematic literature review (SLR) assesses various recommender system methodologies used to suggest course selection tracks, aiming to determine the most effective evidence-based approach.

课程推荐系统在教育领域发挥着越来越重要的作用，为学生推动个性化和明智的决策。然而，这些系统面临着巨大的挑战，包括管理一个大而动态的决策空间，以及解决新生的冷启动问题。本文试图提供一个全面的回顾和背景，以充分理解课程推荐系统的最新研究及其对学习的影响。我们提供了一个详细的经验数据总结，支持在教育战略规划中使用这些系统。我们检查了过去六年(2017-2022年)进行的案例研究，重点关注了使用cadia工具从1938篇学术论文中选出的35项关键研究。本系统文献综述(SLR)评估了用于建议课程选择轨道的各种推荐系统方法，旨在确定最有效的循证方法。

引用次数: 0

What about the Latent Space? The Need for Latent Feature Saliency Detection in Deep Time Series Classification 潜伏空间呢?深度时间序列分类中潜在特征显著性检测的必要性

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-05-18 DOI: 10.3390/make5020032

Maresa Schröder, Alireza Zamanian, N. Ahmidi

Saliency methods are designed to provide explainability for deep image processing models by assigning feature-wise importance scores and thus detecting informative regions in the input images. Recently, these methods have been widely adapted to the time series domain, aiming to identify important temporal regions in a time series. This paper extends our former work on identifying the systematic failure of such methods in the time series domain to produce relevant results when informative patterns are based on underlying latent information rather than temporal regions. First, we both visually and quantitatively assess the quality of explanations provided by multiple state-of-the-art saliency methods, including Integrated Gradients, Deep-Lift, Kernel SHAP, and Lime using univariate simulated time series data with temporal or latent patterns. In addition, to emphasize the severity of the latent feature saliency detection problem, we also run experiments on a real-world predictive maintenance dataset with known latent patterns. We identify Integrated Gradients, Deep-Lift, and the input-cell attention mechanism as potential candidates for refinement to yield latent saliency scores. Finally, we provide recommendations on using saliency methods for time series classification and suggest a guideline for developing latent saliency methods for time series.

显著性方法旨在通过分配特征重要性分数，从而检测输入图像中的信息区域，为深度图像处理模型提供可解释性。近年来，这些方法已广泛应用于时间序列领域，旨在识别时间序列中重要的时间区域。本文扩展了我们以前的工作，即当信息模式基于潜在信息而不是时间区域时，识别此类方法在时间序列域中的系统性失败以产生相关结果。首先，我们使用具有时间或潜在模式的单变量模拟时间序列数据，从视觉和定量上评估多种最先进的显著性方法(包括集成梯度、Deep-Lift、Kernel SHAP和Lime)提供的解释的质量。此外，为了强调潜在特征显著性检测问题的严重性，我们还在具有已知潜在模式的真实预测维护数据集上运行了实验。我们确定了集成梯度、深度提升和输入细胞注意机制作为改进产生潜在显著性分数的潜在候选。最后，对显著性方法在时间序列分类中的应用提出了建议，并提出了发展时间序列潜在显著性方法的指导方针。

{"title":"What about the Latent Space? The Need for Latent Feature Saliency Detection in Deep Time Series Classification","authors":"Maresa Schröder, Alireza Zamanian, N. Ahmidi","doi":"10.3390/make5020032","DOIUrl":"https://doi.org/10.3390/make5020032","url":null,"abstract":"Saliency methods are designed to provide explainability for deep image processing models by assigning feature-wise importance scores and thus detecting informative regions in the input images. Recently, these methods have been widely adapted to the time series domain, aiming to identify important temporal regions in a time series. This paper extends our former work on identifying the systematic failure of such methods in the time series domain to produce relevant results when informative patterns are based on underlying latent information rather than temporal regions. First, we both visually and quantitatively assess the quality of explanations provided by multiple state-of-the-art saliency methods, including Integrated Gradients, Deep-Lift, Kernel SHAP, and Lime using univariate simulated time series data with temporal or latent patterns. In addition, to emphasize the severity of the latent feature saliency detection problem, we also run experiments on a real-world predictive maintenance dataset with known latent patterns. We identify Integrated Gradients, Deep-Lift, and the input-cell attention mechanism as potential candidates for refinement to yield latent saliency scores. Finally, we provide recommendations on using saliency methods for time series classification and suggest a guideline for developing latent saliency methods for time series.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"23 1","pages":"539-559"},"PeriodicalIF":0.0,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90893026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Alzheimer's Disease Detection from Fused PET and MRI Modalities Using an Ensemble Classifier 使用集成分类器从融合PET和MRI模式检测阿尔茨海默病

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-05-18 DOI: 10.3390/make5020031

A. Shukla, Rajeev Tiwari, Shamik Tiwari

Alzheimer’s disease (AD) is an old-age disease that comes in different stages and directly affects the different regions of the brain. The research into the detection of AD and its stages has new advancements in terms of single-modality and multimodality approaches. However, sustainable techniques for the detection of AD and its stages still require a greater extent of research. In this study, a multimodal image-fusion method is initially proposed for the fusion of two different modalities, i.e., PET (Positron Emission Tomography) and MRI (Magnetic Resonance Imaging). Further, the features obtained from fused and non-fused biomarkers are passed to the ensemble classifier with a Random Forest-based feature selection strategy. Three classes of Alzheimer’s disease are used in this work, namely AD, MCI (Mild Cognitive Impairment) and CN (Cognitive Normal). In the resulting analysis, the Binary classifications, i.e., AD vs. CN and MCI vs. CN, attained an accuracy (Acc) of 99% in both cases. The class AD vs. MCI detection achieved an adequate accuracy (Acc) of 91%. Furthermore, the Multi Class classification, i.e., AD vs. MCI vs. CN, achieved 96% (Acc).

阿尔茨海默病(AD)是一种不同阶段的老年疾病，直接影响大脑的不同区域。从单模态和多模态两方面对阿尔茨海默病及其阶段的检测研究有了新的进展。然而，可持续的检测阿尔茨海默病及其阶段的技术仍然需要更大程度的研究。本研究首次提出了一种多模态图像融合方法，用于融合两种不同的模式，即PET(正电子发射断层扫描)和MRI(磁共振成像)。此外，通过基于随机森林的特征选择策略，将从融合和非融合生物标志物中获得的特征传递给集成分类器。在这项工作中使用了三类阿尔茨海默病，即AD, MCI(轻度认知障碍)和CN(认知正常)。在结果分析中，二元分类，即AD vs. CN和MCI vs. CN，在两种情况下都达到了99%的准确率(Acc)。AD类与MCI类检测达到了91%的足够准确度(Acc)。此外，Multi Class分类，即AD、MCI和CN，达到96% (Acc)。

{"title":"Alzheimer's Disease Detection from Fused PET and MRI Modalities Using an Ensemble Classifier","authors":"A. Shukla, Rajeev Tiwari, Shamik Tiwari","doi":"10.3390/make5020031","DOIUrl":"https://doi.org/10.3390/make5020031","url":null,"abstract":"Alzheimer’s disease (AD) is an old-age disease that comes in different stages and directly affects the different regions of the brain. The research into the detection of AD and its stages has new advancements in terms of single-modality and multimodality approaches. However, sustainable techniques for the detection of AD and its stages still require a greater extent of research. In this study, a multimodal image-fusion method is initially proposed for the fusion of two different modalities, i.e., PET (Positron Emission Tomography) and MRI (Magnetic Resonance Imaging). Further, the features obtained from fused and non-fused biomarkers are passed to the ensemble classifier with a Random Forest-based feature selection strategy. Three classes of Alzheimer’s disease are used in this work, namely AD, MCI (Mild Cognitive Impairment) and CN (Cognitive Normal). In the resulting analysis, the Binary classifications, i.e., AD vs. CN and MCI vs. CN, attained an accuracy (Acc) of 99% in both cases. The class AD vs. MCI detection achieved an adequate accuracy (Acc) of 91%. Furthermore, the Multi Class classification, i.e., AD vs. MCI vs. CN, achieved 96% (Acc).","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"8 1","pages":"512-538"},"PeriodicalIF":0.0,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80634773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Probabilistic Transformation of Distance-Based Outliers 基于距离的离群值的概率变换

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-05-16 DOI: 10.3390/make5030042

David Muhr, M. Affenzeller, Josef Küng

The scores of distance-based outlier detection methods are difficult to interpret, and it is challenging to determine a suitable cut-off threshold between normal and outlier data points without additional context. We describe a generic transformation of distance-based outlier scores into interpretable, probabilistic estimates. The transformation is ranking-stable and increases the contrast between normal and outlier data points. Determining distance relationships between data points is necessary to identify the nearest-neighbor relationships in the data, yet most of the computed distances are typically discarded. We show that the distances to other data points can be used to model distance probability distributions and, subsequently, use the distributions to turn distance-based outlier scores into outlier probabilities. Over a variety of tabular and image benchmark datasets, we show that the probabilistic transformation does not impact outlier ranking (ROC AUC) or detection performance (AP, F1), and increases the contrast between normal and outlier score distributions (statistical distance). The experimental findings indicate that it is possible to transform distance-based outlier scores into interpretable probabilities with increased contrast between normal and outlier samples. Our work generalizes to a wide range of distance-based outlier detection methods, and, because existing distance computations are used, it adds no significant computational overhead.

基于距离的离群点检测方法的分数很难解释，并且在没有额外背景的情况下确定正常数据点和离群点之间合适的截止阈值是具有挑战性的。我们描述了基于距离的异常值得分到可解释的概率估计的一般转换。转换是排序稳定的，并增加了正常和离群数据点之间的对比。确定数据点之间的距离关系对于确定数据中的最近邻关系是必要的，但是大多数计算的距离通常被丢弃。我们表明，到其他数据点的距离可用于建模距离概率分布，随后，使用分布将基于距离的离群值得分转化为离群概率。在各种表格和图像基准数据集上，我们表明概率变换不会影响离群值排名(ROC AUC)或检测性能(AP, F1)，并且增加了正态和离群值分布(统计距离)之间的对比。实验结果表明，通过增加正常样本和离群样本之间的对比，可以将基于距离的离群值得分转换为可解释的概率。我们的工作推广到广泛的基于距离的离群值检测方法，并且，由于使用现有的距离计算，它不会增加显著的计算开销。

{"title":"A Probabilistic Transformation of Distance-Based Outliers","authors":"David Muhr, M. Affenzeller, Josef Küng","doi":"10.3390/make5030042","DOIUrl":"https://doi.org/10.3390/make5030042","url":null,"abstract":"The scores of distance-based outlier detection methods are difficult to interpret, and it is challenging to determine a suitable cut-off threshold between normal and outlier data points without additional context. We describe a generic transformation of distance-based outlier scores into interpretable, probabilistic estimates. The transformation is ranking-stable and increases the contrast between normal and outlier data points. Determining distance relationships between data points is necessary to identify the nearest-neighbor relationships in the data, yet most of the computed distances are typically discarded. We show that the distances to other data points can be used to model distance probability distributions and, subsequently, use the distributions to turn distance-based outlier scores into outlier probabilities. Over a variety of tabular and image benchmark datasets, we show that the probabilistic transformation does not impact outlier ranking (ROC AUC) or detection performance (AP, F1), and increases the contrast between normal and outlier score distributions (statistical distance). The experimental findings indicate that it is possible to transform distance-based outlier scores into interpretable probabilities with increased contrast between normal and outlier samples. Our work generalizes to a wide range of distance-based outlier detection methods, and, because existing distance computations are used, it adds no significant computational overhead.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"569 1","pages":"782-802"},"PeriodicalIF":0.0,"publicationDate":"2023-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83496234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Biologically Inspired Self-Organizing Computational Model to Mimic Infant Learning 模仿婴儿学习的生物学启发自组织计算模型

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-05-15 DOI: 10.3390/make5020030

Karthik Santhanaraj, Dinakaran Devaraj, Ramya Mm, J. Dhanraj, K. Ramanathan

Recent technological advancements have fostered human–robot coexistence in work and residential environments. The assistive robot must exhibit humane behavior and consistent care to become an integral part of the human habitat. Furthermore, the robot requires an adaptive unsupervised learning model to explore unfamiliar conditions and collaborate seamlessly. This paper introduces variants of the growing hierarchical self-organizing map (GHSOM)-based computational models for assistive robots, which constructs knowledge from unsupervised exploration-based learning. Traditional self-organizing map (SOM) algorithms have shortcomings, including finite neuron structure, user-defined parameters, and non-hierarchical adaptive architecture. The proposed models overcome these limitations and dynamically grow to form problem-dependent hierarchical feature clusters, thereby allowing associative learning and symbol grounding. Infants can learn from their surroundings through exploration and experience, developing new neuronal connections as they learn. They can also apply their prior knowledge to solve unfamiliar problems. With infant-like emergent behavior, the presented models can operate on different problems without modifications, producing new patterns not present in the input vectors and allowing interactive result visualization. The proposed models are applied to the color, handwritten digits clustering, finger identification, and image classification problems to evaluate their adaptiveness and infant-like knowledge building. The results show that the proposed models are the preferred generalized models for assistive robots.

最近的技术进步促进了人与机器人在工作和居住环境中的共存。辅助机器人必须表现出人性化的行为和持续的关怀，才能成为人类栖息地不可或缺的一部分。此外，机器人需要一个自适应的无监督学习模型来探索不熟悉的环境并无缝协作。本文介绍了基于增长层次自组织地图(GHSOM)的辅助机器人计算模型的变体，该模型从无监督探索学习中构建知识。传统的自组织映射(SOM)算法存在神经元结构有限、参数自定义、非分层自适应等缺点。所提出的模型克服了这些限制，并动态生长形成问题依赖的层次特征簇，从而允许联想学习和符号基础。婴儿可以通过探索和经验从周围环境中学习，在学习过程中发展新的神经元连接。他们还可以运用他们的先验知识来解决不熟悉的问题。由于具有类似婴儿的紧急行为，所提出的模型可以在不修改的情况下处理不同的问题，产生输入向量中不存在的新模式，并允许交互式结果可视化。将所提出的模型应用于颜色、手写体数字聚类、手指识别和图像分类等问题，以评估其自适应性和婴儿式的知识构建。结果表明，所提模型是辅助机器人优选的广义模型。

{"title":"Biologically Inspired Self-Organizing Computational Model to Mimic Infant Learning","authors":"Karthik Santhanaraj, Dinakaran Devaraj, Ramya Mm, J. Dhanraj, K. Ramanathan","doi":"10.3390/make5020030","DOIUrl":"https://doi.org/10.3390/make5020030","url":null,"abstract":"Recent technological advancements have fostered human–robot coexistence in work and residential environments. The assistive robot must exhibit humane behavior and consistent care to become an integral part of the human habitat. Furthermore, the robot requires an adaptive unsupervised learning model to explore unfamiliar conditions and collaborate seamlessly. This paper introduces variants of the growing hierarchical self-organizing map (GHSOM)-based computational models for assistive robots, which constructs knowledge from unsupervised exploration-based learning. Traditional self-organizing map (SOM) algorithms have shortcomings, including finite neuron structure, user-defined parameters, and non-hierarchical adaptive architecture. The proposed models overcome these limitations and dynamically grow to form problem-dependent hierarchical feature clusters, thereby allowing associative learning and symbol grounding. Infants can learn from their surroundings through exploration and experience, developing new neuronal connections as they learn. They can also apply their prior knowledge to solve unfamiliar problems. With infant-like emergent behavior, the presented models can operate on different problems without modifications, producing new patterns not present in the input vectors and allowing interactive result visualization. The proposed models are applied to the color, handwritten digits clustering, finger identification, and image classification problems to evaluate their adaptiveness and infant-like knowledge building. The results show that the proposed models are the preferred generalized models for assistive robots.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"46 1","pages":"491-511"},"PeriodicalIF":0.0,"publicationDate":"2023-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76167857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluating the Coverage and Depth of Latent Dirichlet Allocation Topic Model in Comparison with Human Coding of Qualitative Data: The Case of Education Research 评价潜在狄利克雷分配主题模型与人类定性数据编码的覆盖范围和深度——以教育研究为例

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-05-14 DOI: 10.3390/make5020029

Gaurav Nanda, A. Jaiswal, Hugo Castellanos, Yuzhe Zhou, Alex Choi, Alejandra J. Magana

Fields in the social sciences, such as education research, have started to expand the use of computer-based research methods to supplement traditional research approaches. Natural language processing techniques, such as topic modeling, may support qualitative data analysis by providing early categories that researchers may interpret and refine. This study contributes to this body of research and answers the following research questions: (RQ1) What is the relative coverage of the latent Dirichlet allocation (LDA) topic model and human coding in terms of the breadth of the topics/themes extracted from the text collection? (RQ2) What is the relative depth or level of detail among identified topics using LDA topic models and human coding approaches? A dataset of student reflections was qualitatively analyzed using LDA topic modeling and human coding approaches, and the results were compared. The findings suggest that topic models can provide reliable coverage and depth of themes present in a textual collection comparable to human coding but require manual interpretation of topics. The breadth and depth of human coding output is heavily dependent on the expertise of coders and the size of the collection; these factors are better handled in the topic modeling approach.

社会科学领域，如教育研究，已经开始扩大使用基于计算机的研究方法来补充传统的研究方法。自然语言处理技术，如主题建模，可以通过提供研究人员可以解释和改进的早期类别来支持定性数据分析。本研究为这一研究体系做出了贡献，并回答了以下研究问题:(RQ1)就从文本集合中提取的主题/主题的广度而言，潜在狄利克雷分配(LDA)主题模型和人类编码的相对覆盖范围是什么?(RQ2)使用LDA主题模型和人工编码方法确定的主题之间的相对深度或详细程度是什么?采用LDA主题建模和人工编码方法对学生反思数据集进行定性分析，并对结果进行比较。研究结果表明，主题模型可以提供与人类编码相当的文本集合中主题的可靠覆盖和深度，但需要人工解释主题。人类编码输出的广度和深度在很大程度上取决于编码员的专业知识和集合的大小;这些因素在主题建模方法中得到了更好的处理。

{"title":"Evaluating the Coverage and Depth of Latent Dirichlet Allocation Topic Model in Comparison with Human Coding of Qualitative Data: The Case of Education Research","authors":"Gaurav Nanda, A. Jaiswal, Hugo Castellanos, Yuzhe Zhou, Alex Choi, Alejandra J. Magana","doi":"10.3390/make5020029","DOIUrl":"https://doi.org/10.3390/make5020029","url":null,"abstract":"Fields in the social sciences, such as education research, have started to expand the use of computer-based research methods to supplement traditional research approaches. Natural language processing techniques, such as topic modeling, may support qualitative data analysis by providing early categories that researchers may interpret and refine. This study contributes to this body of research and answers the following research questions: (RQ1) What is the relative coverage of the latent Dirichlet allocation (LDA) topic model and human coding in terms of the breadth of the topics/themes extracted from the text collection? (RQ2) What is the relative depth or level of detail among identified topics using LDA topic models and human coding approaches? A dataset of student reflections was qualitatively analyzed using LDA topic modeling and human coding approaches, and the results were compared. The findings suggest that topic models can provide reliable coverage and depth of themes present in a textual collection comparable to human coding but require manual interpretation of topics. The breadth and depth of human coding output is heavily dependent on the expertise of coders and the size of the collection; these factors are better handled in the topic modeling approach.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"91 ","pages":"473-490"},"PeriodicalIF":0.0,"publicationDate":"2023-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72422652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Multi-Input Machine Learning Approach to Classifying Sex Trafficking from Online Escort Advertisements 一种多输入机器学习方法对在线应召广告中的性交易进行分类

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-05-10 DOI: 10.3390/make5020028

L. Summers, Alyssa N. Shallenberger, John Cruz, Lawrence V. Fulton

Sex trafficking victims are often advertised through online escort sites. These ads can be publicly accessed, but law enforcement lacks the resources to comb through hundreds of ads to identify those that may feature sex-trafficked individuals. The purpose of this study was to implement and test multi-input, deep learning (DL) binary classification models to predict the probability of an online escort ad being associated with sex trafficking (ST) activity and aid in the detection and investigation of ST. Data from 12,350 scraped and classified ads were split into training and test sets (80% and 20%, respectively). Multi-input models that included recurrent neural networks (RNN) for text classification, convolutional neural networks (CNN, specifically EfficientNetB6 or ENET) for image/emoji classification, and neural networks (NN) for feature classification were trained and used to classify the 20% test set. The best-performing DL model included text and imagery inputs, resulting in an accuracy of 0.82 and an F1 score of 0.70. More importantly, the best classifier (RNN + ENET) correctly identified 14 of 14 sites that had classification probability estimates of 0.845 or greater (1.0 precision); precision was 96% for the multi-input model (NN + RNN + ENET) when only the ads associated with the highest positive classification probabilities (>0.90) were considered (n = 202 ads). The models developed could be productionalized and piloted with criminal investigators, as they could potentially increase their efficiency in identifying potential ST victims.

性交易的受害者通常是通过在线陪护网站发布广告的。这些广告可以公开访问，但执法部门缺乏资源来梳理数以百计的广告，以识别那些可能涉及性交易的个人。本研究的目的是实施和测试多输入、深度学习(DL)二元分类模型，以预测在线陪游广告与性交易(ST)活动相关的概率，并帮助检测和调查ST。来自12,350个收集和分类广告的数据被分为训练集和测试集(分别为80%和20%)。多输入模型包括用于文本分类的循环神经网络(RNN)、用于图像/表情符号分类的卷积神经网络(CNN，特别是effentnetb6或ENET)和用于特征分类的神经网络(NN)，并用于对20%测试集进行分类。表现最好的深度学习模型包括文本和图像输入，其准确率为0.82,F1得分为0.70。更重要的是，最佳分类器(RNN + ENET)正确识别了14个站点中的14个，分类概率估计为0.845或更高(精度为1.0);当只考虑具有最高正分类概率(>0.90)的广告(n = 202个广告)时，多输入模型(NN + RNN + ENET)的准确率为96%。开发的模型可以在刑事调查人员中进行生产和试点，因为它们可能提高他们识别潜在性传播疾病受害者的效率。

{"title":"A Multi-Input Machine Learning Approach to Classifying Sex Trafficking from Online Escort Advertisements","authors":"L. Summers, Alyssa N. Shallenberger, John Cruz, Lawrence V. Fulton","doi":"10.3390/make5020028","DOIUrl":"https://doi.org/10.3390/make5020028","url":null,"abstract":"Sex trafficking victims are often advertised through online escort sites. These ads can be publicly accessed, but law enforcement lacks the resources to comb through hundreds of ads to identify those that may feature sex-trafficked individuals. The purpose of this study was to implement and test multi-input, deep learning (DL) binary classification models to predict the probability of an online escort ad being associated with sex trafficking (ST) activity and aid in the detection and investigation of ST. Data from 12,350 scraped and classified ads were split into training and test sets (80% and 20%, respectively). Multi-input models that included recurrent neural networks (RNN) for text classification, convolutional neural networks (CNN, specifically EfficientNetB6 or ENET) for image/emoji classification, and neural networks (NN) for feature classification were trained and used to classify the 20% test set. The best-performing DL model included text and imagery inputs, resulting in an accuracy of 0.82 and an F1 score of 0.70. More importantly, the best classifier (RNN + ENET) correctly identified 14 of 14 sites that had classification probability estimates of 0.845 or greater (1.0 precision); precision was 96% for the multi-input model (NN + RNN + ENET) when only the ads associated with the highest positive classification probabilities (>0.90) were considered (n = 202 ads). The models developed could be productionalized and piloted with criminal investigators, as they could potentially increase their efficiency in identifying potential ST victims.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"101 1","pages":"460-472"},"PeriodicalIF":0.0,"publicationDate":"2023-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80783006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tree-Structured Model with Unbiased Variable Selection and Interaction Detection for Ranking Data 排序数据的无偏变量选择和交互检测树结构模型

Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine learning and knowledge extraction

Pub Date : 2023-05-09 DOI: 10.3390/make5020027

Yu-Shan Shih, Yi-Hung Kung

In this article, we propose a tree-structured method for either complete or partial rank data that incorporates covariate information into the analysis. We use conditional independence tests based on hierarchical log-linear models for three-way contingency tables to select split variables and cut points, and apply a simple Bonferroni rule to declare whether a node worths splitting or not. Through simulations, we also demonstrate that the proposed method is unbiased and effective in selecting informative split variables. Our proposed method can be applied across various fields to provide a flexible and robust framework for analyzing rank data and understanding how various factors affect individual judgments on ranking. This can help improve the quality of products or services and assist with informed decision making.

在本文中，我们提出了一种树形结构的方法，用于将协变量信息纳入分析的完整或部分秩数据。我们使用基于层次对数线性模型的条件独立测试来选择三向列联表的分割变量和切点，并应用简单的Bonferroni规则来声明节点是否值得分割。通过仿真，我们也证明了该方法在选择信息分裂变量方面是无偏的和有效的。我们提出的方法可以应用于各个领域，为分析排名数据和理解各种因素如何影响个人对排名的判断提供了一个灵活而稳健的框架。这有助于提高产品或服务的质量，并有助于做出明智的决策。

引用次数: 0