首页 > 最新文献

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Graph-based Recommendation using Graph Neural Networks 使用图神经网络的基于图的推荐
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00270
Marco Dossena, Christopher Irwin, L. Portinale
Graph based recommendation strategies are recently gaining momentum in connection with the availability of new Graph Neural Network (GNN) architectures. In fact, the interactions between users and products in a recommender system can be naturally expressed in terms of a bipartite graph, where nodes corresponding to users are connected with nodes corresponding to products trough edges representing a user action on the product (usually a purchase). GNNs can then be exploited and trained in order to predict the existence of a specific edge between unconnected users and products, highlighting the interest for a potential purchase of a given product by a given user. In the present paper, we will present an experimental analysis of different GNN architectures in the context of recommender systems. We analyze the impact of different kind of layers such as convolutional, attentional and message-passing, as well as the influence of different embedding size on the performance on the link prediction task. We will also examine the behavior of two of such architectures (the ones relying on the presence of node features) with respect to both transductive and inductive situations.
随着新的图神经网络(GNN)架构的可用性,基于图的推荐策略最近获得了发展势头。实际上,在推荐系统中,用户与产品之间的交互可以很自然地用二部图来表示,其中用户对应的节点与产品对应的节点通过表示用户对产品的操作(通常是购买)的边相连接。然后可以利用和训练gnn,以预测未连接的用户和产品之间存在特定边缘,突出显示给定用户对特定产品的潜在购买兴趣。在本文中,我们将在推荐系统的背景下对不同的GNN架构进行实验分析。我们分析了卷积层、注意力层和消息传递层等不同类型的层对链路预测任务性能的影响,以及不同嵌入大小对链路预测任务性能的影响。我们还将研究两种这样的架构(依赖于节点特征的存在)在转导和归纳情况下的行为。
{"title":"Graph-based Recommendation using Graph Neural Networks","authors":"Marco Dossena, Christopher Irwin, L. Portinale","doi":"10.1109/ICMLA55696.2022.00270","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00270","url":null,"abstract":"Graph based recommendation strategies are recently gaining momentum in connection with the availability of new Graph Neural Network (GNN) architectures. In fact, the interactions between users and products in a recommender system can be naturally expressed in terms of a bipartite graph, where nodes corresponding to users are connected with nodes corresponding to products trough edges representing a user action on the product (usually a purchase). GNNs can then be exploited and trained in order to predict the existence of a specific edge between unconnected users and products, highlighting the interest for a potential purchase of a given product by a given user. In the present paper, we will present an experimental analysis of different GNN architectures in the context of recommender systems. We analyze the impact of different kind of layers such as convolutional, attentional and message-passing, as well as the influence of different embedding size on the performance on the link prediction task. We will also examine the behavior of two of such architectures (the ones relying on the presence of node features) with respect to both transductive and inductive situations.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121500993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using CatBoost and Other Supervised Machine Learning Algorithms to Predict Alzheimer's Disease 使用CatBoost和其他监督机器学习算法预测阿尔茨海默病
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00265
Jessica An
Alzheimer's disease is a progressive neurologic disorder that affects millions of elderly people worldwide. Most affected patients are not formally diagnosed due to the complexity of the disease and the lack of definitive diagnostic tools. Machine learning algorithms are powerful in deciphering complex data patterns. This study applied and evaluated a comprehensive set of nine machine learning techniques in detecting Alzheimer's disease. The model training and testing utilized clinical and brain magnetic resonance imaging features from The Open Access Series of Imaging Studies (OASIS) of Alzheimer's disease. The input data include ordinal data such as cognitive scores and numerical data of imaging measurements. To predict Alzheimer's disease, multiple types of supervised machine learning algorithms were trained, including CatBoost, logistic regression, decision tree, random forest, Naive Bayes, SVM, gradient boosting, XGBoost, and AdaBoost. A set of model performance metrics demonstrated that most algorithms were able to perform very well with high accuracy (92-96% in a longitudinal dataset). The models using CatBoost, SVM and decision tree performed the best. The results of this study suggest that ML algorithms combining clinical cognitive assessment and brain MRI images can assist and improve Alzheimer's disease diagnosis.
阿尔茨海默病是一种进行性神经系统疾病,影响着全世界数百万老年人。由于疾病的复杂性和缺乏明确的诊断工具,大多数受影响的患者未得到正式诊断。机器学习算法在破译复杂的数据模式方面非常强大。本研究应用并评估了一套全面的九种机器学习技术来检测阿尔茨海默病。模型训练和测试利用了阿尔茨海默病开放获取系列成像研究(OASIS)的临床和脑磁共振成像特征。输入数据包括认知得分等序数数据和成像测量的数值数据。为了预测阿尔茨海默病,我们训练了多种监督机器学习算法,包括CatBoost、逻辑回归、决策树、随机森林、朴素贝叶斯、SVM、梯度增强、XGBoost和AdaBoost。一组模型性能指标表明,大多数算法能够以很高的准确率(在纵向数据集中为92-96%)执行得非常好。使用CatBoost、SVM和决策树的模型效果最好。本研究结果表明,结合临床认知评估和脑MRI图像的ML算法可以辅助和改善阿尔茨海默病的诊断。
{"title":"Using CatBoost and Other Supervised Machine Learning Algorithms to Predict Alzheimer's Disease","authors":"Jessica An","doi":"10.1109/ICMLA55696.2022.00265","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00265","url":null,"abstract":"Alzheimer's disease is a progressive neurologic disorder that affects millions of elderly people worldwide. Most affected patients are not formally diagnosed due to the complexity of the disease and the lack of definitive diagnostic tools. Machine learning algorithms are powerful in deciphering complex data patterns. This study applied and evaluated a comprehensive set of nine machine learning techniques in detecting Alzheimer's disease. The model training and testing utilized clinical and brain magnetic resonance imaging features from The Open Access Series of Imaging Studies (OASIS) of Alzheimer's disease. The input data include ordinal data such as cognitive scores and numerical data of imaging measurements. To predict Alzheimer's disease, multiple types of supervised machine learning algorithms were trained, including CatBoost, logistic regression, decision tree, random forest, Naive Bayes, SVM, gradient boosting, XGBoost, and AdaBoost. A set of model performance metrics demonstrated that most algorithms were able to perform very well with high accuracy (92-96% in a longitudinal dataset). The models using CatBoost, SVM and decision tree performed the best. The results of this study suggest that ML algorithms combining clinical cognitive assessment and brain MRI images can assist and improve Alzheimer's disease diagnosis.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126480998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Vision Transformer Architecture for Open Set Recognition 一种面向开放集识别的视觉变换体系结构
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00034
Feiyang Cai, Zhenkai Zhang, Jie Liu
Deep neural networks have demonstrated prominent capacities for image classification tasks in a closed set setting, where the test data come from the same distribution as the training data. However, in a more realistic open set scenario, traditional classifiers with incomplete knowledge cannot tackle test data that are not from the training classes. Open set recognition (OSR) aims to address this problem by both identifying unknown classes and distinguishing known classes simultaneously. In this paper, we propose a novel approach to OSR that is based on the vision transformer (ViT) technique. Specifically, our approach employs two separate training stages. First, a ViT model is trained to perform closed set classification. Then, an additional detection head is attached to the embedded features extracted by the ViT, trained to force the representations of known data to class-specific clusters compactly. Test examples are identified as known or unknown based on their distance to the cluster centers. To the best of our knowledge, this is the first time to leverage ViT for the purpose of OSR, and our extensive evaluation against several OSR benchmark datasets reveals that our approach significantly outperforms other baseline methods and obtains new state-of-the-art performance.
深度神经网络已经证明了在封闭集环境下图像分类任务的突出能力,其中测试数据与训练数据来自相同的分布。然而,在更现实的开放集场景中,具有不完全知识的传统分类器无法处理非训练类的测试数据。开放集识别(OSR)旨在通过同时识别未知类和识别已知类来解决这一问题。在本文中,我们提出了一种基于视觉变压器(ViT)技术的OSR新方法。具体来说,我们的方法采用了两个独立的训练阶段。首先,训练ViT模型进行闭集分类。然后,一个附加的检测头附加到由ViT提取的嵌入特征上,训练以将已知数据的表示紧凑地强制到特定类的聚类。测试样例根据它们到聚类中心的距离被识别为已知或未知。据我们所知,这是第一次将ViT用于OSR,我们对几个OSR基准数据集进行了广泛的评估,结果表明我们的方法明显优于其他基准方法,并获得了最新的性能。
{"title":"A Vision Transformer Architecture for Open Set Recognition","authors":"Feiyang Cai, Zhenkai Zhang, Jie Liu","doi":"10.1109/ICMLA55696.2022.00034","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00034","url":null,"abstract":"Deep neural networks have demonstrated prominent capacities for image classification tasks in a closed set setting, where the test data come from the same distribution as the training data. However, in a more realistic open set scenario, traditional classifiers with incomplete knowledge cannot tackle test data that are not from the training classes. Open set recognition (OSR) aims to address this problem by both identifying unknown classes and distinguishing known classes simultaneously. In this paper, we propose a novel approach to OSR that is based on the vision transformer (ViT) technique. Specifically, our approach employs two separate training stages. First, a ViT model is trained to perform closed set classification. Then, an additional detection head is attached to the embedded features extracted by the ViT, trained to force the representations of known data to class-specific clusters compactly. Test examples are identified as known or unknown based on their distance to the cluster centers. To the best of our knowledge, this is the first time to leverage ViT for the purpose of OSR, and our extensive evaluation against several OSR benchmark datasets reveals that our approach significantly outperforms other baseline methods and obtains new state-of-the-art performance.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121790448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Approach for Synthetic Reduced Nearest-Neighbor Leveraging Neural Networks 一种利用神经网络的合成最近邻化简方法
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00138
A. Alizadeh, Pooya Tavallali, Vahid Behzadan, A. Ranganath, Mukesh Singhal
Synthetic Reduced Nearest Neighbor is a nearest neighbor model that is constrained on synthetic samples (i.e., prototypes). The body of work on such models includes proposals for improving the interpretability and optimization of SRNN models using expectation maximization. Motivated by the promise of this paradigm, we propose a novel Expectation Maximization approach for Synthetic Reduced Nearest Neighbors leveraging neural networks. Furthermore, we compare the performance of our proposed technique to classical state-of-the-art machine learning methods such as random forest and ensemble models. The empirical results demonstrate the advantages of using neural networks in lieu of an expectation maximization algorithm.
合成简化近邻是一种约束在合成样本(即原型)上的最近邻模型。这些模型的工作主体包括使用期望最大化来改进SRNN模型的可解释性和优化的建议。在这种范式的激励下,我们提出了一种利用神经网络的合成减少最近邻的新期望最大化方法。此外,我们将我们提出的技术的性能与经典的最先进的机器学习方法(如随机森林和集成模型)进行了比较。实证结果表明,使用神经网络代替期望最大化算法的优势。
{"title":"A Novel Approach for Synthetic Reduced Nearest-Neighbor Leveraging Neural Networks","authors":"A. Alizadeh, Pooya Tavallali, Vahid Behzadan, A. Ranganath, Mukesh Singhal","doi":"10.1109/ICMLA55696.2022.00138","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00138","url":null,"abstract":"Synthetic Reduced Nearest Neighbor is a nearest neighbor model that is constrained on synthetic samples (i.e., prototypes). The body of work on such models includes proposals for improving the interpretability and optimization of SRNN models using expectation maximization. Motivated by the promise of this paradigm, we propose a novel Expectation Maximization approach for Synthetic Reduced Nearest Neighbors leveraging neural networks. Furthermore, we compare the performance of our proposed technique to classical state-of-the-art machine learning methods such as random forest and ensemble models. The empirical results demonstrate the advantages of using neural networks in lieu of an expectation maximization algorithm.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"352 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115976970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer Learning for Bayesian Optimization with Principal Component Analysis 基于主成分分析的贝叶斯优化迁移学习
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00178
Hideyuki Masui, D. Romeres, D. Nikovski
Bayesian Optimization has been widely used for black-box optimization. Especially in the field of machine learning, BO has obtained remarkable results in hyperparameters optimization. However, the best hyperparameters depend on the specific task and traditionally the BO algorithm needs to be repeated for each task. On the other hand, the relationship between hyperparameters and objectives has similar tendency among tasks. Therefore, transfer learning is an important technology to accelerate the optimization of novel task by leveraging the knowledge acquired in prior tasks. In this work, we propose a new transfer learning strategy for BO. We use information geometry based principal component analysis (PCA) to extract a low-dimension manifold from a set of Gaussian process (GP) posteriors that models the objective functions of the prior tasks. Then, the low dimensional parameters of this manifold can be optimized to adapt to a new task and set a prior distribution for the objective function of the novel task. Experiments on hyperparameters optimization benchmarks show that our proposed algorithm, called BO-PCA, accelerates the learning of an unseen task (less data are required) while having low computational cost.
贝叶斯优化被广泛应用于黑盒优化。特别是在机器学习领域,BO在超参数优化方面取得了显著的成果。然而,最佳的超参数依赖于特定的任务,传统的BO算法需要对每个任务重复。另一方面,任务间的超参数与目标的关系也有类似的趋势。因此,迁移学习是利用在先前任务中获得的知识来加速新任务优化的重要技术。在这项工作中,我们提出了一种新的迁移学习策略。我们使用基于信息几何的主成分分析(PCA)从一组高斯过程(GP)后验中提取低维流形,这些后验建模了先验任务的目标函数。然后,优化流形的低维参数以适应新任务,并为新任务的目标函数设定先验分布。在超参数优化基准上的实验表明,我们提出的BO-PCA算法在计算成本低的同时加速了对未知任务的学习(所需的数据更少)。
{"title":"Transfer Learning for Bayesian Optimization with Principal Component Analysis","authors":"Hideyuki Masui, D. Romeres, D. Nikovski","doi":"10.1109/ICMLA55696.2022.00178","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00178","url":null,"abstract":"Bayesian Optimization has been widely used for black-box optimization. Especially in the field of machine learning, BO has obtained remarkable results in hyperparameters optimization. However, the best hyperparameters depend on the specific task and traditionally the BO algorithm needs to be repeated for each task. On the other hand, the relationship between hyperparameters and objectives has similar tendency among tasks. Therefore, transfer learning is an important technology to accelerate the optimization of novel task by leveraging the knowledge acquired in prior tasks. In this work, we propose a new transfer learning strategy for BO. We use information geometry based principal component analysis (PCA) to extract a low-dimension manifold from a set of Gaussian process (GP) posteriors that models the objective functions of the prior tasks. Then, the low dimensional parameters of this manifold can be optimized to adapt to a new task and set a prior distribution for the objective function of the novel task. Experiments on hyperparameters optimization benchmarks show that our proposed algorithm, called BO-PCA, accelerates the learning of an unseen task (less data are required) while having low computational cost.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133864579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cost-Sensitive Ensemble Learning for Highly Imbalanced Classification 高度不平衡分类的代价敏感集成学习
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00225
Justin M. Johnson, T. Khoshgoftaar
There are a variety of data-level and algorithm-level methods available for treating class imbalance. Data-level methods include data sampling strategies that pre-process training data to reduce levels of class imbalance. Algorithm-level methods modify the learning and inference processes to reduce bias towards the majority class. This study evaluates both data-level and algorithm-level methods for class imbalance using a highly imbalanced healthcare fraud data set. We approach the problem from a cost-sensitive learning perspective, and demonstrate how these direct and indirect cost-sensitive methods can be implemented using a common cost matrix. For each method, a wide range of costs are evaluated using three popular ensemble learning algorithms. Initial results show that random undersampling (RUS) and class weighting are both effective ways to improve classification when the default classification threshold is used. Further analysis using the area under the precision-recall curve, however, shows that both RUS and class weighting actually decrease the discriminative power of these learners. Through multiple complementary performance metrics and confidence interval analysis, we find that the best model performance is consistently obtained when RUS and class weighting are not applied, but when output thresholding is used to maximize the confusion matrix instead. Our contributions include various recommendations related to implementing cost-sensitive ensemble learning and effective model evaluation, as well as empirical evidence that contradicts popular beliefs about learning from imbalanced data.
有许多数据级和算法级的方法可用于处理类不平衡。数据级方法包括数据采样策略,该策略对训练数据进行预处理以降低类不平衡水平。算法级方法修改学习和推理过程,以减少对多数类的偏见。本研究使用高度不平衡的医疗保健欺诈数据集,评估了数据级和算法级方法的类不平衡。我们从成本敏感学习的角度来解决这个问题,并演示如何使用一个共同的成本矩阵来实现这些直接和间接的成本敏感方法。对于每种方法,使用三种流行的集成学习算法来评估广泛的成本。初步结果表明,当使用默认分类阈值时,随机欠采样(RUS)和类加权都是改进分类的有效方法。然而,使用精确召回曲线下的面积进一步分析表明,RUS和班级权重实际上降低了这些学习者的判别能力。通过多个互补的性能指标和置信区间分析,我们发现当不使用RUS和类权重时,而是使用输出阈值来最大化混淆矩阵时,模型性能始终保持最佳。我们的贡献包括与实施成本敏感集成学习和有效模型评估相关的各种建议,以及与从不平衡数据中学习的流行观点相矛盾的经验证据。
{"title":"Cost-Sensitive Ensemble Learning for Highly Imbalanced Classification","authors":"Justin M. Johnson, T. Khoshgoftaar","doi":"10.1109/ICMLA55696.2022.00225","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00225","url":null,"abstract":"There are a variety of data-level and algorithm-level methods available for treating class imbalance. Data-level methods include data sampling strategies that pre-process training data to reduce levels of class imbalance. Algorithm-level methods modify the learning and inference processes to reduce bias towards the majority class. This study evaluates both data-level and algorithm-level methods for class imbalance using a highly imbalanced healthcare fraud data set. We approach the problem from a cost-sensitive learning perspective, and demonstrate how these direct and indirect cost-sensitive methods can be implemented using a common cost matrix. For each method, a wide range of costs are evaluated using three popular ensemble learning algorithms. Initial results show that random undersampling (RUS) and class weighting are both effective ways to improve classification when the default classification threshold is used. Further analysis using the area under the precision-recall curve, however, shows that both RUS and class weighting actually decrease the discriminative power of these learners. Through multiple complementary performance metrics and confidence interval analysis, we find that the best model performance is consistently obtained when RUS and class weighting are not applied, but when output thresholding is used to maximize the confusion matrix instead. Our contributions include various recommendations related to implementing cost-sensitive ensemble learning and effective model evaluation, as well as empirical evidence that contradicts popular beliefs about learning from imbalanced data.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132581012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Ontology-based transfer learning method improving classification of medical documents 基于本体的医学文献分类迁移学习方法
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00065
Daniel Bruneß, Matthias Bay, Christian Schulze, Michael Guckert, Mirjam Minor
Automatic classification of documents is a well known problem and can be solved with Machine Learning methods. However, such approaches require large sets of training data which are not always available. Moreover, in data protection sensitive domains, e.g. electronic health records, Machine Learning models often cannot directly be transferred to other environments. We present a transfer learning method which uses ontologies to normalise the feature space of text classifiers. With this we can guarantee that the trained models do not contain any person related data and can therefore be widely reused without raising General Data Protection Regulation (GDPR) issues. Furthermore, we describe a process with which the ontologies can be enriched so that the classifiers can be reused in different contexts with deviating terminology without any additional training of the classifiers. Our transfer learning method follows a combined paradigm of transfer by copy and transfer by enrichment. As proof of concept we apply classifiers trained on hospital medical documents together with appropriately enriched ontologies to medical texts written in colloquial language. The promising results show the potential of our transfer learning approach that respects GDPR requirements and can flexibly be adapted to drifting terminology.
文档的自动分类是一个众所周知的问题,可以用机器学习方法来解决。然而,这种方法需要大量的训练数据,而这些数据并不总是可用的。此外,在数据保护敏感领域,例如电子健康记录,机器学习模型通常不能直接转移到其他环境。提出了一种利用本体对文本分类器特征空间进行规范化的迁移学习方法。这样,我们就可以保证经过训练的模型不包含任何与个人相关的数据,因此可以在不引起通用数据保护条例(GDPR)问题的情况下被广泛重用。此外,我们描述了一个过程,通过该过程可以丰富本体,以便分类器可以在不同的上下文中重用不同的术语,而无需对分类器进行任何额外的训练。我们的迁移学习方法遵循复制迁移和丰富迁移相结合的范式。作为概念的证明,我们将在医院医疗文件上训练的分类器与适当丰富的本体论一起应用于以口语书写的医学文本。这些令人鼓舞的结果显示了我们的迁移学习方法的潜力,该方法尊重GDPR要求,可以灵活地适应漂移术语。
{"title":"An Ontology-based transfer learning method improving classification of medical documents","authors":"Daniel Bruneß, Matthias Bay, Christian Schulze, Michael Guckert, Mirjam Minor","doi":"10.1109/ICMLA55696.2022.00065","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00065","url":null,"abstract":"Automatic classification of documents is a well known problem and can be solved with Machine Learning methods. However, such approaches require large sets of training data which are not always available. Moreover, in data protection sensitive domains, e.g. electronic health records, Machine Learning models often cannot directly be transferred to other environments. We present a transfer learning method which uses ontologies to normalise the feature space of text classifiers. With this we can guarantee that the trained models do not contain any person related data and can therefore be widely reused without raising General Data Protection Regulation (GDPR) issues. Furthermore, we describe a process with which the ontologies can be enriched so that the classifiers can be reused in different contexts with deviating terminology without any additional training of the classifiers. Our transfer learning method follows a combined paradigm of transfer by copy and transfer by enrichment. As proof of concept we apply classifiers trained on hospital medical documents together with appropriately enriched ontologies to medical texts written in colloquial language. The promising results show the potential of our transfer learning approach that respects GDPR requirements and can flexibly be adapted to drifting terminology.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123221144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A CNN-Based Automated Stuttering Identification System 基于cnn的口吃自动识别系统
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00247
YashKiran Prabhu, Naeem Seliya
Stuttering can affect quality of life, resulting in poor social, emotional, and mental health status. Stuttering is diagnosed and managed by speech language pathologists, who are scarce in developing countries. We propose a novel CNN-based Automated Stuttering Identification System (ASIS) to help speech pathologists autonomously diagnose, classify, and log fluency disorders (blocks, prolongations, sound repetitions, word repetitions, and interjections), and monitor patient’s fluency progress over time. A baseline CNN model was created in Tensorflow/Keras and trained and tested using the Sep-28k dataset, an annotated stuttering database of 28,000 3-second clips. We built individual models for each disfluency label and measured accuracy, precision, recall, and F1 measure. The models were built five times, and the averages were taken of each metric. Three different training-validation-test splits were used: 80-10-10, 70-20-10, and 60-20-20. The models performed very well on the public dataset, exceeding the accuracy and F1 measure of other classifiers. The proposed ASIS can help speech pathologists improve the quality of life of stutterers especially in developing countries immensely, and thus it can make a significant difference for millions around the world.
口吃会影响生活质量,导致社交、情感和精神健康状况不佳。口吃是由语言病理学家诊断和治疗的,而这在发展中国家是稀缺的。我们提出一种新的基于cnn的自动口吃识别系统(ASIS),以帮助语言病理学家自主诊断,分类和记录流利性障碍(块,延长,声音重复,单词重复和叹词),并监测患者的流利程度随时间的进展。在Tensorflow/Keras中创建了一个基线CNN模型,并使用Sep-28k数据集进行训练和测试,Sep-28k数据集是一个带有注释的口吃数据库,包含28,000个3秒片段。我们为每个不流畅标签建立了单独的模型,并测量了准确性、精度、召回率和F1测量。这些模型建立了五次,并对每个指标取平均值。使用了三种不同的训练-验证-测试分割:80-10-10、70-20-10和60-20-20。该模型在公共数据集上表现非常好,超过了其他分类器的准确性和F1度量。提出的ASIS可以帮助语言病理学家极大地提高口吃者的生活质量,特别是在发展中国家,因此它可以对全世界数百万人产生重大影响。
{"title":"A CNN-Based Automated Stuttering Identification System","authors":"YashKiran Prabhu, Naeem Seliya","doi":"10.1109/ICMLA55696.2022.00247","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00247","url":null,"abstract":"Stuttering can affect quality of life, resulting in poor social, emotional, and mental health status. Stuttering is diagnosed and managed by speech language pathologists, who are scarce in developing countries. We propose a novel CNN-based Automated Stuttering Identification System (ASIS) to help speech pathologists autonomously diagnose, classify, and log fluency disorders (blocks, prolongations, sound repetitions, word repetitions, and interjections), and monitor patient’s fluency progress over time. A baseline CNN model was created in Tensorflow/Keras and trained and tested using the Sep-28k dataset, an annotated stuttering database of 28,000 3-second clips. We built individual models for each disfluency label and measured accuracy, precision, recall, and F1 measure. The models were built five times, and the averages were taken of each metric. Three different training-validation-test splits were used: 80-10-10, 70-20-10, and 60-20-20. The models performed very well on the public dataset, exceeding the accuracy and F1 measure of other classifiers. The proposed ASIS can help speech pathologists improve the quality of life of stutterers especially in developing countries immensely, and thus it can make a significant difference for millions around the world.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"488 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126109879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
TSEvo: Evolutionary Counterfactual Explanations for Time Series Classification 时间序列分类的进化反事实解释
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00013
Jacqueline Höllig, Cedric Kulbach, Steffen Thoma
With the increasing predominance of deep learning methods on time series classification, interpretability becomes essential, especially in high-stake scenarios. Although many approaches to interpretability have been explored for images and tabular data, time series data has been mostly neglected. We approach the problem of interpretability by proposing TSEvo, a model-agnostic multiobjective evolutionary approach to time series counterfactuals incorporating a variety of time series transformation mechanisms to cope with different types and structures of time series. We evaluate our framework on both uni- and multivariate benchmark datasets.
随着深度学习方法在时间序列分类中的优势日益增强,可解释性变得至关重要,特别是在高风险场景中。虽然已经探索了许多图像和表格数据的可解释性方法,但时间序列数据大多被忽视。我们通过提出TSEvo来解决可解释性问题,TSEvo是一种模型不可知的时间序列反事实多目标进化方法,它结合了多种时间序列转换机制来处理不同类型和结构的时间序列。我们在单变量和多变量基准数据集上评估我们的框架。
{"title":"TSEvo: Evolutionary Counterfactual Explanations for Time Series Classification","authors":"Jacqueline Höllig, Cedric Kulbach, Steffen Thoma","doi":"10.1109/ICMLA55696.2022.00013","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00013","url":null,"abstract":"With the increasing predominance of deep learning methods on time series classification, interpretability becomes essential, especially in high-stake scenarios. Although many approaches to interpretability have been explored for images and tabular data, time series data has been mostly neglected. We approach the problem of interpretability by proposing TSEvo, a model-agnostic multiobjective evolutionary approach to time series counterfactuals incorporating a variety of time series transformation mechanisms to cope with different types and structures of time series. We evaluate our framework on both uni- and multivariate benchmark datasets.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129867375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A New Framework to Assess the Individual Fairness of Probabilistic Classifiers 一种评估概率分类器个体公平性的新框架
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00145
M. F. A. Khan, Hamid Karimi
Fairness in machine learning has become a global concern due to the predominance of ML in automated decision-making systems. In comparison to group fairness, individual fairness, which aspires that similar individuals should be treated similarly, has received limited attention due to some challenges. One major challenge is the availability of a proper metric to evaluate individual fairness, especially for probabilistic classifiers. In this study, we propose a framework PCIndFair to assess the individual fairness of probabilistic classifiers. Unlike current individual fairness measures, our framework considers probability distribution rather than the final classification outcome, which is suitable for capturing the dynamic of probabilistic classifiers, e.g., neural networks. We perform extensive experiments on four standard datasets and discuss the practical benefits of the framework. This study can be helpful for machine learning researchers and practitioners flexibly assess their models' individual fairness. The complete code of the framework is publicly available1.
由于机器学习在自动化决策系统中的主导地位,机器学习中的公平性已经成为全球关注的问题。与群体公平相比,个体公平由于受到一些挑战而受到的关注有限。个体公平要求相似的个体应该得到相似的对待。一个主要的挑战是评估个体公平性的适当度量的可用性,特别是对于概率分类器。在本研究中,我们提出了一个框架PCIndFair来评估概率分类器的个体公平性。与当前的个体公平度量不同,我们的框架考虑概率分布而不是最终分类结果,这适用于捕获概率分类器(例如神经网络)的动态。我们在四个标准数据集上进行了广泛的实验,并讨论了该框架的实际好处。本研究可以帮助机器学习研究者和实践者灵活地评估其模型的个体公平性。该框架的完整代码是公开的。
{"title":"A New Framework to Assess the Individual Fairness of Probabilistic Classifiers","authors":"M. F. A. Khan, Hamid Karimi","doi":"10.1109/ICMLA55696.2022.00145","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00145","url":null,"abstract":"Fairness in machine learning has become a global concern due to the predominance of ML in automated decision-making systems. In comparison to group fairness, individual fairness, which aspires that similar individuals should be treated similarly, has received limited attention due to some challenges. One major challenge is the availability of a proper metric to evaluate individual fairness, especially for probabilistic classifiers. In this study, we propose a framework PCIndFair to assess the individual fairness of probabilistic classifiers. Unlike current individual fairness measures, our framework considers probability distribution rather than the final classification outcome, which is suitable for capturing the dynamic of probabilistic classifiers, e.g., neural networks. We perform extensive experiments on four standard datasets and discuss the practical benefits of the framework. This study can be helpful for machine learning researchers and practitioners flexibly assess their models' individual fairness. The complete code of the framework is publicly available1.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127539726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1