首页 > 最新文献

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Deep Neural Networks for Detecting Asteroids in the ATLAS Data Pipeline ATLAS数据管道中探测小行星的深度神经网络
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00224
Noa Kaplan, R. Loveland, L. Denneau
The ”Asteroid Terrestrial-impact Last Alert System” (ATLAS) currently uses a two stage binary classifier to filter moving astronomical objects from electronic and optical artifacts, in order to detect asteroids that may eventually pass close to, or impact, Earth. Any detections that pass ATLAS’s filter are examined by human analysts. The results of the current filter contains more false positives than true positives, so that the majority of detections are classified as bogus by the analysts. These bogus tracklets cause unnecessary work for the analysts, and increase the time it takes to classify a detection as a real near Earth object, potentially decreasing warning time for a collision. In order to reduce this unnecessary effort, we extend the current classifier to incorporate dynamic motion data. We develop two engineered features which are combined with the output of the original classifier as the input features of a deep neural network. This network generates the probability of a detected object being designated real (i.e. an actual, moving, astronomical object), as opposed to being classified as bogus (i.e. one of the vast majority of false detections resulting from optical or noise artifacts). The new classifier decreases false positives by 59%, while maintaining a low false negative rate at virtually zero.
“小行星对地撞击最后预警系统”(ATLAS)目前使用两级二元分类器从电子和光学人工制品中过滤移动的天文物体,以检测可能最终接近或撞击地球的小行星。任何通过ATLAS过滤器的检测都由人工分析人员进行检查。当前过滤器的结果包含假阳性多于真阳性,因此大多数检测被分析人员分类为假。这些伪造的轨道给分析人员带来了不必要的工作,并增加了将探测到的物体分类为真正的近地物体所需的时间,潜在地减少了碰撞的警告时间。为了减少这种不必要的努力,我们扩展了当前的分类器以纳入动态运动数据。我们开发了两个工程特征,它们与原始分类器的输出相结合,作为深度神经网络的输入特征。该网络生成被检测对象被指定为真实对象(即实际的,移动的,天文对象)的概率,而不是被归类为伪造对象(即绝大多数由光学或噪声伪影引起的错误检测之一)。新的分类器减少了59%的假阳性,同时保持了几乎为零的低假阴性率。
{"title":"Deep Neural Networks for Detecting Asteroids in the ATLAS Data Pipeline","authors":"Noa Kaplan, R. Loveland, L. Denneau","doi":"10.1109/ICMLA52953.2021.00224","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00224","url":null,"abstract":"The ”Asteroid Terrestrial-impact Last Alert System” (ATLAS) currently uses a two stage binary classifier to filter moving astronomical objects from electronic and optical artifacts, in order to detect asteroids that may eventually pass close to, or impact, Earth. Any detections that pass ATLAS’s filter are examined by human analysts. The results of the current filter contains more false positives than true positives, so that the majority of detections are classified as bogus by the analysts. These bogus tracklets cause unnecessary work for the analysts, and increase the time it takes to classify a detection as a real near Earth object, potentially decreasing warning time for a collision. In order to reduce this unnecessary effort, we extend the current classifier to incorporate dynamic motion data. We develop two engineered features which are combined with the output of the original classifier as the input features of a deep neural network. This network generates the probability of a detected object being designated real (i.e. an actual, moving, astronomical object), as opposed to being classified as bogus (i.e. one of the vast majority of false detections resulting from optical or noise artifacts). The new classifier decreases false positives by 59%, while maintaining a low false negative rate at virtually zero.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"50 1","pages":"1387-1392"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84772376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An HMM–ensemble approach to predict severity progression of ICU treatment for hospitalized COVID–19 patients 预测COVID-19住院患者ICU治疗严重程度进展的hmm集成方法
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00211
F. Mandreoli, Federico Motta, P. Missier
COVID-19-related pneumonia requires different modalities of Intensive Care Unit (ICU) interventions at different times to facilitate breathing, depending on severity progression. The ability for clinical staff to predict how patients admitted to hospital will require more or less ICU treatment on a daily basis is critical to ICU management. For real datasets that are sparse and incomplete and where the most important state transitions (dismissal, death) are rare, a standard Hidden Markov Model (HMM) approach is insufficient, as it is prone to overfitting. In this paper we propose a more sophisticated ensemble–based approach that involves training multiple HMMs, each specialized in a subset of the state transitions, and then selecting the more plausible predictions either by selecting or combining the models. We have validated the approach on a live dataset of about 1,000 patients from a partner hospital. Our results show that rare events, as well as the transitions to the most severe treatments outperform state of the art approaches.
covid -19相关肺炎需要根据严重程度的进展,在不同时间采取不同方式的重症监护病房(ICU)干预措施,以促进呼吸。临床工作人员预测入院患者每天需要更多或更少的ICU治疗的能力对ICU管理至关重要。对于稀疏和不完整的真实数据集,以及最重要的状态转换(解散,死亡)很少的数据集,标准的隐马尔可夫模型(HMM)方法是不够的,因为它容易过度拟合。在本文中,我们提出了一种更复杂的基于集成的方法,该方法包括训练多个hmm,每个hmm专门研究状态转换的一个子集,然后通过选择或组合模型来选择更合理的预测。我们已经在来自合作医院的约1000名患者的实时数据集上验证了该方法。我们的研究结果表明,罕见的事件,以及过渡到最严重的治疗优于最先进的方法。
{"title":"An HMM–ensemble approach to predict severity progression of ICU treatment for hospitalized COVID–19 patients","authors":"F. Mandreoli, Federico Motta, P. Missier","doi":"10.1109/ICMLA52953.2021.00211","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00211","url":null,"abstract":"COVID-19-related pneumonia requires different modalities of Intensive Care Unit (ICU) interventions at different times to facilitate breathing, depending on severity progression. The ability for clinical staff to predict how patients admitted to hospital will require more or less ICU treatment on a daily basis is critical to ICU management. For real datasets that are sparse and incomplete and where the most important state transitions (dismissal, death) are rare, a standard Hidden Markov Model (HMM) approach is insufficient, as it is prone to overfitting. In this paper we propose a more sophisticated ensemble–based approach that involves training multiple HMMs, each specialized in a subset of the state transitions, and then selecting the more plausible predictions either by selecting or combining the models. We have validated the approach on a live dataset of about 1,000 patients from a partner hospital. Our results show that rare events, as well as the transitions to the most severe treatments outperform state of the art approaches.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"39 1","pages":"1299-1306"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88609744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Ensembles of Long Short-Term Memory Experts for Streaming Data with Sudden Concept Drift 具有突然概念漂移的流数据的长短期记忆专家组合
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00120
Sabine Apfeld, A. Charlish, G. Ascheid
One of the challenges encountered when processing streaming data is a change of the data distribution, which is called concept drift. It has been shown that ensemble methods are effective in reacting to such a change. However, so far it has not been investigated how the architecture and configuration of the ensemble, as well as the properties of the scenario, influence the prediction accuracy if the ensemble members (experts) are Long Short-Term Memory networks with an internal state. This paper evaluates six ensemble architectures in several configurations with regards to their suitability for processing streaming data with sudden, recurring concept drift. The evaluation with a public dataset shows the impact of the architecture and configuration on the ensembles’ accuracies, as well as the influence of the concepts’ stability periods and the Long Short-Term Memory experts’ internal states under several conditions.
处理流数据时遇到的挑战之一是数据分布的变化,这被称为概念漂移。已经证明,集成方法在应对这种变化方面是有效的。然而,如果集成成员(专家)是具有内部状态的长短期记忆网络,那么集成的体系结构和配置以及场景的性质如何影响预测精度,目前还没有研究。本文评估了几种配置下的六种集成体系结构对处理具有突然、反复出现的概念漂移的流数据的适用性。利用公共数据集进行评估,显示了在不同条件下,体系结构和配置对集成精度的影响,以及概念稳定期和长短期记忆专家内部状态的影响。
{"title":"Ensembles of Long Short-Term Memory Experts for Streaming Data with Sudden Concept Drift","authors":"Sabine Apfeld, A. Charlish, G. Ascheid","doi":"10.1109/ICMLA52953.2021.00120","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00120","url":null,"abstract":"One of the challenges encountered when processing streaming data is a change of the data distribution, which is called concept drift. It has been shown that ensemble methods are effective in reacting to such a change. However, so far it has not been investigated how the architecture and configuration of the ensemble, as well as the properties of the scenario, influence the prediction accuracy if the ensemble members (experts) are Long Short-Term Memory networks with an internal state. This paper evaluates six ensemble architectures in several configurations with regards to their suitability for processing streaming data with sudden, recurring concept drift. The evaluation with a public dataset shows the impact of the architecture and configuration on the ensembles’ accuracies, as well as the influence of the concepts’ stability periods and the Long Short-Term Memory experts’ internal states under several conditions.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"716-723"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89311452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
PermeabilityNets: Comparing Neural Network Architectures on a Sequence-to-Instance Task in CFRP Manufacturing 渗透性网络:比较CFRP制造中序列到实例任务的神经网络架构
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00116
S. Stieber, N. Schröter, E. Fauster, Alexander Schiendorfer, W. Reif
Carbon fiber reinforced polymers (CFRP) offer highly desirable properties such as weight-specific strength and stiffness. Liquid composite moulding (LCM) processes are prominent, economically efficient, out-of-autoclave manufacturing techniques and, in particular, resin transfer moulding (RTM), allows for a high level of automation. There, fibrous preforms are impregnated by a viscous polymer matrix in a closed mould. Impregnation quality is of crucial importance for the final part quality and is dominated by preform permeability. We propose to learn a map of permeability deviations based on a sequence of camera images acquired in flow experiments. Several ML models are investigated for this task, among which ConvLSTM networks achieve an accuracy of up to 96.56%, showing better performance than the Transformer or pure CNNs. Finally, we demonstrate that models, trained purely on simulated data, achieve qualitatively good results on real data.
碳纤维增强聚合物(CFRP)提供了非常理想的性能,如重量比强度和刚度。液体复合成型(LCM)工艺是突出的,经济高效的,非高压釜制造技术,特别是树脂转移成型(RTM),允许高水平的自动化。在那里,纤维预制体在封闭的模具中由粘性聚合物基质浸渍。浸渍质量对成品质量至关重要,浸渍质量主要由预制体渗透率决定。我们提出了一种基于在流动实验中获得的一系列相机图像的渗透率偏差图。针对该任务研究了几种机器学习模型,其中ConvLSTM网络的准确率高达96.56%,优于Transformer或纯cnn。最后,我们证明了纯粹在模拟数据上训练的模型在真实数据上获得了质量良好的结果。
{"title":"PermeabilityNets: Comparing Neural Network Architectures on a Sequence-to-Instance Task in CFRP Manufacturing","authors":"S. Stieber, N. Schröter, E. Fauster, Alexander Schiendorfer, W. Reif","doi":"10.1109/ICMLA52953.2021.00116","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00116","url":null,"abstract":"Carbon fiber reinforced polymers (CFRP) offer highly desirable properties such as weight-specific strength and stiffness. Liquid composite moulding (LCM) processes are prominent, economically efficient, out-of-autoclave manufacturing techniques and, in particular, resin transfer moulding (RTM), allows for a high level of automation. There, fibrous preforms are impregnated by a viscous polymer matrix in a closed mould. Impregnation quality is of crucial importance for the final part quality and is dominated by preform permeability. We propose to learn a map of permeability deviations based on a sequence of camera images acquired in flow experiments. Several ML models are investigated for this task, among which ConvLSTM networks achieve an accuracy of up to 96.56%, showing better performance than the Transformer or pure CNNs. Finally, we demonstrate that models, trained purely on simulated data, achieve qualitatively good results on real data.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"61 1","pages":"694-697"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88153663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Probabilistic Multi-knowledge Transfer in Reinforcement Learning 强化学习中的概率多知识转移
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00079
Daniel Fernández, F. Fernández, Javier García
Transfer in Reinforcement Learning (RL) aims to remedy the problem of learning complex RL tasks from scratch, which is impractical in most of the cases due to the huge sample requirements. To overcome this problem, transferring the knowledge acquired from a set of source tasks to a new target task is a core idea. This knowledge can be the policy, the model (state transition and/or reward function), or the value function learned in the source tasks. However, algorithms in transfer learning focus on transferring a single type of knowledge at a time, although intuitively it might be interesting to reuse several types of this knowledge. For this reason, in this paper we propose a multi-knowledge transfer RL algorithm which we call Probabilistic Transfer of Policies and Models (PTPM). PTPM, unlike single-knowledge transfer approaches, combines the transfer of two types of knowledge: policies and models. We show through different experiments on two well-known domains (Grid World and Mountain Car) how this novel multi-knowledge transfer algorithm improves the results of the two methods in which it is inspired separately. As an additional result, we show that sequential learning of multiple tasks is generally better than learning from a library of previously learned tasks from scratch.
强化学习中的迁移(Transfer in Reinforcement Learning, RL)旨在解决从头开始学习复杂强化学习任务的问题,由于样本需求巨大,这在大多数情况下是不切实际的。为了克服这一问题,将从一组源任务中获得的知识转移到新的目标任务中是一个核心思想。这些知识可以是策略、模型(状态转换和/或奖励函数),或者在源任务中学习到的价值函数。然而,迁移学习中的算法专注于一次迁移一种类型的知识,尽管从直觉上讲,重用这种知识的几种类型可能会很有趣。为此,本文提出了一种多知识转移强化学习算法,我们称之为策略和模型的概率转移(PTPM)。PTPM与单一知识转移方法不同,它结合了两种类型的知识转移:政策和模型。我们通过在两个众所周知的领域(Grid World和Mountain Car)上的不同实验,展示了这种新颖的多知识转移算法是如何改进两种方法的结果的。作为一个额外的结果,我们表明对多个任务的顺序学习通常比从以前学习过的任务库中从头开始学习要好。
{"title":"Probabilistic Multi-knowledge Transfer in Reinforcement Learning","authors":"Daniel Fernández, F. Fernández, Javier García","doi":"10.1109/ICMLA52953.2021.00079","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00079","url":null,"abstract":"Transfer in Reinforcement Learning (RL) aims to remedy the problem of learning complex RL tasks from scratch, which is impractical in most of the cases due to the huge sample requirements. To overcome this problem, transferring the knowledge acquired from a set of source tasks to a new target task is a core idea. This knowledge can be the policy, the model (state transition and/or reward function), or the value function learned in the source tasks. However, algorithms in transfer learning focus on transferring a single type of knowledge at a time, although intuitively it might be interesting to reuse several types of this knowledge. For this reason, in this paper we propose a multi-knowledge transfer RL algorithm which we call Probabilistic Transfer of Policies and Models (PTPM). PTPM, unlike single-knowledge transfer approaches, combines the transfer of two types of knowledge: policies and models. We show through different experiments on two well-known domains (Grid World and Mountain Car) how this novel multi-knowledge transfer algorithm improves the results of the two methods in which it is inspired separately. As an additional result, we show that sequential learning of multiple tasks is generally better than learning from a library of previously learned tasks from scratch.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"574 1","pages":"471-476"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90421923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automating Questions and Answers of Good and Services Tax system using clustering and embeddings of queries 使用聚类和嵌入查询的商品和服务税系统问答自动化
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00260
Pankaj Dikshit, B. Chandra, M. Gupta
Goods and Services Tax has been introduced for the first time in India in 2017 and it is a major tax reform. There have been a lot of queries posed by the users and response had to be given manually which was a very tedious task. There was a dire need to automate this Question/Answer process in an efficient manner. Embeddings e.g. BERT and ROBERTA have been used for converting the questions to make it efficient for clustering the questions. K-means and Hierarchical clustering techniques have been used for clustering the embeddings of questions, using different distance measures viz. Euclidean and Cosine. Three possible choices for answers for each query have been provided at first, and in the next step the best possible answer has been provided for each test question. Dataset of two months (October and November 2019) is used for automating the process. A high success rate in predicting the answers for the questions has been achieved.
2017年,印度首次引入商品和服务税,这是一项重大的税收改革。用户提出了很多问题,必须手动给出响应,这是一项非常繁琐的任务。我们迫切需要以一种高效的方式自动化这个问答过程。嵌入(例如BERT和ROBERTA)已被用于转换问题,以提高问题聚类的效率。K-means和分层聚类技术已被用于问题嵌入的聚类,使用不同的距离度量,即欧几里得和余弦。首先为每个问题提供了三个可能的答案选择,在下一步中为每个测试问题提供了最佳答案。两个月的数据集(2019年10月和11月)用于自动化流程。在预测问题的答案方面取得了很高的成功率。
{"title":"Automating Questions and Answers of Good and Services Tax system using clustering and embeddings of queries","authors":"Pankaj Dikshit, B. Chandra, M. Gupta","doi":"10.1109/ICMLA52953.2021.00260","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00260","url":null,"abstract":"Goods and Services Tax has been introduced for the first time in India in 2017 and it is a major tax reform. There have been a lot of queries posed by the users and response had to be given manually which was a very tedious task. There was a dire need to automate this Question/Answer process in an efficient manner. Embeddings e.g. BERT and ROBERTA have been used for converting the questions to make it efficient for clustering the questions. K-means and Hierarchical clustering techniques have been used for clustering the embeddings of questions, using different distance measures viz. Euclidean and Cosine. Three possible choices for answers for each query have been provided at first, and in the next step the best possible answer has been provided for each test question. Dataset of two months (October and November 2019) is used for automating the process. A high success rate in predicting the answers for the questions has been achieved.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"1630-1633"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75198162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Analyzing and Improving the Robustness of Tabular Classifiers using Counterfactual Explanations 利用反事实解释分析和改进表分类器的鲁棒性
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00209
P. Rasouli, Ingrid Chieh Yu
Recent studies have revealed that Machine Learning (ML) models are vulnerable to adversarial perturbations. Such perturbations can be intentionally or accidentally added to the original inputs, evading the classifier’s behavior to misclassify the crafted samples. A widely-used solution is to retrain the model using data points generated by various attack strategies. However, this creates a classifier robust to some particular evasions and can not defend unknown or universal perturbations. Counterfactual explanations are a specific class of post-hoc explanation methods that provide minimal modification to the input features in order to obtain a particular outcome from the model. In addition to the resemblance of counterfactual explanations to the universal perturbations, the possibility of generating instances from specific classes makes such approaches suitable for analyzing and improving the model’s robustness. Rather than explaining the model’s decisions in the deployment phase, we utilize the distance information obtained from counterfactuals and propose novel metrics to analyze the robustness of tabular classifiers. Further, we introduce a decision boundary modification approach using customized counterfactual data points to improve the robustness of the models without compromising their accuracy. Our framework addresses the robustness of black-box classifiers in the tabular setting, which is considered an under-explored research area. Through several experiments and evaluations, we demonstrate the efficacy of our approach in analyzing and improving the robustness of black-box tabular classifiers.
最近的研究表明,机器学习(ML)模型容易受到对抗性扰动的影响。这种扰动可以有意或无意地添加到原始输入中,从而避免分类器对精心制作的样本进行错误分类。一种广泛使用的解决方案是使用由各种攻击策略生成的数据点重新训练模型。然而,这创建了一个分类器对某些特定的规避鲁棒性,不能防御未知或普遍的扰动。反事实解释是一种特殊的事后解释方法,它对输入特征进行最小的修改,以便从模型中获得特定的结果。除了反事实解释与普遍扰动的相似性之外,从特定类中生成实例的可能性使得这种方法适合于分析和提高模型的鲁棒性。我们不是在部署阶段解释模型的决策,而是利用从反事实中获得的距离信息并提出新的度量来分析表格分类器的鲁棒性。此外,我们引入了一种使用自定义反事实数据点的决策边界修改方法,以提高模型的鲁棒性而不影响其准确性。我们的框架解决了黑箱分类器在表格设置中的鲁棒性,这被认为是一个尚未开发的研究领域。通过几个实验和评估,我们证明了我们的方法在分析和提高黑箱表分类器的鲁棒性方面的有效性。
{"title":"Analyzing and Improving the Robustness of Tabular Classifiers using Counterfactual Explanations","authors":"P. Rasouli, Ingrid Chieh Yu","doi":"10.1109/ICMLA52953.2021.00209","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00209","url":null,"abstract":"Recent studies have revealed that Machine Learning (ML) models are vulnerable to adversarial perturbations. Such perturbations can be intentionally or accidentally added to the original inputs, evading the classifier’s behavior to misclassify the crafted samples. A widely-used solution is to retrain the model using data points generated by various attack strategies. However, this creates a classifier robust to some particular evasions and can not defend unknown or universal perturbations. Counterfactual explanations are a specific class of post-hoc explanation methods that provide minimal modification to the input features in order to obtain a particular outcome from the model. In addition to the resemblance of counterfactual explanations to the universal perturbations, the possibility of generating instances from specific classes makes such approaches suitable for analyzing and improving the model’s robustness. Rather than explaining the model’s decisions in the deployment phase, we utilize the distance information obtained from counterfactuals and propose novel metrics to analyze the robustness of tabular classifiers. Further, we introduce a decision boundary modification approach using customized counterfactual data points to improve the robustness of the models without compromising their accuracy. Our framework addresses the robustness of black-box classifiers in the tabular setting, which is considered an under-explored research area. Through several experiments and evaluations, we demonstrate the efficacy of our approach in analyzing and improving the robustness of black-box tabular classifiers.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"8 1","pages":"1286-1293"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75449708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
ComFu: Improving Visual Clustering by Commonality Fusion ComFu:通过共性融合改进视觉聚类
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00030
Chunchun Li, Manuel Günther, T. Boult
Clustering has a long history in the computer vision community with a myriad of applications. Clustering is a family of unsupervised machine learning techniques that group samples based on similarity. Multiple ad hoc techniques have been developed to combine or fuse clustering algorithms with dozens of different clustering techniques. This paper presents a new formalization of clustering fusion and introduces the novel Commonality Fusion (ComFu) technique to combine the advantages of different clustering algorithms by fusing their results on datasets. ComFu builds a pairwise commonality matrix of samples by computing how many clustering algorithms group each pair together. Using this matrix, ComFu builds initial clusters of points with high commonality and then assigns points with low commonality to clusters with the highest average commonality to those points with an automatic distance measure selection process. We start experiments by comparing ComFu with the prior state-of-the-art cluster fusion algorithms on eight UCI datasets. We then evaluate ComFu on practical vision clustering problems, advancing the state-of-the-art on a wide range of applications including clustering faces in the IJB-B dataset. We apply ComFu to fuse FINCH, the state-of-the-art ”parameter-free” approach, which returns multiple partitions and can use multiple distance metrics, and show that ComFu improves their result by fusing over metrics and partitions.
聚类在计算机视觉领域有着悠久的历史,有着无数的应用。聚类是一种基于相似性对样本进行分组的无监督机器学习技术。已经开发了多种ad hoc技术,将聚类算法与数十种不同的聚类技术相结合或融合。本文提出了一种新的聚类融合形式化方法,并引入了一种新的共性融合(ComFu)技术,通过在数据集上融合不同聚类算法的结果来结合不同聚类算法的优点。ComFu通过计算有多少聚类算法将每对样本分组在一起来构建样本的成对共性矩阵。使用该矩阵,ComFu构建具有高共性点的初始聚类,然后通过自动距离度量选择过程将低共性点分配给具有最高平均共性点的聚类。我们首先在8个UCI数据集上比较了ComFu和之前最先进的聚类融合算法。然后,我们在实际的视觉聚类问题上评估了ComFu,在广泛的应用中推进了最先进的技术,包括在IJB-B数据集中聚类人脸。我们将ComFu应用于融合FINCH,这是一种最先进的“无参数”方法,它返回多个分区并可以使用多个距离指标,并表明ComFu通过融合指标和分区来改善结果。
{"title":"ComFu: Improving Visual Clustering by Commonality Fusion","authors":"Chunchun Li, Manuel Günther, T. Boult","doi":"10.1109/ICMLA52953.2021.00030","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00030","url":null,"abstract":"Clustering has a long history in the computer vision community with a myriad of applications. Clustering is a family of unsupervised machine learning techniques that group samples based on similarity. Multiple ad hoc techniques have been developed to combine or fuse clustering algorithms with dozens of different clustering techniques. This paper presents a new formalization of clustering fusion and introduces the novel Commonality Fusion (ComFu) technique to combine the advantages of different clustering algorithms by fusing their results on datasets. ComFu builds a pairwise commonality matrix of samples by computing how many clustering algorithms group each pair together. Using this matrix, ComFu builds initial clusters of points with high commonality and then assigns points with low commonality to clusters with the highest average commonality to those points with an automatic distance measure selection process. We start experiments by comparing ComFu with the prior state-of-the-art cluster fusion algorithms on eight UCI datasets. We then evaluate ComFu on practical vision clustering problems, advancing the state-of-the-art on a wide range of applications including clustering faces in the IJB-B dataset. We apply ComFu to fuse FINCH, the state-of-the-art ”parameter-free” approach, which returns multiple partitions and can use multiple distance metrics, and show that ComFu improves their result by fusing over metrics and partitions.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"143-150"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75728483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Condition Monitoring for Power Converters via Deep One-Class Classification 基于深度一类分类的电力变流器状态监测
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00244
Nikola Marković, D. Vahle, V. Staudt, D. Kolossa
We introduce a novel hybrid approach for the early detection of power converter faults, focusing on the use case of modular multilevel converters. The proposed method is based on training a deep one-class classifier, which learns the characteristics of the normal system operation and can hence recognize deviations even without any training on potential fault conditions of the system. In order to achieve robust and reliable performance, the diagnosis of the system state utilizes short sequences of observations, which are combined through a probabilistic model. The decision about the system state can then take the form of monitoring the T2 test statistics, which allows us to control the maximum classification error. This proposed method, Reliability-guided One-Class Classification (ROCC) was tested on data recorded from a Modular Multilevel Converter. The approach is shown to be effective in all test cases, leading to reliable diagnostics even though the classifier is applied to a wide range of unseen conditions.
以模块化多电平变换器为例,介绍了一种新型的混合故障早期检测方法。该方法基于深度单类分类器的训练,该分类器学习系统正常运行的特征,因此即使不需要对系统的潜在故障条件进行任何训练,也可以识别偏差。为了实现鲁棒和可靠的性能,系统状态的诊断利用短序列的观测,这些观测通过概率模型组合。然后,关于系统状态的决策可以采用监视T2测试统计数据的形式,这允许我们控制最大分类错误。在模块化多电平变换器记录的数据上,对所提出的可靠性指导一类分类方法(ROCC)进行了验证。该方法在所有测试用例中都被证明是有效的,即使分类器应用于大量未见过的条件,也会导致可靠的诊断。
{"title":"Condition Monitoring for Power Converters via Deep One-Class Classification","authors":"Nikola Marković, D. Vahle, V. Staudt, D. Kolossa","doi":"10.1109/ICMLA52953.2021.00244","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00244","url":null,"abstract":"We introduce a novel hybrid approach for the early detection of power converter faults, focusing on the use case of modular multilevel converters. The proposed method is based on training a deep one-class classifier, which learns the characteristics of the normal system operation and can hence recognize deviations even without any training on potential fault conditions of the system. In order to achieve robust and reliable performance, the diagnosis of the system state utilizes short sequences of observations, which are combined through a probabilistic model. The decision about the system state can then take the form of monitoring the T2 test statistics, which allows us to control the maximum classification error. This proposed method, Reliability-guided One-Class Classification (ROCC) was tested on data recorded from a Modular Multilevel Converter. The approach is shown to be effective in all test cases, leading to reliable diagnostics even though the classifier is applied to a wide range of unseen conditions.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"34 1","pages":"1513-1520"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72913480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Effects of COVID-19 on individuals in Opioid Addiction Recovery COVID-19对阿片类药物成瘾康复个体的影响
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00216
Khaled Mohammed Saifuddin, Esra Akbas, Max Khanov, J. Beaman
Opioid Use Disorder (OUD) is one of the most severe health care problems in the USA. People addicted to opioids need various treatments, including Medication-Assisted Treatment (MAT), proper counseling, and behavioral therapies. However, during the peak time of the COVID-19 pandemic, the supply of emergency medications was disrupted seriously. Patients faced severe medical care scarcity since many pharmaceutical companies, drugstores, and local pharmacies were closed. Import-export was also canceled to consent to the government emergency law, i.e., lockdown, quarantine, and isolation. These circumstances and their negative effects on OUD patient’s psychology could have led them to a drop out of MAT medications and persuaded to resume illicit opioid use. This project involves collecting and analyzing a large volume of Twitter data related to MAT medications for OUD patients. We discover the Active MAT Medicine Users (AMMUs) on twitter. For this, we build a seed dictionary of words related to OUD and MAT and apply association rules to expand it. Further, AMMUs’ tweet posts are studied ‘before the pandemic’ (BP) and ‘during the pandemic’ (DP) to understand how the drug behaviors and habits have changed due to COVID-19. We also perform sentiment analysis on Tweets to determine the impact of the COVID-19 pandemic on the psychology of AMMUs. Our analysis shows that the use of MAT medications has decreased around 30.54%, where the use of illicit drugs and other prescription opioids increased 18.06% and 12.12%, respectively, based on AMMUs’ tweets posted during the lockdown compared with before the lockdown statistics. The COVID-19 pandemic and lockdown may result in the resumption of illegal and prescription opioid abuse by OUD patients. Necessary steps and precautions should be taken by health care providers to ensure the emergency supply of medicines and also psychological support and thus prevent patients from illicit opioid use.
阿片类药物使用障碍(OUD)是美国最严重的医疗问题之一。对阿片类药物成瘾的人需要各种治疗,包括药物辅助治疗(MAT)、适当的咨询和行为治疗。然而,在新冠肺炎疫情高峰期,应急药品供应严重中断。由于许多制药公司、药店和地方药店关闭,患者面临严重的医疗短缺。根据政府的紧急状态法(封锁、隔离、隔离),进出口也被取消。这些情况及其对OUD患者心理的负面影响可能导致他们放弃MAT药物并被说服重新使用非法阿片类药物。该项目涉及收集和分析大量与OUD患者MAT药物相关的Twitter数据。我们在推特上发现活跃的MAT药物用户(AMMUs)。为此,我们构建了一个与OUD和MAT相关的单词种子字典,并应用关联规则对其进行扩展。此外,研究了“大流行前”(BP)和“大流行期间”(DP) AMMUs的推文,以了解因COVID-19而导致的药物行为和习惯的变化。我们还对推文进行情绪分析,以确定COVID-19大流行对ammu心理的影响。我们的分析显示,根据封锁期间AMMUs发布的推文,与封锁前的统计数据相比,MAT药物的使用减少了约30.54%,其中非法药物和其他处方阿片类药物的使用分别增加了18.06%和12.12%。COVID-19大流行和封锁可能导致OUD患者恢复非法和处方阿片类药物滥用。保健提供者应采取必要步骤和预防措施,确保紧急提供药品和心理支持,从而防止患者非法使用阿片类药物。
{"title":"Effects of COVID-19 on individuals in Opioid Addiction Recovery","authors":"Khaled Mohammed Saifuddin, Esra Akbas, Max Khanov, J. Beaman","doi":"10.1109/ICMLA52953.2021.00216","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00216","url":null,"abstract":"Opioid Use Disorder (OUD) is one of the most severe health care problems in the USA. People addicted to opioids need various treatments, including Medication-Assisted Treatment (MAT), proper counseling, and behavioral therapies. However, during the peak time of the COVID-19 pandemic, the supply of emergency medications was disrupted seriously. Patients faced severe medical care scarcity since many pharmaceutical companies, drugstores, and local pharmacies were closed. Import-export was also canceled to consent to the government emergency law, i.e., lockdown, quarantine, and isolation. These circumstances and their negative effects on OUD patient’s psychology could have led them to a drop out of MAT medications and persuaded to resume illicit opioid use. This project involves collecting and analyzing a large volume of Twitter data related to MAT medications for OUD patients. We discover the Active MAT Medicine Users (AMMUs) on twitter. For this, we build a seed dictionary of words related to OUD and MAT and apply association rules to expand it. Further, AMMUs’ tweet posts are studied ‘before the pandemic’ (BP) and ‘during the pandemic’ (DP) to understand how the drug behaviors and habits have changed due to COVID-19. We also perform sentiment analysis on Tweets to determine the impact of the COVID-19 pandemic on the psychology of AMMUs. Our analysis shows that the use of MAT medications has decreased around 30.54%, where the use of illicit drugs and other prescription opioids increased 18.06% and 12.12%, respectively, based on AMMUs’ tweets posted during the lockdown compared with before the lockdown statistics. The COVID-19 pandemic and lockdown may result in the resumption of illegal and prescription opioid abuse by OUD patients. Necessary steps and precautions should be taken by health care providers to ensure the emergency supply of medicines and also psychological support and thus prevent patients from illicit opioid use.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"2 2 1","pages":"1333-1340"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79473291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1