首页 > 最新文献

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Modeling approaches for Silent Attrition prediction in Payment networks 支付网络中无声损耗预测的建模方法
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00070
L. Dheekollu, H. Wadhwa, Siddharth Vimal, Anubhav Gupta, Siddhartha Asthana, Ankur Arora, Smriti Gupta
Predicting customer attrition (churn) is a well known problem in industries that provide services, like financial institutions, telecommunications, e-commerce, and retail. There are two kinds of attrition - active and passive (silent). Active attrition is usually associated with subscription-based business models, commonly seen in telecommunications and internet industries like Netflix. In industries like finance, retail, and ecommerce, we see the other kind of attrition - silent attrition where customers stop doing business without formal notice. This makes the silent attrition prediction problem even more challenging because it is difficult to differentiate between attrited and inactive customers. We focus our work on predicting silent attrition which is still under-explored in the payment card industry (i.e. Mastercard, Visa). The contribution of our work is threefold. First, we present a data-driven approach to define silent attrition as customer inactivity. Second, we discussed multiple procedures to generate synthetic data thereby preserving customers’ privacy. At last, we presented a comprehensive view of various machine learning (ML) pathways in which this churn prediction problem can be framed and solved; each requiring a specific feature engineering. We presented experimental results corresponding to each pathway to comparative analysis. We believe that this work to be beneficial to the researchers and ML practitioners who often have to deal with sensitive financial data but have limited permission to use it. In this direction, we demonstrated the use of synthetic data generation to reduce the risk of data leakage and other privacy concerns relating to ML models development.
在金融机构、电信、电子商务和零售等提供服务的行业中,预测客户流失是一个众所周知的问题。有两种损耗——主动损耗和被动损耗(无声损耗)。主动流失通常与基于订阅的商业模式有关,通常见于电信和互联网行业,如Netflix。在金融、零售和电子商务等行业,我们看到了另一种流失——无声流失,即客户在没有正式通知的情况下停止做生意。这使得沉默的流失预测问题更具挑战性,因为很难区分流失和不活跃的客户。我们的工作重点是预测无声损耗,这在支付卡行业(如万事达,Visa)仍未得到充分探索。我们的工作有三方面的贡献。首先,我们提出了一种数据驱动的方法,将沉默的流失定义为客户不活动。其次,我们讨论了生成合成数据从而保护客户隐私的多个程序。最后,我们提出了各种机器学习(ML)途径的综合视图,其中可以构建和解决这种流失预测问题;每个都需要一个特定的特征工程。我们给出了各途径对应的实验结果进行对比分析。我们相信这项工作对研究人员和机器学习从业者是有益的,他们经常需要处理敏感的财务数据,但使用权限有限。在这个方向上,我们演示了使用合成数据生成来降低数据泄露的风险和与ML模型开发相关的其他隐私问题。
{"title":"Modeling approaches for Silent Attrition prediction in Payment networks","authors":"L. Dheekollu, H. Wadhwa, Siddharth Vimal, Anubhav Gupta, Siddhartha Asthana, Ankur Arora, Smriti Gupta","doi":"10.1109/ICMLA52953.2021.00070","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00070","url":null,"abstract":"Predicting customer attrition (churn) is a well known problem in industries that provide services, like financial institutions, telecommunications, e-commerce, and retail. There are two kinds of attrition - active and passive (silent). Active attrition is usually associated with subscription-based business models, commonly seen in telecommunications and internet industries like Netflix. In industries like finance, retail, and ecommerce, we see the other kind of attrition - silent attrition where customers stop doing business without formal notice. This makes the silent attrition prediction problem even more challenging because it is difficult to differentiate between attrited and inactive customers. We focus our work on predicting silent attrition which is still under-explored in the payment card industry (i.e. Mastercard, Visa). The contribution of our work is threefold. First, we present a data-driven approach to define silent attrition as customer inactivity. Second, we discussed multiple procedures to generate synthetic data thereby preserving customers’ privacy. At last, we presented a comprehensive view of various machine learning (ML) pathways in which this churn prediction problem can be framed and solved; each requiring a specific feature engineering. We presented experimental results corresponding to each pathway to comparative analysis. We believe that this work to be beneficial to the researchers and ML practitioners who often have to deal with sensitive financial data but have limited permission to use it. In this direction, we demonstrated the use of synthetic data generation to reduce the risk of data leakage and other privacy concerns relating to ML models development.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"64 1","pages":"409-414"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91086362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trade-offs in Metric Learning for Bearing Fault Diagnosis 度量学习在轴承故障诊断中的权衡
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00180
Tyler Cody, Stephen C. Adams, P. Beling
Metric learning is a well-developed field in machine learning and has seen recent application in the area of prognostics and health management (PHM). Metric learning allows for fault diagnosis or condition monitoring models to be developed with the assumption that a machine- or load-specific similarity metric can be learned after model deployment. Existing literature has used metric learning to fine-tune deep learning models to address machine-to-machine differences and differences in working conditions. Here, we study metric learning in isolation, not as an intermediate step in deep learning, by conducting a comparative study of Principal Component Analysis (PCA), Neighborhood Component Analysis (NCA), Local Fisher Discriminant Analysis (LFDA), and Large Margin Nearest Neighbor (LMNN). We consider performance metrics for prediction performance, cluster performance, feature sensitivity, sample efficiency, and latent space efficiency. We find that linear partitions on the latent spaces learned via metric learning are able to achieve accuracies greater than 90% on Case Western Reserve University’s bearing fault data set using only the drive-end vibration signal. We find PCA to be dominated by metric learning algorithms for all working loads considered. And, in sum, we demonstrate classical metric learning algorithms to be a promising approach for learning machine-and load-specific similarity metrics for PHM with minor data processing and small samples.
度量学习是机器学习中一个发展良好的领域,最近在预测和健康管理(PHM)领域得到了应用。度量学习允许在假设可以在模型部署后学习特定于机器或负载的相似性度量的情况下开发故障诊断或状态监测模型。现有文献已经使用度量学习来微调深度学习模型,以解决机器对机器的差异和工作条件的差异。在这里,我们通过对主成分分析(PCA)、邻域成分分析(NCA)、局部Fisher判别分析(LFDA)和大边际最近邻(LMNN)进行比较研究,孤立地研究度量学习,而不是作为深度学习的中间步骤。我们考虑了预测性能、聚类性能、特征灵敏度、样本效率和潜在空间效率的性能指标。我们发现,通过度量学习获得的潜在空间上的线性划分能够在仅使用驱动端振动信号的Case西储大学轴承故障数据集上实现大于90%的精度。我们发现PCA是由度量学习算法主导的所有工作负载考虑。总而言之,我们证明了经典度量学习算法是一种很有前途的方法,用于学习具有少量数据处理和小样本的PHM的机器和负载特定相似性度量。
{"title":"Trade-offs in Metric Learning for Bearing Fault Diagnosis","authors":"Tyler Cody, Stephen C. Adams, P. Beling","doi":"10.1109/ICMLA52953.2021.00180","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00180","url":null,"abstract":"Metric learning is a well-developed field in machine learning and has seen recent application in the area of prognostics and health management (PHM). Metric learning allows for fault diagnosis or condition monitoring models to be developed with the assumption that a machine- or load-specific similarity metric can be learned after model deployment. Existing literature has used metric learning to fine-tune deep learning models to address machine-to-machine differences and differences in working conditions. Here, we study metric learning in isolation, not as an intermediate step in deep learning, by conducting a comparative study of Principal Component Analysis (PCA), Neighborhood Component Analysis (NCA), Local Fisher Discriminant Analysis (LFDA), and Large Margin Nearest Neighbor (LMNN). We consider performance metrics for prediction performance, cluster performance, feature sensitivity, sample efficiency, and latent space efficiency. We find that linear partitions on the latent spaces learned via metric learning are able to achieve accuracies greater than 90% on Case Western Reserve University’s bearing fault data set using only the drive-end vibration signal. We find PCA to be dominated by metric learning algorithms for all working loads considered. And, in sum, we demonstrate classical metric learning algorithms to be a promising approach for learning machine-and load-specific similarity metrics for PHM with minor data processing and small samples.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"23 1","pages":"1100-1105"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82140624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
GAN Based Approach for Drug Design 基于GAN的药物设计方法
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00136
Aninditha Ramesh, Anusha S. Rao, Sanjana Moudgalya, K. S. Srinivas
Deep Learning models have been a tremendous breakthrough in the field of Drug discovery, greatly simplifying the pre-clinical phase of this intricate task. With an intention to ease this further, we introduce a novel method to generate target-specific molecules using a Generative Adversarial Network (GAN). The dataset consists of drugs whose target proteins belong to the class of Tyrosine kinase and are specifically active against some of the growth factor receptors present in the human body. An Autoencoder network is used to learn the embeddings of the drug which is represented in the SMILES format and the deep neural network GAN is used to generate structurally valid molecules using drug-target interaction as the validating criteria. The model has successfully produced 39 novel structures and 15 of them show satisfactory binding with at least one of the target receptors.
深度学习模型在药物发现领域取得了巨大的突破,极大地简化了这一复杂任务的临床前阶段。为了进一步缓解这一问题,我们引入了一种使用生成对抗网络(GAN)生成目标特异性分子的新方法。该数据集由靶蛋白属于酪氨酸激酶类的药物组成,这些药物对人体中存在的一些生长因子受体具有特异性活性。使用自编码器网络学习以SMILES格式表示的药物嵌入,并使用深度神经网络GAN以药物-靶标相互作用为验证标准生成结构有效的分子。该模型成功地产生了39个新结构,其中15个与至少一种靶受体表现出满意的结合。
{"title":"GAN Based Approach for Drug Design","authors":"Aninditha Ramesh, Anusha S. Rao, Sanjana Moudgalya, K. S. Srinivas","doi":"10.1109/ICMLA52953.2021.00136","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00136","url":null,"abstract":"Deep Learning models have been a tremendous breakthrough in the field of Drug discovery, greatly simplifying the pre-clinical phase of this intricate task. With an intention to ease this further, we introduce a novel method to generate target-specific molecules using a Generative Adversarial Network (GAN). The dataset consists of drugs whose target proteins belong to the class of Tyrosine kinase and are specifically active against some of the growth factor receptors present in the human body. An Autoencoder network is used to learn the embeddings of the drug which is represented in the SMILES format and the deep neural network GAN is used to generate structurally valid molecules using drug-target interaction as the validating criteria. The model has successfully produced 39 novel structures and 15 of them show satisfactory binding with at least one of the target receptors.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"10 1","pages":"825-828"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81938131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Kernel ridge reconstruction for anomaly detection: general and low computational reconstruction 异常检测的核脊重建:一般和低计算重建
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00036
Yasutaka Furusho, Shuhei Nitta, Y. Sakata
Autoencoders (AEs) have been widely used for anomaly detection because models trained to reconstruct a normal data are expected to have a higher reconstruction error for anomalous data than that for normal data, and the higher error is adopted as a criterion for identifying anomalies. However, the high capacity of AEs is sometimes able to reconstruct anomalous data even when trained only on normal data, which leads to overlooked anomalies. To remedy this problem, we propose a kernel ridge reconstruction (KRR) approach for general, high-performance, and low computational anomaly detection. KRR replaces the non-linear decoder network of the AE with a linear regressor, which uses the weighted sum of training normal data for reconstruction, and thus prevents the reconstruction of anomalous data. We also reveal the desired property of the encoder for KRR to achieve high anomaly detection performance and propose an effective training algorithm to realize such property by instance discrimination and feature decorrelation. In addition, KRR reduces the computational cost because it replaces the non-linear decoder network with a linear regressor. Our experiments on MNIST, CIFAR10, and KDDCup99 datasets prove its applicability, high performance, and low computational cost. In particular, KRR achieved an area under the curve (AUC) of 0.670 with 12 millions multiply-accumulate operations (MACs) on the CIFAR10 dataset, outperforming a recent reconstruction-based anomaly detection method (MemAE) with a 1.1-fold higher AUC and 0.291 as many MACs.
自编码器(ae)被广泛用于异常检测,因为用于重建正常数据的训练模型对异常数据的重建误差比正常数据的重建误差要高,并且更高的误差被用作识别异常的标准。然而,即使只在正常数据上训练,高容量的ae有时也能够重建异常数据,这导致了被忽视的异常。为了解决这个问题,我们提出了一种核脊重构(KRR)方法,用于通用、高性能和低计算的异常检测。KRR用线性回归器代替声发射的非线性解码器网络,利用训练正常数据的加权和进行重构,从而避免了异常数据的重构。我们还揭示了KRR编码器为实现高异常检测性能所需要的特性,并提出了一种有效的训练算法,通过实例识别和特征去相关来实现这种特性。此外,由于KRR用线性回归器代替了非线性解码器网络,降低了计算成本。我们在MNIST、CIFAR10和KDDCup99数据集上的实验证明了它的适用性、高性能和低计算成本。特别是,KRR在CIFAR10数据集上通过1200万次乘法累积操作(mac)获得了0.670的曲线下面积(AUC),优于最近基于重建的异常检测方法(MemAE), AUC高1.1倍,mac数为0.291。
{"title":"Kernel ridge reconstruction for anomaly detection: general and low computational reconstruction","authors":"Yasutaka Furusho, Shuhei Nitta, Y. Sakata","doi":"10.1109/ICMLA52953.2021.00036","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00036","url":null,"abstract":"Autoencoders (AEs) have been widely used for anomaly detection because models trained to reconstruct a normal data are expected to have a higher reconstruction error for anomalous data than that for normal data, and the higher error is adopted as a criterion for identifying anomalies. However, the high capacity of AEs is sometimes able to reconstruct anomalous data even when trained only on normal data, which leads to overlooked anomalies. To remedy this problem, we propose a kernel ridge reconstruction (KRR) approach for general, high-performance, and low computational anomaly detection. KRR replaces the non-linear decoder network of the AE with a linear regressor, which uses the weighted sum of training normal data for reconstruction, and thus prevents the reconstruction of anomalous data. We also reveal the desired property of the encoder for KRR to achieve high anomaly detection performance and propose an effective training algorithm to realize such property by instance discrimination and feature decorrelation. In addition, KRR reduces the computational cost because it replaces the non-linear decoder network with a linear regressor. Our experiments on MNIST, CIFAR10, and KDDCup99 datasets prove its applicability, high performance, and low computational cost. In particular, KRR achieved an area under the curve (AUC) of 0.670 with 12 millions multiply-accumulate operations (MACs) on the CIFAR10 dataset, outperforming a recent reconstruction-based anomaly detection method (MemAE) with a 1.1-fold higher AUC and 0.291 as many MACs.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"73 1","pages":"185-190"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76213444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting YOLO Misdetection by Learning Grid Cell Consensus 学习网格单元一致性预测YOLO误检
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00107
B. Paudel, Danushka Senarathna, Haibo Wang, S. Tragoudas, Yao Hu, Shengbing Jiang
Despite the immense performance improvement of deep learning-based object detection, the state-of-the-art object detection systems are still prone to misdetections. This work presents a method to predict such misdetections at run-time by using a small network, referred to as ConsensusNet, to learn the correlation patterns or consensus of neighboring detections before non-maximum suppression (NMS). Based on such correlations, ConsensusNet predicts if there are misdetection failures. The proposed method is experimentally evaluated considering single person class from COCO dataset and using YOLOv3 as the object detection system. It shows the proposed method can achieve accuracy of 84.6% and the performance measured in other metrics are also promising. To the best of our knowledge, ConsensusNet is the first network reported for predicting misdetections in object detection.
尽管基于深度学习的目标检测的性能有了巨大的提高,但最先进的目标检测系统仍然容易出现误检测。这项工作提出了一种在运行时预测这种错误检测的方法,通过使用一个小网络,称为ConsensusNet,在非最大抑制(NMS)之前学习相邻检测的相关模式或一致性。基于这种相关性,ConsensusNet可以预测是否存在误检失败。采用YOLOv3作为目标检测系统,对该方法进行了实验验证。结果表明,该方法的准确率达到84.6%,在其他指标上的测试结果也很有希望。据我们所知,ConsensusNet是第一个用于预测物体检测中的误检的网络。
{"title":"Predicting YOLO Misdetection by Learning Grid Cell Consensus","authors":"B. Paudel, Danushka Senarathna, Haibo Wang, S. Tragoudas, Yao Hu, Shengbing Jiang","doi":"10.1109/ICMLA52953.2021.00107","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00107","url":null,"abstract":"Despite the immense performance improvement of deep learning-based object detection, the state-of-the-art object detection systems are still prone to misdetections. This work presents a method to predict such misdetections at run-time by using a small network, referred to as ConsensusNet, to learn the correlation patterns or consensus of neighboring detections before non-maximum suppression (NMS). Based on such correlations, ConsensusNet predicts if there are misdetection failures. The proposed method is experimentally evaluated considering single person class from COCO dataset and using YOLOv3 as the object detection system. It shows the proposed method can achieve accuracy of 84.6% and the performance measured in other metrics are also promising. To the best of our knowledge, ConsensusNet is the first network reported for predicting misdetections in object detection.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"134 1","pages":"643-648"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77894493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
LE-CapsNet: A Light and Enhanced Capsule Network LE-CapsNet:一个轻型和增强胶囊网络
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00280
Pouya Shiri, A. Baniasadi
Capsule Network (CapsNet) classifier has several advantages over CNNs, including better detection of images containing overlapping categories and higher accuracy on transformed images. Despite the advantages, CapsNet is slow due to its different structure. In addition, CapsNet is resource-hungry, includes many parameters and lags in accuracy compared to CNNs. In this work, we propose LE-CapsNet as a light, enhanced and more accurate variant of CapsNet. Using 3.8M weights, LECapsNet obtains 76.73% accuracy on the CIFAR-10 dataset while performing inference 4x faster than CapsNet. In addition, our proposed network is more robust at detecting images with affine transformations compared to CapsNet. We achieve 94.37% accuracy on the AffNIST dataset (compared to CapsNet’s 90.52%).
胶囊网络(Capsule Network, CapsNet)分类器与cnn相比有几个优点,包括更好地检测包含重叠类别的图像,以及对转换后的图像有更高的准确率。尽管有这些优势,但由于其结构不同,CapsNet速度较慢。此外,CapsNet需要大量的资源,包含了很多参数,并且与cnn相比在精度上有一定的滞后。在这项工作中,我们提出LE-CapsNet作为CapsNet的轻量级,增强和更准确的变体。使用3.8M权值,LECapsNet在CIFAR-10数据集上获得76.73%的准确率,同时执行推理的速度比CapsNet快4倍。此外,与CapsNet相比,我们提出的网络在检测具有仿射变换的图像方面具有更强的鲁棒性。我们在AffNIST数据集上实现了94.37%的准确率(与CapsNet的90.52%相比)。
{"title":"LE-CapsNet: A Light and Enhanced Capsule Network","authors":"Pouya Shiri, A. Baniasadi","doi":"10.1109/ICMLA52953.2021.00280","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00280","url":null,"abstract":"Capsule Network (CapsNet) classifier has several advantages over CNNs, including better detection of images containing overlapping categories and higher accuracy on transformed images. Despite the advantages, CapsNet is slow due to its different structure. In addition, CapsNet is resource-hungry, includes many parameters and lags in accuracy compared to CNNs. In this work, we propose LE-CapsNet as a light, enhanced and more accurate variant of CapsNet. Using 3.8M weights, LECapsNet obtains 76.73% accuracy on the CIFAR-10 dataset while performing inference 4x faster than CapsNet. In addition, our proposed network is more robust at detecting images with affine transformations compared to CapsNet. We achieve 94.37% accuracy on the AffNIST dataset (compared to CapsNet’s 90.52%).","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"2 1","pages":"1767-1772"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73354851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards Sequential Multivariate Fault Prediction for Vehicular Predictive Maintenance 面向车辆预测性维修的序列多变量故障预测研究
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00167
A. Hafeez, Eduardo Alonso, Aram Ter-Sarkisov
Predictive maintenance, which has traditionally used anomaly detection methods on sensory data, is now being replaced by event-based techniques. These methods utilise events with multiple temporal (and often non-numeric) features, produced by diagnostic modules. This raises the need of learning numerical event representations to predict the next fault event in industrial machines, specially vehicles, that use Diagnostic Trouble Codes (DTCs). We propose a predictive maintenance approach, named Sequential Multivariate Fault Prediction (SMFP), for predicting the next multivariate DTC fault in an event sequence, using Long Short-Term Memory Networks (LSTMs) and jointly learned event embeddings. By performing an in-depth comparison of different architectural choices and contextual preprocessing techniques, we provide an initial baseline for SMFP that achieves top-3 accuracy of 63% on predicting multivariate fault with 3 collective output layers, using vehicle maintenance data as a case study.
预测性维护传统上使用基于感官数据的异常检测方法,现在正在被基于事件的技术所取代。这些方法利用由诊断模块产生的具有多个时间(通常是非数字)特征的事件。这就提出了学习数字事件表示来预测工业机器,特别是使用诊断故障代码(dtc)的车辆中的下一个故障事件的需求。我们提出了一种预测性维护方法,称为顺序多变量故障预测(SMFP),用于预测事件序列中的下一个多变量DTC故障,使用长短期记忆网络(LSTMs)和联合学习的事件嵌入。通过对不同的架构选择和上下文预处理技术进行深入比较,我们为SMFP提供了一个初始基线,该基线在预测3个集体输出层的多变量故障时达到了63%的前3名准确率,并以车辆维护数据为例进行了研究。
{"title":"Towards Sequential Multivariate Fault Prediction for Vehicular Predictive Maintenance","authors":"A. Hafeez, Eduardo Alonso, Aram Ter-Sarkisov","doi":"10.1109/ICMLA52953.2021.00167","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00167","url":null,"abstract":"Predictive maintenance, which has traditionally used anomaly detection methods on sensory data, is now being replaced by event-based techniques. These methods utilise events with multiple temporal (and often non-numeric) features, produced by diagnostic modules. This raises the need of learning numerical event representations to predict the next fault event in industrial machines, specially vehicles, that use Diagnostic Trouble Codes (DTCs). We propose a predictive maintenance approach, named Sequential Multivariate Fault Prediction (SMFP), for predicting the next multivariate DTC fault in an event sequence, using Long Short-Term Memory Networks (LSTMs) and jointly learned event embeddings. By performing an in-depth comparison of different architectural choices and contextual preprocessing techniques, we provide an initial baseline for SMFP that achieves top-3 accuracy of 63% on predicting multivariate fault with 3 collective output layers, using vehicle maintenance data as a case study.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"309 1","pages":"1016-1021"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75682546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Detecting Offensive Content on Twitter During Proud Boys Riots 在骄傲男孩骚乱期间检测Twitter上的攻击性内容
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00253
M. Fahim, S. Gokhale
Hateful and offensive speech on online social media platforms has seen a rise in the recent years. Often used to convey humor through sarcasm or to emphasize a point, offensive speech may also be employed to insult, deride and mock alternate points of view. In turbulent and chaotic circumstances, insults and mockery can lead to violence and unrest, and hence, such speech must be identified and tagged to limit its damage. This paper presents an application of machine learning to detect hateful and offensive content from Twitter feeds shared after the protests by Proud Boys, an extremist, ideological and violent hate group. A comprehensive coding guide, consolidating definitions of what constitutes offensive content based on the potential to trigger and incite people is developed and used to label the tweets. Linguistic, auxiliary and social features extracted from these labeled tweets were used to train machine learning classifiers, which detect offensive content with an accuracy of about 92%. An analysis of the importance scores reveals that offensiveness is pre-dominantly a function of words and their combinations, rather than meta features such as punctuations and quotes. This observation can form the foundation of pre-trained classifiers that can be deployed to automatically detect offensive speech in new and unforeseen circumstances.
近年来,在线社交媒体平台上的仇恨和攻击性言论有所增加。攻击性言语通常用于通过讽刺来表达幽默或强调一个观点,也可用于侮辱、嘲笑和嘲笑不同的观点。在动荡和混乱的环境中,侮辱和嘲弄可能导致暴力和动荡,因此,必须识别和标记此类言论,以限制其损害。本文介绍了一种机器学习的应用,用于检测极端主义、意识形态和暴力仇恨团体Proud Boys抗议后分享的Twitter feed中的仇恨和攻击性内容。一份全面的编码指南,根据触发和煽动人们的可能性,巩固了构成冒犯性内容的定义,并用于标记推文。从这些标记的推文中提取的语言、辅助和社交特征被用来训练机器学习分类器,它检测攻击性内容的准确率约为92%。对重要性分数的分析表明,冒犯性主要是单词及其组合的功能,而不是标点和引号等元特征。这种观察可以形成预训练分类器的基础,可以部署在新的和不可预见的情况下自动检测攻击性言论。
{"title":"Detecting Offensive Content on Twitter During Proud Boys Riots","authors":"M. Fahim, S. Gokhale","doi":"10.1109/ICMLA52953.2021.00253","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00253","url":null,"abstract":"Hateful and offensive speech on online social media platforms has seen a rise in the recent years. Often used to convey humor through sarcasm or to emphasize a point, offensive speech may also be employed to insult, deride and mock alternate points of view. In turbulent and chaotic circumstances, insults and mockery can lead to violence and unrest, and hence, such speech must be identified and tagged to limit its damage. This paper presents an application of machine learning to detect hateful and offensive content from Twitter feeds shared after the protests by Proud Boys, an extremist, ideological and violent hate group. A comprehensive coding guide, consolidating definitions of what constitutes offensive content based on the potential to trigger and incite people is developed and used to label the tweets. Linguistic, auxiliary and social features extracted from these labeled tweets were used to train machine learning classifiers, which detect offensive content with an accuracy of about 92%. An analysis of the importance scores reveals that offensiveness is pre-dominantly a function of words and their combinations, rather than meta features such as punctuations and quotes. This observation can form the foundation of pre-trained classifiers that can be deployed to automatically detect offensive speech in new and unforeseen circumstances.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"44 3","pages":"1582-1587"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72631341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Deep Learning Methods for the Prediction of Information Display Type Using Eye Tracking Sequences 基于眼动追踪序列的信息显示类型预测的深度学习方法
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00100
Yuehan Yin, Yahya Alqahtani, Jinjuan Feng, J. Chakraborty, M. P. McGuire
Eye tracking data can help design effective user interfaces by showing how users visually process information. In this study, three neural network models were developed and employed to classify three types of information display methods by using eye gaze data that was collected in visual information processing behavior studies. Eye gaze data was first converted into a sequence and was fed into neural networks to predict the information display type. The results of the study show a comparison between three methods for the creation of eye tracking sequences and how they perform using three neural network models including CNN-LSTM, CNN-GRU, and 3D CNN. The results were positive with all models having an accuracy of higher than 88 percent.
眼动追踪数据可以通过显示用户如何视觉处理信息来帮助设计有效的用户界面。本研究利用视觉信息处理行为研究中收集的眼球注视数据,建立了三种神经网络模型,并对三种信息显示方式进行了分类。首先将眼球注视数据转换成序列,并将其输入神经网络来预测信息显示类型。研究结果显示了三种创建眼动追踪序列的方法之间的比较,以及它们如何使用三种神经网络模型,包括CNN- lstm, CNN- gru和3D CNN。结果是肯定的,所有模型的准确率都高于88%。
{"title":"Deep Learning Methods for the Prediction of Information Display Type Using Eye Tracking Sequences","authors":"Yuehan Yin, Yahya Alqahtani, Jinjuan Feng, J. Chakraborty, M. P. McGuire","doi":"10.1109/ICMLA52953.2021.00100","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00100","url":null,"abstract":"Eye tracking data can help design effective user interfaces by showing how users visually process information. In this study, three neural network models were developed and employed to classify three types of information display methods by using eye gaze data that was collected in visual information processing behavior studies. Eye gaze data was first converted into a sequence and was fed into neural networks to predict the information display type. The results of the study show a comparison between three methods for the creation of eye tracking sequences and how they perform using three neural network models including CNN-LSTM, CNN-GRU, and 3D CNN. The results were positive with all models having an accuracy of higher than 88 percent.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"82 1","pages":"601-605"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72707683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast Tensor Singular Value Decomposition Using the Low-Resolution Features of Tensors 基于张量低分辨率特征的快速张量奇异值分解
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00088
Cagri Ozdemir, R. Hoover, Kyle A. Caudle
The tensor singular value decomposition (t-SVD) based on an algebra of circulants is an effective multilinear sub- space learning technique for dimensionality reduction and data classification. Unfortunately, the computational cost associated with computing the t-SVD can become prohibitively expensive, particularly when dealing with very large data sets. In this paper, we present a computationally efficient approach for estimating the t-SVD by capitalizing on the correlations of the data in the temporal dimension. The approach proceeds by extending our prior work on fast eigenspace decompositions by transforming the tensor data from the spatial domain to the spectral domain in order to obtain reduced order harmonic tensor. The t-SVD can then be applied in the transform domain thereby significantly reducing the computational burden. Experimental results which are presented on the extended Yale-B, COIL-100, and MNIST data sets show the proposed method provides considerable computational savings with the approximated subspaces that are nearly the same as the true subspaces as computed via the t-SVD.
基于循环代数的张量奇异值分解(t-SVD)是一种有效的多线性子空间学习降维和数据分类技术。不幸的是,与计算t-SVD相关的计算成本可能变得非常昂贵,特别是在处理非常大的数据集时。在本文中,我们提出了一种计算效率的方法,通过利用数据在时间维度上的相关性来估计t-SVD。该方法通过将张量数据从空间域转换到谱域来获得降阶谐波张量,从而扩展了我们之前关于快速特征空间分解的工作。然后将t-SVD应用于变换域,从而大大减少了计算负担。在扩展的Yale-B、COIL-100和MNIST数据集上的实验结果表明,所提出的方法可以节省大量的计算量,其近似子空间与通过t-SVD计算的真实子空间几乎相同。
{"title":"Fast Tensor Singular Value Decomposition Using the Low-Resolution Features of Tensors","authors":"Cagri Ozdemir, R. Hoover, Kyle A. Caudle","doi":"10.1109/ICMLA52953.2021.00088","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00088","url":null,"abstract":"The tensor singular value decomposition (t-SVD) based on an algebra of circulants is an effective multilinear sub- space learning technique for dimensionality reduction and data classification. Unfortunately, the computational cost associated with computing the t-SVD can become prohibitively expensive, particularly when dealing with very large data sets. In this paper, we present a computationally efficient approach for estimating the t-SVD by capitalizing on the correlations of the data in the temporal dimension. The approach proceeds by extending our prior work on fast eigenspace decompositions by transforming the tensor data from the spatial domain to the spectral domain in order to obtain reduced order harmonic tensor. The t-SVD can then be applied in the transform domain thereby significantly reducing the computational burden. Experimental results which are presented on the extended Yale-B, COIL-100, and MNIST data sets show the proposed method provides considerable computational savings with the approximated subspaces that are nearly the same as the true subspaces as computed via the t-SVD.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"64 1","pages":"527-533"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81380157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1