首页 > 最新文献

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
How Dense Autoencoders can still Achieve the State-of-the-art in Time-Series Anomaly Detection 如何密集的自编码器仍然可以实现最先进的时间序列异常检测
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00207
Louis Jensen, Jayme Fosa, Ben Teitelbaum, Peter Chin
Time series data has become ubiquitous in the modern era of data collection. With the increase of these time series data streams, the demand for automatic time series anomaly detection has also increased. Automatic monitoring of data allows engineers to investigate only unusual behavior in their data streams. Despite this increase in demand for automatic time series anomaly detection, many popular methods fail to offer a general purpose solution. Some demand expensive labelling of anomalies, others require the data to follow certain assumed patterns, some have long and unstable training, and many suffer from high rates of false alarms. In this paper we demonstrate that simpler is often better, showing that a fully unsupervised multilayer perceptron autoencoder is able to outperform much more complicated models with only a few critical improvements. We offer improvements to help distinguish anomalous subsequences near to each other, and to distinguish anomalies even in the midst of changing distributions of data. We compare our model with state-of-the-art competitors on benchmark datasets sourced from NASA, Yahoo, and Numenta, achieving improvements beyond competitive models in all three datasets.
时间序列数据在现代数据收集时代已经变得无处不在。随着这些时间序列数据流的增加,对时间序列自动异常检测的需求也随之增加。数据的自动监控允许工程师只调查数据流中的异常行为。尽管对自动时间序列异常检测的需求不断增加,但许多流行的方法无法提供通用的解决方案。一些需要昂贵的异常标记,另一些需要数据遵循某些假定的模式,一些有长期和不稳定的训练,许多遭受高误报率。在本文中,我们证明了简单通常是更好的,表明一个完全无监督的多层感知器自编码器仅通过一些关键的改进就能够胜过更复杂的模型。我们提供了改进,以帮助区分彼此靠近的异常子序列,并且即使在数据分布变化的过程中也能区分异常。我们将我们的模型与来自NASA、Yahoo和Numenta的最先进的竞争对手的基准数据集进行比较,在所有三个数据集上都取得了超越竞争对手模型的改进。
{"title":"How Dense Autoencoders can still Achieve the State-of-the-art in Time-Series Anomaly Detection","authors":"Louis Jensen, Jayme Fosa, Ben Teitelbaum, Peter Chin","doi":"10.1109/ICMLA52953.2021.00207","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00207","url":null,"abstract":"Time series data has become ubiquitous in the modern era of data collection. With the increase of these time series data streams, the demand for automatic time series anomaly detection has also increased. Automatic monitoring of data allows engineers to investigate only unusual behavior in their data streams. Despite this increase in demand for automatic time series anomaly detection, many popular methods fail to offer a general purpose solution. Some demand expensive labelling of anomalies, others require the data to follow certain assumed patterns, some have long and unstable training, and many suffer from high rates of false alarms. In this paper we demonstrate that simpler is often better, showing that a fully unsupervised multilayer perceptron autoencoder is able to outperform much more complicated models with only a few critical improvements. We offer improvements to help distinguish anomalous subsequences near to each other, and to distinguish anomalies even in the midst of changing distributions of data. We compare our model with state-of-the-art competitors on benchmark datasets sourced from NASA, Yahoo, and Numenta, achieving improvements beyond competitive models in all three datasets.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"1272-1277"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82949939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Driven football scouting assistance with simulated player performance extrapolation 数据驱动的足球球探协助模拟球员的表现外推
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00189
Shantanu Ghar, Sayali Patil, Venkhatesh Arunachalam
In club football, scouting is a crucial aspect of player recruitment, with elite football clubs investing millions of dollars in scouting and signing the best player for their team every year. Scouting requires great analytical and observational skills from the scout, to find the best player for any position in the team. A scout needs to analyze the player by watching his in-game actions, physical attributes and make a judgement on how the player might fit into the team. Every team has a formation, a style of play and a specific profile of player is required for a given position depending on the aforementioned factors. But scouts only watch a player play a few matches in person, and prepare their scouting report based on a player’s performance in those matches. This process is flawed as the scout is expected to watch a few games and make estimates of the player’s performance in a new team. The player statistics can help the scout in making better data-driven decisions. A player’s career statistics can provide a picture of how the player performs individually, but they fail to predict player chemistry alongside a team. Misjudgement in scouting can lead to losses of millions of dollars to a club. We propose to solve this problem by utilising vast amounts of quantitative and qualitative player statistics (from 3+ sources), and by incorporating data science and machine learning algorithms to simulate real world performances of the team after the addition of the newly scouted player. We take into account specific player requirements and classify a player into one of our specific 15 player types, and use the team’s formation and style of play to predict the players that will have the best chemistry with any given lineup, thereby facilitating scouts in making better decisions.
在俱乐部足球中,球探是球员招募的一个重要方面,精英足球俱乐部每年都会投资数百万美元用于球探并为球队签下最好的球员。球探需要出色的分析和观察能力,以便为球队的任何位置找到最好的球员。侦察员需要通过观察玩家在游戏中的行为、身体属性来分析玩家,并判断玩家是否适合团队。每支球队都有一个阵型,一种比赛风格,一个特定的球员需要一个特定的位置,这取决于上述因素。但球探只会亲自观看球员的几场比赛,并根据球员在这些比赛中的表现准备球探报告。这个过程是有缺陷的,因为球探需要看几场比赛,并对球员在新球队的表现做出估计。球员统计数据可以帮助球探做出更好的数据驱动决策。球员的职业生涯数据可以提供球员个人表现的画面,但它们无法预测球员在球队中的化学反应。球探中的错误判断可能会给俱乐部造成数百万美元的损失。我们建议通过利用大量的定量和定性球员统计数据(来自3个以上的来源)来解决这个问题,并结合数据科学和机器学习算法来模拟新球员加入后球队在现实世界中的表现。我们会考虑特定的球员需求,并将球员分为15种特定的球员类型之一,并使用球队的阵型和打法来预测在任何给定阵容中最能产生最佳化学反应的球员,从而帮助球探做出更好的决策。
{"title":"Data Driven football scouting assistance with simulated player performance extrapolation","authors":"Shantanu Ghar, Sayali Patil, Venkhatesh Arunachalam","doi":"10.1109/ICMLA52953.2021.00189","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00189","url":null,"abstract":"In club football, scouting is a crucial aspect of player recruitment, with elite football clubs investing millions of dollars in scouting and signing the best player for their team every year. Scouting requires great analytical and observational skills from the scout, to find the best player for any position in the team. A scout needs to analyze the player by watching his in-game actions, physical attributes and make a judgement on how the player might fit into the team. Every team has a formation, a style of play and a specific profile of player is required for a given position depending on the aforementioned factors. But scouts only watch a player play a few matches in person, and prepare their scouting report based on a player’s performance in those matches. This process is flawed as the scout is expected to watch a few games and make estimates of the player’s performance in a new team. The player statistics can help the scout in making better data-driven decisions. A player’s career statistics can provide a picture of how the player performs individually, but they fail to predict player chemistry alongside a team. Misjudgement in scouting can lead to losses of millions of dollars to a club. We propose to solve this problem by utilising vast amounts of quantitative and qualitative player statistics (from 3+ sources), and by incorporating data science and machine learning algorithms to simulate real world performances of the team after the addition of the newly scouted player. We take into account specific player requirements and classify a player into one of our specific 15 player types, and use the team’s formation and style of play to predict the players that will have the best chemistry with any given lineup, thereby facilitating scouts in making better decisions.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"19 1","pages":"1160-1167"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84331322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deployment of Embedded Edge-AI for Wildlife Monitoring in Remote Regions 嵌入式边缘人工智能在偏远地区野生动物监测中的部署
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00170
D. Schwartz, Jonathan Michael Gomes Selman, P. Wrege, A. Paepcke
Artificial intelligence is increasingly used in ecological contexts to monitor animal and insect populations. Species of interest are those in danger of extinction, and those that play pivotal roles in agriculture. Noticing population declines or geographical shifts early enough for intervention can prevent local famine and disruption to the global food chain. Traditionally, data are collected in the field using human labor or sensors. Applicable classification models then analyze the data on central servers. The most expensive, and sometimes dangerous part of the remote sensing solution is the human labor of visiting the sensors, retrieving data, and changing batteries. Constantly sending all readings by radio is expensive in power. Instead, having AI in the sensors process readings, and only transmitting results could lead to an indefinitely autonomous, renewably powered solution. We implemented an elephant vocalization detector on a small processor board, and demonstrate that such a device can be operated at low enough power levels with considerable freedom of choice among AI technologies. We achieved a mean of 1.6W, in the best case staying within 75% of memory limits. Measurements covered three inference models, two batch sizes, and two floating point word width settings.
人工智能越来越多地用于生态环境中监测动物和昆虫种群。我们关注的物种是那些濒临灭绝的物种,以及那些在农业中发挥关键作用的物种。及早注意到人口减少或地理变化以便进行干预,可以防止局部饥荒和对全球食物链的破坏。传统上,数据是通过人工或传感器在现场收集的。然后,适用的分类模型分析中央服务器上的数据。遥感解决方案中最昂贵、有时也是最危险的部分是访问传感器、检索数据和更换电池的人力劳动。不断地用无线电发送所有的读数是很昂贵的。相反,在传感器中加入人工智能处理读数,只传输结果,可能会带来无限自主、可再生能源的解决方案。我们在一个小处理器板上实现了一个大象发声探测器,并证明了这样的设备可以在足够低的功率水平下运行,并且在人工智能技术中有相当大的选择自由。我们实现了平均1.6W,在最好的情况下保持在内存限制的75%以内。测量包括三个推理模型、两个批大小和两个浮点字宽设置。
{"title":"Deployment of Embedded Edge-AI for Wildlife Monitoring in Remote Regions","authors":"D. Schwartz, Jonathan Michael Gomes Selman, P. Wrege, A. Paepcke","doi":"10.1109/ICMLA52953.2021.00170","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00170","url":null,"abstract":"Artificial intelligence is increasingly used in ecological contexts to monitor animal and insect populations. Species of interest are those in danger of extinction, and those that play pivotal roles in agriculture. Noticing population declines or geographical shifts early enough for intervention can prevent local famine and disruption to the global food chain. Traditionally, data are collected in the field using human labor or sensors. Applicable classification models then analyze the data on central servers. The most expensive, and sometimes dangerous part of the remote sensing solution is the human labor of visiting the sensors, retrieving data, and changing batteries. Constantly sending all readings by radio is expensive in power. Instead, having AI in the sensors process readings, and only transmitting results could lead to an indefinitely autonomous, renewably powered solution. We implemented an elephant vocalization detector on a small processor board, and demonstrate that such a device can be operated at low enough power levels with considerable freedom of choice among AI technologies. We achieved a mean of 1.6W, in the best case staying within 75% of memory limits. Measurements covered three inference models, two batch sizes, and two floating point word width settings.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"1035-1042"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89243102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Outperforming Clinical Practices in Breast Cancer Detection: A Superior Dense Neural Network in Classification and False Negative Reduction 在乳腺癌检测中表现优异的临床实践:在分类和假阴性减少方面的优越密集神经网络
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00098
Patrick Bujok, Maria Jensen, Steffen M. Larsen, R. A. Alphinas
Machine Learning applications provide a promising method to support clinical practitioners in Breast Cancer (BC) detection. Currently, Fine Needle Aspiration (FNA) is a commonly applied diagnostic method for BC tumors, which, however, is associated with ominous false negative misclassifications. For this purpose, the present study explores Artificial Neural Networks (ANNs) with the aim of outperforming clinical practices via FNA in classifying benign or malignant BC cases with regard to an improved accuracy and reduced False Negative Rate (FNR) using the Breast Cancer Wisconsin (Diagnostic) Dataset (WDBC). The findings reveal that a dense ANN with a single hidden layer including 15 neurons can reach a testing accuracy of 98.60% and a FNR of 0% on a scaled dataset. In combination with several introduced improvement measures, a high degree of generalizability is associated with the model under the consideration of the relatively small dataset. As a result, this model outperforms not only clinical practitioners but also 72 classifiers from the recent literature.
机器学习应用为支持临床医生检测乳腺癌(BC)提供了一种很有前途的方法。目前,细针穿刺(FNA)是一种常用的诊断BC肿瘤的方法,然而,它与不祥的假阴性错误分类有关。为此,本研究探索了人工神经网络(ann),目的是通过FNA在使用乳腺癌威斯康星(诊断)数据集(WDBC)对良性或恶性BC病例进行分类方面优于临床实践,提高准确性并降低假阴性率(FNR)。研究结果表明,包含15个神经元的单个隐藏层的密集神经网络在缩放数据集上的测试准确率为98.60%,FNR为0%。结合引入的几个改进措施,在考虑相对较小的数据集的情况下,该模型具有高度的泛化性。因此,该模型不仅优于临床医生,而且优于最近文献中的72个分类器。
{"title":"Outperforming Clinical Practices in Breast Cancer Detection: A Superior Dense Neural Network in Classification and False Negative Reduction","authors":"Patrick Bujok, Maria Jensen, Steffen M. Larsen, R. A. Alphinas","doi":"10.1109/ICMLA52953.2021.00098","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00098","url":null,"abstract":"Machine Learning applications provide a promising method to support clinical practitioners in Breast Cancer (BC) detection. Currently, Fine Needle Aspiration (FNA) is a commonly applied diagnostic method for BC tumors, which, however, is associated with ominous false negative misclassifications. For this purpose, the present study explores Artificial Neural Networks (ANNs) with the aim of outperforming clinical practices via FNA in classifying benign or malignant BC cases with regard to an improved accuracy and reduced False Negative Rate (FNR) using the Breast Cancer Wisconsin (Diagnostic) Dataset (WDBC). The findings reveal that a dense ANN with a single hidden layer including 15 neurons can reach a testing accuracy of 98.60% and a FNR of 0% on a scaled dataset. In combination with several introduced improvement measures, a high degree of generalizability is associated with the model under the consideration of the relatively small dataset. As a result, this model outperforms not only clinical practitioners but also 72 classifiers from the recent literature.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"25 1","pages":"589-594"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87677746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Sequential Multivariate Fault Prediction for Vehicular Predictive Maintenance 面向车辆预测性维修的序列多变量故障预测研究
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00167
A. Hafeez, Eduardo Alonso, Aram Ter-Sarkisov
Predictive maintenance, which has traditionally used anomaly detection methods on sensory data, is now being replaced by event-based techniques. These methods utilise events with multiple temporal (and often non-numeric) features, produced by diagnostic modules. This raises the need of learning numerical event representations to predict the next fault event in industrial machines, specially vehicles, that use Diagnostic Trouble Codes (DTCs). We propose a predictive maintenance approach, named Sequential Multivariate Fault Prediction (SMFP), for predicting the next multivariate DTC fault in an event sequence, using Long Short-Term Memory Networks (LSTMs) and jointly learned event embeddings. By performing an in-depth comparison of different architectural choices and contextual preprocessing techniques, we provide an initial baseline for SMFP that achieves top-3 accuracy of 63% on predicting multivariate fault with 3 collective output layers, using vehicle maintenance data as a case study.
预测性维护传统上使用基于感官数据的异常检测方法,现在正在被基于事件的技术所取代。这些方法利用由诊断模块产生的具有多个时间(通常是非数字)特征的事件。这就提出了学习数字事件表示来预测工业机器,特别是使用诊断故障代码(dtc)的车辆中的下一个故障事件的需求。我们提出了一种预测性维护方法,称为顺序多变量故障预测(SMFP),用于预测事件序列中的下一个多变量DTC故障,使用长短期记忆网络(LSTMs)和联合学习的事件嵌入。通过对不同的架构选择和上下文预处理技术进行深入比较,我们为SMFP提供了一个初始基线,该基线在预测3个集体输出层的多变量故障时达到了63%的前3名准确率,并以车辆维护数据为例进行了研究。
{"title":"Towards Sequential Multivariate Fault Prediction for Vehicular Predictive Maintenance","authors":"A. Hafeez, Eduardo Alonso, Aram Ter-Sarkisov","doi":"10.1109/ICMLA52953.2021.00167","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00167","url":null,"abstract":"Predictive maintenance, which has traditionally used anomaly detection methods on sensory data, is now being replaced by event-based techniques. These methods utilise events with multiple temporal (and often non-numeric) features, produced by diagnostic modules. This raises the need of learning numerical event representations to predict the next fault event in industrial machines, specially vehicles, that use Diagnostic Trouble Codes (DTCs). We propose a predictive maintenance approach, named Sequential Multivariate Fault Prediction (SMFP), for predicting the next multivariate DTC fault in an event sequence, using Long Short-Term Memory Networks (LSTMs) and jointly learned event embeddings. By performing an in-depth comparison of different architectural choices and contextual preprocessing techniques, we provide an initial baseline for SMFP that achieves top-3 accuracy of 63% on predicting multivariate fault with 3 collective output layers, using vehicle maintenance data as a case study.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"309 1","pages":"1016-1021"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75682546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Improved Attribute Manipulation in the Latent Space of StyleGAN for Semantic Face Editing 面向语义人脸编辑的StyleGAN隐空间属性操作改进
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00014
Aashish Rai, Clara Ducher, J. Cooperstock
With the recent popularization of generative frameworks for producing photorealistic face images, we now have the ability to create a convincing graphical match for any particular individual. It is unrealistic, however, to rely solely on such generative methods to randomly produce the facial characteristics we are seeking. Instead, manipulation of facial attributes in the latent space, enabled by the InterFaceGAN framework, allows us to “tweak” these characteristics in the desired direction to improve the quality of the match. The challenge in this process is that attribute entanglement leads to a change of one feature having an undesirable impact on others. We explore several strategies to improve the results of these manipulations, and demonstrate how the automatic conditioning of attributes can be used to minimize the impact of such entanglement, and further, allow for improved control over complex (non-binary) attributes such as race or face shape.
随着最近用于生成逼真人脸图像的生成框架的普及,我们现在有能力为任何特定个体创建令人信服的图形匹配。然而,仅仅依靠这种生成方法来随机生成我们正在寻找的面部特征是不现实的。相反,通过InterFaceGAN框架在潜在空间中操纵面部属性,允许我们在期望的方向上“调整”这些特征,以提高匹配的质量。这个过程中的挑战是,属性纠缠会导致一个特性的改变对其他特性产生不良影响。我们探索了几种策略来改善这些操作的结果,并展示了如何使用属性的自动条件反射来最小化这种纠缠的影响,并且进一步允许改进对复杂(非二进制)属性(如种族或脸型)的控制。
{"title":"Improved Attribute Manipulation in the Latent Space of StyleGAN for Semantic Face Editing","authors":"Aashish Rai, Clara Ducher, J. Cooperstock","doi":"10.1109/ICMLA52953.2021.00014","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00014","url":null,"abstract":"With the recent popularization of generative frameworks for producing photorealistic face images, we now have the ability to create a convincing graphical match for any particular individual. It is unrealistic, however, to rely solely on such generative methods to randomly produce the facial characteristics we are seeking. Instead, manipulation of facial attributes in the latent space, enabled by the InterFaceGAN framework, allows us to “tweak” these characteristics in the desired direction to improve the quality of the match. The challenge in this process is that attribute entanglement leads to a change of one feature having an undesirable impact on others. We explore several strategies to improve the results of these manipulations, and demonstrate how the automatic conditioning of attributes can be used to minimize the impact of such entanglement, and further, allow for improved control over complex (non-binary) attributes such as race or face shape.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"21 39","pages":"38-43"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91505996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Modeling approaches for Silent Attrition prediction in Payment networks 支付网络中无声损耗预测的建模方法
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00070
L. Dheekollu, H. Wadhwa, Siddharth Vimal, Anubhav Gupta, Siddhartha Asthana, Ankur Arora, Smriti Gupta
Predicting customer attrition (churn) is a well known problem in industries that provide services, like financial institutions, telecommunications, e-commerce, and retail. There are two kinds of attrition - active and passive (silent). Active attrition is usually associated with subscription-based business models, commonly seen in telecommunications and internet industries like Netflix. In industries like finance, retail, and ecommerce, we see the other kind of attrition - silent attrition where customers stop doing business without formal notice. This makes the silent attrition prediction problem even more challenging because it is difficult to differentiate between attrited and inactive customers. We focus our work on predicting silent attrition which is still under-explored in the payment card industry (i.e. Mastercard, Visa). The contribution of our work is threefold. First, we present a data-driven approach to define silent attrition as customer inactivity. Second, we discussed multiple procedures to generate synthetic data thereby preserving customers’ privacy. At last, we presented a comprehensive view of various machine learning (ML) pathways in which this churn prediction problem can be framed and solved; each requiring a specific feature engineering. We presented experimental results corresponding to each pathway to comparative analysis. We believe that this work to be beneficial to the researchers and ML practitioners who often have to deal with sensitive financial data but have limited permission to use it. In this direction, we demonstrated the use of synthetic data generation to reduce the risk of data leakage and other privacy concerns relating to ML models development.
在金融机构、电信、电子商务和零售等提供服务的行业中,预测客户流失是一个众所周知的问题。有两种损耗——主动损耗和被动损耗(无声损耗)。主动流失通常与基于订阅的商业模式有关,通常见于电信和互联网行业,如Netflix。在金融、零售和电子商务等行业,我们看到了另一种流失——无声流失,即客户在没有正式通知的情况下停止做生意。这使得沉默的流失预测问题更具挑战性,因为很难区分流失和不活跃的客户。我们的工作重点是预测无声损耗,这在支付卡行业(如万事达,Visa)仍未得到充分探索。我们的工作有三方面的贡献。首先,我们提出了一种数据驱动的方法,将沉默的流失定义为客户不活动。其次,我们讨论了生成合成数据从而保护客户隐私的多个程序。最后,我们提出了各种机器学习(ML)途径的综合视图,其中可以构建和解决这种流失预测问题;每个都需要一个特定的特征工程。我们给出了各途径对应的实验结果进行对比分析。我们相信这项工作对研究人员和机器学习从业者是有益的,他们经常需要处理敏感的财务数据,但使用权限有限。在这个方向上,我们演示了使用合成数据生成来降低数据泄露的风险和与ML模型开发相关的其他隐私问题。
{"title":"Modeling approaches for Silent Attrition prediction in Payment networks","authors":"L. Dheekollu, H. Wadhwa, Siddharth Vimal, Anubhav Gupta, Siddhartha Asthana, Ankur Arora, Smriti Gupta","doi":"10.1109/ICMLA52953.2021.00070","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00070","url":null,"abstract":"Predicting customer attrition (churn) is a well known problem in industries that provide services, like financial institutions, telecommunications, e-commerce, and retail. There are two kinds of attrition - active and passive (silent). Active attrition is usually associated with subscription-based business models, commonly seen in telecommunications and internet industries like Netflix. In industries like finance, retail, and ecommerce, we see the other kind of attrition - silent attrition where customers stop doing business without formal notice. This makes the silent attrition prediction problem even more challenging because it is difficult to differentiate between attrited and inactive customers. We focus our work on predicting silent attrition which is still under-explored in the payment card industry (i.e. Mastercard, Visa). The contribution of our work is threefold. First, we present a data-driven approach to define silent attrition as customer inactivity. Second, we discussed multiple procedures to generate synthetic data thereby preserving customers’ privacy. At last, we presented a comprehensive view of various machine learning (ML) pathways in which this churn prediction problem can be framed and solved; each requiring a specific feature engineering. We presented experimental results corresponding to each pathway to comparative analysis. We believe that this work to be beneficial to the researchers and ML practitioners who often have to deal with sensitive financial data but have limited permission to use it. In this direction, we demonstrated the use of synthetic data generation to reduce the risk of data leakage and other privacy concerns relating to ML models development.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"64 1","pages":"409-414"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91086362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applications of Mobile Machine Learning for Detecting Bio-energy Crops Flowers 移动机器学习在生物能源作物花卉检测中的应用
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00121
Wenjun Zeng, Bakhtiar Amen
Automated flower detection and control is important to crop production and precision agriculture. Some computer vision methods have been proposed for flower detection, but their performances are not satisfactory on platforms with limited computing ability such as mobile and embedded devices, and thus not suitable for field applications. Herein we demonstrate two de novo approaches that can precisely detect the flowers of two bioenergy crops (potatoes and sweet potatoes) and can distinguish them from similar flowers of relative species (eggplants and Ipomoea triloba) on mobile devices. In this work, a custom dataset containing 495 manually labelled images is constructed for training and testing, and the latest state-of-the-art object detection model, YOLOv4, as well as its lightweight version, YOLOv4-tiny, are selected as the flower detection models. Some other milestone object detection models including YOLOv3, YOLOv3-tiny, SSD and Faster-RCNN are chosen as benchmarks for performance comparison. The comparative experiment results indicate that the retrained YOLOv4 model achieves a considerable high mean average precision (mAP= 91%;) but a slower inference speed (FPS) on a mobile device, while the retrained YOLOv4-tiny has a lower mAP of 87%; but reach a higher FPS of 9 on a mobile device. Two mobile applications are then developed by directly deploying YOLOv4-tiny model on a mobile app and by deploying YOLOv4 on a web API, respectively. The testing experiments indicate that both applications can not only achieve real-time and accurate detection, but also reduce computation burdens on mobile devices.
花卉自动化检测与控制对作物生产和精准农业具有重要意义。目前已经提出了一些用于花卉检测的计算机视觉方法,但在移动和嵌入式设备等计算能力有限的平台上,这些方法的性能并不令人满意,因此不适合现场应用。在这里,我们展示了两种全新的方法,可以精确检测两种生物能源作物(土豆和红薯)的花朵,并可以在移动设备上将它们与相关物种(茄子和三叶马铃薯)的类似花朵区分开来。在这项工作中,我们构建了一个包含495张手动标记图像的自定义数据集进行训练和测试,并选择了最新的最先进的目标检测模型YOLOv4,以及它的轻量级版本YOLOv4-tiny作为花卉检测模型。其他一些具有里程碑意义的目标检测模型包括YOLOv3, YOLOv3-tiny, SSD和Faster-RCNN作为性能比较的基准。对比实验结果表明,经过再训练的YOLOv4模型在移动设备上获得了相当高的平均精度(mAP= 91%),但推理速度(FPS)较慢,而经过再训练的YOLOv4-tiny模型的mAP较低,为87%;但在移动设备上达到更高的FPS(9)。然后分别通过在移动应用程序上直接部署YOLOv4-tiny模型和在web API上部署YOLOv4来开发两个移动应用程序。测试实验表明,这两种应用不仅可以实现实时、准确的检测,还可以减少移动设备的计算负担。
{"title":"Applications of Mobile Machine Learning for Detecting Bio-energy Crops Flowers","authors":"Wenjun Zeng, Bakhtiar Amen","doi":"10.1109/ICMLA52953.2021.00121","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00121","url":null,"abstract":"Automated flower detection and control is important to crop production and precision agriculture. Some computer vision methods have been proposed for flower detection, but their performances are not satisfactory on platforms with limited computing ability such as mobile and embedded devices, and thus not suitable for field applications. Herein we demonstrate two de novo approaches that can precisely detect the flowers of two bioenergy crops (potatoes and sweet potatoes) and can distinguish them from similar flowers of relative species (eggplants and Ipomoea triloba) on mobile devices. In this work, a custom dataset containing 495 manually labelled images is constructed for training and testing, and the latest state-of-the-art object detection model, YOLOv4, as well as its lightweight version, YOLOv4-tiny, are selected as the flower detection models. Some other milestone object detection models including YOLOv3, YOLOv3-tiny, SSD and Faster-RCNN are chosen as benchmarks for performance comparison. The comparative experiment results indicate that the retrained YOLOv4 model achieves a considerable high mean average precision (mAP= 91%;) but a slower inference speed (FPS) on a mobile device, while the retrained YOLOv4-tiny has a lower mAP of 87%; but reach a higher FPS of 9 on a mobile device. Two mobile applications are then developed by directly deploying YOLOv4-tiny model on a mobile app and by deploying YOLOv4 on a web API, respectively. The testing experiments indicate that both applications can not only achieve real-time and accurate detection, but also reduce computation burdens on mobile devices.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"41 1","pages":"724-729"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91011208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Trade-offs in Metric Learning for Bearing Fault Diagnosis 度量学习在轴承故障诊断中的权衡
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00180
Tyler Cody, Stephen C. Adams, P. Beling
Metric learning is a well-developed field in machine learning and has seen recent application in the area of prognostics and health management (PHM). Metric learning allows for fault diagnosis or condition monitoring models to be developed with the assumption that a machine- or load-specific similarity metric can be learned after model deployment. Existing literature has used metric learning to fine-tune deep learning models to address machine-to-machine differences and differences in working conditions. Here, we study metric learning in isolation, not as an intermediate step in deep learning, by conducting a comparative study of Principal Component Analysis (PCA), Neighborhood Component Analysis (NCA), Local Fisher Discriminant Analysis (LFDA), and Large Margin Nearest Neighbor (LMNN). We consider performance metrics for prediction performance, cluster performance, feature sensitivity, sample efficiency, and latent space efficiency. We find that linear partitions on the latent spaces learned via metric learning are able to achieve accuracies greater than 90% on Case Western Reserve University’s bearing fault data set using only the drive-end vibration signal. We find PCA to be dominated by metric learning algorithms for all working loads considered. And, in sum, we demonstrate classical metric learning algorithms to be a promising approach for learning machine-and load-specific similarity metrics for PHM with minor data processing and small samples.
度量学习是机器学习中一个发展良好的领域,最近在预测和健康管理(PHM)领域得到了应用。度量学习允许在假设可以在模型部署后学习特定于机器或负载的相似性度量的情况下开发故障诊断或状态监测模型。现有文献已经使用度量学习来微调深度学习模型,以解决机器对机器的差异和工作条件的差异。在这里,我们通过对主成分分析(PCA)、邻域成分分析(NCA)、局部Fisher判别分析(LFDA)和大边际最近邻(LMNN)进行比较研究,孤立地研究度量学习,而不是作为深度学习的中间步骤。我们考虑了预测性能、聚类性能、特征灵敏度、样本效率和潜在空间效率的性能指标。我们发现,通过度量学习获得的潜在空间上的线性划分能够在仅使用驱动端振动信号的Case西储大学轴承故障数据集上实现大于90%的精度。我们发现PCA是由度量学习算法主导的所有工作负载考虑。总而言之,我们证明了经典度量学习算法是一种很有前途的方法,用于学习具有少量数据处理和小样本的PHM的机器和负载特定相似性度量。
{"title":"Trade-offs in Metric Learning for Bearing Fault Diagnosis","authors":"Tyler Cody, Stephen C. Adams, P. Beling","doi":"10.1109/ICMLA52953.2021.00180","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00180","url":null,"abstract":"Metric learning is a well-developed field in machine learning and has seen recent application in the area of prognostics and health management (PHM). Metric learning allows for fault diagnosis or condition monitoring models to be developed with the assumption that a machine- or load-specific similarity metric can be learned after model deployment. Existing literature has used metric learning to fine-tune deep learning models to address machine-to-machine differences and differences in working conditions. Here, we study metric learning in isolation, not as an intermediate step in deep learning, by conducting a comparative study of Principal Component Analysis (PCA), Neighborhood Component Analysis (NCA), Local Fisher Discriminant Analysis (LFDA), and Large Margin Nearest Neighbor (LMNN). We consider performance metrics for prediction performance, cluster performance, feature sensitivity, sample efficiency, and latent space efficiency. We find that linear partitions on the latent spaces learned via metric learning are able to achieve accuracies greater than 90% on Case Western Reserve University’s bearing fault data set using only the drive-end vibration signal. We find PCA to be dominated by metric learning algorithms for all working loads considered. And, in sum, we demonstrate classical metric learning algorithms to be a promising approach for learning machine-and load-specific similarity metrics for PHM with minor data processing and small samples.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"23 1","pages":"1100-1105"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82140624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fast Tensor Singular Value Decomposition Using the Low-Resolution Features of Tensors 基于张量低分辨率特征的快速张量奇异值分解
Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00088
Cagri Ozdemir, R. Hoover, Kyle A. Caudle
The tensor singular value decomposition (t-SVD) based on an algebra of circulants is an effective multilinear sub- space learning technique for dimensionality reduction and data classification. Unfortunately, the computational cost associated with computing the t-SVD can become prohibitively expensive, particularly when dealing with very large data sets. In this paper, we present a computationally efficient approach for estimating the t-SVD by capitalizing on the correlations of the data in the temporal dimension. The approach proceeds by extending our prior work on fast eigenspace decompositions by transforming the tensor data from the spatial domain to the spectral domain in order to obtain reduced order harmonic tensor. The t-SVD can then be applied in the transform domain thereby significantly reducing the computational burden. Experimental results which are presented on the extended Yale-B, COIL-100, and MNIST data sets show the proposed method provides considerable computational savings with the approximated subspaces that are nearly the same as the true subspaces as computed via the t-SVD.
基于循环代数的张量奇异值分解(t-SVD)是一种有效的多线性子空间学习降维和数据分类技术。不幸的是,与计算t-SVD相关的计算成本可能变得非常昂贵,特别是在处理非常大的数据集时。在本文中,我们提出了一种计算效率的方法,通过利用数据在时间维度上的相关性来估计t-SVD。该方法通过将张量数据从空间域转换到谱域来获得降阶谐波张量,从而扩展了我们之前关于快速特征空间分解的工作。然后将t-SVD应用于变换域,从而大大减少了计算负担。在扩展的Yale-B、COIL-100和MNIST数据集上的实验结果表明,所提出的方法可以节省大量的计算量,其近似子空间与通过t-SVD计算的真实子空间几乎相同。
{"title":"Fast Tensor Singular Value Decomposition Using the Low-Resolution Features of Tensors","authors":"Cagri Ozdemir, R. Hoover, Kyle A. Caudle","doi":"10.1109/ICMLA52953.2021.00088","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00088","url":null,"abstract":"The tensor singular value decomposition (t-SVD) based on an algebra of circulants is an effective multilinear sub- space learning technique for dimensionality reduction and data classification. Unfortunately, the computational cost associated with computing the t-SVD can become prohibitively expensive, particularly when dealing with very large data sets. In this paper, we present a computationally efficient approach for estimating the t-SVD by capitalizing on the correlations of the data in the temporal dimension. The approach proceeds by extending our prior work on fast eigenspace decompositions by transforming the tensor data from the spatial domain to the spectral domain in order to obtain reduced order harmonic tensor. The t-SVD can then be applied in the transform domain thereby significantly reducing the computational burden. Experimental results which are presented on the extended Yale-B, COIL-100, and MNIST data sets show the proposed method provides considerable computational savings with the approximated subspaces that are nearly the same as the true subspaces as computed via the t-SVD.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"64 1","pages":"527-533"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81380157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1