首页 > 最新文献

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
A Rule-Based Classifier with Accurate and Fast Rule Term Induction for Continuous Attributes 基于规则的连续属性分类器,具有快速准确的规则项归纳
Manal Almutairi, Frederic T. Stahl, M. Bramer
Rule-based classifiers are considered more expressive, human readable and less prone to over-fitting compared with decision trees, especially when there is noise in the data. Furthermore, rule-based classifiers do not suffer from the replicated subtree problem as classifiers induced by top down induction of decision trees (also known as 'Divide and Conquer'). This research explores some recent developments of a family of rulebased classifiers, the Prism family and more particular G-Prism-FB and G-Prism-DB algorithms, in terms of local discretisation methods used to induce rule terms for continuous data. The paper then proposes a new algorithm of the Prism family based on a combination of Gauss Probability Density Distribution (GPDD), InterQuartile Range (IQR) and data transformation methods. This new rule-based algorithm, termed G-Rules-IQR, is evaluated empirically and outperforms other members of the Prism family in execution time, accuracy and tentative accuracy.
与决策树相比,基于规则的分类器被认为更具表现力,更易于人类阅读,并且更不容易过度拟合,特别是当数据中存在噪声时。此外,基于规则的分类器不会像由自上而下的决策树归纳(也称为“分而治之”)引起的分类器那样受到复制子树问题的困扰。本研究探讨了基于规则的分类器家族的一些最新发展,Prism家族和更具体的G-Prism-FB和G-Prism-DB算法,用于为连续数据归纳规则项的局部离散化方法。在此基础上,提出了一种结合高斯概率密度分布(GPDD)、四分位间距(IQR)和数据变换方法的Prism族新算法。这种新的基于规则的算法被称为G-Rules-IQR,经过经验评估,在执行时间、准确性和暂定准确性方面优于Prism家族的其他成员。
{"title":"A Rule-Based Classifier with Accurate and Fast Rule Term Induction for Continuous Attributes","authors":"Manal Almutairi, Frederic T. Stahl, M. Bramer","doi":"10.1109/ICMLA.2018.00068","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00068","url":null,"abstract":"Rule-based classifiers are considered more expressive, human readable and less prone to over-fitting compared with decision trees, especially when there is noise in the data. Furthermore, rule-based classifiers do not suffer from the replicated subtree problem as classifiers induced by top down induction of decision trees (also known as 'Divide and Conquer'). This research explores some recent developments of a family of rulebased classifiers, the Prism family and more particular G-Prism-FB and G-Prism-DB algorithms, in terms of local discretisation methods used to induce rule terms for continuous data. The paper then proposes a new algorithm of the Prism family based on a combination of Gauss Probability Density Distribution (GPDD), InterQuartile Range (IQR) and data transformation methods. This new rule-based algorithm, termed G-Rules-IQR, is evaluated empirically and outperforms other members of the Prism family in execution time, accuracy and tentative accuracy.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"186 1","pages":"413-420"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74949588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Machine Learning for US Army UAVs Sustainment: Assessing Effect of Sensor Frequency and Placement on Damage Information in the Ultrasound Signals 美国陆军无人机维护的机器学习:评估传感器频率和放置对超声信号中损伤信息的影响
R. Valisetty, R. Haynes, R. Namburu, Michael Lee
US Army unmanned aerial vehicles (UAVs) in the future will be sustained for longer durations if damage in structural parts is continuously monitored from the damage-inception stage and continuously through vehicle life. Neural networks based machine learning (ML) are proposed, demonstrating that the length of a developing fatigue crack can be estimated continuously using the ultrasound signals. Using a 0.5-TB data set that was obtained from a carefully selected set of experiments, the ML was developed in three stages: 1) feature development, 2) outlier elimination and 3) role of the excitation frequency and exciter-receiver path in the ML of the crack length. In the first stage, the recorded 8000-point ultrasound signals were reduced, first, to 63 features comprising the major statistical features of the returned signal and the seven scales of a seven scale wavelet decomposition of the returned signal. Using an autoencoder algorithm, outliers in the input were identified and removed. A four-layer, 63-32-16-1 neural network based linear regression algorithm was used to predict the crack length from the input features. The results indicated that the ML algorithm gave correlation in the range of 99.43-99.97% when both the exciter-frequency and the exciter-receiver paths are fixed. For investigating the effects of the excitation frequency and the exciter-receiver path on the crack-length information in the returned signal, a similar neural network algorithm was used. One or two additional variables were added to the incoming samples' feature space depending on whether the excitation frequency or the exciter-receiver path or both were variables. ML for crack-length estimation showed promise for these situations, too. In the more practical first situation, where the exciter frequency is fixed and the exciter-receiver path is uncertain, the algorithm showed an accuracy in the range of 96.97-98.92%. This algorithm still gave a correlation above 85% when there was uncertainty in the excitation frequency and exciter-receiver paths, as well. This work thus demonstrates the potential for monitoring fatigue crack length growth throughout the life of a vehicle for an increased sustainment of the US Army UAVs.
如果从损坏开始阶段就对结构部件的损坏进行连续监测,并在整个车辆寿命期间持续监测,美国陆军无人机(uav)未来的持续时间将更长。提出了基于神经网络的机器学习方法,证明了利用超声信号可以连续估计疲劳裂纹的长度。使用精心挑选的一组实验获得的0.5 tb数据集,ML分为三个阶段:1)特征开发,2)异常值消除和3)激励频率和激励-接收路径在裂纹长度ML中的作用。在第一阶段,将记录的8000点超声信号缩减为63个特征,包括返回信号的主要统计特征和返回信号的七尺度小波分解的七个尺度。使用自动编码器算法,识别并去除输入中的异常值。采用基于63-32-16-1神经网络的四层线性回归算法,根据输入特征预测裂纹长度。结果表明,当激振频率和激振接收机路径固定时,ML算法的相关性在99.43 ~ 99.97%之间。为了研究激励频率和激励-接收路径对返回信号中裂纹长度信息的影响,采用了类似的神经网络算法。根据激励频率或激励-接收路径或两者都是变量,在输入样本的特征空间中添加一个或两个额外的变量。用于裂缝长度估计的ML也显示了对这些情况的希望。在较为实际的第一种情况下,当激振频率固定且激振接收机路径不确定时,该算法的精度在96.97 ~ 98.92%之间。当激励频率和激励-接收路径存在不确定性时,该算法仍能给出85%以上的相关性。因此,这项工作证明了在整个车辆寿命期间监测疲劳裂纹长度增长的潜力,从而增加了美国陆军无人机的维持能力。
{"title":"Machine Learning for US Army UAVs Sustainment: Assessing Effect of Sensor Frequency and Placement on Damage Information in the Ultrasound Signals","authors":"R. Valisetty, R. Haynes, R. Namburu, Michael Lee","doi":"10.1109/ICMLA.2018.00032","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00032","url":null,"abstract":"US Army unmanned aerial vehicles (UAVs) in the future will be sustained for longer durations if damage in structural parts is continuously monitored from the damage-inception stage and continuously through vehicle life. Neural networks based machine learning (ML) are proposed, demonstrating that the length of a developing fatigue crack can be estimated continuously using the ultrasound signals. Using a 0.5-TB data set that was obtained from a carefully selected set of experiments, the ML was developed in three stages: 1) feature development, 2) outlier elimination and 3) role of the excitation frequency and exciter-receiver path in the ML of the crack length. In the first stage, the recorded 8000-point ultrasound signals were reduced, first, to 63 features comprising the major statistical features of the returned signal and the seven scales of a seven scale wavelet decomposition of the returned signal. Using an autoencoder algorithm, outliers in the input were identified and removed. A four-layer, 63-32-16-1 neural network based linear regression algorithm was used to predict the crack length from the input features. The results indicated that the ML algorithm gave correlation in the range of 99.43-99.97% when both the exciter-frequency and the exciter-receiver paths are fixed. For investigating the effects of the excitation frequency and the exciter-receiver path on the crack-length information in the returned signal, a similar neural network algorithm was used. One or two additional variables were added to the incoming samples' feature space depending on whether the excitation frequency or the exciter-receiver path or both were variables. ML for crack-length estimation showed promise for these situations, too. In the more practical first situation, where the exciter frequency is fixed and the exciter-receiver path is uncertain, the algorithm showed an accuracy in the range of 96.97-98.92%. This algorithm still gave a correlation above 85% when there was uncertainty in the excitation frequency and exciter-receiver paths, as well. This work thus demonstrates the potential for monitoring fatigue crack length growth throughout the life of a vehicle for an increased sustainment of the US Army UAVs.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"60 1","pages":"165-172"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73795856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Auction Fraud Classification Based on Clustering and Sampling Techniques 基于聚类和抽样技术的拍卖欺诈分类
Farzana Anowar, S. Sadaoui, Malek Mouhoub
Online auctions created a very attractive environment for dishonest moneymakers who can commit different types of fraud. Shill Bidding (SB) is the most predominant auction fraud and also the most difficult to detect because of its similarity to usual bidding behavior. Based on a newly produced SB dataset, in this study, we devise a fraud classification model that is able to efficiently differentiate between honest and malicious bidders. First, we label the SB data by combining a hierarchical clustering technique and a semi-automated labeling approach. To solve the imbalanced learning problem, we apply several advanced data sampling methods and compare their performance using the SVM model. As a result, we develop an optimal SB classifier that exhibits very satisfactory detection and low misclassification rates.
在线拍卖为不诚实的赚钱者创造了一个非常有吸引力的环境,他们可以实施不同类型的欺诈。欺骗性投标是最主要的拍卖欺诈行为,也是最难以发现的,因为它与通常的投标行为相似。基于新生成的SB数据集,在本研究中,我们设计了一个欺诈分类模型,能够有效区分诚实和恶意的投标人。首先,我们结合层次聚类技术和半自动标记方法对SB数据进行标记。为了解决不平衡学习问题,我们采用了几种先进的数据采样方法,并使用支持向量机模型比较了它们的性能。因此,我们开发了一个最优的SB分类器,它具有非常令人满意的检测和低误分类率。
{"title":"Auction Fraud Classification Based on Clustering and Sampling Techniques","authors":"Farzana Anowar, S. Sadaoui, Malek Mouhoub","doi":"10.1109/ICMLA.2018.00061","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00061","url":null,"abstract":"Online auctions created a very attractive environment for dishonest moneymakers who can commit different types of fraud. Shill Bidding (SB) is the most predominant auction fraud and also the most difficult to detect because of its similarity to usual bidding behavior. Based on a newly produced SB dataset, in this study, we devise a fraud classification model that is able to efficiently differentiate between honest and malicious bidders. First, we label the SB data by combining a hierarchical clustering technique and a semi-automated labeling approach. To solve the imbalanced learning problem, we apply several advanced data sampling methods and compare their performance using the SVM model. As a result, we develop an optimal SB classifier that exhibits very satisfactory detection and low misclassification rates.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"25 1","pages":"366-371"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84780762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Implementation of a Smartphone as a Wearable and Wireless Gyroscope Platform for Machine Learning Classification of Hemiplegic Gait Through a Multilayer Perceptron Neural Network 基于多层感知器神经网络的智能手机可穿戴无线陀螺仪平台偏瘫步态机器学习分类
R. LeMoyne, Timothy Mastroianni
The smartphone represents a wearable and wireless system with the potential to have transformative influence on the biomedical and healthcare industry. An intrinsic feature of the smartphone is a gyroscope sensor, for which with a software application the smartphone functions as a wearable and wireless gyroscope platform. The resultant gyroscope data recording presents a clinical recognizable signal, which has been successful demonstrated to quantify aspects of human movement characteristics, such as the patellar tendon reflex. Gait another associated feature of human movement can be readily quantified by a smartphone functioning as a wearable and wireless gyroscope platform. The research objective is to distinguish between an affected leg and unaffected leg during hemiplegic gait based on a smartphone functioning as a wearable and wireless gyroscope platform though machine learning classification. A single smartphone is applied to quantify hemiplegic gait. The smartphone is first mounted to the affected leg and then the unaffected leg with velocity constrained to a constant velocity by a treadmill. Through wireless connectivity to the Internet the gyroscope signal data is conveyed as an email attachment for post-processing at a remote location. Software automation consolidates the gyroscope signal data of hemiplegic gait to a feature set for machine learning classification. With the application of a multilayer perceptron neural network considerable classification accuracy is attained for distinguishing between the affected leg and unaffected leg of hemiplegic gait. Future implications of the successful implementation of a smartphone as a wearable and wireless gyroscope for machine learning classification of hemiplegic gait through a multilayer perceptron neural network elucidate pathways to highly optimized therapy through machine learning with the potential for patients to reside remote from their therapist.
智能手机代表了一种可穿戴的无线系统,有可能对生物医学和医疗保健行业产生变革性的影响。智能手机的一个固有特征是陀螺仪传感器,通过一个软件应用,智能手机可以作为一个可穿戴的无线陀螺仪平台。由此产生的陀螺仪数据记录呈现出临床可识别的信号,该信号已被成功地证明可以量化人体运动特征的各个方面,如髌骨肌腱反射。步态是人类运动的另一个相关特征,可以很容易地通过智能手机作为可穿戴和无线陀螺仪平台进行量化。研究目标是基于智能手机作为可穿戴无线陀螺仪平台,通过机器学习分类,区分偏瘫步态中受损腿和未受损腿。应用单个智能手机量化偏瘫步态。智能手机首先安装在受影响的腿上,然后安装在未受影响的腿上,通过跑步机将速度限制在恒定速度。通过与互联网的无线连接,陀螺仪信号数据作为电子邮件附件传送,以便在远程位置进行后处理。软件自动化将偏瘫步态的陀螺仪信号数据整合为特征集,用于机器学习分类。应用多层感知器神经网络对偏瘫步态的影响腿和未影响腿进行分类,获得了较高的分类精度。通过多层感知器神经网络成功实现智能手机作为可穿戴和无线陀螺仪,用于偏瘫步态的机器学习分类,这对未来的影响阐明了通过机器学习实现高度优化治疗的途径,并有可能使患者远离他们的治疗师。
{"title":"Implementation of a Smartphone as a Wearable and Wireless Gyroscope Platform for Machine Learning Classification of Hemiplegic Gait Through a Multilayer Perceptron Neural Network","authors":"R. LeMoyne, Timothy Mastroianni","doi":"10.1109/ICMLA.2018.00153","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00153","url":null,"abstract":"The smartphone represents a wearable and wireless system with the potential to have transformative influence on the biomedical and healthcare industry. An intrinsic feature of the smartphone is a gyroscope sensor, for which with a software application the smartphone functions as a wearable and wireless gyroscope platform. The resultant gyroscope data recording presents a clinical recognizable signal, which has been successful demonstrated to quantify aspects of human movement characteristics, such as the patellar tendon reflex. Gait another associated feature of human movement can be readily quantified by a smartphone functioning as a wearable and wireless gyroscope platform. The research objective is to distinguish between an affected leg and unaffected leg during hemiplegic gait based on a smartphone functioning as a wearable and wireless gyroscope platform though machine learning classification. A single smartphone is applied to quantify hemiplegic gait. The smartphone is first mounted to the affected leg and then the unaffected leg with velocity constrained to a constant velocity by a treadmill. Through wireless connectivity to the Internet the gyroscope signal data is conveyed as an email attachment for post-processing at a remote location. Software automation consolidates the gyroscope signal data of hemiplegic gait to a feature set for machine learning classification. With the application of a multilayer perceptron neural network considerable classification accuracy is attained for distinguishing between the affected leg and unaffected leg of hemiplegic gait. Future implications of the successful implementation of a smartphone as a wearable and wireless gyroscope for machine learning classification of hemiplegic gait through a multilayer perceptron neural network elucidate pathways to highly optimized therapy through machine learning with the potential for patients to reside remote from their therapist.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"62 1","pages":"946-950"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85197097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Convolutional Neural Networks for Automatic Threat Detection in Security X-Ray Images 基于卷积神经网络的安全x射线图像威胁自动检测
Trevor Morris, Tiffany Chien, Eric L. Goodman
In this paper we apply Convolutional Neural Networks (CNNs) to the task of automatic threat detection, specifically conventional explosives, in security X-ray scans of passenger baggage. We present the first results of utilizing CNNs for explosives detection, and introduce a dataset, the Passenger Baggage Object Database (PBOD), which can be used by researchers to develop new threat detection algorithms. Using state-of-the-art CNN models and taking advantage of the properties of the Xray scanner, we achieve reliable detection of threats, with the best model achieving an AUC of the ROC of 0.95. We also explore heatmaps as a visualization of the location of the threat.
在本文中,我们将卷积神经网络(cnn)应用于旅客行李安全x射线扫描中的自动威胁检测任务,特别是常规爆炸物。我们提出了利用cnn进行爆炸物检测的第一个结果,并介绍了一个数据集,乘客行李对象数据库(PBOD),研究人员可以使用它来开发新的威胁检测算法。使用最先进的CNN模型并利用x射线扫描仪的特性,我们实现了对威胁的可靠检测,最佳模型的ROC AUC为0.95。我们还探索了热图作为威胁位置的可视化。
{"title":"Convolutional Neural Networks for Automatic Threat Detection in Security X-Ray Images","authors":"Trevor Morris, Tiffany Chien, Eric L. Goodman","doi":"10.1109/ICMLA.2018.00049","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00049","url":null,"abstract":"In this paper we apply Convolutional Neural Networks (CNNs) to the task of automatic threat detection, specifically conventional explosives, in security X-ray scans of passenger baggage. We present the first results of utilizing CNNs for explosives detection, and introduce a dataset, the Passenger Baggage Object Database (PBOD), which can be used by researchers to develop new threat detection algorithms. Using state-of-the-art CNN models and taking advantage of the properties of the Xray scanner, we achieve reliable detection of threats, with the best model achieving an AUC of the ROC of 0.95. We also explore heatmaps as a visualization of the location of the threat.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"8 1","pages":"285-292"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81898814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Lorenz Chaotic System Artificial Neural Network Training with Single Time Series Input and Multiple Time Series Outputs for EEG Prediction 单时间序列输入和多时间序列输出的Lorenz混沌系统人工神经网络训练用于脑电预测
Lei Zhang
The goal of this research is to develop an efficient artificial neural network (ANN) architecture to predict three chaotic time series outputs for Lorenz system using single time series input. The training performances are evaluated and compared for different ANN architectures with multiple hidden layers, as well as for input data with different combination of time series, including the first and second order differences of the time series. It is found that given the same ANN architecture, the training results of multiple time series outputs using single time series (x) input are much worse than those using multiple time series inputs. However, the training results can be improved significantly by increasing the number of ANN hidden layers up to 3; and marginally improved by adding the first and second order differences of the x time series, as well as adding steps for calculating the first and second order differences of the input time series.
本研究的目标是开发一种高效的人工神经网络(ANN)架构,以预测使用单一时间序列输入的洛伦兹系统的三个混沌时间序列输出。对具有多个隐藏层的不同神经网络结构以及不同时间序列组合的输入数据的训练性能进行了评估和比较,包括时间序列的一阶和二阶差异。研究发现,在相同的人工神经网络架构下,使用单个时间序列(x)输入的多时间序列输出的训练结果要比使用多个时间序列输入的训练结果差得多。然而,将人工神经网络隐藏层的数量增加到3层可以显著改善训练结果;并通过增加x时间序列的一阶和二阶差分,以及增加计算输入时间序列的一阶和二阶差分的步骤,略微改进。
{"title":"Lorenz Chaotic System Artificial Neural Network Training with Single Time Series Input and Multiple Time Series Outputs for EEG Prediction","authors":"Lei Zhang","doi":"10.1109/ICMLA.2018.00221","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00221","url":null,"abstract":"The goal of this research is to develop an efficient artificial neural network (ANN) architecture to predict three chaotic time series outputs for Lorenz system using single time series input. The training performances are evaluated and compared for different ANN architectures with multiple hidden layers, as well as for input data with different combination of time series, including the first and second order differences of the time series. It is found that given the same ANN architecture, the training results of multiple time series outputs using single time series (x) input are much worse than those using multiple time series inputs. However, the training results can be improved significantly by increasing the number of ANN hidden layers up to 3; and marginally improved by adding the first and second order differences of the x time series, as well as adding steps for calculating the first and second order differences of the input time series.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"149 1","pages":"1358-1365"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79423964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What are they Researching? Examining Industry-Based Doctoral Dissertation Research through the Lens of Machine Learning 他们在研究什么?从机器学习的角度审视基于行业的博士论文研究
Ion Freeman, Ashley Haigler, Suzanna E. Schmeelk, Lisa Ellrodt, Tonya Fields
This paper examines industry-based doctoral dissertation research in a professional computing doctoral program for full time working professionals through the lens of different machine learning algorithms to understand topics explored by full time working industry professionals. This research paper examines machine learning algorithms and the IBM Watson Discovery machine learning tool to categorize dissertation research topics defended at Pace University. The research provides insights into differences in machine learning algorithm categorization using natural language processing.
本文通过不同的机器学习算法,考察了全职工作专业人员的专业计算博士课程中基于行业的博士论文研究,以理解全职工作行业专业人员探索的主题。本研究论文考察了机器学习算法和IBM Watson Discovery机器学习工具,以对佩斯大学辩护的论文研究主题进行分类。该研究提供了使用自然语言处理的机器学习算法分类差异的见解。
{"title":"What are they Researching? Examining Industry-Based Doctoral Dissertation Research through the Lens of Machine Learning","authors":"Ion Freeman, Ashley Haigler, Suzanna E. Schmeelk, Lisa Ellrodt, Tonya Fields","doi":"10.1109/ICMLA.2018.00217","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00217","url":null,"abstract":"This paper examines industry-based doctoral dissertation research in a professional computing doctoral program for full time working professionals through the lens of different machine learning algorithms to understand topics explored by full time working industry professionals. This research paper examines machine learning algorithms and the IBM Watson Discovery machine learning tool to categorize dissertation research topics defended at Pace University. The research provides insights into differences in machine learning algorithm categorization using natural language processing.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"58 1","pages":"1338-1340"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84353855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Constrained Sparse Dynamic Time Warping 约束稀疏动态时间翘曲
Youngha Hwang, S. Gelfand
Dynamic time warping (DTW) has been applied to a wide range of machine learning problems involving the comparison of time series. An important feature of such time series is that they can sometimes be sparse in the sense that the data takes zero value at many epochs. This corresponds for example to quiet periods in speech or to a lack of physical activity. However, employing conventional DTW for such sparse time series runs a full search ignoring the zero data. So a fast dynamic time warping algorithm that is exactly equivalent to DTW was developed for the unconstrained case where there is no global constraint on the permissible warping path. It was called sparse dynamic time warping (SDTW). In this paper we focus on the development and analysis of a fast dynamic time warping algorithm for the constrained case where there is a global constraint on the permissible warping path, specifically limit the width along the diagonal of the permissible path domain. We call this constrained sparse dynamic time warping (CSDTW). A careful formulation and analysis are performed to determine exactly how CSDTW should treat the zero data. It is shown that CSDTW reduces the computational complexity relative to constrained DTW by about three times the sparsity ratio, which is defined as the arithmetic mean of the fraction of non-zero's in the two time series. Numerical experiments confirm the speed advantage of CSDTW relative to constrained DTW for sparse time series with sparsity ratio up to 0.2-0.3. This study provides a benchmark and also background to potentially understand how to exploit such sparsity when the underlying time series is approximated to reduce complexity.
动态时间翘曲(DTW)已广泛应用于涉及时间序列比较的机器学习问题。这种时间序列的一个重要特征是它们有时可能是稀疏的,即数据在许多epoch取零值。例如,这与说话时的安静期或缺乏身体活动相对应。然而,对于这样的稀疏时间序列,使用传统的DTW会忽略零数据进行完整的搜索。因此,针对允许翘曲路径不存在全局约束的无约束情况,提出了一种与DTW完全等价的快速动态时间翘曲算法。它被称为稀疏动态时间翘曲(SDTW)。本文重点研究了一种快速动态时间翘曲算法的开发和分析,该算法在允许翘曲路径存在全局约束的情况下,即限制了允许翘曲路径域沿对角线的宽度。我们称之为约束稀疏动态时间规整(CSDTW)。进行了仔细的表述和分析,以确定CSDTW应该如何处理零数据。结果表明,相对于约束DTW, CSDTW将计算复杂度降低了约3倍的稀疏度比,稀疏度比定义为两个时间序列中非零分数的算术平均值。数值实验证实,对于稀疏度比在0.2 ~ 0.3之间的稀疏时间序列,CSDTW相对于约束DTW具有速度优势。这项研究提供了一个基准和背景,以潜在地理解如何在近似底层时间序列时利用这种稀疏性来降低复杂性。
{"title":"Constrained Sparse Dynamic Time Warping","authors":"Youngha Hwang, S. Gelfand","doi":"10.1109/ICMLA.2018.00039","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00039","url":null,"abstract":"Dynamic time warping (DTW) has been applied to a wide range of machine learning problems involving the comparison of time series. An important feature of such time series is that they can sometimes be sparse in the sense that the data takes zero value at many epochs. This corresponds for example to quiet periods in speech or to a lack of physical activity. However, employing conventional DTW for such sparse time series runs a full search ignoring the zero data. So a fast dynamic time warping algorithm that is exactly equivalent to DTW was developed for the unconstrained case where there is no global constraint on the permissible warping path. It was called sparse dynamic time warping (SDTW). In this paper we focus on the development and analysis of a fast dynamic time warping algorithm for the constrained case where there is a global constraint on the permissible warping path, specifically limit the width along the diagonal of the permissible path domain. We call this constrained sparse dynamic time warping (CSDTW). A careful formulation and analysis are performed to determine exactly how CSDTW should treat the zero data. It is shown that CSDTW reduces the computational complexity relative to constrained DTW by about three times the sparsity ratio, which is defined as the arithmetic mean of the fraction of non-zero's in the two time series. Numerical experiments confirm the speed advantage of CSDTW relative to constrained DTW for sparse time series with sparsity ratio up to 0.2-0.3. This study provides a benchmark and also background to potentially understand how to exploit such sparsity when the underlying time series is approximated to reduce complexity.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"38 1","pages":"216-222"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82307584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
GAN-Based Super Resolution for Accurate 3D Surface Reconstruction from Light Field Skin Images Towards Haptic Palpation 基于gan的从光场皮肤图像到触觉触诊的精确3D表面重建的超分辨率
Myeongseob Ko, Donghyun Kim, Kwangtaek Kim
The development of vision technology for observation of skin surface and diagnosis of skin disease for preventing secondary infections caused by direct skin touch has consistently been in the medical field spotlight. Many studies have been conducted to acquire three dimensional (3D) data through stereo images, multiple images, and lasers because (3D) data of in-vivo skin image is essential for accurate medical diagnosis. However, stereo vision systems or 3D laser systems for obtaining 3D information require high cost and have high computational complexity, and hence they have not been used universally. Additionally, the use of such systems is still not preferred in the medical field due to limitations on visual decision making. Therefore, a haptic diagnosis system that can blend vision information from a camera and palpation information from a dermatologist has been considered. In this study, we propose a 3D skin surface reconstruction method using a light field camera for haptic rendering and palpation. To achieve this goal, we addressed the low resolution problem, which has been consistently present in light field cameras, through the generative adversarial nets (GANs)-based super resolution method, and exploited the light field system which has been applied only to the object scene for obtaining 3D skin surface texture. Experimental results show that the method proposed in this study is promising and offers sufficient potential for haptic diagnosis.
利用视觉技术对皮肤表面进行观察和诊断,预防皮肤直接接触引起的继发感染,一直是医学界关注的焦点。由于活体皮肤图像的三维数据对于准确的医学诊断至关重要,因此许多研究通过立体图像、多图像和激光获取三维数据。然而,用于获取三维信息的立体视觉系统或三维激光系统成本高、计算复杂度高,尚未得到普遍应用。此外,由于视觉决策的限制,在医疗领域使用这种系统仍然不是首选。因此,我们考虑了一种融合了相机视觉信息和皮肤科医生触诊信息的触觉诊断系统。在这项研究中,我们提出了一种使用光场相机进行触觉渲染和触诊的3D皮肤表面重建方法。为了实现这一目标,我们通过基于生成对抗网络(GANs)的超分辨率方法解决了光场相机一直存在的低分辨率问题,并利用仅应用于物体场景的光场系统来获取3D皮肤表面纹理。实验结果表明,该方法在触觉诊断中具有广阔的应用前景。
{"title":"GAN-Based Super Resolution for Accurate 3D Surface Reconstruction from Light Field Skin Images Towards Haptic Palpation","authors":"Myeongseob Ko, Donghyun Kim, Kwangtaek Kim","doi":"10.1109/ICMLA.2018.00065","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00065","url":null,"abstract":"The development of vision technology for observation of skin surface and diagnosis of skin disease for preventing secondary infections caused by direct skin touch has consistently been in the medical field spotlight. Many studies have been conducted to acquire three dimensional (3D) data through stereo images, multiple images, and lasers because (3D) data of in-vivo skin image is essential for accurate medical diagnosis. However, stereo vision systems or 3D laser systems for obtaining 3D information require high cost and have high computational complexity, and hence they have not been used universally. Additionally, the use of such systems is still not preferred in the medical field due to limitations on visual decision making. Therefore, a haptic diagnosis system that can blend vision information from a camera and palpation information from a dermatologist has been considered. In this study, we propose a 3D skin surface reconstruction method using a light field camera for haptic rendering and palpation. To achieve this goal, we addressed the low resolution problem, which has been consistently present in light field cameras, through the generative adversarial nets (GANs)-based super resolution method, and exploited the light field system which has been applied only to the object scene for obtaining 3D skin surface texture. Experimental results show that the method proposed in this study is promising and offers sufficient potential for haptic diagnosis.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"227 1","pages":"392-397"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80172944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Extraction Using Apparent Power and Real Power for Smart Home Data Classification 基于视在功率和实际功率的智能家居数据分类特征提取
V. Vadakattu, S. Suthaharan
The goal of this paper is to perform an experimental research and show that simple statistical predictors can reveal usage patterns of the electrical appliances from smart meter and sensor readings. We used an open data set of Smart* project and its real power and apparent power variability to accomplish this goal. We generated the predictors using block-based statistical information of the real power and apparent power associated with each appliance class type. We constructed five machine learning models using these predictors and evaluated them using random forest classification and the qualitative measures – classification accuracy, out-of-bag error, and misclassification error. Our finding is that the simple statistical predictors that reveal smart home occupants appliance usage patterns and energy consumption details can be obtained through smart home data analytics. Our finding includes that the statistical predictors generated from apparent power can improve the accuracy of the significantly-imbalanced smart home data classification.
本文的目的是进行一项实验研究,并表明简单的统计预测可以从智能电表和传感器读数中揭示电器的使用模式。我们使用了Smart*项目的开放数据集及其实际功率和视在功率变异性来实现这一目标。我们使用基于块的实际功率和视在功率的统计信息来生成预测器,这些信息与每个电器类别类型相关。我们使用这些预测因子构建了5个机器学习模型,并使用随机森林分类和定性指标(分类精度、袋外误差和误分类误差)对它们进行了评估。我们的发现是,通过智能家居数据分析,可以获得揭示智能家居居住者家电使用模式和能源消耗细节的简单统计预测。我们的发现包括,视在功率产生的统计预测因子可以提高显著不平衡的智能家居数据分类的准确性。
{"title":"Feature Extraction Using Apparent Power and Real Power for Smart Home Data Classification","authors":"V. Vadakattu, S. Suthaharan","doi":"10.1109/ICMLA.2018.00209","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00209","url":null,"abstract":"The goal of this paper is to perform an experimental research and show that simple statistical predictors can reveal usage patterns of the electrical appliances from smart meter and sensor readings. We used an open data set of Smart* project and its real power and apparent power variability to accomplish this goal. We generated the predictors using block-based statistical information of the real power and apparent power associated with each appliance class type. We constructed five machine learning models using these predictors and evaluated them using random forest classification and the qualitative measures – classification accuracy, out-of-bag error, and misclassification error. Our finding is that the simple statistical predictors that reveal smart home occupants appliance usage patterns and energy consumption details can be obtained through smart home data analytics. Our finding includes that the statistical predictors generated from apparent power can improve the accuracy of the significantly-imbalanced smart home data classification.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"2 1","pages":"1290-1295"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80752821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1