首页 > 最新文献

2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Multiple Kernel Learning Using Sparse Representation 基于稀疏表示的多核学习
N. Klausner, M. Azimi-Sadjadi
This paper introduces a kernel machine for multiclass discrimination where the scoring function for each class is constructed using a linear combination over a predefined diverse library of kernel functions. The scoring function is built using an expanded set of the kernel library hence increasing the number of degrees of freedom to analyze the information content of each data sample. To choose the smallest set of kernels that best match desirable first-order moment properties of the class-conditional distribution a regularized linear least-squares problem is solved. The proposed multi-kernel machine is then demonstrated and benchmarked against similar techniques which rely on the use of a single kernel using a satellite imagery dataset for the purposes of discriminating among several vegetation and soil types.
本文介绍了一种用于多类判别的核机,其中每个类的评分函数使用预定义的多种核函数库上的线性组合来构造。评分函数使用扩展的内核库集构建,从而增加了分析每个数据样本信息内容的自由度。为了选择最符合类条件分布一阶矩特性的最小核集,解决了一个正则化线性最小二乘问题。然后对所提出的多核机器进行了演示,并对类似的技术进行了基准测试,这些技术依赖于使用单个核,使用卫星图像数据集来区分几种植被和土壤类型。
{"title":"Multiple Kernel Learning Using Sparse Representation","authors":"N. Klausner, M. Azimi-Sadjadi","doi":"10.1109/ICMLA.2017.00-79","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-79","url":null,"abstract":"This paper introduces a kernel machine for multiclass discrimination where the scoring function for each class is constructed using a linear combination over a predefined diverse library of kernel functions. The scoring function is built using an expanded set of the kernel library hence increasing the number of degrees of freedom to analyze the information content of each data sample. To choose the smallest set of kernels that best match desirable first-order moment properties of the class-conditional distribution a regularized linear least-squares problem is solved. The proposed multi-kernel machine is then demonstrated and benchmarked against similar techniques which rely on the use of a single kernel using a satellite imagery dataset for the purposes of discriminating among several vegetation and soil types.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"78 1","pages":"695-700"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78218681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Medicare Fraud Detection Using Machine Learning Methods 使用机器学习方法检测医疗保险欺诈
Richard A. Bauder, T. Khoshgoftaar
Healthcare is an integral component in people’s lives, especially for the rising elderly population, and must be affordable. Medicare is one such healthcare program. Claims fraud is a major contributor to increased healthcare costs, but its impact can be lessened through fraud detection. In this paper, we compare several machine learning methods to detect Medicare fraud. We perform a comparative study with supervised, unsupervised, and hybrid machine learning approaches using four performance metrics and class imbalance reduction via oversampling and an 80-20 undersampling method. We group the 2015 Medicare data into provider types, with fraud labels from the List of Excluded Individuals/Entities database. Our results show that the successful detection of fraudulent providers is possible, with the 80-20 sampling method demonstrating the best performance across the learners. Furthermore, supervised methods performed better than unsupervised or hybrid methods, but these results varied based on the class imbalance sampling technique and provider type.
医疗保健是人们生活中不可或缺的组成部分,特别是对不断增加的老年人口来说,必须负担得起。医疗保险就是这样一个医疗保健项目。索赔欺诈是增加医疗保健成本的一个主要因素,但可以通过欺诈检测来减轻其影响。在本文中,我们比较了几种机器学习方法来检测医疗保险欺诈。我们对有监督、无监督和混合机器学习方法进行了比较研究,使用了四个性能指标,并通过过采样和80-20欠采样方法减少了类不平衡。我们将2015年医疗保险数据分组为提供者类型,并使用排除个人/实体数据库列表中的欺诈标签。我们的结果表明,欺诈性提供者的成功检测是可能的,80-20抽样方法在学习器中展示了最佳性能。此外,监督方法比非监督方法或混合方法表现得更好,但这些结果因类不平衡采样技术和提供者类型而异。
{"title":"Medicare Fraud Detection Using Machine Learning Methods","authors":"Richard A. Bauder, T. Khoshgoftaar","doi":"10.1109/ICMLA.2017.00-48","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-48","url":null,"abstract":"Healthcare is an integral component in people’s lives, especially for the rising elderly population, and must be affordable. Medicare is one such healthcare program. Claims fraud is a major contributor to increased healthcare costs, but its impact can be lessened through fraud detection. In this paper, we compare several machine learning methods to detect Medicare fraud. We perform a comparative study with supervised, unsupervised, and hybrid machine learning approaches using four performance metrics and class imbalance reduction via oversampling and an 80-20 undersampling method. We group the 2015 Medicare data into provider types, with fraud labels from the List of Excluded Individuals/Entities database. Our results show that the successful detection of fraudulent providers is possible, with the 80-20 sampling method demonstrating the best performance across the learners. Furthermore, supervised methods performed better than unsupervised or hybrid methods, but these results varied based on the class imbalance sampling technique and provider type.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"83 1","pages":"858-865"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77642549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
Malicious Software Classification Using Transfer Learning of ResNet-50 Deep Neural Network 基于ResNet-50深度神经网络迁移学习的恶意软件分类
Edmar R. S. Rezende, Guilherme C. S. Ruppert, T. Carvalho, F. Ramos, Paulo Lício de Geus
Malicious software (malware) has been extensively used for illegal activity and new malware variants are discovered at an alarmingly high rate. The ability to group malware variants into families with similar characteristics makes possible to create mitigation strategies that work for a whole class of programs. In this paper, we present a malware family classification approach using a deep neural network based on the ResNet-50 architecture. Malware samples are represented as byteplot grayscale images and a deep neural network is trained freezing the convolutional layers of ResNet-50 pre-trained on the ImageNet dataset and adapting the last layer to malware family classification. The experimental results on a dataset comprising 9,339 samples from 25 different families showed that our approach can effectively be used to classify malware families with an accuracy of 98.62%.
恶意软件(恶意软件)已被广泛用于非法活动,新的恶意软件变种被发现的速度高得惊人。将恶意软件变体分成具有相似特征的家族的能力,使得创建适用于整个程序类的缓解策略成为可能。在本文中,我们提出了一种基于ResNet-50架构的深度神经网络恶意软件家族分类方法。恶意软件样本被表示为字节图灰度图像,深度神经网络被训练,冻结在ImageNet数据集上预训练的ResNet-50的卷积层,并使最后一层适应恶意软件家族分类。在包含25个不同家族的9339个样本的数据集上的实验结果表明,我们的方法可以有效地用于恶意软件家族的分类,准确率为98.62%。
{"title":"Malicious Software Classification Using Transfer Learning of ResNet-50 Deep Neural Network","authors":"Edmar R. S. Rezende, Guilherme C. S. Ruppert, T. Carvalho, F. Ramos, Paulo Lício de Geus","doi":"10.1109/ICMLA.2017.00-19","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-19","url":null,"abstract":"Malicious software (malware) has been extensively used for illegal activity and new malware variants are discovered at an alarmingly high rate. The ability to group malware variants into families with similar characteristics makes possible to create mitigation strategies that work for a whole class of programs. In this paper, we present a malware family classification approach using a deep neural network based on the ResNet-50 architecture. Malware samples are represented as byteplot grayscale images and a deep neural network is trained freezing the convolutional layers of ResNet-50 pre-trained on the ImageNet dataset and adapting the last layer to malware family classification. The experimental results on a dataset comprising 9,339 samples from 25 different families showed that our approach can effectively be used to classify malware families with an accuracy of 98.62%.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"61 26 1","pages":"1011-1014"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78856722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 179
An Ensembled RBF Extreme Learning Machine to Forecast Road Surface Temperature 一种集成RBF极限学习机预测路面温度
Bo Liu, Shuo Yan, Huanling You, Yan Dong, Jianqiang Li, Yong Li, Jianlei Lang, Rentao Gu
At present, high road surface temperature (RST) is threatening the safety of expressway transportation. It can lead to accidents and damages to road, accordingly, people have paid more attention to RST forecasting. Numerical methods on RST prediction are often hard to obtain precise parameters, whereas statistical methods cannot achieve desired accuracy. To address these problems, this paper proposes GBELM-RBF method that utilizes gradient boosting to ensemble Radial Basis Function Extreme Learning Machine. To evaluate the performance of the proposed method, GBELM-RBF is compared with other ELM algorithms on the datasets of airport expressway and Badaling expressway during November 2012 and September 2014. The root mean squared error (RMSE), accuracy and Pearson Correlation Coefficient (PCC) of these methods are analyzed. The experimental results show that GBELM-RBF has the best performance. For airport expressway dataset, the RMSE is less than 3, the accuracy is 78.8% and PCC is 0.94. For Badaling expressway dataset, the RMSE is less than 3, the accuracy is 81.2% and PCC is 0.921.
目前,路面温度过高正威胁着高速公路的交通安全。它会导致交通事故和对道路的破坏,因此RST的预测越来越受到人们的重视。数值方法预测RST往往难以获得精确的参数,而统计方法则无法达到预期的精度。针对这些问题,本文提出了利用梯度增强集成径向基函数极限学习机的GBELM-RBF方法。为了评价该方法的性能,将GBELM-RBF与其他ELM算法在2012年11月和2014年9月的机场高速公路和八达岭高速公路数据集上进行了比较。对这些方法的均方根误差(RMSE)、准确度和Pearson相关系数(PCC)进行了分析。实验结果表明,GBELM-RBF具有较好的性能。对于机场高速公路数据集,RMSE小于3,准确率为78.8%,PCC为0.94。对于八达岭高速公路数据,RMSE小于3,准确率为81.2%,PCC为0.921。
{"title":"An Ensembled RBF Extreme Learning Machine to Forecast Road Surface Temperature","authors":"Bo Liu, Shuo Yan, Huanling You, Yan Dong, Jianqiang Li, Yong Li, Jianlei Lang, Rentao Gu","doi":"10.1109/ICMLA.2017.00-26","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-26","url":null,"abstract":"At present, high road surface temperature (RST) is threatening the safety of expressway transportation. It can lead to accidents and damages to road, accordingly, people have paid more attention to RST forecasting. Numerical methods on RST prediction are often hard to obtain precise parameters, whereas statistical methods cannot achieve desired accuracy. To address these problems, this paper proposes GBELM-RBF method that utilizes gradient boosting to ensemble Radial Basis Function Extreme Learning Machine. To evaluate the performance of the proposed method, GBELM-RBF is compared with other ELM algorithms on the datasets of airport expressway and Badaling expressway during November 2012 and September 2014. The root mean squared error (RMSE), accuracy and Pearson Correlation Coefficient (PCC) of these methods are analyzed. The experimental results show that GBELM-RBF has the best performance. For airport expressway dataset, the RMSE is less than 3, the accuracy is 78.8% and PCC is 0.94. For Badaling expressway dataset, the RMSE is less than 3, the accuracy is 81.2% and PCC is 0.921.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"65 1","pages":"977-980"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87002288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Unsupervised Anomaly Detection for Digital Radio Frequency Transmissions 数字射频传输的无监督异常检测
Michael Walton, M. Ayache, Logan Straatemeier, Daniel Gebhardt, Benjamin Migliori
We present a novel method of unsupervised anomaly detection using long-short-term memory mixture density networks (LSTM-MDN), applied to timeseries data of digital radio transmissions. The modern radio frequency (RF) environment is a dynamic and ever-changing complex milieu of signals, environmental effects, unintentional interference, and intentional jamming. A consequence of this complex mix is that RF receivers must become better and better at rejecting anomalous signals in order to recover the transmitted information. However, it is not always possible to know a priori what constitutes a valid signal and what constitutes an anomaly (intentional or otherwise), especially with the adoption of cognitive radio techniques. We show that an LSTM-MDN model is able to rapidly learn the training set and produce probability distribution functions for the expected signal as a function of time. We then demonstrate that the negative log likelihood of an incoming test transmission, conditioned on the training set, provides a metric that allows anomalous signals to be detected and labeled. We demonstrate this method for eight popular modulations and for three different anomaly types. By applying unsupervised learning in the temporal domain, we report a fully-generalizable anomaly detection method that may be applied to signals for which the transmission parameters may be unknown or obscured.
提出了一种基于长短期记忆混合密度网络(LSTM-MDN)的无监督异常检测方法,并将其应用于数字无线电传输的时间序列数据。现代射频(RF)环境是一个动态的、不断变化的复杂信号环境,环境影响、无意干扰和故意干扰。这种复杂混合的结果是射频接收器必须越来越好地拒绝异常信号,以便恢复传输的信息。然而,并不总是有可能先验地知道什么构成有效信号,什么构成异常(有意或无意),特别是采用认知无线电技术。我们证明了LSTM-MDN模型能够快速学习训练集并产生期望信号作为时间函数的概率分布函数。然后,我们证明了输入测试传输的负对数似然,以训练集为条件,提供了一个允许检测和标记异常信号的度量。我们对八种常见的调制和三种不同的异常类型演示了这种方法。通过在时域中应用无监督学习,我们报告了一种完全可推广的异常检测方法,该方法可以应用于传输参数未知或模糊的信号。
{"title":"Unsupervised Anomaly Detection for Digital Radio Frequency Transmissions","authors":"Michael Walton, M. Ayache, Logan Straatemeier, Daniel Gebhardt, Benjamin Migliori","doi":"10.1109/ICMLA.2017.00-54","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-54","url":null,"abstract":"We present a novel method of unsupervised anomaly detection using long-short-term memory mixture density networks (LSTM-MDN), applied to timeseries data of digital radio transmissions. The modern radio frequency (RF) environment is a dynamic and ever-changing complex milieu of signals, environmental effects, unintentional interference, and intentional jamming. A consequence of this complex mix is that RF receivers must become better and better at rejecting anomalous signals in order to recover the transmitted information. However, it is not always possible to know a priori what constitutes a valid signal and what constitutes an anomaly (intentional or otherwise), especially with the adoption of cognitive radio techniques. We show that an LSTM-MDN model is able to rapidly learn the training set and produce probability distribution functions for the expected signal as a function of time. We then demonstrate that the negative log likelihood of an incoming test transmission, conditioned on the training set, provides a metric that allows anomalous signals to be detected and labeled. We demonstrate this method for eight popular modulations and for three different anomaly types. By applying unsupervised learning in the temporal domain, we report a fully-generalizable anomaly detection method that may be applied to signals for which the transmission parameters may be unknown or obscured.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"28 1","pages":"826-832"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86145491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
RBF-FIRMLP Architecture for Digit Recognition 数字识别的RBF-FIRMLP体系结构
Cristinel Codrescu
The finite impulse response multilayer perceptron (FIRMLP) is a multilayer perceptron where the static weights have been replaced by finite impulse response filters. Hereby, it represents a model for spatio-temporal processing. In this paper we present a temporal processing neural network which is based on the FIRMLP, but some layers have been replaced by temporal radial basis function (RBF) units. As training algorithm we used the temporal backpropagation not just for adapting the weights but also for finding the centers and widths of the RBF layers as well. The performance comparison have been done for the task of handwritten digit ecognition by using the MNIST and MNIST-Variations databases.
有限脉冲响应多层感知器(FIRMLP)是一种用有限脉冲响应滤波器代替静态权重的多层感知器。因此,它代表了一个时空处理模型。本文提出了一种基于FIRMLP的时间处理神经网络,但其中一些层被时间径向基函数(RBF)单元所取代。作为训练算法,我们不仅使用时间反向传播来调整权重,而且还用于寻找RBF层的中心和宽度。用MNIST和MNIST- variation数据库对手写数字识别任务进行了性能比较。
{"title":"RBF-FIRMLP Architecture for Digit Recognition","authors":"Cristinel Codrescu","doi":"10.1109/ICMLA.2017.0-125","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-125","url":null,"abstract":"The finite impulse response multilayer perceptron (FIRMLP) is a multilayer perceptron where the static weights have been replaced by finite impulse response filters. Hereby, it represents a model for spatio-temporal processing. In this paper we present a temporal processing neural network which is based on the FIRMLP, but some layers have been replaced by temporal radial basis function (RBF) units. As training algorithm we used the temporal backpropagation not just for adapting the weights but also for finding the centers and widths of the RBF layers as well. The performance comparison have been done for the task of handwritten digit ecognition by using the MNIST and MNIST-Variations databases.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"62 1","pages":"420-425"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76212335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Anytime Exploitation of Stragglers in Synchronous Stochastic Gradient Descent 同步随机梯度下降中离散机的随时开发
Nuwan S. Ferdinand, Benjamin Gharachorloo, S. Draper
In this paper we propose an approach to parallelizing synchronous stochastic gradient descent (SGD) that we term “Anytime-Gradients”. The Anytime-Gradients is designed to exploit the work completed by slow compute nodes or “stragglers”. In many approaches work completed by these nodes, while only partial, is discarded completely. To maintain synchronization in our approach, each computational epoch is of fixed duration, and at the end of each epoch, workers send updated parameter vectors to a master mode for combination. The master weights each update by the amount of work done. The Anytime-Gradients scheme is robust to both persistent and non-persistent stragglers and requires no prior knowledge about processor abilities. We show that the scheme effectively exploits stragglers and outperforms existing methods.
在本文中,我们提出了一种并行化同步随机梯度下降(SGD)的方法,我们称之为“任意时间梯度”。任意时间梯度的设计是为了利用缓慢的计算节点或“掉队者”完成的工作。在许多方法中,这些节点完成的工作虽然只是部分完成,但被完全丢弃。为了在我们的方法中保持同步,每个计算历元都是固定的持续时间,并且在每个历元结束时,工作人员将更新的参数向量发送到主模式进行组合。主服务器根据完成的工作量对每次更新进行加权。Anytime-Gradients方案对持久性和非持久性掉队者都具有鲁棒性,并且不需要事先了解处理器的能力。我们证明了该方案有效地利用了离散子,并且优于现有的方法。
{"title":"Anytime Exploitation of Stragglers in Synchronous Stochastic Gradient Descent","authors":"Nuwan S. Ferdinand, Benjamin Gharachorloo, S. Draper","doi":"10.1109/ICMLA.2017.0-166","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-166","url":null,"abstract":"In this paper we propose an approach to parallelizing synchronous stochastic gradient descent (SGD) that we term “Anytime-Gradients”. The Anytime-Gradients is designed to exploit the work completed by slow compute nodes or “stragglers”. In many approaches work completed by these nodes, while only partial, is discarded completely. To maintain synchronization in our approach, each computational epoch is of fixed duration, and at the end of each epoch, workers send updated parameter vectors to a master mode for combination. The master weights each update by the amount of work done. The Anytime-Gradients scheme is robust to both persistent and non-persistent stragglers and requires no prior knowledge about processor abilities. We show that the scheme effectively exploits stragglers and outperforms existing methods.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"42 1","pages":"141-146"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77637100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
An Exploratory Study of Oral and Dental Health in Canada 加拿大口腔和牙齿健康的探索性研究
Andrei Belcin, Sean L. A. Floyd, A. Asiri, H. Viktor
Healthcare practitioners agree that good oral health is a critical indicator of general health and wellness of a population. The lack of access to mandatory coverage for common issues such as cavities and non-surgical periodontal care often lead not only to medical problems, but also to loss of productivity. This trend is especially evident for older individuals and lower-income families. This paper discusses the results of our exploration of the annual Canadian Community Health Survey (CCHS), in order to further study the interplay between socio-economic factors and oral and dental health. To this end, we present the results when applying a number of machine learning algorithms to a CCHS data mart. Our results reaffirm that individuals' levels and sources of income are strong indicators of the number of dental visits per year. In addition, we found that younger adults and youth, who usually live in larger households, visit the dentist less frequently than all other survey respondents.
保健从业人员一致认为,良好的口腔健康是总体健康和人口健康的关键指标。缺乏诸如蛀牙和非手术牙周护理等常见问题的强制性保险,往往不仅导致医疗问题,而且导致生产力损失。这一趋势在老年人和低收入家庭中尤为明显。本文讨论了我们对年度加拿大社区健康调查(CCHS)的探索结果,以进一步研究社会经济因素与口腔和牙齿健康之间的相互作用。为此,我们展示了将许多机器学习算法应用于CCHS数据集市时的结果。我们的研究结果重申,个人的收入水平和来源是每年看牙医次数的有力指标。此外,我们发现,通常生活在大家庭中的年轻人和年轻人看牙医的频率低于所有其他调查对象。
{"title":"An Exploratory Study of Oral and Dental Health in Canada","authors":"Andrei Belcin, Sean L. A. Floyd, A. Asiri, H. Viktor","doi":"10.1109/ICMLA.2017.00010","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00010","url":null,"abstract":"Healthcare practitioners agree that good oral health is a critical indicator of general health and wellness of a population. The lack of access to mandatory coverage for common issues such as cavities and non-surgical periodontal care often lead not only to medical problems, but also to loss of productivity. This trend is especially evident for older individuals and lower-income families. This paper discusses the results of our exploration of the annual Canadian Community Health Survey (CCHS), in order to further study the interplay between socio-economic factors and oral and dental health. To this end, we present the results when applying a number of machine learning algorithms to a CCHS data mart. Our results reaffirm that individuals' levels and sources of income are strong indicators of the number of dental visits per year. In addition, we found that younger adults and youth, who usually live in larger households, visit the dentist less frequently than all other survey respondents.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"24 1","pages":"1114-1119"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82551351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Empirical Study of Cross-Lingual Transfer Learning Techniques for Small-Footprint Keyword Spotting 跨语言迁移学习技术在小足迹关键词识别中的实证研究
Ming Sun, A. Schwarz, Minhua Wu, N. Strom, S. Matsoukas, S. Vitaladevuni
This paper presents our work on building a small-footprint keyword spotting system for a resource-limited language, which requires low CPU, memory and latency. Our keyword spotting system consists of deep neural network (DNN) and hidden Markov model (HMM), which is a hybrid DNN-HMM decoder. We investigate different transfer learning techniques to leverage knowledge and data from a resource-abundant source language to improve the keyword DNN training for a target language which has limited in-domain data. The approaches employed in this paper include training a DNN using source language data to initialize the target language DNN training, mixing data from source and target languages together in a multi-task DNN training setup, using logits computed from a DNN trained on the source language data to regularize the keyword DNN training in the target language, as well as combinations of these techniques. Given different amounts of target language training data, our experimental results show that these transfer learning techniques successfully improve keyword spotting performance for the target language, measured by the area under the curve (AUC) of DNN-HMM decoding detection error tradeoff (DET) curves using a large in-house far-field test set.
本文介绍了我们为资源有限的语言构建一个小占用的关键字定位系统的工作,该系统需要低CPU,低内存和低延迟。我们的关键词识别系统由深度神经网络(DNN)和隐马尔可夫模型(HMM)组成,隐马尔可夫模型是一种混合的DNN-HMM解码器。我们研究了不同的迁移学习技术,以利用资源丰富的源语言的知识和数据来改进域内数据有限的目标语言的关键字DNN训练。本文采用的方法包括使用源语言数据训练DNN来初始化目标语言DNN训练,在多任务DNN训练设置中将源语言和目标语言的数据混合在一起,使用从源语言数据训练的DNN计算的logits来正则化目标语言中的关键字DNN训练,以及这些技术的组合。给定不同数量的目标语言训练数据,我们的实验结果表明,这些迁移学习技术成功地提高了目标语言的关键字识别性能,通过使用大型内部远场测试集的DNN-HMM解码检测误差权衡(DET)曲线下面积(AUC)来衡量。
{"title":"An Empirical Study of Cross-Lingual Transfer Learning Techniques for Small-Footprint Keyword Spotting","authors":"Ming Sun, A. Schwarz, Minhua Wu, N. Strom, S. Matsoukas, S. Vitaladevuni","doi":"10.1109/ICMLA.2017.0-150","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-150","url":null,"abstract":"This paper presents our work on building a small-footprint keyword spotting system for a resource-limited language, which requires low CPU, memory and latency. Our keyword spotting system consists of deep neural network (DNN) and hidden Markov model (HMM), which is a hybrid DNN-HMM decoder. We investigate different transfer learning techniques to leverage knowledge and data from a resource-abundant source language to improve the keyword DNN training for a target language which has limited in-domain data. The approaches employed in this paper include training a DNN using source language data to initialize the target language DNN training, mixing data from source and target languages together in a multi-task DNN training setup, using logits computed from a DNN trained on the source language data to regularize the keyword DNN training in the target language, as well as combinations of these techniques. Given different amounts of target language training data, our experimental results show that these transfer learning techniques successfully improve keyword spotting performance for the target language, measured by the area under the curve (AUC) of DNN-HMM decoding detection error tradeoff (DET) curves using a large in-house far-field test set.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"22 1","pages":"255-260"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84316817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Learning Long-Term Situation Prediction for Automated Driving 学习自动驾驶的长期情况预测
S. Hörmann, Martin Bach, K. Dietmayer
A major challenge in autonomous driving is the prediction of complex downtown scenarios with mutiple road users. This contribution tackles this challenge by combining,,,,,,,, a Bayesian filtering technique for environment representation and machine learning as long-term predictor. Therefore, a dynamic occupancy grid map representing the static and dynamic environment around the ego-vehicle is utilized as input to a deep convolutional neural network. This yields the advantage of using data from a single timestamp for prediction, rather than an entire time series. Furthermore, convolutional neural networks have the inherent characteristic of using context information, enabling the implicit modeling of road user interaction. One of the major advantages is the unsupervised learning character due to fully automatic label generation. The presented algorithm is trained and evaluated on multiple hours of recorded sensor data containing multiple road users, e.g., pedestrians, bikes and vehicles.
自动驾驶面临的一个主要挑战是预测有多个道路使用者的复杂市中心场景。该贡献通过将,,,,,,,,(用于环境表示的贝叶斯过滤技术)和作为长期预测器的机器学习相结合,解决了这一挑战。因此,利用代表自我车辆周围静态和动态环境的动态占用网格图作为深度卷积神经网络的输入。这样做的好处是可以使用来自单个时间戳的数据进行预测,而不是整个时间序列。此外,卷积神经网络具有使用上下文信息的固有特性,可以对道路用户交互进行隐式建模。其中一个主要优点是由于全自动标签生成而具有无监督学习特性。所提出的算法是在包含多个道路使用者(如行人、自行车和车辆)的多个小时记录的传感器数据上进行训练和评估的。
{"title":"Learning Long-Term Situation Prediction for Automated Driving","authors":"S. Hörmann, Martin Bach, K. Dietmayer","doi":"10.1109/ICMLA.2017.00-21","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-21","url":null,"abstract":"A major challenge in autonomous driving is the prediction of complex downtown scenarios with mutiple road users. This contribution tackles this challenge by combining,,,,,,,, a Bayesian filtering technique for environment representation and machine learning as long-term predictor. Therefore, a dynamic occupancy grid map representing the static and dynamic environment around the ego-vehicle is utilized as input to a deep convolutional neural network. This yields the advantage of using data from a single timestamp for prediction, rather than an entire time series. Furthermore, convolutional neural networks have the inherent characteristic of using context information, enabling the implicit modeling of road user interaction. One of the major advantages is the unsupervised learning character due to fully automatic label generation. The presented algorithm is trained and evaluated on multiple hours of recorded sensor data containing multiple road users, e.g., pedestrians, bikes and vehicles.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"103 1","pages":"1000-1005"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80297256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1