首页 > 最新文献

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)最新文献

英文 中文
Hypergraph Link Prediction: Learning Drug Interaction Networks Embeddings 超图链接预测:学习药物相互作用网络嵌入
M. Vaida, Kevin Purcell
Graph neural networks (GNNs) have revolutionized deep learning on non-Euclidean data domains, and are extensively used in fields such as social media and recommendation systems. However, complex relational data structures such as hypergraphs, pose challenges for GNNs in terms of their ability to model, embed, and learn relational complexities of multigraphs. Most GNNs focus on capturing flat local neighborhoods of a node thus failing to account for structural properties of multi-relational graphs. This paper introduces Hypergraph Link Prediction (HLP), a novel approach of encoding the multilink structure of graphs. HLP allows pooling operations to incorporate a 360 degrees overview of a node interaction profile, by learning local neighborhood and global hypergraph structure simultaneously. Global graph information is injected into node representations, such that unique global structural patterns of every node are encoded at the node level. HLP leverages the augmented hypergraph adjacency matrix to incorporate the depth of the hypergraph in the convolutional layers. The model is applied to the task of predicting multi-drug interactions, by modeling relations between pairs of drugs as a hypergraph. The existence and the type of drug interactions between the same pair of drugs are mapped as multiple edges, and can be inferred by learning the multigraph local and global structure concurrently. To account for molecular graph properties of a drug, additional drug chemical graph structural fingerprints are included as node attributes.
图神经网络(gnn)已经彻底改变了非欧几里得数据领域的深度学习,并被广泛应用于社交媒体和推荐系统等领域。然而,复杂的关系数据结构(如超图)在建模、嵌入和学习多图关系复杂性的能力方面给gnn带来了挑战。大多数gnn专注于捕获节点的平面局部邻域,因此无法考虑多关系图的结构属性。介绍了一种对图的多链接结构进行编码的新方法——超图链接预测(Hypergraph Link Prediction, HLP)。通过同时学习局部邻域和全局超图结构,HLP允许池化操作合并节点交互配置文件的360度概述。全局图形信息被注入到节点表示中,这样每个节点的唯一全局结构模式就在节点级别被编码。HLP利用增广的超图邻接矩阵将超图的深度合并到卷积层中。该模型通过将药物对之间的关系建模为超图,应用于预测多种药物相互作用的任务。同一对药物之间相互作用的存在和类型被映射为多个边,并可以通过同时学习多图局部和全局结构来推断。为了说明药物的分子图属性,额外的药物化学图结构指纹被包括作为节点属性。
{"title":"Hypergraph Link Prediction: Learning Drug Interaction Networks Embeddings","authors":"M. Vaida, Kevin Purcell","doi":"10.1109/ICMLA.2019.00299","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00299","url":null,"abstract":"Graph neural networks (GNNs) have revolutionized deep learning on non-Euclidean data domains, and are extensively used in fields such as social media and recommendation systems. However, complex relational data structures such as hypergraphs, pose challenges for GNNs in terms of their ability to model, embed, and learn relational complexities of multigraphs. Most GNNs focus on capturing flat local neighborhoods of a node thus failing to account for structural properties of multi-relational graphs. This paper introduces Hypergraph Link Prediction (HLP), a novel approach of encoding the multilink structure of graphs. HLP allows pooling operations to incorporate a 360 degrees overview of a node interaction profile, by learning local neighborhood and global hypergraph structure simultaneously. Global graph information is injected into node representations, such that unique global structural patterns of every node are encoded at the node level. HLP leverages the augmented hypergraph adjacency matrix to incorporate the depth of the hypergraph in the convolutional layers. The model is applied to the task of predicting multi-drug interactions, by modeling relations between pairs of drugs as a hypergraph. The existence and the type of drug interactions between the same pair of drugs are mapped as multiple edges, and can be inferred by learning the multigraph local and global structure concurrently. To account for molecular graph properties of a drug, additional drug chemical graph structural fingerprints are included as node attributes.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132927363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Stochastic Coordinate Descent for 01 Loss and Its Sensitivity to Adversarial Attacks 01损失的随机坐标下降及其对抗性攻击的敏感性
Meiyan Xie, Yunzhe Xue, Usman Roshan
The 01 loss while hard to optimize is least sensitive to outliers compared to its continuous differentiable counterparts, namely hinge and logistic loss. Recently the 01 loss has been shown to be most robust compared to surrogate losses against corrupted labels which can be interpreted as adversarial attacks. Here we propose a stochastic coordinate descent heuristic for linear 01 loss classification. We implement and study our heuristic on real datasets from the UCI machine learning archive and find our method to be comparable to the support vector machine in accuracy and tractable in training time. We conjecture that the 01 loss may be harder to attack in a black box setting due to its non-continuity and infinite solution space. We train our linear classifier in a one-vs-one multi-class strategy on CIFAR10 and STL10 image benchmark datasets. In both cases we find our classifier to have the same accuracy as the linear support vector machine but more resilient to black box attacks. On CIFAR10 the linear support vector machine has 0% on adversarial examples while the 01 loss classifier hovers about 10%. On STL10 the linear support vector machine has 0% accuracy whereas 01 loss is at 10%. Our work here suggests that 01 loss may be more resilient to adversarial attacks than the hinge loss and further work is required.
01损失虽然难以优化,但与连续可微损失相比,它对异常值最不敏感,即铰链损失和逻辑损失。最近,与可解释为对抗性攻击的损坏标签的代理损失相比,01损失已被证明是最稳健的。本文提出了一种线性01损失分类的随机坐标下降启发式算法。我们在UCI机器学习档案的真实数据集上实现和研究了我们的启发式方法,发现我们的方法在准确性和训练时间上与支持向量机相当。我们推测,由于01损失的非连续性和无限的解空间,在黑箱设置中可能更难攻击。我们在CIFAR10和STL10图像基准数据集上以一对一的多类策略训练我们的线性分类器。在这两种情况下,我们发现我们的分类器具有与线性支持向量机相同的精度,但对黑盒攻击更具弹性。在CIFAR10上,线性支持向量机对对抗样本的准确率为0%,而01损失分类器在10%左右徘徊。在STL10上,线性支持向量机的准确率为0%,而01的损失为10%。我们在这里的工作表明,01损失可能比铰链损失对对抗性攻击更有弹性,需要进一步的工作。
{"title":"Stochastic Coordinate Descent for 01 Loss and Its Sensitivity to Adversarial Attacks","authors":"Meiyan Xie, Yunzhe Xue, Usman Roshan","doi":"10.1109/ICMLA.2019.00056","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00056","url":null,"abstract":"The 01 loss while hard to optimize is least sensitive to outliers compared to its continuous differentiable counterparts, namely hinge and logistic loss. Recently the 01 loss has been shown to be most robust compared to surrogate losses against corrupted labels which can be interpreted as adversarial attacks. Here we propose a stochastic coordinate descent heuristic for linear 01 loss classification. We implement and study our heuristic on real datasets from the UCI machine learning archive and find our method to be comparable to the support vector machine in accuracy and tractable in training time. We conjecture that the 01 loss may be harder to attack in a black box setting due to its non-continuity and infinite solution space. We train our linear classifier in a one-vs-one multi-class strategy on CIFAR10 and STL10 image benchmark datasets. In both cases we find our classifier to have the same accuracy as the linear support vector machine but more resilient to black box attacks. On CIFAR10 the linear support vector machine has 0% on adversarial examples while the 01 loss classifier hovers about 10%. On STL10 the linear support vector machine has 0% accuracy whereas 01 loss is at 10%. Our work here suggests that 01 loss may be more resilient to adversarial attacks than the hinge loss and further work is required.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117020413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Deep Learning and Thresholding with Class-Imbalanced Big Data 类不平衡大数据下的深度学习与阈值
Justin M. Johnson, T. Khoshgoftaar
Class imbalance is a regularly occurring problem in machine learning that has been studied extensively over the last two decades. Various methods for addressing class imbalance have been introduced, including algorithm-level methods, datalevel methods, and hybrid methods. While these methods are well studied using traditional machine learning algorithms, there are relatively few studies that explore their application to deep neural networks. Thresholding, in particular, is rarely discussed in the deep learning with class imbalance literature. This paper addresses this gap by conducting a systematic study on the application of thresholding with deep neural networks using a Big Data Medicare fraud data set. We use random oversampling (ROS), random under-sampling (RUS), and a hybrid ROS-RUS to create 15 training distributions with varying levels of class imbalance. With the fraudulent class size ranging from 0.03%–60%, we identify optimal classification thresholds for each distribution on random validation sets and then score the thresholds on a 20% holdout test set. Through repetition and statistical analysis, confidence intervals show that the default threshold is never optimal when training data is imbalanced. Results also show that the optimal threshold outperforms the default threshold in nearly all cases, and linear models indicate a strong linear relationship between the minority class size and the optimal decision threshold. To the best of our knowledge, this is the first study to provide statistical results that describe optimal classification thresholds for deep neural networks over a range of class distributions.
类不平衡是机器学习中经常出现的问题,在过去的二十年里得到了广泛的研究。介绍了解决类不平衡的各种方法,包括算法级方法、数据级方法和混合方法。虽然使用传统的机器学习算法对这些方法进行了很好的研究,但探索其在深度神经网络中的应用的研究相对较少。特别是阈值,在具有阶级不平衡的深度学习文献中很少被讨论。本文通过使用大数据医疗欺诈数据集对阈值与深度神经网络的应用进行系统研究,解决了这一差距。我们使用随机过采样(ROS)、随机欠采样(RUS)和混合ROS-RUS来创建15个具有不同等级不平衡的训练分布。在欺诈类大小范围为0.03%-60%的情况下,我们在随机验证集上为每个分布确定了最佳分类阈值,然后在20%的拒绝测试集上对阈值进行评分。通过重复和统计分析,置信区间表明,当训练数据不平衡时,默认阈值永远不是最优的。结果还表明,在几乎所有情况下,最优阈值都优于默认阈值,线性模型表明,少数类大小与最优决策阈值之间存在很强的线性关系。据我们所知,这是第一个提供统计结果来描述深度神经网络在一系列类别分布上的最佳分类阈值的研究。
{"title":"Deep Learning and Thresholding with Class-Imbalanced Big Data","authors":"Justin M. Johnson, T. Khoshgoftaar","doi":"10.1109/ICMLA.2019.00134","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00134","url":null,"abstract":"Class imbalance is a regularly occurring problem in machine learning that has been studied extensively over the last two decades. Various methods for addressing class imbalance have been introduced, including algorithm-level methods, datalevel methods, and hybrid methods. While these methods are well studied using traditional machine learning algorithms, there are relatively few studies that explore their application to deep neural networks. Thresholding, in particular, is rarely discussed in the deep learning with class imbalance literature. This paper addresses this gap by conducting a systematic study on the application of thresholding with deep neural networks using a Big Data Medicare fraud data set. We use random oversampling (ROS), random under-sampling (RUS), and a hybrid ROS-RUS to create 15 training distributions with varying levels of class imbalance. With the fraudulent class size ranging from 0.03%–60%, we identify optimal classification thresholds for each distribution on random validation sets and then score the thresholds on a 20% holdout test set. Through repetition and statistical analysis, confidence intervals show that the default threshold is never optimal when training data is imbalanced. Results also show that the optimal threshold outperforms the default threshold in nearly all cases, and linear models indicate a strong linear relationship between the minority class size and the optimal decision threshold. To the best of our knowledge, this is the first study to provide statistical results that describe optimal classification thresholds for deep neural networks over a range of class distributions.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114950196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Deep Ensemble Network for Quantification and Severity Assessment of Knee Osteoarthritis 用于膝骨关节炎量化和严重程度评估的深度集成网络
Mohammed Bany Muhammad, A. Moinuddin, M. Lee, Yanfei Zhang, V. Abedi, R. Zand, M. Yeasin
The assessment of knee joint gap and severity of Osteoarthritis (OA) is subjective and often inaccurate. The main source of error is due to the judgement of human expert from low resolution images (i.e., X-ray images). To address the problem, we developed an ensemble of Deep Learning (DL) model to objectively score the severity of OA only from the radiometric images. The proposed method consists of two main modules. First, we developed a scale invariant and aspect ratio preserving automatic localization and characterization of the kneecap area. Second, we developed multiple instances of "hyper parameter optimized" DL models and fused them using ensemble classification to score the severity of OA. In this implementation, we used three convolutional neural networks to improve the bias-variance trade-off, and boost accuracy and generalization. We tested our modeling framework using a collection of 4,796 X-ray images from Osteoarthritis Initiative (OAI). Our results show a higher performance (~ 2-8%) when compared to the state-of-the-art methods. Finally, this machine learning-based methodology provides a pipeline in decision support system for assessing and quantifying the OA severity.
评估膝关节间隙和骨关节炎(OA)的严重程度是主观的,往往不准确。误差的主要来源是人类专家对低分辨率图像(即x射线图像)的判断。为了解决这个问题,我们开发了一个深度学习(DL)集成模型,仅从放射图像中客观地评分OA的严重程度。该方法包括两个主要模块。首先,我们开发了一种保留尺度不变量和纵横比的膝盖骨区域自动定位和表征方法。其次,我们开发了多个“超参数优化”深度学习模型实例,并使用集成分类对它们进行融合,对OA的严重程度进行评分。在这个实现中,我们使用了三个卷积神经网络来改善偏差-方差权衡,并提高准确性和泛化。我们使用骨关节炎倡议(OAI)收集的4,796张x射线图像来测试我们的建模框架。与最先进的方法相比,我们的结果显示出更高的性能(~ 2-8%)。最后,基于机器学习的方法为OA严重程度的评估和量化提供了决策支持系统的管道。
{"title":"Deep Ensemble Network for Quantification and Severity Assessment of Knee Osteoarthritis","authors":"Mohammed Bany Muhammad, A. Moinuddin, M. Lee, Yanfei Zhang, V. Abedi, R. Zand, M. Yeasin","doi":"10.1109/ICMLA.2019.00163","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00163","url":null,"abstract":"The assessment of knee joint gap and severity of Osteoarthritis (OA) is subjective and often inaccurate. The main source of error is due to the judgement of human expert from low resolution images (i.e., X-ray images). To address the problem, we developed an ensemble of Deep Learning (DL) model to objectively score the severity of OA only from the radiometric images. The proposed method consists of two main modules. First, we developed a scale invariant and aspect ratio preserving automatic localization and characterization of the kneecap area. Second, we developed multiple instances of \"hyper parameter optimized\" DL models and fused them using ensemble classification to score the severity of OA. In this implementation, we used three convolutional neural networks to improve the bias-variance trade-off, and boost accuracy and generalization. We tested our modeling framework using a collection of 4,796 X-ray images from Osteoarthritis Initiative (OAI). Our results show a higher performance (~ 2-8%) when compared to the state-of-the-art methods. Finally, this machine learning-based methodology provides a pipeline in decision support system for assessing and quantifying the OA severity.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115078464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Deep Learning Approach to Distributed Anomaly Detection for Edge Computing 边缘计算分布式异常检测的深度学习方法
Okwudili M. Ezeme, Q. Mahmoud, Akramul Azim
One of the multiplier effects of the boom in mobile technologies ranging from cell phones to computers and wearables like smart watches is that every public and private common spaces are now dotted with Wi-Fi hotspots. These hotspots provide the convenience of accessing the internet on-the-go for either play or work. Also, with the increased automation of our daily routines by our mobile devices via a multitude of applications, our vulnerability to cyber fraud or attacks becomes higher too. Hence, the need for heightened security that is capable of detecting anomalies on-the-fly. However, these edge devices connected to the local area network come with diverse capabilities with varying degrees of limitations in compute and energy resources. Therefore, running a process-based anomaly detector is not given a high priority in these devices because; a) the primary functions of the applications running on the devices is not security; therefore, the device allocates much of its resources into satisfying the primary duty of the applications. b) the volume and velocity of the data are high. Therefore, in this paper, we introduce a multi-node (nodes and devices are used interchangeably in the paper) ad-hoc network that uses a novel offloading scheme to bring an online anomaly detection capability on the kernel events to the nodes in the network. We test the framework in a Wi-Fi-based ad-hoc network made up of several devices, and the results confirm our hypothesis that the scheme can reduce latency and increase the throughput of the anomaly detector, thereby making online anomaly detection in the edge possible without sacrificing the accuracy of the deep recurrent neural network.
从手机到电脑,再到智能手表等可穿戴设备,移动技术的蓬勃发展带来的一个乘数效应是,每一个公共和私人公共空间现在都点缀着Wi-Fi热点。这些热点提供了方便的上网,无论是玩还是工作。此外,随着我们的移动设备通过大量应用程序提高了日常生活的自动化程度,我们对网络欺诈或攻击的脆弱性也变得更高。因此,需要提高安全性,能够在飞行中检测异常情况。然而,这些连接到局域网的边缘设备具有不同的功能,在计算和能源资源方面有不同程度的限制。因此,在这些设备中,运行基于进程的异常检测器没有被赋予高优先级,因为;A)设备上运行的应用程序的主要功能不是安全;因此,设备将其大部分资源分配给满足应用程序的主要任务。B)数据的量和速度都很高。因此,在本文中,我们引入了一个多节点(节点和设备在本文中互换使用)自组织网络,该网络使用一种新颖的卸载方案,为网络中的节点提供对内核事件的在线异常检测能力。我们在由多个设备组成的基于wi - fi的ad-hoc网络中对该框架进行了测试,结果证实了我们的假设,即该方案可以减少延迟并增加异常检测器的吞吐量,从而在不牺牲深度递归神经网络准确性的情况下实现边缘的在线异常检测。
{"title":"A Deep Learning Approach to Distributed Anomaly Detection for Edge Computing","authors":"Okwudili M. Ezeme, Q. Mahmoud, Akramul Azim","doi":"10.1109/ICMLA.2019.00169","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00169","url":null,"abstract":"One of the multiplier effects of the boom in mobile technologies ranging from cell phones to computers and wearables like smart watches is that every public and private common spaces are now dotted with Wi-Fi hotspots. These hotspots provide the convenience of accessing the internet on-the-go for either play or work. Also, with the increased automation of our daily routines by our mobile devices via a multitude of applications, our vulnerability to cyber fraud or attacks becomes higher too. Hence, the need for heightened security that is capable of detecting anomalies on-the-fly. However, these edge devices connected to the local area network come with diverse capabilities with varying degrees of limitations in compute and energy resources. Therefore, running a process-based anomaly detector is not given a high priority in these devices because; a) the primary functions of the applications running on the devices is not security; therefore, the device allocates much of its resources into satisfying the primary duty of the applications. b) the volume and velocity of the data are high. Therefore, in this paper, we introduce a multi-node (nodes and devices are used interchangeably in the paper) ad-hoc network that uses a novel offloading scheme to bring an online anomaly detection capability on the kernel events to the nodes in the network. We test the framework in a Wi-Fi-based ad-hoc network made up of several devices, and the results confirm our hypothesis that the scheme can reduce latency and increase the throughput of the anomaly detector, thereby making online anomaly detection in the edge possible without sacrificing the accuracy of the deep recurrent neural network.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115715692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
IoT Environmental Analyzer using Sensors and Machine Learning for Migraine Occurrence Prevention 使用传感器和机器学习预防偏头痛的物联网环境分析仪
Rosemarie J. Day, H. Salehi, Mahsa Javadi
Everyday migraines are affecting more than one billion people worldwide. This headache disorder is classified as the sixth most disabling disease in the world. Migraines are just one chronic illness affected by environmental triggers due to changes that occur inside the home. Migraines share this characteristic with sinus headaches and thus are often misdiagnosed. In this research work, an iOS-based environmental analyzer was designed, implemented and evaluated for migraine sufferers with the use of sensors. After the data collection and cleaning, five machine learning model were used to estimate prediction accuracy of migraines in terms of the environment. The data was evaluated against the models using K-Fold cross validation. The algorithm accuracy comparison showed that Linear Discriminant Analysis (LDA) produced highest accuracy for the testing data at a mean of 0.938. Preliminary results demonstrate the feasibility of using machine learning algorithms to perform the automated recognition of migraine trigger areas in the environment.
日常偏头痛影响着全球超过10亿人。这种头痛疾病被列为世界上第六大致残疾病。偏头痛只是一种受环境因素影响的慢性疾病,由于家庭内部发生的变化。偏头痛与窦性头痛有相同的特点,因此常被误诊。在本研究中,设计了一种基于ios的环境分析仪,并利用传感器对偏头痛患者进行了环境分析仪的设计、实现和评估。在数据收集和清理后,使用5个机器学习模型来估计偏头痛在环境方面的预测精度。使用K-Fold交叉验证对模型进行数据评估。算法精度比较表明,线性判别分析(Linear Discriminant Analysis, LDA)对检测数据的准确率最高,均值为0.938。初步结果表明,使用机器学习算法对环境中的偏头痛触发区域进行自动识别是可行的。
{"title":"IoT Environmental Analyzer using Sensors and Machine Learning for Migraine Occurrence Prevention","authors":"Rosemarie J. Day, H. Salehi, Mahsa Javadi","doi":"10.1109/ICMLA.2019.00239","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00239","url":null,"abstract":"Everyday migraines are affecting more than one billion people worldwide. This headache disorder is classified as the sixth most disabling disease in the world. Migraines are just one chronic illness affected by environmental triggers due to changes that occur inside the home. Migraines share this characteristic with sinus headaches and thus are often misdiagnosed. In this research work, an iOS-based environmental analyzer was designed, implemented and evaluated for migraine sufferers with the use of sensors. After the data collection and cleaning, five machine learning model were used to estimate prediction accuracy of migraines in terms of the environment. The data was evaluated against the models using K-Fold cross validation. The algorithm accuracy comparison showed that Linear Discriminant Analysis (LDA) produced highest accuracy for the testing data at a mean of 0.938. Preliminary results demonstrate the feasibility of using machine learning algorithms to perform the automated recognition of migraine trigger areas in the environment.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121335751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Widened Learning of Index Tracking Portfolios 指数跟踪投资组合的拓宽学习
Iuliia Gavriushina, Oliver R. Sampson, M. Berthold, W. Pohlmeier, C. Borgelt
Index investing has an advantage over active investment strategies, because less frequent trading results in lower expenses, yielding higher long-term returns. Index tracking is a popular investment strategy that attempts to find a portfolio replicating the performance of a collection of investment vehicles. This paper considers index tracking from the perspective of solution space exploration. Three search space heuristics in combination with three portfolio tracking error methods are compared in order to select a tracking portfolio with returns that mimic a benchmark index. Experimental results conducted on real-world datasets show that Widening, a metaheuristic using diverse parallel search paths, finds superior solutions than those found by the reference heuristics. Presented here are the first results using Widening on time-series data.
指数投资比主动投资策略有优势,因为较少的交易导致较低的费用,产生较高的长期回报。指数跟踪是一种流行的投资策略,它试图找到一个能复制一系列投资工具表现的投资组合。本文从解空间探索的角度考虑索引跟踪。比较了三种搜索空间启发式方法与三种投资组合跟踪误差方法的结合,以选择具有模拟基准指数收益的跟踪投资组合。在实际数据集上进行的实验结果表明,使用多种并行搜索路径的元启发式算法“扩大”比参考启发式算法找到的解更优。这里展示的是在时间序列数据上使用扩展的第一个结果。
{"title":"Widened Learning of Index Tracking Portfolios","authors":"Iuliia Gavriushina, Oliver R. Sampson, M. Berthold, W. Pohlmeier, C. Borgelt","doi":"10.1109/ICMLA.2019.00291","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00291","url":null,"abstract":"Index investing has an advantage over active investment strategies, because less frequent trading results in lower expenses, yielding higher long-term returns. Index tracking is a popular investment strategy that attempts to find a portfolio replicating the performance of a collection of investment vehicles. This paper considers index tracking from the perspective of solution space exploration. Three search space heuristics in combination with three portfolio tracking error methods are compared in order to select a tracking portfolio with returns that mimic a benchmark index. Experimental results conducted on real-world datasets show that Widening, a metaheuristic using diverse parallel search paths, finds superior solutions than those found by the reference heuristics. Presented here are the first results using Widening on time-series data.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125235128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forcing Interpretability for Deep Neural Networks through Rule-Based Regularization 基于规则的正则化深度神经网络的强制可解释性
Nadia Burkart, Marco F. Huber, Phillip Faller
Remarkable progress in the field of machine learning strongly drives the research in many application domains. For some domains, it is mandatory that the output of machine learning algorithms needs to be interpretable. In this paper, we propose a rule-based regularization technique to enforce interpretability for neural networks (NN). For this purpose, we train a rule-based surrogate model simultaneously with the NN. From the surrogate, a metric quantifying its degree of explainability is derived and fed back to the training of the NN as a regularization term. We evaluate our model on four datasets and compare it to unregularized models as well as a decision tree (DT) based baseline. The rule-based regularization approach achieves interpretability and competitive accuracy.
机器学习领域的显著进步有力地推动了许多应用领域的研究。对于某些领域,机器学习算法的输出必须是可解释的。在本文中,我们提出了一种基于规则的正则化技术来增强神经网络的可解释性。为此,我们与神经网络同时训练基于规则的代理模型。从代理中,导出量化其可解释程度的度量,并作为正则化项反馈给神经网络的训练。我们在四个数据集上评估我们的模型,并将其与非正则化模型以及基于决策树(DT)的基线进行比较。基于规则的正则化方法实现了可解释性和竞争精度。
{"title":"Forcing Interpretability for Deep Neural Networks through Rule-Based Regularization","authors":"Nadia Burkart, Marco F. Huber, Phillip Faller","doi":"10.1109/ICMLA.2019.00126","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00126","url":null,"abstract":"Remarkable progress in the field of machine learning strongly drives the research in many application domains. For some domains, it is mandatory that the output of machine learning algorithms needs to be interpretable. In this paper, we propose a rule-based regularization technique to enforce interpretability for neural networks (NN). For this purpose, we train a rule-based surrogate model simultaneously with the NN. From the surrogate, a metric quantifying its degree of explainability is derived and fed back to the training of the NN as a regularization term. We evaluate our model on four datasets and compare it to unregularized models as well as a decision tree (DT) based baseline. The rule-based regularization approach achieves interpretability and competitive accuracy.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123988394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Deep Neuronal Based Classifiers for Wireless Multi-hop Network Mobility Models 基于深度神经元的无线多跳网络移动模型分类器
Daniel Gutiérrez, S. Toral
Mobility plays an important role in the performance of wireless multi-hop networks. Since communications are established in a multi-hop fashion, the mobility of nodes can cause a significant degradation of the performance. Therefore, the analysis of nodes' mobility is relevant to improve the performance of the applications implemented over wireless multi-hop networks. This work evaluates two neuronal network models, such as fully connected or multi-layer perceptron and 1D convolutional models, for the classification of up to four widely used mobility models for wireless multi-hop networks. Several architectures are evaluated and parametrized for both models. The results indicate a considerable better performance of an architecture with 1D convolutional layers. The test results show that the best convolutional 1D model is able to reach an accuracy level of 0.91, outperforming the best multi-layer perceptron model in 13,9 %.
移动性对无线多跳网络的性能起着至关重要的作用。由于通信是以多跳方式建立的,因此节点的移动性可能会导致性能的显著降低。因此,分析节点的移动性对于提高在无线多跳网络上实现的应用程序的性能具有重要意义。这项工作评估了两种神经网络模型,如全连接或多层感知器和1D卷积模型,用于多达四种广泛使用的无线多跳网络移动模型的分类。对两种模型的几种体系结构进行了评估和参数化。结果表明,具有一维卷积层的体系结构具有相当好的性能。测试结果表明,最佳卷积一维模型能够达到0.91的准确率水平,比最佳多层感知器模型高出13.9%。
{"title":"Deep Neuronal Based Classifiers for Wireless Multi-hop Network Mobility Models","authors":"Daniel Gutiérrez, S. Toral","doi":"10.1109/ICMLA.2019.00111","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00111","url":null,"abstract":"Mobility plays an important role in the performance of wireless multi-hop networks. Since communications are established in a multi-hop fashion, the mobility of nodes can cause a significant degradation of the performance. Therefore, the analysis of nodes' mobility is relevant to improve the performance of the applications implemented over wireless multi-hop networks. This work evaluates two neuronal network models, such as fully connected or multi-layer perceptron and 1D convolutional models, for the classification of up to four widely used mobility models for wireless multi-hop networks. Several architectures are evaluated and parametrized for both models. The results indicate a considerable better performance of an architecture with 1D convolutional layers. The test results show that the best convolutional 1D model is able to reach an accuracy level of 0.91, outperforming the best multi-layer perceptron model in 13,9 %.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122261454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Low-Bit Quantization and Quantization-Aware Training for Small-Footprint Keyword Spotting 小尺寸关键字识别的低比特量化和量化感知训练
Yuriy Mishchenko, Yusuf Goren, Ming Sun, Chris Beauchene, Spyros Matsoukas, Oleg Rybakov, S. Vitaladevuni
In this paper, we investigate novel quantization approaches to reduce memory and computational footprint of deep neural network (DNN) based keyword spotters (KWS). We propose a new method for KWS offline and online quantization, which we call dynamic quantization, where we quantize DNN weight matrices column-wise, using each column's exact individual min-max range, and the DNN layers' inputs and outputs are quantized for every input audio frame individually, using the exact min-max range of each input and output vector. We further apply a new quantization-aware training approach that allows us to incorporate quantization errors into KWS model during training. Together, these approaches allow us to significantly improve the performance of KWS in 4-bit and 8-bit quantized precision, achieving the end-to-end accuracy close to that of full precision models while reducing the models' on-device memory footprint by up to 80%.
在本文中,我们研究了新的量化方法来减少基于深度神经网络(DNN)的关键词定位器(KWS)的内存和计算足迹。我们提出了一种KWS离线和在线量化的新方法,我们称之为动态量化,其中我们按列量化DNN权重矩阵,使用每个列的精确单个最小-最大范围,并且DNN层的输入和输出分别量化每个输入音频帧,使用每个输入和输出向量的精确最小-最大范围。我们进一步应用了一种新的量化感知训练方法,该方法允许我们在训练期间将量化误差纳入KWS模型。总之,这些方法使我们能够显着提高KWS在4位和8位量化精度方面的性能,实现接近全精度模型的端到端精度,同时将模型的设备上内存占用减少高达80%。
{"title":"Low-Bit Quantization and Quantization-Aware Training for Small-Footprint Keyword Spotting","authors":"Yuriy Mishchenko, Yusuf Goren, Ming Sun, Chris Beauchene, Spyros Matsoukas, Oleg Rybakov, S. Vitaladevuni","doi":"10.1109/ICMLA.2019.00127","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00127","url":null,"abstract":"In this paper, we investigate novel quantization approaches to reduce memory and computational footprint of deep neural network (DNN) based keyword spotters (KWS). We propose a new method for KWS offline and online quantization, which we call dynamic quantization, where we quantize DNN weight matrices column-wise, using each column's exact individual min-max range, and the DNN layers' inputs and outputs are quantized for every input audio frame individually, using the exact min-max range of each input and output vector. We further apply a new quantization-aware training approach that allows us to incorporate quantization errors into KWS model during training. Together, these approaches allow us to significantly improve the performance of KWS in 4-bit and 8-bit quantized precision, achieving the end-to-end accuracy close to that of full precision models while reducing the models' on-device memory footprint by up to 80%.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128708618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
期刊
2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1