首页 > 最新文献

2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Tree-Structured Curriculum Learning Based on Semantic Similarity of Text 基于文本语义相似度的树状结构课程学习
Sanggyu Han, Sung-Hyon Myaeng
Inspired by the notion of a curriculum that allows human learners to acquire knowledge from easy to difficult materials, curriculum learning (CL) has been applied to many areas including Natural Language Processing (NLP). Most previous CL methods in NLP learn texts according to their lengths. We posit, however, that learning semantically similar texts is more effective than simply relying on superficial easiness such as text lengths. As such, we propose a new CL method that considers semantic dissimilarity as the complexity measure and a tree-structured curriculum as the organization method. The proposed CL method shows better performance than previous CL methods on a sentiment analysis task in an experiment.
受课程概念的启发,课程学习允许人类学习者从简单到困难的材料中获取知识,课程学习(CL)已应用于许多领域,包括自然语言处理(NLP)。在NLP中,以前的大多数CL方法都是根据文本的长度来学习文本的。然而,我们假设,学习语义相似的文本比简单地依赖于表面的容易程度(如文本长度)更有效。因此,我们提出了一种以语义不相似度作为复杂性度量,以树状结构课程作为组织方法的CL方法。实验表明,该方法在情感分析任务上的表现优于以往的方法。
{"title":"Tree-Structured Curriculum Learning Based on Semantic Similarity of Text","authors":"Sanggyu Han, Sung-Hyon Myaeng","doi":"10.1109/ICMLA.2017.00-27","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-27","url":null,"abstract":"Inspired by the notion of a curriculum that allows human learners to acquire knowledge from easy to difficult materials, curriculum learning (CL) has been applied to many areas including Natural Language Processing (NLP). Most previous CL methods in NLP learn texts according to their lengths. We posit, however, that learning semantically similar texts is more effective than simply relying on superficial easiness such as text lengths. As such, we propose a new CL method that considers semantic dissimilarity as the complexity measure and a tree-structured curriculum as the organization method. The proposed CL method shows better performance than previous CL methods on a sentiment analysis task in an experiment.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"37 1","pages":"971-976"},"PeriodicalIF":0.0,"publicationDate":"2017-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90103826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Predicting Psychosis Using the Experience Sampling Method with Mobile Apps 使用手机应用程序的经验抽样方法预测精神病
D. Stamate, Andrea Katrinecz, W. Alghamdi, D. Ståhl, P. Delespaul, J. Os, S. Guloksuz
Smart phones have become ubiquitous in the recent years, which opened up a new opportunity for rediscovering the Experience Sampling Method (ESM) in a new efficient form using mobile apps, and provides great prospects to become a low cost and high impact mHealth tool for psychiatry practice. The method is used to collect longitudinal data of participants' daily life experiences, and is ideal to capture fluctuations in emotions (momentary mental states) as an early indicator for later mental health disorder. In this study ESM data of patients with psychosis and controls were used to examine emotion changes and identify patterns. This paper attempts to determine whether aggregated ESM data, in which statistical measures represent the distribution and dynamics of the original data, are able to distinguish patients from controls. Variable importance, recursive feature elimination and ReliefF methods were used for feature selection. Model training and tuning, and testing were performed in nested cross-validation, and were based on algorithms such as Random Forests, Support Vector Machines, Gaussian Processes, Logistic Regression and Neural Networks. ROC analysis was used to post-process these models. Stability of model performances was studied using Monte Carlo simulations. The results provide evidence that pattern in mood changes can be captured with the combination of techniques used. The best results were achieved by SVM with radial kernel, where the best model performed with 82% accuracy and 82% sensitivity.
近年来,智能手机变得无处不在,这为使用移动应用程序以一种新的高效形式重新发现体验抽样方法(ESM)开辟了新的机会,并为精神病学实践提供了一个低成本和高影响力的移动健康工具。该方法用于收集参与者日常生活经历的纵向数据,非常适合捕捉情绪波动(瞬间精神状态),作为后期精神健康障碍的早期指标。在这项研究中,精神病患者和对照组的ESM数据被用来检查情绪变化和识别模式。本文试图确定汇总的ESM数据,其中统计度量代表原始数据的分布和动态,是否能够区分患者和对照组。采用变重要度、递归特征消除和ReliefF方法进行特征选择。模型训练、调优和测试在嵌套交叉验证中进行,并基于随机森林、支持向量机、高斯过程、逻辑回归和神经网络等算法。采用ROC分析对模型进行后处理。采用蒙特卡罗仿真方法研究了模型性能的稳定性。研究结果提供了证据,表明可以通过结合使用的技术来捕捉情绪变化的模式。采用径向核支持向量机(SVM)的结果最好,模型的准确率为82%,灵敏度为82%。
{"title":"Predicting Psychosis Using the Experience Sampling Method with Mobile Apps","authors":"D. Stamate, Andrea Katrinecz, W. Alghamdi, D. Ståhl, P. Delespaul, J. Os, S. Guloksuz","doi":"10.1109/ICMLA.2017.00-84","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-84","url":null,"abstract":"Smart phones have become ubiquitous in the recent years, which opened up a new opportunity for rediscovering the Experience Sampling Method (ESM) in a new efficient form using mobile apps, and provides great prospects to become a low cost and high impact mHealth tool for psychiatry practice. The method is used to collect longitudinal data of participants' daily life experiences, and is ideal to capture fluctuations in emotions (momentary mental states) as an early indicator for later mental health disorder. In this study ESM data of patients with psychosis and controls were used to examine emotion changes and identify patterns. This paper attempts to determine whether aggregated ESM data, in which statistical measures represent the distribution and dynamics of the original data, are able to distinguish patients from controls. Variable importance, recursive feature elimination and ReliefF methods were used for feature selection. Model training and tuning, and testing were performed in nested cross-validation, and were based on algorithms such as Random Forests, Support Vector Machines, Gaussian Processes, Logistic Regression and Neural Networks. ROC analysis was used to post-process these models. Stability of model performances was studied using Monte Carlo simulations. The results provide evidence that pattern in mood changes can be captured with the combination of techniques used. The best results were achieved by SVM with radial kernel, where the best model performed with 82% accuracy and 82% sensitivity.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"26 1","pages":"667-673"},"PeriodicalIF":0.0,"publicationDate":"2017-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81882553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Direct Multiclass Boosting Using Base Classifiers' Posterior Probabilities Estimates 基于基分类器后验概率估计的直接多类提升
M. Bourel, B. Ghattas
We present a new multiclass boosting algorithm called Adaboost.BG. Like the original Freund and Shapire's Adaboost algorithm, it aggregates trees but instead of using their misclassification error it takes into account the margins of the observations, which may be seen as confidence measures of their prediction, rather then their correctness. We prove the efficiency of our algorithm by simulation and compare it to similar approaches known to minimize the global margins of the final classifier.
提出了一种新的多类增强算法Adaboost.BG。像最初的Freund和Shapire的Adaboost算法一样,它聚合了树,但没有使用他们的误分类误差,而是考虑了观察结果的边缘,这可能被视为他们预测的置信度,而不是他们的正确性。我们通过仿真证明了算法的有效性,并将其与已知的最小化最终分类器全局边缘的类似方法进行了比较。
{"title":"Direct Multiclass Boosting Using Base Classifiers' Posterior Probabilities Estimates","authors":"M. Bourel, B. Ghattas","doi":"10.1109/ICMLA.2017.0-154","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-154","url":null,"abstract":"We present a new multiclass boosting algorithm called Adaboost.BG. Like the original Freund and Shapire's Adaboost algorithm, it aggregates trees but instead of using their misclassification error it takes into account the margins of the observations, which may be seen as confidence measures of their prediction, rather then their correctness. We prove the efficiency of our algorithm by simulation and compare it to similar approaches known to minimize the global margins of the final classifier.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"15 1","pages":"228-233"},"PeriodicalIF":0.0,"publicationDate":"2017-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73301143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human Action Recognition from Body-Part Directional Velocity Using Hidden Markov Models 基于隐马尔可夫模型的人体动作识别
Sid Ahmed Walid Talha, A. Fleury, S. Ambellouis
This paper introduces a novel approach for early recognition of human actions using 3D skeleton joints extracted from 3D depth data. We propose a novel, frame-by-frame and real-time descriptor called Body-part Directional Velocity (BDV) calculated by considering the algebraic velocity produced by different body-parts. A real-time Hidden Markov Models algorithm with Gaussian Mixture Models state-output distributions is used to carry out the classification. We show that our method outperforms various state-of-the-art skeleton-based human action recognition approaches on MSRAction3D and Florence3D datasets. We also proved the suitability of our approach for early human action recognition by deducing the decision from a partial analysis of the sequence.
本文介绍了一种利用三维深度数据提取的三维骨骼关节进行人体动作早期识别的新方法。我们提出了一种新颖的、逐帧实时描述符,称为身体部位定向速度(BDV),该描述符通过考虑不同身体部位产生的代数速度来计算。采用基于高斯混合模型状态输出分布的实时隐马尔可夫模型算法进行分类。我们的方法在MSRAction3D和Florence3D数据集上优于各种最先进的基于骨骼的人体动作识别方法。我们还通过从序列的部分分析中推断出决策,证明了我们的方法对早期人类行为识别的适用性。
{"title":"Human Action Recognition from Body-Part Directional Velocity Using Hidden Markov Models","authors":"Sid Ahmed Walid Talha, A. Fleury, S. Ambellouis","doi":"10.1109/ICMLA.2017.00-14","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-14","url":null,"abstract":"This paper introduces a novel approach for early recognition of human actions using 3D skeleton joints extracted from 3D depth data. We propose a novel, frame-by-frame and real-time descriptor called Body-part Directional Velocity (BDV) calculated by considering the algebraic velocity produced by different body-parts. A real-time Hidden Markov Models algorithm with Gaussian Mixture Models state-output distributions is used to carry out the classification. We show that our method outperforms various state-of-the-art skeleton-based human action recognition approaches on MSRAction3D and Florence3D datasets. We also proved the suitability of our approach for early human action recognition by deducing the decision from a partial analysis of the sequence.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"8 1","pages":"1035-1040"},"PeriodicalIF":0.0,"publicationDate":"2017-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84132570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Realistic Traffic Generation for Web Robots 现实的流量生成网络机器人
Kyle Brown, Derek Doran
Critical to evaluating the capacity, scalability, and availability of web systems are realistic web traffic generators. Web traffic generation is a classic research problem, no generator accounts for the characteristics of web robots or crawlers that are now the dominant source of traffic to a web server. Administrators are thus unable to test, stress, and evaluate how their systems perform in the face of ever increasing levels of web robot traffic. To resolve this problem, this paper introduces a novel approach to generate synthetic web robot traffic with high fidelity. It generates traffic that accounts for both the temporal and behavioral qualities of robot traffic by statistical and Bayesian models that are fitted to the properties of robot traffic seen in web logs from North America and Europe. We evaluate our traffic generator by comparing the characteristics of generated traffic to those of the original data. We look at session arrival rates, inter-arrival times and session lengths, comparing and contrasting them between generated and real traffic. Finally, we show that our generated traffic affects cache performance similarly to actual traffic, using the common LRU and LFU eviction policies.
评估网络系统的容量、可扩展性和可用性的关键是现实的网络流量生成器。网络流量生成是一个经典的研究问题,没有生成器能够解释网络机器人或爬虫的特征,而它们现在是网络服务器流量的主要来源。因此,管理员无法测试、强调和评估他们的系统在面对不断增加的网络机器人流量时的表现。为了解决这一问题,本文提出了一种生成高保真合成网络机器人流量的新方法。它通过统计和贝叶斯模型生成的流量同时考虑了机器人流量的时间和行为质量,这些模型与北美和欧洲的网络日志中看到的机器人流量属性相匹配。我们通过比较生成的流量与原始数据的特征来评估我们的流量生成器。我们着眼于会话到达率、间隔到达时间和会话长度,并在生成流量和真实流量之间进行比较和对比。最后,我们将展示,使用常见的LRU和LFU清除策略,生成的流量对缓存性能的影响与实际流量类似。
{"title":"Realistic Traffic Generation for Web Robots","authors":"Kyle Brown, Derek Doran","doi":"10.1109/ICMLA.2017.0-161","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-161","url":null,"abstract":"Critical to evaluating the capacity, scalability, and availability of web systems are realistic web traffic generators. Web traffic generation is a classic research problem, no generator accounts for the characteristics of web robots or crawlers that are now the dominant source of traffic to a web server. Administrators are thus unable to test, stress, and evaluate how their systems perform in the face of ever increasing levels of web robot traffic. To resolve this problem, this paper introduces a novel approach to generate synthetic web robot traffic with high fidelity. It generates traffic that accounts for both the temporal and behavioral qualities of robot traffic by statistical and Bayesian models that are fitted to the properties of robot traffic seen in web logs from North America and Europe. We evaluate our traffic generator by comparing the characteristics of generated traffic to those of the original data. We look at session arrival rates, inter-arrival times and session lengths, comparing and contrasting them between generated and real traffic. Finally, we show that our generated traffic affects cache performance similarly to actual traffic, using the common LRU and LFU eviction policies.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"341 1","pages":"178-185"},"PeriodicalIF":0.0,"publicationDate":"2017-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73754074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hierarchical, Bulk-Synchronous Stochastic Gradient Descent Algorithm for Deep-Learning Applications on GPU Clusters 面向GPU集群深度学习应用的分层、批量同步随机梯度下降算法
Guojing Cong, Onkar Bhardwaj
The training data and models are becoming increasingly large in many deep-learning applications. Large-scale distributed processing is employed to accelerate training. Increasing the number of learners in synchronous and asynchronous stochastic gradient descent presents challenges to convergence and communication performance. We present our hierarchical, bulk-synchronous stochastic gradient algorithm that effectively balances execution time and accuracy for training in deep-learning applications on GPU clusters. It achieves much better convergence and execution time at scale in comparison to asynchronous stochastic gradient descent implementations. When deployed on a cluster of 128 GPUs, our implementation achieves up to 56 times speedups over the sequential stochastic gradient descent with similar test accuracy for our target application.
在许多深度学习应用中,训练数据和模型变得越来越大。采用大规模分布式处理,加快训练速度。同步和异步随机梯度下降中学习器数量的增加对收敛性和通信性能提出了挑战。我们提出了我们的分层、批量同步随机梯度算法,该算法有效地平衡了GPU集群上深度学习应用程序训练的执行时间和准确性。与异步随机梯度下降实现相比,它在规模上实现了更好的收敛性和执行时间。当部署在128个gpu的集群上时,我们的实现在顺序随机梯度下降的基础上实现了高达56倍的加速,并具有类似的目标应用程序测试精度。
{"title":"A Hierarchical, Bulk-Synchronous Stochastic Gradient Descent Algorithm for Deep-Learning Applications on GPU Clusters","authors":"Guojing Cong, Onkar Bhardwaj","doi":"10.1109/ICMLA.2017.00-56","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-56","url":null,"abstract":"The training data and models are becoming increasingly large in many deep-learning applications. Large-scale distributed processing is employed to accelerate training. Increasing the number of learners in synchronous and asynchronous stochastic gradient descent presents challenges to convergence and communication performance. We present our hierarchical, bulk-synchronous stochastic gradient algorithm that effectively balances execution time and accuracy for training in deep-learning applications on GPU clusters. It achieves much better convergence and execution time at scale in comparison to asynchronous stochastic gradient descent implementations. When deployed on a cluster of 128 GPUs, our implementation achieves up to 56 times speedups over the sequential stochastic gradient descent with similar test accuracy for our target application.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"30 1","pages":"818-821"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73810257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Advanced ECHMM-Based Machine Learning Tools for Complex Big Data Applications 用于复杂大数据应用的先进的基于echmm的机器学习工具
A. Cuzzocrea, E. Mumolo, G. Vercelli
We present a novel approach for accurate characterization of workloads, which is relevant in the context of complex big data applications.Workloads are generally described with statistical models and are based on the analysis of resource requests measurements of a running program. In this paper we propose to consider the sequence of virtual memory references generated from a program during its execution as a temporal series, and to use spectral analysis principles to process the sequence. However, the sequence is time-varying, so we employed processing approaches based on Ergodic Continuous Hidden Markov Models (ECHMMs) which extend conventional stationary spectral analysis approaches to the analysis of time-varying sequences. In this work, we describe two applications of the proposed approach: the on-line classification of a running process and the generation of synthetic traces of a given workload. The first step was to show that ECHMMs accurately describe virtual memory sequences; to this goal a different ECHMM was trained for each sequence and the related run-time average process classification accuracy, evaluated using trace driven simulations over a wide range of traces of SPEC2000, was about 82%. Then, a single ECHMM was trained using all the sequences obtained from a given running application; again, the classification accuracy has been evaluated using the same traces and it resulted about 76%. As regards the synthetic trace generation, a single ECHMM characterizing a given application has been used as a stochastic generator to produce benchmarks for spanning a large application space.
我们提出了一种准确表征工作负载的新方法,这与复杂的大数据应用相关。工作负载通常用统计模型来描述,并基于对正在运行的程序的资源请求度量的分析。在本文中,我们建议将程序在执行过程中产生的虚拟内存引用序列视为一个时间序列,并使用谱分析原理来处理该序列。然而,序列是时变的,因此我们采用了基于遍历连续隐马尔可夫模型(echmm)的处理方法,将传统的平稳谱分析方法扩展到时变序列的分析。在这项工作中,我们描述了所提出方法的两种应用:运行过程的在线分类和给定工作负载的合成轨迹的生成。第一步是证明echmm准确地描述了虚拟内存序列;为了实现这一目标,每个序列都训练了不同的ECHMM,相关的运行时平均过程分类精度(在SPEC2000的大范围轨迹上使用轨迹驱动模拟进行评估)约为82%。然后,使用从给定运行应用程序中获得的所有序列训练单个ECHMM;同样,使用相同的轨迹评估了分类精度,结果约为76%。至于合成跟踪生成,表征给定应用程序的单个ECHMM已被用作随机生成器,以生成跨越大型应用程序空间的基准。
{"title":"Advanced ECHMM-Based Machine Learning Tools for Complex Big Data Applications","authors":"A. Cuzzocrea, E. Mumolo, G. Vercelli","doi":"10.1109/ICMLA.2017.00-86","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-86","url":null,"abstract":"We present a novel approach for accurate characterization of workloads, which is relevant in the context of complex big data applications.Workloads are generally described with statistical models and are based on the analysis of resource requests measurements of a running program. In this paper we propose to consider the sequence of virtual memory references generated from a program during its execution as a temporal series, and to use spectral analysis principles to process the sequence. However, the sequence is time-varying, so we employed processing approaches based on Ergodic Continuous Hidden Markov Models (ECHMMs) which extend conventional stationary spectral analysis approaches to the analysis of time-varying sequences. In this work, we describe two applications of the proposed approach: the on-line classification of a running process and the generation of synthetic traces of a given workload. The first step was to show that ECHMMs accurately describe virtual memory sequences; to this goal a different ECHMM was trained for each sequence and the related run-time average process classification accuracy, evaluated using trace driven simulations over a wide range of traces of SPEC2000, was about 82%. Then, a single ECHMM was trained using all the sequences obtained from a given running application; again, the classification accuracy has been evaluated using the same traces and it resulted about 76%. As regards the synthetic trace generation, a single ECHMM characterizing a given application has been used as a stochastic generator to produce benchmarks for spanning a large application space.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"38 1","pages":"655-660"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75660425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Sensitive Behavior Prediction in a Health Social Network 健康社会网络中的时间敏感行为预测
Amnay Amimeur, Nhathai Phan, D. Dou, David Kil, B. Piniewski
Human behavior prediction is critical in understanding and addressing large scale health and social issues in online communities. Specifically, predicting when in the future a user will engage in a behavior as opposed to whether a user will behave at a particular time is a less studied subproblem of behavior prediction. Further lacking is exploration of how social context affects personal behavior and the exploitation of network structure information in behavior and time prediction. To address these problems we propose a novel semi-supervised deep learning model for prediction of return time to personal behavior. A carefully designed objective function ensures the model learns good social context embeddings and historical behavior embeddings in order to capture the effects of social influence on personal behavior. Our model is validated on a unique health social network dataset by predicting when users will engage in physical activities. We show our model outperforms relevant time prediction baselines.
人类行为预测对于理解和解决在线社区中的大规模健康和社会问题至关重要。具体来说,预测用户在未来什么时候会做出某种行为,而不是用户在某个特定时间是否会做出某种行为,这是行为预测中一个研究较少的子问题。更缺乏的是对社会环境如何影响个人行为的探索,以及在行为和时间预测中利用网络结构信息。为了解决这些问题,我们提出了一种新的半监督深度学习模型来预测个人行为的回归时间。精心设计的目标函数确保模型学习良好的社会情境嵌入和历史行为嵌入,以捕捉社会影响对个人行为的影响。我们的模型在一个独特的健康社交网络数据集上通过预测用户什么时候会参加体育活动来验证。我们表明我们的模型优于相关的时间预测基线。
{"title":"Time-Sensitive Behavior Prediction in a Health Social Network","authors":"Amnay Amimeur, Nhathai Phan, D. Dou, David Kil, B. Piniewski","doi":"10.1109/ICMLA.2017.000-4","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.000-4","url":null,"abstract":"Human behavior prediction is critical in understanding and addressing large scale health and social issues in online communities. Specifically, predicting when in the future a user will engage in a behavior as opposed to whether a user will behave at a particular time is a less studied subproblem of behavior prediction. Further lacking is exploration of how social context affects personal behavior and the exploitation of network structure information in behavior and time prediction. To address these problems we propose a novel semi-supervised deep learning model for prediction of return time to personal behavior. A carefully designed objective function ensures the model learns good social context embeddings and historical behavior embeddings in order to capture the effects of social influence on personal behavior. Our model is validated on a unique health social network dataset by predicting when users will engage in physical activities. We show our model outperforms relevant time prediction baselines.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"31 1","pages":"1083-1088"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75751864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Anomaly Prediction Based on k-Means Clustering for Memory-Constrained Embedded Devices 基于k均值聚类的内存受限嵌入式设备异常预测
Yuto Kitagawa, Tasuku Ishigoka, Takuya Azumi
This paper proposes an anomaly prediction method based on k-means clustering that assumes embedded devices with memory constraints to predict control system anomalies. With this method, by checking control system behavior, it is possible to predict anomalies. However, continuing clustering is difficult because data accumulate in memory similar to existing k-means clustering method, which is problematic for embedded devices with low memory capacity. Therefore, we also propose k-means clustering to continue clustering for infinite stream data. The proposed k-means clustering method is based on online k-means clustering of sequential processing. The proposed k-means clustering method only stores data required for anomaly prediction and releases other data from memory. Experimental results show that anomalies can be predicted by k-means clustering, and the proposed method can predict anomalies similar to standard k-means clustering while reducing memory consumption. Moreover, the proposed k-means clustering demonstrates better results of anomaly prediction than existing online k-means clustering.
本文提出了一种基于k均值聚类的异常预测方法,该方法假设嵌入式设备具有内存约束来预测控制系统异常。用这种方法,通过检查控制系统的行为,可以预测异常。然而,持续聚类是困难的,因为数据像现有的k-means聚类方法一样在内存中积累,这对于内存容量小的嵌入式设备来说是有问题的。因此,我们也提出了k-means聚类来对无限流数据继续聚类。本文提出的k-means聚类方法是基于序列处理的在线k-means聚类。提出的k-means聚类方法只存储异常预测所需的数据,而将其他数据从内存中释放出来。实验结果表明,k-means聚类可以预测异常,并且该方法在减少内存消耗的同时可以预测与标准k-means聚类相似的异常。此外,所提出的k-means聚类比现有的在线k-means聚类具有更好的异常预测效果。
{"title":"Anomaly Prediction Based on k-Means Clustering for Memory-Constrained Embedded Devices","authors":"Yuto Kitagawa, Tasuku Ishigoka, Takuya Azumi","doi":"10.1109/ICMLA.2017.0-182","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-182","url":null,"abstract":"This paper proposes an anomaly prediction method based on k-means clustering that assumes embedded devices with memory constraints to predict control system anomalies. With this method, by checking control system behavior, it is possible to predict anomalies. However, continuing clustering is difficult because data accumulate in memory similar to existing k-means clustering method, which is problematic for embedded devices with low memory capacity. Therefore, we also propose k-means clustering to continue clustering for infinite stream data. The proposed k-means clustering method is based on online k-means clustering of sequential processing. The proposed k-means clustering method only stores data required for anomaly prediction and releases other data from memory. Experimental results show that anomalies can be predicted by k-means clustering, and the proposed method can predict anomalies similar to standard k-means clustering while reducing memory consumption. Moreover, the proposed k-means clustering demonstrates better results of anomaly prediction than existing online k-means clustering.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"22 1","pages":"26-33"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74774948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Exploring the Impact of Clone Refactoring on Test Code Size in Object-Oriented Software 探讨面向对象软件中克隆重构对测试代码大小的影响
M. Badri, L. Badri, Oussama Hachemane, Alexandre Ouellet
This paper aims at exploring the impact of clone refactoring on the test code size, in terms of number of operations, in object-oriented software. We investigated three research questions: (1) the impact of clone refactoring on three important source code attributes (coupling, complexity and size) that are related to unit testability of classes, (2) the impact of clone refactoring on the test code size, and (3) the variations after clone refactoring in the source code attributes that have the most important impact on the test code size. We used linear regression and three popular machine learning techniques (i.e., k-Nearest Neighbors, Naïve Bayes and Random Forest) to develop predictive and explanatory models. We used data collected from an open source Java software system (ANT) that has been refactored using clone-refactoring techniques. The analyses indicate that there is a strong and positive relationship between clone refactoring and the reduction of the test code size. Results show that: (1) the source code attributes of refactored classes have been significantly improved, (2) the test code size of refactored classes has been significantly reduced, and (3) the variations of the test code size are more influenced by the variations of the complexity and size of refactored classes compared to coupling.
本文旨在探讨在面向对象的软件中,克隆重构对测试代码大小(按操作次数)的影响。我们研究了三个问题:(1)克隆重构对与类的单元可测试性相关的三个重要源代码属性(耦合、复杂性和大小)的影响;(2)克隆重构对测试代码大小的影响;(3)克隆重构后对测试代码大小影响最大的源代码属性的变化。我们使用线性回归和三种流行的机器学习技术(即k近邻,Naïve贝叶斯和随机森林)来开发预测和解释模型。我们使用了从开源Java软件系统(ANT)收集的数据,该系统已经使用克隆重构技术进行了重构。分析表明,在克隆重构和减少测试代码大小之间存在着强烈的正相关关系。结果表明:(1)重构类的源代码属性得到了显著改善;(2)重构类的测试代码大小得到了显著减少;(3)与耦合相比,测试代码大小的变化更受重构类复杂度和大小变化的影响。
{"title":"Exploring the Impact of Clone Refactoring on Test Code Size in Object-Oriented Software","authors":"M. Badri, L. Badri, Oussama Hachemane, Alexandre Ouellet","doi":"10.1109/ICMLA.2017.00098","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00098","url":null,"abstract":"This paper aims at exploring the impact of clone refactoring on the test code size, in terms of number of operations, in object-oriented software. We investigated three research questions: (1) the impact of clone refactoring on three important source code attributes (coupling, complexity and size) that are related to unit testability of classes, (2) the impact of clone refactoring on the test code size, and (3) the variations after clone refactoring in the source code attributes that have the most important impact on the test code size. We used linear regression and three popular machine learning techniques (i.e., k-Nearest Neighbors, Naïve Bayes and Random Forest) to develop predictive and explanatory models. We used data collected from an open source Java software system (ANT) that has been refactored using clone-refactoring techniques. The analyses indicate that there is a strong and positive relationship between clone refactoring and the reduction of the test code size. Results show that: (1) the source code attributes of refactored classes have been significantly improved, (2) the test code size of refactored classes has been significantly reduced, and (3) the variations of the test code size are more influenced by the variations of the complexity and size of refactored classes compared to coupling.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"63 1","pages":"586-592"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75030536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1