首页 > 最新文献

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Robust Modeling of Continuous 4-D Affective Space from EEG Recording 基于脑电记录的连续四维情感空间鲁棒建模
Rakib Al-Fahad, M. Yeasin
The inherent intangible nature, complexity, context-specific interpretations of emotions make it difficult to quantify and model affective space. Dimensional theory is one of the effective methods to describe and model emotions. Despite recent advances in affective computing, modeling continuous affective space remains a challenge. Here, we present a computational framework to study the role of functional areas of brain and band frequencies in modeling 4-D continuous affective space (Valence, Arousal, Like and Dominance). In particular, we used Electroencephalogram (EEG) recordings and adopted a recursive feature elimination (RFE) approach to select band frequencies and electrode locations (functional areas) that are most relevant for predicting affective space. Empirical analyses on DEAP dataset [1] reveals that only a small number of locations (7-12) and certain band frequencies carry most discriminative information. Using the selected features, we modeled 4-D affective space using Support Vector Regression (SVR). Regression analysis show that Root Mean Square Error (RMSE) for Valence, Arousal, Dominance, Like are 1.40, 1.23, 1.24 and, 1.24, respectively. Besides SVR, the performance of feature fusion and ensemble classifiers were also compared to determine the robust model against technical noise and individual variations. It was observed that the prediction accuracy of the final model is up to 37% better than human judgment evaluated on same data set. Spillover effect of our approach may include design of task-specific (i.e., emotion, memory capacity) EEG headset with a minimal number of electrodes.
情感固有的无形性、复杂性和特定情境的解释使得情感空间难以量化和建模。维度理论是描述和模拟情绪的有效方法之一。尽管情感计算最近取得了进展,但对连续情感空间的建模仍然是一个挑战。在这里,我们提出了一个计算框架来研究脑功能区和频带频率在建模4-D连续情感空间(效价、唤醒、喜欢和支配)中的作用。特别是,我们使用脑电图(EEG)记录并采用递归特征消除(RFE)方法来选择与预测情感空间最相关的频带频率和电极位置(功能区)。对DEAP数据集的实证分析[1]表明,只有少数位置(7-12)和某些频带频率携带最多的判别信息。利用选择的特征,我们使用支持向量回归(SVR)对4-D情感空间进行建模。回归分析结果表明,效价、觉醒、优势、喜欢的均方根误差(RMSE)分别为1.40、1.23、1.24和1.24。除了SVR之外,还比较了特征融合和集成分类器的性能,以确定对技术噪声和个体变化的鲁棒模型。结果表明,在相同的数据集上,最终模型的预测精度比人工判断提高了37%。我们方法的溢出效应可能包括设计具有最小电极数量的特定任务(即情感,记忆容量)脑电图耳机。
{"title":"Robust Modeling of Continuous 4-D Affective Space from EEG Recording","authors":"Rakib Al-Fahad, M. Yeasin","doi":"10.1109/ICMLA.2016.0188","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0188","url":null,"abstract":"The inherent intangible nature, complexity, context-specific interpretations of emotions make it difficult to quantify and model affective space. Dimensional theory is one of the effective methods to describe and model emotions. Despite recent advances in affective computing, modeling continuous affective space remains a challenge. Here, we present a computational framework to study the role of functional areas of brain and band frequencies in modeling 4-D continuous affective space (Valence, Arousal, Like and Dominance). In particular, we used Electroencephalogram (EEG) recordings and adopted a recursive feature elimination (RFE) approach to select band frequencies and electrode locations (functional areas) that are most relevant for predicting affective space. Empirical analyses on DEAP dataset [1] reveals that only a small number of locations (7-12) and certain band frequencies carry most discriminative information. Using the selected features, we modeled 4-D affective space using Support Vector Regression (SVR). Regression analysis show that Root Mean Square Error (RMSE) for Valence, Arousal, Dominance, Like are 1.40, 1.23, 1.24 and, 1.24, respectively. Besides SVR, the performance of feature fusion and ensemble classifiers were also compared to determine the robust model against technical noise and individual variations. It was observed that the prediction accuracy of the final model is up to 37% better than human judgment evaluated on same data set. Spillover effect of our approach may include design of task-specific (i.e., emotion, memory capacity) EEG headset with a minimal number of electrodes.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"155 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122033613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
An Empirical Study on Machine Learning Models for Wind Power Predictions 风电预测机器学习模型的实证研究
Yiqian Liu, Huajie Zhang
Wind power prediction is of great importance in the utilization of renewable wind power. A lot of research has been done attempting to improve the accuracy of wind power predictions and has achieved desirable performance. However, there is no complete performance evaluation of machine learning methods. This paper presents an extensive empirical study of machine learning methods for wind power predictions. Nine various models are considered in this study which also includes the application and evaluation of deep learning techniques. The experimental data consists of seven datasets based on wind farms in Ontario, Canada. The results indicate that SVM, followed by ANN, has the best overall performance, and that k-NN method is suitable for longer ahead predictions. Despite the findings that deep learning fails to give improvement in basic predictions, it shows the potential for more abstract prediction tasks, such as spatial correlation predictions.
风电功率预测在可再生风电利用中具有重要意义。为了提高风电预测的准确性,人们进行了大量的研究,并取得了令人满意的效果。然而,目前还没有对机器学习方法进行完整的性能评估。本文对风力发电预测的机器学习方法进行了广泛的实证研究。本研究考虑了九种不同的模型,其中还包括深度学习技术的应用和评估。实验数据由基于加拿大安大略省风力发电场的七个数据集组成。结果表明,支持向量机的综合性能最好,其次是人工神经网络,k-NN方法适用于较长的提前预测。尽管研究发现深度学习在基本预测方面没有改善,但它显示了更抽象的预测任务的潜力,比如空间相关性预测。
{"title":"An Empirical Study on Machine Learning Models for Wind Power Predictions","authors":"Yiqian Liu, Huajie Zhang","doi":"10.1109/ICMLA.2016.0135","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0135","url":null,"abstract":"Wind power prediction is of great importance in the utilization of renewable wind power. A lot of research has been done attempting to improve the accuracy of wind power predictions and has achieved desirable performance. However, there is no complete performance evaluation of machine learning methods. This paper presents an extensive empirical study of machine learning methods for wind power predictions. Nine various models are considered in this study which also includes the application and evaluation of deep learning techniques. The experimental data consists of seven datasets based on wind farms in Ontario, Canada. The results indicate that SVM, followed by ANN, has the best overall performance, and that k-NN method is suitable for longer ahead predictions. Despite the findings that deep learning fails to give improvement in basic predictions, it shows the potential for more abstract prediction tasks, such as spatial correlation predictions.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"154 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123742975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Identifying Nontechnical Power Loss via Spatial and Temporal Deep Learning 通过时空深度学习识别非技术功率损失
Rajendra Rana Bhat, R. Trevizan, Rahul Sengupta, Xiaolin Li, A. Bretas
Fraud detection in electricity consumption is a major challenge for power distribution companies. While many pattern recognition techniques have been applied to identify electricity theft, they often require extensive handcrafted feature engineering. Instead, through deep layers of transformation, nonlinearity, and abstraction, Deep Learning (DL) automatically extracts key features from data. In this paper, we design spatial and temporal deep learning solutions to identify nontechnical power losses (NTL), including Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM) and Stacked Autoencoder. These models are evaluated in a modified IEEE 123-bus test feeder. For the same tests, we also conduct comparison experiments using three conventional machine learning approaches: Random Forest, Decision Trees and shallow Neural Networks. Experimental results demonstrate that the spatiotemporal deep learning approaches outperform conventional machine learning approaches.
电力消费中的欺诈检测是配电公司面临的主要挑战。虽然许多模式识别技术已被应用于识别电力盗窃,但它们通常需要大量的手工特征工程。相反,通过深层的转换、非线性和抽象,深度学习(DL)可以自动从数据中提取关键特征。在本文中,我们设计了空间和时间深度学习解决方案来识别非技术功率损耗(NTL),包括卷积神经网络(CNN),长短期记忆(LSTM)和堆叠自编码器。这些模型在改进的IEEE 123总线测试馈线中进行了评估。对于相同的测试,我们还使用三种传统的机器学习方法进行比较实验:随机森林,决策树和浅神经网络。实验结果表明,时空深度学习方法优于传统的机器学习方法。
{"title":"Identifying Nontechnical Power Loss via Spatial and Temporal Deep Learning","authors":"Rajendra Rana Bhat, R. Trevizan, Rahul Sengupta, Xiaolin Li, A. Bretas","doi":"10.1109/ICMLA.2016.0052","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0052","url":null,"abstract":"Fraud detection in electricity consumption is a major challenge for power distribution companies. While many pattern recognition techniques have been applied to identify electricity theft, they often require extensive handcrafted feature engineering. Instead, through deep layers of transformation, nonlinearity, and abstraction, Deep Learning (DL) automatically extracts key features from data. In this paper, we design spatial and temporal deep learning solutions to identify nontechnical power losses (NTL), including Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM) and Stacked Autoencoder. These models are evaluated in a modified IEEE 123-bus test feeder. For the same tests, we also conduct comparison experiments using three conventional machine learning approaches: Random Forest, Decision Trees and shallow Neural Networks. Experimental results demonstrate that the spatiotemporal deep learning approaches outperform conventional machine learning approaches.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124865226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 58
Inferring Hearing Loss from Learned Speech Kernels 从习得语言核推断听力损失
Bonny Banerjee, Masoumeh Heidari Kapourchali, S. Najnin, L. L. Mendel, Sungmin Lee, Chhayakanta Patro, Monique Pousson
Does a hearing-impaired individual's speech reflect his hearing loss, and if it does, can the nature of hearing loss be inferred from his speech? To investigate these questions, at least four hours of speech data were recorded from each of 37 adult individuals, both male and female, belonging to four classes: 7 normal, and 30 severely-to-profoundly hearing impaired with high, medium or low speech intelligibility. Acoustic kernels were learned for each individual by capturing the distribution of his speech data points represented as 20 ms duration windows. These kernels were evaluated using a set of neurophysiological metrics, namely, distribution of characteristic frequencies, equal loudness contour, bandwidth and Q10 value of tuning curve. Our experimental results reveal that a hearing-impaired individual's speech does reflect his hearing loss provided his loss of hearing has considerably affected the intelligibility of his speech. For such individuals, the lack of tuning in any frequency range can be inferred from his learned speech kernels.
听障人士的言语是否反映了他的听力损失?如果是,是否可以从他的言语推断出听力损失的性质?为了调查这些问题,研究人员对37名成年男性和女性进行了至少4小时的语音数据记录,这些人分别属于四个类别:7名正常听力障碍患者和30名重度到重度听力障碍患者,他们的语音清晰度分别为高、中、低。通过捕获每个人的语音数据点的分布来学习每个人的声学核,这些数据点表示为20 ms持续时间窗口。利用特征频率分布、等响度轮廓、带宽和调谐曲线Q10值等神经生理指标对这些核进行评价。我们的实验结果表明,听力受损个体的语言确实反映了他的听力损失,前提是他的听力损失在很大程度上影响了他的语言的可理解性。对于这样的人来说,在任何频率范围内缺乏调谐都可以从他学习的语音内核中推断出来。
{"title":"Inferring Hearing Loss from Learned Speech Kernels","authors":"Bonny Banerjee, Masoumeh Heidari Kapourchali, S. Najnin, L. L. Mendel, Sungmin Lee, Chhayakanta Patro, Monique Pousson","doi":"10.1109/ICMLA.2016.0014","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0014","url":null,"abstract":"Does a hearing-impaired individual's speech reflect his hearing loss, and if it does, can the nature of hearing loss be inferred from his speech? To investigate these questions, at least four hours of speech data were recorded from each of 37 adult individuals, both male and female, belonging to four classes: 7 normal, and 30 severely-to-profoundly hearing impaired with high, medium or low speech intelligibility. Acoustic kernels were learned for each individual by capturing the distribution of his speech data points represented as 20 ms duration windows. These kernels were evaluated using a set of neurophysiological metrics, namely, distribution of characteristic frequencies, equal loudness contour, bandwidth and Q10 value of tuning curve. Our experimental results reveal that a hearing-impaired individual's speech does reflect his hearing loss provided his loss of hearing has considerably affected the intelligibility of his speech. For such individuals, the lack of tuning in any frequency range can be inferred from his learned speech kernels.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124927270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cyberbullying Detection with a Pronunciation Based Convolutional Neural Network 基于语音卷积神经网络的网络欺凌检测
Xiang Zhang, Jonathan Tong, Nishant Vishwamitra, E. Whittaker, Joseph P. Mazer, Robin M. Kowalski, Hongxin Hu, Feng Luo, J. Macbeth, Edward C. Dillon
Cyberbullying can have a deep and long lasting impact on its victims, who are often adolescents. Accurately detecting cyberbullying helps prevent it. However, the noise and errors in social media posts and messages make detecting cyberbullying very challenging. In this paper, we propose a novel pronunciation based convolutional neural network (PCNN) to address this challenge. Upon observing that the pronunciation of misspelled words in informal online conversations is often unchanged, we used the phoneme codes of the text as the features for a convolutional neural network. This procedure corrects spelling errors that did not alter the pronunciation, thereby alleviating the problem of noise and bullying data sparsity. To overcome class imbalance, a common problem in cyberbullying datasets, we implement three techniques that include threshold-moving, cost function adjusting, and a hybrid solution in our model. We evaluate the performance of our models using two cyberbullying datasets collected from Twitter and Formspring.me. The results of our experiment show that PCNN can achieve improved recall and precision compared to baseline convolutional neural networks.
网络欺凌会对受害者(通常是青少年)产生深远而持久的影响。准确检测网络欺凌有助于预防它。然而,社交媒体帖子和信息中的噪音和错误使得检测网络欺凌非常具有挑战性。在本文中,我们提出了一种新颖的基于发音的卷积神经网络(PCNN)来解决这一挑战。在观察到非正式在线对话中拼写错误的单词的发音通常是不变的之后,我们使用文本的音素代码作为卷积神经网络的特征。这个过程纠正了没有改变发音的拼写错误,从而减轻了噪声和欺凌数据稀疏性的问题。为了克服网络欺凌数据集中常见的类失衡问题,我们在模型中实现了三种技术,包括阈值移动、成本函数调整和混合解决方案。我们使用从Twitter和Formspring.me收集的两个网络欺凌数据集来评估我们模型的性能。实验结果表明,与基线卷积神经网络相比,PCNN可以获得更高的查全率和查准率。
{"title":"Cyberbullying Detection with a Pronunciation Based Convolutional Neural Network","authors":"Xiang Zhang, Jonathan Tong, Nishant Vishwamitra, E. Whittaker, Joseph P. Mazer, Robin M. Kowalski, Hongxin Hu, Feng Luo, J. Macbeth, Edward C. Dillon","doi":"10.1109/ICMLA.2016.0132","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0132","url":null,"abstract":"Cyberbullying can have a deep and long lasting impact on its victims, who are often adolescents. Accurately detecting cyberbullying helps prevent it. However, the noise and errors in social media posts and messages make detecting cyberbullying very challenging. In this paper, we propose a novel pronunciation based convolutional neural network (PCNN) to address this challenge. Upon observing that the pronunciation of misspelled words in informal online conversations is often unchanged, we used the phoneme codes of the text as the features for a convolutional neural network. This procedure corrects spelling errors that did not alter the pronunciation, thereby alleviating the problem of noise and bullying data sparsity. To overcome class imbalance, a common problem in cyberbullying datasets, we implement three techniques that include threshold-moving, cost function adjusting, and a hybrid solution in our model. We evaluate the performance of our models using two cyberbullying datasets collected from Twitter and Formspring.me. The results of our experiment show that PCNN can achieve improved recall and precision compared to baseline convolutional neural networks.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129704547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 78
Short-Term Urban Rail Passenger Flow Forecasting: A Dynamic Bayesian Network Approach 城市轨道交通短期客流预测:动态贝叶斯网络方法
J. Roos, S. Bonnevay, G. Gavin
We propose a dynamic Bayesian network approach to forecast the short-term passenger flows of the urban rail network of Paris. This approach can deal with the incompleteness of the data caused by failures or lack of collection systems. The structure of the model is based on the causal relationships between the adjacent flows and is designed to take into account the transport service. To reduce the number of arcs and find the maximum likelihood estimate of the parameters, we perform the structural expectation-maximization (EM) algorithm. Then short-term forecasting is conducted by inference, using the bootstrap filter. An experiment is carried out on an entire metro line, using ticket validation, count and transport service data. Overall, the forecasting results outperform historical average and last observation carried forward (LOCF). They illustrate the potential of the approach, as well as the key role of the transport service.
本文提出了一种动态贝叶斯网络方法来预测巴黎城市轨道网络的短期客流。这种方法可以处理由于故障或缺乏收集系统而导致的数据不完整。该模型的结构基于相邻流之间的因果关系,并设计为考虑运输服务。为了减少弧的数量并找到参数的最大似然估计,我们使用了结构期望最大化(EM)算法。然后利用自举滤波进行短期预测。实验在整条地铁线路上进行,使用票务验证,计数和运输服务数据。总体而言,预测结果优于历史平均水平和最后观测结转(LOCF)。它们说明了这种方法的潜力,以及运输服务的关键作用。
{"title":"Short-Term Urban Rail Passenger Flow Forecasting: A Dynamic Bayesian Network Approach","authors":"J. Roos, S. Bonnevay, G. Gavin","doi":"10.1109/ICMLA.2016.0187","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0187","url":null,"abstract":"We propose a dynamic Bayesian network approach to forecast the short-term passenger flows of the urban rail network of Paris. This approach can deal with the incompleteness of the data caused by failures or lack of collection systems. The structure of the model is based on the causal relationships between the adjacent flows and is designed to take into account the transport service. To reduce the number of arcs and find the maximum likelihood estimate of the parameters, we perform the structural expectation-maximization (EM) algorithm. Then short-term forecasting is conducted by inference, using the bootstrap filter. An experiment is carried out on an entire metro line, using ticket validation, count and transport service data. Overall, the forecasting results outperform historical average and last observation carried forward (LOCF). They illustrate the potential of the approach, as well as the key role of the transport service.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128990961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Automatic Object Detection Using DBSCAN for Counting Intoxicated Flies in the FLORIDA Assay 利用DBSCAN自动目标检测计数佛罗里达实验中的中毒蝇
Christian Bodenstein, Markus Goetz, Annika Jansen, H. Scholz, M. Riedel
In this paper, we propose an instrumentation and computer vision pipeline that allows automatic object detection on images taken from multiple experimental set ups. We demonstrate the approach by autonomously counting intoxicated flies in the FLORIDA assay. The assay measures the effect of ethanol exposure onto the ability of a vinegar fly Drosophila melanogaster to right itself. The analysis consists of a three-step approach. First, obtaining an image of a large set of individual experiments, second, identify areas containing a single experiment, and third, discover the searched objects within the experiment. For the analysis we facilitate well-known computer vision and machine learning algorithms - namely color segmentation, threshold imaging and DBSCAN. The automation of the experiment enables an unprecedented reproducibility and consistency, while significantly decreasing the manual labor.
在本文中,我们提出了一种仪器和计算机视觉管道,可以对从多个实验装置拍摄的图像进行自动目标检测。我们通过在佛罗里达实验中自动计数中毒苍蝇来证明这种方法。该实验测量了乙醇暴露对黑腹果蝇自我纠正能力的影响。分析包括三个步骤。首先,获取大量单个实验的图像,其次,识别包含单个实验的区域,第三,发现实验内的搜索对象。为了进行分析,我们采用了众所周知的计算机视觉和机器学习算法-即颜色分割,阈值成像和DBSCAN。实验的自动化实现了前所未有的重复性和一致性,同时显著减少了手工劳动。
{"title":"Automatic Object Detection Using DBSCAN for Counting Intoxicated Flies in the FLORIDA Assay","authors":"Christian Bodenstein, Markus Goetz, Annika Jansen, H. Scholz, M. Riedel","doi":"10.1109/ICMLA.2016.0133","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0133","url":null,"abstract":"In this paper, we propose an instrumentation and computer vision pipeline that allows automatic object detection on images taken from multiple experimental set ups. We demonstrate the approach by autonomously counting intoxicated flies in the FLORIDA assay. The assay measures the effect of ethanol exposure onto the ability of a vinegar fly Drosophila melanogaster to right itself. The analysis consists of a three-step approach. First, obtaining an image of a large set of individual experiments, second, identify areas containing a single experiment, and third, discover the searched objects within the experiment. For the analysis we facilitate well-known computer vision and machine learning algorithms - namely color segmentation, threshold imaging and DBSCAN. The automation of the experiment enables an unprecedented reproducibility and consistency, while significantly decreasing the manual labor.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127603293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Privacy-Preserving Solution for the Bipartite Ranking Problem 二部排序问题的隐私保护解
N. Faramarzi, Erman Ayday, H. Altay Güvenir
In this paper, we propose an efficient solution for the privacy-preserving of a bipartite ranking algorithm. The bipartite ranking problem can be considered as finding a function that ranks positive instances (in a dataset) higher than the negative ones. However, one common concern for all the existing schemes is the privacy of individuals in the dataset. That is, one (e.g., a researcher) needs to access the records of all individuals in the dataset in order to run the algorithm. This privacy concern puts limitations on the use of sensitive personal data for such analysis. The RIMARC (Ranking Instances by Maximizing Area under the ROC Curve) algorithm solves the bipartite ranking problem by learning a model to rank instances. As part of the model, it learns weights for each feature by analyzing the area under receiver operating characteristic (ROC) curve. RIMARC algorithm is shown to be more accurate and efficient than its counterparts. Thus, we use this algorithm as a building-block and provide a privacy-preserving version of the RIMARC algorithm using homomorphic encryption and secure multi-party computation. Our proposed algorithm lets a data owner outsource the storage and processing of its encrypted dataset to a semi-trusted cloud. Then, a researcher can get the results of his/her queries (to learn the ranking function) on the dataset by interacting with the cloud. During this process, neither the researcher nor the cloud learns any information about the raw dataset. We prove the security of the proposed algorithm and show its efficiency via experiments on real data.
在本文中,我们提出了一个二部排序算法的隐私保护的有效解决方案。二部排序问题可以被认为是找到一个函数,该函数将数据集中的正实例排序高于负实例。然而,所有现有方案的一个共同问题是数据集中个人的隐私。也就是说,一个人(例如,研究人员)需要访问数据集中所有个人的记录才能运行算法。出于对隐私的考虑,对使用敏感个人资料进行此类分析施加了限制。RIMARC(通过最大化ROC曲线下的面积来排序实例)算法通过学习一个模型对实例进行排序来解决二部排序问题。作为模型的一部分,它通过分析接收者工作特征(ROC)曲线下的面积来学习每个特征的权重。结果表明,RIMARC算法比同类算法更准确、更高效。因此,我们使用该算法作为构建块,并使用同态加密和安全多方计算提供了RIMARC算法的隐私保护版本。我们提出的算法允许数据所有者将其加密数据集的存储和处理外包给半可信的云。然后,研究人员可以通过与云交互,在数据集上获得他/她的查询结果(以学习排名函数)。在这个过程中,研究人员和云都没有学习到关于原始数据集的任何信息。通过对实际数据的实验,证明了该算法的安全性和有效性。
{"title":"A Privacy-Preserving Solution for the Bipartite Ranking Problem","authors":"N. Faramarzi, Erman Ayday, H. Altay Güvenir","doi":"10.1109/ICMLA.2016.0067","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0067","url":null,"abstract":"In this paper, we propose an efficient solution for the privacy-preserving of a bipartite ranking algorithm. The bipartite ranking problem can be considered as finding a function that ranks positive instances (in a dataset) higher than the negative ones. However, one common concern for all the existing schemes is the privacy of individuals in the dataset. That is, one (e.g., a researcher) needs to access the records of all individuals in the dataset in order to run the algorithm. This privacy concern puts limitations on the use of sensitive personal data for such analysis. The RIMARC (Ranking Instances by Maximizing Area under the ROC Curve) algorithm solves the bipartite ranking problem by learning a model to rank instances. As part of the model, it learns weights for each feature by analyzing the area under receiver operating characteristic (ROC) curve. RIMARC algorithm is shown to be more accurate and efficient than its counterparts. Thus, we use this algorithm as a building-block and provide a privacy-preserving version of the RIMARC algorithm using homomorphic encryption and secure multi-party computation. Our proposed algorithm lets a data owner outsource the storage and processing of its encrypted dataset to a semi-trusted cloud. Then, a researcher can get the results of his/her queries (to learn the ranking function) on the dataset by interacting with the cloud. During this process, neither the researcher nor the cloud learns any information about the raw dataset. We prove the security of the proposed algorithm and show its efficiency via experiments on real data.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127901713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Detecting Smooth Cluster Changes in Evolving Graphs 进化图中平滑聚类变化的检测
Sohei Okui, Kaho Osamura, Akihiro Inokuchi
Clustering vertices in graphs or in sequences of graphs has important applications in network science, bioinformatics, and other areas. Most research to date has focused on static graphs or sequences where the number of vertices does not change. We propose a new algorithm that successfully partitions the vertices of a graph sequence into smooth clusters, even when the number of vertices is allowed to vary over time. Our approach uses spectral clustering and relies on applying the k partition problem to a graph constructed from the input graph sequence. Several experiments demonstrate the performance of our method and its advantages over existing methods.
图或图序列中的聚类顶点在网络科学、生物信息学和其他领域具有重要的应用。到目前为止,大多数研究都集中在顶点数量不变的静态图或序列上。我们提出了一种新的算法,即使允许顶点的数量随时间变化,也能成功地将图序列的顶点划分为光滑的簇。我们的方法使用谱聚类,并依赖于将k划分问题应用于由输入图序列构建的图。几个实验证明了该方法的性能及其相对于现有方法的优势。
{"title":"Detecting Smooth Cluster Changes in Evolving Graphs","authors":"Sohei Okui, Kaho Osamura, Akihiro Inokuchi","doi":"10.1109/ICMLA.2016.0066","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0066","url":null,"abstract":"Clustering vertices in graphs or in sequences of graphs has important applications in network science, bioinformatics, and other areas. Most research to date has focused on static graphs or sequences where the number of vertices does not change. We propose a new algorithm that successfully partitions the vertices of a graph sequence into smooth clusters, even when the number of vertices is allowed to vary over time. Our approach uses spectral clustering and relies on applying the k partition problem to a graph constructed from the input graph sequence. Several experiments demonstrate the performance of our method and its advantages over existing methods.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121312206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Demographic Group Prediction Based on Smart Device User Recognition Gestures 基于智能设备用户识别手势的人口统计群体预测
Adel R. Alharbi, M. Thornton
We propose a novel demographic group prediction mechanism for smart device users based upon the recognition of user gestures. The core idea of our proposed approach is to utilize data from a variety of the internal environmental sensors in the device to predict useful demographics information. In order to achieve this objective, an application with several intuitive user interfaces was implemented and used to capture user data. The results presented here are based upon the data from fifty users. These captured data are integrated or fused, pre-processed, analyzed, and used as training data for a supervised machine learning predictive approach. The data reduction methods are based upon principal component analysis (PCA) and linear discriminant analysis (LDA). PCA/LDA were implemented to reduce the data feature dimensions and to improve the k-nearest neighbors (KNN) supervised classification predictions. The results of our experiment indicate that high accuracy is achieved from this method. To the best of our knowledge, this is the first research that uses user recognition gestures to predict multiple demographic groups.
我们提出了一种基于用户手势识别的智能设备用户人口群体预测机制。我们提出的方法的核心思想是利用设备中各种内部环境传感器的数据来预测有用的人口统计信息。为了实现这一目标,实现了一个具有几个直观用户界面的应用程序,并使用它来捕获用户数据。这里给出的结果是基于50个用户的数据。这些捕获的数据被集成或融合、预处理、分析,并用作监督机器学习预测方法的训练数据。数据约简方法主要基于主成分分析(PCA)和线性判别分析(LDA)。采用PCA/LDA方法降低数据特征维数,提高k近邻监督分类预测的准确性。实验结果表明,该方法具有较高的精度。据我们所知,这是第一个使用用户识别手势来预测多个人口群体的研究。
{"title":"Demographic Group Prediction Based on Smart Device User Recognition Gestures","authors":"Adel R. Alharbi, M. Thornton","doi":"10.1109/ICMLA.2016.0025","DOIUrl":"https://doi.org/10.1109/ICMLA.2016.0025","url":null,"abstract":"We propose a novel demographic group prediction mechanism for smart device users based upon the recognition of user gestures. The core idea of our proposed approach is to utilize data from a variety of the internal environmental sensors in the device to predict useful demographics information. In order to achieve this objective, an application with several intuitive user interfaces was implemented and used to capture user data. The results presented here are based upon the data from fifty users. These captured data are integrated or fused, pre-processed, analyzed, and used as training data for a supervised machine learning predictive approach. The data reduction methods are based upon principal component analysis (PCA) and linear discriminant analysis (LDA). PCA/LDA were implemented to reduce the data feature dimensions and to improve the k-nearest neighbors (KNN) supervised classification predictions. The results of our experiment indicate that high accuracy is achieved from this method. To the best of our knowledge, this is the first research that uses user recognition gestures to predict multiple demographic groups.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123241604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1