首页 > 最新文献

International Conference on Signal Processing and Machine Learning最新文献

英文 中文
Detecting Blind Cross-Site Scripting Attacks Using Machine Learning 利用机器学习检测盲跨站脚本攻击
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297096
Gurpreet Kaur, Yasir Malik, Hamman W. Samuel, Fehmi Jaafar
Cross-site scripting (XSS) is a scripting attack targeting web applications by injecting malicious scripts into web pages. Blind XSS is a subset of stored XSS, where an attacker blindly deploys malicious payloads in web pages that are stored in a persistent manner on target servers. Most of the XSS detection techniques used to detect the XSS vulnerabilities are inadequate to detect blind XSS attacks. In this research, we present machine learning based approach to detect blind XSS attacks. Testing results help to identify malicious payloads that are likely to get stored in databases through web applications.
跨站脚本(XSS)是一种针对web应用程序的脚本攻击,通过在web页面中注入恶意脚本。盲目XSS是存储XSS的一个子集,攻击者在以持久方式存储在目标服务器上的网页中盲目地部署恶意有效负载。大多数用于检测XSS漏洞的XSS检测技术都不足以检测盲目的XSS攻击。在这项研究中,我们提出了一种基于机器学习的盲XSS攻击检测方法。测试结果有助于识别可能通过web应用程序存储在数据库中的恶意有效负载。
{"title":"Detecting Blind Cross-Site Scripting Attacks Using Machine Learning","authors":"Gurpreet Kaur, Yasir Malik, Hamman W. Samuel, Fehmi Jaafar","doi":"10.1145/3297067.3297096","DOIUrl":"https://doi.org/10.1145/3297067.3297096","url":null,"abstract":"Cross-site scripting (XSS) is a scripting attack targeting web applications by injecting malicious scripts into web pages. Blind XSS is a subset of stored XSS, where an attacker blindly deploys malicious payloads in web pages that are stored in a persistent manner on target servers. Most of the XSS detection techniques used to detect the XSS vulnerabilities are inadequate to detect blind XSS attacks. In this research, we present machine learning based approach to detect blind XSS attacks. Testing results help to identify malicious payloads that are likely to get stored in databases through web applications.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"139 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131427974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Brain Function Networks Reveal Movement-related EEG Potentials Associated with Exercise-induced Fatigue 脑功能网络揭示与运动诱发疲劳相关的运动相关脑电图电位
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297074
Jiahui Wang, Kun Yang, Jianhai Zhang, N. Zhang, Bin Chen
The present research was aimed to find out EEG potentials related to movement in exercise-induced fatigue task using brain function network analysis, so that future researchers can find more accurate mutual informations between these potentials to detect fatigue to make healthy people exercise better and especially improve the effectiveness of rehabilitation in patients with motor dysfunction. EEG signals from 32 electrode sites of 20 subjects(10 adults (5 females and 5 males) and 10 children (6 females and 4 males) were recorded. We applied network topologies extracted from brain function networks constructed by phase synchronization to identify movement-related electrode sites. We first found that there were significant differences on the global network topologies of subjects of different ages and genders, and the difference between subjects of different ages was greater, so adults and children in the subjects were separated to discuss potential selection related to movement. The following finding illustrated that local network topologies of some electrode sites correlated significantly with the degree of fatigue, we thought and selected such electrode sites to be movement-related. Results showed that 17 potentials in adults, 6 most relevant potentials as important potentials(CP5,C3,AF4,CZ,PZ,C4), and 4 potentials (F4,F8,F3,FC5) in children were selected as movement-related EEG potentials associated with exercise-induced fatigue in rotating the forearm repetitively task. We demonstrated that the credibility of our selections by observing the classification accuracy of local network topologies of non-fatigue state and fatigue state in our selected electrode sites was higher than that of local network topologies of non-fatigue state and fatigue state in our unselected electrode sites, which suggested that our selected movement-related electrode sites were more able to detect non-fatigue state and fatigue state.
本研究旨在通过脑功能网络分析,找出运动性疲劳任务中与运动相关的脑电图电位,以便未来研究人员能够更准确地发现这些电位之间的相互信息,从而检测疲劳,使健康人更好地运动,特别是提高运动功能障碍患者的康复效果。记录20名被试(10名成人(5女5男)和10名儿童(6女4男)32个电极的脑电信号。我们应用从相同步构建的脑功能网络中提取的网络拓扑来识别与运动相关的电极位置。我们首先发现不同年龄和性别的被试在全局网络拓扑结构上存在显著差异,且不同年龄的被试之间的差异更大,因此将被试中的成人和儿童分开讨论与运动相关的潜在选择。以下发现表明,某些电极位置的局部网络拓扑结构与疲劳程度显著相关,我们认为并选择了与运动相关的这些电极位置。结果表明,成人有17个脑电位,儿童有4个脑电位(F4、F8、F3、FC5),儿童有6个脑电位(CP5、C3、AF4、CZ、PZ、C4)为前臂重复性旋转运动疲劳相关脑电位。通过观察我们选择的电极位置的非疲劳状态和疲劳状态的局部网络拓扑的分类精度,我们证明了我们选择的可信度高于我们未选择的电极位置的非疲劳状态和疲劳状态的局部网络拓扑,这表明我们选择的运动相关电极位置更能够检测非疲劳状态和疲劳状态。
{"title":"Brain Function Networks Reveal Movement-related EEG Potentials Associated with Exercise-induced Fatigue","authors":"Jiahui Wang, Kun Yang, Jianhai Zhang, N. Zhang, Bin Chen","doi":"10.1145/3297067.3297074","DOIUrl":"https://doi.org/10.1145/3297067.3297074","url":null,"abstract":"The present research was aimed to find out EEG potentials related to movement in exercise-induced fatigue task using brain function network analysis, so that future researchers can find more accurate mutual informations between these potentials to detect fatigue to make healthy people exercise better and especially improve the effectiveness of rehabilitation in patients with motor dysfunction. EEG signals from 32 electrode sites of 20 subjects(10 adults (5 females and 5 males) and 10 children (6 females and 4 males) were recorded. We applied network topologies extracted from brain function networks constructed by phase synchronization to identify movement-related electrode sites.\u0000 We first found that there were significant differences on the global network topologies of subjects of different ages and genders, and the difference between subjects of different ages was greater, so adults and children in the subjects were separated to discuss potential selection related to movement. The following finding illustrated that local network topologies of some electrode sites correlated significantly with the degree of fatigue, we thought and selected such electrode sites to be movement-related. Results showed that 17 potentials in adults, 6 most relevant potentials as important potentials(CP5,C3,AF4,CZ,PZ,C4), and 4 potentials (F4,F8,F3,FC5) in children were selected as movement-related EEG potentials associated with exercise-induced fatigue in rotating the forearm repetitively task. We demonstrated that the credibility of our selections by observing the classification accuracy of local network topologies of non-fatigue state and fatigue state in our selected electrode sites was higher than that of local network topologies of non-fatigue state and fatigue state in our unselected electrode sites, which suggested that our selected movement-related electrode sites were more able to detect non-fatigue state and fatigue state.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131202418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pedestrian Detection in Fish-eye Images using Deep Learning: Combine Faster R-CNN with an effective Cutting Method 使用深度学习的鱼眼图像行人检测:将更快的R-CNN与有效的切割方法相结合
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297069
Hongli Lin, Zhenzhen Kong, Weisheng Wang, K. Liang, Jun Chen
With the development of artificial intelligence, pedestrian detection has become an important research topic in the field of intelligent video surveillance. Fish-eye camera is a useful tool for video monitoring. However, due to the edge distortion of the fish-eye image, which puts higher requirements and challenges on the pedestrian detection technology of fish-eye images. In this paper, an effective method is proposed by rotating cutting to address the problem, a fish-eye image is divided into an edge portion and a center portion. The effectiveness and performance of our method is verified by the traditional pedestrian detection method HOG+SVM and the Faster R-CNN based on convolutional neural network. The experimental results demonstrate the efficacy of the proposed approach, and Faster R-CNN achieves better performance than traditional method.
随着人工智能的发展,行人检测已成为智能视频监控领域的重要研究课题。鱼眼摄像机是一种非常有用的视频监控工具。然而,由于鱼眼图像的边缘失真,这对鱼眼图像的行人检测技术提出了更高的要求和挑战。本文提出了一种有效的方法——旋转切割,将鱼眼图像分割为边缘部分和中心部分。通过传统的行人检测方法HOG+SVM和基于卷积神经网络的Faster R-CNN验证了本文方法的有效性和性能。实验结果证明了该方法的有效性,更快的R-CNN比传统方法取得了更好的性能。
{"title":"Pedestrian Detection in Fish-eye Images using Deep Learning: Combine Faster R-CNN with an effective Cutting Method","authors":"Hongli Lin, Zhenzhen Kong, Weisheng Wang, K. Liang, Jun Chen","doi":"10.1145/3297067.3297069","DOIUrl":"https://doi.org/10.1145/3297067.3297069","url":null,"abstract":"With the development of artificial intelligence, pedestrian detection has become an important research topic in the field of intelligent video surveillance. Fish-eye camera is a useful tool for video monitoring. However, due to the edge distortion of the fish-eye image, which puts higher requirements and challenges on the pedestrian detection technology of fish-eye images. In this paper, an effective method is proposed by rotating cutting to address the problem, a fish-eye image is divided into an edge portion and a center portion. The effectiveness and performance of our method is verified by the traditional pedestrian detection method HOG+SVM and the Faster R-CNN based on convolutional neural network. The experimental results demonstrate the efficacy of the proposed approach, and Faster R-CNN achieves better performance than traditional method.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121729387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Feature Selection by Maximizing Part Mutual Information 最大化零件互信息的特征选择
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297068
Wanfu Gao, Liang Hu, Ping Zhang
Feature selection is an important preprocessing stage in signal processing and machine learning. Feature selection methods choose the most informative feature subset for classification. Mutual information and conditional mutual information are used extensively in feature selection methods. However, mutual information suffers from an overestimation problem, with conditional mutual information suffering from a problem of underestimation. To address the issues of overestimation and underestimation, we introduce a new measure named part mutual information that could accurately quantify direct association among variables. The proposed method selects the maximal value of cumulative summation of the part mutual information between candidate features and class labels when each selected feature is known. To evaluate the classification performance of the proposed method, our method is compared with four state-of the-art feature selection methods on twelve real-world data sets. Extensive studies demonstrate that our method outperforms the four compared methods in terms of average classification accuracy and the highest classification accuracy.
特征选择是信号处理和机器学习中一个重要的预处理阶段。特征选择方法选择信息量最大的特征子集进行分类。互信息和条件互信息在特征选择方法中得到了广泛的应用。然而,互信息存在高估的问题,而条件互信息存在低估的问题。为了解决高估和低估的问题,我们引入了一种新的度量,即部分互信息,它可以准确地量化变量之间的直接关联。该方法选取已知候选特征与类标号之间部分互信息累积和的最大值。为了评估所提出方法的分类性能,将我们的方法与四种最先进的特征选择方法在12个真实数据集上进行了比较。广泛的研究表明,我们的方法在平均分类精度和最高分类精度方面优于四种比较方法。
{"title":"Feature Selection by Maximizing Part Mutual Information","authors":"Wanfu Gao, Liang Hu, Ping Zhang","doi":"10.1145/3297067.3297068","DOIUrl":"https://doi.org/10.1145/3297067.3297068","url":null,"abstract":"Feature selection is an important preprocessing stage in signal processing and machine learning. Feature selection methods choose the most informative feature subset for classification. Mutual information and conditional mutual information are used extensively in feature selection methods. However, mutual information suffers from an overestimation problem, with conditional mutual information suffering from a problem of underestimation. To address the issues of overestimation and underestimation, we introduce a new measure named part mutual information that could accurately quantify direct association among variables. The proposed method selects the maximal value of cumulative summation of the part mutual information between candidate features and class labels when each selected feature is known. To evaluate the classification performance of the proposed method, our method is compared with four state-of the-art feature selection methods on twelve real-world data sets. Extensive studies demonstrate that our method outperforms the four compared methods in terms of average classification accuracy and the highest classification accuracy.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116923773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Target-depth Estimation for Active Towed Array Sonar in Shallow Sea base on Matched Field Processing 基于匹配场处理的浅海有源拖曳阵声呐目标深度估计
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297087
Jun Wang, Fuchen Liu
Target depth estimation can facilitate classification of surface ships or water-column targets thus reducing the false rates in active surveillance systems. Active sonar mainly determines the distance of the target by measuring the roundtrip time of the transmitted signal to the received echo, but it can't determine the depth of the target. For the long distance sound field, the echo is regarded as a point source sound field emitted from the reflector, the distance-depth space is divided into grids, and the sound field at each grid point is calculated according to the parameters of the ocean environment, and then matched with the received echoes, the best match point is the distance and depth of the target. In the active matched field depth-estimation algorithm, the pulse signal generated by the active sonar is sent to the transmitter to generate sound wave, at the same time, it is sent to the emission model to calculate the copy field of the hypothetical target point, and then the reflected sound field of the hypothetical target is calculated through the reflection model, finally, calculate the total copy vector at the receiving hydrophone array. The active matched field processor matches the received echo signal with the calculated total copy vector and outputs an ambiguity surface, it can be seen that the active matched field processing makes full use of the ocean environment. Since the active sonar has estimated the distance of the target according to the arrival time of the echo, the matched field depth estimation is to search for the target depth in a small range so as to determine the depth of the target. Sea trial data show that under good hydrological conditions, when the SNR of target echo is relatively high, the low-frequency active towed array sonar has good depth estimation capability.
目标深度估计有助于水面舰艇或水柱目标的分类,从而降低主动监视系统的误报率。主动声呐主要通过测量发射信号与接收回波的往返时间来确定目标的距离,但不能确定目标的深度。对于长距离声场,将回波视为反射器发射的点源声场,将距离-深度空间划分为网格,根据海洋环境参数计算每个网格点处的声场,然后与接收到的回波进行匹配,最佳匹配点为目标的距离和深度。在主动匹配场深度估计算法中,主动声纳产生的脉冲信号被发送到发射机产生声波,同时被发送到发射模型计算假设目标点的复制场,然后通过反射模型计算假设目标点的反射声场,最后计算接收水听器阵列处的总复制矢量。主动匹配场处理器将接收到的回波信号与计算得到的总复制向量进行匹配,输出歧义面,可见主动匹配场处理充分利用了海洋环境。由于主动声纳根据回波到达时间估计目标距离,匹配场深度估计就是在小范围内搜索目标深度,从而确定目标深度。海试数据表明,在良好的水文条件下,当目标回波信噪比较高时,低频主动拖曳阵声呐具有较好的深度估计能力。
{"title":"Target-depth Estimation for Active Towed Array Sonar in Shallow Sea base on Matched Field Processing","authors":"Jun Wang, Fuchen Liu","doi":"10.1145/3297067.3297087","DOIUrl":"https://doi.org/10.1145/3297067.3297087","url":null,"abstract":"Target depth estimation can facilitate classification of surface ships or water-column targets thus reducing the false rates in active surveillance systems. Active sonar mainly determines the distance of the target by measuring the roundtrip time of the transmitted signal to the received echo, but it can't determine the depth of the target. For the long distance sound field, the echo is regarded as a point source sound field emitted from the reflector, the distance-depth space is divided into grids, and the sound field at each grid point is calculated according to the parameters of the ocean environment, and then matched with the received echoes, the best match point is the distance and depth of the target. In the active matched field depth-estimation algorithm, the pulse signal generated by the active sonar is sent to the transmitter to generate sound wave, at the same time, it is sent to the emission model to calculate the copy field of the hypothetical target point, and then the reflected sound field of the hypothetical target is calculated through the reflection model, finally, calculate the total copy vector at the receiving hydrophone array. The active matched field processor matches the received echo signal with the calculated total copy vector and outputs an ambiguity surface, it can be seen that the active matched field processing makes full use of the ocean environment. Since the active sonar has estimated the distance of the target according to the arrival time of the echo, the matched field depth estimation is to search for the target depth in a small range so as to determine the depth of the target. Sea trial data show that under good hydrological conditions, when the SNR of target echo is relatively high, the low-frequency active towed array sonar has good depth estimation capability.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129305764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unsupervised Depth Estimation from Monocular Video based on Relative Motion 基于相对运动的单目视频无监督深度估计
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297094
H. Cao, Chao Wang, Ping Wang, Qingquan Zou, Xiao Xiao
In this paper, we present an unsupervised learning based approach to conduct depth estimation for monocular camera video images. Our system is formed by two convolutional neural networks (CNNs). A Depth-net is applied to estimate the depth information of objects in the target frame, and a Pose-net tends to estimate the relative motion of the camera from multiple adjacent video frames. Different from most previous works, which normally assume that all objects captured by the images are static so that a frame-level camera pose is generated by the Pose-net, we take into account of the motions of all objects and require the Pose-net to estimate the pixel-level relative pose. The outputs of the two networks are then combined to formulate a synthetic view loss function, through which the two CNNs are optimized to provide accurate depth estimation. Experimental test results show that our method can provide better performance than most conventional approaches.
在本文中,我们提出了一种基于无监督学习的方法来对单目摄像机视频图像进行深度估计。我们的系统由两个卷积神经网络(cnn)组成。深度网络用于估计目标帧中物体的深度信息,而姿态网络则倾向于从多个相邻视频帧中估计摄像机的相对运动。与以往大多数工作不同的是,通常假设图像捕获的所有物体都是静态的,从而由pose -net生成帧级相机姿态,我们考虑了所有物体的运动,并要求pose -net估计像素级的相对姿态。然后将两个网络的输出组合成一个合成的视图损失函数,通过该函数对两个cnn进行优化以提供准确的深度估计。实验测试结果表明,该方法比大多数传统方法具有更好的性能。
{"title":"Unsupervised Depth Estimation from Monocular Video based on Relative Motion","authors":"H. Cao, Chao Wang, Ping Wang, Qingquan Zou, Xiao Xiao","doi":"10.1145/3297067.3297094","DOIUrl":"https://doi.org/10.1145/3297067.3297094","url":null,"abstract":"In this paper, we present an unsupervised learning based approach to conduct depth estimation for monocular camera video images. Our system is formed by two convolutional neural networks (CNNs). A Depth-net is applied to estimate the depth information of objects in the target frame, and a Pose-net tends to estimate the relative motion of the camera from multiple adjacent video frames. Different from most previous works, which normally assume that all objects captured by the images are static so that a frame-level camera pose is generated by the Pose-net, we take into account of the motions of all objects and require the Pose-net to estimate the pixel-level relative pose. The outputs of the two networks are then combined to formulate a synthetic view loss function, through which the two CNNs are optimized to provide accurate depth estimation. Experimental test results show that our method can provide better performance than most conventional approaches.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123231843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimality Analysis of Boundary-Uncertainty-Based Classifier Model Parameter Status Selection Method 基于边界不确定性的分类器模型参数状态选择方法的最优性分析
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297076
David R Ha, Hideyuki Watanabe, Yuya Tomotoshi, Emilie Delattre, S. Katagiri
We proposed a novel method that selects an optimal classifier model's parameter status through the uncertainty measure evaluation of the estimated class boundaries instead of an estimation of the classification error probability. A key feature of our method is its potential to perform a classifier parameter status selection without a separate validation sample set that can be easily applied to any reasonable type of classifier model, unlike traditional approaches that often need a validation sample set or are sometimes less practical. In this paper, we first summarize our method and its experimental evaluation results and introduce the mathematical formalization for the posterior probability estimation procedure adopted in it. Then we show the convergence property of the estimation procedure and finally demonstrate our method's optimality in a practical situation where only a finite number of training samples are available.
本文提出了一种新的方法,通过对估计的类边界的不确定性度量评价来选择最优的分类器模型的参数状态,而不是估计分类错误概率。我们的方法的一个关键特征是它有可能在没有单独的验证样本集的情况下执行分类器参数状态选择,这可以很容易地应用于任何合理类型的分类器模型,而不像传统方法通常需要验证样本集或有时不太实用。本文首先总结了我们的方法及其实验评价结果,并介绍了该方法所采用的后验概率估计过程的数学形式化。然后我们证明了估计过程的收敛性,最后证明了我们的方法在训练样本数量有限的实际情况下的最优性。
{"title":"Optimality Analysis of Boundary-Uncertainty-Based Classifier Model Parameter Status Selection Method","authors":"David R Ha, Hideyuki Watanabe, Yuya Tomotoshi, Emilie Delattre, S. Katagiri","doi":"10.1145/3297067.3297076","DOIUrl":"https://doi.org/10.1145/3297067.3297076","url":null,"abstract":"We proposed a novel method that selects an optimal classifier model's parameter status through the uncertainty measure evaluation of the estimated class boundaries instead of an estimation of the classification error probability. A key feature of our method is its potential to perform a classifier parameter status selection without a separate validation sample set that can be easily applied to any reasonable type of classifier model, unlike traditional approaches that often need a validation sample set or are sometimes less practical. In this paper, we first summarize our method and its experimental evaluation results and introduce the mathematical formalization for the posterior probability estimation procedure adopted in it. Then we show the convergence property of the estimation procedure and finally demonstrate our method's optimality in a practical situation where only a finite number of training samples are available.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"171 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117302536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Speech Emotion Classification using Raw Audio Input and Transcriptions 使用原始音频输入和转录的语音情感分类
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297089
Gabriel Lima, Jinyeong Bak
As new gadgets that interact with the user through voice become accessible, the importance of not only the content of the speech increases, but also the significance of the way the user has spoken. Even though many techniques have been developed to indicate emotion on speech, none of them can fully grasp the real emotion of the speaker. This paper presents a neural network model capable of predicting emotions in conversations by analyzing transcriptions and raw audio waveforms, focusing on feature extraction using convolutional layers and feature combination. The model achieves an accuracy of over 71% across four classes: Anger, Happiness, Neutrality and Sadness. We also analyze the effect of audio and textual features on the classification task, by interpreting attention scores and parts of speech. This paper explores the use of raw audio waveforms, that in the best of our knowledge, have not yet been used deeply in the emotion classification task, achieving close to state of art results.
随着通过语音与用户互动的新设备变得触手可及,不仅语音内容的重要性增加了,而且用户说话方式的重要性也增加了。尽管已经发展了许多技术来表达言语中的情感,但没有一种技术能够完全把握说话人的真实情感。本文提出了一个神经网络模型,能够通过分析转录和原始音频波形来预测对话中的情绪,重点是使用卷积层和特征组合进行特征提取。该模型在四个类别(愤怒、快乐、中立和悲伤)中达到了超过71%的准确率。我们还通过解释注意分数和词性来分析音频和文本特征对分类任务的影响。本文探索了原始音频波形的使用,据我们所知,这些波形尚未在情感分类任务中被深入使用,取得了接近最先进的结果。
{"title":"Speech Emotion Classification using Raw Audio Input and Transcriptions","authors":"Gabriel Lima, Jinyeong Bak","doi":"10.1145/3297067.3297089","DOIUrl":"https://doi.org/10.1145/3297067.3297089","url":null,"abstract":"As new gadgets that interact with the user through voice become accessible, the importance of not only the content of the speech increases, but also the significance of the way the user has spoken. Even though many techniques have been developed to indicate emotion on speech, none of them can fully grasp the real emotion of the speaker. This paper presents a neural network model capable of predicting emotions in conversations by analyzing transcriptions and raw audio waveforms, focusing on feature extraction using convolutional layers and feature combination. The model achieves an accuracy of over 71% across four classes: Anger, Happiness, Neutrality and Sadness. We also analyze the effect of audio and textual features on the classification task, by interpreting attention scores and parts of speech. This paper explores the use of raw audio waveforms, that in the best of our knowledge, have not yet been used deeply in the emotion classification task, achieving close to state of art results.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133684582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Object Detection Based on Binocular Vision with Convolutional Neural Network 基于卷积神经网络的双目视觉目标检测
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297081
Zekun Luo, Xia Wu, Qingquan Zou, Xiao Xiao
Autonomous vehicles are widely accepted as one of the most potential technologies in alleviating traffic problems. In most existing autonomous vehicles for object detection and distance measurement, compared with radar or LIDAR which obviously increases the cost, camera combined with Convolutional Neural Network (CNN) has advantage in accuracy and low cost. However, most object detection methods applied on camera cannot perform distance measurement. In this paper, we simultaneously carry out real-time object detection and distance measurement (DDM) in one system by utilizing CNN on a binocular camera. Firstly, a binocular camera is used to acquire disparity maps. Secondly, a set of high-quality region proposals is generated by those disparity maps and the number of region proposals is reduced. Thirdly, CNN is utilized to classify those region proposals and get the bounding box of detected objects. Consequently, those reduced region proposals generated by disparity maps lead to improved computational efficiency. Finally, the object distance is measured by the disparity map and the bounding box. The experiment results show that the proposed method can achieve an accuracy of 87.2% on KITTI dataset and an accuracy of 68% in the real environment for object detection. The average relative error of the distance measurement is 0.85% within 10 meters in real environment. The operation time of the whole DDM system is less than 80 ms.
自动驾驶汽车被广泛认为是缓解交通问题的最具潜力的技术之一。在现有的大多数自动驾驶汽车的目标检测和距离测量中,相对于雷达或激光雷达的成本明显增加,摄像头结合卷积神经网络(CNN)具有精度高、成本低的优势。然而,大多数应用于相机的目标检测方法都不能进行距离测量。在本文中,我们利用双目摄像机上的CNN在一个系统中同时进行实时目标检测和距离测量(DDM)。首先,利用双目摄像机获取视差图;其次,利用视差图生成一组高质量的区域建议,并减少区域建议的数量;第三,利用CNN对这些区域建议进行分类,得到检测目标的边界框。因此,视差图生成的减少区域建议提高了计算效率。最后,通过视差图和边界框测量目标距离。实验结果表明,该方法在KITTI数据集上的准确率为87.2%,在真实环境下的目标检测准确率为68%。在实际环境中,距离测量的平均相对误差为0.85%,误差范围为10米。整个DDM系统的运行时间小于80ms。
{"title":"Object Detection Based on Binocular Vision with Convolutional Neural Network","authors":"Zekun Luo, Xia Wu, Qingquan Zou, Xiao Xiao","doi":"10.1145/3297067.3297081","DOIUrl":"https://doi.org/10.1145/3297067.3297081","url":null,"abstract":"Autonomous vehicles are widely accepted as one of the most potential technologies in alleviating traffic problems. In most existing autonomous vehicles for object detection and distance measurement, compared with radar or LIDAR which obviously increases the cost, camera combined with Convolutional Neural Network (CNN) has advantage in accuracy and low cost. However, most object detection methods applied on camera cannot perform distance measurement. In this paper, we simultaneously carry out real-time object detection and distance measurement (DDM) in one system by utilizing CNN on a binocular camera. Firstly, a binocular camera is used to acquire disparity maps. Secondly, a set of high-quality region proposals is generated by those disparity maps and the number of region proposals is reduced. Thirdly, CNN is utilized to classify those region proposals and get the bounding box of detected objects. Consequently, those reduced region proposals generated by disparity maps lead to improved computational efficiency. Finally, the object distance is measured by the disparity map and the bounding box. The experiment results show that the proposed method can achieve an accuracy of 87.2% on KITTI dataset and an accuracy of 68% in the real environment for object detection. The average relative error of the distance measurement is 0.85% within 10 meters in real environment. The operation time of the whole DDM system is less than 80 ms.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124931248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Expressway Crash Prediction based on Traffic Big Data 基于交通大数据的高速公路碰撞预测
Pub Date : 2018-11-28 DOI: 10.1145/3297067.3297093
Hailang Meng, Xinhong Wang, X. Wang
With the development of society, the number of vehicles increases rapidly. The vehicle plays an important role in people's life, however the problem of traffic safety caused by vehicles has also become increasingly prominent. In China, the high crash rate and casualty rate on expressways have always troubled traffic management department. So crash prediction on expressway becomes vital. Conventionally, crash prediction is based on traffic flow data. These data do not contain all the necessary factors. In this paper, we propose a method of prediction using real-world data, including historical accident data, road geometry data, vehicle speed data, and weather data. We treat the crash prediction problem as a binary classification problem. For classification, sample imbalanced is a great challenge in practice. Modifying sample weights is applied to handle this challenge. Three machine learning classification techniques, namely Random Forest (RF), Gradient Boosting Decision Tree (GBDT) and Xgboost, are considered to carry out the crash prediction task respectively. The best recall and precision rate of these models are respectively 0.764253 and 0.01062. The proposed method can be integrated into urban traffic control systems toward police dispatch and crash prevention.
随着社会的发展,车辆的数量迅速增加。车辆在人们的生活中扮演着重要的角色,然而由车辆引起的交通安全问题也日益突出。在中国,高速公路的高事故率和伤亡率一直困扰着交通管理部门。因此,高速公路碰撞预测变得至关重要。传统上,碰撞预测是基于交通流量数据的。这些数据不包括所有必要的因素。在本文中,我们提出了一种使用现实世界数据的预测方法,包括历史事故数据、道路几何数据、车速数据和天气数据。我们将碰撞预测问题视为一个二分类问题。对于分类来说,样本不平衡在实践中是一个很大的挑战。修改样本权重可以解决这个问题。采用随机森林(Random Forest, RF)、梯度提升决策树(Gradient Boosting Decision Tree, GBDT)和Xgboost三种机器学习分类技术分别完成碰撞预测任务。这些模型的最佳查全率和准确率分别为0.764253和0.01062。该方法可以集成到城市交通控制系统中,用于警察调度和事故预防。
{"title":"Expressway Crash Prediction based on Traffic Big Data","authors":"Hailang Meng, Xinhong Wang, X. Wang","doi":"10.1145/3297067.3297093","DOIUrl":"https://doi.org/10.1145/3297067.3297093","url":null,"abstract":"With the development of society, the number of vehicles increases rapidly. The vehicle plays an important role in people's life, however the problem of traffic safety caused by vehicles has also become increasingly prominent. In China, the high crash rate and casualty rate on expressways have always troubled traffic management department. So crash prediction on expressway becomes vital. Conventionally, crash prediction is based on traffic flow data. These data do not contain all the necessary factors. In this paper, we propose a method of prediction using real-world data, including historical accident data, road geometry data, vehicle speed data, and weather data. We treat the crash prediction problem as a binary classification problem. For classification, sample imbalanced is a great challenge in practice. Modifying sample weights is applied to handle this challenge. Three machine learning classification techniques, namely Random Forest (RF), Gradient Boosting Decision Tree (GBDT) and Xgboost, are considered to carry out the crash prediction task respectively. The best recall and precision rate of these models are respectively 0.764253 and 0.01062. The proposed method can be integrated into urban traffic control systems toward police dispatch and crash prevention.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114248185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
International Conference on Signal Processing and Machine Learning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1