首页 > 最新文献

Proceedings of the 9th International Symposium on Information and Communication Technology最新文献

英文 中文
Calf Robust Weight Estimation Using 3D Contiguous Cylindrical Model and Directional Orientation from Stereo Images 基于三维连续圆柱模型和立体图像定向的小牛鲁棒权重估计
Ryo Nishide, Ayumi Yamashita, Yumi Takaki, C. Ohta, K. Oyama, T. Ohkawa
Calving interval is often used as an indicator for fertility of beef cattle, however, maternal abilities are also required because the value of breeding cows depends on how efficiently the healthy and growing calves are produced. The calf's weight has been used as an indicator of maternity ability since the past few decades. We propose a method to estimate body weight by modeling the shape of calf using 3D information extracted from the stereo images. This method enables to predict the swelling of the cattle's body by creating a 3D model, which cannot be obtained solely from a 2D image. In addition, it is possible to estimate robust weight regardless of different shooting conditions toward cattle's posture and orientation. An image suitable for estimation is selected from motion images taken by the camera installed in the barn, and 3D coordinates are calculated by the images. Then, only the body is developed with a 3D model as it has the highest correlation with the body weight. Considering that the side of cattle's body may not be exactly perpendicular to the camera's shooting direction, a symmetric axis is extracted to find the inclination of cattle body from the camera in order to generate a 3D model based on the symmetric axis. 3D contiguous cylindrical model is used for the body of a cattle which has a rounded shape. In order to manipulate the shapes of the cylindrical surface, the circle and ellipse fittings are applied and compared. The linear regression equation of the volume of the cylindrical model and the actually measured body weight are used to estimate the cattle weight. As a result of modeling with the proposed method using the actual camera images, the correlation coefficient between the body weight and the model volume was at the best value, 0.9107. Even when experimentally examined with the different 3D coordinates obtained from other types of camera, the MAPE (Mean Absolute Percentage Error) was as low as 6.39%.
产犊间隔常被用作肉牛生育力的指标,然而,由于繁殖奶牛的价值取决于健康和生长的小牛的生产效率,因此也需要母亲的能力。过去几十年来,小牛的体重一直被用作衡量母性能力的指标。本文提出了一种利用立体图像中提取的三维信息对小腿形状进行建模来估计体重的方法。这种方法可以通过创建3D模型来预测牛的身体肿胀,这是无法从2D图像中获得的。此外,无论不同的射击条件对牛的姿势和方向,都可以估计健壮的重量。从安装在谷仓内的摄像机拍摄的运动图像中选择适合估计的图像,并根据图像计算三维坐标。然后,只有身体是用3D模型开发的,因为它与体重的相关性最高。考虑到牛身体的侧面可能并不完全垂直于摄像机的拍摄方向,提取对称轴来寻找牛身体与摄像机的倾斜度,并基于对称轴生成三维模型。牛体为圆形,采用三维连续圆柱形模型。为了控制圆柱形表面的形状,采用了圆形和椭圆形管件,并进行了比较。利用圆柱形模型的体积和实际测量的体重的线性回归方程来估计牛的体重。采用该方法对实际摄像机图像进行建模后,体重与模型体积的相关系数达到最佳值,为0.9107。即使使用其他类型相机获得的不同三维坐标进行实验检验,MAPE(平均绝对百分比误差)也低至6.39%。
{"title":"Calf Robust Weight Estimation Using 3D Contiguous Cylindrical Model and Directional Orientation from Stereo Images","authors":"Ryo Nishide, Ayumi Yamashita, Yumi Takaki, C. Ohta, K. Oyama, T. Ohkawa","doi":"10.1145/3287921.3287923","DOIUrl":"https://doi.org/10.1145/3287921.3287923","url":null,"abstract":"Calving interval is often used as an indicator for fertility of beef cattle, however, maternal abilities are also required because the value of breeding cows depends on how efficiently the healthy and growing calves are produced. The calf's weight has been used as an indicator of maternity ability since the past few decades. We propose a method to estimate body weight by modeling the shape of calf using 3D information extracted from the stereo images. This method enables to predict the swelling of the cattle's body by creating a 3D model, which cannot be obtained solely from a 2D image. In addition, it is possible to estimate robust weight regardless of different shooting conditions toward cattle's posture and orientation. An image suitable for estimation is selected from motion images taken by the camera installed in the barn, and 3D coordinates are calculated by the images. Then, only the body is developed with a 3D model as it has the highest correlation with the body weight. Considering that the side of cattle's body may not be exactly perpendicular to the camera's shooting direction, a symmetric axis is extracted to find the inclination of cattle body from the camera in order to generate a 3D model based on the symmetric axis. 3D contiguous cylindrical model is used for the body of a cattle which has a rounded shape. In order to manipulate the shapes of the cylindrical surface, the circle and ellipse fittings are applied and compared. The linear regression equation of the volume of the cylindrical model and the actually measured body weight are used to estimate the cattle weight. As a result of modeling with the proposed method using the actual camera images, the correlation coefficient between the body weight and the model volume was at the best value, 0.9107. Even when experimentally examined with the different 3D coordinates obtained from other types of camera, the MAPE (Mean Absolute Percentage Error) was as low as 6.39%.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128816245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A New Framework For Crowded Scene Counting Based On Weighted Sum Of Regressors and Human Classifier 基于回归量加权和和人类分类器的拥挤场景计数新框架
P. Do, N. Ly
Crowd density estimation is an important task in the surveillance camera system, it serves in security, traffic, business etc. At the present, the trend of monitoring is moving from individual to crowd, but traditional counting techniques will be inefficient in this case because of issues such as scale, clutter background and occlusion. Most of the previous methods have focused on modeling work to accurately estimate the density map and thus infer the count. However, with non-human scenes, which have many clouds, trees, houses, seas etc, these models are often confused, resulting in inaccurate count estimates. To overcome this problem, we propose the "Weighted Sum of Regressors and Human Classifier" (WSRHC) method. Our model consists of two main parts: human -- non-human classification and crowd counting estimation. First of all, we built a Human Classifier, which filters out negative sample images (non-human images) before entering into the regressors. Then, the count estimation is based on the regressors. The difference between regressors is the size of the filters. The essence of this method is the count depends on the weighted average of the density map obtained from these regressors. This is to overcome the defects of the previous model, Switching Convolutional Neural Network (Switch-CNN) select the count as the output of one of the regressors. Multi-Column Convolutional Neural Network (MCNN) combines the count and the weight of the Regressors by fixed weights from MCNN, while our approach is adapted for individual images. Our experiments have shown that our method outperform Switch-CNN, MCNN on ShanghaiTech dataset and UCF_CC_50 dataset.
人群密度估计是监控摄像系统中的一项重要工作,它服务于安防、交通、商业等领域。目前,监测的趋势正在从个体向群体发展,但传统的计数技术在这种情况下由于规模、背景杂波和遮挡等问题将会效率低下。以前的大多数方法都集中在建模工作上,以准确估计密度图,从而推断计数。然而,对于非人类场景,其中有许多云,树木,房屋,海洋等,这些模型经常混淆,导致计数估计不准确。为了克服这个问题,我们提出了“回归量和人类分类器的加权和”(WSRHC)方法。我们的模型由两个主要部分组成:人类-非人类分类和人群计数估计。首先,我们建立了一个人类分类器,它在进入回归量之前过滤掉负样本图像(非人类图像)。然后,基于回归量进行计数估计。回归量之间的区别在于过滤器的大小。该方法的本质是计数依赖于从这些回归量得到的密度图的加权平均值。这是为了克服之前模型的缺陷,切换卷积神经网络(Switch-CNN)选择计数作为其中一个回归量的输出。多列卷积神经网络(MCNN)通过MCNN的固定权重将回归量的计数和权重结合起来,而我们的方法适用于单个图像。实验结果表明,该方法在上海科技数据集和UCF_CC_50数据集上优于Switch-CNN、MCNN。
{"title":"A New Framework For Crowded Scene Counting Based On Weighted Sum Of Regressors and Human Classifier","authors":"P. Do, N. Ly","doi":"10.1145/3287921.3287980","DOIUrl":"https://doi.org/10.1145/3287921.3287980","url":null,"abstract":"Crowd density estimation is an important task in the surveillance camera system, it serves in security, traffic, business etc. At the present, the trend of monitoring is moving from individual to crowd, but traditional counting techniques will be inefficient in this case because of issues such as scale, clutter background and occlusion. Most of the previous methods have focused on modeling work to accurately estimate the density map and thus infer the count. However, with non-human scenes, which have many clouds, trees, houses, seas etc, these models are often confused, resulting in inaccurate count estimates. To overcome this problem, we propose the \"Weighted Sum of Regressors and Human Classifier\" (WSRHC) method. Our model consists of two main parts: human -- non-human classification and crowd counting estimation. First of all, we built a Human Classifier, which filters out negative sample images (non-human images) before entering into the regressors. Then, the count estimation is based on the regressors. The difference between regressors is the size of the filters. The essence of this method is the count depends on the weighted average of the density map obtained from these regressors. This is to overcome the defects of the previous model, Switching Convolutional Neural Network (Switch-CNN) select the count as the output of one of the regressors. Multi-Column Convolutional Neural Network (MCNN) combines the count and the weight of the Regressors by fixed weights from MCNN, while our approach is adapted for individual images. Our experiments have shown that our method outperform Switch-CNN, MCNN on ShanghaiTech dataset and UCF_CC_50 dataset.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"50 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126351341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
CVSS: A Blockchainized Certificate Verifying Support System CVSS:区块链证书验证支持系统
Duc-Hiep Nguyen, Dinh-Nghia Nguyen-Duc, Nguyen Huynh-Tuong, Hoang-Anh Pham
By using a decentralized peer-to-peer network together with public and distributed ledger to decentralize the central authority, Blockchain has shown its great potential with the success of Bitcoin. However, the blockchain technology can go beyond financial transactions. In this paper, we propose an approach that utilizes the blockchain technology to issue immutable digital certificates and improve the current limitations of the existing certificate verifying systems such as faster, more trusted, and independent of the central authority. Our prototype has been successfully deployed for several short-term courses at the Center of Computer Engineering, HCMC University of Technology, Vietnam. This result indicates that our proposed system is an appropriate solution adopting ICT for e-government, especially in certificate and diploma management.
通过使用去中心化的点对点网络以及公共和分布式账本来分散中央权力,区块链随着比特币的成功显示出了巨大的潜力。然而,区块链技术可以超越金融交易。在本文中,我们提出了一种利用区块链技术颁发不可变数字证书的方法,并改进了现有证书验证系统当前的局限性,例如更快,更可信和独立于中央权威机构。我们的样机已经成功地用于越南胡志明市工业大学计算机工程中心的几个短期课程。结果表明,本文提出的系统是采用信息通信技术(ICT)进行电子政务,特别是证书和文凭管理的一种合适的解决方案。
{"title":"CVSS: A Blockchainized Certificate Verifying Support System","authors":"Duc-Hiep Nguyen, Dinh-Nghia Nguyen-Duc, Nguyen Huynh-Tuong, Hoang-Anh Pham","doi":"10.1145/3287921.3287968","DOIUrl":"https://doi.org/10.1145/3287921.3287968","url":null,"abstract":"By using a decentralized peer-to-peer network together with public and distributed ledger to decentralize the central authority, Blockchain has shown its great potential with the success of Bitcoin. However, the blockchain technology can go beyond financial transactions. In this paper, we propose an approach that utilizes the blockchain technology to issue immutable digital certificates and improve the current limitations of the existing certificate verifying systems such as faster, more trusted, and independent of the central authority. Our prototype has been successfully deployed for several short-term courses at the Center of Computer Engineering, HCMC University of Technology, Vietnam. This result indicates that our proposed system is an appropriate solution adopting ICT for e-government, especially in certificate and diploma management.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131039234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Aspect Based Sentiment Analysis Using NeuroNER and Bidirectional Recurrent Neural Network 基于神经元和双向递归神经网络的面向情感分析
N. Tran
Nowadays, understanding sentiments of what customers say, think and review plays an important part in the success of every business. In consequence, Sentiment Analysis (SA) has been becoming a vital part in both academic and commercial standpoint in recent years. However, most of the current sentiment analysis approaches only focus on detecting the overall polarity of the whole sentence or paragraph. That is the reason why this work focuses on another approach to this task, which is Aspect Based Sentiment Analysis (ABSA). The proposed ABSA system in this paper has two main phases: aspect term extraction and aspect sentiment prediction. For the first phase, as to deal with the named-entity recognition (NER) task, it is performed by reusing the NeuroNER [1] program without any modifications because it is currently one of the best NER tool available. For the sentiment prediction task, a bidirectional gated recurrent unit (BiGRU) Recurrent Neural Network (RNN) model which processes 4 features as input: word embeddings, SenticNet [2], Part of Speech and Distance is implemented. However, this network architecture performance on SemEval 2016 [3] dataset showed some drawbacks and limitations that influenced the polarity prediction result. For this reason, this work proposes some adjustments to the mentioned model to solve the current problems and improve the accuracy of the second task.
如今,了解顾客所说、所想和评论的情绪对每项业务的成功都起着重要的作用。因此,情感分析(SA)近年来已成为学术界和商业观点的重要组成部分。然而,目前大多数情感分析方法只关注于检测整个句子或段落的整体极性。这就是为什么这项工作关注于这项任务的另一种方法,即基于方面的情感分析(ABSA)。本文提出的ABSA系统主要分为两个阶段:方面术语提取和方面情感预测。对于第一阶段,处理命名实体识别(NER)任务,通过重用NeuroNER[1]程序而不做任何修改来执行,因为它是目前可用的最好的NER工具之一。对于情感预测任务,实现了双向门控循环单元(BiGRU)循环神经网络(RNN)模型,该模型处理4个特征作为输入:词嵌入、SenticNet[2]、部分语音和距离。然而,这种网络架构在SemEval 2016[3]数据集上的性能表现出一些缺陷和局限性,影响了极性预测结果。为此,本工作对上述模型提出了一些调整,以解决目前存在的问题,提高第二项任务的准确性。
{"title":"Aspect Based Sentiment Analysis Using NeuroNER and Bidirectional Recurrent Neural Network","authors":"N. Tran","doi":"10.1145/3287921.3287922","DOIUrl":"https://doi.org/10.1145/3287921.3287922","url":null,"abstract":"Nowadays, understanding sentiments of what customers say, think and review plays an important part in the success of every business. In consequence, Sentiment Analysis (SA) has been becoming a vital part in both academic and commercial standpoint in recent years. However, most of the current sentiment analysis approaches only focus on detecting the overall polarity of the whole sentence or paragraph. That is the reason why this work focuses on another approach to this task, which is Aspect Based Sentiment Analysis (ABSA). The proposed ABSA system in this paper has two main phases: aspect term extraction and aspect sentiment prediction. For the first phase, as to deal with the named-entity recognition (NER) task, it is performed by reusing the NeuroNER [1] program without any modifications because it is currently one of the best NER tool available. For the sentiment prediction task, a bidirectional gated recurrent unit (BiGRU) Recurrent Neural Network (RNN) model which processes 4 features as input: word embeddings, SenticNet [2], Part of Speech and Distance is implemented. However, this network architecture performance on SemEval 2016 [3] dataset showed some drawbacks and limitations that influenced the polarity prediction result. For this reason, this work proposes some adjustments to the mentioned model to solve the current problems and improve the accuracy of the second task.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"os-44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127782520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Combined Objective Function in Deep Learning Model for Abstractive Summarization 面向抽象摘要的深度学习模型中的组合目标函数
Tung Le, Le-Minh Nguyen
Abstractive Summarization is the specific task in text generation whose popular approaches are based on the strength of Recurrent Neural Network. With the purpose to take advantages of Convolution Neural Network in text representation, we propose to combine these above networks in our encoder to capture both the global and local features from the input documents. Simultaneously, our model also integrates the reinforced mechanism with the novel reward function to get the closer direction between the learning and evaluating process. Through the experiments in CNN/Daily Mail, our models gains the significant results. Especially, in ROUGE-1 and ROUGE-L, it outperforms the previous works in this task with the expressive improvement (39.09% in ROUGE-L F1-score).
摘要摘要是文本生成中的一项特殊任务,其常用的方法是基于递归神经网络的强度。为了充分利用卷积神经网络在文本表示中的优势,我们建议在编码器中结合上述网络,以从输入文档中捕获全局和局部特征。同时,我们的模型还将强化机制与新的奖励函数相结合,使学习过程与评价过程之间的方向更加紧密。通过CNN/Daily Mail的实验,我们的模型得到了显著的结果。特别是在ROUGE-1和ROUGE-L中,它在该任务中表现优于以往的作品,表现力提高(ROUGE-L f1得分为39.09%)。
{"title":"Combined Objective Function in Deep Learning Model for Abstractive Summarization","authors":"Tung Le, Le-Minh Nguyen","doi":"10.1145/3287921.3287952","DOIUrl":"https://doi.org/10.1145/3287921.3287952","url":null,"abstract":"Abstractive Summarization is the specific task in text generation whose popular approaches are based on the strength of Recurrent Neural Network. With the purpose to take advantages of Convolution Neural Network in text representation, we propose to combine these above networks in our encoder to capture both the global and local features from the input documents. Simultaneously, our model also integrates the reinforced mechanism with the novel reward function to get the closer direction between the learning and evaluating process. Through the experiments in CNN/Daily Mail, our models gains the significant results. Especially, in ROUGE-1 and ROUGE-L, it outperforms the previous works in this task with the expressive improvement (39.09% in ROUGE-L F1-score).","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117239965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cow estrus detection via Discrete Wavelet Transformation and Unsupervised Clustering 基于离散小波变换和无监督聚类的奶牛发情检测
Le Tien Thanh, Rin Nishikawa, Masashi Takemoto, Huynh Thi Thanh Binh, H. Nakajo
Estrus is a special periods in the life cycle of female cows. Within this period, they have much more chance to become pregnant. Successfully detecting this period increase the milk and meat productivity of the whole farm. Recently, a potential approach is unsupervised learning on motion data of the cows, similar to human activity recognition based on motion. In particular, an accelerometer is attached to the neck of the cows to measure their acceleration, then the unsupervised algorithm group the measured acceleration time-series. Recent study adopted bag-of-feature and Discrete Fourier Transform for feature extraction, yet it may not reflect the nature of motion data. Thus, we proposed a method based on Discrete Wavelet Transform to get the multi-resolution feature, Dynamic Time Wraping as clustering distance and Iterative-K-Means as clustering algorithm, to better match with the characteristic of cowsâĂŹ movement. The proposed methods demonstrated higher score on human activity recognition dataset with ground truth and more reliable prediction on cow motion dataset.
发情期是母牛生命周期中的一个特殊时期。在这段时间内,她们有更多的机会怀孕。成功发现这一时期可以提高整个农场的奶和肉产量。最近,一种潜在的方法是对奶牛的运动数据进行无监督学习,类似于基于运动的人类活动识别。特别是,在奶牛的脖子上安装一个加速度计来测量它们的加速度,然后无监督算法将测量到的加速度时间序列进行分组。最近的研究采用特征袋变换和离散傅里叶变换进行特征提取,但这些方法可能不能反映运动数据的本质。为此,我们提出了一种基于离散小波变换获得多分辨率特征的方法,采用动态时间包裹作为聚类距离,采用迭代k - means作为聚类算法,以更好地匹配cowsâĂŹ运动的特征。该方法在具有地面真实性的人类活动识别数据集上表现出较高的得分,在奶牛运动数据集上表现出更可靠的预测。
{"title":"Cow estrus detection via Discrete Wavelet Transformation and Unsupervised Clustering","authors":"Le Tien Thanh, Rin Nishikawa, Masashi Takemoto, Huynh Thi Thanh Binh, H. Nakajo","doi":"10.1145/3287921.3287973","DOIUrl":"https://doi.org/10.1145/3287921.3287973","url":null,"abstract":"Estrus is a special periods in the life cycle of female cows. Within this period, they have much more chance to become pregnant. Successfully detecting this period increase the milk and meat productivity of the whole farm. Recently, a potential approach is unsupervised learning on motion data of the cows, similar to human activity recognition based on motion. In particular, an accelerometer is attached to the neck of the cows to measure their acceleration, then the unsupervised algorithm group the measured acceleration time-series. Recent study adopted bag-of-feature and Discrete Fourier Transform for feature extraction, yet it may not reflect the nature of motion data. Thus, we proposed a method based on Discrete Wavelet Transform to get the multi-resolution feature, Dynamic Time Wraping as clustering distance and Iterative-K-Means as clustering algorithm, to better match with the characteristic of cowsâĂŹ movement. The proposed methods demonstrated higher score on human activity recognition dataset with ground truth and more reliable prediction on cow motion dataset.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126760993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Two-stream Deep Residual Learning with Fisher Criterion for Human Action Recognition 基于Fisher准则的两流深度残差学习人体动作识别
D. V. Sang, Hoang Trung Dung
Action recognition is one of the most important areas in the computer vision community. Many previous work use two-stream CNN model to obtain both spatial and temporal clues for predicting task. However, two stream are trained separately and combined later by late fusion. This strategy has overlooked the spatial-temporal features interaction. In this paper, we propose new two-stream CNN architectures that are able to learn the relation between two kinds of features. Furthermore, they can be trained end-to-end with standard back propagation algorithm. We also introduce a Fisher loss that makes features more discriminative. The experiments show that Fisher loss yields higher accuracy than using only the softmax loss.
动作识别是计算机视觉领域最重要的领域之一。以前的许多工作都是使用双流CNN模型来获得空间和时间线索来预测任务。但是,两个流分别训练,然后通过后期融合合并。这一策略忽略了时空特征的相互作用。在本文中,我们提出了一种新的双流CNN架构,它能够学习两种特征之间的关系。此外,还可以使用标准的反向传播算法进行端到端训练。我们还引入了费雪损失,使特征更具判别性。实验表明,Fisher损失比仅使用softmax损失具有更高的精度。
{"title":"Two-stream Deep Residual Learning with Fisher Criterion for Human Action Recognition","authors":"D. V. Sang, Hoang Trung Dung","doi":"10.1145/3287921.3287972","DOIUrl":"https://doi.org/10.1145/3287921.3287972","url":null,"abstract":"Action recognition is one of the most important areas in the computer vision community. Many previous work use two-stream CNN model to obtain both spatial and temporal clues for predicting task. However, two stream are trained separately and combined later by late fusion. This strategy has overlooked the spatial-temporal features interaction. In this paper, we propose new two-stream CNN architectures that are able to learn the relation between two kinds of features. Furthermore, they can be trained end-to-end with standard back propagation algorithm. We also introduce a Fisher loss that makes features more discriminative. The experiments show that Fisher loss yields higher accuracy than using only the softmax loss.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126766670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pavement Crack Detection using Convolutional Neural Network 基于卷积神经网络的路面裂缝检测
N. H. T. Nguyen, T. Lê, S. Perry, Thuy Thi Nguyen
Pavement crack detection is an important problem in road maintenance. There are many processing methods, including traditional and modern methods, solving this issue. Traditional methods use edge detection or some other digital image processing for crack detection, but these approaches are sensitive to many types of noise and unwanted objects on the road. For the purpose of increasing accuracy, image pre-processing methods are required for many of these techniques. Recently, some techniques that utilize deep learning to detect cracks in images have achieved high accuracy, without pre-processing. However, some of them are very complicated, some make use of manually collected data and some methods still need some form of pre-processing. In this paper, we propose a method that applies a convolutional neural networks to detect cracks in pavement images. Our research uses two data sets, one public data set and the other collected by ourselves. We also experimentally compare our method with some exiting methods and the experiments show that the proposed approach achieves high accuracy and generates stable models.
路面裂缝检测是道路养护中的一个重要问题。解决这一问题的方法有很多,包括传统方法和现代方法。传统的方法使用边缘检测或一些其他数字图像处理来检测裂纹,但这些方法对许多类型的噪声和道路上不需要的物体很敏感。为了提高精度,许多这些技术都需要图像预处理方法。最近,一些利用深度学习来检测图像裂缝的技术已经达到了很高的精度,而不需要预处理。然而,有些方法非常复杂,有些方法需要人工采集数据,有些方法还需要进行某种形式的预处理。在本文中,我们提出了一种应用卷积神经网络检测路面图像裂缝的方法。我们的研究使用了两个数据集,一个是公共数据集,另一个是我们自己收集的。实验结果表明,该方法具有较高的精度和稳定的模型生成能力。
{"title":"Pavement Crack Detection using Convolutional Neural Network","authors":"N. H. T. Nguyen, T. Lê, S. Perry, Thuy Thi Nguyen","doi":"10.1145/3287921.3287949","DOIUrl":"https://doi.org/10.1145/3287921.3287949","url":null,"abstract":"Pavement crack detection is an important problem in road maintenance. There are many processing methods, including traditional and modern methods, solving this issue. Traditional methods use edge detection or some other digital image processing for crack detection, but these approaches are sensitive to many types of noise and unwanted objects on the road. For the purpose of increasing accuracy, image pre-processing methods are required for many of these techniques. Recently, some techniques that utilize deep learning to detect cracks in images have achieved high accuracy, without pre-processing. However, some of them are very complicated, some make use of manually collected data and some methods still need some form of pre-processing. In this paper, we propose a method that applies a convolutional neural networks to detect cracks in pavement images. Our research uses two data sets, one public data set and the other collected by ourselves. We also experimentally compare our method with some exiting methods and the experiments show that the proposed approach achieves high accuracy and generates stable models.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116387962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 37
Automated Large Program Repair based on Big Code 基于大代码的大程序自动修复
H. V. Thuy, Phan Viet Anh, N. X. Hoai
The task of automatic program repair is to automatically localize and generate the correct patches for the bugs. A prominent approach is to produce a space of candidate patches, then find and validate candidates on test case sets. However, searching for the correct candidates is really challenging, since the search space is dominated by incorrect patches and its size is huge. This paper presents several methods to improve the automated program repair system Prophet, called Prophet+. Our approach contributes three improvements over Prophet: 1) extract twelve relations of statements and blocks for Bi-gram model using Big code, 2) prune the search space, 3) develop an algorithm to re-rank candidate patches in the search space. The experimental results show that our proposed system enhances the performance of Prophet, recognized as the state-of-the-art system, significantly. Specifically, for the top 1, our system generates the correct patches for 17 over 69 bugs while the number achieved by Prophet is 15.
自动程序修复的任务是自动定位并生成错误的正确补丁。一个突出的方法是生成候选补丁的空间,然后在测试用例集上找到并验证候选补丁。然而,搜索正确的候选是非常具有挑战性的,因为搜索空间被不正确的补丁所主导,而且它的大小非常大。本文介绍了对自动程序修复系统Prophet (Prophet+)进行改进的几种方法。我们的方法对Prophet有三个改进:1)使用Big code为Bi-gram模型提取12个语句和块的关系,2)修剪搜索空间,3)开发一种算法对搜索空间中的候选补丁进行重新排序。实验结果表明,该系统显著提高了目前公认的最先进的系统Prophet的性能。具体来说,对于前1名,我们的系统为69个bug中的17个生成了正确的补丁,而Prophet则为15个。
{"title":"Automated Large Program Repair based on Big Code","authors":"H. V. Thuy, Phan Viet Anh, N. X. Hoai","doi":"10.1145/3287921.3287958","DOIUrl":"https://doi.org/10.1145/3287921.3287958","url":null,"abstract":"The task of automatic program repair is to automatically localize and generate the correct patches for the bugs. A prominent approach is to produce a space of candidate patches, then find and validate candidates on test case sets. However, searching for the correct candidates is really challenging, since the search space is dominated by incorrect patches and its size is huge. This paper presents several methods to improve the automated program repair system Prophet, called Prophet+. Our approach contributes three improvements over Prophet: 1) extract twelve relations of statements and blocks for Bi-gram model using Big code, 2) prune the search space, 3) develop an algorithm to re-rank candidate patches in the search space. The experimental results show that our proposed system enhances the performance of Prophet, recognized as the state-of-the-art system, significantly. Specifically, for the top 1, our system generates the correct patches for 17 over 69 bugs while the number achieved by Prophet is 15.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"249 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132707207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spatial Decision Tree Analysis to Identify Location Pattern 空间决策树分析识别区位格局
D. L. Widaningrum, I. Surjandari, D. Sudiana
Jakarta has become a megacity with elaborate service network activities. Fast food restaurants as a type of food service provider have a role in supporting urban lifestyles. Despite the growth of value and transaction volume, there are some fast food categories in Indonesia which have a negative percentage of outlets growth. In general, the location of fast food restaurants divides into two categories. The first one is stand-alone restaurants, and the second is restaurants which located in other public facilities, such as malls, supermarket, and market area. According to the first law of Tobler, closer public facilities will have activity relatedness. This study aims to examine whether proximity between fast food restaurant locations and other public facilities affect categories of fast food restaurants, using spatial decision tree analysis approach. The public facilities examined for proximity to fast food restaurants consist of 11 criteria, which are considered to have a co-location pattern from previous research results. The results will be spatial characteristics of public facilities which expected to be indicators of consumer movement behavior, especially from and to fast food restaurant.
雅加达已经成为一个拥有完善的服务网络活动的超大型城市。快餐店作为一种食品服务提供者,在支持城市生活方式方面发挥着作用。尽管价值和交易量有所增长,但印度尼西亚的一些快餐类别的门店增长百分比为负。一般来说,快餐店的位置分为两类。第一种是独立餐厅,第二种是位于其他公共设施内的餐厅,如商场、超市、市场区域。根据托布勒第一定律,距离较近的公共设施具有活动关联性。本研究旨在利用空间决策树分析方法,探讨快餐店位置与其他公共设施的接近程度是否会影响快餐店的类别。调查的公共设施是否靠近快餐店,共有11项标准。根据以往的研究结果,这些标准被认为是“共址模式”。结果将是公共设施的空间特征,有望成为消费者移动行为的指标,特别是进出快餐店。
{"title":"Spatial Decision Tree Analysis to Identify Location Pattern","authors":"D. L. Widaningrum, I. Surjandari, D. Sudiana","doi":"10.1145/3287921.3287956","DOIUrl":"https://doi.org/10.1145/3287921.3287956","url":null,"abstract":"Jakarta has become a megacity with elaborate service network activities. Fast food restaurants as a type of food service provider have a role in supporting urban lifestyles. Despite the growth of value and transaction volume, there are some fast food categories in Indonesia which have a negative percentage of outlets growth. In general, the location of fast food restaurants divides into two categories. The first one is stand-alone restaurants, and the second is restaurants which located in other public facilities, such as malls, supermarket, and market area. According to the first law of Tobler, closer public facilities will have activity relatedness. This study aims to examine whether proximity between fast food restaurant locations and other public facilities affect categories of fast food restaurants, using spatial decision tree analysis approach. The public facilities examined for proximity to fast food restaurants consist of 11 criteria, which are considered to have a co-location pattern from previous research results. The results will be spatial characteristics of public facilities which expected to be indicators of consumer movement behavior, especially from and to fast food restaurant.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130848356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 9th International Symposium on Information and Communication Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1