首页 > 最新文献

2019 International Conference on Machine Learning and Cybernetics (ICMLC)最新文献

英文 中文
Phase Retrieval via Wirtinger Flow Algorithm and Its Variants 基于Wirtinger流算法及其变体的相位检索
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949170
Jian-wei Liu, Zhi Cao, Jing Liu, Xiong-lin Luo, Wei-min Li, Nobuyasu Ito, Longteng Guo
Almost three-quarters of the underling information in the light wave field is embodied in the phase. However, the early optical detectors can only record the intensity or amplitude of the light wave field and cannot directly extract the phase information of the light wave field. Therefore, it is necessary to use the measured amplitude or strength to reconstruct the phase information of the object, this problem is denoted phase retrieval. Phase retrieval is a matter of cardinal significance in signal processing and machine learning. The phase retrieval by convex optimization algorithm is ideal but the computational complexity is high. In 2015, Candès proposed a very effective non-convex optimization algorithm-Wirtinger flow algorithm which used spectral initialization to get a better initial value and then gradient iteration to get a promised recovery effect. Subsequently, in line with the idea, a large number of variants are devised, such as: Wirtinger flow(WF), Truncated Wirtinger Flow (TWF), Truncated Amplitude Flow (TAF), Reshaped Wirtinger Flow (RWF), Incremental Truncated Wirtinger Flow (ITWF), Incremental Reshaped Wirtinger Flow (IRWF), Robust Wirtinger Flow (Robust-WF), Sparse Wirtinger Flow (SWF), Median-TWF, Median-RWF, Generalized Wirtinger Flow (GWF), Accelerated Wirtinger Flow (AWF), Thresholded Wirtinger Flow Revisited (THWFR), Thresholded Wirtinger Flow (THWF), Reweighted Wirtinger Flow (REWF), Wirtinger Flow Method With Optimal Stepsize (WFOS), Stochastic Truncated Wirtinger Flow Algorithm (STWF), Stochastic Truncated Amplitude Flow (STAF), Reweighted Amplitude Flow (RAF), Compressive Reweighted Amplitude Flow (CRAF), SPARse Truncated Amplitude flow (SPARTA) and Sparse Wirtinger Flow Algorithm with Optimal Stepsize (SWFOS), etc. This paper analyzes and summarizes these algorithms according to their characteristics such as: initialization method, step size, iteration times, sample complexity, computational complexity, etc., so that readers can intuitively and clearly see the characteristics of each algorithm. Finally, we provide the website of the source code of some algorithms, facilitate to access and use it for readers.
光波场中几乎四分之三的底层信息体现在相位中。然而,早期的光学探测器只能记录光波场的强度或振幅,不能直接提取光波场的相位信息。因此,需要利用测量到的振幅或强度来重建目标的相位信息,这一问题称为相位恢复。相位检索在信号处理和机器学习中具有重要的意义。采用凸优化算法进行相位检索是理想的,但计算量较大。2015年,cand提出了一种非常有效的非凸优化算法——wirtinger flow算法,该算法通过谱初始化得到较好的初值,再通过梯度迭代得到较好的恢复效果。随后,根据这个想法,设计了大量的变体,例如:维丁格流(WF)、截断维丁格流(TWF)、截断幅值流(TAF)、重塑维丁格流(RWF)、递增截断维丁格流(ITWF)、递增重塑维丁格流(IRWF)、稳健维丁格流(Robust-WF)、稀疏维丁格流(SWF)、中位数维丁格流、中位数维丁格流、广义维丁格流(GWF)、加速维丁格流(AWF)、重新访问阈值维丁格流(THWFR)、阈值维丁格流(THWF)、重加权维丁格流(REWF)、最优步长Wirtinger流方法(WFOS)、随机截断Wirtinger流算法(STWF)、随机截断幅值流(STAF)、重加权幅值流(RAF)、压缩重加权幅值流(CRAF)、稀疏截断幅值流(SPARTA)和最优步长稀疏Wirtinger流算法(SWFOS)等。本文根据这些算法的特点,如:初始化方法、步长、迭代次数、样本复杂度、计算复杂度等进行分析总结,使读者能够直观、清晰地看到每种算法的特点。最后,我们提供了部分算法的源代码网站,方便读者访问和使用。
{"title":"Phase Retrieval via Wirtinger Flow Algorithm and Its Variants","authors":"Jian-wei Liu, Zhi Cao, Jing Liu, Xiong-lin Luo, Wei-min Li, Nobuyasu Ito, Longteng Guo","doi":"10.1109/ICMLC48188.2019.8949170","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949170","url":null,"abstract":"Almost three-quarters of the underling information in the light wave field is embodied in the phase. However, the early optical detectors can only record the intensity or amplitude of the light wave field and cannot directly extract the phase information of the light wave field. Therefore, it is necessary to use the measured amplitude or strength to reconstruct the phase information of the object, this problem is denoted phase retrieval. Phase retrieval is a matter of cardinal significance in signal processing and machine learning. The phase retrieval by convex optimization algorithm is ideal but the computational complexity is high. In 2015, Candès proposed a very effective non-convex optimization algorithm-Wirtinger flow algorithm which used spectral initialization to get a better initial value and then gradient iteration to get a promised recovery effect. Subsequently, in line with the idea, a large number of variants are devised, such as: Wirtinger flow(WF), Truncated Wirtinger Flow (TWF), Truncated Amplitude Flow (TAF), Reshaped Wirtinger Flow (RWF), Incremental Truncated Wirtinger Flow (ITWF), Incremental Reshaped Wirtinger Flow (IRWF), Robust Wirtinger Flow (Robust-WF), Sparse Wirtinger Flow (SWF), Median-TWF, Median-RWF, Generalized Wirtinger Flow (GWF), Accelerated Wirtinger Flow (AWF), Thresholded Wirtinger Flow Revisited (THWFR), Thresholded Wirtinger Flow (THWF), Reweighted Wirtinger Flow (REWF), Wirtinger Flow Method With Optimal Stepsize (WFOS), Stochastic Truncated Wirtinger Flow Algorithm (STWF), Stochastic Truncated Amplitude Flow (STAF), Reweighted Amplitude Flow (RAF), Compressive Reweighted Amplitude Flow (CRAF), SPARse Truncated Amplitude flow (SPARTA) and Sparse Wirtinger Flow Algorithm with Optimal Stepsize (SWFOS), etc. This paper analyzes and summarizes these algorithms according to their characteristics such as: initialization method, step size, iteration times, sample complexity, computational complexity, etc., so that readers can intuitively and clearly see the characteristics of each algorithm. Finally, we provide the website of the source code of some algorithms, facilitate to access and use it for readers.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125432289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Domain Adaption for Facial Expression Recognition 面部表情识别的领域自适应
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949178
Juntong Liu, F. Wu, Wenjin Lu, Bai-Ling Zhang
Facial expression recognition (FER) is a task that recognizes human emotions from their facial expressions. Owing to the lack of large datasets, a FER system is difficult to design, especially for real world environment. In this paper, we propose a new dataset augmentation method for FER and the corresponding training strategy by using similarity preserving generative adversarial network (SPGAN). By borrowing the idea from person re-ID field, we consider dataset augmentation as a domain adaptation task. The SPGAN is first trained on a lab condition dataset and a real world condition dataset to generate domain adapted images, and then CNN models are subsequently trained on the domain adapted images. We test our models on the RAF-DB and SFEW 2.0 datasets to show the improvement when compared it to our baseline. We also report our competitive accuracy when compared it with other state of the art works, which shows promissing results.
面部表情识别(FER)是一项从面部表情中识别人类情绪的任务。由于缺乏大型数据集,FER系统的设计非常困难,特别是在现实环境中。本文提出了一种基于相似保持生成对抗网络(SPGAN)的FER数据集增强方法和相应的训练策略。我们借鉴了个人id字段的思想,将数据集扩充看作是一个领域自适应任务。首先在实验室条件数据集和现实世界条件数据集上训练SPGAN生成域适应图像,然后在域适应图像上训练CNN模型。我们在RAF-DB和SFEW 2.0数据集上测试了我们的模型,以显示与基线相比的改进。我们还报告了与其他艺术作品相比,我们的竞争准确性,这显示了有希望的结果。
{"title":"Domain Adaption for Facial Expression Recognition","authors":"Juntong Liu, F. Wu, Wenjin Lu, Bai-Ling Zhang","doi":"10.1109/ICMLC48188.2019.8949178","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949178","url":null,"abstract":"Facial expression recognition (FER) is a task that recognizes human emotions from their facial expressions. Owing to the lack of large datasets, a FER system is difficult to design, especially for real world environment. In this paper, we propose a new dataset augmentation method for FER and the corresponding training strategy by using similarity preserving generative adversarial network (SPGAN). By borrowing the idea from person re-ID field, we consider dataset augmentation as a domain adaptation task. The SPGAN is first trained on a lab condition dataset and a real world condition dataset to generate domain adapted images, and then CNN models are subsequently trained on the domain adapted images. We test our models on the RAF-DB and SFEW 2.0 datasets to show the improvement when compared it to our baseline. We also report our competitive accuracy when compared it with other state of the art works, which shows promissing results.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115333366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Autonomous Cross-Floor Navigation System for a ROS-Based Modular Service Robot 基于ros的模块化服务机器人自主跨楼层导航系统
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949176
Wenhui Wang, Yi-Hsing Chien, H. Chiang, Wei-Yen Wang, C. Hsu
In this paper, we present an autonomous cross-floor navigation system including mapping, localization, path planning, and scene recognition based on robot operating system (ROS) architecture. The Gmapping algorithm is utilized to build a 2D map with a laser range-finder, and AMCL algorithm is utilized in the robot localization. Moreover, an improved A* algorithm is proposed to prevent robot from getting too close to the wall. Because our robot needs to navigate in the multi-floor environment, a decision system using deep convolutional neural network (DCNN) is also designed to recognize the current floor and the associated map can be download to the robot system. By training with the scene images of the featured location in each floor, the robot can recognize the current floor and then complete the navigation task. Finally, real test of our robot is conducted to demonstrate the feasibility of the proposed method.
本文提出了一种基于机器人操作系统(ROS)架构的自主跨楼层导航系统,包括绘图、定位、路径规划和场景识别。利用gmap算法建立激光测距仪的二维地图,利用AMCL算法实现机器人定位。此外,提出了一种改进的A*算法,以防止机器人过于靠近墙壁。由于我们的机器人需要在多楼层环境中导航,我们还设计了一个基于深度卷积神经网络(DCNN)的决策系统来识别当前楼层,并将相关地图下载到机器人系统中。通过对每层楼的特色位置的场景图像进行训练,机器人可以识别当前的楼层,从而完成导航任务。最后,对机器人进行了实际测试,验证了所提方法的可行性。
{"title":"Autonomous Cross-Floor Navigation System for a ROS-Based Modular Service Robot","authors":"Wenhui Wang, Yi-Hsing Chien, H. Chiang, Wei-Yen Wang, C. Hsu","doi":"10.1109/ICMLC48188.2019.8949176","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949176","url":null,"abstract":"In this paper, we present an autonomous cross-floor navigation system including mapping, localization, path planning, and scene recognition based on robot operating system (ROS) architecture. The Gmapping algorithm is utilized to build a 2D map with a laser range-finder, and AMCL algorithm is utilized in the robot localization. Moreover, an improved A* algorithm is proposed to prevent robot from getting too close to the wall. Because our robot needs to navigate in the multi-floor environment, a decision system using deep convolutional neural network (DCNN) is also designed to recognize the current floor and the associated map can be download to the robot system. By training with the scene images of the featured location in each floor, the robot can recognize the current floor and then complete the navigation task. Finally, real test of our robot is conducted to demonstrate the feasibility of the proposed method.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126679400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Development of a System to Measure Radioulnar Distance in Wrist-Joint Rotation Using Three-Dimensional Electromagnetic Sensor 基于三维电磁传感器的腕关节旋转中尺桡距离测量系统的研制
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949284
K. Nagamune, Akito Nakano
For sports such as baseball and tennis, there are actions to throw the ball and swing the racket. There are cases injured the wrist joint by repeating this action. One such injury to the wrist joint is triangular fibrocartilage complex (TFCC) injury. TFCC is a part that keeps stability on the ulnar side of the wrist joint scale. So, if the TFCC is injured, the distance between the ulna and the radius will widen due to the wrist rotation, when the injury is severe, pain occurs on the ulnar side of the wrist joint. In the current diagnosis, there is no diagnosis to evaluate the change in distance between the ulna and the radius in the wrist rotation. Therefore, in this study, to quantitatively evaluate the change of distance between the ulna and the radius in TFCC injury, we develop a system to measure the distance between the ulna and the radius in the wrist rotation.
对于像棒球和网球这样的运动,有投球和挥动球拍的动作。有重复此动作损伤腕关节的病例。其中一种对腕关节的损伤是三角纤维软骨复合体(TFCC)损伤。TFCC是保持腕关节尺侧稳定的部件。因此,如果TFCC受伤,由于手腕的旋转,尺骨与桡骨之间的距离会变宽,当损伤严重时,疼痛发生在手腕关节的尺侧。在目前的诊断中,没有诊断来评估腕关节旋转过程中尺骨与桡骨之间距离的变化。因此,在本研究中,为了定量评价TFCC损伤中尺骨与桡骨之间距离的变化,我们开发了一套测量腕关节旋转过程中尺骨与桡骨之间距离的系统。
{"title":"A Development of a System to Measure Radioulnar Distance in Wrist-Joint Rotation Using Three-Dimensional Electromagnetic Sensor","authors":"K. Nagamune, Akito Nakano","doi":"10.1109/ICMLC48188.2019.8949284","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949284","url":null,"abstract":"For sports such as baseball and tennis, there are actions to throw the ball and swing the racket. There are cases injured the wrist joint by repeating this action. One such injury to the wrist joint is triangular fibrocartilage complex (TFCC) injury. TFCC is a part that keeps stability on the ulnar side of the wrist joint scale. So, if the TFCC is injured, the distance between the ulna and the radius will widen due to the wrist rotation, when the injury is severe, pain occurs on the ulnar side of the wrist joint. In the current diagnosis, there is no diagnosis to evaluate the change in distance between the ulna and the radius in the wrist rotation. Therefore, in this study, to quantitatively evaluate the change of distance between the ulna and the radius in TFCC injury, we develop a system to measure the distance between the ulna and the radius in the wrist rotation.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124966343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Water Level Prediction at TICH-BUI river in Vietnam Using Support Vector Regression 基于支持向量回归的越南TICH-BUI河水位预测
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949273
Thanh-Tung Nguyen, Hien T. T. Le
In this paper, the support vector regression model is used to predict water levels at a downstream station of the Tich-Bui river basin. The study investigated the effects of rainfall data collected from eight gauging stations and water levels at the downstream station for the performance forecast. The model was set up to forecast water levels at the downstream station before 6-lead-hour, 12-lead-hour, 18-lead-hour and 24-lead-hour. Although the model does not require data on the climate, terrain but the forecast results are accurate. In the case of a water level forecast before 6 hours and 12 hours, the Nash coefficient gives a value of over 98.81% and the RMSE value is less than 0.20 m. This results suggest that the support vector regression model, which the authors use to accurately predict water levels in real time, can be used to warn of floods in Vietnam's rivers.
本文采用支持向量回归模型对堤布河流域下游站水位进行了预测。研究调查了八个测量站收集的降雨数据和下游站的水位对业绩预测的影响。建立了6、12、18、24铅前下游站水位预报模型。虽然该模式不需要有关气候、地形的资料,但预报结果是准确的。在6 h和12 h前的水位预报中,Nash系数的值大于98.81%,RMSE值小于0.20 m。这一结果表明,这组作者用来实时准确预测水位的支持向量回归模型可以用来警告越南河流的洪水。
{"title":"Water Level Prediction at TICH-BUI river in Vietnam Using Support Vector Regression","authors":"Thanh-Tung Nguyen, Hien T. T. Le","doi":"10.1109/ICMLC48188.2019.8949273","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949273","url":null,"abstract":"In this paper, the support vector regression model is used to predict water levels at a downstream station of the Tich-Bui river basin. The study investigated the effects of rainfall data collected from eight gauging stations and water levels at the downstream station for the performance forecast. The model was set up to forecast water levels at the downstream station before 6-lead-hour, 12-lead-hour, 18-lead-hour and 24-lead-hour. Although the model does not require data on the climate, terrain but the forecast results are accurate. In the case of a water level forecast before 6 hours and 12 hours, the Nash coefficient gives a value of over 98.81% and the RMSE value is less than 0.20 m. This results suggest that the support vector regression model, which the authors use to accurately predict water levels in real time, can be used to warn of floods in Vietnam's rivers.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130411081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
News Recommendation Based on Collaborative Semantic Topic Models and Recommendation Adjustment 基于协同语义主题模型的新闻推荐及推荐调整
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949259
Yu-Shan Liao, Jun-Yi Lu, Duen-Ren Liu
Providing news recommendations is an important trend for online news websites to attract more users and create more benefits. In this research, we propose a novel recommendation approach for recommending news articles. We propose A Collaborative Semantic Topic Model and an ensemble model to predict user preferences based on combining Matrix Factorization with articles' semantic latent topics derived from word embedding and Latent Dirichlet Allocation. The proposed ensemble model is further integrated with a recommendation adjustment mechanism to adjust users' online recommendation lists. We evaluate the proposed approach via offline experiments and online evaluation on a real news website. The experimental result demonstrates that our proposed approach can improve the recommendation quality of recommending news articles.
提供新闻推荐是在线新闻网站吸引更多用户、创造更多效益的重要趋势。在这项研究中,我们提出了一种新的推荐方法来推荐新闻文章。本文提出了一种基于矩阵分解的协同语义主题模型和一种集成模型来预测用户偏好,该模型结合词嵌入和潜在狄利克雷分配得到的文章语义潜在主题。该集成模型进一步集成了推荐调整机制,以调整用户的在线推荐列表。我们通过离线实验和在真实新闻网站上的在线评估来评估所提出的方法。实验结果表明,该方法可以提高新闻文章推荐的质量。
{"title":"News Recommendation Based on Collaborative Semantic Topic Models and Recommendation Adjustment","authors":"Yu-Shan Liao, Jun-Yi Lu, Duen-Ren Liu","doi":"10.1109/ICMLC48188.2019.8949259","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949259","url":null,"abstract":"Providing news recommendations is an important trend for online news websites to attract more users and create more benefits. In this research, we propose a novel recommendation approach for recommending news articles. We propose A Collaborative Semantic Topic Model and an ensemble model to predict user preferences based on combining Matrix Factorization with articles' semantic latent topics derived from word embedding and Latent Dirichlet Allocation. The proposed ensemble model is further integrated with a recommendation adjustment mechanism to adjust users' online recommendation lists. We evaluate the proposed approach via offline experiments and online evaluation on a real news website. The experimental result demonstrates that our proposed approach can improve the recommendation quality of recommending news articles.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132962104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An Improved Siamese Network for Face Sketch Recognition 一种改进的Siamese网络用于人脸素描识别
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949231
Liang Fan, Han Liu, Y. Hou
Face sketch recognition identifies the face photo from a large face sketch dataset. Some traditional methods are typically used to reduce the modality gap between face photos and sketches and gain excellent recognition rate based on a pseudo image which is synthesized using the corresponded face photo. However, these methods cannot obtain better high recognition rate for all face sketch datasets, because the use of extracted features cannot lead to the elimination of the effect of different modalities' images. The feature representation of the deep convolutional neural networks as a feasible approach for identification involves wider applications than other methods. It is adapted to extract the features which eliminate the difference between face photos and sketches. The recognition rate is high for neural networks constructed by learning optimal local features, even if the input image shows geometric distortions. However, the case of overfitting leads to the unsatisfactory performance of deep learning methods on face sketch recognition tasks. Also, the sketch images are too simple to be used for extracting effective features. This paper aims to increase the matching rate using the Siamese convolution network architecture. The framework is used to extract useful features from each image pair to reduce the modality gap. Moreover, data augmentation is used to avoid overfitting. We explore the performance of three loss functions and compare the similarity between each image pair. The experimental results show that our framework is adequate for a composite sketch dataset. In addition, it reduces the influence of overfitting by using data augmentation and modifying the network structure.
人脸素描识别从大型人脸素描数据集中识别人脸照片。传统方法主要是利用人脸照片合成的伪图像来减小人脸照片与草图之间的模态差距,从而获得较好的识别率。然而,这些方法并不能对所有的人脸草图数据集获得更好的高识别率,因为提取的特征的使用并不能消除不同模态图像的影响。深度卷积神经网络的特征表示作为一种可行的识别方法有着比其他方法更广泛的应用。它适用于提取特征,消除人脸照片和草图之间的差异。通过学习最优局部特征构建的神经网络,即使输入图像显示几何畸变,识别率也很高。然而,过度拟合的情况导致深度学习方法在人脸草图识别任务上的性能不理想。此外,草图图像过于简单,无法用于提取有效的特征。本文旨在利用Siamese卷积网络架构来提高匹配率。该框架用于从每个图像对中提取有用的特征,以减小模态差距。此外,使用数据增强来避免过拟合。我们探讨了三种损失函数的性能,并比较了每个图像对之间的相似度。实验结果表明,该框架适用于复合草图数据集。此外,通过数据扩充和网络结构的修改,降低了过拟合的影响。
{"title":"An Improved Siamese Network for Face Sketch Recognition","authors":"Liang Fan, Han Liu, Y. Hou","doi":"10.1109/ICMLC48188.2019.8949231","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949231","url":null,"abstract":"Face sketch recognition identifies the face photo from a large face sketch dataset. Some traditional methods are typically used to reduce the modality gap between face photos and sketches and gain excellent recognition rate based on a pseudo image which is synthesized using the corresponded face photo. However, these methods cannot obtain better high recognition rate for all face sketch datasets, because the use of extracted features cannot lead to the elimination of the effect of different modalities' images. The feature representation of the deep convolutional neural networks as a feasible approach for identification involves wider applications than other methods. It is adapted to extract the features which eliminate the difference between face photos and sketches. The recognition rate is high for neural networks constructed by learning optimal local features, even if the input image shows geometric distortions. However, the case of overfitting leads to the unsatisfactory performance of deep learning methods on face sketch recognition tasks. Also, the sketch images are too simple to be used for extracting effective features. This paper aims to increase the matching rate using the Siamese convolution network architecture. The framework is used to extract useful features from each image pair to reduce the modality gap. Moreover, data augmentation is used to avoid overfitting. We explore the performance of three loss functions and compare the similarity between each image pair. The experimental results show that our framework is adequate for a composite sketch dataset. In addition, it reduces the influence of overfitting by using data augmentation and modifying the network structure.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133923776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Deep License Plate Recognition in Ill-Conditioned Environments With Ill-Conditional Data Augmentation 基于病态数据增强的病态环境下车牌深度识别
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949248
C. Lien, Yu-Chun Chien, Fu-Yu Teng, Chih-Chieh Yang
In general, the conventional LPR systems consist of the following modules: feature extraction, license plate locating, character segmentation, and character recognition. The performances of these module are strongly correlated with some low level image features, e.g., edges, colors, and textures. These low level image features can be influenced significantly by the illumination and view angle variations such that the recognition accuracy is degraded. Recently, the deep learning technologies make the conventional vision-based recognition technologies getting significant improvement in terms of feature discrimination and recognition accuracy. In this paper, we aim to develop a novel deep learning based LPR system with the ill-conditional data augmentation. Therefore, this paper is expected to the following contributions. First, we apply the WebGL technology to augment the training database for the ill-conditioned outdoor environments. Second, we apply the YOLOv2 DNN architecture to develop deep license plate recognition system in the ill-conditioned environments with recognition accuracy 98%.
一般来说,传统的车牌识别系统包括以下几个模块:特征提取、车牌定位、字符分割和字符识别。这些模块的性能与一些低级图像特征密切相关,例如边缘、颜色和纹理。这些低水平的图像特征会受到光照和视角变化的显著影响,从而降低识别精度。近年来,深度学习技术使传统的基于视觉的识别技术在特征识别和识别精度方面得到了显著的提高。在本文中,我们的目标是开发一种新的基于深度学习的LPR系统。因此,本文预计将做出以下贡献。首先,我们应用WebGL技术对恶劣室外环境下的训练数据库进行扩充。其次,应用YOLOv2深度神经网络架构开发了病态环境下深度车牌识别系统,识别准确率达到98%。
{"title":"Deep License Plate Recognition in Ill-Conditioned Environments With Ill-Conditional Data Augmentation","authors":"C. Lien, Yu-Chun Chien, Fu-Yu Teng, Chih-Chieh Yang","doi":"10.1109/ICMLC48188.2019.8949248","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949248","url":null,"abstract":"In general, the conventional LPR systems consist of the following modules: feature extraction, license plate locating, character segmentation, and character recognition. The performances of these module are strongly correlated with some low level image features, e.g., edges, colors, and textures. These low level image features can be influenced significantly by the illumination and view angle variations such that the recognition accuracy is degraded. Recently, the deep learning technologies make the conventional vision-based recognition technologies getting significant improvement in terms of feature discrimination and recognition accuracy. In this paper, we aim to develop a novel deep learning based LPR system with the ill-conditional data augmentation. Therefore, this paper is expected to the following contributions. First, we apply the WebGL technology to augment the training database for the ill-conditioned outdoor environments. Second, we apply the YOLOv2 DNN architecture to develop deep license plate recognition system in the ill-conditioned environments with recognition accuracy 98%.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132219127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single-Image Super-Resolution via Multiple Matrix-Valued Kernel Regression 基于多矩阵值核回归的单图像超分辨率
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949261
Yi Tang, Zuo Jiang, Junhua Chen
Single-image super-resolution focuses on learning a mapping to recover high-resolution images from given low-resolution images with the help of a set of paired images. Matrix-valued operators serve as an efficient mapping to super-resolve low-resolution images. However, most existed matrix-valued based super-resolution algorithms limit matrix-valued operators as linear mappings. Multiple matrix-valued operators based algorithm is introduced for improving the performance of matrix-value operators in single-image super-resolution. Taking advantages of the non-linear style of multiple matrix-valued operators, we have more accurate super-resolved images. The experimental results show the efficiency and effectiveness of the reported multiple matrix-valued operator learning based super-resolution algorithm.
单图像超分辨率的重点是学习映射,通过一组配对图像从给定的低分辨率图像中恢复高分辨率图像。矩阵值运算符作为超分辨率低分辨率图像的有效映射。然而,大多数现有的基于矩阵值的超分辨算法将矩阵值算子限制为线性映射。为了提高矩阵值算子在单幅图像超分辨中的性能,提出了基于多矩阵值算子的算法。利用多矩阵值算子的非线性风格,我们获得了更精确的超分辨图像。实验结果表明了本文提出的基于多矩阵值算子学习的超分辨算法的有效性和有效性。
{"title":"Single-Image Super-Resolution via Multiple Matrix-Valued Kernel Regression","authors":"Yi Tang, Zuo Jiang, Junhua Chen","doi":"10.1109/ICMLC48188.2019.8949261","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949261","url":null,"abstract":"Single-image super-resolution focuses on learning a mapping to recover high-resolution images from given low-resolution images with the help of a set of paired images. Matrix-valued operators serve as an efficient mapping to super-resolve low-resolution images. However, most existed matrix-valued based super-resolution algorithms limit matrix-valued operators as linear mappings. Multiple matrix-valued operators based algorithm is introduced for improving the performance of matrix-value operators in single-image super-resolution. Taking advantages of the non-linear style of multiple matrix-valued operators, we have more accurate super-resolved images. The experimental results show the efficiency and effectiveness of the reported multiple matrix-valued operator learning based super-resolution algorithm.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133717505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stabilization of Inertia Wheel Inverted Pendulum Using Fuzzy-Based Hybrid Control 基于模糊混合控制的惯性轮倒立摆镇定
Pub Date : 2019-07-01 DOI: 10.1109/ICMLC48188.2019.8949281
Bo-Rui Chen, Chun-Fei Hsu, Tsu-Tian Lee
It is known that the inertia wheel inverted pendulum (IWIP) is a nonlinear underactuated system. Since the unavoidable friction or unclear interference of the IWIP system, designing a controller for the IWIP is a challenging task. In this paper, a fuzzy-based hybrid control (FBHC) is proposed to make the IWIP system can be stably balanced around the upright position. The FBHC system is comprised of a feedback linearization controller, a fuzzy logic controller and a speed compensated controller. The feedback linearization controller with a fuzzy logic controller can control the priority parameter at the non-actuated joint; however, it does not ensure the control of the inertia wheel speed. The speed compensated controller is designed to stabilize the speed of the inertia wheel once the body angle is stable. Thus, the IWIP system can be stably balanced around the upright position and the disk speed is gradually reduced. Finally, the experimental results are verified that the proposed FBHC can achieve a good dynamic balance effect for the IWIP system, even when there is an external force to push the IWIP system.
惯性轮倒立摆是一个非线性欠驱动系统。由于IWIP系统存在不可避免的摩擦或不明确的干扰,因此设计IWIP控制器是一项具有挑战性的任务。本文提出了一种基于模糊的混合控制(FBHC),使IWIP系统能够在垂直位置稳定平衡。FBHC系统由反馈线性化控制器、模糊控制器和速度补偿控制器组成。带模糊控制器的反馈线性化控制器可以控制非驱动关节处的优先级参数;但是,它不能保证对惯性轮速的控制。速度补偿控制器的设计是为了在车身角度稳定的情况下稳定惯性轮的速度。因此,IWIP系统可以在直立位置周围稳定平衡,磁盘速度逐渐降低。最后,实验结果验证了所提出的FBHC在有外力推动IWIP系统的情况下,仍能取得良好的动态平衡效果。
{"title":"Stabilization of Inertia Wheel Inverted Pendulum Using Fuzzy-Based Hybrid Control","authors":"Bo-Rui Chen, Chun-Fei Hsu, Tsu-Tian Lee","doi":"10.1109/ICMLC48188.2019.8949281","DOIUrl":"https://doi.org/10.1109/ICMLC48188.2019.8949281","url":null,"abstract":"It is known that the inertia wheel inverted pendulum (IWIP) is a nonlinear underactuated system. Since the unavoidable friction or unclear interference of the IWIP system, designing a controller for the IWIP is a challenging task. In this paper, a fuzzy-based hybrid control (FBHC) is proposed to make the IWIP system can be stably balanced around the upright position. The FBHC system is comprised of a feedback linearization controller, a fuzzy logic controller and a speed compensated controller. The feedback linearization controller with a fuzzy logic controller can control the priority parameter at the non-actuated joint; however, it does not ensure the control of the inertia wheel speed. The speed compensated controller is designed to stabilize the speed of the inertia wheel once the body angle is stable. Thus, the IWIP system can be stably balanced around the upright position and the disk speed is gradually reduced. Finally, the experimental results are verified that the proposed FBHC can achieve a good dynamic balance effect for the IWIP system, even when there is an external force to push the IWIP system.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117095208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2019 International Conference on Machine Learning and Cybernetics (ICMLC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1