首页 > 最新文献

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)最新文献

英文 中文
A Survey of Object Detection Based on CNN and Transformer 基于CNN和Transformer的目标检测综述
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520732
Ershat Arkin, Nurbiya Yadikar, Yusnur Muhtar, K. Ubul
The task of object detection is to find all the objects of interest in the image, and to determine their classifications and positions, which is one of the core problems in the field of computer vision. Since the emergence of AlexNet, convolutional neural networks have an absolute position in the field of computer vision, and the research on convolutional neural networks and algorithm structures has become more and more in-depth. Object detection algorithms can be roughly divided into two categories: candidate-based(two stage) and regression-based(one stage). The object detection algorithm based on the candidate area has high accuracy, but the structure is complex and the detection speed is slow. The regression-based object detection algorithm has a simple structure and fast detection speed. It has high application value in the field of real-time object detection, but the detection accuracy is relatively low. With the pursuit of the speed and accuracy of object detection, researchers try to apply mainstream methods in different fields. Therefore, recently Transformers in the NLP field has been used in computer vision, such as ViT, Swin Transformer, etc. It showed transformer-based models perform similar to or better than neural network algorithms, and pointed out new paths for researchers. This paper introduces classic neural networks, discusses the advantages and disadvantages of convolutional neural networks used in object detection algorithms, and introduces the latest innovative methods of Transformer used in computer vision. Finally, the difficulties, challenges and future development of convolutional neural networks and Transformers in object detection are considered.
目标检测的任务是找到图像中所有感兴趣的目标,并确定它们的分类和位置,这是计算机视觉领域的核心问题之一。自AlexNet出现以来,卷积神经网络在计算机视觉领域占据了绝对的地位,对卷积神经网络和算法结构的研究也越来越深入。目标检测算法大致可分为两类:基于候选对象的(两阶段)和基于回归的(一阶段)。基于候选区域的目标检测算法精度高,但结构复杂,检测速度慢。基于回归的目标检测算法结构简单,检测速度快。它在实时目标检测领域具有很高的应用价值,但检测精度相对较低。随着对目标检测速度和准确性的追求,研究人员试图将主流方法应用于不同的领域。因此,近年来NLP领域的变压器被应用到计算机视觉中,如ViT、Swin变压器等。它表明,基于变压器的模型的性能与神经网络算法相似或更好,并为研究人员指出了新的途径。本文介绍了经典神经网络,讨论了卷积神经网络用于目标检测算法的优缺点,并介绍了Transformer在计算机视觉中的最新创新方法。最后,对卷积神经网络和变压器在目标检测中的难点、挑战和未来发展进行了展望。
{"title":"A Survey of Object Detection Based on CNN and Transformer","authors":"Ershat Arkin, Nurbiya Yadikar, Yusnur Muhtar, K. Ubul","doi":"10.1109/PRML52754.2021.9520732","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520732","url":null,"abstract":"The task of object detection is to find all the objects of interest in the image, and to determine their classifications and positions, which is one of the core problems in the field of computer vision. Since the emergence of AlexNet, convolutional neural networks have an absolute position in the field of computer vision, and the research on convolutional neural networks and algorithm structures has become more and more in-depth. Object detection algorithms can be roughly divided into two categories: candidate-based(two stage) and regression-based(one stage). The object detection algorithm based on the candidate area has high accuracy, but the structure is complex and the detection speed is slow. The regression-based object detection algorithm has a simple structure and fast detection speed. It has high application value in the field of real-time object detection, but the detection accuracy is relatively low. With the pursuit of the speed and accuracy of object detection, researchers try to apply mainstream methods in different fields. Therefore, recently Transformers in the NLP field has been used in computer vision, such as ViT, Swin Transformer, etc. It showed transformer-based models perform similar to or better than neural network algorithms, and pointed out new paths for researchers. This paper introduces classic neural networks, discusses the advantages and disadvantages of convolutional neural networks used in object detection algorithms, and introduces the latest innovative methods of Transformer used in computer vision. Finally, the difficulties, challenges and future development of convolutional neural networks and Transformers in object detection are considered.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114599606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Intelligent Robot for Cleaning Garbage Based on OpenCV 基于OpenCV的智能垃圾清扫机器人
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520722
Shuang Pan, Zihui Xie, Xianao Yang, G. Lin, Yulian Jiang
In order to clean the garbage effectively in small areas, such as communities, gardens and squares, and save the cost of garbage cleaning, this paper developed an intelligent robot, which can clean garbage independently outdoors. The vacuum cleaner on the chassis of the robot can inhale small garbage, and the flexible manipulator can grab big garbage. The robot identifies garbage based on OpenCV. To improve the accuracy of garbage recognition, the method of edge detection and contour detection is used in the process of image recognition. The test of cleaning efficiency based on the Yolo model shows that the recognition accuracy of the robot is 95%. It can clean garbage well.
为了有效地清扫小区、花园、广场等小区域的垃圾,节约清扫垃圾的成本,本文研制了一种能够在室外独立清扫垃圾的智能机器人。机器人底盘上的吸尘器可以吸入小垃圾,灵活的机械手可以抓取大垃圾。机器人基于OpenCV进行垃圾识别。为了提高垃圾识别的准确性,在图像识别过程中采用了边缘检测和轮廓检测的方法。基于Yolo模型的清洁效率测试表明,该机器人的识别准确率达到95%。它可以很好地清理垃圾。
{"title":"Intelligent Robot for Cleaning Garbage Based on OpenCV","authors":"Shuang Pan, Zihui Xie, Xianao Yang, G. Lin, Yulian Jiang","doi":"10.1109/PRML52754.2021.9520722","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520722","url":null,"abstract":"In order to clean the garbage effectively in small areas, such as communities, gardens and squares, and save the cost of garbage cleaning, this paper developed an intelligent robot, which can clean garbage independently outdoors. The vacuum cleaner on the chassis of the robot can inhale small garbage, and the flexible manipulator can grab big garbage. The robot identifies garbage based on OpenCV. To improve the accuracy of garbage recognition, the method of edge detection and contour detection is used in the process of image recognition. The test of cleaning efficiency based on the Yolo model shows that the recognition accuracy of the robot is 95%. It can clean garbage well.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"15 1-2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114111863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review of Segmentation and Classification for Retinal Optical Coherence Tomography Images 视网膜光学相干断层扫描图像分割与分类研究进展
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520706
Zhijun Gao, Jian Wang, Xingle Wang, Xichao Dong, Yi Li
Optical coherence tomography (OCT) is one of the important auxiliary tools for ophthalmologists to screen and diagnose human retinal diseases. In this paper, from three aspects: the segmentation method of retinal OCT image macular edema, the segmentation method of retinal layer and the classification method of retinal macular degeneration. Firstly, the current representative research methods are classified, and then summarized and discussed. Finally, the current problems in OCT medical image processing are briefly summarized and prospected.
光学相干断层扫描(OCT)是眼科医生筛查和诊断人类视网膜疾病的重要辅助工具之一。本文从视网膜OCT图像黄斑水肿的分割方法、视网膜层的分割方法和视网膜黄斑变性的分类方法三个方面进行了研究。首先对目前具有代表性的研究方法进行分类,然后进行总结和讨论。最后,对目前OCT医学图像处理中存在的问题进行了简要总结和展望。
{"title":"A Review of Segmentation and Classification for Retinal Optical Coherence Tomography Images","authors":"Zhijun Gao, Jian Wang, Xingle Wang, Xichao Dong, Yi Li","doi":"10.1109/PRML52754.2021.9520706","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520706","url":null,"abstract":"Optical coherence tomography (OCT) is one of the important auxiliary tools for ophthalmologists to screen and diagnose human retinal diseases. In this paper, from three aspects: the segmentation method of retinal OCT image macular edema, the segmentation method of retinal layer and the classification method of retinal macular degeneration. Firstly, the current representative research methods are classified, and then summarized and discussed. Finally, the current problems in OCT medical image processing are briefly summarized and prospected.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114715456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of Pre-processing on the Performance of Transfer Learning Based Person Detection in Thermal Images 预处理对基于迁移学习的热图像人检测性能的影响
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520729
Noor Ul Huda, Rikke Gade, T. Moeslund
Thermal images have the property of identifying objects even in low light conditions. However, person detection in thermal is tricky, due to varying person representations depending upon the surrounding temperature. Three major polarities are commonly observed in these representations i.e., 1. person warmer than the background, 2. person colder than the background and 3. person’s body temperature is similar to background. In this work, we have studied and analyzed the performance of the detection network by using the data in its original form and by harmonizing the person representation in two ways i.e., dark persons in the light background and light persons in a darker background. The data passed to each testing scenario was first pre-processed using histogram stretching to enhance the contrast. The work also presents the method to separate the three kinds of images from thermal data. The analysis is performed on publicly available outdoor AAUPD-T and OSU-T datasets. Precision, recall, and F1 score is used to evaluate network performance. The results have shown that network performance is not enhanced by performing the mentioned pre-processing. Best results are obtained by using the data in its original form.
热图像即使在弱光条件下也具有识别物体的特性。然而,由于人的表征随周围温度的变化而变化,热检测是棘手的。在这些表征中通常观察到三种主要的极性,即:1。人比背景温暖,2。人比背景冷3。人的体温与背景相似。在这项工作中,我们研究和分析了检测网络的性能,通过使用原始形式的数据,并以两种方式协调人物表示,即浅色背景中的深色人物和深色背景中的浅色人物。传递到每个测试场景的数据首先使用直方图拉伸进行预处理以增强对比度。本文还提出了从热数据中分离这三种图像的方法。分析是在公开可用的室外AAUPD-T和OSU-T数据集上进行的。精度、召回率和F1分数用于评估网络性能。结果表明,进行上述预处理并没有提高网络性能。使用原始形式的数据可获得最佳结果。
{"title":"Effects of Pre-processing on the Performance of Transfer Learning Based Person Detection in Thermal Images","authors":"Noor Ul Huda, Rikke Gade, T. Moeslund","doi":"10.1109/PRML52754.2021.9520729","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520729","url":null,"abstract":"Thermal images have the property of identifying objects even in low light conditions. However, person detection in thermal is tricky, due to varying person representations depending upon the surrounding temperature. Three major polarities are commonly observed in these representations i.e., 1. person warmer than the background, 2. person colder than the background and 3. person’s body temperature is similar to background. In this work, we have studied and analyzed the performance of the detection network by using the data in its original form and by harmonizing the person representation in two ways i.e., dark persons in the light background and light persons in a darker background. The data passed to each testing scenario was first pre-processed using histogram stretching to enhance the contrast. The work also presents the method to separate the three kinds of images from thermal data. The analysis is performed on publicly available outdoor AAUPD-T and OSU-T datasets. Precision, recall, and F1 score is used to evaluate network performance. The results have shown that network performance is not enhanced by performing the mentioned pre-processing. Best results are obtained by using the data in its original form.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127698695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using Denoised LSTM Network for Tourist Arrivals Prediction 基于去噪LSTM网络的游客预测
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520695
Junke Wang, Peng Ge, Zhusheng Liu
Precise tourist arrivals prediction is required since tourism products are perishable and vulnerable to environmental change. Many studies have been pursuing more effective techniques to forecast tourist arrivals after the worldwide COVID-19. A hybrid method based on singular spectrum analysis (SSA) and long short-term memory network (LSTM) that incorporates various varieties of time series, containing historical tourist arrivals and search intensity indices (SII), is proposed to make tourist arrivals predictions. The proposed method is applied to the empirical studies and its results outperform all baseline models which verifies the effectiveness of the denoised deep learning method for high-frequency predictions. In addition, experimental results on independent SII variables reveal that SII data is of great significance to tourist arrivals predictions and provides practitioners with deeper comprehension of potential tourism forecasting factors.
由于旅游产品易腐烂,易受环境变化的影响,因此需要精确的游客到达预测。许多研究一直在寻求更有效的技术来预测全球COVID-19后的游客人数。提出了一种基于奇异谱分析(SSA)和长短期记忆网络(LSTM)的混合方法,该方法结合了包含历史游客数量和搜索强度指数(SII)的各种时间序列进行游客数量预测。将该方法应用于实证研究,其结果优于所有基线模型,验证了去噪深度学习方法用于高频预测的有效性。此外,SII独立变量的实验结果表明,SII数据对游客数量预测具有重要意义,可以让从业者更深入地了解潜在的旅游预测因素。
{"title":"Using Denoised LSTM Network for Tourist Arrivals Prediction","authors":"Junke Wang, Peng Ge, Zhusheng Liu","doi":"10.1109/PRML52754.2021.9520695","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520695","url":null,"abstract":"Precise tourist arrivals prediction is required since tourism products are perishable and vulnerable to environmental change. Many studies have been pursuing more effective techniques to forecast tourist arrivals after the worldwide COVID-19. A hybrid method based on singular spectrum analysis (SSA) and long short-term memory network (LSTM) that incorporates various varieties of time series, containing historical tourist arrivals and search intensity indices (SII), is proposed to make tourist arrivals predictions. The proposed method is applied to the empirical studies and its results outperform all baseline models which verifies the effectiveness of the denoised deep learning method for high-frequency predictions. In addition, experimental results on independent SII variables reveal that SII data is of great significance to tourist arrivals predictions and provides practitioners with deeper comprehension of potential tourism forecasting factors.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132543804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Efficient Co-location Pattern Approximation Algorithm Based on Clustering Branches 一种基于聚类分支的高效协同定位模式逼近算法
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520713
Duan Duanping
The spatial co-location pattern represents a set of spatial features, whose instances are frequently associated in the space. However, due to the exponential time complexity of the traditional algorithm, the operation efficiency of the algorithm is not high, especially in the face of massive data mining, it is unable to complete the mining task normally. Therefore, an efficient co-location pattern approximation algorithm is proposed. The new algorithm first clusters according to the feature instances, takes each center as the new instance coordinates, and associates the number of instances of each family. On this basis, the mining area is divided into branches, and the distance threshold is taken for the row spacing, so as to achieve the purpose of fast pruning. On the premise of ensuring high accuracy, the algorithm effectively solves the efficiency problem of traditional algorithms, and effectively solves the spatial colocation pattern mining of massive data. A large number of experiments show that the new algorithm has the advantages of high efficiency, stability and high accuracy.
空间共定位模式表示一组空间特征,这些特征的实例经常在空间中关联。然而,由于传统算法的指数时间复杂度,使得算法的运算效率不高,特别是面对海量数据挖掘时,无法正常完成挖掘任务。为此,提出了一种高效的共定位模式近似算法。该算法首先根据特征实例进行聚类,以每个中心作为新的实例坐标,并关联每个族的实例个数。在此基础上,将矿区划分为分支,行距采用距离阈值,从而达到快速剪枝的目的。该算法在保证高精度的前提下,有效解决了传统算法的效率问题,有效解决了海量数据的空间协同模式挖掘问题。大量实验表明,新算法具有效率高、稳定性好、精度高等优点。
{"title":"An Efficient Co-location Pattern Approximation Algorithm Based on Clustering Branches","authors":"Duan Duanping","doi":"10.1109/PRML52754.2021.9520713","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520713","url":null,"abstract":"The spatial co-location pattern represents a set of spatial features, whose instances are frequently associated in the space. However, due to the exponential time complexity of the traditional algorithm, the operation efficiency of the algorithm is not high, especially in the face of massive data mining, it is unable to complete the mining task normally. Therefore, an efficient co-location pattern approximation algorithm is proposed. The new algorithm first clusters according to the feature instances, takes each center as the new instance coordinates, and associates the number of instances of each family. On this basis, the mining area is divided into branches, and the distance threshold is taken for the row spacing, so as to achieve the purpose of fast pruning. On the premise of ensuring high accuracy, the algorithm effectively solves the efficiency problem of traditional algorithms, and effectively solves the spatial colocation pattern mining of massive data. A large number of experiments show that the new algorithm has the advantages of high efficiency, stability and high accuracy.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133716926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Super Resolution of Single Image Based on Multi Level Residual Self Attention Mechanism 基于多级残差自注意机制的单幅图像超分辨率
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520742
Junfeng Mao, Yaqi Hu
The existing network models achieve good reconstruction effect by deepening the network depth, but most of them have problems such as insufficient feature information extraction, single scale of feature information, weak perception of valuable information and so on. In order to solve this problem, this paper proposes a single image super-resolution network based on multi-level residual self attention mechanism. Firstly, shallow features and deep features are extracted from the input low resolution image hierarchically, and then convolution operation is performed on the deep features and shallow features to obtain high resolution image. Compared with the existing comparison methods, the reconstruction effect of this method is better, and the objective evaluation indexes PSNR and SSIM are also improved.
现有的网络模型通过加深网络深度获得了良好的重构效果,但大多存在特征信息提取不足、特征信息尺度单一、对有价值信息感知能力弱等问题。为了解决这一问题,本文提出了一种基于多级残差自注意机制的单幅图像超分辨网络。首先从输入的低分辨率图像中分层提取浅特征和深特征,然后对深特征和浅特征进行卷积运算,得到高分辨率图像。与现有的对比方法相比,该方法的重建效果更好,客观评价指标PSNR和SSIM也有所提高。
{"title":"Super Resolution of Single Image Based on Multi Level Residual Self Attention Mechanism","authors":"Junfeng Mao, Yaqi Hu","doi":"10.1109/PRML52754.2021.9520742","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520742","url":null,"abstract":"The existing network models achieve good reconstruction effect by deepening the network depth, but most of them have problems such as insufficient feature information extraction, single scale of feature information, weak perception of valuable information and so on. In order to solve this problem, this paper proposes a single image super-resolution network based on multi-level residual self attention mechanism. Firstly, shallow features and deep features are extracted from the input low resolution image hierarchically, and then convolution operation is performed on the deep features and shallow features to obtain high resolution image. Compared with the existing comparison methods, the reconstruction effect of this method is better, and the objective evaluation indexes PSNR and SSIM are also improved.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133724979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Damage Detection in Switch Rails via Machine Learning 基于机器学习的交换轨道损伤检测
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520705
Weixu Liu, Zhifeng Tang, Pengfei Zhang, Xiangxian Chen, Bin Yang
Switch rail is a weak but essential component of high-speed rail (HSR) systems. Due to aging and the potential of fatigue damage accumulation, it has an urgent requirement for damage detection. An automatic classification method of switch rail damage based on feature integration and machine learning is proposed. According to the characteristics of switch rail and guided wave, several features extracted from different signal processing domains (such as time domain, power spectrum domain and time-frequency domain) are proposed and defined to characterize the complexity of switch rail damage. A damage index is defined to eliminate the effects of various environmental and operational conditions. A feature selection method based on binary particle swarm optimization (BPSO) is proposed. This method uses a new fitness function to select the most damage-sensitive features, eliminate the irrelevant and redundant features, and improve the classification performance. The least-squares support-vector machine (LS-SVM) is adopted to build an automatic classification model to reduce the probability of artificial error diagnosis and improve the generalization ability. Finally, experiment on the switch rail foot is conducted to verify the proposed method. The results show that the method has the ability of damage identification, which is better than traditional methods.
开关柜轨道是高速铁路系统中一个薄弱但必不可少的部件。由于老化和潜在的疲劳损伤积累,对损伤检测提出了迫切的要求。提出了一种基于特征集成和机器学习的开关轨损伤自动分类方法。根据开关柜轨和导波的特点,提出并定义了从不同信号处理域(如时域、功率谱域和时频域)提取的特征来表征开关柜轨损伤的复杂性。定义损伤指数以消除各种环境和操作条件的影响。提出了一种基于二元粒子群优化(BPSO)的特征选择方法。该方法利用一种新的适应度函数来选择对损伤最敏感的特征,剔除不相关和冗余的特征,提高分类性能。采用最小二乘支持向量机(least-squares support-vector machine, LS-SVM)建立自动分类模型,降低人工错误诊断的概率,提高泛化能力。最后,对开关轨脚进行了实验验证。结果表明,该方法具有较传统方法更好的损伤识别能力。
{"title":"Damage Detection in Switch Rails via Machine Learning","authors":"Weixu Liu, Zhifeng Tang, Pengfei Zhang, Xiangxian Chen, Bin Yang","doi":"10.1109/PRML52754.2021.9520705","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520705","url":null,"abstract":"Switch rail is a weak but essential component of high-speed rail (HSR) systems. Due to aging and the potential of fatigue damage accumulation, it has an urgent requirement for damage detection. An automatic classification method of switch rail damage based on feature integration and machine learning is proposed. According to the characteristics of switch rail and guided wave, several features extracted from different signal processing domains (such as time domain, power spectrum domain and time-frequency domain) are proposed and defined to characterize the complexity of switch rail damage. A damage index is defined to eliminate the effects of various environmental and operational conditions. A feature selection method based on binary particle swarm optimization (BPSO) is proposed. This method uses a new fitness function to select the most damage-sensitive features, eliminate the irrelevant and redundant features, and improve the classification performance. The least-squares support-vector machine (LS-SVM) is adopted to build an automatic classification model to reduce the probability of artificial error diagnosis and improve the generalization ability. Finally, experiment on the switch rail foot is conducted to verify the proposed method. The results show that the method has the ability of damage identification, which is better than traditional methods.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115503597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Efficient and Bias-aware Recommendation with Two-side Relevance for Implicit Feedback 基于隐性反馈的双向关联的高效、偏见感知推荐
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520701
Guanyu Lin, Lei Huang, Yuting Yin, Chengmin Zhang, Feng Zhu, Lingqi Kong, Zhiheng Li
Today’s wide-spread recommendation is usually constructed based on implicit data such as click for easy collection but whether the no clicked data is negative feedback or unobserved positive feedback confuses the model construction. As a response, Relevance Matrix Factorization (Rel-MF) is recently proposed to tackle this problem as well as the missing-not-at-random (MNAR) problem ignored by previous studies. However, Rel-MF meets three problems: limited assumption (LA), negative square loss (NSL) and indiscriminate no click data (INCD). In this paper, we first get rid of Rel-MF’s limited assumption and establish a more general theory by incorporating a defined transformation function which captures the relevance level to our two-side relevance ideal loss, containing Rel-MF’s theory. To resolve the INCD problem and NSL problem, we introduce an adjusting variable and perform normalization, respectively, which is called Naive Solution with Normalization for Rel-MF (NRel-MF). But we then analytically discover that the clipped function proposed by Rel-MF meets the high variance problem. To overcome it, we design a power clipped function and further propose Improved Solution with Power Function for Rel-MF (PRel-MF). Besides, we also explore propensity score estimation from user and hybrid perspectives in contrast to Rel-MF’s sole item perspective. Finally, we also consider and address the computational problem caused by the Rel-MF’s non-sampling strategy. Empirical results verify the effectiveness of our solutions from both performance even in rare items and loss decrease. In broader perspective experiment, decent performance is seen in item perspective with fewer recommended items while in user perspective with more recommended items and hybrid perspective outperforms them in more situations.
目前广泛传播的推荐通常是基于点击等隐式数据构建的,便于收集,但未点击的数据是负反馈还是未观察到的正反馈,使模型构建变得混乱。为了解决这一问题,相关矩阵分解(Rel-MF)以及之前研究忽略的缺失非随机(MNAR)问题最近被提出。然而,Rel-MF存在三个问题:有限假设(LA)、负平方损失(NSL)和无点击数据(INCD)。在本文中,我们首先摆脱了Rel-MF的有限假设,并通过包含Rel-MF理论的定义转换函数建立了一个更一般的理论,该转换函数捕获了我们的双边相关理想损失的相关水平。为了解决INCD问题和NSL问题,我们分别引入了一个调节变量并进行了归一化处理,称为正则化朴素解(NRel-MF)。但我们分析发现,由Rel-MF提出的裁剪函数满足高方差问题。为了克服这个问题,我们设计了一个功率截断函数,并进一步提出了基于功率函数的Rel-MF (PRel-MF)改进方案。此外,我们还探讨了从用户和混合的角度来估计倾向得分,而不是从Rel-MF的单一项目角度。最后,我们还考虑并解决了Rel-MF的非采样策略带来的计算问题。实证结果从稀有物品的性能和减少损失两方面验证了我们的解决方案的有效性。在宽视角实验中,项目视角在推荐项目较少的情况下表现良好,而用户视角在推荐项目较多的情况下表现较好,混合视角在更多情况下表现较好。
{"title":"Efficient and Bias-aware Recommendation with Two-side Relevance for Implicit Feedback","authors":"Guanyu Lin, Lei Huang, Yuting Yin, Chengmin Zhang, Feng Zhu, Lingqi Kong, Zhiheng Li","doi":"10.1109/PRML52754.2021.9520701","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520701","url":null,"abstract":"Today’s wide-spread recommendation is usually constructed based on implicit data such as click for easy collection but whether the no clicked data is negative feedback or unobserved positive feedback confuses the model construction. As a response, Relevance Matrix Factorization (Rel-MF) is recently proposed to tackle this problem as well as the missing-not-at-random (MNAR) problem ignored by previous studies. However, Rel-MF meets three problems: limited assumption (LA), negative square loss (NSL) and indiscriminate no click data (INCD). In this paper, we first get rid of Rel-MF’s limited assumption and establish a more general theory by incorporating a defined transformation function which captures the relevance level to our two-side relevance ideal loss, containing Rel-MF’s theory. To resolve the INCD problem and NSL problem, we introduce an adjusting variable and perform normalization, respectively, which is called Naive Solution with Normalization for Rel-MF (NRel-MF). But we then analytically discover that the clipped function proposed by Rel-MF meets the high variance problem. To overcome it, we design a power clipped function and further propose Improved Solution with Power Function for Rel-MF (PRel-MF). Besides, we also explore propensity score estimation from user and hybrid perspectives in contrast to Rel-MF’s sole item perspective. Finally, we also consider and address the computational problem caused by the Rel-MF’s non-sampling strategy. Empirical results verify the effectiveness of our solutions from both performance even in rare items and loss decrease. In broader perspective experiment, decent performance is seen in item perspective with fewer recommended items while in user perspective with more recommended items and hybrid perspective outperforms them in more situations.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127236619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Does Chinese Segmentation Strategy Effect on Sentiment Analysis of Short Text? 汉语分词策略对短文本情感分析的影响
Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520738
Qing Lei, Haifeng Li, Yanxi Chen
In term of Chinese natural language processing, it exits one particular problem that how to choose the strategy of word segmentation, which commonly includes char-based and word-based. Targeted at sentiment analysis of short text comparing with long text, the word-based segmentation faces the other problem that there are the more ambiguous or unregistered words in context of short text. The feature extraction done by the different Chinese Word Segmentation impact the statistic distribution of features, and further the accuracy of sentiment analysis. This paper evaluates five Chinese segmentation strategy effect on Sentiment Analysis of Short Text. We chose two word-based Chinese Word Segmentation (CWS), and three char-based n-gram, then transformed Bag-of-Word (BOW) to Vector Space Model (VSM) which finally was fed into several classifiers to predict sentiment polarity of short text. To reduce the impact of corpora, the study is based a collection of five public corpora.
在汉语自然语言处理中,如何选择分词策略是一个特别的问题,分词策略一般有基于字符和基于词两种。基于词的分词方法针对短文本与长短文本的情感分析,面临着短文本语境中歧义词或未注册词较多的另一个问题。不同的中文分词方式所提取的特征会影响特征的统计分布,进而影响情感分析的准确性。本文评价了五种汉语分词策略对短文本情感分析的影响。我们选择了两个基于词的中文分词(CWS)和三个基于字符的n-gram,然后将词袋模型(BOW)转化为向量空间模型(VSM),最后将其输入到多个分类器中进行短文本情感极性预测。为了减少语料库的影响,本研究基于五个公共语料库的集合。
{"title":"How Does Chinese Segmentation Strategy Effect on Sentiment Analysis of Short Text?","authors":"Qing Lei, Haifeng Li, Yanxi Chen","doi":"10.1109/PRML52754.2021.9520738","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520738","url":null,"abstract":"In term of Chinese natural language processing, it exits one particular problem that how to choose the strategy of word segmentation, which commonly includes char-based and word-based. Targeted at sentiment analysis of short text comparing with long text, the word-based segmentation faces the other problem that there are the more ambiguous or unregistered words in context of short text. The feature extraction done by the different Chinese Word Segmentation impact the statistic distribution of features, and further the accuracy of sentiment analysis. This paper evaluates five Chinese segmentation strategy effect on Sentiment Analysis of Short Text. We chose two word-based Chinese Word Segmentation (CWS), and three char-based n-gram, then transformed Bag-of-Word (BOW) to Vector Space Model (VSM) which finally was fed into several classifiers to predict sentiment polarity of short text. To reduce the impact of corpora, the study is based a collection of five public corpora.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130686674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1