2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)最新文献

英文中文

A Camera-Aware Three-Stage Method for Fully Unsupervised Person Re-identification 一种摄像机感知的完全无监督人员再识别三阶段方法

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520689

Guyu Fang, Hongtao Lu

Most of existing unsupervised person re-identification methods focus on cross-domain adaptation. In order to further relieve the dependence on manual labels, we propose a camera-aware three-stage method for fully unsupervised person re-identification which only requires the unlabeled target dataset. We exploit camera labels and divide the learning process into three relatively easy sub-tasks: initialization by instance discrimination, intra-camera learning and inter-camera learning. The first stage regards each person image as an instance and tries to distinguish each image. The second stage performs intra-camera clustering while the last stage performs clustering and training on the whole dataset. These three stages share the backbone network. Finally, our method substantially boosts the performance stage by stage without any manual ID annotation. We conduct extensive experiments on three large-scale image-based datasets, including Market-1501, DukeMTMC-reID and MSMT17. The results demonstrate that our method achieves the state-of-the-art performance.

现有的无监督人再识别方法多侧重于跨域自适应。为了进一步减轻对人工标签的依赖，我们提出了一种摄像机感知的三阶段完全无监督人再识别方法，该方法只需要未标记的目标数据集。我们利用相机标签并将学习过程分为三个相对简单的子任务:通过实例识别初始化，相机内学习和相机间学习。第一阶段将每个人的形象作为一个实例，并试图区分每个形象。第二阶段进行相机内聚类，最后阶段对整个数据集进行聚类和训练。这三个阶段共用骨干网。最后，我们的方法在不需要任何手动ID注释的情况下逐步提高了性能。我们在Market-1501、DukeMTMC-reID和MSMT17三个大规模基于图像的数据集上进行了广泛的实验。结果表明，我们的方法达到了最先进的性能。

引用次数: 1

Research on Pre-training of Tibetan Natural Language Processing 藏文自然语言处理的预训练研究

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520714

Zhensong Li, Jie Zhu, Hong Cao

In the field of natural language processing, pre-training can effectively improve the performance of downstream tasks. In recent years, pre-training has been continuously developed in Tibetan NLP. We built three pre-trained models of Tibetan Word2Vec, Tibetan ELMo, and Tibetan ALBERT, and applied them to the two downstream tasks of Tibetan text classification and Tibetan part-of-speech tagging. Comparing them with the baseline models of these two downstream tasks, it is found that the performance of the downstream tasks using the pre-training is significantly better than the baseline model. The three pre-trained models have also brought a gradual improvement in performance for Tibetan downstream tasks.

在自然语言处理领域，预训练可以有效地提高下游任务的性能。近年来，预训练在藏语自然语言处理中不断发展。构建了藏文Word2Vec、藏文ELMo和藏文ALBERT三个预训练模型，并将其应用于藏文文本分类和藏文词性标注两个下游任务。将它们与这两个下游任务的基线模型进行比较，发现使用预训练的下游任务的性能明显优于基线模型。这三种预训练模型也使藏区下游任务的性能逐步提高。

引用次数: 1

A VideoSAR Moving Target Detection Method Based on GMM 基于GMM的视频sar运动目标检测方法

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520711

Meng Yan, L. Li, Haochuan Chen

In VideoSAR circle trace imaging mode, the energy of moving target is defocused and shifted. However, due to the occlusion of target height, there is shadow in its real position, which represents the lack of energy. In addition, there is a strong correlation between adjacent frames of VideoSAR image sequence, and the shadow also moves with the movement of the target. Based on this property, a new method for moving object detection in VideoSAR image sequences is proposed. This method is based on Gaussian mixture model. Firstly, it preprocesses the image sequence, uses sift + RANSAC algorithm and median filter processing, then uses Otsu threshold segmentation algorithm to transform the image into binary image, uses Gaussian mixture model to detect moving objects, and finally carries out morphological processing. Using VideoSAR image sequence of Sandia National Laboratory, the moving target can be detected effectively.

在视频sar圆迹成像模式下，运动目标的能量会发生离焦和偏移。然而，由于目标高度的遮挡，在其真实位置存在阴影，代表能量不足。此外，VideoSAR图像序列的相邻帧之间存在很强的相关性，阴影也会随着目标的移动而移动。基于这一特性，提出了一种新的视频sar图像序列运动目标检测方法。该方法基于高斯混合模型。首先对图像序列进行预处理，使用sift + RANSAC算法和中值滤波处理，然后使用Otsu阈值分割算法将图像转换为二值图像，使用高斯混合模型检测运动目标，最后进行形态学处理。利用桑迪亚国家实验室的视频sar图像序列，可以有效地检测出运动目标。

引用次数: 0

An Improved Bounding Box Regression Loss Function Based on CIOU Loss for Multi-scale Object Detection 基于CIOU损失的改进边界盒回归损失函数用于多尺度目标检测

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520717

Shuangjiang Du, Baofu Zhang, Pin Zhang, Peng Xiang

The regression loss function is a key factor in the training and optimization process of object detection. The current mainstream regression loss functions are Ln norm loss, IOU loss and CIOU loss. This paper proposes the Scale-Sensitive IOU(SIOU) loss, a new loss function different from the above all, which could solve the issues that the current loss functions cannot distinguish the two bounding boxes in some special cases when the target area scales in one image vary greatly during training process, thereby leading to the improper regression loss calculation and the slowing down of the optimization. An area scale regulating factor Y is added on the basis of CIOU loss to adjust the loss values of the bounding boxes, which could distinguish all the boxes quantitatively in theory thus gets a faster converging speed and better optimization. Through analysis and simulation comparison among the several loss functions, the superiority of SIOU loss is verified. Furthermore, by incorporating SIOU loss into YOLO v4, Faster R-CNN and SSD on the two mainstream aerial remote sensing datasets, i.e., DIOR and NWPU VHR-10, the detection precisions improve by 10.2% than IOU loss and 2.8% than CIOU loss respectively.

回归损失函数是目标检测训练和优化过程中的关键因素。目前主流的回归损失函数有Ln范数损失、IOU损失和CIOU损失。本文提出了Scale-Sensitive IOU(SIOU)损失函数，这是一种不同于上述损失函数的新型损失函数，它可以解决当前损失函数在训练过程中，当一幅图像的目标区域尺度变化较大时，在某些特殊情况下无法区分两个边界框，从而导致回归损失计算不当和优化速度变慢的问题。在CIOU损失的基础上加入面积尺度调节因子Y来调节边界框的损失值，理论上可以定量区分所有的边界框，从而获得更快的收敛速度和更好的优化效果。通过对几种损耗函数的分析和仿真比较，验证了SIOU损耗的优越性。此外，在迪奥和NWPU VHR-10两种主流航空遥感数据集上，将SIOU损耗纳入YOLO v4、Faster R-CNN和SSD，探测精度分别比IOU损耗提高10.2%和2.8%。

{"title":"An Improved Bounding Box Regression Loss Function Based on CIOU Loss for Multi-scale Object Detection","authors":"Shuangjiang Du, Baofu Zhang, Pin Zhang, Peng Xiang","doi":"10.1109/PRML52754.2021.9520717","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520717","url":null,"abstract":"The regression loss function is a key factor in the training and optimization process of object detection. The current mainstream regression loss functions are Ln norm loss, IOU loss and CIOU loss. This paper proposes the Scale-Sensitive IOU(SIOU) loss, a new loss function different from the above all, which could solve the issues that the current loss functions cannot distinguish the two bounding boxes in some special cases when the target area scales in one image vary greatly during training process, thereby leading to the improper regression loss calculation and the slowing down of the optimization. An area scale regulating factor Y is added on the basis of CIOU loss to adjust the loss values of the bounding boxes, which could distinguish all the boxes quantitatively in theory thus gets a faster converging speed and better optimization. Through analysis and simulation comparison among the several loss functions, the superiority of SIOU loss is verified. Furthermore, by incorporating SIOU loss into YOLO v4, Faster R-CNN and SSD on the two mainstream aerial remote sensing datasets, i.e., DIOR and NWPU VHR-10, the detection precisions improve by 10.2% than IOU loss and 2.8% than CIOU loss respectively.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124215119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

INFIN: An Efficient Algorithm for Fast Mining Frequent Itemsets 一种快速挖掘频繁项集的有效算法

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520736

Shaopeng Wang, Yufei Wang, Chunkai Feng, ChaoYu Niu

The negFIN is the current state-of-the art algorithm for frequent itemsets mining. It employs a novel BMC (bitmap code) encoding model for nodes in a prefix tree based on the bitmap representation of sets. The encoding of each node is a binary number of which bit number is the number of frequent items, and is stored in the form of decimal integer number. The key operations of negFIN are all performed based on the bitwise operation of the encoding. The main problem of BMC is that the maximal bit number of the data type which is used to store the decimal integer number in current general compiling systems is 64, so if the number of frequent items exceeds 64, the encoding cannot work effectively. In this work, we propose B-BMC (block bitmap code) encoding model, a more efficient encoding model. The B-BMC is a dividing of BMC based on the block size in essential. For facilitating the work of B-BMC, the B-BMC tree and TNC(terminal node code) table are devised as an alternative to the BMC tree of negFIN. Based on these two structures, we present an efficient algorithm called INFIN (improved negFIN) to mining frequent itemsets. Our experiments illustrate that the B-BMC can overcome the drawback of BMC, and the INFIN is the most efficient one in time and space when the block size takes value 64 on condition that the number of frequent items exceeds 64.

negFIN是当前最先进的频繁项集挖掘算法。基于集合的位图表示，对前缀树中的节点采用了一种新颖的BMC (bitmap code)编码模型。每个节点的编码是一个二进制数，其中位数是频繁项的个数，并以十进制整数的形式存储。negFIN的键操作都是基于编码的位操作来执行的。BMC的主要问题是目前一般编译系统中用于存储十进制整数的数据类型的最大位数为64位，因此如果频繁项的数量超过64位，则无法有效地进行编码。在这项工作中，我们提出了B-BMC(块位图码)编码模型，这是一种更有效的编码模型。B-BMC本质上是一种基于块大小的BMC划分。为了方便B-BMC的工作，设计了B-BMC树和TNC(终端节点代码)表，作为negFIN的BMC树的替代方案。在这两种结构的基础上，我们提出了一种高效的挖掘频繁项集的算法，称为INFIN (improved negFIN)。实验表明，B-BMC可以克服BMC的缺点，当块大小为64且频繁项数超过64时，INFIN在时间和空间上是最有效的。

{"title":"INFIN: An Efficient Algorithm for Fast Mining Frequent Itemsets","authors":"Shaopeng Wang, Yufei Wang, Chunkai Feng, ChaoYu Niu","doi":"10.1109/PRML52754.2021.9520736","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520736","url":null,"abstract":"The negFIN is the current state-of-the art algorithm for frequent itemsets mining. It employs a novel BMC (bitmap code) encoding model for nodes in a prefix tree based on the bitmap representation of sets. The encoding of each node is a binary number of which bit number is the number of frequent items, and is stored in the form of decimal integer number. The key operations of negFIN are all performed based on the bitwise operation of the encoding. The main problem of BMC is that the maximal bit number of the data type which is used to store the decimal integer number in current general compiling systems is 64, so if the number of frequent items exceeds 64, the encoding cannot work effectively. In this work, we propose B-BMC (block bitmap code) encoding model, a more efficient encoding model. The B-BMC is a dividing of BMC based on the block size in essential. For facilitating the work of B-BMC, the B-BMC tree and TNC(terminal node code) table are devised as an alternative to the BMC tree of negFIN. Based on these two structures, we present an efficient algorithm called INFIN (improved negFIN) to mining frequent itemsets. Our experiments illustrate that the B-BMC can overcome the drawback of BMC, and the INFIN is the most efficient one in time and space when the block size takes value 64 on condition that the number of frequent items exceeds 64.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131938185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Anti-Corner Reflector Array Method Based on Pauli Polarization Decomposition and BP Neural Network 基于泡利极化分解和BP神经网络的反角反射器阵列方法

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520744

Liang Ziyao, Yu Yong, Zhang Bin

The radar echoes of the corner reflector array and the ship target are very similar, and the existing algorithms are difficult to identify them effectively in time, frequency and spatial domain. Aiming at the problem that the terminal guidance radar of anti-ship missile can’t detect and track the real target effectively under the deception jamming of corner reflector array, this paper designs a countermeasure method based on Pauli polarization decomposition and BP neural network. Firstly, the Pauli polarization decomposition of the full polarization scattering matrix of the target measured in the fixed angle window is used to obtain four normalized coefficients and form the eigenvector, and the differences between the ship target and the corner reflector are analyzed. Then, the BP neural network model is trained and optimized as the training sample. The simulation and test results show that the feature vectors can distinguish the two kinds of targets, and the trained network can identify the ship and the corner reflector Array effectively, and the overall success rate is close to 97%.

角阵雷达回波与舰船目标回波非常相似，现有算法难以在时域、频域和空域进行有效识别。针对反舰导弹末制导雷达在角反射阵欺骗干扰下不能有效探测和跟踪真实目标的问题，设计了一种基于泡利极化分解和BP神经网络的对抗方法。首先，对固定角窗测量目标的全极化散射矩阵进行泡利极化分解，得到4个归一化系数并形成特征向量，分析舰船目标与角反射器的差异;然后，将BP神经网络模型作为训练样本进行训练和优化。仿真和测试结果表明，特征向量能有效区分两类目标，训练后的网络能有效识别舰船和角反射器阵列，总体成功率接近97%。

{"title":"Anti-Corner Reflector Array Method Based on Pauli Polarization Decomposition and BP Neural Network","authors":"Liang Ziyao, Yu Yong, Zhang Bin","doi":"10.1109/PRML52754.2021.9520744","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520744","url":null,"abstract":"The radar echoes of the corner reflector array and the ship target are very similar, and the existing algorithms are difficult to identify them effectively in time, frequency and spatial domain. Aiming at the problem that the terminal guidance radar of anti-ship missile can’t detect and track the real target effectively under the deception jamming of corner reflector array, this paper designs a countermeasure method based on Pauli polarization decomposition and BP neural network. Firstly, the Pauli polarization decomposition of the full polarization scattering matrix of the target measured in the fixed angle window is used to obtain four normalized coefficients and form the eigenvector, and the differences between the ship target and the corner reflector are analyzed. Then, the BP neural network model is trained and optimized as the training sample. The simulation and test results show that the feature vectors can distinguish the two kinds of targets, and the trained network can identify the ship and the corner reflector Array effectively, and the overall success rate is close to 97%.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"202 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134480157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Tree-Based Models Using Random Grid Search Optimization for Disease Classification Based on Environmental Factors: A Case Study on Asthma Hospitalizations 基于环境因素的随机网格搜索优化疾病分类树模型:以哮喘住院病例为例

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520720

P. Nanthakumaran, L. Liyanage

An understanding on the exposure to environmental factors aggravating global disease burden can aid mitigating it. Generally, a class of generalized linear models and generalized additive models are used in predicting disease burden whereas, tree-based models are underused. The objective of this paper is to evaluate the performance of different tree-based models namely decision tree, random forest, gradient boosted tree and stochastic gradient boosted trees in predicting asthma attack based on short-term exposure to environmental factors and to examine the environmental factors triggering asthma attack. A sample of patients during 2013 - 2015 from different parts of Victoria was considered. The study area for the considered study period had reasonably good air quality and relatively humid environment. The tree-based models were tuned using random grid search optimization with bootstrapping to address over-fitting. The models considered performed well in predicting asthma attacks in terms of area under the receiver operating curve (ROC AUC) (>0.82). All the gradient boosted trees (accuracy = 76%; recall = 63%; F2-score = 64%) showed better overall prediction whereas decision tree (accuracy = 71%; recall = 75%; F2-score = 71%) outperformed other models in identifying the positive cases. Tree-based models revealed that O3 exposure consistently influence Asthma. Further, decision tree revealed O3 exposure < 13 ppb or with high O3 exposure >= 13 ppb, and with [SO2 exposure < 0.5 ppb and maximum wind speed > 5.4. km/hr.] influenced Asthma. In addition, relative humidity and exposure to CO were also detected in other tree-based models as relevant predictors triggering asthma attacks.

了解环境因素对加重全球疾病负担的影响有助于减轻疾病负担。在疾病负担预测中，一般采用广义线性模型和广义加性模型，而基于树的模型应用较少。本文的目的是评估决策树、随机森林、梯度增强树和随机梯度增强树等不同的基于树的模型在基于短期暴露于环境因素的哮喘发作预测中的性能，并研究引发哮喘发作的环境因素。在2013年至2015年期间，来自维多利亚州不同地区的患者样本被考虑。研究区在考虑的研究期间空气质量较好，环境相对潮湿。基于树的模型使用随机网格搜索优化和自举来解决过拟合问题。所考虑的模型在受试者工作曲线下面积(ROC AUC)方面预测哮喘发作的效果良好(>0.82)。所有的梯度增强树(准确率= 76%;召回率= 63%;F2-score = 64%)表现出更好的整体预测能力，而决策树(准确率= 71%;召回率= 75%;F2-score = 71%)在识别阳性病例方面优于其他模型。基于树的模型显示，臭氧暴露持续影响哮喘。此外，决策树显示O3暴露< 13 ppb或高O3暴露>= 13 ppb, SO2暴露< 0.5 ppb，最大风速> 5.4。公里/小时。影响哮喘。此外，在其他基于树木的模型中，相对湿度和CO暴露也被检测到是引发哮喘发作的相关预测因素。

{"title":"Tree-Based Models Using Random Grid Search Optimization for Disease Classification Based on Environmental Factors: A Case Study on Asthma Hospitalizations","authors":"P. Nanthakumaran, L. Liyanage","doi":"10.1109/PRML52754.2021.9520720","DOIUrl":"https://doi.org/10.1109/PRML52754.2021.9520720","url":null,"abstract":"An understanding on the exposure to environmental factors aggravating global disease burden can aid mitigating it. Generally, a class of generalized linear models and generalized additive models are used in predicting disease burden whereas, tree-based models are underused. The objective of this paper is to evaluate the performance of different tree-based models namely decision tree, random forest, gradient boosted tree and stochastic gradient boosted trees in predicting asthma attack based on short-term exposure to environmental factors and to examine the environmental factors triggering asthma attack. A sample of patients during 2013 - 2015 from different parts of Victoria was considered. The study area for the considered study period had reasonably good air quality and relatively humid environment. The tree-based models were tuned using random grid search optimization with bootstrapping to address over-fitting. The models considered performed well in predicting asthma attacks in terms of area under the receiver operating curve (ROC AUC) (>0.82). All the gradient boosted trees (accuracy = 76%; recall = 63%; F2-score = 64%) showed better overall prediction whereas decision tree (accuracy = 71%; recall = 75%; F2-score = 71%) outperformed other models in identifying the positive cases. Tree-based models revealed that O3 exposure consistently influence Asthma. Further, decision tree revealed O3 exposure < 13 ppb or with high O3 exposure >= 13 ppb, and with [SO2 exposure < 0.5 ppb and maximum wind speed > 5.4. km/hr.] influenced Asthma. In addition, relative humidity and exposure to CO were also detected in other tree-based models as relevant predictors triggering asthma attacks.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131194625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evolutionary Parameter-Free Clustering Algorithm 进化无参数聚类算法

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520724

Z. Ding, Haibin Xie, Peng Li

The performance of the clustering algorithms depends mainly on the setting of artificial parameter values which is usually difficult in practical application. In addition, the dataset is usually incremental, and the clustering algorithm applied to the static dataset cannot develop with the change of the dataset. If new sample points are added, algorithm parameters need to be readjusted to cluster again, leading to a great time cost. This paper proposed an evolutionary parameter-free clustering algorithm (EPFC) for the above problems, which imitates the human clustering mechanism of objective things. EPFC algorithm takes the average distance between each sample and its nearest neighbour sample as the threshold value to judge whether the sample can be grouped into one cluster. The threshold value is adaptively updated without setting an artificially parameter value as the samples increase. A large number of experiments on benchmark datasets show that EPFC is effective on datasets with different characteristics, and the algorithm has strong robustness.

聚类算法的性能主要取决于人工参数值的设置，这在实际应用中往往是一个难点。此外，数据集通常是增量的，应用于静态数据集的聚类算法不能随着数据集的变化而发展。如果增加新的样本点，需要重新调整算法参数进行聚类，时间开销很大。针对上述问题，本文提出了一种模拟人类对客观事物聚类机制的无参数进化聚类算法(EPFC)。EPFC算法以每个样本与最近邻样本之间的平均距离作为阈值，判断样本是否可以归为一个聚类。随着样本的增加，阈值可以自适应更新，而不需要人为设置参数值。在基准数据集上的大量实验表明，EPFC对不同特征的数据集都是有效的，并且该算法具有较强的鲁棒性。

引用次数: 0

Improving Relation Classification with Multi-graph GCN 基于多图GCN的关系分类改进

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520688

Ya Zhang, Shuai Qin

As a basis task in the field of Natural Language Processing (NLP), relation extraction task aims to extract the relation between two entities in a text. Most existing models rely on a single semantic feature of the sentence for relation classification. In this paper, we present MGGCM model, a novel neural network to classify the relation of two entities in a sentence. Our neural architecture leverages two distinct graphs which are the dependency tree path and the relation-entity graph respectively. In this model, we integrate both semantic features and structural features to enhance the performance of relation extraction model. We encode the sentence through BiLSTM, obtain its structural features by GCN, and pay more attention to the entity information which is related to the target entity pair, and finally fuse the features to obtain the classification results. We test our model on the SemEval 2010 relation classification task, and achieve an F1-score of 85.7%, higher than competing methods in literature.

关系提取任务是自然语言处理(NLP)领域的一项基础任务，旨在提取文本中两个实体之间的关系。大多数现有模型依赖于句子的单个语义特征来进行关系分类。本文提出了一种新的神经网络MGGCM模型，用于对句子中两个实体之间的关系进行分类。我们的神经结构利用了两个不同的图，分别是依赖树路径和关系实体图。在该模型中，我们将语义特征和结构特征相结合，提高了关系抽取模型的性能。我们通过BiLSTM对句子进行编码，通过GCN获得句子的结构特征，并更加关注与目标实体对相关的实体信息，最后融合特征得到分类结果。我们在SemEval 2010关系分类任务上测试了我们的模型，并获得了85.7%的f1得分，高于文献中的竞争方法。

引用次数: 0

Combined Channel and Spatial Attention for YOLOv5 during Target Detection YOLOv5在目标检测过程中的信道和空间注意组合

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

Pub Date : 2021-07-16 DOI: 10.1109/PRML52754.2021.9520728

Gui-Hong Shi, Jiezhong Huang, Junhua Zhang, Guoqin Tan, Gaoli Sang

Accuracy target detection can benefit many target detection applications. The latest YOLOv5 method has faster detection speed and better accuracy in target detection. However, there are still insufficient on bounding box positioning and it is difficult to distinguish overlapping objects. This paper proposes an improved Attention-YOLO v5, which adds channel attention and spatial attention mechanisms to the feature extraction. Furthermore, a squeeze and excitation(SE) module is applied to improve the resolution of the input image. Experiments on two public datasets show that our proposed method effectively reduces the positioning error of the bounding box and improves the detection accuracy. The accuracy on INRIA and PnPLO datasets are 97.9% and 96.2%.

准确的目标检测可以使许多目标检测应用受益。最新的YOLOv5方法具有更快的检测速度和更好的目标检测精度。但是，在边界盒定位方面仍然存在不足，难以区分重叠对象。本文提出了一种改进的attention - yolo v5，在特征提取中加入了通道注意和空间注意机制。此外，还采用了挤压激励(SE)模块来提高输入图像的分辨率。在两个公开数据集上的实验表明，该方法有效地降低了边界盒的定位误差，提高了检测精度。在INRIA和PnPLO数据集上的准确率分别为97.9%和96.2%。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀