首页 > 最新文献

International Conference on Pattern Recognition Applications and Methods最新文献

英文 中文
Implications of Z-Normalization in the Matrix Profile 矩阵剖面中z -归一化的含义
Pub Date : 2019-02-19 DOI: 10.1007/978-3-030-40014-9_5
Dieter De Paepe, Diego Nieves Avendano, S. Hoecke
{"title":"Implications of Z-Normalization in the Matrix Profile","authors":"Dieter De Paepe, Diego Nieves Avendano, S. Hoecke","doi":"10.1007/978-3-030-40014-9_5","DOIUrl":"https://doi.org/10.1007/978-3-030-40014-9_5","url":null,"abstract":"","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134036188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Eliminating Noise in the Matrix Profile 消除矩阵轮廓中的噪声
Pub Date : 2019-02-19 DOI: 10.5220/0007314100830093
Dieter De Paepe, Olivier Janssens, S. Hoecke
As companies are increasingly measuring their products and services, the amount of time series data is rising and techniques to extract usable information are needed. One recently developed data mining technique for time series is the Matrix Profile. It consists of the smallest z-normalized Euclidean distance of each subsequence of a time series to all other subsequences of another series. It has been used for motif and discord discovery, for segmentation and as building block for other techniques. One side effect of the z-normalization used is that small fluctuations on flat signals are upscaled. This can lead to high and unintuitive distances for very similar subsequences from noisy data. We determined an analytic method to estimate and remove the effects of this noise, adding only a single, intuitive parameter to the calculation of the Matrix Profile. This paper explains our method and demonstrates it by performing discord discovery on the Numenta Anomaly Benchmark and by segmenting the PAMAP2 activity dataset. We find that our technique results in a more intuitive Matrix Profile and provides improved results in both usecases for series containing many flat, noisy subsequences. Since our technique is an extension of the Matrix Profile, it can be applied to any of the various tasks that could be solved by it, improving results where data contains flat and noisy sequences.
随着公司越来越多地测量他们的产品和服务,时间序列数据的数量正在增加,需要提取可用信息的技术。最近开发的一种时间序列数据挖掘技术是矩阵剖面。它由一个时间序列的每个子序列到另一个序列的所有其他子序列的最小z归一化欧氏距离组成。它已被用于motif和discord的发现,分割和作为其他技术的构建块。使用z归一化的一个副作用是平坦信号上的小波动被放大。这可能会导致噪声数据中非常相似的子序列的高且不直观的距离。我们确定了一种分析方法来估计和消除这种噪声的影响,只添加一个单一的,直观的参数来计算矩阵轮廓。本文解释了我们的方法,并通过在Numenta异常基准上执行不和谐发现和分割PAMAP2活动数据集来演示它。我们发现我们的技术产生了一个更直观的矩阵轮廓,并且在包含许多平坦、噪声子序列的序列的两种使用情况下都提供了改进的结果。由于我们的技术是Matrix Profile的扩展,因此它可以应用于任何可以通过它解决的各种任务,从而改善数据包含平坦和噪声序列的结果。
{"title":"Eliminating Noise in the Matrix Profile","authors":"Dieter De Paepe, Olivier Janssens, S. Hoecke","doi":"10.5220/0007314100830093","DOIUrl":"https://doi.org/10.5220/0007314100830093","url":null,"abstract":"As companies are increasingly measuring their products and services, the amount of time series data is rising and techniques to extract usable information are needed. One recently developed data mining technique for time series is the Matrix Profile. It consists of the smallest z-normalized Euclidean distance of each subsequence of a time series to all other subsequences of another series. It has been used for motif and discord discovery, for segmentation and as building block for other techniques. One side effect of the z-normalization used is that small fluctuations on flat signals are upscaled. This can lead to high and unintuitive distances for very similar subsequences from noisy data. We determined an analytic method to estimate and remove the effects of this noise, adding only a single, intuitive parameter to the calculation of the Matrix Profile. This paper explains our method and demonstrates it by performing discord discovery on the Numenta Anomaly Benchmark and by segmenting the PAMAP2 activity dataset. We find that our technique results in a more intuitive Matrix Profile and provides improved results in both usecases for series containing many flat, noisy subsequences. Since our technique is an extension of the Matrix Profile, it can be applied to any of the various tasks that could be solved by it, improving results where data contains flat and noisy sequences.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"17 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114017170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Comparison between Supervised and Unsupervised Feature Selection Methods 有监督与无监督特征选择方法的比较
Pub Date : 2019-02-19 DOI: 10.5220/0007385305820589
L. Haar, K. Anding, K. Trambitckii, G. Notni
The reduction of the feature set by selecting relevant features for the classification process is an important step within the image processing chain, but sometimes too little attention is paid to it. Such a reduction has many advantages. It can remove irrelevant and redundant data, improve recognition performance, reduce storage capacity requirements, computational time of calculations and also the complexity of the model. Within this paper supervised and unsupervised feature selection methods are compared with respect to the achievable recognition accuracy. Supervised Methods include information of the given classes in the selection, whereas unsupervised ones can be used for tasks without known class labels. Feature clustering is an unsupervised method. For this type of feature reduction, mainly hierarchical methods, but also k-means are used. Instead of this two clustering methods, the Expectation Maximization (EM) algorithm was used in this paper. The aim is to investigate whether this type of clustering algorithm can provide a proper feature vector using feature clustering. There is no feature reduction technique that provides equally best results for all datasets and classifiers. However, for all datasets, it was possible to reduce the feature set to a specific number of useful features without losses and often even with improvements in recognition performance.
通过选择相关特征对特征集进行约简进行分类是图像处理链中的一个重要步骤,但有时人们对它的关注太少。这样的降价有很多好处。它可以去除不相关和冗余的数据,提高识别性能,降低对存储容量的要求,减少计算的计算时间,降低模型的复杂度。本文比较了有监督和无监督两种特征选择方法的识别精度。监督方法包括选择中给定类的信息,而非监督方法可用于没有已知类标签的任务。特征聚类是一种无监督方法。对于这种类型的特征约简,主要使用分层方法,但也使用k-means。本文采用期望最大化(EM)算法代替这两种聚类方法。目的是研究这种类型的聚类算法是否可以使用特征聚类提供合适的特征向量。没有一种特征约简技术可以为所有数据集和分类器提供相同的最佳结果。然而,对于所有数据集,都可以将特征集减少到特定数量的有用特征而不会损失,甚至可以提高识别性能。
{"title":"Comparison between Supervised and Unsupervised Feature Selection Methods","authors":"L. Haar, K. Anding, K. Trambitckii, G. Notni","doi":"10.5220/0007385305820589","DOIUrl":"https://doi.org/10.5220/0007385305820589","url":null,"abstract":"The reduction of the feature set by selecting relevant features for the classification process is an important step within the image processing chain, but sometimes too little attention is paid to it. Such a reduction has many advantages. It can remove irrelevant and redundant data, improve recognition performance, reduce storage capacity requirements, computational time of calculations and also the complexity of the model. Within this paper supervised and unsupervised feature selection methods are compared with respect to the achievable recognition accuracy. Supervised Methods include information of the given classes in the selection, whereas unsupervised ones can be used for tasks without known class labels. Feature clustering is an unsupervised method. For this type of feature reduction, mainly hierarchical methods, but also k-means are used. Instead of this two clustering methods, the Expectation Maximization (EM) algorithm was used in this paper. The aim is to investigate whether this type of clustering algorithm can provide a proper feature vector using feature clustering. There is no feature reduction technique that provides equally best results for all datasets and classifiers. However, for all datasets, it was possible to reduce the feature set to a specific number of useful features without losses and often even with improvements in recognition performance.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116037640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Approximation of the Distance from a Point to an Algebraic Manifold 从点到代数流形的距离的近似
Pub Date : 2019-02-19 DOI: 10.5220/0007483007150720
A. Uteshev, M. Goncharova
The problem of geometric distance d evaluation from a point X0 to an algebraic curve in R2 or manifold G(X) = 0 in R3 is treated in the form of comparison of exact value with two its successive approximations d(1) and d(2). The geometric distance is evaluated from the univariate distance equation possessing the zero set coinciding with that of critical values of the function d2(X0), while d(1)(X0) and d(2)(X0) are obtained via expansion of d2(X0) into the power series of the algebraic distance G(X0). We estimate the quality of approximation comparing the relative positions of the level sets of d(X), d(1)(X) and d(2)(X).
用精确值与其两个连续近似值d(1)和d(2)的比较的形式处理了R2中点X0到代数曲线或R3中流形G(X) = 0的几何距离d的求值问题。几何距离由具有与函数d2(X0)的临界值重合的零集的单变量距离方程计算,而d(1)(X0)和d(2)(X0)是通过将d2(X0)展开成代数距离G(X0)的幂级数得到的。我们通过比较d(X)、d(1)(X)和d(2)(X)的水平集的相对位置来估计近似的质量。
{"title":"Approximation of the Distance from a Point to an Algebraic Manifold","authors":"A. Uteshev, M. Goncharova","doi":"10.5220/0007483007150720","DOIUrl":"https://doi.org/10.5220/0007483007150720","url":null,"abstract":"The problem of geometric distance d evaluation from a point X0 to an algebraic curve in R2 or manifold G(X) = 0 in R3 is treated in the form of comparison of exact value with two its successive approximations d(1) and d(2). The geometric distance is evaluated from the univariate distance equation possessing the zero set coinciding with that of critical values of the function d2(X0), while d(1)(X0) and d(2)(X0) are obtained via expansion of d2(X0) into the power series of the algebraic distance G(X0). We estimate the quality of approximation comparing the relative positions of the level sets of d(X), d(1)(X) and d(2)(X).","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127982940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Learning Ensembles in the Presence of Imbalanced Classes 在班级不平衡的情况下学习合奏
Pub Date : 2019-02-19 DOI: 10.5220/0007681508660873
A. Saadallah, N. Piatkowski, Felix Finkeldey, P. Wiederkehr, K. Morik
Class imbalance occurs when data classes are not equally represented. Generally, it occurs when some classes represent rare events, while the other classes represent the counterpart of these events. Rare events, especially those that may have a negative impact, often require informed decision-making in a timely manner. However, class imbalance is known to induce a learning bias towards majority classes which implies a poor detection of minority classes. Thus, we propose a new ensemble method to handle class imbalance explicitly at training time. In contrast to existing ensemble methods for class imbalance that use either data driven or randomized approaches for their constructions, our method exploits both directions. On the one hand, ensemble members are built from randomized subsets of training data. On the other hand, we construct different scenarios of class imbalance for the unknown test data. An ensemble is built for each resulting scenario by combining random sampling with the estimation of the relative importance of specific loss functions. Final predictions are generated by a weighted average of each ensemble prediction. As opposed to existing methods, our approach does not try to fix imbalanced data sets. Instead, we show how imbalanced data sets can make classification easier, due to a limited range of true class frequencies. Our procedure promotes diversity among the ensemble members and is not sensitive to specific parameter settings. An experimental demonstration shows, that our new method outperforms or is on par with state-of-the-art ensembles and class imbalance techniques.
当数据类没有平等地表示时,就会发生类不平衡。通常,当一些类表示罕见事件,而另一些类表示这些事件的对应时,就会发生这种情况。罕见事件,特别是那些可能产生负面影响的事件,往往需要及时做出明智的决策。然而,我们知道,班级失衡会导致对多数班级的学习偏见,这意味着对少数班级的检测不足。因此,我们提出了一种新的集成方法来显式处理训练时的类不平衡。与使用数据驱动或随机方法构建类不平衡的现有集成方法相比,我们的方法利用了两个方向。一方面,集成成员是从训练数据的随机子集中构建的。另一方面,对于未知的测试数据,我们构建了不同的类不平衡场景。通过将随机抽样与特定损失函数的相对重要性估计相结合,为每个结果场景构建一个集成。最终的预测是由每个集合预测的加权平均值生成的。与现有方法相反,我们的方法并不试图修复不平衡的数据集。相反,我们展示了由于真实类别频率的有限范围,不平衡数据集如何使分类更容易。我们的程序促进了集合成员之间的多样性,并且对特定的参数设置不敏感。实验证明,我们的新方法优于或与最先进的合奏和班级不平衡技术相当。
{"title":"Learning Ensembles in the Presence of Imbalanced Classes","authors":"A. Saadallah, N. Piatkowski, Felix Finkeldey, P. Wiederkehr, K. Morik","doi":"10.5220/0007681508660873","DOIUrl":"https://doi.org/10.5220/0007681508660873","url":null,"abstract":"Class imbalance occurs when data classes are not equally represented. Generally, it occurs when some classes represent rare events, while the other classes represent the counterpart of these events. Rare events, especially those that may have a negative impact, often require informed decision-making in a timely manner. However, class imbalance is known to induce a learning bias towards majority classes which implies a poor detection of minority classes. Thus, we propose a new ensemble method to handle class imbalance explicitly at training time. In contrast to existing ensemble methods for class imbalance that use either data driven or randomized approaches for their constructions, our method exploits both directions. On the one hand, ensemble members are built from randomized subsets of training data. On the other hand, we construct different scenarios of class imbalance for the unknown test data. An ensemble is built for each resulting scenario by combining random sampling with the estimation of the relative importance of specific loss functions. Final predictions are generated by a weighted average of each ensemble prediction. As opposed to existing methods, our approach does not try to fix imbalanced data sets. Instead, we show how imbalanced data sets can make classification easier, due to a limited range of true class frequencies. Our procedure promotes diversity among the ensemble members and is not sensitive to specific parameter settings. An experimental demonstration shows, that our new method outperforms or is on par with state-of-the-art ensembles and class imbalance techniques.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"205 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129535420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Two-layer Residual Feature Fusion for Object Detection
Pub Date : 2019-02-19 DOI: 10.5220/0007306803520359
Jaeseok Choi, Kyoungmin Lee, Jisoo Jeong, Nojun Kwak
Recently, a lot of single stage detectors using multi-scale features have been actively proposed. They are much faster than two stage detectors that use region proposal networks (RPN) without much degradation in the detection performances. However, the feature maps in the lower layers close to the input which are responsible for detecting small objects in a single stage detector have a problem of insufficient representation power because they are too shallow. There is also a structural contradiction that the feature maps not only have to deliver low-level information to next layers but also have to contain high-level abstraction for prediction. In this paper, we propose a method to enrich the representation power of feature maps using a new feature fusion method which makes use of the information from the consecutive layer. It also adopts a unified prediction module which has an enhanced generalization performance. The proposed method enables more precise prediction, which achieved higher or compatible score than other competitors such as SSD and DSSD on PASCAL VOC and MS COCO. In addition, it maintains the advantage of fast computation of a single stage detector, which requires much less computation than other detectors with similar performance.
近年来,人们积极地提出了许多基于多尺度特征的单级检测器。它们比使用区域建议网络(RPN)的两级检测器快得多,且检测性能没有明显下降。然而,在单级检测器中,靠近输入的低层负责检测小物体的特征映射由于太浅而存在表示能力不足的问题。还有一个结构上的矛盾,即特征映射不仅要向下一层传递低级信息,还必须包含用于预测的高级抽象。本文提出了一种利用连续层信息的特征融合方法来增强特征映射的表示能力。采用了统一的预测模块,提高了泛化性能。该方法预测精度更高,在PASCAL VOC和MS COCO上取得了比SSD和DSSD等竞争对手更高或兼容的分数。此外,它还保持了单级检测器计算速度快的优点,与同类性能的检测器相比,计算量要少得多。
{"title":"Two-layer Residual Feature Fusion for Object Detection","authors":"Jaeseok Choi, Kyoungmin Lee, Jisoo Jeong, Nojun Kwak","doi":"10.5220/0007306803520359","DOIUrl":"https://doi.org/10.5220/0007306803520359","url":null,"abstract":"Recently, a lot of single stage detectors using multi-scale features have been actively proposed. They are much faster than two stage detectors that use region proposal networks (RPN) without much degradation in the detection performances. However, the feature maps in the lower layers close to the input which are responsible for detecting small objects in a single stage detector have a problem of insufficient representation power because they are too shallow. There is also a structural contradiction that the feature maps not only have to deliver low-level information to next layers but also have to contain high-level abstraction for prediction. In this paper, we propose a method to enrich the representation power of feature maps using a new feature fusion method which makes use of the information from the consecutive layer. It also adopts a unified prediction module which has an enhanced generalization performance. The proposed method enables more precise prediction, which achieved higher or compatible score than other competitors such as SSD and DSSD on PASCAL VOC and MS COCO. In addition, it maintains the advantage of fast computation of a single stage detector, which requires much less computation than other detectors with similar performance.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124203848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
All Together Now! The Benefits of Adaptively Fusing Pre-trained Deep Representations 现在一起来!自适应融合预训练深度表征的好处
Pub Date : 2019-02-19 DOI: 10.5220/0007367301350144
Yehezkel S. Resheff, I. Lieder, Tom Hope
Pre-trained deep neural networks, powerful models trained on large datasets, have become a popular tool in computer vision for transfer learning. However, the standard approach of using a single network potentially misses out on valuable information contained in other readily available models. In this work, we study the Mixture of Experts (MoE) approach for adaptively fusing multiple pre-trained models for each individual input image. In particular, we explore how far we can get by combining diverse pre-trained representations in a customized way that maximizes their potential in a lightweight framework. Our approach is motivated by an empirical study of the predictions made by popular pre-trained nets across various datasets, finding that both performance and agreement between models vary across datasets. We further propose a miniature CNN gating mechanism operating on a thumbnail version of the input image, and show this is enough to guide a good fusion. Finally, we explore a multi-modal blend of visual and natural-language representations, using a label-space embedding to inject pre-trained word-vectors. Across multiple datasets, we demonstrate that an adaptive fusion of pre-trained models can obtain favorable results.
预训练深度神经网络是在大数据集上训练的强大模型,已经成为计算机视觉中迁移学习的流行工具。然而,使用单一网络的标准方法可能会错过包含在其他现成可用模型中的有价值的信息。在这项工作中,我们研究了混合专家(MoE)方法,用于自适应融合每个单独输入图像的多个预训练模型。特别地,我们探索了通过以定制的方式组合各种预训练的表示,在轻量级框架中最大限度地发挥其潜力,我们可以走多远。我们的方法的动机是对流行的预训练网络在各种数据集上所做的预测进行实证研究,发现模型之间的性能和一致性在不同的数据集上都是不同的。我们进一步提出了一种微型CNN门控机制,该机制在输入图像的缩略图上运行,并表明这足以指导良好的融合。最后,我们探索了视觉和自然语言表示的多模态混合,使用标签空间嵌入注入预训练的词向量。在多个数据集上,我们证明了预训练模型的自适应融合可以获得良好的结果。
{"title":"All Together Now! The Benefits of Adaptively Fusing Pre-trained Deep Representations","authors":"Yehezkel S. Resheff, I. Lieder, Tom Hope","doi":"10.5220/0007367301350144","DOIUrl":"https://doi.org/10.5220/0007367301350144","url":null,"abstract":"Pre-trained deep neural networks, powerful models trained on large datasets, have become a popular tool in computer vision for transfer learning. However, the standard approach of using a single network potentially misses out on valuable information contained in other readily available models. In this work, we study the Mixture of Experts (MoE) approach for adaptively fusing multiple pre-trained models for each individual input image. In particular, we explore how far we can get by combining diverse pre-trained representations in a customized way that maximizes their potential in a lightweight framework. Our approach is motivated by an empirical study of the predictions made by popular pre-trained nets across various datasets, finding that both performance and agreement between models vary across datasets. We further propose a miniature CNN gating mechanism operating on a thumbnail version of the input image, and show this is enough to guide a good fusion. Finally, we explore a multi-modal blend of visual and natural-language representations, using a label-space embedding to inject pre-trained word-vectors. Across multiple datasets, we demonstrate that an adaptive fusion of pre-trained models can obtain favorable results.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"337 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122976053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Feedforward and Feedback Processing of Spatiotemporal Tubes for Efficient Object Localization 有效目标定位的时空管前馈与反馈处理
Pub Date : 2019-02-19 DOI: 10.5220/0007313603770387
Khari Jarrett, Joachim Lohn-Jaramillo, E. Bowen, Laura Ray, R. Granger
We introduce a new set of mechanisms for tracking entities through videos, at substantially less expense than required by standard methods. The approach combines inexpensive initial processing of individual frames together with integration of information across long time spans (multiple frames), resulting in the recognition and tracking of spatially and temporally contiguous entities, rather than focusing on the individual pixels that comprise those entities.
我们引入了一套新的机制,通过视频跟踪实体,比标准方法所需的费用少得多。该方法将单个帧的低成本初始处理与跨长时间跨度(多帧)的信息集成相结合,从而识别和跟踪空间和时间上连续的实体,而不是专注于组成这些实体的单个像素。
{"title":"Feedforward and Feedback Processing of Spatiotemporal Tubes for Efficient Object Localization","authors":"Khari Jarrett, Joachim Lohn-Jaramillo, E. Bowen, Laura Ray, R. Granger","doi":"10.5220/0007313603770387","DOIUrl":"https://doi.org/10.5220/0007313603770387","url":null,"abstract":"We introduce a new set of mechanisms for tracking entities through videos, at substantially less expense than required by standard methods. The approach combines inexpensive initial processing of individual frames together with integration of information across long time spans (multiple frames), resulting in the recognition and tracking of spatially and temporally contiguous entities, rather than focusing on the individual pixels that comprise those entities.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125223428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Stochastic Phase Estimation and Unwrapping 随机相位估计和展开
Pub Date : 2019-02-19 DOI: 10.5220/0007389402000209
Mara Pistellato, Filippo Bergamasco, A. Albarelli, L. Cosmo, A. Gasparetto, A. Torsello
Phase-shift is one of the most effective techniques in 3D structured-light scanning for its accuracy and noise resilience. However, the periodic nature of the signal causes a spatial ambiguity when the fringe periods are shorter than the projector resolution. To solve this, many techniques exploit multiple combined signals to unwrap the phases and thus recovering a unique consistent code. In this paper, we study the phase estimation and unwrapping problem in a stochastic context. Assuming the acquired fringe signal to be affected by additive white Gaussian noise, we start by modelling each estimated phase as a zero-mean Wrapped Normal distribution with variance σ2. Then, our contributions are twofolds. First, we show how to recover the best projector code given multiple phase observations by means of a ML estimation over the combined fringe distributions. Second, we exploit the Cramer-Rao bounds to relate the phase variance σ2 to the variance of the observed signal, that can be easily estimated online during the fringe acquisition. An extensive set of experiments demonstrate that our approach outperforms other methods in terms of code recovery accuracy and ratio of faulty unwrappings.
相移是三维结构光扫描中最有效的技术之一,具有精度高、抗噪性好等优点。然而,当条纹周期短于投影仪分辨率时,信号的周期性会导致空间模糊。为了解决这个问题,许多技术利用多个组合信号来解开相位,从而恢复唯一的一致代码。本文研究了随机环境下的相位估计和解包裹问题。假设采集到的条纹信号受到加性高斯白噪声的影响,我们首先将每个估计相位建模为方差为σ2的零均值包裹正态分布。那么,我们的贡献是双重的。首先,我们展示了如何通过对组合条纹分布的ML估计来恢复给定多相观测的最佳投影仪代码。其次,我们利用Cramer-Rao边界将相位方差σ2与观测信号的方差联系起来,可以在条纹采集过程中轻松地在线估计。大量的实验表明,我们的方法在代码恢复精度和错误展开率方面优于其他方法。
{"title":"Stochastic Phase Estimation and Unwrapping","authors":"Mara Pistellato, Filippo Bergamasco, A. Albarelli, L. Cosmo, A. Gasparetto, A. Torsello","doi":"10.5220/0007389402000209","DOIUrl":"https://doi.org/10.5220/0007389402000209","url":null,"abstract":"Phase-shift is one of the most effective techniques in 3D structured-light scanning for its accuracy and noise resilience. However, the periodic nature of the signal causes a spatial ambiguity when the fringe periods are shorter than the projector resolution. To solve this, many techniques exploit multiple combined signals to unwrap the phases and thus recovering a unique consistent code. In this paper, we study the phase estimation and unwrapping problem in a stochastic context. Assuming the acquired fringe signal to be affected by additive white Gaussian noise, we start by modelling each estimated phase as a zero-mean Wrapped Normal distribution with variance σ2. Then, our contributions are twofolds. First, we show how to recover the best projector code given multiple phase observations by means of a ML estimation over the combined fringe distributions. Second, we exploit the Cramer-Rao bounds to relate the phase variance σ2 to the variance of the observed signal, that can be easily estimated online during the fringe acquisition. An extensive set of experiments demonstrate that our approach outperforms other methods in terms of code recovery accuracy and ratio of faulty unwrappings.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115517341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multimodal Sentiment Analysis: A Multitask Learning Approach 多模态情感分析:多任务学习方法
Pub Date : 2019-02-19 DOI: 10.5220/0007313503680376
M. Fortin, B. Chaib-draa
Multimodal sentiment analysis has recently received an increasing interest. However, most methods have considered that text and image modalities are always available at test time. This assumption is often violated in real environments (e.g. social media) since users do not always publish a text with an image. In this paper we propose a method based on a multitask framework to combine multimodal information when it is available, while being able to handle the cases where a modality is missing. Our model contains one classifier for analyzing the text, another for analyzing the image, and another performing the prediction by fusing both modalities. In addition to offer a solution to the problem of a missing modality, our experiments show that this multitask framework improves generalization by acting as a regularization mechanism. We also demonstrate that the model can handle a missing modality at training time, thus being able to be trained with image-only and text-only examples.
多模态情感分析近年来受到越来越多的关注。然而,大多数方法都认为文本和图像模式在测试时总是可用的。这个假设在现实环境中经常被违反(例如社交媒体),因为用户并不总是发布带有图像的文本。在本文中,我们提出了一种基于多任务框架的方法,在多模态信息可用时进行组合,同时能够处理模态缺失的情况。我们的模型包含一个用于分析文本的分类器,另一个用于分析图像的分类器,另一个通过融合两种模式来执行预测。除了为缺少模态的问题提供解决方案之外,我们的实验表明,这个多任务框架通过充当正则化机制来提高泛化。我们还证明了该模型可以在训练时处理缺失的模态,从而能够使用纯图像和纯文本示例进行训练。
{"title":"Multimodal Sentiment Analysis: A Multitask Learning Approach","authors":"M. Fortin, B. Chaib-draa","doi":"10.5220/0007313503680376","DOIUrl":"https://doi.org/10.5220/0007313503680376","url":null,"abstract":"Multimodal sentiment analysis has recently received an increasing interest. However, most methods have considered that text and image modalities are always available at test time. This assumption is often violated in real environments (e.g. social media) since users do not always publish a text with an image. In this paper we propose a method based on a multitask framework to combine multimodal information when it is available, while being able to handle the cases where a modality is missing. Our model contains one classifier for analyzing the text, another for analyzing the image, and another performing the prediction by fusing both modalities. In addition to offer a solution to the problem of a missing modality, our experiments show that this multitask framework improves generalization by acting as a regularization mechanism. We also demonstrate that the model can handle a missing modality at training time, thus being able to be trained with image-only and text-only examples.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116325066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
期刊
International Conference on Pattern Recognition Applications and Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1