首页 > 最新文献

International Conference on Pattern Recognition Applications and Methods最新文献

英文 中文
Empirical Evaluation on Utilizing CNN-features for Seismic Patch Classification 利用cnn特征进行地震斑块分类的实证评价
Pub Date : 2021-12-25 DOI: 10.5220/0010185701660173
Chun‐Xia Zhang, Xiaoli Wei, Sang-Woon Kim
This paper empirically evaluates two kinds of features, which are extracted, respectively, with traditional statistical methods and convolutional neural networks (CNNs), in order to improve the performance of seismic patch image classification. In the latter case, feature vectors, named “CNN-features”, were extracted from one trained CNN model, and were then used to learn existing classifiers, such as support vector machines. In this case, to learn the CNN model, a technique of transfer learning using synthetic seismic patch data in the source domain, and real-world patch data in the target domain, was applied. The experimental results show that CNN-features lead to some improvements in the classification performance. By analyzing the data complexity measures, the CNN-features are found to have the strongest discriminant capabilities. Furthermore, the transfer learning technique alleviates the problems of long processing times and the lack of learning data.
本文对传统统计方法和卷积神经网络(cnn)分别提取的两类特征进行了实证评价,以提高地震斑块图像分类的性能。在后一种情况下,从一个训练好的CNN模型中提取特征向量,称为“CNN-features”,然后用于学习现有的分类器,如支持向量机。在这种情况下,为了学习CNN模型,我们采用了一种迁移学习技术,在源域使用合成的地震patch数据,在目标域使用真实的patch数据。实验结果表明,cnn特征对分类性能有一定的改善。通过对数据复杂度度量的分析,发现cnn特征具有最强的判别能力。此外,迁移学习技术还缓解了处理时间长和学习数据缺乏的问题。
{"title":"Empirical Evaluation on Utilizing CNN-features for Seismic Patch Classification","authors":"Chun‐Xia Zhang, Xiaoli Wei, Sang-Woon Kim","doi":"10.5220/0010185701660173","DOIUrl":"https://doi.org/10.5220/0010185701660173","url":null,"abstract":"This paper empirically evaluates two kinds of features, which are extracted, respectively, with traditional statistical methods and convolutional neural networks (CNNs), in order to improve the performance of seismic patch image classification. In the latter case, feature vectors, named “CNN-features”, were extracted from one trained CNN model, and were then used to learn existing classifiers, such as support vector machines. In this case, to learn the CNN model, a technique of transfer learning using synthetic seismic patch data in the source domain, and real-world patch data in the target domain, was applied. The experimental results show that CNN-features lead to some improvements in the classification performance. By analyzing the data complexity measures, the CNN-features are found to have the strongest discriminant capabilities. Furthermore, the transfer learning technique alleviates the problems of long processing times and the lack of learning data.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"168 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121250651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On the Choice of General Purpose Classifiers in Learned Bloom Filters: An Initial Analysis Within Basic Filters 学习布隆过滤器中通用分类器的选择:对基本过滤器的初步分析
Pub Date : 2021-12-13 DOI: 10.5220/0010889000003122
G. Fumagalli, Davide Raimondi, R. Giancarlo, D. Malchiodi, Marco Frasca
Bloom Filters are a fundamental and pervasive data structure. Within the growing area of Learned Data Structures, several Learned versions of Bloom Filters have been considered, yielding advantages over classic Filters. Each of them uses a classifier, which is the Learned part of the data structure. Although it has a central role in those new filters, and its space footprint as well as classification time may affect the performance of the Learned Filter, no systematic study of which specific classifier to use in which circumstances is available. We report progress in this area here, providing also initial guidelines on which classifier to choose among five classic classification paradigms.
布隆过滤器是一种基本且普遍的数据结构。在不断增长的学习数据结构领域,已经考虑了几个学习版本的布隆过滤器,产生了优于经典过滤器的优点。它们每个都使用一个分类器,这是数据结构的学习部分。尽管它在这些新的过滤器中起着核心作用,而且它的空间占用和分类时间可能会影响学习过滤器的性能,但是没有系统的研究表明在什么情况下使用哪个特定的分类器是可用的。我们在这里报告了这一领域的进展,并提供了在五个经典分类范式中选择哪个分类器的初步指南。
{"title":"On the Choice of General Purpose Classifiers in Learned Bloom Filters: An Initial Analysis Within Basic Filters","authors":"G. Fumagalli, Davide Raimondi, R. Giancarlo, D. Malchiodi, Marco Frasca","doi":"10.5220/0010889000003122","DOIUrl":"https://doi.org/10.5220/0010889000003122","url":null,"abstract":"Bloom Filters are a fundamental and pervasive data structure. Within the growing area of Learned Data Structures, several Learned versions of Bloom Filters have been considered, yielding advantages over classic Filters. Each of them uses a classifier, which is the Learned part of the data structure. Although it has a central role in those new filters, and its space footprint as well as classification time may affect the performance of the Learned Filter, no systematic study of which specific classifier to use in which circumstances is available. We report progress in this area here, providing also initial guidelines on which classifier to choose among five classic classification paradigms.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123718110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Single-Step Adversarial Training for Semantic Segmentation 语义分割的单步对抗训练
Pub Date : 2021-06-30 DOI: 10.5220/0010788400003122
D. Wiens, B. Hammer
Even though deep neural networks succeed on many different tasks including semantic segmentation, they lack on robustness against adversarial examples. To counteract this exploit, often adversarial training is used. However, it is known that adversarial training with weak adversarial attacks (e.g. using the Fast Gradient Method) does not improve the robustness against stronger attacks. Recent research shows that it is possible to increase the robustness of such single-step methods by choosing an appropriate step size during the training. Finding such a step size, without increasing the computational effort of single-step adversarial training, is still an open challenge. In this work we address the computationally particularly demanding task of semantic segmentation and propose a new step size control algorithm that increases the robustness of single-step adversarial training. The proposed algorithm does not increase the computational effort of single-step adversarial training considerably and also simplifies training, because it is free of meta-parameter. We show that the robustness of our approach can compete with multi-step adversarial training on two popular benchmarks for semantic segmentation.
尽管深度神经网络在许多不同的任务上都取得了成功,包括语义分割,但它们对对抗示例缺乏鲁棒性。为了对抗这种攻击,通常会使用对抗性训练。然而,众所周知,使用弱对抗性攻击的对抗性训练(例如使用快速梯度方法)并不能提高对更强攻击的鲁棒性。最近的研究表明,可以通过在训练过程中选择适当的步长来增加这种单步方法的鲁棒性。在不增加单步对抗训练的计算工作量的情况下,找到这样一个步长仍然是一个开放的挑战。在这项工作中,我们解决了计算上特别苛刻的语义分割任务,并提出了一种新的步长控制算法,增加了单步对抗训练的鲁棒性。由于该算法不含元参数,因此不会大大增加单步对抗训练的计算量,并且简化了训练。我们表明,我们的方法的鲁棒性可以在两个流行的语义分割基准上与多步骤对抗训练相竞争。
{"title":"Single-Step Adversarial Training for Semantic Segmentation","authors":"D. Wiens, B. Hammer","doi":"10.5220/0010788400003122","DOIUrl":"https://doi.org/10.5220/0010788400003122","url":null,"abstract":"Even though deep neural networks succeed on many different tasks including semantic segmentation, they lack on robustness against adversarial examples. To counteract this exploit, often adversarial training is used. However, it is known that adversarial training with weak adversarial attacks (e.g. using the Fast Gradient Method) does not improve the robustness against stronger attacks. Recent research shows that it is possible to increase the robustness of such single-step methods by choosing an appropriate step size during the training. Finding such a step size, without increasing the computational effort of single-step adversarial training, is still an open challenge. In this work we address the computationally particularly demanding task of semantic segmentation and propose a new step size control algorithm that increases the robustness of single-step adversarial training. The proposed algorithm does not increase the computational effort of single-step adversarial training considerably and also simplifies training, because it is free of meta-parameter. We show that the robustness of our approach can compete with multi-step adversarial training on two popular benchmarks for semantic segmentation.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129415008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case 深度学习的低精度策略:一个高能物理生成对抗网络用例
Pub Date : 2021-03-18 DOI: 10.5220/0010245002510258
F. Rehm, S. Vallecorsa, V. Saletore, Hans Pabst, Adel Chaibi, V. Codreanu, K. Borras, D. Krücker
Deep learning is finding its way into high energy physics by replacing traditional Monte Carlo simulations. However, deep learning still requires an excessive amount of computational resources. A promising approach to make deep learning more efficient is to quantize the parameters of the neural networks to reduced precision. Reduced precision computing is extensively used in modern deep learning and results to lower execution inference time, smaller memory footprint and less memory bandwidth. In this paper we analyse the effects of low precision inference on a complex deep generative adversarial network model. The use case which we are addressing is calorimeter detector simulations of subatomic particle interactions in accelerator based high energy physics. We employ the novel Intel low precision optimization tool (iLoT) for quantization and compare the results to the quantized model from TensorFlow Lite. In the performance benchmark we gain a speed-up of 1.73x on Intel hardware for the quantized iLoT model compared to the initial, not quantized, model. With different physics-inspired self-developed metrics, we validate that the quantized iLoT model shows a lower loss of physical accuracy in comparison to the TensorFlow Lite model.
通过取代传统的蒙特卡罗模拟,深度学习正在进入高能物理领域。然而,深度学习仍然需要大量的计算资源。使深度学习更有效的一种有前途的方法是量化神经网络的参数以降低精度。低精度计算在现代深度学习中得到了广泛的应用,其结果是降低了执行推理时间,减少了内存占用和内存带宽。本文分析了低精度推理对复杂深度生成对抗网络模型的影响。我们正在处理的用例是基于加速器的高能物理中亚原子粒子相互作用的量热计探测器模拟。我们采用英特尔低精度优化工具(iLoT)进行量化,并将结果与TensorFlow Lite的量化模型进行比较。在性能基准测试中,与初始的、未量化的模型相比,我们在英特尔硬件上获得了1.73倍的加速提升。使用不同的物理启发的自开发指标,我们验证了量化的iLoT模型与TensorFlow Lite模型相比,显示出更低的物理精度损失。
{"title":"Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case","authors":"F. Rehm, S. Vallecorsa, V. Saletore, Hans Pabst, Adel Chaibi, V. Codreanu, K. Borras, D. Krücker","doi":"10.5220/0010245002510258","DOIUrl":"https://doi.org/10.5220/0010245002510258","url":null,"abstract":"Deep learning is finding its way into high energy physics by replacing traditional Monte Carlo simulations. However, deep learning still requires an excessive amount of computational resources. A promising approach to make deep learning more efficient is to quantize the parameters of the neural networks to reduced precision. Reduced precision computing is extensively used in modern deep learning and results to lower execution inference time, smaller memory footprint and less memory bandwidth. In this paper we analyse the effects of low precision inference on a complex deep generative adversarial network model. The use case which we are addressing is calorimeter detector simulations of subatomic particle interactions in accelerator based high energy physics. We employ the novel Intel low precision optimization tool (iLoT) for quantization and compare the results to the quantized model from TensorFlow Lite. In the performance benchmark we gain a speed-up of 1.73x on Intel hardware for the quantized iLoT model compared to the initial, not quantized, model. With different physics-inspired self-developed metrics, we validate that the quantized iLoT model shows a lower loss of physical accuracy in comparison to the TensorFlow Lite model.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124597840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
The U-Net based GLOW for Optical-Flow-Free Video Interframe Generation 基于U-Net的无光流视频帧间生成技术
Pub Date : 2021-03-17 DOI: 10.5220/0010869400003122
Saem Park, D. Han, Nojun Kwak
Video frame interpolation is the task of creating an interframe between two adjacent frames along the time axis. So, instead of simply averaging two adjacent frames to create an intermediate image, this operation should maintain semantic continuity with the adjacent frames. Most conventional methods use optical flow, and various tools such as occlusion handling and object smoothing are indispensable. Since the use of these various tools leads to complex problems, we tried to tackle the video interframe generation problem without using problematic optical flow . To enable this , we have tried to use a deep neural network with an invertible structure, and developed an U-Net based Generative Flow which is a modified normalizing flow. In addition, we propose a learning method with a new consistency loss in the latent space to maintain semantic temporal consistency between frames. The resolution of the generated image is guaranteed to be identical to that of the original images by using an invertible network. Furthermore, as it is not a random image like the ones by generative models, our network guarantees stable outputs without flicker. Through experiments, we sam {confirmed the feasibility of the proposed algorithm and would like to suggest the U-Net based Generative Flow as a new possibility for baseline in video frame interpolation. This paper is meaningful in that it is the world's first attempt to use invertible networks instead of optical flows for video interpolation.
视频帧插值是沿时间轴在两个相邻帧之间创建帧间的任务。因此,不是简单地取两个相邻帧的平均值来创建中间图像,这个操作应该保持相邻帧的语义连续性。传统的方法大多使用光流,各种工具如遮挡处理和物体平滑是必不可少的。由于使用这些不同的工具会导致复杂的问题,我们尝试在不使用有问题的光流的情况下解决视频帧间生成问题。为了实现这一点,我们尝试使用具有可逆结构的深度神经网络,并开发了基于U-Net的生成流,这是一种改进的归一化流。此外,我们提出了一种在潜在空间中引入新的一致性损失的学习方法,以保持帧间的语义时间一致性。通过使用可逆网络,保证生成图像的分辨率与原始图像相同。此外,由于它不是像生成模型那样的随机图像,我们的网络保证了稳定的输出,没有闪烁。通过实验,我们证实了所提出算法的可行性,并建议基于U-Net的生成流作为视频帧插值中基线的一种新的可能性。这篇论文的意义在于,它是世界上第一次尝试用可逆网络代替光流进行视频插值。
{"title":"The U-Net based GLOW for Optical-Flow-Free Video Interframe Generation","authors":"Saem Park, D. Han, Nojun Kwak","doi":"10.5220/0010869400003122","DOIUrl":"https://doi.org/10.5220/0010869400003122","url":null,"abstract":"Video frame interpolation is the task of creating an interframe between two adjacent frames along the time axis. So, instead of simply averaging two adjacent frames to create an intermediate image, this operation should maintain semantic continuity with the adjacent frames. Most conventional methods use optical flow, and various tools such as occlusion handling and object smoothing are indispensable. Since the use of these various tools leads to complex problems, we tried to tackle the video interframe generation problem without using problematic optical flow . To enable this , we have tried to use a deep neural network with an invertible structure, and developed an U-Net based Generative Flow which is a modified normalizing flow. In addition, we propose a learning method with a new consistency loss in the latent space to maintain semantic temporal consistency between frames. The resolution of the generated image is guaranteed to be identical to that of the original images by using an invertible network. Furthermore, as it is not a random image like the ones by generative models, our network guarantees stable outputs without flicker. Through experiments, we sam {confirmed the feasibility of the proposed algorithm and would like to suggest the U-Net based Generative Flow as a new possibility for baseline in video frame interpolation. This paper is meaningful in that it is the world's first attempt to use invertible networks instead of optical flows for video interpolation.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127019897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning Based Model Identification System Exploits the Modular Structure of a Bio-Inspired Posture Control Model for Humans and Humanoids 基于深度学习的模型识别系统利用仿生人体和类人姿态控制模型的模块化结构
Pub Date : 2021-02-04 DOI: 10.5220/0010245405400547
Vittorio Lippi
This work presents a system identification procedure based on Convolutional Neural Networks (CNN) for human posture control using the DEC (Disturbance Estimation and Compensation) parametric model. The modular structure of the proposed control model inspired the design of a modular identification procedure, in the sense that the same neural network is used to identify the parameters of the modules controlling different degrees of freedom. In this way the presented examples of body sway induced by external stimuli provide several training samples at once
本文提出了一种基于卷积神经网络(CNN)的系统识别方法,用于使用DEC(扰动估计和补偿)参数模型进行人体姿态控制。所提出的控制模型的模块化结构启发了模块化识别过程的设计,即使用相同的神经网络来识别控制不同自由度的模块的参数。通过这种方式,所提出的由外部刺激引起的身体摇摆的例子同时提供了几个训练样本
{"title":"Deep Learning Based Model Identification System Exploits the Modular Structure of a Bio-Inspired Posture Control Model for Humans and Humanoids","authors":"Vittorio Lippi","doi":"10.5220/0010245405400547","DOIUrl":"https://doi.org/10.5220/0010245405400547","url":null,"abstract":"This work presents a system identification procedure based on Convolutional Neural Networks (CNN) for human posture control using the DEC (Disturbance Estimation and Compensation) parametric model. The modular structure of the proposed control model inspired the design of a modular identification procedure, in the sense that the same neural network is used to identify the parameters of the modules controlling different degrees of freedom. In this way the presented examples of body sway induced by external stimuli provide several training samples at once","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128850877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Importance of Models in Data Analysis with Small Human Movement Datasets - Inspirations from Neurorobotics Applied to Posture Control of Humanoids and Humans 模型在小型人体运动数据集数据分析中的重要性——来自神经机器人应用于人形和人类姿态控制的启示
Pub Date : 2021-02-04 DOI: 10.5220/0010297005790585
Vittorio Lippi, C. Maurer, T. Mergner
Machine learning has shown impressive improvements recently, thanks especially to the results shown in deep learning applications. Besides important advancements in the theory, such improvements have been associated with an increment in the complexity of the used models (i.e. the numbers of neurons and connections in neural networks). Bigger models are possible given the amount of data used in the training process is increased accordingly. In medical applications, however, the size of datasets is often limited by the availability of human subjects and the effort required to perform human experiments. This position paper proposes the integration of bioinspired models with machine learning.
机器学习最近取得了令人印象深刻的进步,尤其是在深度学习应用中所显示的结果。除了理论上的重要进步外,这种改进还与所用模型复杂性的增加(即神经网络中神经元和连接的数量)有关。考虑到训练过程中使用的数据量相应增加,更大的模型是可能的。然而,在医学应用中,数据集的大小往往受到人类受试者的可用性和进行人体实验所需的努力的限制。本意见书提出将生物启发模型与机器学习相结合。
{"title":"The Importance of Models in Data Analysis with Small Human Movement Datasets - Inspirations from Neurorobotics Applied to Posture Control of Humanoids and Humans","authors":"Vittorio Lippi, C. Maurer, T. Mergner","doi":"10.5220/0010297005790585","DOIUrl":"https://doi.org/10.5220/0010297005790585","url":null,"abstract":"Machine learning has shown impressive improvements recently, thanks especially to the results shown in deep learning applications. Besides important advancements in the theory, such improvements have been associated with an increment in the complexity of the used models (i.e. the numbers of neurons and connections in neural networks). Bigger models are possible given the amount of data used in the training process is increased accordingly. In medical applications, however, the size of datasets is often limited by the availability of human subjects and the effort required to perform human experiments. This position paper proposes the integration of bioinspired models with machine learning.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117267633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Active Region Detection in Multi-spectral Solar Images 太阳多光谱图像的活动区域检测
Pub Date : 2021-02-01 DOI: 10.5220/0010310504520459
Majedaldein Almahasneh, A. Paiement, Xianghua Xie, J. Aboudarham
Precisely detecting solar Active Regions (AR) from multi-spectral images is a challenging task yet important in understanding solar activity and its influence on space weather. A main challenge comes from each modality capturing a different location of these 3D objects, as opposed to more traditional multi-spectral imaging scenarios where all image bands observe the same scene. We present a multi-task deep learning framework that exploits the dependencies between image bands to produce 3D AR detection where different image bands (and physical locations) each have their own set of results. We compare our detection method against baseline approaches for solar image analysis (multi-channel coronal hole detection, SPOCA for ARs (Verbeeck et al., 2013)) and a state-of-the-art deep learning method (Faster RCNN) and show enhanced performances in detecting ARs jointly from multiple bands.
从多光谱图像中精确探测太阳活动区(AR)是一项具有挑战性的任务,但对于了解太阳活动及其对空间天气的影响至关重要。一个主要的挑战来自于每种模式捕获这些3D物体的不同位置,而不是更传统的多光谱成像场景,所有图像带观察相同的场景。我们提出了一个多任务深度学习框架,利用图像波段之间的依赖关系来产生3D AR检测,其中不同的图像波段(和物理位置)每个都有自己的一组结果。我们将我们的检测方法与太阳图像分析的基线方法(多通道日冕洞检测,用于ar的SPOCA (Verbeeck等人,2013))和最先进的深度学习方法(Faster RCNN)进行了比较,并在多波段联合检测ar方面显示出增强的性能。
{"title":"Active Region Detection in Multi-spectral Solar Images","authors":"Majedaldein Almahasneh, A. Paiement, Xianghua Xie, J. Aboudarham","doi":"10.5220/0010310504520459","DOIUrl":"https://doi.org/10.5220/0010310504520459","url":null,"abstract":"Precisely detecting solar Active Regions (AR) from multi-spectral images is a challenging task yet important in understanding solar activity and its influence on space weather. A main challenge comes from each modality capturing a different location of these 3D objects, as opposed to more traditional multi-spectral imaging scenarios where all image bands observe the same scene. We present a multi-task deep learning framework that exploits the dependencies between image bands to produce 3D AR detection where different image bands (and physical locations) each have their own set of results. We compare our detection method against baseline approaches for solar image analysis (multi-channel coronal hole detection, SPOCA for ARs (Verbeeck et al., 2013)) and a state-of-the-art deep learning method (Faster RCNN) and show enhanced performances in detecting ARs jointly from multiple bands.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132410566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
DenseHMM: Learning Hidden Markov Models by Learning Dense Representations DenseHMM:通过学习密集表示来学习隐马尔可夫模型
Pub Date : 2020-12-17 DOI: 10.5220/0010821800003122
Joachim Sicking, Maximilian Pintz, M. Akila, Tim Wirtz
We propose DenseHMM - a modification of Hidden Markov Models (HMMs) that allows to learn dense representations of both the hidden states and the observables. Compared to the standard HMM, transition probabilities are not atomic but composed of these representations via kernelization. Our approach enables constraint-free and gradient-based optimization. We propose two optimization schemes that make use of this: a modification of the Baum-Welch algorithm and a direct co-occurrence optimization. The latter one is highly scalable and comes empirically without loss of performance compared to standard HMMs. We show that the non-linearity of the kernelization is crucial for the expressiveness of the representations. The properties of the DenseHMM like learned co-occurrences and log-likelihoods are studied empirically on synthetic and biomedical datasets.
我们提出了DenseHMM——隐马尔可夫模型(hmm)的一种修改,允许学习隐藏状态和可观察对象的密集表示。与标准HMM相比,跃迁概率不是原子的,而是由这些表示通过核化组成的。我们的方法可以实现无约束和基于梯度的优化。我们提出了两种利用这一点的优化方案:鲍姆-韦尔奇算法的修改和直接共现优化。后者具有高度可扩展性,并且与标准hmm相比没有性能损失。我们证明了核化的非线性对于表示的表达性是至关重要的。在合成和生物医学数据集上对DenseHMM类学习共现和对数似然的性质进行了实证研究。
{"title":"DenseHMM: Learning Hidden Markov Models by Learning Dense Representations","authors":"Joachim Sicking, Maximilian Pintz, M. Akila, Tim Wirtz","doi":"10.5220/0010821800003122","DOIUrl":"https://doi.org/10.5220/0010821800003122","url":null,"abstract":"We propose DenseHMM - a modification of Hidden Markov Models (HMMs) that allows to learn dense representations of both the hidden states and the observables. Compared to the standard HMM, transition probabilities are not atomic but composed of these representations via kernelization. Our approach enables constraint-free and gradient-based optimization. We propose two optimization schemes that make use of this: a modification of the Baum-Welch algorithm and a direct co-occurrence optimization. The latter one is highly scalable and comes empirically without loss of performance compared to standard HMMs. We show that the non-linearity of the kernelization is crucial for the expressiveness of the representations. The properties of the DenseHMM like learned co-occurrences and log-likelihoods are studied empirically on synthetic and biomedical datasets.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123716505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FLIC: Fast Lidar Image Clustering 快速激光雷达图像聚类
Pub Date : 2020-12-08 DOI: 10.5220/0010193700250035
Frederik Hasecke, Lukas Hahn, A. Kummert
Lidar sensors are widely used in various applications, ranging from scientific fields over industrial use to integration in consumer products. With an ever growing number of different driver assistance systems, they have been introduced to automotive series production in recent years and are considered an important building block for the practical realisation of autonomous driving. However, due to the potentially large amount of Lidar points per scan, tailored algorithms are required to identify objects (e.g. pedestrians or vehicles) with high precision in a very short time. In this work, we propose an algorithmic approach for real-time instance segmentation of Lidar sensor data. We show how our method leverages the properties of the Euclidean distance to retain three-dimensional measurement information, while being narrowed down to a two-dimensional representation for fast computation. We further introduce what we call "skip connections", to make our approach robust against over-segmentation and improve assignment in cases of partial occlusion. Through detailed evaluation on public data and comparison with established methods, we show how these aspects enable state-of-the-art performance and runtime on a single CPU core.
激光雷达传感器广泛应用于各种应用,从科学领域的工业应用到消费产品的集成。随着不同驾驶辅助系统的数量不断增加,近年来它们已被引入汽车批量生产,并被认为是实际实现自动驾驶的重要组成部分。然而,由于每次扫描可能需要大量的激光雷达点,因此需要定制算法来在很短的时间内高精度地识别物体(例如行人或车辆)。在这项工作中,我们提出了一种激光雷达传感器数据的实时实例分割算法。我们展示了我们的方法如何利用欧几里得距离的属性来保留三维测量信息,同时被缩小到二维表示以进行快速计算。我们进一步引入了我们所谓的“跳过连接”,使我们的方法对过度分割具有鲁棒性,并改善了部分遮挡情况下的分配。通过对公开数据的详细评估以及与现有方法的比较,我们展示了这些方面如何在单个CPU核心上实现最先进的性能和运行时。
{"title":"FLIC: Fast Lidar Image Clustering","authors":"Frederik Hasecke, Lukas Hahn, A. Kummert","doi":"10.5220/0010193700250035","DOIUrl":"https://doi.org/10.5220/0010193700250035","url":null,"abstract":"Lidar sensors are widely used in various applications, ranging from scientific fields over industrial use to integration in consumer products. With an ever growing number of different driver assistance systems, they have been introduced to automotive series production in recent years and are considered an important building block for the practical realisation of autonomous driving. However, due to the potentially large amount of Lidar points per scan, tailored algorithms are required to identify objects (e.g. pedestrians or vehicles) with high precision in a very short time. In this work, we propose an algorithmic approach for real-time instance segmentation of Lidar sensor data. We show how our method leverages the properties of the Euclidean distance to retain three-dimensional measurement information, while being narrowed down to a two-dimensional representation for fast computation. We further introduce what we call \"skip connections\", to make our approach robust against over-segmentation and improve assignment in cases of partial occlusion. Through detailed evaluation on public data and comparison with established methods, we show how these aspects enable state-of-the-art performance and runtime on a single CPU core.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114757142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
International Conference on Pattern Recognition Applications and Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1