首页 > 最新文献

2020 28th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Coupled Tensor Model of Atrial Fibrillation ECG 心房颤动心电图的耦合张量模型
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287494
Pedro Marinho R. de Oliveira, V. Zarzoso, C. A. R. Fernandes
Atrial fibrillation (AF) is the most frequent cardiac arrhythmia diagnosed in clinical practice, identified by an uncoordinated and irregular atrial depolarization. However, its electrophysiological mechanisms are still not clearly understood, increasing the intensive clinical research into this challenging cardiac condition in the past few years. The noninvasive extraction of the atrial activity (AA) from multi-lead electrocardiogram (ECG) recordings by signal processing techniques has helped in better understanding this complex arrhythmia. In particular, tensor decomposition techniques have proven to be powerful tools in this task, overcoming the limitations of matrix factorization methods. Exploring the spatial as well as the temporal diversity of ECG recordings, this contribution puts forward a novel noninvasive AA extraction method that models consecutive AF ECG segments as a coupled block-term tensor decomposition, assuming that they share the same spatial signatures. Experiments on synthetic and real data, the latter acquired from persistent AF patients, validate the proposed coupled tensor approach, which provides satisfactory performance with reduced computational cost.
心房颤动(AF)是临床上最常见的心律失常,主要表现为心房去极化不协调和不规则。然而,其电生理机制仍不清楚,这增加了近年来对这一具有挑战性的心脏疾病的深入临床研究。通过信号处理技术从多导联心电图(ECG)记录中无创提取心房活动(AA)有助于更好地理解这种复杂的心律失常。特别是,张量分解技术已被证明是这项任务的强大工具,克服了矩阵分解方法的局限性。研究了ECG记录的空间和时间多样性,提出了一种新的无创AA提取方法,该方法将连续AF ECG段建模为耦合块项张量分解,假设它们具有相同的空间特征。在合成数据和真实数据上的实验验证了所提出的耦合张量方法,该方法在降低计算成本的同时提供了令人满意的性能。
{"title":"Coupled Tensor Model of Atrial Fibrillation ECG","authors":"Pedro Marinho R. de Oliveira, V. Zarzoso, C. A. R. Fernandes","doi":"10.23919/Eusipco47968.2020.9287494","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287494","url":null,"abstract":"Atrial fibrillation (AF) is the most frequent cardiac arrhythmia diagnosed in clinical practice, identified by an uncoordinated and irregular atrial depolarization. However, its electrophysiological mechanisms are still not clearly understood, increasing the intensive clinical research into this challenging cardiac condition in the past few years. The noninvasive extraction of the atrial activity (AA) from multi-lead electrocardiogram (ECG) recordings by signal processing techniques has helped in better understanding this complex arrhythmia. In particular, tensor decomposition techniques have proven to be powerful tools in this task, overcoming the limitations of matrix factorization methods. Exploring the spatial as well as the temporal diversity of ECG recordings, this contribution puts forward a novel noninvasive AA extraction method that models consecutive AF ECG segments as a coupled block-term tensor decomposition, assuming that they share the same spatial signatures. Experiments on synthetic and real data, the latter acquired from persistent AF patients, validate the proposed coupled tensor approach, which provides satisfactory performance with reduced computational cost.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"42 1","pages":"915-919"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83111205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low-Complexity HEVC Transrating Based on Prediction Unit Mode Inheritance 基于预测单元模式继承的HEVC低复杂度翻译
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287789
Matheus Lindino, Thiago Bubolz, B. Zatt, D. Palomino, G. Corrêa
Video transcoding for bit rate adaptation has become mandatory for over-the-top applications that deliver multimedia content in heterogeneous environments under different network conditions and user capabilities. As transcoding requires sequentially decoding and re-encoding the video bitstream, the computational cost involved in the process is too high, especially when considering current state-of-the-art codecs, such as the High Efficiency Video Coding (HEVC). This work presents a fast HEVC transcoder for bit rate adaptation based on Prediction Unit (PU) mode inheritance, which uses information gathered from the HEVC decoding process to accelerate PU mode decision in the re-encoding process. Experimental results show that the proposed method achieves an average transrating time reduction of 42% at the cost of a bitrate increase of 0.54%.
为适应比特率而进行的视频转码已经成为在不同网络条件和用户能力的异构环境中交付多媒体内容的顶级应用程序的必备条件。由于转码需要依次解码和重新编码视频比特流,因此该过程所涉及的计算成本太高,特别是考虑到当前最先进的编解码器,如高效视频编码(HEVC)。本文提出了一种基于预测单元(PU)模式继承的快速HEVC转码器,该转码器利用HEVC解码过程中收集的信息来加速重编码过程中PU模式的决策。实验结果表明,该方法平均翻译时间缩短42%,比特率提高0.54%。
{"title":"Low-Complexity HEVC Transrating Based on Prediction Unit Mode Inheritance","authors":"Matheus Lindino, Thiago Bubolz, B. Zatt, D. Palomino, G. Corrêa","doi":"10.23919/Eusipco47968.2020.9287789","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287789","url":null,"abstract":"Video transcoding for bit rate adaptation has become mandatory for over-the-top applications that deliver multimedia content in heterogeneous environments under different network conditions and user capabilities. As transcoding requires sequentially decoding and re-encoding the video bitstream, the computational cost involved in the process is too high, especially when considering current state-of-the-art codecs, such as the High Efficiency Video Coding (HEVC). This work presents a fast HEVC transcoder for bit rate adaptation based on Prediction Unit (PU) mode inheritance, which uses information gathered from the HEVC decoding process to accelerate PU mode decision in the re-encoding process. Experimental results show that the proposed method achieves an average transrating time reduction of 42% at the cost of a bitrate increase of 0.54%.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"14 1","pages":"550-554"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80928905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Comparison of Convolution Types in CNN-based Feature Extraction for Sound Source Localization 基于cnn的声源定位特征提取中卷积类型的比较
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287344
D. Krause, A. Politis, K. Kowalczyk
This paper presents an overview of several approaches to convolutional feature extraction in the context of deep neural network (DNN) based sound source localization. Different ways of processing multichannel audio data in the time-frequency domain using convolutional neural networks (CNNs) are described and tested with the aim to provide a comparative study of their performance. In most considered approaches, models are trained with phase and magnitude components of the Short-Time Fourier Transform (STFT). In addition to state-of-the-art 2D convolutional layers, we investigate several solutions for the processing of 3D matrices containing multichannel complex representation of the microphone signals. The first two proposed approaches are the 3D convolutions and depthwise separable convolutions in which two types of filters are used to exploit information within and between the channels. Note that this paper presents the first application of depthwise separable convolutions in a task of sound source localization. The third approach is based on complex-valued neural networks which allows for performing convolutions directly on complex signal representations. Experiments are conducted using two synthetic datasets containing noise and speech signals recorded using a tetrahedral microphone array. The paper presents the results obtained using all investigated model types and discusses the resulting accuracy and computational complexity in DNN-based source localization.
本文综述了基于深度神经网络(DNN)的声源定位中卷积特征提取的几种方法。描述并测试了使用卷积神经网络(cnn)在时频域处理多通道音频数据的不同方法,目的是对它们的性能进行比较研究。在大多数考虑的方法中,模型是用短时傅里叶变换(STFT)的相位和幅度分量来训练的。除了最先进的2D卷积层,我们还研究了几种用于处理包含麦克风信号的多通道复杂表示的3D矩阵的解决方案。提出的前两种方法是3D卷积和深度可分离卷积,其中使用两种类型的滤波器来利用通道内部和通道之间的信息。值得注意的是,本文首次提出了深度可分离卷积在声源定位任务中的应用。第三种方法是基于复值神经网络,它允许在复杂信号表示上直接执行卷积。实验采用四面体麦克风阵列记录的噪声和语音信号合成数据集进行。本文给出了使用所有研究模型类型获得的结果,并讨论了基于dnn的源定位的精度和计算复杂度。
{"title":"Comparison of Convolution Types in CNN-based Feature Extraction for Sound Source Localization","authors":"D. Krause, A. Politis, K. Kowalczyk","doi":"10.23919/Eusipco47968.2020.9287344","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287344","url":null,"abstract":"This paper presents an overview of several approaches to convolutional feature extraction in the context of deep neural network (DNN) based sound source localization. Different ways of processing multichannel audio data in the time-frequency domain using convolutional neural networks (CNNs) are described and tested with the aim to provide a comparative study of their performance. In most considered approaches, models are trained with phase and magnitude components of the Short-Time Fourier Transform (STFT). In addition to state-of-the-art 2D convolutional layers, we investigate several solutions for the processing of 3D matrices containing multichannel complex representation of the microphone signals. The first two proposed approaches are the 3D convolutions and depthwise separable convolutions in which two types of filters are used to exploit information within and between the channels. Note that this paper presents the first application of depthwise separable convolutions in a task of sound source localization. The third approach is based on complex-valued neural networks which allows for performing convolutions directly on complex signal representations. Experiments are conducted using two synthetic datasets containing noise and speech signals recorded using a tetrahedral microphone array. The paper presents the results obtained using all investigated model types and discusses the resulting accuracy and computational complexity in DNN-based source localization.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"138 1","pages":"820-824"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80988283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Robust EEG Source Localization Using Subspace Principal Vector Projection Technique 基于子空间主向量投影技术的鲁棒脑电信号源定位
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287482
Amita Giri, L. Kumar, T. Gandhi
ElectroEncephaloGram (EEG) signals based Brain Source Localization (BSL) has been an active area of research. The performance of BSL algorithms is severely degraded in the presence of background interferences. Pre-Whitening (PW) based approach to deal with such interference assumes temporal stationarity of the data which does not hold good for EEG based processing. Null Projection (NP) based approach relaxes the temporal stationarity. However, the strict spatial stationarity of the number of interfering sources is maintained between control state and activity state measurement. In practical scenarios where an interference source that exists only in the control state, and does not appear in activity state, NP based approach removes a higher dimension space from the activity data leading to its poor performance. The proposed Subspace Principal Vector Projection (SPVP) based approach utilizes subspace correlation based common interference statistics and thus relaxing the strict spatial stationarity condition. In particular, SPVP based MUltiple SIgnal Classification (MUSIC) and Linearly Constrained Minimum Variance (LCMV) algorithms are presented for BSL. Simulation and experiment with real EEG data from Physionet dataset involving motor imagery task illustrate the effectiveness of the proposed algorithms in robust BSL with interference suppression.
基于脑电图(EEG)信号的脑源定位(BSL)一直是一个活跃的研究领域。在存在背景干扰的情况下,BSL算法的性能严重下降。基于预白化(PW)的处理这种干扰的方法假定数据具有时间平稳性,这不利于基于脑电的处理。基于零投影(NP)的方法放宽了时间平稳性。然而,在控制状态和活动状态测量之间保持了干扰源数量的严格空间平稳性。在实际场景中,干扰源只存在于控制状态,而不出现在活动状态,基于NP的方法从活动数据中删除了高维空间,导致其性能不佳。提出的基于子空间主向量投影(SPVP)的方法利用基于子空间相关的共同干扰统计量,从而放宽了严格的空间平稳性条件。特别提出了基于SPVP的多信号分类(MUSIC)和线性约束最小方差(LCMV)算法。利用Physionet数据集的真实脑电数据(包括运动图像任务)进行仿真和实验,验证了该算法在抑制干扰的鲁棒性脑电信号处理中的有效性。
{"title":"Robust EEG Source Localization Using Subspace Principal Vector Projection Technique","authors":"Amita Giri, L. Kumar, T. Gandhi","doi":"10.23919/Eusipco47968.2020.9287482","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287482","url":null,"abstract":"ElectroEncephaloGram (EEG) signals based Brain Source Localization (BSL) has been an active area of research. The performance of BSL algorithms is severely degraded in the presence of background interferences. Pre-Whitening (PW) based approach to deal with such interference assumes temporal stationarity of the data which does not hold good for EEG based processing. Null Projection (NP) based approach relaxes the temporal stationarity. However, the strict spatial stationarity of the number of interfering sources is maintained between control state and activity state measurement. In practical scenarios where an interference source that exists only in the control state, and does not appear in activity state, NP based approach removes a higher dimension space from the activity data leading to its poor performance. The proposed Subspace Principal Vector Projection (SPVP) based approach utilizes subspace correlation based common interference statistics and thus relaxing the strict spatial stationarity condition. In particular, SPVP based MUltiple SIgnal Classification (MUSIC) and Linearly Constrained Minimum Variance (LCMV) algorithms are presented for BSL. Simulation and experiment with real EEG data from Physionet dataset involving motor imagery task illustrate the effectiveness of the proposed algorithms in robust BSL with interference suppression.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"50 1","pages":"1075-1079"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81038212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Deep Learning Method with CRF for Instance Segmentation of Metal-Organic Frameworks in Scanning Electron Microscopy Images 基于CRF的扫描电镜图像中金属-有机框架实例分割的深度学习方法
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287366
Ilyes Batatia
This paper proposes an integrated method for recognizing special crystals, called metal-organic frameworks (MOF), in scanning electron microscopy images (SEM). The proposed approach combines two deep learning networks and a dense conditional random field (CRF) to perform image segmentation. A modified Unet-like convolutional neural network (CNN), incorporating dilatation techniques using atrous convolution, is designed to segment cluttered objects in the SEM image. The dense CRF is tailored to enhance object boundaries and recover small objects. The unary energy of the CRF is obtained from the CNN. And the pairwise energy is estimated using mean field approximation. The resulting segmented regions are fed to a fully connected CNN that performs instance recognition. The method has been trained on a dataset of 500 images with 3200 objects from 3 classes. Testing achieves an overall accuracy of 95.7% MOF recognition. The proposed method opens up the possibility for developing automated chemical process monitoring that allows researchers to optimize conditions of MOF synthesis.
本文提出了一种识别扫描电子显微镜图像中特殊晶体(金属有机框架)的集成方法。该方法结合了两个深度学习网络和一个密集条件随机场(CRF)来进行图像分割。一种改进的类unet卷积神经网络(CNN),结合使用亚特卷积的扩张技术,设计用于分割扫描电镜图像中的杂乱物体。密集CRF是针对增强目标边界和恢复小目标而定制的。CRF的一元能量由CNN得到。并利用平均场近似法对能量进行了估计。得到的分割区域被馈送到执行实例识别的全连接CNN。该方法在500张图像的数据集上进行了训练,其中包含来自3个类的3200个对象。测试结果表明,MOF识别的总体准确率为95.7%。所提出的方法开辟了开发自动化化学过程监测的可能性,使研究人员能够优化MOF合成的条件。
{"title":"A Deep Learning Method with CRF for Instance Segmentation of Metal-Organic Frameworks in Scanning Electron Microscopy Images","authors":"Ilyes Batatia","doi":"10.23919/Eusipco47968.2020.9287366","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287366","url":null,"abstract":"This paper proposes an integrated method for recognizing special crystals, called metal-organic frameworks (MOF), in scanning electron microscopy images (SEM). The proposed approach combines two deep learning networks and a dense conditional random field (CRF) to perform image segmentation. A modified Unet-like convolutional neural network (CNN), incorporating dilatation techniques using atrous convolution, is designed to segment cluttered objects in the SEM image. The dense CRF is tailored to enhance object boundaries and recover small objects. The unary energy of the CRF is obtained from the CNN. And the pairwise energy is estimated using mean field approximation. The resulting segmented regions are fed to a fully connected CNN that performs instance recognition. The method has been trained on a dataset of 500 images with 3200 objects from 3 classes. Testing achieves an overall accuracy of 95.7% MOF recognition. The proposed method opens up the possibility for developing automated chemical process monitoring that allows researchers to optimize conditions of MOF synthesis.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"85 1","pages":"625-629"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78831563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Fast Ray Space Transform for Wave Field Processing using Acoustic Arrays 利用声阵列进行波场处理的快速射线空间变换
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287380
Federico Borra, Mirco Pezzoli, Luca Comanducci, A. Bernardini, F. Antonacci, S. Tubaro, A. Sarti
The importance of soundfield imaging techniques is expected to further increase in the next few years thanks to the ever-increasing availability of low-cost sensors such as MEMS microphones. When it comes to processing a relevant number of sensor signals, however, the computational load of space-time processing algorithms easily grows to unmanageable levels. The Ray Space Transform (RST) was recently introduced as a promising tool for soundfield analysis. Given the collection of signals captured by a uniform linear array of microphones, the RST allows us to collect and map the directional components of the acoustic field onto a domain called "ray space", where relevant acoustic objects are represented as linear patterns for advanced acoustic analysis and synthesis applications. So far the computational complexity of the RST linearly increases with the number of microphones. In order to alleviate this problem, in this paper we propose an alternative efficient implementation of the RST based on the Non Uniform Fast Fourier Transform.
由于MEMS麦克风等低成本传感器的不断普及,声场成像技术的重要性预计将在未来几年内进一步提高。然而,当涉及到处理相关数量的传感器信号时,时空处理算法的计算量很容易增长到难以控制的水平。射线空间变换(RST)作为声场分析的一种很有前途的工具最近被引入。考虑到由麦克风的均匀线性阵列捕获的信号的收集,RST允许我们收集声场的方向分量并将其映射到称为“射线空间”的域中,其中相关的声学对象表示为高级声学分析和合成应用的线性模式。到目前为止,RST的计算复杂度随着麦克风数量的增加而线性增加。为了缓解这一问题,本文提出了一种基于非均匀快速傅里叶变换的RST的高效实现方法。
{"title":"A Fast Ray Space Transform for Wave Field Processing using Acoustic Arrays","authors":"Federico Borra, Mirco Pezzoli, Luca Comanducci, A. Bernardini, F. Antonacci, S. Tubaro, A. Sarti","doi":"10.23919/Eusipco47968.2020.9287380","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287380","url":null,"abstract":"The importance of soundfield imaging techniques is expected to further increase in the next few years thanks to the ever-increasing availability of low-cost sensors such as MEMS microphones. When it comes to processing a relevant number of sensor signals, however, the computational load of space-time processing algorithms easily grows to unmanageable levels. The Ray Space Transform (RST) was recently introduced as a promising tool for soundfield analysis. Given the collection of signals captured by a uniform linear array of microphones, the RST allows us to collect and map the directional components of the acoustic field onto a domain called \"ray space\", where relevant acoustic objects are represented as linear patterns for advanced acoustic analysis and synthesis applications. So far the computational complexity of the RST linearly increases with the number of microphones. In order to alleviate this problem, in this paper we propose an alternative efficient implementation of the RST based on the Non Uniform Fast Fourier Transform.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"26 1","pages":"186-190"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82788335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Detection of Package Edges in Distance Maps 距离图中包边的检测
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287558
E. Vasileva, Nenad Avramovski, Z. Ivanovski
This paper presents a CNN-based algorithm for detecting package edges in a scene represented with a distance map (range image), trained on a custom dataset of packaging scenarios. The proposed algorithm represents the basis for package recognition for automatic trailer loading/unloading. The main focus of this paper is designing a semantic segmentation CNN model capable of detecting different types of package edges in a distance map containing distance errors characteristic of Time-of-Flight (ToF) scanning, and differentiating box edges from edges belonging to other types of packaging objects (bags, irregular objects, etc.). The proposed CNN is optimized for training with a limited number of samples containing heavily imbalanced classes. Generating a binary mask of edges with 1-pixel thickness from the probability maps outputted from the CNN is achieved through a custom non-maximum suppression-based edge thinning algorithm. The proposed algorithm shows promising results in detecting box edges.
本文提出了一种基于cnn的算法,用于在包装场景的自定义数据集上训练的距离图(距离图像)表示的场景中检测包装边缘。该算法为拖车自动装卸货物的识别奠定了基础。本文的主要重点是设计一个语义分割CNN模型,该模型能够在包含飞行时间(ToF)扫描距离误差特征的距离图中检测不同类型的包装边缘,并将盒子边缘与属于其他类型包装物体(袋子、不规则物体等)的边缘区分开来。所提出的CNN针对有限数量的样本进行了优化,样本中包含严重不平衡的类别。通过自定义的基于非最大抑制的边缘细化算法,从CNN输出的概率图中生成厚度为1像素的边缘二进制掩码。该算法在检测盒边缘方面取得了良好的效果。
{"title":"Detection of Package Edges in Distance Maps","authors":"E. Vasileva, Nenad Avramovski, Z. Ivanovski","doi":"10.23919/Eusipco47968.2020.9287558","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287558","url":null,"abstract":"This paper presents a CNN-based algorithm for detecting package edges in a scene represented with a distance map (range image), trained on a custom dataset of packaging scenarios. The proposed algorithm represents the basis for package recognition for automatic trailer loading/unloading. The main focus of this paper is designing a semantic segmentation CNN model capable of detecting different types of package edges in a distance map containing distance errors characteristic of Time-of-Flight (ToF) scanning, and differentiating box edges from edges belonging to other types of packaging objects (bags, irregular objects, etc.). The proposed CNN is optimized for training with a limited number of samples containing heavily imbalanced classes. Generating a binary mask of edges with 1-pixel thickness from the probability maps outputted from the CNN is achieved through a custom non-maximum suppression-based edge thinning algorithm. The proposed algorithm shows promising results in detecting box edges.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"24 1","pages":"600-604"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89963709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Feature-based Response Prediction to Immunotherapy of late-stage Melanoma Patients Using PET/MR Imaging 利用PET/MR成像预测晚期黑色素瘤患者免疫治疗的特征反应
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287571
Annika Liebgott, S. Gatidis, V. Vu, Tobias Haueise, K. Nikolaou, Bin Yang
The treatment of malignant melanoma with immunotherapy is a promising approach to treat advanced stages of the disease. However, the treatment can cause serious side effects and not every patient responds to it. This means, crucial time may be wasted on an ineffective treatment. Assessment of the possible therapy response is hence an important research issue. The research presented in this study focuses on the investigation of the potential of medical imaging and machine learning to solve this task. To this end, we extracted image features from multi-modal images and trained a classifier to differentiate non-responsive patients from responsive ones.
用免疫疗法治疗恶性黑色素瘤是一种治疗晚期疾病的有希望的方法。然而,这种疗法会引起严重的副作用,并不是每个病人都对它有反应。这意味着,关键的时间可能会浪费在无效的治疗上。因此,评估可能的治疗反应是一个重要的研究问题。本研究中提出的研究重点是研究医学成像和机器学习解决这一任务的潜力。为此,我们从多模态图像中提取图像特征,并训练分类器来区分无反应患者和有反应患者。
{"title":"Feature-based Response Prediction to Immunotherapy of late-stage Melanoma Patients Using PET/MR Imaging","authors":"Annika Liebgott, S. Gatidis, V. Vu, Tobias Haueise, K. Nikolaou, Bin Yang","doi":"10.23919/Eusipco47968.2020.9287571","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287571","url":null,"abstract":"The treatment of malignant melanoma with immunotherapy is a promising approach to treat advanced stages of the disease. However, the treatment can cause serious side effects and not every patient responds to it. This means, crucial time may be wasted on an ineffective treatment. Assessment of the possible therapy response is hence an important research issue. The research presented in this study focuses on the investigation of the potential of medical imaging and machine learning to solve this task. To this end, we extracted image features from multi-modal images and trained a classifier to differentiate non-responsive patients from responsive ones.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"54 1","pages":"1229-1233"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88116257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
One or Two Frequencies? The Scattering Transform Answers 一个还是两个频率?散射变换答案
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287216
V. Lostanlen, Alice Cohen-Hadria, J. Bello
With the aim of constructing a biologically plausible model of machine listening, we study the representation of a multicomponent stationary signal by a wavelet scattering network. First, we show that renormalizing second-order nodes by their first-order parents gives a simple numerical criterion to assess whether two neighboring components will interfere psychoacoustically. Secondly, we run a manifold learning algorithm (Isomap) on scattering coefficients to visualize the similarity space underlying parametric additive synthesis. Thirdly, we generalize the “one or two components” framework to three sine waves or more, and prove that the effective scattering depth of a Fourier series grows in logarithmic proportion to its bandwidth.
为了构建一个生物学上合理的机器听力模型,我们研究了用小波散射网络表示多分量平稳信号。首先,我们证明了二阶节点的一阶父节点的重规格化给出了一个简单的数值准则来评估两个相邻分量是否会产生心理声学干扰。其次,我们在散射系数上运行流形学习算法(Isomap)来可视化参数加性合成的相似空间。第三,我们将“一或两分量”框架推广到三个或更多正弦波,并证明了傅里叶级数的有效散射深度与带宽成对数比例增长。
{"title":"One or Two Frequencies? The Scattering Transform Answers","authors":"V. Lostanlen, Alice Cohen-Hadria, J. Bello","doi":"10.23919/Eusipco47968.2020.9287216","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287216","url":null,"abstract":"With the aim of constructing a biologically plausible model of machine listening, we study the representation of a multicomponent stationary signal by a wavelet scattering network. First, we show that renormalizing second-order nodes by their first-order parents gives a simple numerical criterion to assess whether two neighboring components will interfere psychoacoustically. Secondly, we run a manifold learning algorithm (Isomap) on scattering coefficients to visualize the similarity space underlying parametric additive synthesis. Thirdly, we generalize the “one or two components” framework to three sine waves or more, and prove that the effective scattering depth of a Fourier series grows in logarithmic proportion to its bandwidth.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"11 1","pages":"2205-2209"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87877775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Data-driven Kernel-based Probabilistic SAX for Time Series Dimensionality Reduction 基于数据驱动核的时间序列降维概率SAX
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287311
Konstantinos Bountrogiannis, G. Tzagkarakis, P. Tsakalides
The ever-increasing volume and complexity of time series data, emerging in various application domains, necessitate efficient dimensionality reduction for facilitating data mining tasks. Symbolic representations, among them symbolic aggregate approximation (SAX), have proven very effective in compacting the information content of time series while exploiting the wealth of search algorithms used in bioinformatics and text mining communities. However, typical SAX-based techniques rely on a Gaussian assumption for the underlying data statistics, which often deteriorates their performance in practical scenarios. To overcome this limitation, this work introduces a method that negates any assumption on the probability distribution of time series. Specifically, a data-driven kernel density estimator is first applied on the data, followed by Lloyd-Max quantization to determine the optimal horizontal segmentation breakpoints. Experimental evaluation on distinct datasets demonstrates the superiority of our method, in terms of reconstruction accuracy and tightness of lower bound, when compared against the conventional and a modified SAX method.
随着时间序列数据在各个应用领域的不断增长和复杂性,需要有效的降维来促进数据挖掘任务。符号表示,其中包括符号聚合近似(SAX),已被证明在压缩时间序列的信息内容方面非常有效,同时利用了生物信息学和文本挖掘社区中使用的丰富搜索算法。然而,典型的基于sax的技术依赖于底层数据统计的高斯假设,这通常会降低它们在实际场景中的性能。为了克服这一限制,本工作引入了一种方法,该方法否定了对时间序列概率分布的任何假设。具体而言,首先对数据应用数据驱动的核密度估计器,然后使用Lloyd-Max量化来确定最佳的水平分割断点。不同数据集的实验评估表明,与传统的和改进的SAX方法相比,我们的方法在重建精度和下界紧密性方面具有优势。
{"title":"Data-driven Kernel-based Probabilistic SAX for Time Series Dimensionality Reduction","authors":"Konstantinos Bountrogiannis, G. Tzagkarakis, P. Tsakalides","doi":"10.23919/Eusipco47968.2020.9287311","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287311","url":null,"abstract":"The ever-increasing volume and complexity of time series data, emerging in various application domains, necessitate efficient dimensionality reduction for facilitating data mining tasks. Symbolic representations, among them symbolic aggregate approximation (SAX), have proven very effective in compacting the information content of time series while exploiting the wealth of search algorithms used in bioinformatics and text mining communities. However, typical SAX-based techniques rely on a Gaussian assumption for the underlying data statistics, which often deteriorates their performance in practical scenarios. To overcome this limitation, this work introduces a method that negates any assumption on the probability distribution of time series. Specifically, a data-driven kernel density estimator is first applied on the data, followed by Lloyd-Max quantization to determine the optimal horizontal segmentation breakpoints. Experimental evaluation on distinct datasets demonstrates the superiority of our method, in terms of reconstruction accuracy and tightness of lower bound, when compared against the conventional and a modified SAX method.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"59 1","pages":"2343-2347"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85829075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2020 28th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1