首页 > 最新文献

2022 30th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Hypernetworks for Sound event Detection: a Proof-of-Concept 用于声音事件检测的超网络:概念验证
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909716
Shubhr Singh, Huy Phan, Emmanouil Benetos
Polyphonic sound event detection (SED) involves the pre-diction of sound events present in an audio recording along with their onset and offset times. Recently, Deep Neural Net-works, specifically convolutional recurrent neural networks (CRNN) have achieved impressive results for this task. The convolution part of the architecture is used to extract trans-lational invariant features from the input and the recurrent part learns the underlying temporal relationship between au-dio frames. Recent studies showed that the weight sharing paradigm of recurrent networks might be a hindering factor in certain kinds of time series data, specifically where there is a temporal conditional shift, i.e. the conditional distribution of a label changes across the temporal scale. This warrants a relevant question - is there a similar phenomenon in poly-phonic sound events due to dynamic polyphony level across the temporal axis? In this work, we explore this question and inquire if relaxed weight sharing improves performance of a CRNN for polyphonic SED. We propose to use hyper-networks to relax weight sharing in the recurrent part and show that the CRNN's performance is improved by ≈ 3% across two datasets, thus paving the way for further explo-ration of the existence of temporal conditional shift for poly-phonic SED.
复调声音事件检测(SED)涉及到对音频记录中出现的声音事件及其开始和偏移时间的预测。最近,深度神经网络,特别是卷积递归神经网络(CRNN)在这项任务上取得了令人印象深刻的成果。该体系结构的卷积部分用于从输入中提取平移不变特征,循环部分用于学习au- audio帧之间的潜在时间关系。最近的研究表明,在某些类型的时间序列数据中,循环网络的权重共享范式可能是一个阻碍因素,特别是在存在时间条件转移的情况下,即标签的条件分布在时间尺度上发生了变化。这就提出了一个相关的问题——在复音事件中,由于动态复音水平跨越时间轴,是否也存在类似的现象?在这项工作中,我们探讨了这个问题,并询问放宽权值共享是否可以提高重音SED的CRNN性能。我们建议使用超网络来放松循环部分的权值共享,并表明CRNN的性能在两个数据集上提高了约3%,从而为进一步探索多音SED的时间条件移位的存在铺平了道路。
{"title":"Hypernetworks for Sound event Detection: a Proof-of-Concept","authors":"Shubhr Singh, Huy Phan, Emmanouil Benetos","doi":"10.23919/eusipco55093.2022.9909716","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909716","url":null,"abstract":"Polyphonic sound event detection (SED) involves the pre-diction of sound events present in an audio recording along with their onset and offset times. Recently, Deep Neural Net-works, specifically convolutional recurrent neural networks (CRNN) have achieved impressive results for this task. The convolution part of the architecture is used to extract trans-lational invariant features from the input and the recurrent part learns the underlying temporal relationship between au-dio frames. Recent studies showed that the weight sharing paradigm of recurrent networks might be a hindering factor in certain kinds of time series data, specifically where there is a temporal conditional shift, i.e. the conditional distribution of a label changes across the temporal scale. This warrants a relevant question - is there a similar phenomenon in poly-phonic sound events due to dynamic polyphony level across the temporal axis? In this work, we explore this question and inquire if relaxed weight sharing improves performance of a CRNN for polyphonic SED. We propose to use hyper-networks to relax weight sharing in the recurrent part and show that the CRNN's performance is improved by ≈ 3% across two datasets, thus paving the way for further explo-ration of the existence of temporal conditional shift for poly-phonic SED.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127374303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi-Supervised Online Speaker Diarization using Vector Quantization with Alternative Codebooks 基于矢量量化的可选码本半监督在线说话人二分化
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909891
Mahmoud El-Hindi, Michael Muma, A. Zoubir
Speaker diarization systems process audio files by labelling speech segments according to speakers' identities. Many speaker diarization systems work offline and are not suited for online applications. We present a semi-supervised, online, low-complexity system. While, in general, speaker diarization operates in an unsupervised manner, the presented system relies on the enrollment of the participating speakers in the conversation. The diarization system has two main novel aspects. The first one is a proposed online learning strategy that evaluates processed segments according to their usefulness for learning a speaker, i.e. update a speaker model with it. The segment is evaluated using two metrics to determine whether to use the segment to update the system. The second novel aspect is a proposed vector quantization approach that models the score not only depending on the target speaker codebook but also takes an alternative codebook into account. We also present an approach to compute the alternative codebook. Simulation results show that the proposed system outperforms a comparable system without the proposed online learning strategy and shows benefits, especially for short training lengths.
说话人分类系统通过根据说话人的身份标记语音片段来处理音频文件。许多扬声器拨号系统离线工作,不适合在线应用。我们提出了一个半监督、在线、低复杂度的系统。虽然一般来说,说话人分界是以一种无监督的方式进行的,但所呈现的系统依赖于对话中参与说话人的登记。该系统有两个主要的新颖之处。第一个是提出的在线学习策略,该策略根据处理后的片段对学习说话人的有用性进行评估,即用它更新说话人模型。使用两个度量来评估段,以确定是否使用该段更新系统。第二个新颖的方面是提出的矢量量化方法,该方法不仅根据目标说话人的码本建模得分,而且还考虑了另一个码本。我们还提出了一种计算替代码本的方法。仿真结果表明,所提出的系统优于没有提出在线学习策略的同类系统,并显示出优势,特别是在短训练时间下。
{"title":"Semi-Supervised Online Speaker Diarization using Vector Quantization with Alternative Codebooks","authors":"Mahmoud El-Hindi, Michael Muma, A. Zoubir","doi":"10.23919/eusipco55093.2022.9909891","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909891","url":null,"abstract":"Speaker diarization systems process audio files by labelling speech segments according to speakers' identities. Many speaker diarization systems work offline and are not suited for online applications. We present a semi-supervised, online, low-complexity system. While, in general, speaker diarization operates in an unsupervised manner, the presented system relies on the enrollment of the participating speakers in the conversation. The diarization system has two main novel aspects. The first one is a proposed online learning strategy that evaluates processed segments according to their usefulness for learning a speaker, i.e. update a speaker model with it. The segment is evaluated using two metrics to determine whether to use the segment to update the system. The second novel aspect is a proposed vector quantization approach that models the score not only depending on the target speaker codebook but also takes an alternative codebook into account. We also present an approach to compute the alternative codebook. Simulation results show that the proposed system outperforms a comparable system without the proposed online learning strategy and shows benefits, especially for short training lengths.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127203912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-Level Semantic Segmentation Guided Feature Space Decoupling And Augmentation for Fine-Grained Ship Detection 基于跨层次语义分割的细粒度船舶检测特征空间解耦与增强
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909586
Zhengning Zhang, Lin Zhang, Yue Wang, P. Feng, Shaobo Liu, Jian Wang
Fine-grained ship detection in optical remote sensing images is a challenging problem due to its long-tailed distributed dataset, which is often coupled with the multi-scale of ship and complex environment. In this paper, a novel average instance area imbalance ratio (AIAIR) is firstly used for quantitatively evaluating long-tailed distribution and multi-scale coupled problem. Based on which, we propose the idea of feature space decoupling and augmentation guided by cross-Level semantic segmentation, where features on different classwise-balance level are scheduled. On this basis, a Siamese Semantic Segmentation Guided Ship Detection Network (SGSDet) is proposed to effectively facilitate fine-grained ship detection performance. Our proposed method can be easily plugged into existing object detection models. Numerical experiments show that the proposed method outperforms the baseline by 2.32% mAP on the ShipRSImageNet dataset without extra annotations.
由于光学遥感图像的分布式数据集长尾,加之船舶的多尺度和环境的复杂性,使得细粒度船舶检测成为一个具有挑战性的问题。本文首次将一种新的平均实例面积不平衡比(AIAIR)用于定量评价长尾分布和多尺度耦合问题。在此基础上,提出了基于跨层语义分割的特征空间解耦和增强思想,对不同类别平衡级别的特征进行调度。在此基础上,提出了一种暹罗语义分割引导船舶检测网络(Siamese Semantic Segmentation Guided Ship Detection Network, SGSDet),有效提升细粒度船舶检测性能。我们的方法可以很容易地插入到现有的目标检测模型中。数值实验表明,在没有额外标注的情况下,该方法在ShipRSImageNet数据集上的mAP值比基线值高出2.32%。
{"title":"Cross-Level Semantic Segmentation Guided Feature Space Decoupling And Augmentation for Fine-Grained Ship Detection","authors":"Zhengning Zhang, Lin Zhang, Yue Wang, P. Feng, Shaobo Liu, Jian Wang","doi":"10.23919/eusipco55093.2022.9909586","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909586","url":null,"abstract":"Fine-grained ship detection in optical remote sensing images is a challenging problem due to its long-tailed distributed dataset, which is often coupled with the multi-scale of ship and complex environment. In this paper, a novel average instance area imbalance ratio (AIAIR) is firstly used for quantitatively evaluating long-tailed distribution and multi-scale coupled problem. Based on which, we propose the idea of feature space decoupling and augmentation guided by cross-Level semantic segmentation, where features on different classwise-balance level are scheduled. On this basis, a Siamese Semantic Segmentation Guided Ship Detection Network (SGSDet) is proposed to effectively facilitate fine-grained ship detection performance. Our proposed method can be easily plugged into existing object detection models. Numerical experiments show that the proposed method outperforms the baseline by 2.32% mAP on the ShipRSImageNet dataset without extra annotations.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128915271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An efficient clustering-based non-fiducial approach for ECG biometric recognition 一种有效的基于聚类的心电生物特征识别非基准方法
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909751
David Meltzer, D. Luengo
Recognition of individuals through different bio-metric traits is becoming increasingly important. Apart from traditional biomarkers (like fingerprints), many alternative traits have been proposed during the last two decades: ECG and EEG signals, iris or facial recognition, behavioral traits, etc. Several works have shown that ECG-based recognition is a feasible alternative for either stand-alone or multibiometric recognition systems. In this paper, we propose a novel, efficient and scalable clustering-based method for ECG biometric recognition. First of all, fixed length segments of the ECG are extracted without the need of computing any fiducial points. Unique traits for each subject are then obtained by computing the autocorrelation (AC) of each segment, followed by the discrete cosine transform (DCT) to compress the available information. Finally, hierarchical ag-glomerative clustering (HAC) is applied to generate the groups that will be used later on for classification. The combination of the DCT to reduce the feature dimensionality and the HAC to decrease the number of features required by the classifier results in an efficient method both from the memory storage and computational point of view. Furthermore, the proposed AC/DCT-HAC (ADH) approach is robust, since no fiducial points (which may be difficult to extract reliably) are required, scalable and attains a better performance than other approaches with higher storage/computational cost.
通过不同的生物特征来识别个体变得越来越重要。除了传统的生物标记(如指纹),在过去的二十年里,许多替代特征被提出:心电图和脑电图信号,虹膜或面部识别,行为特征等。一些工作表明,基于脑电图的识别是独立或多生物识别系统的可行替代方案。本文提出了一种新颖、高效、可扩展的基于聚类的心电生物特征识别方法。首先,在不计算任何基点的情况下提取心电固定长度的片段。然后通过计算每个片段的自相关(AC)来获得每个主题的独特特征,然后通过离散余弦变换(DCT)来压缩可用信息。最后,应用分层聚类(HAC)来生成稍后将用于分类的组。结合DCT来降低特征维数和HAC来减少分类器所需的特征数量,从内存存储和计算的角度来看都是一种有效的方法。此外,所提出的AC/DCT-HAC (ADH)方法具有鲁棒性,因为不需要基准点(可能难以可靠地提取),具有可扩展性,并且比其他具有更高存储/计算成本的方法具有更好的性能。
{"title":"An efficient clustering-based non-fiducial approach for ECG biometric recognition","authors":"David Meltzer, D. Luengo","doi":"10.23919/eusipco55093.2022.9909751","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909751","url":null,"abstract":"Recognition of individuals through different bio-metric traits is becoming increasingly important. Apart from traditional biomarkers (like fingerprints), many alternative traits have been proposed during the last two decades: ECG and EEG signals, iris or facial recognition, behavioral traits, etc. Several works have shown that ECG-based recognition is a feasible alternative for either stand-alone or multibiometric recognition systems. In this paper, we propose a novel, efficient and scalable clustering-based method for ECG biometric recognition. First of all, fixed length segments of the ECG are extracted without the need of computing any fiducial points. Unique traits for each subject are then obtained by computing the autocorrelation (AC) of each segment, followed by the discrete cosine transform (DCT) to compress the available information. Finally, hierarchical ag-glomerative clustering (HAC) is applied to generate the groups that will be used later on for classification. The combination of the DCT to reduce the feature dimensionality and the HAC to decrease the number of features required by the classifier results in an efficient method both from the memory storage and computational point of view. Furthermore, the proposed AC/DCT-HAC (ADH) approach is robust, since no fiducial points (which may be difficult to extract reliably) are required, scalable and attains a better performance than other approaches with higher storage/computational cost.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130553894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model-Based Online Learning for Joint Radar-Communication Systems Operating in Dynamic Interference 动态干扰下联合雷达-通信系统基于模型的在线学习
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909601
Petteri Pulkkinen, V. Koivunen
This paper addresses the problems of co-design and cooperation among radar and communication systems operating in a shared spectrum scenario. Online learning facilitates using the spectrum flexibly while managing and mitigating rapidly time-frequency-space varying interference. We extend the previously proposed Model-Based Online Learning (MBOL) algorithm [1] to allocate frequency and power resources among co-designed and collaborating sensing and communication systems in dynamic interference scenarios. The proposed MBOL algorithm learns a predictive spectrum model using online convex optimization (OCO), assigns sub-bands between sensing and communications tasks, and optimizes their power for the tasks at hand. The performance of the proposed MBOL method is evaluated in simulations using the proposed constrained regret criterion and shown to improve the sensing and communications performance compared to the baseline method in terms of lower and sub-linear constrained regret.
本文讨论了在共享频谱情况下雷达和通信系统之间的协同设计和合作问题。在线学习有助于灵活地使用频谱,同时管理和减轻快速的时频空间变化干扰。我们扩展了先前提出的基于模型的在线学习(MBOL)算法[1],以便在动态干扰场景中在共同设计和协作的传感和通信系统之间分配频率和功率资源。提出的MBOL算法利用在线凸优化(OCO)学习预测频谱模型,在传感和通信任务之间分配子带,并根据手头的任务优化其功率。利用所提出的约束后悔准则在仿真中评估了所提出的MBOL方法的性能,并表明与基线方法相比,在较低和次线性约束后悔方面提高了传感和通信性能。
{"title":"Model-Based Online Learning for Joint Radar-Communication Systems Operating in Dynamic Interference","authors":"Petteri Pulkkinen, V. Koivunen","doi":"10.23919/eusipco55093.2022.9909601","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909601","url":null,"abstract":"This paper addresses the problems of co-design and cooperation among radar and communication systems operating in a shared spectrum scenario. Online learning facilitates using the spectrum flexibly while managing and mitigating rapidly time-frequency-space varying interference. We extend the previously proposed Model-Based Online Learning (MBOL) algorithm [1] to allocate frequency and power resources among co-designed and collaborating sensing and communication systems in dynamic interference scenarios. The proposed MBOL algorithm learns a predictive spectrum model using online convex optimization (OCO), assigns sub-bands between sensing and communications tasks, and optimizes their power for the tasks at hand. The performance of the proposed MBOL method is evaluated in simulations using the proposed constrained regret criterion and shown to improve the sensing and communications performance compared to the baseline method in terms of lower and sub-linear constrained regret.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130840149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Low-Complexity Double EP-Based DFE for Turbo Equalization 基于低复杂度双ep的Turbo均衡DFE
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909782
Congji Yin, Wenjiang Feng, Junbing Li, Guojun Li
In this paper, a new double expectation propagation-based decision feedback equalizer (DEP-DFE) for server inter-symbol interference (ISI) channels employing turbo equalization is proposed. The EP algorithm is used at the equalizer output and the channel decoder output. The proposed DEP-DFE offers a new approach to alleviate error propagation. Additionally, its computational complexity is nearly half of the EP-based minimum mean square error (MMSE)-based linear equalizer (EP-MMSE-LE) proposed by Santos et al. The bit error ratio performance of the proposed equalizer is verified through simulation in the well-known severely frequency selective Proakis-C channel for different scenarios. Simulation results demonstrate that the proposed DEP-DFE can achieve similar or better performance than the EP-MMSE-LE. Moreover, it has significant improvement over the double expectation propagation-based MMSE-LE (DEP-MMSE-LE).
本文提出了一种新的基于双期望传播的决策反馈均衡器(deep - dfe),用于服务器码间干扰(ISI)信道。EP算法用于均衡器输出和信道解码器输出。所提出的deep - dfe为减小误差传播提供了一种新的方法。此外,它的计算复杂度几乎是Santos等人提出的基于ep的最小均方误差(MMSE)线性均衡器(EP-MMSE-LE)的一半。在众所周知的严重选频的Proakis-C信道中,通过不同场景的仿真验证了该均衡器的误码率性能。仿真结果表明,所提出的deep - dfe可以达到与EP-MMSE-LE相似或更好的性能。此外,它比基于双期望传播的MMSE-LE (deep -MMSE-LE)有显著的改进。
{"title":"A Low-Complexity Double EP-Based DFE for Turbo Equalization","authors":"Congji Yin, Wenjiang Feng, Junbing Li, Guojun Li","doi":"10.23919/eusipco55093.2022.9909782","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909782","url":null,"abstract":"In this paper, a new double expectation propagation-based decision feedback equalizer (DEP-DFE) for server inter-symbol interference (ISI) channels employing turbo equalization is proposed. The EP algorithm is used at the equalizer output and the channel decoder output. The proposed DEP-DFE offers a new approach to alleviate error propagation. Additionally, its computational complexity is nearly half of the EP-based minimum mean square error (MMSE)-based linear equalizer (EP-MMSE-LE) proposed by Santos et al. The bit error ratio performance of the proposed equalizer is verified through simulation in the well-known severely frequency selective Proakis-C channel for different scenarios. Simulation results demonstrate that the proposed DEP-DFE can achieve similar or better performance than the EP-MMSE-LE. Moreover, it has significant improvement over the double expectation propagation-based MMSE-LE (DEP-MMSE-LE).","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130840759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Cover Source Mismatch Problem in Deep-Learning Steganalysis 深度学习隐写分析中的覆盖源不匹配问题
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909553
Quentin Giboulot, Patrick Bas, R. Cogranne, Dirk Borghys
This paper studies the problem of Cover Source Mismatch (CSM) in steganalysis, i.e. the impact of a testing set which does not originate from the same source than the training set. In this study, the trained steganalyzer uses state of the art deep-learning architecture prone to better generalization than feature-based steganalysis. Different sources such as the sensor model, the ISO sensitivity, the processing pipeline and the content, are investigated. Our conclusions are that, on one hand, deep learning steganalysis is still very sensitive to the CSM, on the other hand, the holistic strategy leverages the good generalization properties of deep learning to reduce the CSM with a relatively small number of training samples.
本文研究了隐写分析中的覆盖源不匹配问题,即测试集与训练集的来源不同所产生的影响。在本研究中,经过训练的隐写分析器使用最先进的深度学习架构,比基于特征的隐写分析更容易泛化。不同的来源,如传感器的型号,ISO灵敏度,处理流程和内容,进行了研究。我们的结论是,一方面,深度学习隐写分析对CSM仍然非常敏感,另一方面,整体策略利用深度学习良好的泛化特性,在训练样本数量相对较少的情况下减少CSM。
{"title":"The Cover Source Mismatch Problem in Deep-Learning Steganalysis","authors":"Quentin Giboulot, Patrick Bas, R. Cogranne, Dirk Borghys","doi":"10.23919/eusipco55093.2022.9909553","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909553","url":null,"abstract":"This paper studies the problem of Cover Source Mismatch (CSM) in steganalysis, i.e. the impact of a testing set which does not originate from the same source than the training set. In this study, the trained steganalyzer uses state of the art deep-learning architecture prone to better generalization than feature-based steganalysis. Different sources such as the sensor model, the ISO sensitivity, the processing pipeline and the content, are investigated. Our conclusions are that, on one hand, deep learning steganalysis is still very sensitive to the CSM, on the other hand, the holistic strategy leverages the good generalization properties of deep learning to reduce the CSM with a relatively small number of training samples.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126686706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Deep Residual Learning Based Localization of Near-Field Sources in Unknown Spatially Colored Noise Fields 基于深度残差学习的未知空间有色噪声场近场源定位
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909877
Zhuoqian Jiang, J. Xin, Weiliang Zuo, Nanning Zheng, A. Sano
In this paper, we explore the problem of near-field source localization in an unknown spatially colored noise environment using an end-to-end neural network which is based on deep residual learning. Specifically, the proposed approach uses the multi-dimensional information of the array covariance as input, and finally directly outputs the location information of the near-field sources through the regression structure. The architecture of deep neural network is well designed taking into account the trade-off between the expression ability and compu-tational complexity. In addition, benefiting from the method of generating training data that combines the degree of separation to traverse the spatial location, the proposed approach has a robust performance for different location parameter separation. The simulation results demonstrate that the proposed approach outperforms the existing model-driven methods under various conditions, especially for the adverse scenes with low SNRs, small number of snapshots, or correlated sources.
本文利用基于深度残差学习的端到端神经网络,研究了未知空间彩色噪声环境下的近场源定位问题。具体而言,该方法以阵列协方差的多维信息为输入,最后通过回归结构直接输出近场源的位置信息。深度神经网络的结构设计很好地考虑了表达能力和计算复杂度之间的权衡。此外,得益于结合分离度遍历空间位置生成训练数据的方法,该方法对不同的位置参数分离具有鲁棒性。仿真结果表明,该方法在各种条件下都优于现有的模型驱动方法,特别是在低信噪比、快照数量少或相关源的不利场景下。
{"title":"Deep Residual Learning Based Localization of Near-Field Sources in Unknown Spatially Colored Noise Fields","authors":"Zhuoqian Jiang, J. Xin, Weiliang Zuo, Nanning Zheng, A. Sano","doi":"10.23919/eusipco55093.2022.9909877","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909877","url":null,"abstract":"In this paper, we explore the problem of near-field source localization in an unknown spatially colored noise environment using an end-to-end neural network which is based on deep residual learning. Specifically, the proposed approach uses the multi-dimensional information of the array covariance as input, and finally directly outputs the location information of the near-field sources through the regression structure. The architecture of deep neural network is well designed taking into account the trade-off between the expression ability and compu-tational complexity. In addition, benefiting from the method of generating training data that combines the degree of separation to traverse the spatial location, the proposed approach has a robust performance for different location parameter separation. The simulation results demonstrate that the proposed approach outperforms the existing model-driven methods under various conditions, especially for the adverse scenes with low SNRs, small number of snapshots, or correlated sources.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"238 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123317499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Ensemble Link Learning for Large State Space Multiple Access Communications 面向大状态空间多址通信的集成链路学习
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909958
Talha Bozkus, U. Mitra
Wireless communication networks are well-modeled by Markov Decision Processes (MDPs), but induce a large state space which challenges policy optimization. Reinforcement learning such as Q-learning enables the solution of policy opti-mization problems in unknown environments. Herein a graph-learning algorithm is proposed to improve the accuracy and complexity performance of Q-learning algorithm for a multiple access communications problem. By exploiting the structural properties of the wireless network MDP, several structurally related Markov chains are created and these multiple chains are sampled to learn multiple policies which are fused. Furthermore, a state-action aggregation method is proposed to reduce the time and memory complexity of the algorithm. Numerical results show that the proposed algorithm achieves a reduction of 80% with respect to the policy error and a reduction of 70% for the runtime versus other state-of-the-art $Q$ learning algorithms.
无线通信网络是由马尔可夫决策过程(mdp)很好地建模的,但会产生一个大的状态空间,这给策略优化带来了挑战。Q-learning等强化学习可以解决未知环境下的策略优化问题。针对多址通信问题,提出了一种改进q -学习算法精度和复杂度的图学习算法。利用无线网络MDP的结构特性,建立了若干个结构相关的马尔可夫链,并对这些马尔可夫链进行采样,学习多条融合策略。在此基础上,提出了一种状态-动作聚合方法,降低了算法的时间复杂度和内存复杂度。数值结果表明,与其他先进的$Q$学习算法相比,所提出的算法在策略误差方面减少了80%,在运行时减少了70%。
{"title":"Ensemble Link Learning for Large State Space Multiple Access Communications","authors":"Talha Bozkus, U. Mitra","doi":"10.23919/eusipco55093.2022.9909958","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909958","url":null,"abstract":"Wireless communication networks are well-modeled by Markov Decision Processes (MDPs), but induce a large state space which challenges policy optimization. Reinforcement learning such as Q-learning enables the solution of policy opti-mization problems in unknown environments. Herein a graph-learning algorithm is proposed to improve the accuracy and complexity performance of Q-learning algorithm for a multiple access communications problem. By exploiting the structural properties of the wireless network MDP, several structurally related Markov chains are created and these multiple chains are sampled to learn multiple policies which are fused. Furthermore, a state-action aggregation method is proposed to reduce the time and memory complexity of the algorithm. Numerical results show that the proposed algorithm achieves a reduction of 80% with respect to the policy error and a reduction of 70% for the runtime versus other state-of-the-art $Q$ learning algorithms.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126209361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Neural Network Approach for Ultrasound Attenuation Coefficient Estimation 超声衰减系数估计的神经网络方法
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909948
J. Birdi, J. D’hooge, A. Bertrand
Quantitative ultrasound (QUS) imaging complements the standard B-mode images with a quantitative represen-tation of the target's acoustic properties. Attenuation coefficient is an important parameter characterizing these properties, with applications in medical diagnosis and tissue characterization. Traditional QUS methods use analytical models to estimate this coefficient from the acquired signal. Propagation effects, such as diffraction, which are difficult to model analytically are usually ignored, affecting their estimation accuracy. To tackle this issue, reference phantom measurements are commonly used. These are, however, time-consuming and may not always be feasible, limiting the existing approaches' practical applicability. To overcome these challenges, we leverage recent advances in the deep learning field and propose a neural network approach which takes the magnitude spectra of the backscattered ultrasound signal at different axial depths as the input and provides the target's attenuation coefficient as the output. For the presented proof-of-concept study, the network was trained on a simulated dataset, and learnt a proper model from the training data, thereby avoiding the need for an analytical model. The trained network was tested on both simulated and tissue-mimicking phantom datasets, demonstrating the capability of neural networks to provide accurate attenuation estimates from diffraction affected recordings without a reference phantom measurement.
定量超声(QUS)成像补充了标准的b模式图像与目标的声学特性的定量表示。衰减系数是表征这些特性的重要参数,在医学诊断和组织表征中有着广泛的应用。传统的QUS方法使用解析模型从采集的信号中估计该系数。衍射等难以解析建模的传播效应通常被忽略,影响了其估计精度。为了解决这个问题,通常使用参考幻像测量。然而,这些都是耗时的,可能并不总是可行的,限制了现有方法的实际适用性。为了克服这些挑战,我们利用深度学习领域的最新进展,提出了一种神经网络方法,该方法以不同轴向深度的后向散射超声信号的幅度谱作为输入,并提供目标的衰减系数作为输出。对于提出的概念验证研究,网络在模拟数据集上进行训练,并从训练数据中学习适当的模型,从而避免了对分析模型的需要。经过训练的网络在模拟和组织模拟的幻影数据集上进行了测试,证明了神经网络能够在没有参考幻影测量的情况下,从衍射影响的记录中提供准确的衰减估计。
{"title":"A Neural Network Approach for Ultrasound Attenuation Coefficient Estimation","authors":"J. Birdi, J. D’hooge, A. Bertrand","doi":"10.23919/eusipco55093.2022.9909948","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909948","url":null,"abstract":"Quantitative ultrasound (QUS) imaging complements the standard B-mode images with a quantitative represen-tation of the target's acoustic properties. Attenuation coefficient is an important parameter characterizing these properties, with applications in medical diagnosis and tissue characterization. Traditional QUS methods use analytical models to estimate this coefficient from the acquired signal. Propagation effects, such as diffraction, which are difficult to model analytically are usually ignored, affecting their estimation accuracy. To tackle this issue, reference phantom measurements are commonly used. These are, however, time-consuming and may not always be feasible, limiting the existing approaches' practical applicability. To overcome these challenges, we leverage recent advances in the deep learning field and propose a neural network approach which takes the magnitude spectra of the backscattered ultrasound signal at different axial depths as the input and provides the target's attenuation coefficient as the output. For the presented proof-of-concept study, the network was trained on a simulated dataset, and learnt a proper model from the training data, thereby avoiding the need for an analytical model. The trained network was tested on both simulated and tissue-mimicking phantom datasets, demonstrating the capability of neural networks to provide accurate attenuation estimates from diffraction affected recordings without a reference phantom measurement.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126498877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 30th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1