首页 > 最新文献

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文 中文
Hypergraphs with Edge-Dependent Vertex Weights: Spectral Clustering Based on the 1-Laplacian 边缘依赖顶点权值的超图:基于1-拉普拉斯的谱聚类
Yu Zhu, Boning Li, Santiago Segarra
We propose a flexible framework for defining the 1-Laplacian of a hypergraph that incorporates edge-dependent vertex weights. These weights are able to reflect varying importance of vertices within a hyperedge, thus conferring the hypergraph model higher expressivity than homogeneous hypergraphs. We then utilize the eigenvector associated with the second smallest eigenvalue of the hypergraph 1-Laplacian to cluster the vertices. From a theoretical standpoint based on an adequately defined normalized Cheeger cut, this procedure is expected to achieve higher clustering accuracy than that based on the traditional Laplacian. Indeed, we confirm that this is the case using real-world datasets to demonstrate the effectiveness of the proposed spectral clustering approach. Moreover, we show that for a special case within our framework, the corresponding hypergraph 1-Laplacian is equivalent to the 1-Laplacian of a related graph, whose eigenvectors can be computed more efficiently, facilitating the adoption on larger datasets.
我们提出了一个灵活的框架来定义包含边相关顶点权重的超图的1-拉普拉斯算子。这些权重能够反映超边缘中顶点的不同重要性,从而赋予超图模型比齐次超图更高的表达性。然后我们利用与超图1-拉普拉斯的第二个最小特征值相关联的特征向量来聚类顶点。从理论的角度来看,基于充分定义的归一化Cheeger切割,该过程有望获得比基于传统拉普拉斯的聚类精度更高的聚类精度。事实上,我们使用真实世界的数据集证实了这种情况,以证明所提出的光谱聚类方法的有效性。此外,我们证明了在我们的框架内的一个特殊情况下,相应的超图1-拉普拉斯等价于相关图的1-拉普拉斯,其特征向量可以更有效地计算,便于在更大的数据集上采用。
{"title":"Hypergraphs with Edge-Dependent Vertex Weights: Spectral Clustering Based on the 1-Laplacian","authors":"Yu Zhu, Boning Li, Santiago Segarra","doi":"10.1109/icassp43922.2022.9746363","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9746363","url":null,"abstract":"We propose a flexible framework for defining the 1-Laplacian of a hypergraph that incorporates edge-dependent vertex weights. These weights are able to reflect varying importance of vertices within a hyperedge, thus conferring the hypergraph model higher expressivity than homogeneous hypergraphs. We then utilize the eigenvector associated with the second smallest eigenvalue of the hypergraph 1-Laplacian to cluster the vertices. From a theoretical standpoint based on an adequately defined normalized Cheeger cut, this procedure is expected to achieve higher clustering accuracy than that based on the traditional Laplacian. Indeed, we confirm that this is the case using real-world datasets to demonstrate the effectiveness of the proposed spectral clustering approach. Moreover, we show that for a special case within our framework, the corresponding hypergraph 1-Laplacian is equivalent to the 1-Laplacian of a related graph, whose eigenvectors can be computed more efficiently, facilitating the adoption on larger datasets.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133537029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Low Resources Online Single-Microphone Speech Enhancement with Harmonic Emphasis 低资源在线单麦克风语音增强与谐波重点
Nir Raviv, Ofer Schwartz, S. Gannot
In this paper, we propose a deep neural network (DNN)-based single-microphone speech enhancement algorithm characterized by a short latency and low computational resources. Many speech enhancement algorithms suffer from low noise reduction capabilities between pitch harmonics, and in severe cases, the harmonic structure may even be lost. Recognizing this drawback, we propose a new weighted loss that emphasizes pitch-dominated frequency bands. For that, we propose a method, applied only at the training stage, to detect these frequency bands. The proposed method is applied to speech signals contaminated by several noise types, and in particular, typical domestic noise drawn from ESC-50 and DE-MAND databases, demonstrating its applicability to ‘stay-at-home’ scenarios.
在本文中,我们提出了一种基于深度神经网络(DNN)的单麦克风语音增强算法,该算法具有短延迟和低计算资源的特点。许多语音增强算法在基音谐波之间的降噪能力较低,严重时甚至可能丢失谐波结构。认识到这一缺点,我们提出了一种新的加权损失,强调音调主导的频带。为此,我们提出了一种仅在训练阶段应用的方法来检测这些频段。该方法应用于受多种噪声污染的语音信号,特别是来自ESC-50和DE-MAND数据库的典型家庭噪声,证明其适用于“呆在家里”的场景。
{"title":"Low Resources Online Single-Microphone Speech Enhancement with Harmonic Emphasis","authors":"Nir Raviv, Ofer Schwartz, S. Gannot","doi":"10.1109/icassp43922.2022.9747656","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9747656","url":null,"abstract":"In this paper, we propose a deep neural network (DNN)-based single-microphone speech enhancement algorithm characterized by a short latency and low computational resources. Many speech enhancement algorithms suffer from low noise reduction capabilities between pitch harmonics, and in severe cases, the harmonic structure may even be lost. Recognizing this drawback, we propose a new weighted loss that emphasizes pitch-dominated frequency bands. For that, we propose a method, applied only at the training stage, to detect these frequency bands. The proposed method is applied to speech signals contaminated by several noise types, and in particular, typical domestic noise drawn from ESC-50 and DE-MAND databases, demonstrating its applicability to ‘stay-at-home’ scenarios.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133542953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Confidence Estimation for Speech Emotion Recognition Based on the Relationship Between Emotion Categories and Primitives 基于情感类别与原语关系的语音情感识别置信度估计
Y. Li, C. Papayiannis, Viktor Rozgic, Elizabeth Shriberg, Chao Wang
Confidence estimation for Speech Emotion Recognition (SER) is instrumental in improving the reliability in the behavior of downstream applications. In this work we propose (1) a novel confidence metric for SER based on the relationship between emotion primitives: arousal, valence, and dominance (AVD) and emotion categories (ECs), (2) EmoConfidNet - a DNN trained alongside the EC recognizer to predict the proposed confidence metric, and (3) a data filtering technique used to enhance the training of EmoConfidNet and the EC recognizer. For each training sample, we calculate distances from corresponding AVD annotation vectors to centroids of each EC in the AVD space, and define EC confidences as functions of the evaluated distances. EmoConfidNet is trained to predict confidence from the same acoustic representations used to train the EC recognizer. EmoConfidNet outperforms state-of-the-art confidence estimation methods on the MSP-Podcast and IEMOCAP datasets. For a fixed EC recognizer, after we reject the same number of low confidence predictions using EmoConfidNet, we achieve a higher F1 and unweighted average recall (UAR) than when rejecting using other methods.
语音情感识别(SER)的置信度估计有助于提高下游应用行为的可靠性。在这项工作中,我们提出了(1)基于情绪原语:唤醒、效价和优势(AVD)和情绪类别(ECs)之间关系的新的SER置信度量,(2)emoconfetnet -与EC识别器一起训练的DNN来预测拟议的置信度量,以及(3)用于增强emoconfetnet和EC识别器训练的数据过滤技术。对于每个训练样本,我们计算对应的AVD注释向量到AVD空间中每个EC质心的距离,并将EC置信度定义为评估距离的函数。emoconfetnet经过训练,可以从与训练EC识别器相同的声学表示中预测置信度。在MSP-Podcast和IEMOCAP数据集上,emoconfnet优于最先进的置信度估计方法。对于一个固定的EC识别器,当我们使用emoconfnet拒绝相同数量的低置信度预测后,我们获得了比使用其他方法拒绝时更高的F1和未加权平均召回率(UAR)。
{"title":"Confidence Estimation for Speech Emotion Recognition Based on the Relationship Between Emotion Categories and Primitives","authors":"Y. Li, C. Papayiannis, Viktor Rozgic, Elizabeth Shriberg, Chao Wang","doi":"10.1109/ICASSP43922.2022.9746930","DOIUrl":"https://doi.org/10.1109/ICASSP43922.2022.9746930","url":null,"abstract":"Confidence estimation for Speech Emotion Recognition (SER) is instrumental in improving the reliability in the behavior of downstream applications. In this work we propose (1) a novel confidence metric for SER based on the relationship between emotion primitives: arousal, valence, and dominance (AVD) and emotion categories (ECs), (2) EmoConfidNet - a DNN trained alongside the EC recognizer to predict the proposed confidence metric, and (3) a data filtering technique used to enhance the training of EmoConfidNet and the EC recognizer. For each training sample, we calculate distances from corresponding AVD annotation vectors to centroids of each EC in the AVD space, and define EC confidences as functions of the evaluated distances. EmoConfidNet is trained to predict confidence from the same acoustic representations used to train the EC recognizer. EmoConfidNet outperforms state-of-the-art confidence estimation methods on the MSP-Podcast and IEMOCAP datasets. For a fixed EC recognizer, after we reject the same number of low confidence predictions using EmoConfidNet, we achieve a higher F1 and unweighted average recall (UAR) than when rejecting using other methods.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133566899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Robust Collaborative Learning for Sequence Modelling 鲁棒协同学习序列建模
Francois Buet-Golfouse, Hans Roggeman, Islam Utyagulov
Current deep learning techniques for RNA classification suffer from over-fitting and lack of reproducibility. We show that by introducing robustness by design in both CNN and RNN algorithms, we are able to achieve standalone state-of-the-art accuracy. By constructing model-agnostic robustness checks and reusing features obtained from both architectures, we build a collaborative framework that improves performance and stability.
目前用于RNA分类的深度学习技术存在过拟合和缺乏可重复性的问题。我们表明,通过在CNN和RNN算法中引入鲁棒性设计,我们能够实现独立的最先进的精度。通过构建与模型无关的鲁棒性检查和重用从两个体系结构中获得的特征,我们构建了一个提高性能和稳定性的协作框架。
{"title":"Robust Collaborative Learning for Sequence Modelling","authors":"Francois Buet-Golfouse, Hans Roggeman, Islam Utyagulov","doi":"10.1109/icassp43922.2022.9746494","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9746494","url":null,"abstract":"Current deep learning techniques for RNA classification suffer from over-fitting and lack of reproducibility. We show that by introducing robustness by design in both CNN and RNN algorithms, we are able to achieve standalone state-of-the-art accuracy. By constructing model-agnostic robustness checks and reusing features obtained from both architectures, we build a collaborative framework that improves performance and stability.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"136 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133271264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepHull: Fast Convex Hull Approximation in High Dimensions DeepHull:高维快速凸壳近似
Randall Balestriero, Zichao Wang, Richard Baraniuk
Computing or approximating the convex hull of a dataset plays a role in a wide range of applications, including economics, statistics, and physics, to name just a few. However, convex hull computation and approximation is exponentially complex, in terms of both memory and computation, as the ambient space dimension increases. In this paper, we propose DeepHull, a new convex hull approximation algorithm based on convex deep networks (DNs) with continuous piecewise-affine nonlinearities and nonnegative weights. The idea is that binary classification between true data samples and adversarially generated samples with such a DN naturally induces a polytope decision boundary that approximates the true data convex hull. A range of exploratory experiments demonstrates that DeepHull efficiently produces a meaningful convex hull approximation, even in a high-dimensional ambient space.
计算或近似数据集的凸包在广泛的应用中发挥作用,包括经济学,统计学和物理学,仅举几例。然而,随着环境空间维度的增加,凸包计算和近似在内存和计算方面呈指数级复杂。本文提出了一种新的基于连续分段仿射非线性和非负权凸深度网络的凸壳近似算法DeepHull。其思想是,真实数据样本和具有这种DN的对抗性生成样本之间的二元分类自然会产生近似真实数据凸包的多面体决策边界。一系列探索性实验表明,即使在高维环境空间中,DeepHull也能有效地产生有意义的凸壳近似。
{"title":"DeepHull: Fast Convex Hull Approximation in High Dimensions","authors":"Randall Balestriero, Zichao Wang, Richard Baraniuk","doi":"10.1109/ICASSP43922.2022.9746031","DOIUrl":"https://doi.org/10.1109/ICASSP43922.2022.9746031","url":null,"abstract":"Computing or approximating the convex hull of a dataset plays a role in a wide range of applications, including economics, statistics, and physics, to name just a few. However, convex hull computation and approximation is exponentially complex, in terms of both memory and computation, as the ambient space dimension increases. In this paper, we propose DeepHull, a new convex hull approximation algorithm based on convex deep networks (DNs) with continuous piecewise-affine nonlinearities and nonnegative weights. The idea is that binary classification between true data samples and adversarially generated samples with such a DN naturally induces a polytope decision boundary that approximates the true data convex hull. A range of exploratory experiments demonstrates that DeepHull efficiently produces a meaningful convex hull approximation, even in a high-dimensional ambient space.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133346541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Unsupervised Anomaly Detection for Container Cloud Via BILSTM-Based Variational Auto-Encoder 基于bilstm的变分自编码器的容器云无监督异常检测
Yulong Wang, Xingshu Chen, Qixu Wang, Run Yang, Bangzhou Xin
The appearance of container technology has profoundly changed the development and deployment of multi-tier distributed applications. However, the imperfect system resource isolation features and the kernel-sharing mechanism will introduce significant security risks to the container-based cloud. In this paper, we propose a real-time unsupervised anomaly detection system for monitoring system calls in container cloud via BiLSTM-based variational auto-encoder (VAE). Our proposed BiLSTM-based VAE network leverages the generative characteristics of VAE to learn the robust representations of normal patterns by reconstruction probabilities while being sensitive to long-term dependencies. Our evaluations using real-world datasets show that the BiLSTM-based VAE network achieves excellent detection performance without introducing significant running performance overhead to the container platform.
容器技术的出现深刻地改变了多层分布式应用程序的开发和部署。然而,不完善的系统资源隔离特性和内核共享机制将给基于容器的云带来重大的安全风险。本文提出了一种基于bilstm的变分自编码器(VAE)的实时无监督异常检测系统,用于监控容器云中的系统调用。我们提出的基于bilstm的VAE网络利用VAE的生成特性,通过重构概率学习正常模式的鲁棒表示,同时对长期依赖关系敏感。我们使用真实数据集进行的评估表明,基于bilstm的VAE网络在不给容器平台带来显著运行性能开销的情况下实现了出色的检测性能。
{"title":"Unsupervised Anomaly Detection for Container Cloud Via BILSTM-Based Variational Auto-Encoder","authors":"Yulong Wang, Xingshu Chen, Qixu Wang, Run Yang, Bangzhou Xin","doi":"10.1109/icassp43922.2022.9747341","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9747341","url":null,"abstract":"The appearance of container technology has profoundly changed the development and deployment of multi-tier distributed applications. However, the imperfect system resource isolation features and the kernel-sharing mechanism will introduce significant security risks to the container-based cloud. In this paper, we propose a real-time unsupervised anomaly detection system for monitoring system calls in container cloud via BiLSTM-based variational auto-encoder (VAE). Our proposed BiLSTM-based VAE network leverages the generative characteristics of VAE to learn the robust representations of normal patterns by reconstruction probabilities while being sensitive to long-term dependencies. Our evaluations using real-world datasets show that the BiLSTM-based VAE network achieves excellent detection performance without introducing significant running performance overhead to the container platform.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133631090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders 用变分自编码器学习语音诱发脑电的主体不变表征
Lies Bollens, T. Francart, H. V. hamme
The electroencephalogram (EEG) is a powerful method to understand how the brain processes speech. Linear models have recently been replaced for this purpose with deep neural networks and yield promising results. In related EEG classification fields, it is shown that explicitly modeling subject-invariant features improves generalization of models across subjects and benefits classification accuracy. In this work, we adapt factorized hierarchical variational autoencoders to exploit parallel EEG recordings of the same stimuli. We model EEG into two disentangled latent spaces. Subject accuracy reaches 98.96% and 1.60% on respectively the subject and content latent space, whereas binary content classification experiments reach an accuracy of 51.51% and 62.91% on respectively the subject and content latent space.
脑电图(EEG)是了解大脑如何处理语言的有力方法。线性模型最近被深度神经网络所取代,并产生了令人鼓舞的结果。在相关的脑电分类领域中,明确建模主题不变性特征可以提高模型跨主题的泛化性,提高分类精度。在这项工作中,我们采用因式分层变分自编码器来利用相同刺激的并行脑电图记录。我们将EEG建模为两个分离的潜在空间。主题和内容潜空间的准确率分别达到98.96%和1.60%,而二元内容分类实验在主题和内容潜空间上的准确率分别达到51.51%和62.91%。
{"title":"Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders","authors":"Lies Bollens, T. Francart, H. V. hamme","doi":"10.1109/ICASSP43922.2022.9747297","DOIUrl":"https://doi.org/10.1109/ICASSP43922.2022.9747297","url":null,"abstract":"The electroencephalogram (EEG) is a powerful method to understand how the brain processes speech. Linear models have recently been replaced for this purpose with deep neural networks and yield promising results. In related EEG classification fields, it is shown that explicitly modeling subject-invariant features improves generalization of models across subjects and benefits classification accuracy. In this work, we adapt factorized hierarchical variational autoencoders to exploit parallel EEG recordings of the same stimuli. We model EEG into two disentangled latent spaces. Subject accuracy reaches 98.96% and 1.60% on respectively the subject and content latent space, whereas binary content classification experiments reach an accuracy of 51.51% and 62.91% on respectively the subject and content latent space.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133668656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Controlled Sensing and Anomaly Detection Via Soft Actor-Critic Reinforcement Learning 基于软Actor-Critic强化学习的受控传感和异常检测
Chen Zhong, M. C. Gursoy, Senem Velipasalar
To address the anomaly detection problem in the presence of noisy observations and to tackle the tuning and efficient exploration challenges that arise in deep reinforcement learning algorithms, we in this paper propose a soft actor-critic deep reinforcement learning framework. To evaluate the proposed framework, we measure its performance in terms of detection accuracy, stopping time, and the total number of samples needed for detection. Via simulation results, we demonstrate the performance when soft actor-critic algorithms are employed, and identify the impact of key parameters, such as the sensing cost, on the performance. In all results, we further provide comparisons between the performances of the proposed soft actor-critic and conventional actor-critic algorithms.
为了解决存在噪声观测的异常检测问题,并解决深度强化学习算法中出现的调优和高效探索挑战,我们在本文中提出了一个软行为者批评深度强化学习框架。为了评估所提出的框架,我们根据检测精度、停止时间和检测所需的样本总数来衡量其性能。通过仿真结果,我们展示了采用软行为评价算法时的性能,并确定了关键参数(如传感成本)对性能的影响。在所有结果中,我们进一步比较了所提出的软演员评论和传统演员评论算法的性能。
{"title":"Controlled Sensing and Anomaly Detection Via Soft Actor-Critic Reinforcement Learning","authors":"Chen Zhong, M. C. Gursoy, Senem Velipasalar","doi":"10.1109/icassp43922.2022.9747436","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9747436","url":null,"abstract":"To address the anomaly detection problem in the presence of noisy observations and to tackle the tuning and efficient exploration challenges that arise in deep reinforcement learning algorithms, we in this paper propose a soft actor-critic deep reinforcement learning framework. To evaluate the proposed framework, we measure its performance in terms of detection accuracy, stopping time, and the total number of samples needed for detection. Via simulation results, we demonstrate the performance when soft actor-critic algorithms are employed, and identify the impact of key parameters, such as the sensing cost, on the performance. In all results, we further provide comparisons between the performances of the proposed soft actor-critic and conventional actor-critic algorithms.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"163 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132137080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Physical Layer Anonymous Communications: An Anonymity Entropy Oriented Precoding Design (Invited Paper) 物理层匿名通信:面向匿名熵的预编码设计(特邀论文)
Zhongxiang Wei, C. Masouros, Sumei Sun
Different from traditional security-oriented designs, the aim of anonymizing techniques is to mask users' identities during communication, thereby providing users with unidentifiability and unlinkability. The existing anonymizing techniques are only designated at upper layers of networks, ignoring the risk of anonymity leakage at physical layer (PHY). In this paper, we address the PHY anonymity design with focus on a typical uplink scenario where the receiver is equipped with more antennas than the sender. With the increased degrees-of-freedom at the receiver side, we first propose a maximum likelihood estimation (MLE) signal trace-back detector, which only analyzes the signaling pattern of the received signal to disclose the sender's identity. Accordingly, an anonymity entropy anonymous (AEA) precoder is proposed, which manipulates the transmitted signalling pattern to counteract the receiver's trace-back detector and meanwhile to guarantee high receive signal-to-interference-plus-noise ratio for communication. More importantly, more data streams can be multiplexed than the number of transmit antennas, which is particularly suitable for the strong receiver configuration. Simulation demonstrates that the proposed AEA precoder can simultaneously provide high anonymity and communication performance.
与传统的面向安全的设计不同,匿名化技术的目的是在通信过程中掩盖用户的身份,从而为用户提供不可识别性和不可链接性。现有的匿名化技术只针对网络的上层,忽略了物理层匿名泄露的风险。在本文中,我们讨论了PHY匿名设计,重点关注典型的上行场景,其中接收器配备的天线比发送者多。随着接收端自由度的增加,我们首先提出了一种最大似然估计(MLE)信号跟踪检测器,它只分析接收信号的信令模式来揭示发送者的身份。在此基础上,提出了一种匿名熵匿名(AEA)预编码器,该预编码器对发送的信号模式进行处理,以抵消接收端跟踪检测器的干扰,同时保证较高的接收信噪比。更重要的是,可以复用的数据流比发射天线的数量多,这特别适合于强接收配置。仿真结果表明,该预编码器能够同时提供较高的匿名性和通信性能。
{"title":"Physical Layer Anonymous Communications: An Anonymity Entropy Oriented Precoding Design (Invited Paper)","authors":"Zhongxiang Wei, C. Masouros, Sumei Sun","doi":"10.1109/icassp43922.2022.9746100","DOIUrl":"https://doi.org/10.1109/icassp43922.2022.9746100","url":null,"abstract":"Different from traditional security-oriented designs, the aim of anonymizing techniques is to mask users' identities during communication, thereby providing users with unidentifiability and unlinkability. The existing anonymizing techniques are only designated at upper layers of networks, ignoring the risk of anonymity leakage at physical layer (PHY). In this paper, we address the PHY anonymity design with focus on a typical uplink scenario where the receiver is equipped with more antennas than the sender. With the increased degrees-of-freedom at the receiver side, we first propose a maximum likelihood estimation (MLE) signal trace-back detector, which only analyzes the signaling pattern of the received signal to disclose the sender's identity. Accordingly, an anonymity entropy anonymous (AEA) precoder is proposed, which manipulates the transmitted signalling pattern to counteract the receiver's trace-back detector and meanwhile to guarantee high receive signal-to-interference-plus-noise ratio for communication. More importantly, more data streams can be multiplexed than the number of transmit antennas, which is particularly suitable for the strong receiver configuration. Simulation demonstrates that the proposed AEA precoder can simultaneously provide high anonymity and communication performance.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132270137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Hierarchical Translation-Based Model for Multi-Modal Medical Image Registration 基于无监督分层翻译的多模态医学图像配准模型
X. Dai, Tai Ma, Haibin Cai, Ying Wen
Deformable registration of multi-modal medical images is a challenging task in medical image processing due to the differences in both appearance and structure. We propose an unsupervised hierarchical translation-based model to perform a coarse to fine registration of multi-modal medical images. The proposed model consists of three parts: a coarse registration network, a modal translation network and a fine registration network. First, the coarse registration network learns to obtain the coarse deformation field, which is applied as structure-preserving information to generate a translated image by the modal translation network. Then, the translated image as enhancing information combined with the original images are used to derive a fine deformation field in the fine registration network. Furthermore, the final deformation field is composed from the coarse and the fine deformation fields. In this way, the proposed model can learn high accurate deformation field to implement multi-modal medical image registration. Experiments on two multi-modal brain image datasets demonstrate the effectiveness of this model.
由于多模态医学图像在外观和结构上的差异,多模态医学图像的形变配准是医学图像处理中的一个难题。我们提出了一种基于无监督分层翻译的模型来对多模态医学图像进行从粗到精的配准。该模型由三部分组成:粗配准网络、模态翻译网络和精细配准网络。首先,粗配准网络学习获取粗变形场,并将其作为保持结构的信息,通过模态平移网络生成翻译图像;然后,将翻译后的图像作为增强信息与原始图像结合,在精细配准网络中得到精细形变场;最终变形场由粗变形场和细变形场组成。这样,该模型可以学习高精度的形变场,实现多模态医学图像配准。在两个多模态脑图像数据集上的实验验证了该模型的有效性。
{"title":"Unsupervised Hierarchical Translation-Based Model for Multi-Modal Medical Image Registration","authors":"X. Dai, Tai Ma, Haibin Cai, Ying Wen","doi":"10.1109/ICASSP43922.2022.9746324","DOIUrl":"https://doi.org/10.1109/ICASSP43922.2022.9746324","url":null,"abstract":"Deformable registration of multi-modal medical images is a challenging task in medical image processing due to the differences in both appearance and structure. We propose an unsupervised hierarchical translation-based model to perform a coarse to fine registration of multi-modal medical images. The proposed model consists of three parts: a coarse registration network, a modal translation network and a fine registration network. First, the coarse registration network learns to obtain the coarse deformation field, which is applied as structure-preserving information to generate a translated image by the modal translation network. Then, the translated image as enhancing information combined with the original images are used to derive a fine deformation field in the fine registration network. Furthermore, the final deformation field is composed from the coarse and the fine deformation fields. In this way, the proposed model can learn high accurate deformation field to implement multi-modal medical image registration. Experiments on two multi-modal brain image datasets demonstrate the effectiveness of this model.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132275886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1