首页 > 最新文献

2020 International Conference on Signal Processing and Communications (SPCOM)最新文献

英文 中文
Signal Representation Using Ramanujan Subspaces Utilizing A Prior Signal Information 利用先验信号信息的Ramanujan子空间信号表示
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179618
Shaik Basheeruddin Shah, Vijay Kumar Chakka
In signal processing applications the information about the signal such as frequency (or) period is known a prior for most of the practical signals like ECG, EEG, speech, etc. Inspired by this, in this paper, we propose a new signal representation to estimate the period and frequency information of a given signal with low computational complexity. We achieve this by representing a finite-length discrete-time signal as a linear combination of signals belongs to Ramanujan subspaces. Further, we evaluate the performance of the proposed representation with a simulated example and also by addressing the problem of reducing Power Line Interference (PLI) in an ECG signal. Finally, for a given integer-valued signal, we show that the computational complexity of the proposed transform is quite low in comparison with the existing transforms, and it is quite comparable for a given real (or) complex-valued signal.
在信号处理应用中,对于大多数实际信号,如心电、脑电图、语音等,有关信号的频率(或)周期等信息是已知的。受此启发,本文提出了一种新的信号表示,以较低的计算复杂度估计给定信号的周期和频率信息。我们通过将有限长度的离散时间信号表示为属于拉马努金子空间的信号的线性组合来实现这一点。此外,我们通过一个模拟示例评估了所提出的表示的性能,并通过解决减少心电信号中的电源线干扰(PLI)的问题。最后,对于给定的整数值信号,我们表明,与现有的变换相比,所提出的变换的计算复杂度相当低,并且对于给定的实(或)复值信号具有相当的可比性。
{"title":"Signal Representation Using Ramanujan Subspaces Utilizing A Prior Signal Information","authors":"Shaik Basheeruddin Shah, Vijay Kumar Chakka","doi":"10.1109/SPCOM50965.2020.9179618","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179618","url":null,"abstract":"In signal processing applications the information about the signal such as frequency (or) period is known a prior for most of the practical signals like ECG, EEG, speech, etc. Inspired by this, in this paper, we propose a new signal representation to estimate the period and frequency information of a given signal with low computational complexity. We achieve this by representing a finite-length discrete-time signal as a linear combination of signals belongs to Ramanujan subspaces. Further, we evaluate the performance of the proposed representation with a simulated example and also by addressing the problem of reducing Power Line Interference (PLI) in an ECG signal. Finally, for a given integer-valued signal, we show that the computational complexity of the proposed transform is quite low in comparison with the existing transforms, and it is quite comparable for a given real (or) complex-valued signal.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120988710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion Channel Characterization for A Cuboid Container: Some Insights into The Role of Dimensionality and Fluid Boundaries 长方体容器的扩散通道表征:对维度和流体边界作用的一些见解
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179630
Ankit, M. Bhatnagar
Dimensionality of the fluid environment plays a crucial role in characterizing the diffusive channel. It is generally believed that increasing the dimensionality of the fluid medium should negatively affect the hitting probabilities as the degrees of freedom of propagating molecules have been enhanced. This paper has twofold objectives, it provides the diffusion channel characterization of a molecular communication (MC) system in an enclosed cuboid geometry and then studies the effect of dimensionality and the size of the receiver on the obtained channel statistics. The motility probability distribution function (PDF) of the molecules in a constrained cuboid environment with five reflecting and one absorbing wall is derived. The first hitting time (FHT) PDF and the hitting probabilities of the molecules to the absorbing wall are deduced from the same. A comparative analytical study of the derived FHT PDF against the diffusion channel statistics of various bounded and unbounded environments is presented. The comparison quantitatively establishes that an MC system with suitably configured fluid boundaries and transmitter and receiver arrangement can completely eliminate the effect of dimensionality and the size of the receiver on the hitting probabilities. The study may be of use in designing practically efficient and economic MC systems.
流体环境的维度在表征扩散通道中起着至关重要的作用。一般认为,增加流体介质的维数会对碰撞概率产生负面影响,因为传播分子的自由度增加了。本文有两个目的,首先给出了分子通信系统在封闭长方体中的扩散通道表征,然后研究了维数和接收器尺寸对得到的通道统计量的影响。导出了分子在具有5个反射壁和1个吸收壁的约束长方体环境中的运动概率分布函数。由此推导出分子对吸波壁的首次撞击时间(FHT) PDF和撞击概率。本文给出了推导出的FHT PDF与各种有界和无界环境下的扩散通道统计量的比较分析研究。定量比较表明,适当配置流体边界和发射器和接收器的MC系统可以完全消除维数和接收器尺寸对命中概率的影响。研究结果可为设计实际高效、经济的MC系统提供参考。
{"title":"Diffusion Channel Characterization for A Cuboid Container: Some Insights into The Role of Dimensionality and Fluid Boundaries","authors":"Ankit, M. Bhatnagar","doi":"10.1109/SPCOM50965.2020.9179630","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179630","url":null,"abstract":"Dimensionality of the fluid environment plays a crucial role in characterizing the diffusive channel. It is generally believed that increasing the dimensionality of the fluid medium should negatively affect the hitting probabilities as the degrees of freedom of propagating molecules have been enhanced. This paper has twofold objectives, it provides the diffusion channel characterization of a molecular communication (MC) system in an enclosed cuboid geometry and then studies the effect of dimensionality and the size of the receiver on the obtained channel statistics. The motility probability distribution function (PDF) of the molecules in a constrained cuboid environment with five reflecting and one absorbing wall is derived. The first hitting time (FHT) PDF and the hitting probabilities of the molecules to the absorbing wall are deduced from the same. A comparative analytical study of the derived FHT PDF against the diffusion channel statistics of various bounded and unbounded environments is presented. The comparison quantitatively establishes that an MC system with suitably configured fluid boundaries and transmitter and receiver arrangement can completely eliminate the effect of dimensionality and the size of the receiver on the hitting probabilities. The study may be of use in designing practically efficient and economic MC systems.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130425554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Steady-State Performance of the UT-ZA-PNLMS Algorithm for Sparse Systems 改进稀疏系统UT-ZA-PNLMS算法的稳态性能
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179566
K. S. S. Anudeep, Kuldeep Khoria, R. Das
For identifying sparse systems, a recently proposed algorithm called upper threshold based zero attracting proportionate normalized least mean square (UT-ZA-PNLMS) algorithm has shown improved performance in terms of both the convergence rate and steady-state error in comparison to the ZAPNLMS algorithm. The UT-ZA-PNLMS algorithm employs adaptive threshold based gain function in order to improve convergence rate of the active taps, especially the taps with low magnitude, and appends zero attracting term in the update equation in order to bring the inactive taps to their optimum zero level. However, as the UT-ZA-PNLMS algorithm uses uniform shrinkage for that zero attraction, the active taps experience significant bias which limits overall steady-state performance. In this paper, we introduce selective shrinkage for the zero attracting term so that the inactive taps get strong attractive force whereas the active taps would experience negligibly small attractive force, and thus the bias in the active tap is reduced. In particular, we propose three different algorithms incorporating log-sum, $ell_{p^{-}}$ norm and $ell_{0}$-norm penalties to the cost function of the upper threshold based PNLMS algorithm. The resulting algorithms are studied extensively and the simulation results show their improved steady-state performances.
为了识别稀疏系统,最近提出了一种基于上阈值的零吸引比例归一化最小均方(utza - pnlms)算法,与ZAPNLMS算法相比,该算法在收敛速度和稳态误差方面都有提高。UT-ZA-PNLMS算法采用基于自适应阈值的增益函数来提高有源抽头特别是低幅值抽头的收敛速度,并在更新方程中增加零吸引项,使无活动抽头达到最佳零水平。然而,由于UT-ZA-PNLMS算法对零吸引力使用均匀收缩,因此主动水龙头会经历明显的偏差,从而限制了整体稳态性能。本文引入零吸引项的选择性收缩,使非活动丝锥受到强大的吸引力,而活动丝锥受到可忽略的小吸引力,从而减小了活动丝锥的偏置。特别地,我们提出了三种不同的算法,将对数和、$ell_{p^{-}}$ norm和$ell_{0}$-norm对基于上限阈值的PNLMS算法的代价函数进行惩罚。对所得到的算法进行了广泛的研究,仿真结果表明它们改善了稳态性能。
{"title":"Improving Steady-State Performance of the UT-ZA-PNLMS Algorithm for Sparse Systems","authors":"K. S. S. Anudeep, Kuldeep Khoria, R. Das","doi":"10.1109/SPCOM50965.2020.9179566","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179566","url":null,"abstract":"For identifying sparse systems, a recently proposed algorithm called upper threshold based zero attracting proportionate normalized least mean square (UT-ZA-PNLMS) algorithm has shown improved performance in terms of both the convergence rate and steady-state error in comparison to the ZAPNLMS algorithm. The UT-ZA-PNLMS algorithm employs adaptive threshold based gain function in order to improve convergence rate of the active taps, especially the taps with low magnitude, and appends zero attracting term in the update equation in order to bring the inactive taps to their optimum zero level. However, as the UT-ZA-PNLMS algorithm uses uniform shrinkage for that zero attraction, the active taps experience significant bias which limits overall steady-state performance. In this paper, we introduce selective shrinkage for the zero attracting term so that the inactive taps get strong attractive force whereas the active taps would experience negligibly small attractive force, and thus the bias in the active tap is reduced. In particular, we propose three different algorithms incorporating log-sum, $ell_{p^{-}}$ norm and $ell_{0}$-norm penalties to the cost function of the upper threshold based PNLMS algorithm. The resulting algorithms are studied extensively and the simulation results show their improved steady-state performances.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"318 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134262498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PhaseSense — Signal Reconstruction from Phase-Only Measurements via Quadratic Programming 相位感知-信号重建从相位测量通过二次规划
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179523
Vinith Kishore, Subhadip Mukherjee, C. Seelamantula
We consider the problem of reconstructing a complex-valued signal from its phase-only measurements. This framework can be considered as a generalization of the well-known one-bit compressed sensing paradigm where the underlying signal is known to be sparse. In contrast, the proposed formalism does not rely on the assumption of sparsity and hence applies to a broader class of signals. The optimization problem for signal reconstruction is formulated by first splitting the linear measurement vector into its phase and magnitude components and subsequently using the non-negativity property of the magnitude component as a constraint. The resulting optimization problem turns out to be a quadratic program (QP) and is solved using two algorithms: (i) alternating directions method of multipliers; and (ii) projected gradient-descent with Nesterov’s momentum. Due to the inherent scale ambiguity of the phase-only measurement, the underlying signal can be reconstructed only up to a global scale-factor. We obtain high accuracy for reconstructing 1–D synthetic signals in the absence of noise. We also show an application of the proposed approach in reconstructing images from the phase of their measurement coefficients. The underlying image is recovered up to a peak signal-to-noise ratio exceeding 30 dB in several examples, indicating an accurate reconstruction.
我们考虑了从相位测量中重建复值信号的问题。该框架可以被认为是众所周知的一位压缩感知范式的推广,其中底层信号已知是稀疏的。相反,所提出的形式主义不依赖于稀疏性假设,因此适用于更广泛的信号类别。首先将线性测量向量分解为相位分量和幅度分量,然后利用幅度分量的非负性作为约束,从而制定了信号重构的优化问题。所得到的优化问题是一个二次规划(QP),并使用两种算法求解:(i)乘法器交替方向法;(ii)利用Nesterov动量预测梯度下降。由于纯相位测量固有的尺度模糊性,底层信号只能重构到一个全局尺度因子。我们获得了在无噪声情况下重建一维合成信号的高精度。我们还展示了该方法在从测量系数的相位重建图像中的应用。在几个例子中,底层图像恢复到峰值信噪比超过30 dB,表明重建准确。
{"title":"PhaseSense — Signal Reconstruction from Phase-Only Measurements via Quadratic Programming","authors":"Vinith Kishore, Subhadip Mukherjee, C. Seelamantula","doi":"10.1109/SPCOM50965.2020.9179523","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179523","url":null,"abstract":"We consider the problem of reconstructing a complex-valued signal from its phase-only measurements. This framework can be considered as a generalization of the well-known one-bit compressed sensing paradigm where the underlying signal is known to be sparse. In contrast, the proposed formalism does not rely on the assumption of sparsity and hence applies to a broader class of signals. The optimization problem for signal reconstruction is formulated by first splitting the linear measurement vector into its phase and magnitude components and subsequently using the non-negativity property of the magnitude component as a constraint. The resulting optimization problem turns out to be a quadratic program (QP) and is solved using two algorithms: (i) alternating directions method of multipliers; and (ii) projected gradient-descent with Nesterov’s momentum. Due to the inherent scale ambiguity of the phase-only measurement, the underlying signal can be reconstructed only up to a global scale-factor. We obtain high accuracy for reconstructing 1–D synthetic signals in the absence of noise. We also show an application of the proposed approach in reconstructing images from the phase of their measurement coefficients. The underlying image is recovered up to a peak signal-to-noise ratio exceeding 30 dB in several examples, indicating an accurate reconstruction.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133439124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
On Impact of Imperfect CSI over SWIPT Device-to-Device (D2D) MIMO Relay Systems 不完善的CSI对SWIPT设备对设备(D2D) MIMO中继系统的影响
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179633
Parvez Shaik, P. Singya, Nagendra Kumar, Kamal K. Garg, V. Bhatia
Simultaneous wireless information and power transfer (SWIPT) is an efficient solution for the power scarce wireless communications. In this paper, we consider a relay assisted multi-input and multi-output (MIMO) device-to-device (D2D) communications with a practical scenario of imperfect channel state information (CSI) over a generalized Nakagami-m fading channels. Further, at the relay node, energy from the received radio-frequency (RF) signals is harvested by adopting time-switch (TS) protocol for broadcasting a signal to the destination. In a resource limited environment, it is deterrent to use all the MIMO antennas due to increased system complexity with the dedicated RF chain for each active antenna. Thus, transmit antenna selection strategy (TAS) is considered in this work. Framework for the outage probability and asymptotic outage probability of TAS based MIMO D2D relay system is provided. It is observed that the diversity order of the system gets affected with small variation in imperfect CSI correlation coefficient and throughput of the system gets affected severely with increasing rates with imperfect CSI. Further, Monte-Carlo simulations are performed to validate the derived analytical expressions.
同时无线信息与电力传输(SWIPT)是解决电力短缺的无线通信的一种有效方法。在本文中,我们考虑了一种中继辅助的多输入多输出(MIMO)设备对设备(D2D)通信,该通信具有不完全信道状态信息(CSI)在广义Nakagami-m衰落信道上的实际场景。此外,在中继节点,从接收的射频(RF)信号中获取能量,采用时间开关(TS)协议将信号广播到目的地。在资源有限的环境中,由于每个有源天线的专用射频链增加了系统复杂性,因此使用所有MIMO天线是一种威慑。因此,本文考虑了发射天线选择策略(TAS)。给出了基于TAS的MIMO D2D中继系统的中断概率和渐近中断概率的框架。结果表明,不完全CSI相关系数变化较小时,系统的分集顺序受到影响;不完全CSI增加时,系统的吞吐量受到严重影响。此外,还进行了蒙特卡罗模拟来验证推导出的解析表达式。
{"title":"On Impact of Imperfect CSI over SWIPT Device-to-Device (D2D) MIMO Relay Systems","authors":"Parvez Shaik, P. Singya, Nagendra Kumar, Kamal K. Garg, V. Bhatia","doi":"10.1109/SPCOM50965.2020.9179633","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179633","url":null,"abstract":"Simultaneous wireless information and power transfer (SWIPT) is an efficient solution for the power scarce wireless communications. In this paper, we consider a relay assisted multi-input and multi-output (MIMO) device-to-device (D2D) communications with a practical scenario of imperfect channel state information (CSI) over a generalized Nakagami-m fading channels. Further, at the relay node, energy from the received radio-frequency (RF) signals is harvested by adopting time-switch (TS) protocol for broadcasting a signal to the destination. In a resource limited environment, it is deterrent to use all the MIMO antennas due to increased system complexity with the dedicated RF chain for each active antenna. Thus, transmit antenna selection strategy (TAS) is considered in this work. Framework for the outage probability and asymptotic outage probability of TAS based MIMO D2D relay system is provided. It is observed that the diversity order of the system gets affected with small variation in imperfect CSI correlation coefficient and throughput of the system gets affected severely with increasing rates with imperfect CSI. Further, Monte-Carlo simulations are performed to validate the derived analytical expressions.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128991829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Effectiveness of Transfer Learning on Singing Voice Conversion in the Presence of Background Music 背景音乐下迁移学习对歌唱声音转换的影响
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179583
Divyesh G. Rajpura, Jui Shah, Maitreya Patel, Harshit Malaviya, K. Phatnani, H. Patil
Singing voice conversion (SVC) is a task of converting the perception of the source speaker’s identity to the target speaker without changing lyrics and rhythm. Recent approaches in traditional voice conversion involve the use of the generative models, such as Variational Autoencoders (VAE), and Generative Adversarial Networks (GANs). However, in the case of SVC, GANs are not explored much. The only system that has been proposed in the literature uses traditional GAN on the parallel data. The parallel data collection for real scenarios (with the same background music) is not feasible. Moreover, in the presence of background music, SVC is one of the most challenging tasks as it involves the source separation of vocals from the inputs, which will have some noise. Therefore, in this paper, we propose transfer learning, and fine-tuning-based Cycle consistent GAN (CycleGAN) model for non-parallel SVC, where music source separation is done using Deep Attractor Network (DANet). We designed seven different possible systems to identify the best possible combination of transfer learning and fine-tuning. Here, we use a more challenging database, MUSDB18, as our primary dataset, and we also use the NUS-48E database to pre-train CycleGAN. We perform extensive analysis via objective and subjective measures and report that with a 4.14 MOS score out of 5 for naturalness, the CycleGAN model pre-trained on NUS-48E corpus performs the best compared to the other systems described in the paper.
歌唱声音转换(SVC)是在不改变歌词和节奏的情况下,将源说话者的身份感知转换为目标说话者的任务。传统语音转换的最新方法包括使用生成模型,如变分自编码器(VAE)和生成对抗网络(gan)。然而,在SVC的情况下,gan的探索并不多。文献中唯一提出的系统是在并行数据上使用传统GAN。真实场景的并行数据收集(具有相同的背景音乐)是不可行的。此外,在背景音乐存在的情况下,SVC是最具挑战性的任务之一,因为它涉及到从输入中分离人声的源,这将有一些噪声。因此,在本文中,我们提出了非并行SVC的迁移学习和基于微调的循环一致GAN (CycleGAN)模型,其中音乐源分离使用深度吸引器网络(DANet)完成。我们设计了七种不同的可能系统,以确定迁移学习和微调的最佳组合。在这里,我们使用更具挑战性的数据库MUSDB18作为我们的主要数据集,我们还使用NUS-48E数据库来预训练CycleGAN。我们通过客观和主观测量进行了广泛的分析,并报告说,与论文中描述的其他系统相比,在NUS-48E语料库上预训练的CycleGAN模型在自然度方面的MOS得分为4.14(满分为5)。
{"title":"Effectiveness of Transfer Learning on Singing Voice Conversion in the Presence of Background Music","authors":"Divyesh G. Rajpura, Jui Shah, Maitreya Patel, Harshit Malaviya, K. Phatnani, H. Patil","doi":"10.1109/SPCOM50965.2020.9179583","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179583","url":null,"abstract":"Singing voice conversion (SVC) is a task of converting the perception of the source speaker’s identity to the target speaker without changing lyrics and rhythm. Recent approaches in traditional voice conversion involve the use of the generative models, such as Variational Autoencoders (VAE), and Generative Adversarial Networks (GANs). However, in the case of SVC, GANs are not explored much. The only system that has been proposed in the literature uses traditional GAN on the parallel data. The parallel data collection for real scenarios (with the same background music) is not feasible. Moreover, in the presence of background music, SVC is one of the most challenging tasks as it involves the source separation of vocals from the inputs, which will have some noise. Therefore, in this paper, we propose transfer learning, and fine-tuning-based Cycle consistent GAN (CycleGAN) model for non-parallel SVC, where music source separation is done using Deep Attractor Network (DANet). We designed seven different possible systems to identify the best possible combination of transfer learning and fine-tuning. Here, we use a more challenging database, MUSDB18, as our primary dataset, and we also use the NUS-48E database to pre-train CycleGAN. We perform extensive analysis via objective and subjective measures and report that with a 4.14 MOS score out of 5 for naturalness, the CycleGAN model pre-trained on NUS-48E corpus performs the best compared to the other systems described in the paper.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"123 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129005201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A ‘Complete Blind’ No-Reference Stereoscopic Image Quality Assessment Algorithm 一种“完全盲”无参考立体图像质量评估算法
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179556
Balasubramanyam Appina
We propose a complete blind no-reference (NR) image quality assessment algorithm for assessing the perceptual quality of natural stereoscopic (S3D) images. Towards this end, we have generated an intermediate image from the left and right views, and hypothesize that the perceived quality of the S3D view close to that cyclopean image. We perform multi-steerable decomposition on cyclopean images and we compute the naturalness image quality evaluator (NIQE) score [1] and entropy score from each subband. Finally, the primitive quality scores of steerable subbands are pooled to obtain the overall perceptual quality score of an S3D image. The proposed algorithm is evaluated on the LIVE Phase I [2] and LIVE Phase II [3] stereoscopic image datasets and demonstrates its robust performance on both the datasets and across distortions. The proposed algorithm, which is a ‘complete blind’ model (neither requires pristine S3D images nor requires training on human opinion scores), is called the Multi-Orient NIQE based 3D image quality evaluator (MO-NIQE).
提出了一种完全盲无参考(NR)图像质量评估算法,用于评估自然立体(S3D)图像的感知质量。为此,我们从左视图和右视图中生成了一个中间图像,并假设S3D视图的感知质量接近该单眼图像。我们对cyclopean图像进行多导向分解,并从每个子带计算自然图像质量评估器(NIQE)分数[1]和熵分数。最后,对可控制子带的原始质量分数进行汇总,得到S3D图像的整体感知质量分数。该算法在LIVE Phase I[2]和LIVE Phase II[3]立体图像数据集上进行了评估,并证明了其在数据集和跨失真上的鲁棒性。所提出的算法是一个“完全盲”模型(既不需要原始的S3D图像,也不需要对人类意见评分进行训练),被称为基于Multi-Orient NIQE的3D图像质量评估器(MO-NIQE)。
{"title":"A ‘Complete Blind’ No-Reference Stereoscopic Image Quality Assessment Algorithm","authors":"Balasubramanyam Appina","doi":"10.1109/SPCOM50965.2020.9179556","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179556","url":null,"abstract":"We propose a complete blind no-reference (NR) image quality assessment algorithm for assessing the perceptual quality of natural stereoscopic (S3D) images. Towards this end, we have generated an intermediate image from the left and right views, and hypothesize that the perceived quality of the S3D view close to that cyclopean image. We perform multi-steerable decomposition on cyclopean images and we compute the naturalness image quality evaluator (NIQE) score [1] and entropy score from each subband. Finally, the primitive quality scores of steerable subbands are pooled to obtain the overall perceptual quality score of an S3D image. The proposed algorithm is evaluated on the LIVE Phase I [2] and LIVE Phase II [3] stereoscopic image datasets and demonstrates its robust performance on both the datasets and across distortions. The proposed algorithm, which is a ‘complete blind’ model (neither requires pristine S3D images nor requires training on human opinion scores), is called the Multi-Orient NIQE based 3D image quality evaluator (MO-NIQE).","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123550399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Dynamic Subarray Beamforming for Angular Superresolution of Coherent Targets 相干目标角超分辨的动态子阵列波束形成
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179526
M. Sivasankar, R. Hegde
Development of angular superresolution methods for resolving targets using multifunction phased array radar is challenging. Angular superresolution of closely spaced coherent targets with strong interferences in the context of phased array radar has hitherto not been addressed. In this paper a novel beamforming method with angular superresolution is proposed for resolving closely spaced coherent targets in the presence of interferences. A dynamic subarray beamforming framework is first developed based on the knowledge of the number of interferences. The output obtained from the dynamic subarray beamformer is then smoothed using an augmented covariance method to account for the coherence of targets. Superresolution method is then used to obtain robust DOA estimates even at low SNR. Experiments on DOA estimation are conducted in typical target detection scenarios and the results are evaluated using several performance metrics to illustrate the significance of the proposed method.
利用多功能相控阵雷达进行角超分辨目标的研究是一个具有挑战性的课题。在相控阵雷达的背景下,具有强干扰的近距离相干目标的角超分辨问题迄今尚未得到解决。本文提出了一种角超分辨率波束形成方法,用于分辨存在干扰的近距离相干目标。基于对干扰数的了解,提出了一种动态子阵列波束形成框架。从动态子阵列波束形成器获得的输出然后使用增广协方差方法进行平滑,以考虑目标的相干性。然后使用超分辨率方法在低信噪比下获得鲁棒的DOA估计。在典型的目标检测场景中进行了DOA估计实验,并使用几个性能指标对结果进行了评估,以说明所提出方法的重要性。
{"title":"Dynamic Subarray Beamforming for Angular Superresolution of Coherent Targets","authors":"M. Sivasankar, R. Hegde","doi":"10.1109/SPCOM50965.2020.9179526","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179526","url":null,"abstract":"Development of angular superresolution methods for resolving targets using multifunction phased array radar is challenging. Angular superresolution of closely spaced coherent targets with strong interferences in the context of phased array radar has hitherto not been addressed. In this paper a novel beamforming method with angular superresolution is proposed for resolving closely spaced coherent targets in the presence of interferences. A dynamic subarray beamforming framework is first developed based on the knowledge of the number of interferences. The output obtained from the dynamic subarray beamformer is then smoothed using an augmented covariance method to account for the coherence of targets. Superresolution method is then used to obtain robust DOA estimates even at low SNR. Experiments on DOA estimation are conducted in typical target detection scenarios and the results are evaluated using several performance metrics to illustrate the significance of the proposed method.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"36 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124507378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards Emotion Independent Language Identification System 面向情感独立语言识别系统
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179550
P. Jain, K. Gurugubelli, A. Vuppala
Language Identification (LID) is an integral part of multilingual speech systems. There are various conditions under which the performance of LID systems are sub-optimal, such as short duration, noise, channel variation, and so on. There has been effort to improve performance under these conditions, but the impact of speaker emotion variation on the performance of LID systems has not been studied. It is observed that the performance of LID systems degrade in the presence of emotional mismatch between train and test conditions. To that effect, we investigated adaptation approaches for improving the performance of LID systems by incorporating emotional utterances in form of adaptation dataset. Hence, we studied a prosody modification technique called Flexible Analysis Synthesis Tool (FAST) to vary the emotional characteristics of an utterance in order to improve the performance, but the results were inconsistent and not satisfactory. In this work, we propose a combination of Recurrent Convolutional Neural Network (RCNN) based architecture with multi stage training methodology, which outperformed state-ofart LID systems such as i-vectors, time delay neural network, long short term memory, and deep neural network x-vector.
语言识别是多语言语音系统的重要组成部分。在各种条件下,LID系统的性能不是最优的,例如持续时间短、噪声、信道变化等。人们一直在努力提高这些条件下的性能,但尚未研究说话者情绪变化对LID系统性能的影响。观察到,在训练和测试条件之间存在情感不匹配时,LID系统的性能会下降。为此,我们研究了适应方法,通过将情感话语以适应数据集的形式纳入来提高LID系统的性能。因此,我们研究了一种叫做FAST的韵律修饰技术,通过改变话语的情感特征来提高表现,但结果并不一致,也不令人满意。在这项工作中,我们提出了一种基于循环卷积神经网络(RCNN)的体系结构与多阶段训练方法的结合,其性能优于目前最先进的LID系统,如i向量、时滞神经网络、长短期记忆和深度神经网络x向量。
{"title":"Towards Emotion Independent Language Identification System","authors":"P. Jain, K. Gurugubelli, A. Vuppala","doi":"10.1109/SPCOM50965.2020.9179550","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179550","url":null,"abstract":"Language Identification (LID) is an integral part of multilingual speech systems. There are various conditions under which the performance of LID systems are sub-optimal, such as short duration, noise, channel variation, and so on. There has been effort to improve performance under these conditions, but the impact of speaker emotion variation on the performance of LID systems has not been studied. It is observed that the performance of LID systems degrade in the presence of emotional mismatch between train and test conditions. To that effect, we investigated adaptation approaches for improving the performance of LID systems by incorporating emotional utterances in form of adaptation dataset. Hence, we studied a prosody modification technique called Flexible Analysis Synthesis Tool (FAST) to vary the emotional characteristics of an utterance in order to improve the performance, but the results were inconsistent and not satisfactory. In this work, we propose a combination of Recurrent Convolutional Neural Network (RCNN) based architecture with multi stage training methodology, which outperformed state-ofart LID systems such as i-vectors, time delay neural network, long short term memory, and deep neural network x-vector.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"145 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121190203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Joint User Scheduling, and Precoding for Multicast Spectral Efficiency in Multigroup Multicast Systems 多组组播系统中联合用户调度和预编码对组播频谱效率的影响
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179596
Ashok Bandi, R. B. S. Mysore, S. Chatzinotas, B. Ottersten
This paper studies the joint design of user scheduling and precoding for the maximization of spectral efficiency (SE) for a multigroup multicast scenario in multiuser MISO downlink channels. Noticing that the existing definition of SE fails to account for group sizes, a new metric called multicast spectral efficiency (MC-SE) is proposed. In this context, the joint design is considered for the maximization of MC-SE. Firstly, with the help of binary scheduling variables, the joint design problem is formulated as a mixed-integer non-linear programming problem such that it facilitates the joint update of scheduling and precoding variables. Further, useful reformulations are proposed to reveal the hidden difference-of-convex/concave structure of the problem. Thereafter, we propose a convex-concave procedure based iterative algorithm with convergence guarantees to a stationary point. Finally, we compare different aspects namely MC-SE, SE and number of scheduled users through Monte-Carlo simulations.
针对多用户MISO下行信道下多组多播场景,研究了以频谱效率最大化为目标的用户调度和预编码联合设计。注意到现有的多播频谱效率的定义没有考虑到组的大小,提出了一个新的度量称为多播频谱效率(MC-SE)。在这种情况下,考虑关节设计以最大化MC-SE。首先,利用二元调度变量,将联合设计问题化为混合整数非线性规划问题,便于调度变量和预编码变量的联合更新;此外,提出了有用的重新表述,以揭示问题的凸/凹结构的隐藏差异。在此基础上,提出了一种基于凸凹过程的迭代算法,该算法具有收敛到平稳点的保证。最后,我们通过蒙特卡洛仿真比较了MC-SE、SE和调度用户数量。
{"title":"Joint User Scheduling, and Precoding for Multicast Spectral Efficiency in Multigroup Multicast Systems","authors":"Ashok Bandi, R. B. S. Mysore, S. Chatzinotas, B. Ottersten","doi":"10.1109/SPCOM50965.2020.9179596","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179596","url":null,"abstract":"This paper studies the joint design of user scheduling and precoding for the maximization of spectral efficiency (SE) for a multigroup multicast scenario in multiuser MISO downlink channels. Noticing that the existing definition of SE fails to account for group sizes, a new metric called multicast spectral efficiency (MC-SE) is proposed. In this context, the joint design is considered for the maximization of MC-SE. Firstly, with the help of binary scheduling variables, the joint design problem is formulated as a mixed-integer non-linear programming problem such that it facilitates the joint update of scheduling and precoding variables. Further, useful reformulations are proposed to reveal the hidden difference-of-convex/concave structure of the problem. Thereafter, we propose a convex-concave procedure based iterative algorithm with convergence guarantees to a stationary point. Finally, we compare different aspects namely MC-SE, SE and number of scheduled users through Monte-Carlo simulations.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116650142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2020 International Conference on Signal Processing and Communications (SPCOM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1