首页 > 最新文献

2022 IEEE International Conference on Signal Processing and Communications (SPCOM)最新文献

英文 中文
On the Performance of Mixed RF-VLC Relaying Systems for HST Communication 用于HST通信的RF-VLC混合中继系统性能研究
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840803
Rupender Singh, E. Yaacoub
The emerging fifth generation (5G) wireless communication systems are a promising solution to support the high data rate for high-speed trains (HSTs). In this article, we propose a cooperative mixed radio frequency (RF)-visible light communication (VLC) system to avoid the high penetration loss of the direct link between the end-users (UE) and the base station (BS). Our motivation is mainly based on the suitability of VLC technologies for indoor systems and their advantages to tackle forthcoming spectrum crunch, wide spectral availability, and easy bandwidth reuse. In the proposed system setup, the outdoor BS-relay is served by the backhaul RF links subject to double shadowing due to slow moving obstacles and pedestrian, while the data traffic inside the HST is conveyed by an indoor VLC system. Moreover, it is assumed that the relay node has imperfect channel state information (CSI) due to the mobility of the HST. We first statistically characterize the end-to-end signal-to-noise ratios (SNRs). Then, the performance is analyzed by deriving the exact closed-form expressions for key metrics such as outage probability and average bit error rate (BER).
新兴的第五代(5G)无线通信系统是支持高速列车(HSTs)高数据速率的有前途的解决方案。为了避免终端用户(UE)和基站(BS)之间直接链路的高穿透损耗,本文提出了一种协同混合射频(RF)-可见光通信(VLC)系统。我们的动机主要是基于VLC技术对室内系统的适用性,以及它们在解决即将到来的频谱紧张、广泛的频谱可用性和易于带宽重用方面的优势。在提出的系统设置中,室外bs中继由回程RF链路提供服务,由于缓慢移动的障碍物和行人,回程RF链路会受到双重阴影的影响,而HST内部的数据流量由室内VLC系统传输。此外,由于HST的移动性,假定中继节点具有不完善的信道状态信息(CSI)。我们首先统计表征端到端信噪比(SNRs)。然后,通过导出停机概率和平均误码率(BER)等关键指标的精确封闭表达式来分析性能。
{"title":"On the Performance of Mixed RF-VLC Relaying Systems for HST Communication","authors":"Rupender Singh, E. Yaacoub","doi":"10.1109/SPCOM55316.2022.9840803","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840803","url":null,"abstract":"The emerging fifth generation (5G) wireless communication systems are a promising solution to support the high data rate for high-speed trains (HSTs). In this article, we propose a cooperative mixed radio frequency (RF)-visible light communication (VLC) system to avoid the high penetration loss of the direct link between the end-users (UE) and the base station (BS). Our motivation is mainly based on the suitability of VLC technologies for indoor systems and their advantages to tackle forthcoming spectrum crunch, wide spectral availability, and easy bandwidth reuse. In the proposed system setup, the outdoor BS-relay is served by the backhaul RF links subject to double shadowing due to slow moving obstacles and pedestrian, while the data traffic inside the HST is conveyed by an indoor VLC system. Moreover, it is assumed that the relay node has imperfect channel state information (CSI) due to the mobility of the HST. We first statistically characterize the end-to-end signal-to-noise ratios (SNRs). Then, the performance is analyzed by deriving the exact closed-form expressions for key metrics such as outage probability and average bit error rate (BER).","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130706370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the Performance of Zero-Resource Children’s ASR System through Formant and Duration Modification based Data Augmentation 基于形成峰和持续时间修改的数据增强提高零资源儿童ASR系统的性能
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840767
S. Shahnawazuddin, Vinit Kumar, Avinash Kumar, Waquar Ahmad
Developing an automatic speech recognition (ASR) system for children’s speech is extremely challenging due to the unavailability of data from the child domain for the majority of the languages. Consequently, in such zero-resource scenarios, we are forced to develop an ASR system using adults’ speech for transcribing data from child speakers. However, differences in formant frequencies and speaking-rate between the two groups of speakers degrade recognition performance. To reduce the said mismatch, out-of-domain data augmentation approaches based on formant and duration modification are proposed in this work. For that purpose, formant frequencies of adults’ speech training data are up-scaled using warping of linear predictive coding coefficients. Next, the speaking-rate of adults’ data is also increased through time-scale modification. Due to simultaneous altering of formant frequencies and duration of adults’ speech and then pooling the modified data into training, the acoustic mismatch due to the aforementioned factors gets reduced. This, in turn, enhances the recognition performance significantly. Additional improvement is obtained by combining the recently reported voice-conversion-based data augmentation technique with the proposed ones. On combining the proposed and voice-conversion-based data augmentation techniques, a relative reduction of nearly 32.3% in word error rate over the baseline is obtained.
由于大多数语言的儿童领域数据不可用,开发儿童语音自动识别(ASR)系统极具挑战性。因此,在这种零资源的情况下,我们被迫开发一个ASR系统,使用成人的语言来转录儿童说话者的数据。然而,两组说话者在共振频率和语速上的差异会降低识别性能。为了减少这种不匹配,本文提出了基于形成峰和持续时间修改的域外数据增强方法。为此,使用线性预测编码系数的扭曲来放大成人语音训练数据的形成峰频率。其次,通过时间尺度修正,也提高了成人数据的说话率。通过同时改变成人语音的共振峰频率和持续时间,然后将修改后的数据汇集到训练中,减少了由于上述因素造成的声学失配。这反过来又大大提高了识别性能。将最近报道的基于语音转换的数据增强技术与所提出的数据增强技术相结合,获得了额外的改进。将所提出的方法与基于语音转换的数据增强技术相结合,在基线的基础上,错误率相对降低了近32.3%。
{"title":"Improving the Performance of Zero-Resource Children’s ASR System through Formant and Duration Modification based Data Augmentation","authors":"S. Shahnawazuddin, Vinit Kumar, Avinash Kumar, Waquar Ahmad","doi":"10.1109/SPCOM55316.2022.9840767","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840767","url":null,"abstract":"Developing an automatic speech recognition (ASR) system for children’s speech is extremely challenging due to the unavailability of data from the child domain for the majority of the languages. Consequently, in such zero-resource scenarios, we are forced to develop an ASR system using adults’ speech for transcribing data from child speakers. However, differences in formant frequencies and speaking-rate between the two groups of speakers degrade recognition performance. To reduce the said mismatch, out-of-domain data augmentation approaches based on formant and duration modification are proposed in this work. For that purpose, formant frequencies of adults’ speech training data are up-scaled using warping of linear predictive coding coefficients. Next, the speaking-rate of adults’ data is also increased through time-scale modification. Due to simultaneous altering of formant frequencies and duration of adults’ speech and then pooling the modified data into training, the acoustic mismatch due to the aforementioned factors gets reduced. This, in turn, enhances the recognition performance significantly. Additional improvement is obtained by combining the recently reported voice-conversion-based data augmentation technique with the proposed ones. On combining the proposed and voice-conversion-based data augmentation techniques, a relative reduction of nearly 32.3% in word error rate over the baseline is obtained.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114701727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Long Short-Term Memory Model Based Microaneurysm Sequence Classification in Fundus Images 基于长短期记忆模型的眼底图像微动脉瘤序列分类
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840789
Renuka Acharya, N. Puhan
Diabetic Retinopathy (DR) has emerged as one of the serious medical conditions over the years leading to blindness among patients. Microaneurysms (MAs) are generally the earliest objective evidence of DR captured in fundus imaging. This work proposes a novel methodology based on long short-term memory (LSTM) to exploit the sequence dependencies of 1-D feature signals extracted from MAs and aid in their classification in colour fundus images. The model is trained using 1-dimensional intensity based signals generated from various patches of preprocessed fundus images. The model is tested on e-ophtha & ROC datasets and sensitivity scores are computed against seven unique values of false positive per image. The average of these scores is utilized as performance measurement of the proposed model which shows 66.6% and 60.5% sensitivity for e-ophtha and ROC datasets, respectively.
近年来,糖尿病视网膜病变(DR)已成为导致患者失明的严重疾病之一。微动脉瘤(MAs)通常是眼底成像中最早发现的DR客观证据。这项工作提出了一种基于长短期记忆(LSTM)的新方法,以利用从MAs提取的1-D特征信号的序列依赖性,并帮助它们在彩色眼底图像中的分类。该模型使用从预处理后的眼底图像的不同斑块产生的一维强度信号进行训练。该模型在e-ophtha和ROC数据集上进行了测试,并根据每个图像的七个唯一假阳性值计算灵敏度分数。这些分数的平均值被用来衡量所提出的模型的性能,该模型对e-ophtha和ROC数据集的灵敏度分别为66.6%和60.5%。
{"title":"Long Short-Term Memory Model Based Microaneurysm Sequence Classification in Fundus Images","authors":"Renuka Acharya, N. Puhan","doi":"10.1109/SPCOM55316.2022.9840789","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840789","url":null,"abstract":"Diabetic Retinopathy (DR) has emerged as one of the serious medical conditions over the years leading to blindness among patients. Microaneurysms (MAs) are generally the earliest objective evidence of DR captured in fundus imaging. This work proposes a novel methodology based on long short-term memory (LSTM) to exploit the sequence dependencies of 1-D feature signals extracted from MAs and aid in their classification in colour fundus images. The model is trained using 1-dimensional intensity based signals generated from various patches of preprocessed fundus images. The model is tested on e-ophtha & ROC datasets and sensitivity scores are computed against seven unique values of false positive per image. The average of these scores is utilized as performance measurement of the proposed model which shows 66.6% and 60.5% sensitivity for e-ophtha and ROC datasets, respectively.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115850597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Automatic Depression Detection Based on Merged Convolutional Neural Networks using Facial Features 基于融合卷积神经网络的面部特征自动抑郁检测
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840812
Renuka Acharya, Soumya P. Dash
Clinical depression is one of the crucial medical conditions affecting a substantial portion of today’s world population. This paper proposes a novel technology based on deep learning tools for efficient detection of clinical depression in potential patients. A novel algorithm is proposed to label a recent AFEW-VA dataset in terms of creating the depressed class and the non-depressed class based on the valence and arousal values for various individuals from their video frames. Furthermore, the full facial regions, the eye regions, and the mouth regions from the classified dataset are extracted as the regions of interest (ROIs) to be utilized to train three different pre-trained 2DCNN models, namely, ResNet50, VGG16, and InceptionV3 by using transfer learning. For each 2D-CNN architecture, a novel algorithm is proposed to merge the models trained on the three ROIs. It is observed that the merged model, combining all the three ROIs outperforms the individual models or a merged model merging only two of the three ROIs in terms of obtaining a higher accuracy of depression detection. It is also observed that the merged models based on the ResNet50 architecture results in the best accuracy value of 0.95 as compared to the VGG16 and InceptionV3 architectures.
临床抑郁症是影响当今世界相当一部分人口的关键医疗条件之一。本文提出了一种基于深度学习工具的新技术,用于有效检测潜在患者的临床抑郁症。提出了一种新的算法来标记最近的AFEW-VA数据集,根据不同个体的视频帧的效价和唤醒值来创建抑郁类和非抑郁类。此外,从分类数据集中提取全面部区域、眼睛区域和嘴巴区域作为感兴趣区域(roi),利用迁移学习方法训练三个不同的预训练2DCNN模型,即ResNet50、VGG16和InceptionV3。针对每一种2D-CNN结构,提出了一种新的算法来合并在三个roi上训练的模型。可以看出,在获得更高的抑郁检测精度方面,合并所有三个roi的合并模型优于单个模型或仅合并三个roi中的两个的合并模型。我们还观察到,与VGG16和InceptionV3架构相比,基于ResNet50架构的合并模型获得了0.95的最佳精度值。
{"title":"Automatic Depression Detection Based on Merged Convolutional Neural Networks using Facial Features","authors":"Renuka Acharya, Soumya P. Dash","doi":"10.1109/SPCOM55316.2022.9840812","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840812","url":null,"abstract":"Clinical depression is one of the crucial medical conditions affecting a substantial portion of today’s world population. This paper proposes a novel technology based on deep learning tools for efficient detection of clinical depression in potential patients. A novel algorithm is proposed to label a recent AFEW-VA dataset in terms of creating the depressed class and the non-depressed class based on the valence and arousal values for various individuals from their video frames. Furthermore, the full facial regions, the eye regions, and the mouth regions from the classified dataset are extracted as the regions of interest (ROIs) to be utilized to train three different pre-trained 2DCNN models, namely, ResNet50, VGG16, and InceptionV3 by using transfer learning. For each 2D-CNN architecture, a novel algorithm is proposed to merge the models trained on the three ROIs. It is observed that the merged model, combining all the three ROIs outperforms the individual models or a merged model merging only two of the three ROIs in terms of obtaining a higher accuracy of depression detection. It is also observed that the merged models based on the ResNet50 architecture results in the best accuracy value of 0.95 as compared to the VGG16 and InceptionV3 architectures.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117067810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gait Recognition under Different Covariate Conditions using Deep Learning Technique 基于深度学习技术的不同协变量条件下步态识别
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840818
Iman Junaid, S. Ari
Gait as a biometric has become a popular research topic in recent years as a result of its numerous applications in sectors such as surveillance, authentication, and so on. It is capable of achieving detection at a distance that few other technologies can equal. It is still a difficult problem to solve since real human gait is influenced by several variable elements such as alterations in clothing, speed, and carrying situation. Also, unknown covariate circumstances may impact the training and testing settings for a specific individual in gait recognition. Image sequences are typically used by computer-aided gait recognition systems without taking into account variables such as clothes and the contents of carrier bags while on the move. In this work, we provide a technique for selecting gait energy image-based (GEI) features, that is both effective and robust. The covariate factors have less impact on the given gait representation. A simple ten-layered convolutional neural network (CNN) is designed which intakes GEI as input. Several typical variations and occlusions that impact and worsen gait recognition ability are less susceptible to the suggested method. For both clothing and mobility variations, we used the CASIA datasets to assess our observations. The experimental findings reveal that in numerous circumstances, the deep neural network model created in this study achieved better results when compared with other existing algorithms.
步态作为一种生物特征识别技术,近年来在监控、身份认证等领域得到了广泛的应用,已成为一个热门的研究课题。它能够实现其他技术无法企及的远距离探测。由于真实的人的步态受到许多可变因素的影响,如衣服的改变、速度和携带环境,因此仍然是一个难以解决的问题。此外,未知的协变量环境可能会影响步态识别中特定个体的训练和测试设置。计算机辅助步态识别系统通常使用图像序列,而不考虑移动时的衣服和手提袋内容等变量。在这项工作中,我们提供了一种既有效又鲁棒的基于步态能量图像(GEI)特征选择技术。协变量因素对给定步态表示的影响较小。以GEI为输入,设计了一个简单的十层卷积神经网络(CNN)。几种影响和恶化步态识别能力的典型变异和闭塞不太容易受到建议的方法的影响。对于服装和活动的变化,我们使用CASIA数据集来评估我们的观察结果。实验结果表明,在许多情况下,与其他现有算法相比,本研究创建的深度神经网络模型取得了更好的结果。
{"title":"Gait Recognition under Different Covariate Conditions using Deep Learning Technique","authors":"Iman Junaid, S. Ari","doi":"10.1109/SPCOM55316.2022.9840818","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840818","url":null,"abstract":"Gait as a biometric has become a popular research topic in recent years as a result of its numerous applications in sectors such as surveillance, authentication, and so on. It is capable of achieving detection at a distance that few other technologies can equal. It is still a difficult problem to solve since real human gait is influenced by several variable elements such as alterations in clothing, speed, and carrying situation. Also, unknown covariate circumstances may impact the training and testing settings for a specific individual in gait recognition. Image sequences are typically used by computer-aided gait recognition systems without taking into account variables such as clothes and the contents of carrier bags while on the move. In this work, we provide a technique for selecting gait energy image-based (GEI) features, that is both effective and robust. The covariate factors have less impact on the given gait representation. A simple ten-layered convolutional neural network (CNN) is designed which intakes GEI as input. Several typical variations and occlusions that impact and worsen gait recognition ability are less susceptible to the suggested method. For both clothing and mobility variations, we used the CASIA datasets to assess our observations. The experimental findings reveal that in numerous circumstances, the deep neural network model created in this study achieved better results when compared with other existing algorithms.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117194507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Binaural Spatial Transform for Multi-source Localization determining Angular Extent of Ensemble Source Width 多源定位的双耳空间变换确定集合源宽度角范围
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840782
S. Arthi, T. Sreenivas
In the case of ensemble like distributed presentation, inter-aural cross-correlation (IACC) is correlated with the measure for source width extension. We question the validity of this measure and develop an angular measure for ensemble source width. In this work, we distinguish the binaural correlation functions of localized source, source with reverberation and ensemble source. We also develop a novel phase-only spatial transform to localize as many sources as possible. The angular separation between the spatial extrema sources can be a physical measure for ensemble source width. We also observe that phase-only HRIR cross-correlation functions act as listener dependent functional bases for localizing multiple sources. We observe these functional bases are wavelet-like and their signature are listener dependent and direction dependent. We extend the spatial transform to time-varying short-time spatial transform and define “Spatio-gram” to understand the effect of time-varying nature of the signal.
在集成式分布式表示中,耳间互相关(IACC)与源宽度扩展度量相关。我们对这种测量方法的有效性提出了质疑,并提出了一种集成源宽度的角度测量方法。本文区分了局域声源、混响声源和合奏声源的双耳相关函数。我们还开发了一种新的纯相位空间变换来定位尽可能多的源。空间极值源之间的角间距可以作为系综源宽度的物理度量。我们还观察到,仅相位的HRIR互相关函数作为侦听器依赖的功能基础,用于定位多个源。我们观察到这些功能基是小波样的,它们的特征依赖于听者和方向。我们将空间变换扩展到时变短时空间变换,并定义了“空间图”来理解信号时变特性的影响。
{"title":"Binaural Spatial Transform for Multi-source Localization determining Angular Extent of Ensemble Source Width","authors":"S. Arthi, T. Sreenivas","doi":"10.1109/SPCOM55316.2022.9840782","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840782","url":null,"abstract":"In the case of ensemble like distributed presentation, inter-aural cross-correlation (IACC) is correlated with the measure for source width extension. We question the validity of this measure and develop an angular measure for ensemble source width. In this work, we distinguish the binaural correlation functions of localized source, source with reverberation and ensemble source. We also develop a novel phase-only spatial transform to localize as many sources as possible. The angular separation between the spatial extrema sources can be a physical measure for ensemble source width. We also observe that phase-only HRIR cross-correlation functions act as listener dependent functional bases for localizing multiple sources. We observe these functional bases are wavelet-like and their signature are listener dependent and direction dependent. We extend the spatial transform to time-varying short-time spatial transform and define “Spatio-gram” to understand the effect of time-varying nature of the signal.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123567931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement Learning for Spectrum Prediction and EE Maximization in D2D Communication 强化学习用于 D2D 通信中的频谱预测和 EE 最大化
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840772
S. Maity, K. Sinha, B. Sinha, Reema Kumari
This paper proposes a reinforcement learning (RL) based Q-learning to address the issues of joint spectrum prediction (SP) and device-to-device (D2D) data communication in cognitive radio (CR) framework. An optimization problem is formulated that addresses energy efficiency (EE) maximization of D2D communications under the constraints of its total transmit power and a certain data transmission rate while meeting an interference threshold and cooperation rate in primary user (PU) transmission. The high accuracy in SP offers reward as an improvement on EE while a compulsion of meeting an interference threshold and a penalty on PU data transmission are made based on the relative degree of wrong prediction. A large set of simulation results shows that the proposed method offers 30% gain in EE while 20% reduction in data collision with PU over the existing works.
本文提出了一种基于强化学习(RL)的 Q-learning,以解决认知无线电(CR)框架中联合频谱预测(SP)和设备到设备(D2D)数据通信的问题。本文提出了一个优化问题,即在满足干扰阈值和主用户(PU)传输合作率的同时,在总发射功率和一定数据传输速率的约束条件下实现 D2D 通信的能效(EE)最大化。SP 的高精确度可作为对 EE 的改进提供奖励,而满足干扰阈值的强制要求和对 PU 数据传输的惩罚则基于错误预测的相对程度。大量仿真结果表明,与现有方法相比,所提出的方法在 EE 方面提高了 30%,而在与 PU 的数据碰撞方面减少了 20%。
{"title":"Reinforcement Learning for Spectrum Prediction and EE Maximization in D2D Communication","authors":"S. Maity, K. Sinha, B. Sinha, Reema Kumari","doi":"10.1109/SPCOM55316.2022.9840772","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840772","url":null,"abstract":"This paper proposes a reinforcement learning (RL) based Q-learning to address the issues of joint spectrum prediction (SP) and device-to-device (D2D) data communication in cognitive radio (CR) framework. An optimization problem is formulated that addresses energy efficiency (EE) maximization of D2D communications under the constraints of its total transmit power and a certain data transmission rate while meeting an interference threshold and cooperation rate in primary user (PU) transmission. The high accuracy in SP offers reward as an improvement on EE while a compulsion of meeting an interference threshold and a penalty on PU data transmission are made based on the relative degree of wrong prediction. A large set of simulation results shows that the proposed method offers 30% gain in EE while 20% reduction in data collision with PU over the existing works.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126784659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sum Rate and Outage Performance of Relay-Aided NOMA Over Power Line Communication 电力线上中继辅助NOMA通信的和速率和中断性能
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840845
Roopesh Ramesh, Sanjeev Gurugopinath
We present a non-orthogonal multiple access (NOMA)-based coooperative scheme for relay-assisted power line communication (PLC) systems. The network consists of a source (S) modem, a decode-and-forward relay (R) modem and a destination (D) modem. In the first time slot, S communicates to both R and D using NOMA, while S and R simultaneously communicate with D in the second time slot. We derive closed form expressions for the approximate average sum rate and overall outage probability of the network. Through Monte Carlo simulations and numerical techniques we show that the approximations used in our analysis are tight. Furthermore, we show that our scheme outperforms a single-stage cooperative relaying NOMA scheme for PLC proposed in the earlier literature, in terms of outage probability and sum rate.
提出了一种基于非正交多址(NOMA)的中继辅助电力线通信(PLC)系统协同方案。该网络由源(S)调制解调器、解码转发中继(R)调制解调器和目的(D)调制解调器组成。在第一个时隙,S使用NOMA与R和D同时通信,而在第二个时隙,S和R同时与D通信。我们导出了网络的近似平均和率和总中断概率的封闭表达式。通过蒙特卡罗模拟和数值技术,我们表明在我们的分析中使用的近似是紧密的。此外,我们表明,我们的方案优于先前文献中提出的PLC单级合作中继NOMA方案,在停电概率和求和率方面。
{"title":"Sum Rate and Outage Performance of Relay-Aided NOMA Over Power Line Communication","authors":"Roopesh Ramesh, Sanjeev Gurugopinath","doi":"10.1109/SPCOM55316.2022.9840845","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840845","url":null,"abstract":"We present a non-orthogonal multiple access (NOMA)-based coooperative scheme for relay-assisted power line communication (PLC) systems. The network consists of a source (S) modem, a decode-and-forward relay (R) modem and a destination (D) modem. In the first time slot, S communicates to both R and D using NOMA, while S and R simultaneously communicate with D in the second time slot. We derive closed form expressions for the approximate average sum rate and overall outage probability of the network. Through Monte Carlo simulations and numerical techniques we show that the approximations used in our analysis are tight. Furthermore, we show that our scheme outperforms a single-stage cooperative relaying NOMA scheme for PLC proposed in the earlier literature, in terms of outage probability and sum rate.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134322762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Real Time Smart Music Player Using Facial Expression 使用面部表情的实时智能音乐播放器
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840806
Natha Harika, T. Kumar
According to the World Happiness Report 2021, created by the World Happiness Council, it has ranked 140 countries based on how satisfied their citizens are. Due to COVID-19 pandemic, they observed there was large decline in mental health because of unemployment and decrease inperson gatherings resulting in a decrease in the happiness index of countries. Therefore, in this work we targeted to improve the emotional state of the person and make him happy, by recognizing the persons emotion and plays corresponding music will help user in changing their mood. Music's magical power has been scientifically established and people enjoy listening to music that reflects their emotional feelings, it is a stress-relieving tool and has the ability to control a wide range of psychological states. We used Viola Jones algorithm, Data augmentation and CoAtNet algorithm to detect the emotion of a person. A high accuracy is achieved with proposed CoAtNet model when compared to other methods like Conventional CNN, Principal Component Analysis (PCA) and SVM etc. We have also deployed the model on the STM32H747I Board.
根据世界幸福理事会发布的《2021年世界幸福报告》,该报告根据各国公民的幸福程度对140个国家进行了排名。他们观察到,由于新冠肺炎大流行,失业和面对面聚会的减少导致各国的幸福指数下降,导致心理健康大幅下降。因此,在这个作品中,我们的目标是改善人的情绪状态,让他快乐,通过识别人的情绪并播放相应的音乐来帮助用户改变他们的情绪。音乐的神奇力量已经被科学地证实,人们喜欢听反映他们情绪感受的音乐,它是一种缓解压力的工具,有能力控制广泛的心理状态。我们使用Viola Jones算法、Data augmentation和CoAtNet算法来检测一个人的情绪。与传统的CNN、主成分分析(PCA)和支持向量机(SVM)等方法相比,本文提出的CoAtNet模型具有较高的准确率。我们还将该模型部署在STM32H747I板上。
{"title":"Real Time Smart Music Player Using Facial Expression","authors":"Natha Harika, T. Kumar","doi":"10.1109/SPCOM55316.2022.9840806","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840806","url":null,"abstract":"According to the World Happiness Report 2021, created by the World Happiness Council, it has ranked 140 countries based on how satisfied their citizens are. Due to COVID-19 pandemic, they observed there was large decline in mental health because of unemployment and decrease inperson gatherings resulting in a decrease in the happiness index of countries. Therefore, in this work we targeted to improve the emotional state of the person and make him happy, by recognizing the persons emotion and plays corresponding music will help user in changing their mood. Music's magical power has been scientifically established and people enjoy listening to music that reflects their emotional feelings, it is a stress-relieving tool and has the ability to control a wide range of psychological states. We used Viola Jones algorithm, Data augmentation and CoAtNet algorithm to detect the emotion of a person. A high accuracy is achieved with proposed CoAtNet model when compared to other methods like Conventional CNN, Principal Component Analysis (PCA) and SVM etc. We have also deployed the model on the STM32H747I Board.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"140 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132499381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Effect of Different Sequence Distributions on ISI in an MCvD System MCvD系统中不同序列分布对ISI的影响
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840783
Tamoghno Nath, K. Benerjee, Adrish Banerjee
In a Molecular-Communication-via-Diffusion (MCvD) channel, the molecules follow a simple Brownian motion that leads to an irregular arrival of the molecules at the receiver and introduces Inter-Symbol-Interference (ISI) in the channel. In this work, we have used different sequence distributions to analyze the effect of ISI in an MCvD channel. It has been shown that the ISI strictly depends on the location of bit-1s in the sequence, and accordingly, the expected ISI has been computed for all the proposed sequences based on the bit-1 positions in the sequence. We have also derived an upper bound on the expected ISI for the proposed sequences. We have shown that One-at-Starting-Position (OSP) sequence shows the best performance among all the proposed sequence distributions, with the expected ISI converging to a constant value. Simulation results also corroborate that the OSP sequence provides the lowest ISI in an MCvD channel compared to other codes studied in the literature.
在分子扩散通信(MCvD)通道中,分子遵循简单的布朗运动,导致分子不规则到达接收器,并在通道中引入符号间干扰(ISI)。在这项工作中,我们使用不同的序列分布来分析ISI在MCvD通道中的影响。结果表明,ISI严格依赖于序列中bit-1的位置,因此,根据序列中bit-1的位置计算了所有提议序列的期望ISI。我们还推导了所提出序列的期望ISI的上界。我们已经证明,在所有提出的序列分布中,1 -at- startingposition (OSP)序列表现出最好的性能,期望ISI收敛到一个常数值。仿真结果还证实,与文献中研究的其他代码相比,OSP序列在MCvD通道中提供了最低的ISI。
{"title":"On Effect of Different Sequence Distributions on ISI in an MCvD System","authors":"Tamoghno Nath, K. Benerjee, Adrish Banerjee","doi":"10.1109/SPCOM55316.2022.9840783","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840783","url":null,"abstract":"In a Molecular-Communication-via-Diffusion (MCvD) channel, the molecules follow a simple Brownian motion that leads to an irregular arrival of the molecules at the receiver and introduces Inter-Symbol-Interference (ISI) in the channel. In this work, we have used different sequence distributions to analyze the effect of ISI in an MCvD channel. It has been shown that the ISI strictly depends on the location of bit-1s in the sequence, and accordingly, the expected ISI has been computed for all the proposed sequences based on the bit-1 positions in the sequence. We have also derived an upper bound on the expected ISI for the proposed sequences. We have shown that One-at-Starting-Position (OSP) sequence shows the best performance among all the proposed sequence distributions, with the expected ISI converging to a constant value. Simulation results also corroborate that the OSP sequence provides the lowest ISI in an MCvD channel compared to other codes studied in the literature.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116032464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 IEEE International Conference on Signal Processing and Communications (SPCOM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1