2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)最新文献

英文中文

Real Time Audio Synchronization Using Audio Fingerprinting Techniques 使用音频指纹技术的实时音频同步

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9808050

Tarun Kumar Yadav, Gautam Sanjeev Bidari, Adwait Abhay Pande, K. Surender

In this day and age of widespread multimedia content, it is quite common to listen to a song, wish to identify it and also continue to listen to it in sync, even in the absence of the original sound source. However, this is quite difficult to achieve in noisy environments or in radio transmissions, due to features like time-stretching. Despite the presence of popular query-by-example apps like shazam, there are no applications that combine audio recognition with real-time synchronization of the song. This paper attempts to present a novel method of realtime audio synchronization by making use of established audio fingerprinting techniques and proposing a scaleable distributed handling mechanism for handling larger databases.

在这个多媒体内容广泛传播的时代，即使在没有原始声源的情况下，听一首歌，希望识别它并继续同步听它是相当普遍的。然而，这在嘈杂的环境或无线电传输中很难实现，因为有时间拉伸等特征。尽管有shazam等流行的按例查询应用程序，但还没有将音频识别与歌曲实时同步结合起来的应用程序。本文试图利用现有的音频指纹技术，提出一种新的实时音频同步方法，并提出一种可扩展的分布式处理机制，用于处理大型数据库。

引用次数: 1

Intelligent Billing system using Object Detection 使用对象检测的智能计费系统

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807953

Neeraj Chidella, N. K. Reddy, Nicole Reddy, Maddi Mohan, Joydeep Sengupta

With the rapidly increasing technology and development in machine learning, deep learning and artificial intelligence, improving the billing system is an effective means of reducing wastage of time. Nowadays, even though barcode scanners have become as fast as ever but for fruits and vegetables, it still needs to be entered manually into the computer which is very time taking and hectic process. Vegetable and fruit markets have become an integral part of our life hence in such places the environment must be made hassle free and more importantly, the billing should be less laborious and efficient without wasting time. In order to overcome the existing problems associated with the barcode and RFID tags, we proposed an automatic billing system that detects the fruits and vegetables and then displays the final Bill. The main objective of this project is to detect the fruits, display the fruits detected and then to bill these items. To achieve this, we have used two different algorithms, 1) Fine tuned Convolutional Neural Network that we built from base model. 2) To increase accuracy for real time object detection and for the bounding boxes to be displayed, we used state of the art YOLO based on pytorch as YOLO predicts the bounding boxes and detects the object faster than other detection algorithms and is more reliable.

随着机器学习、深度学习、人工智能等技术的飞速发展，完善计费系统是减少时间浪费的有效手段。如今，尽管条形码扫描器的速度比以往任何时候都快，但对于水果和蔬菜，仍然需要手动输入到计算机中，这是一个非常耗时和繁忙的过程。蔬菜和水果市场已经成为我们生活中不可或缺的一部分，因此在这样的地方，环境必须是无麻烦的，更重要的是，账单应该不浪费时间，不那么费力和高效。为了克服条形码和RFID标签存在的问题，我们提出了一种自动计费系统，它可以检测水果和蔬菜，然后显示最终的账单。这个项目的主要目标是检测水果，显示检测到的水果，然后对这些项目进行计费。为了实现这一目标，我们使用了两种不同的算法:1)基于基础模型构建的微调卷积神经网络。2)为了提高实时目标检测和显示边界框的准确性，我们使用了基于pytorch的最先进的YOLO，因为YOLO预测边界框并比其他检测算法更快地检测目标，并且更可靠。

{"title":"Intelligent Billing system using Object Detection","authors":"Neeraj Chidella, N. K. Reddy, Nicole Reddy, Maddi Mohan, Joydeep Sengupta","doi":"10.1109/PCEMS55161.2022.9807953","DOIUrl":"https://doi.org/10.1109/PCEMS55161.2022.9807953","url":null,"abstract":"With the rapidly increasing technology and development in machine learning, deep learning and artificial intelligence, improving the billing system is an effective means of reducing wastage of time. Nowadays, even though barcode scanners have become as fast as ever but for fruits and vegetables, it still needs to be entered manually into the computer which is very time taking and hectic process. Vegetable and fruit markets have become an integral part of our life hence in such places the environment must be made hassle free and more importantly, the billing should be less laborious and efficient without wasting time. In order to overcome the existing problems associated with the barcode and RFID tags, we proposed an automatic billing system that detects the fruits and vegetables and then displays the final Bill. The main objective of this project is to detect the fruits, display the fruits detected and then to bill these items. To achieve this, we have used two different algorithms, 1) Fine tuned Convolutional Neural Network that we built from base model. 2) To increase accuracy for real time object detection and for the bounding boxes to be displayed, we used state of the art YOLO based on pytorch as YOLO predicts the bounding boxes and detects the object faster than other detection algorithms and is more reliable.","PeriodicalId":248874,"journal":{"name":"2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129427349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Experimental Design and Implementation of RFID based Clinical Medicine Dispenser 基于RFID的临床药品分配器的实验设计与实现

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807886

Ridita Garg, Isha Bhatt, K. Eashwer, S. Jindal

Elders depend on their medicines to keep them stable, yet complex prescription timetables can spur confusion like missing dosages, wrong medication, or medications at some wrong time. These slip-ups could lead to a redundant specialist or clinic visits, sickness, and even demise. Consequently, there is a need to plan a Medication Dispenser to assist elders with taking the drug on time. This would dissuade impromptu clinic or specialist visits connected with mistaken medicine use. The objective involves the development of an intelligent device that dispenses the medicines on the advised schedule. This work requires interfacing LCD, motor, and interfacing RFID reader with an 8051 microcontroller.

老年人依靠他们的药物来保持他们的稳定，然而复杂的处方时间表可能会引起混乱，比如错过剂量，错误的药物，或者在错误的时间服药。这些疏忽可能会导致多余的专家或诊所就诊，生病，甚至死亡。因此，有必要计划一个药物分发器，以帮助老年人按时服药。这将劝阻临时诊所或专家访问与错误用药有关。目标包括开发一种智能设备，按照建议的时间表分配药物。这项工作需要将LCD，电机和RFID读取器与8051微控制器连接起来。

引用次数: 0

Identification and Localization of COVID-19 Abnormalities on Chest Radiographs using Ensembled Deep Neural Networks 利用集成深度神经网络识别和定位胸片上的COVID-19异常

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807972

Manikiran Kommidi, Anudeep Chinta, Tarun Kumar Dachepally, Srilatha Chebrolu

With the hit of the global pandemic COVID-19, the chest X-ray domain has gained prominence. It has been recognised as one of the principal methods to learn the presence of infection and its effect on various internal organs like the lungs. Chest radiographs show abnormalities due to COVID-19 that appear similar to the anomalies caused by other viruses and bacteria, thus making it challenging for technicians to detect. Therefore, it becomes almost inevitable to have a computer vision model that identifies and localizes the COVID-19 virus to help doctors provide an immediate and confident diagnosis. The models in computer vision tasks have seen considerable advancements in deep learning, so the proposed model tried to integrate a few of them to come up with a model for classifying and localising the diagnosis of COVID-19 using chest X-rays. This paper ensembles a few state-of-the-art models in classification and object detection to build a model for chest radiograph diagnosis. The proposed ensembled model is found to achieve the mean Average Precision value of 0.627 on SIIM-FISABIO-RSNA COVID-19 dataset.

随着COVID-19全球大流行的打击，胸部x射线领域得到了重视。它被认为是了解感染的存在及其对各种内脏器官(如肺)影响的主要方法之一。胸部x线片显示COVID-19引起的异常与其他病毒和细菌引起的异常相似，因此对技术人员来说很难检测到。因此，拥有能够识别和定位COVID-19病毒的计算机视觉模型，以帮助医生提供即时和自信的诊断，几乎是不可避免的。计算机视觉任务中的模型在深度学习方面取得了相当大的进步，因此所提出的模型试图整合其中的一些模型，以提出一个模型，用于使用胸部x射线对COVID-19的诊断进行分类和定位。本文综合了几种目前最先进的分类和目标检测模型，建立了胸片诊断模型。在SIIM-FISABIO-RSNA COVID-19数据集上，所提出的集成模型的平均精度为0.627。

{"title":"Identification and Localization of COVID-19 Abnormalities on Chest Radiographs using Ensembled Deep Neural Networks","authors":"Manikiran Kommidi, Anudeep Chinta, Tarun Kumar Dachepally, Srilatha Chebrolu","doi":"10.1109/PCEMS55161.2022.9807972","DOIUrl":"https://doi.org/10.1109/PCEMS55161.2022.9807972","url":null,"abstract":"With the hit of the global pandemic COVID-19, the chest X-ray domain has gained prominence. It has been recognised as one of the principal methods to learn the presence of infection and its effect on various internal organs like the lungs. Chest radiographs show abnormalities due to COVID-19 that appear similar to the anomalies caused by other viruses and bacteria, thus making it challenging for technicians to detect. Therefore, it becomes almost inevitable to have a computer vision model that identifies and localizes the COVID-19 virus to help doctors provide an immediate and confident diagnosis. The models in computer vision tasks have seen considerable advancements in deep learning, so the proposed model tried to integrate a few of them to come up with a model for classifying and localising the diagnosis of COVID-19 using chest X-rays. This paper ensembles a few state-of-the-art models in classification and object detection to build a model for chest radiograph diagnosis. The proposed ensembled model is found to achieve the mean Average Precision value of 0.627 on SIIM-FISABIO-RSNA COVID-19 dataset.","PeriodicalId":248874,"journal":{"name":"2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132897285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

ARMPC - ARIMA based prediction model for Adaptive Bitrate Scheme in Streaming 基于腋窝- ARIMA的自适应码率流预测模型

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807874

Sankalp Naik, Osama Khan, Ashay Katre, A. Keskar

With the boom of the internet, online streaming algorithms such as Adaptive bitrate streaming have gained prominence. The Adaptive Bitrate (ABR) scheme uses the Model Predictive Control (MPC) to determine the best possible bitrate for the given network conditions. Though this method works well, the major disadvantage is its heavy reliance on the throughput prediction error which makes it difficult to perform well in congested network conditions. Other methods such as DeepMPC have also been explored in this paper which use the Deep Learning algorithms to predict the bandwidth. These work better than the trivial harmonic predictor but demand high computational power. This paper proposes ARMPC which uses the Auto-Regressive Integrated Moving Average Technique (ARIMA) to predict the future bandwidth. Using trace-driven experiments, we have shown both mathematically and practically that the ARMPC can provide us with improvements in both the prediction and the computational points of view.

随着互联网的蓬勃发展，像自适应比特率流这样的在线流媒体算法已经得到了重视。自适应比特率(ABR)方案使用模型预测控制(MPC)来确定给定网络条件下可能的最佳比特率。该方法虽然效果良好，但其主要缺点是严重依赖吞吐量预测误差，难以在拥塞网络条件下发挥良好的性能。本文还探讨了使用深度学习算法预测带宽的其他方法，如DeepMPC。这些方法比普通的谐波预测器效果更好，但需要较高的计算能力。本文提出了利用自回归综合移动平均技术(ARIMA)预测未来带宽的胳肢窝算法。通过跟踪驱动的实验，我们已经在数学上和实践上证明了腋窝可以在预测和计算方面为我们提供改进。

引用次数: 2

Smart Helmet For Coal Miners 煤矿工人的智能头盔

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807899

Rohan Paul, Arpan Karar, Abhirup Datta, S. Jindal

A traditional model of the smart helmet has been produced to assist miners operating in the mining industry. Many risky incidents commonly occur in the mining sector, many of which result in life-threatening injuries or death. A miner’s helmet is one of the most regularly used safety equipment for mine workers hence it must be loaded with some more advanced features. With the use of different sensors, the smart helmet will be able to identify catastrophic situations such as harmful gases like Carbon-Monoxide, CH4, LPG, and natural gases. Whether the miner is wearing his helmet or not is detected by an infrared sensor. Each sensor has a critical value that, if exceeded, causes the buzzer to activate and the LEDs to illuminate, signaling the miners and supervisors. The GPS module fitted in the miners’ helmets allows the mining officials to readily track their locations. Furthermore, a Panic Button has been implemented, which, when pressed, sends an emergency signal via mail to higher authorities outside the mines. A mobile application has also been created to display all of the data supplied wirelessly from the sensors. As a result, the proposed smart helmet protects miners from any upcoming accidents.

智能头盔的传统模型已经生产出来，以帮助矿工在采矿业中操作。采矿部门经常发生许多危险事故，其中许多事故造成危及生命的伤害或死亡。矿工头盔是矿工最常用的安全设备之一，因此它必须装载一些更先进的功能。通过使用不同的传感器，智能头盔将能够识别灾难性的情况，如一氧化碳、甲烷、液化石油气和天然气等有害气体。矿工是否戴头盔由红外传感器检测。每个传感器都有一个临界值，如果超过临界值，蜂鸣器就会启动，led灯就会亮起，向矿工和监工发出信号。安装在矿工头盔上的GPS模块可以让采矿官员随时追踪他们的位置。此外，还设置了一个紧急按钮，按下该按钮后，将通过邮件向矿井外的上级发出紧急信号。一个移动应用程序也被创建出来，用来显示传感器无线提供的所有数据。因此，拟议中的智能头盔可以保护矿工免受任何即将发生的事故。

{"title":"Smart Helmet For Coal Miners","authors":"Rohan Paul, Arpan Karar, Abhirup Datta, S. Jindal","doi":"10.1109/PCEMS55161.2022.9807899","DOIUrl":"https://doi.org/10.1109/PCEMS55161.2022.9807899","url":null,"abstract":"A traditional model of the smart helmet has been produced to assist miners operating in the mining industry. Many risky incidents commonly occur in the mining sector, many of which result in life-threatening injuries or death. A miner’s helmet is one of the most regularly used safety equipment for mine workers hence it must be loaded with some more advanced features. With the use of different sensors, the smart helmet will be able to identify catastrophic situations such as harmful gases like Carbon-Monoxide, CH4, LPG, and natural gases. Whether the miner is wearing his helmet or not is detected by an infrared sensor. Each sensor has a critical value that, if exceeded, causes the buzzer to activate and the LEDs to illuminate, signaling the miners and supervisors. The GPS module fitted in the miners’ helmets allows the mining officials to readily track their locations. Furthermore, a Panic Button has been implemented, which, when pressed, sends an emergency signal via mail to higher authorities outside the mines. A mobile application has also been created to display all of the data supplied wirelessly from the sensors. As a result, the proposed smart helmet protects miners from any upcoming accidents.","PeriodicalId":248874,"journal":{"name":"2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130266543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Relay Selection in SWIPT-enabled Cooperative Networks 支持swift的合作网络中的中继选择

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807973

Vaijayanti Panse, T. Jain, A. Kothari

The radio frequency energy harvesting (RF-EH) technique provides a potential way to power the battery-constrained wireless devices in the future generation wireless networks. In this paper, we investigate a dual-hop decode-and-forward (DF) cooperative network with RF-EH using non-linear hybrid power-time-splitting (PTS) based model. In the proposed system, the best relay is obtained by considering the instantaneous signal-to-noise ratios (SNRs) of source (S) to relay (R) links using three selection schemes, namely, absolute SNR-based selection, normalized SNR-based selection and random selection. Considering the DF protocol at R, we evaluate the outage and throughput performances of the system over independent and identically distributed Rayleigh fading channels. The derived results are validated through Monte-Carlo simulations.

射频能量收集(RF-EH)技术为下一代无线网络中电池受限的无线设备供电提供了一种潜在的方法。本文利用非线性混合功率-时间分裂(PTS)模型研究了一种具有RF-EH的双跳译码转发(DF)合作网络。在该系统中，通过考虑源链路(S)与中继链路(R)的瞬时信噪比(SNRs)，采用绝对信噪比选择、归一化信噪比选择和随机选择三种选择方案获得最佳中继。考虑R点的DF协议，我们评估了系统在独立和同分布瑞利衰落信道上的中断和吞吐量性能。通过蒙特卡罗仿真验证了所得结果。

引用次数: 1

Metasurface based microstrip patch antenna at 11GHz frequency for enhanced gain and directivity 基于超表面的11GHz频率微带贴片天线，增强增益和指向性

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807891

Pallavi S. Kadam, P. Manikanta, N. Rao

This paper presents a metamaterial based microstrip patch antenna with fractal geometry and defected ground to give improved Return loss, Directivity, Gain and Bandwidth.Two rectangular shapes have been cut from two edges of the ground.RT 5880LZ dielectric has been used as substrate along with a Meta-surface (MS) layer separated by air gap. The MS unit cell consists of two L-shaped patches on two corners and a C-shaped patch at the centre.Fractal Patch with rectangular notches on three sides has been used along with microstrip feedline to feed the antenna.This antenna gives a good impedance matching in the frequency range of 10.6 GHz - 11.3 GHz.Maximum return loss of 21.31 dB has been achieved and a gain of 6 dBi has been obtained at the operating frequency of 10.95 GHz. This efficient antenna model can be implemented for satellite communication in X-band.

本文提出了一种基于分形几何和缺陷接地的超材料微带贴片天线，以改善回波损耗、指向性、增益和带宽。从地面的两边剪出两个长方形。采用RT 5880LZ介质作为衬底，并采用气隙分隔元表面(MS)层。MS细胞由两个角上的两个l形斑块和中心的一个c形斑块组成。采用三面矩形缺口分形贴片与微带馈线配合馈线进行天线馈电。该天线在10.6 GHz - 11.3 GHz频率范围内阻抗匹配良好。在10.95 GHz工作频率下，最大回波损耗为21.31 dB，增益为6 dBi。这种高效的天线模型可用于x波段的卫星通信。

引用次数: 1

Human Following Robot using Kinect in Embedded Platform 基于Kinect的嵌入式人机跟踪机器人

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9807846

Sachin N. Kapgate, Pravin Sahu, M. Das, Deep Gupta

This paper presents an embedded robotic system using Kinect sensor technology that is utilized to detect an individual target and track its movement in the surrounding. The developed system integrates both the features of computer vision and embedded robotics simultaneously. In this paper, a skeleton-based tracking algorithm is used because tracking plays an important role in localization and mapping. This system uses a gesture-based recognition that relies on Microsoft Kinect XBox 360 instead of a standard touch-based control system that can be utilized as a stand-alone system or subsystem to integrate into a larger system. The Kinect sensor captures the 3-dimensional information of the surroundings and recognizes the human body by retrieving the depth information that does not require wearing any kind of intrusive sensors. Firstly, this robotic system follows an individual by detection of torso point that is required for steering and maintaining a fixed safe distance for localization and mapping and provides a robust and reliable system. The proposed system can be utilized in a wide variety of applications such as work assistants, luggage carrying carts, etc.

本文介绍了一种使用Kinect传感器技术的嵌入式机器人系统，该系统用于检测单个目标并跟踪其在周围环境中的运动。所开发的系统同时集成了计算机视觉和嵌入式机器人的特点。由于跟踪在定位和映射中起着重要的作用，本文采用了基于骨架的跟踪算法。该系统使用基于手势的识别技术，依赖于微软Kinect XBox 360，而不是标准的基于触摸的控制系统，可以作为一个独立的系统或子系统集成到一个更大的系统中。Kinect传感器捕捉周围环境的三维信息，并通过检索深度信息来识别人体，而不需要佩戴任何侵入式传感器。首先，该机器人系统通过检测个体转向所需的躯干点来跟踪个体，并保持固定的安全距离进行定位和绘图，提供了一个鲁棒可靠的系统。所提出的系统可用于各种各样的应用，如工作助理、行李搬运车等。

{"title":"Human Following Robot using Kinect in Embedded Platform","authors":"Sachin N. Kapgate, Pravin Sahu, M. Das, Deep Gupta","doi":"10.1109/PCEMS55161.2022.9807846","DOIUrl":"https://doi.org/10.1109/PCEMS55161.2022.9807846","url":null,"abstract":"This paper presents an embedded robotic system using Kinect sensor technology that is utilized to detect an individual target and track its movement in the surrounding. The developed system integrates both the features of computer vision and embedded robotics simultaneously. In this paper, a skeleton-based tracking algorithm is used because tracking plays an important role in localization and mapping. This system uses a gesture-based recognition that relies on Microsoft Kinect XBox 360 instead of a standard touch-based control system that can be utilized as a stand-alone system or subsystem to integrate into a larger system. The Kinect sensor captures the 3-dimensional information of the surroundings and recognizes the human body by retrieving the depth information that does not require wearing any kind of intrusive sensors. Firstly, this robotic system follows an individual by detection of torso point that is required for steering and maintaining a fixed safe distance for localization and mapping and provides a robust and reliable system. The proposed system can be utilized in a wide variety of applications such as work assistants, luggage carrying carts, etc.","PeriodicalId":248874,"journal":{"name":"2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128669735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

An Acoustic Analysis of Speech for Emotion Recognition using Deep Learning 基于深度学习的情感识别语音声学分析

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

Pub Date : 2022-05-06 DOI: 10.1109/PCEMS55161.2022.9808012

Aman Verma, Raghav Agrawal, Priyanka Singh, N. Ansari

Speech emotion recognition has shown several advancements as a result of advancements in Deep Learning algorithms. These algorithms can easily extract the features from the data and learn to recognize patterns from them. Although these algorithms can successfully recognize emotions, their efficiency is often argued. The main objective of this paper is to efficiently classify the emotional state of a person from speech signals using traditional machine learning and deep learning techniques and to present a comparative analysis. We have considered eight different types of emotions, and have analyzed them in the following two ways: First, by considering the male and female emotions combinedly (gender-neutral) where they are classified into eight classes, and second, separately for the male and female emotions (gender-based) for a total of 16 classes. We have performed experimentation and have tested several architectures like K-Nearest Neighbor (KNN), Multilayer Perceptron (MLP), One Dimensional Convolutional Neural Network + Long Short-Term Memory (ID CNN+LSTM) by efficiently tuning the hyperparameters to classify the emotional states. Best results are obtained with the ID CNN + LSTM model. We have obtained an accuracy of 87.4% for gender-neutral cases and 82.78% for gender-based cases. This model outperforms existing techniques.

由于深度学习算法的进步，语音情感识别已经取得了一些进展。这些算法可以很容易地从数据中提取特征并从中学习识别模式。虽然这些算法可以成功地识别情绪，但它们的效率经常受到争议。本文的主要目的是利用传统的机器学习和深度学习技术有效地从语音信号中分类人的情绪状态，并进行比较分析。我们考虑了8种不同类型的情绪，并通过以下两种方式进行了分析:首先，将男性和女性情绪合并考虑(性别中立)，将其分为8类;其次，将男性和女性情绪分开考虑(性别为基础)，共分为16类。我们已经进行了实验，并通过有效地调整超参数来对情绪状态进行分类，测试了k -最近邻(KNN)，多层感知器(MLP)，一维卷积神经网络+长短期记忆(ID CNN+LSTM)等几种架构。ID CNN + LSTM模型效果最好。我们在性别中立病例和基于性别的病例中获得了87.4%和82.78%的准确率。这个模型优于现有的技术。

{"title":"An Acoustic Analysis of Speech for Emotion Recognition using Deep Learning","authors":"Aman Verma, Raghav Agrawal, Priyanka Singh, N. Ansari","doi":"10.1109/PCEMS55161.2022.9808012","DOIUrl":"https://doi.org/10.1109/PCEMS55161.2022.9808012","url":null,"abstract":"Speech emotion recognition has shown several advancements as a result of advancements in Deep Learning algorithms. These algorithms can easily extract the features from the data and learn to recognize patterns from them. Although these algorithms can successfully recognize emotions, their efficiency is often argued. The main objective of this paper is to efficiently classify the emotional state of a person from speech signals using traditional machine learning and deep learning techniques and to present a comparative analysis. We have considered eight different types of emotions, and have analyzed them in the following two ways: First, by considering the male and female emotions combinedly (gender-neutral) where they are classified into eight classes, and second, separately for the male and female emotions (gender-based) for a total of 16 classes. We have performed experimentation and have tested several architectures like K-Nearest Neighbor (KNN), Multilayer Perceptron (MLP), One Dimensional Convolutional Neural Network + Long Short-Term Memory (ID CNN+LSTM) by efficiently tuning the hyperparameters to classify the emotional states. Best results are obtained with the ID CNN + LSTM model. We have obtained an accuracy of 87.4% for gender-neutral cases and 82.78% for gender-based cases. This model outperforms existing techniques.","PeriodicalId":248874,"journal":{"name":"2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125932509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2022 1st International Conference on the Paradigm Shifts in Communication, Embedded Systems, Machine Learning and Signal Processing (PCEMS)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀