首页 > 最新文献

2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)最新文献

英文 中文
Improved American Sign Language Recognition and Correction Using Inception Network, MediaPipe and PyEnchant 使用Inception网络,MediaPipe和PyEnchant改进美国手语识别和纠正
D. Agrawal, Harshvardhan Dave, Abhishek P. Shete, Spandana Pulimamidi, Snigdha Bhagat, Punitkumar Bhavsar
Sign language recognition through image processing presents challenges related to the requirement of real time applicability and high accuracy. Though previous work adopting methodologies from deep convolutional neural network architectures have shown to achieve good performance, they lack a consummate solution in terms of accuracy due to consideration of word based recognition. Recent development of Inception Network based architectures have shown promising classification accuracy with relatively less computational demand. Hence in this paper we propose a methodology that adopts Inception Network for the task of Sign Language Recognition. We considered the American Sign Language Recognition and Correction model. The correction and suggestion tools are implemented in the model to rectify any incorrect sign detection. The results from our approach achieves accuracy in the order of 99 percent.
基于图像处理的手语识别技术对实时性和准确性的要求提出了挑战。虽然以前的工作采用深度卷积神经网络架构的方法已经显示出良好的性能,但由于考虑到基于词的识别,它们在准确性方面缺乏完善的解决方案。基于Inception网络的体系结构的最新发展显示出有希望的分类精度和相对较少的计算需求。因此,本文提出了一种采用盗梦网络来完成手语识别任务的方法。我们考虑了美国手语识别和纠正模型。在模型中实现了纠正和建议工具,以纠正任何不正确的符号检测。我们的方法的结果达到了99%的精度。
{"title":"Improved American Sign Language Recognition and Correction Using Inception Network, MediaPipe and PyEnchant","authors":"D. Agrawal, Harshvardhan Dave, Abhishek P. Shete, Spandana Pulimamidi, Snigdha Bhagat, Punitkumar Bhavsar","doi":"10.1109/PCEMS58491.2023.10136115","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136115","url":null,"abstract":"Sign language recognition through image processing presents challenges related to the requirement of real time applicability and high accuracy. Though previous work adopting methodologies from deep convolutional neural network architectures have shown to achieve good performance, they lack a consummate solution in terms of accuracy due to consideration of word based recognition. Recent development of Inception Network based architectures have shown promising classification accuracy with relatively less computational demand. Hence in this paper we propose a methodology that adopts Inception Network for the task of Sign Language Recognition. We considered the American Sign Language Recognition and Correction model. The correction and suggestion tools are implemented in the model to rectify any incorrect sign detection. The results from our approach achieves accuracy in the order of 99 percent.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126488257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recent Advancements in Applications of Artificial Intelligence and Machine Learning for 5G Technology: A Review 人工智能和机器学习在5G技术中的应用进展综述
Alekya Nyalapelli, Shubham Sharma, Pranjal Phadnis, Maithili Patil, A. Tandle
As the fifth generation (5G) of wireless communication rolls out worldwide, conceptualized use cases and disruptive industry solutions are being deployed to offer smooth, frictionless, and secure connectivity. The landscape of Artificial Intelligence (AI) and Machine Learning (ML) can be seen as potential drivers in the automation and optimization of network performances and management complexities. The shifting network behaviors and complicated modern applications present diverse network performance traffic, which can be exploited by service providers to deal with network demands and provide superior user experiences. The existing research can be divided into the following 5G research areas, which include network traffic, resource allocation, network slicing, mobility management, physical layer security, etc., to name a few. The primary objective of this paper is to provide a comprehensive perspective on the expanding diversity of viable ML-assisted solutions for tackling various 5G network-level issues. The paper concludes with an indepth investigation of the challenges and unexplored directions of future research pertaining to making 5G applications more reliable for future use cases.
随着第五代(5G)无线通信在全球范围内的推广,概念化的用例和颠覆性的行业解决方案正在被部署,以提供流畅、无摩擦和安全的连接。人工智能(AI)和机器学习(ML)的前景可以被视为网络性能和管理复杂性自动化和优化的潜在驱动因素。网络行为的变化和复杂的现代应用带来了多样化的网络性能流量,服务提供商可以利用这些流量来满足网络需求并提供卓越的用户体验。现有的研究可以分为以下几个5G研究领域,包括网络流量、资源分配、网络切片、移动性管理、物理层安全等。本文的主要目标是为解决各种5G网络级问题的可行ml辅助解决方案的不断扩大的多样性提供全面的视角。本文最后深入调查了未来研究的挑战和未探索的方向,这些挑战和方向与使5G应用在未来用例中更加可靠有关。
{"title":"Recent Advancements in Applications of Artificial Intelligence and Machine Learning for 5G Technology: A Review","authors":"Alekya Nyalapelli, Shubham Sharma, Pranjal Phadnis, Maithili Patil, A. Tandle","doi":"10.1109/PCEMS58491.2023.10136039","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136039","url":null,"abstract":"As the fifth generation (5G) of wireless communication rolls out worldwide, conceptualized use cases and disruptive industry solutions are being deployed to offer smooth, frictionless, and secure connectivity. The landscape of Artificial Intelligence (AI) and Machine Learning (ML) can be seen as potential drivers in the automation and optimization of network performances and management complexities. The shifting network behaviors and complicated modern applications present diverse network performance traffic, which can be exploited by service providers to deal with network demands and provide superior user experiences. The existing research can be divided into the following 5G research areas, which include network traffic, resource allocation, network slicing, mobility management, physical layer security, etc., to name a few. The primary objective of this paper is to provide a comprehensive perspective on the expanding diversity of viable ML-assisted solutions for tackling various 5G network-level issues. The paper concludes with an indepth investigation of the challenges and unexplored directions of future research pertaining to making 5G applications more reliable for future use cases.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129885007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A TinyML Approach for Quantification of BOD and COD in Water 一种定量测定水中BOD和COD的简易方法
Sanket Soni, A. Khurshid, Anushree Mrugank Minase, Ashlesha Bonkinpelliwar
Water quality prediction is a crucial process before any consumption of water. Prediction and modeling methods are used for pollutants in water to deal with water pollution control. This work involves the use of a random forest learning algorithm to quantitate BOD and COD using parameter tuning to establish the importance of input variables. It uses minimal sensed quantitative parameters such as Temperature, pH, DO, and Conductivity along with categorical parameters. The trained model shows excellent efficiency compared to other models and is validated using the laboratory test results with a maximum error of 10%. It is computationally low-cost, requires minimal parameters, and is pruned to integrate and implement in an IoT hardware system, reducing the cost of expensive sensors.
水质预测是用水前的重要环节。对水体中的污染物采用预测和建模的方法进行水污染控制。这项工作涉及使用随机森林学习算法来量化BOD和COD,使用参数调整来确定输入变量的重要性。它使用最小的感测定量参数,如温度,pH值,DO和电导率以及分类参数。与其他模型相比,训练后的模型具有良好的效率,并使用实验室测试结果进行了验证,最大误差为10%。它计算成本低,需要最小的参数,并且可以在物联网硬件系统中集成和实施,从而降低昂贵传感器的成本。
{"title":"A TinyML Approach for Quantification of BOD and COD in Water","authors":"Sanket Soni, A. Khurshid, Anushree Mrugank Minase, Ashlesha Bonkinpelliwar","doi":"10.1109/PCEMS58491.2023.10136050","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136050","url":null,"abstract":"Water quality prediction is a crucial process before any consumption of water. Prediction and modeling methods are used for pollutants in water to deal with water pollution control. This work involves the use of a random forest learning algorithm to quantitate BOD and COD using parameter tuning to establish the importance of input variables. It uses minimal sensed quantitative parameters such as Temperature, pH, DO, and Conductivity along with categorical parameters. The trained model shows excellent efficiency compared to other models and is validated using the laboratory test results with a maximum error of 10%. It is computationally low-cost, requires minimal parameters, and is pruned to integrate and implement in an IoT hardware system, reducing the cost of expensive sensors.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125631361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Atmospheric Attenuation affects the UAV Communication Network? 大气衰减如何影响无人机通信网络?
Praveen Pawar, Naveen Pawar, Aditya Trivedi
The benefits of 5G millimetre wave (mmWave) technology include high data throughput, very low latency, stable connectivity, and massive capacity. Furthermore, unmanned aerial vehicles (UAVs) are being used as flying base stations to provide a viable option for reliable and cost-effective wireless communication. The goal of this research is to integrate a UAVassisted wireless network with a 5G mmWave communication system and assess coverage performance. The signal between the UAV and the user is heavily distorted by tropical atmospheric parameters such as rain, trees, and fog. This paper investigated the effect of attenuation on user coverage performance in the mmWave frequency bands of 28 GHz and 60 GHz (recommended by the International Telecommunication Union (ITU) for use in 5G cellular services). First, various factors such as rain rate, frequency, and foliage depth are used to calculate the attenuation of rain and foliage. Next, the suggested system model is used to analyse the likelihood of ground users’ downlink coverage in terms of total attenuation, channel gain, channel noise, and interference. However, the impacts of rain and foliage attenuation in various tropical locales cannot be avoided, even given the small cell size of the system.
5G毫米波(mmWave)技术的优势包括高数据吞吐量、极低延迟、稳定的连接和巨大的容量。此外,无人驾驶飞行器(uav)正被用作飞行基站,为可靠和具有成本效益的无线通信提供可行的选择。这项研究的目标是将无人机辅助无线网络与5G毫米波通信系统集成,并评估覆盖性能。无人机和用户之间的信号被热带大气参数如雨、树木和雾严重扭曲。本文研究了28 GHz和60 GHz毫米波频段(国际电信联盟(ITU)推荐用于5G蜂窝业务)衰减对用户覆盖性能的影响。首先,利用降雨率、频率、叶深等因素计算雨叶衰减;接下来,使用建议的系统模型来分析地面用户的下行链路覆盖的可能性,包括总衰减、信道增益、信道噪声和干扰。然而,即使考虑到系统的小单元尺寸,雨和树叶衰减对热带地区的影响也是不可避免的。
{"title":"How Atmospheric Attenuation affects the UAV Communication Network?","authors":"Praveen Pawar, Naveen Pawar, Aditya Trivedi","doi":"10.1109/PCEMS58491.2023.10136103","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136103","url":null,"abstract":"The benefits of 5G millimetre wave (mmWave) technology include high data throughput, very low latency, stable connectivity, and massive capacity. Furthermore, unmanned aerial vehicles (UAVs) are being used as flying base stations to provide a viable option for reliable and cost-effective wireless communication. The goal of this research is to integrate a UAVassisted wireless network with a 5G mmWave communication system and assess coverage performance. The signal between the UAV and the user is heavily distorted by tropical atmospheric parameters such as rain, trees, and fog. This paper investigated the effect of attenuation on user coverage performance in the mmWave frequency bands of 28 GHz and 60 GHz (recommended by the International Telecommunication Union (ITU) for use in 5G cellular services). First, various factors such as rain rate, frequency, and foliage depth are used to calculate the attenuation of rain and foliage. Next, the suggested system model is used to analyse the likelihood of ground users’ downlink coverage in terms of total attenuation, channel gain, channel noise, and interference. However, the impacts of rain and foliage attenuation in various tropical locales cannot be avoided, even given the small cell size of the system.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"528 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129356474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative Study of Data Driven Methods for State of Charge Estimation of Li-ion Battery 锂离子电池电量状态估计数据驱动方法的比较研究
A. Sreekumar, R. Lekshmi
The battery storage system is an inevitable component in an electric vehicle. A precise state of charge is critically notable to evaluate the level of battery aging and ensure electric vehicle reliability and security. Recently, data driven methods have procured much popularity in many research fields. Data-driven methods are found to be a promising approach to provide a high accuracy solution to battery state of charge estimation problem. Research works focus on the state of charge estimation under a fixed operating temperature. Apparently, the maximum deliverable charge of a battery degrades with charge-discharge cycle and temperature. Thus, it is important to consider the operating temperature as one of the input features. This paper identifies the best data driven model for state of charge estimation among linear regression, random forest, CatBoost and XGBoost models. The data driven models are developed via training, validating, and testing stages deploying Li-ion (LG 18650HG2) battery data set under -10°C, 0°C, 10°C and 25°C. The best model is identified using performance indices. The results show the superior performance of XGBoost model under all temperatures and best performance at 25°C with mean absolute error of 0.68%, mean square error of 0.01% and root mean square error of 1.10%, mean absolute percentage error of 1.78%, and R2 value of 99.86%, compared to linear regression, random forest, CatBoost models.
电池储能系统是电动汽车不可缺少的组成部分。准确的充电状态对于评估电池老化程度、保证电动汽车的可靠性和安全性至关重要。近年来,数据驱动方法在许多研究领域得到了广泛的应用。数据驱动方法是一种很有前途的方法,可以为电池状态估计问题提供高精度的解决方案。研究工作主要集中在固定工作温度下的电荷状态估计。显然,电池的最大可交付电量随着充放电周期和温度的变化而降低。因此,将工作温度作为输入特性之一考虑是很重要的。本文从线性回归模型、随机森林模型、CatBoost模型和XGBoost模型中找出了电荷状态估计的最佳数据驱动模型。数据驱动模型通过训练,验证和测试阶段开发,部署锂离子(LG 18650HG2)电池数据集在-10°C, 0°C, 10°C和25°C。使用性能指标确定最佳模型。结果表明,与线性回归、随机森林、CatBoost模型相比,XGBoost模型在所有温度下均具有优越的性能,在25°C时性能最佳,平均绝对误差为0.68%,均方误差为0.01%,均方根误差为1.10%,平均绝对百分比误差为1.78%,R2值为99.86%。
{"title":"Comparative Study of Data Driven Methods for State of Charge Estimation of Li-ion Battery","authors":"A. Sreekumar, R. Lekshmi","doi":"10.1109/PCEMS58491.2023.10136079","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136079","url":null,"abstract":"The battery storage system is an inevitable component in an electric vehicle. A precise state of charge is critically notable to evaluate the level of battery aging and ensure electric vehicle reliability and security. Recently, data driven methods have procured much popularity in many research fields. Data-driven methods are found to be a promising approach to provide a high accuracy solution to battery state of charge estimation problem. Research works focus on the state of charge estimation under a fixed operating temperature. Apparently, the maximum deliverable charge of a battery degrades with charge-discharge cycle and temperature. Thus, it is important to consider the operating temperature as one of the input features. This paper identifies the best data driven model for state of charge estimation among linear regression, random forest, CatBoost and XGBoost models. The data driven models are developed via training, validating, and testing stages deploying Li-ion (LG 18650HG2) battery data set under -10°C, 0°C, 10°C and 25°C. The best model is identified using performance indices. The results show the superior performance of XGBoost model under all temperatures and best performance at 25°C with mean absolute error of 0.68%, mean square error of 0.01% and root mean square error of 1.10%, mean absolute percentage error of 1.78%, and R2 value of 99.86%, compared to linear regression, random forest, CatBoost models.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127801823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Performance Analysis of MultiUser MIMO Indoor Visible Light Communication Systems 多用户MIMO室内可见光通信系统性能分析
P. Patel, Ajay Shanmukh Goteti
Visible light communication (VLC) is emerging as a solution for wireless communication systems to overcome the crowded radio spectrum. VLC makes use of a large, uncontrolled free spectrum. A light emitting diode is utilized as the transmitter, while Avalanche photodiodes or PIN photodiodes are employed as the receiver. In this paper, a basic visible light communication model is studied, then two indoor visible light communication models in which the sources are positioned on the roof’s ceiling and the receivers are placed five meters away from the sources towards the ground are designed. optiSystems software was used to design visible light communication models, which include a multiple input and multiple output (MIMO) model. The Line Of Sight (LOS) channel was used as a channel between the transmitter and receiver, along with on off keying (OOK) as a modulation technique. The system’s bit error rates (BER), quality factor, and eye diagram are evaluated at different data rates.
可见光通信(VLC)正在成为无线通信系统克服无线电频谱拥挤的一种解决方案。VLC利用了一个大的、不受控制的自由频谱。采用发光二极管作为发射端,雪崩光电二极管或PIN光电二极管作为接收端。本文首先研究了一个基本的可见光通信模型,然后设计了两种室内可见光通信模型,其中光源位于屋顶天花板上,接收器放置在距离光源5米的地方。利用optiSystems软件设计可见光通信模型,其中包括多输入多输出(MIMO)模型。视距(LOS)通道被用作发射器和接收器之间的通道,同时开关键控(OOK)作为调制技术。在不同的数据速率下对系统的误码率(BER)、质量因子和眼图进行了评估。
{"title":"Performance Analysis of MultiUser MIMO Indoor Visible Light Communication Systems","authors":"P. Patel, Ajay Shanmukh Goteti","doi":"10.1109/PCEMS58491.2023.10136075","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136075","url":null,"abstract":"Visible light communication (VLC) is emerging as a solution for wireless communication systems to overcome the crowded radio spectrum. VLC makes use of a large, uncontrolled free spectrum. A light emitting diode is utilized as the transmitter, while Avalanche photodiodes or PIN photodiodes are employed as the receiver. In this paper, a basic visible light communication model is studied, then two indoor visible light communication models in which the sources are positioned on the roof’s ceiling and the receivers are placed five meters away from the sources towards the ground are designed. optiSystems software was used to design visible light communication models, which include a multiple input and multiple output (MIMO) model. The Line Of Sight (LOS) channel was used as a channel between the transmitter and receiver, along with on off keying (OOK) as a modulation technique. The system’s bit error rates (BER), quality factor, and eye diagram are evaluated at different data rates.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127887803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross Platform Mobile Application for solving Calculus 求解微积分的跨平台移动应用程序
Aveg Ajay Ganorkar, Anurag Khandelwal, Meeti Khendelwal, P. Selokar
Mathematics is widely used in engineering and education domains. The ubiquitous use of smartphones makes it easy for a user to digitally look for a math solution. Manually typing a mathematical expression is a cumbersome process; many a time, even basic mathematical expressions with a superscript and subscript get difficult to interpret. As a result, this paper describes a cross-platform mobile scanner application integrated with optical character recognition (OCR) to extract the math expression from a handwritten or printed image, as well as the final solution to the problem. The mathematical equation scanner proposed is an efficient and time effective solution for solving polynomial and calculus equations.
数学广泛应用于工程和教育领域。智能手机的普遍使用使得用户很容易通过数字方式寻找数学解决方案。手动输入数学表达式是一个繁琐的过程;很多时候,即使是带有上标和下标的基本数学表达式也很难解释。因此,本文介绍了一种集成光学字符识别(OCR)的跨平台移动扫描应用程序,用于从手写或印刷图像中提取数学表达式,以及该问题的最终解决方案。所提出的数学方程扫描器是求解多项式方程和微积分方程的一种高效、省时的方法。
{"title":"Cross Platform Mobile Application for solving Calculus","authors":"Aveg Ajay Ganorkar, Anurag Khandelwal, Meeti Khendelwal, P. Selokar","doi":"10.1109/PCEMS58491.2023.10136082","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136082","url":null,"abstract":"Mathematics is widely used in engineering and education domains. The ubiquitous use of smartphones makes it easy for a user to digitally look for a math solution. Manually typing a mathematical expression is a cumbersome process; many a time, even basic mathematical expressions with a superscript and subscript get difficult to interpret. As a result, this paper describes a cross-platform mobile scanner application integrated with optical character recognition (OCR) to extract the math expression from a handwritten or printed image, as well as the final solution to the problem. The mathematical equation scanner proposed is an efficient and time effective solution for solving polynomial and calculus equations.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132508301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computer-aided tissue characterization for detection of thyroid cancer using multi-wavelength photoacoustic imaging. 使用多波长光声成像检测甲状腺癌的计算机辅助组织表征。
R. Tholkappian, S. Sinha, B. Chinni, N. Rao, V. Dogra
Photoacoustic Imaging(PAI) is an emerging soft tissue imaging system that can be potentially used for the detection of thyroid cancer. Computer-Aided diagnosis tools help further enhance the detection process by assisting the radiologist in the elucidation of medical data. This study aimed to classify the malignant and non-malignant thyroid tissue using different machine learning algorithms applied to the multi-wavelength PA data obtained, generated by the excised thyroid specimens from actual thyroid cancer patients. An exhaustive comparative analysis among the performances of three machine learning algorithms, random forest, support vector machine, and artificial neural network was performed for classifying benign vs malignant thyroid as well as non-malignant vs malignant thyroid. While the random forest algorithm efficiently classified benign vs malignant thyroid with the highest accuracy than the other two algorithms, the support vector machine outperformed the other two algorithms in classifying non-malignant vs malignant with the highest specificity, the area under the receiver operating characteristics, and accuracy. This study shows that multiwavelength PA data can be used with suitable machine algorithms for efficient thyroid cancer detection.
光声成像(PAI)是一种新兴的软组织成像系统,可以潜在地用于甲状腺癌的检测。计算机辅助诊断工具通过协助放射科医生阐明医疗数据,进一步提高了检测过程。本研究旨在使用不同的机器学习算法对实际甲状腺癌患者切除甲状腺标本所获得的多波长PA数据进行分类。对随机森林、支持向量机、人工神经网络三种机器学习算法在甲状腺良性与恶性、非恶性与恶性分类中的性能进行了详尽的比较分析。随机森林算法对良性甲状腺和恶性甲状腺的分类效率最高,准确率高于其他两种算法,而支持向量机在非恶性甲状腺分类方面的特异性、接受者操作特征下的面积和准确率均高于其他两种算法。本研究表明,多波长PA数据可以与合适的机器算法一起用于有效的甲状腺癌检测。
{"title":"Computer-aided tissue characterization for detection of thyroid cancer using multi-wavelength photoacoustic imaging.","authors":"R. Tholkappian, S. Sinha, B. Chinni, N. Rao, V. Dogra","doi":"10.1109/PCEMS58491.2023.10136084","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136084","url":null,"abstract":"Photoacoustic Imaging(PAI) is an emerging soft tissue imaging system that can be potentially used for the detection of thyroid cancer. Computer-Aided diagnosis tools help further enhance the detection process by assisting the radiologist in the elucidation of medical data. This study aimed to classify the malignant and non-malignant thyroid tissue using different machine learning algorithms applied to the multi-wavelength PA data obtained, generated by the excised thyroid specimens from actual thyroid cancer patients. An exhaustive comparative analysis among the performances of three machine learning algorithms, random forest, support vector machine, and artificial neural network was performed for classifying benign vs malignant thyroid as well as non-malignant vs malignant thyroid. While the random forest algorithm efficiently classified benign vs malignant thyroid with the highest accuracy than the other two algorithms, the support vector machine outperformed the other two algorithms in classifying non-malignant vs malignant with the highest specificity, the area under the receiver operating characteristics, and accuracy. This study shows that multiwavelength PA data can be used with suitable machine algorithms for efficient thyroid cancer detection.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131929873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classifying Human Activities using CNN and ConvLSTM in Video Sequences 利用CNN和ConvLSTM对视频序列中的人类活动进行分类
Reema Gera, Kalyan Ram Ambati, Pallavi G. Chakole, Naveen Cheggoju, V. Kamble, V. Satpute
Video surveillance plays an important role to analyze any anomaly activity in the given premises. However, cameras can only capture the video information but cannot determine the type of activity on its own. Therefore, such systems require regular human intervention and monitoring. This requires a lot of time and manual efforts. This calls for the need of automatic human activity recognition (HAR) system. This is possible using latest technologies like computer vision and deep learning based systems. Recognizing human activities in videos is a challenging task in computer vision. The main function of intelligent video systems is to automatically identify and tag the actions performed by people in video sequences accurately. The objective of this research is to develop a model that can accurately recognize and classify human activities from video footage. The information captured by the cameras i.e., videos can be used to determine the type of activity using deep learning based networks. Such a network should be capable of classifying the videos using the available spatial and temporal information. In this paper, a framework is proposed where the data is pre-processed initially to reject redundant information. This data is fed then into deep network to predict the event. In this paper, for HAR, two different network models are presented based on the size of the sequence of frames. One network takes in just the most significant frame and the other uses a longer sequence of frames for predicting the behavior as a time domain parameter.
视频监控对于分析给定场所的异常活动起着重要的作用。然而,摄像机只能捕捉视频信息,而不能自行确定活动的类型。因此,这种系统需要定期的人为干预和监测。这需要大量的时间和手工工作。这就需要人类活动自动识别(HAR)系统。使用计算机视觉和基于深度学习的系统等最新技术,这是可能的。在计算机视觉中,识别视频中的人类活动是一项具有挑战性的任务。智能视频系统的主要功能是对视频序列中人的动作进行准确的自动识别和标记。本研究的目的是开发一种能够从视频片段中准确识别和分类人类活动的模型。摄像头捕捉到的信息,比如视频,可以通过基于深度学习的网络来确定活动的类型。这种网络应该能够利用现有的空间和时间信息对视频进行分类。本文提出了一种对数据进行初步预处理以剔除冗余信息的框架。然后将这些数据输入深度网络来预测事件。在本文中,基于帧序列的大小提出了两种不同的HAR网络模型。一个网络只接收最重要的帧,另一个使用更长的帧序列作为时域参数来预测行为。
{"title":"Classifying Human Activities using CNN and ConvLSTM in Video Sequences","authors":"Reema Gera, Kalyan Ram Ambati, Pallavi G. Chakole, Naveen Cheggoju, V. Kamble, V. Satpute","doi":"10.1109/PCEMS58491.2023.10136043","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136043","url":null,"abstract":"Video surveillance plays an important role to analyze any anomaly activity in the given premises. However, cameras can only capture the video information but cannot determine the type of activity on its own. Therefore, such systems require regular human intervention and monitoring. This requires a lot of time and manual efforts. This calls for the need of automatic human activity recognition (HAR) system. This is possible using latest technologies like computer vision and deep learning based systems. Recognizing human activities in videos is a challenging task in computer vision. The main function of intelligent video systems is to automatically identify and tag the actions performed by people in video sequences accurately. The objective of this research is to develop a model that can accurately recognize and classify human activities from video footage. The information captured by the cameras i.e., videos can be used to determine the type of activity using deep learning based networks. Such a network should be capable of classifying the videos using the available spatial and temporal information. In this paper, a framework is proposed where the data is pre-processed initially to reject redundant information. This data is fed then into deep network to predict the event. In this paper, for HAR, two different network models are presented based on the size of the sequence of frames. One network takes in just the most significant frame and the other uses a longer sequence of frames for predicting the behavior as a time domain parameter.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132628439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gait-Face Based Human Recognition From Distant Video 基于步态人脸的远距离视频人体识别
Gayatri Mudliar, Koushiki Nath, Yogesh Saini, Praveen Kumar
This paper presents a non-intrusive video-based identification system with a single camera view placed at 90° angle to the subject. The aim is to efficiently extract biometric features from a low-resolution video (acquired from a common CCTV camera) for recognizing individuals. The proposed method uses gait cues and face profile features for identification of individuals. A fusion rule is applied to these feature sets to obtain a new set of attributes. Thus, three different recognition models are developed using the gait, face, and fused feature sets. An ensemble technique is defined over the three classification models based on these sets of cues to identify an individual. This approach is experimentally validated on the CASIA-B dataset that achieves 99.33% identification accuracy.
本文提出了一种非侵入式的基于视频的识别系统,该系统采用单摄像机视角,与被摄对象呈90°角。目的是有效地从低分辨率视频(从普通闭路电视摄像机获取)中提取生物特征,用于识别个体。该方法使用步态线索和面部轮廓特征来识别个体。对这些特征集应用融合规则,得到新的属性集。因此,利用步态、面部和融合特征集开发了三种不同的识别模型。在三种分类模型的基础上,定义了一种基于这些线索集的集成技术来识别个体。该方法在CASIA-B数据集上进行了实验验证,识别准确率达到99.33%。
{"title":"Gait-Face Based Human Recognition From Distant Video","authors":"Gayatri Mudliar, Koushiki Nath, Yogesh Saini, Praveen Kumar","doi":"10.1109/PCEMS58491.2023.10136028","DOIUrl":"https://doi.org/10.1109/PCEMS58491.2023.10136028","url":null,"abstract":"This paper presents a non-intrusive video-based identification system with a single camera view placed at 90° angle to the subject. The aim is to efficiently extract biometric features from a low-resolution video (acquired from a common CCTV camera) for recognizing individuals. The proposed method uses gait cues and face profile features for identification of individuals. A fusion rule is applied to these feature sets to obtain a new set of attributes. Thus, three different recognition models are developed using the gait, face, and fused feature sets. An ensemble technique is defined over the three classification models based on these sets of cues to identify an individual. This approach is experimentally validated on the CASIA-B dataset that achieves 99.33% identification accuracy.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114301237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1