首页 > 最新文献

2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)最新文献

英文 中文
Hybrid Facial Expression Analysis Model using Quantum Distance-based Classifier and Classical Support Vector Machine 基于量子距离分类器和经典支持向量机的混合面部表情分析模型
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149860
K. Rengasamy, Piyush Joshi, Vvs Raveendra
Rapid advancements in image and video processing technologies are poised to create remarkable impacts on a wide range of industries. A significant challenge in these processing technologies resides in identifying the features fed for image classification algorithms. Though all classification algorithms could identify, extract and classify the features of a given image, their accuracy is directly proportional to the number of sample points taken from the image using a sampling technique. As the accuracy improves with a substantial number of sample points, the time consumed to process them looms large. These challenges beseech enormous computing power. Quantum computers avowed exceptional computing power is expected to bridge the growing demands. To address these challenges effectively, we have chosen a specific problem, Facial Expression Analysis, to explore in-depth and arrive at a purposeful approach to deliver the desired outcome. The purpose of this paper is two-pronged. Perform a comparative study of accuracy and performance of classical and quantum image processing algorithms in classical and quantum computers, respectively. Secondly, devise a novel hybrid model using a quantum distance-based classifier augmented with a classical linear support vector machine to overcome the limitations observed. Sample image features derived from the quantum classifier were used to train the linear classifier. The results were observed to be better relative to results from the classical distance-based classifier. Holistically, the novel hybrid model is observed as a promising solution for all image classification problems. Our future work will focus on sophisticated usage of a linear classification algorithm in quantum computing.
图像和视频处理技术的快速发展将对广泛的行业产生显著的影响。这些处理技术的一个重大挑战在于识别为图像分类算法提供的特征。尽管所有的分类算法都可以识别、提取和分类给定图像的特征,但它们的精度与使用采样技术从图像中提取的样本点数量成正比。随着样本点数量的增加,精度得到提高,处理它们所消耗的时间也随之增加。这些挑战需要巨大的计算能力。量子计算机宣称其卓越的计算能力有望满足日益增长的需求。为了有效地应对这些挑战,我们选择了一个特定的问题——面部表情分析,来深入探索,并得出一个有目的的方法来实现预期的结果。本文的目的有两个方面。分别在经典计算机和量子计算机上对经典和量子图像处理算法的精度和性能进行比较研究。其次,利用基于量子距离的分类器与经典线性支持向量机的增强,设计了一种新的混合模型,以克服所观察到的局限性。利用量子分类器得到的样本图像特征来训练线性分类器。结果被观察到相对于经典的基于距离的分类器的结果更好。总的来说,这种新的混合模型被认为是解决所有图像分类问题的一种很有前途的方法。我们未来的工作将集中在量子计算中线性分类算法的复杂使用上。
{"title":"Hybrid Facial Expression Analysis Model using Quantum Distance-based Classifier and Classical Support Vector Machine","authors":"K. Rengasamy, Piyush Joshi, Vvs Raveendra","doi":"10.1109/ESDC56251.2023.10149860","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149860","url":null,"abstract":"Rapid advancements in image and video processing technologies are poised to create remarkable impacts on a wide range of industries. A significant challenge in these processing technologies resides in identifying the features fed for image classification algorithms. Though all classification algorithms could identify, extract and classify the features of a given image, their accuracy is directly proportional to the number of sample points taken from the image using a sampling technique. As the accuracy improves with a substantial number of sample points, the time consumed to process them looms large. These challenges beseech enormous computing power. Quantum computers avowed exceptional computing power is expected to bridge the growing demands. To address these challenges effectively, we have chosen a specific problem, Facial Expression Analysis, to explore in-depth and arrive at a purposeful approach to deliver the desired outcome. The purpose of this paper is two-pronged. Perform a comparative study of accuracy and performance of classical and quantum image processing algorithms in classical and quantum computers, respectively. Secondly, devise a novel hybrid model using a quantum distance-based classifier augmented with a classical linear support vector machine to overcome the limitations observed. Sample image features derived from the quantum classifier were used to train the linear classifier. The results were observed to be better relative to results from the classical distance-based classifier. Holistically, the novel hybrid model is observed as a promising solution for all image classification problems. Our future work will focus on sophisticated usage of a linear classification algorithm in quantum computing.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"127 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121737178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Traffic Congestion and Emergency Vehicle Responsive Traffic Signal Control in Resource Constrained Environment 资源约束环境下交通拥挤与应急车辆响应交通信号控制
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149873
Sagar Bapodara, Shyam Mesvani, Manish Chaturvedi, Pruthvish Rajput
With the increase in the number of vehicles on the road, traffic congestion has become a major problem in metropolitan areas. Generally, the traffic flow through a junction is controlled using static traffic lights which are unable to adapt to the real-time traffic condition at a junction and do not prioritize the movement of certain types of vehicles. Emergency vehicles (e.g. ambulance, fire, police, etc.) play a crucial role in all life-threatening situations, and ensuring their movement through a congested junction with minimal time delay is essential.In this paper, we propose an adaptive and efficient traffic signal control system for less-lane disciplined heterogeneous (mixed) traffic that can be easily integrated with the existing static traffic lights in a resource-constrained environment. A sound sensor-based emergency vehicle detection system is designed that accurately detects and classifies emergency vehicles by identifying their unique siren sound. The traffic camera data are processed in real-time to compute the PCU counts at every approach of a junction and to detect emergency vehicles that do not generate siren sounds. The experiment results show 100% accuracy in emergency vehicle detection, more than 95% accuracy in the emergency vehicle classification, and 65% accuracy in vehicle classification and PCU count. We also design a queuing theory-based cost function that considers the prevailing traffic condition and the presence of priority vehicle(s) at a junction. The cost function can be used to adapt the green phase of different approaches at a junction to improve the vehicle flow through the junction while minimizing the delay for the emergency vehicles.
随着道路上车辆数量的增加,交通拥堵已成为大都市地区的一个主要问题。一般情况下,通过路口的交通流量是由静态交通灯控制的,它不能适应路口的实时交通状况,也不能优先考虑某些类型的车辆的移动。紧急车辆(如救护车、消防车、警车等)在所有危及生命的情况下都发挥着至关重要的作用,确保它们在最短时间内通过拥挤的交叉路口是至关重要的。在本文中,我们提出了一种自适应和高效的交通信号控制系统,用于较少车道的异构(混合)交通,该系统可以在资源受限的环境中轻松地与现有的静态交通信号灯集成。设计了一种基于声音传感器的应急车辆检测系统,通过识别应急车辆独特的警报器声音,对应急车辆进行准确的检测和分类。交通摄像头的数据被实时处理,以计算每个路口的PCU计数,并检测没有发出警报器声音的紧急车辆。实验结果表明,该方法对应急车辆检测的准确率达到100%,对应急车辆分类的准确率达到95%以上,对车辆分类和PCU计数的准确率达到65%以上。我们还设计了一个基于排队理论的成本函数,该函数考虑了当前的交通状况和路口优先车辆的存在。成本函数可用于调整交叉口不同路径的绿灯相位,以改善交叉口的车流,同时最大限度地减少应急车辆的延误。
{"title":"Traffic Congestion and Emergency Vehicle Responsive Traffic Signal Control in Resource Constrained Environment","authors":"Sagar Bapodara, Shyam Mesvani, Manish Chaturvedi, Pruthvish Rajput","doi":"10.1109/ESDC56251.2023.10149873","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149873","url":null,"abstract":"With the increase in the number of vehicles on the road, traffic congestion has become a major problem in metropolitan areas. Generally, the traffic flow through a junction is controlled using static traffic lights which are unable to adapt to the real-time traffic condition at a junction and do not prioritize the movement of certain types of vehicles. Emergency vehicles (e.g. ambulance, fire, police, etc.) play a crucial role in all life-threatening situations, and ensuring their movement through a congested junction with minimal time delay is essential.In this paper, we propose an adaptive and efficient traffic signal control system for less-lane disciplined heterogeneous (mixed) traffic that can be easily integrated with the existing static traffic lights in a resource-constrained environment. A sound sensor-based emergency vehicle detection system is designed that accurately detects and classifies emergency vehicles by identifying their unique siren sound. The traffic camera data are processed in real-time to compute the PCU counts at every approach of a junction and to detect emergency vehicles that do not generate siren sounds. The experiment results show 100% accuracy in emergency vehicle detection, more than 95% accuracy in the emergency vehicle classification, and 65% accuracy in vehicle classification and PCU count. We also design a queuing theory-based cost function that considers the prevailing traffic condition and the presence of priority vehicle(s) at a junction. The cost function can be used to adapt the green phase of different approaches at a junction to improve the vehicle flow through the junction while minimizing the delay for the emergency vehicles.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132604786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Semantic Representations and Discriminative Features in Unsupervised Domain Adaptation 无监督域自适应中语义表征和判别特征的学习
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149872
Rushendra Sidibomma, R. Sanodiya
In domain adaptation, the goal is to train a neural network on the source domain and obtain a good accuracy on the target domain. In such a scenario, it is important to transfer the knowledge from the labelled source domain to the unlabelled target domain due to the expensive cost of manual labelling. Following the trail of works in the recent time, feature level alignment seems to be the most promising direction in unsupervised domain adaptation. In most of the recent works using this feature alignment, the semantic information present in the labelled source domain has not been exploited. Among the works that have tried to learn this semantic representations, the discriminative features have not been taken into consideration which results in lower accuracy on target domain. In this paper, we present a novel approach, joint discriminative and semantic transfer network (JDSTN) that not only aligns the semantic representations of source and target domain, but also enhances the discriminative features and thereby improving the accuracy significantly. This is achieved by using pseudo-labels to align the feature centroids of source and target domains while introducing losses that promote the learning of discriminative features.
在域自适应中,目标是在源域训练神经网络,在目标域获得较好的精度。在这种情况下,由于手工标记的成本昂贵,将知识从标记的源领域转移到未标记的目标领域是很重要的。从近年来的工作轨迹来看,特征级对齐似乎是无监督域自适应中最有前途的方向。在最近使用这种特征对齐的大多数工作中,标记源域中存在的语义信息尚未被利用。在尝试学习这种语义表示的工作中,没有考虑到区别特征,导致目标域上的准确率较低。在本文中,我们提出了一种新的方法——联合判别和语义转移网络(JDSTN),该方法不仅对齐了源域和目标域的语义表示,而且增强了判别特征,从而显著提高了准确率。这是通过使用伪标签来对齐源域和目标域的特征质心,同时引入促进判别特征学习的损失来实现的。
{"title":"Learning Semantic Representations and Discriminative Features in Unsupervised Domain Adaptation","authors":"Rushendra Sidibomma, R. Sanodiya","doi":"10.1109/ESDC56251.2023.10149872","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149872","url":null,"abstract":"In domain adaptation, the goal is to train a neural network on the source domain and obtain a good accuracy on the target domain. In such a scenario, it is important to transfer the knowledge from the labelled source domain to the unlabelled target domain due to the expensive cost of manual labelling. Following the trail of works in the recent time, feature level alignment seems to be the most promising direction in unsupervised domain adaptation. In most of the recent works using this feature alignment, the semantic information present in the labelled source domain has not been exploited. Among the works that have tried to learn this semantic representations, the discriminative features have not been taken into consideration which results in lower accuracy on target domain. In this paper, we present a novel approach, joint discriminative and semantic transfer network (JDSTN) that not only aligns the semantic representations of source and target domain, but also enhances the discriminative features and thereby improving the accuracy significantly. This is achieved by using pseudo-labels to align the feature centroids of source and target domains while introducing losses that promote the learning of discriminative features.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122687631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VLSI Architecture of Generalized Pooling for Hardware Acceleration of Convolutional Neural Networks 卷积神经网络硬件加速的广义池化VLSI架构
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149878
Akash Ther, Binit Kumar Pandit, Ayan Banerjee
Convolutional Neural Networks (CNNs) handle a massive variety of datasets with great accuracy for various image processing and computer vision applications. However, it comes at the cost of the requirement of large hardware resources, which are computational and energy extensive. There is a need for efficient hardware and algorithmic optimization for real-time CNN inference while deploying the CNN model. This paper proposes a novel VLSI architecture of generalized pooling operation for hardware acceleration of CNN inference in real-time. Generalized pooling operation adaptively downsamples the huge parameter set generated by the convolutional layer by generating weights as per the input features. It is capable of accommodating varying feature maps and preserves significant features, unlike other counterparts maximum and average pooling. In order to efficiently compute the output of the generalised pooling operation, the proposed hardware design makes use of the Newton-Raphson reciprocal approximation for division operations, a low number of comparators, and a high degree of parallelism. The proposed architecture is developed and tested for performance evaluation on Xilinx Vivado 2018.3, and the target device chosen is Zynq UltraScale + MPSoC ZCU104 Evaluation board.
卷积神经网络(cnn)以极高的精度处理各种图像处理和计算机视觉应用的大量数据集。然而,它的代价是需要大量的硬件资源,这些资源需要大量的计算和能源。在部署CNN模型时,需要对实时CNN推理进行有效的硬件和算法优化。本文提出了一种基于广义池化运算的VLSI架构,用于CNN推理的实时硬件加速。广义池化操作根据输入特征生成权值,自适应地对卷积层生成的庞大参数集进行下采样。它能够适应不同的特征映射,并保留重要的特征,不像其他对等的最大和平均池化。为了有效地计算广义池化操作的输出,所提出的硬件设计利用牛顿-拉夫森互反近似进行除法操作,较少的比较器数量和高度的并行性。在Xilinx Vivado 2018.3平台上开发并测试了所提出的架构,并选择了Zynq UltraScale + MPSoC ZCU104评估板作为目标器件。
{"title":"VLSI Architecture of Generalized Pooling for Hardware Acceleration of Convolutional Neural Networks","authors":"Akash Ther, Binit Kumar Pandit, Ayan Banerjee","doi":"10.1109/ESDC56251.2023.10149878","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149878","url":null,"abstract":"Convolutional Neural Networks (CNNs) handle a massive variety of datasets with great accuracy for various image processing and computer vision applications. However, it comes at the cost of the requirement of large hardware resources, which are computational and energy extensive. There is a need for efficient hardware and algorithmic optimization for real-time CNN inference while deploying the CNN model. This paper proposes a novel VLSI architecture of generalized pooling operation for hardware acceleration of CNN inference in real-time. Generalized pooling operation adaptively downsamples the huge parameter set generated by the convolutional layer by generating weights as per the input features. It is capable of accommodating varying feature maps and preserves significant features, unlike other counterparts maximum and average pooling. In order to efficiently compute the output of the generalised pooling operation, the proposed hardware design makes use of the Newton-Raphson reciprocal approximation for division operations, a low number of comparators, and a high degree of parallelism. The proposed architecture is developed and tested for performance evaluation on Xilinx Vivado 2018.3, and the target device chosen is Zynq UltraScale + MPSoC ZCU104 Evaluation board.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126892281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ML-based techniques for prediction of Ocean currents for underwater vehicles 基于ml的水下航行器洋流预测技术
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149859
Shaik Shakeera, V. Bala Naga Jyothi, H. Venkataraman
Dynamic ocean current in real-time plays a significant role for the precise navigation of underwater vehicles. Estimation and prediction of ocean currents with traditional methods such as Navier–Stokes equations, which are computationally very complex and also need huge historical ocean data for developing numerical models. Hence, in this paper Machine Learning, based on less complex and easily deployable regression methods is exercised to identify the best prediction model for ocean currents. Further, all the regression methods performed were compared with the R2 score, Mean Absolute Error (MAE) and Mean Square Error (MSE). Among all methods, the Decision tree regression-based ML method performed best with 84% accuracy with minimal error. Qualitative performance is studied using visualization of data correlation, heat maps are also generated and compared.
实时动态洋流对水下航行器的精确导航具有重要意义。利用传统方法(如Navier-Stokes方程)估算和预测洋流,这些方法计算非常复杂,并且需要大量的历史海洋数据来建立数值模型。因此,在本文中,机器学习是基于不太复杂和易于部署的回归方法来识别洋流的最佳预测模型。进一步,将所有回归方法与R2评分、平均绝对误差(MAE)和均方误差(MSE)进行比较。在所有方法中,基于决策树回归的ML方法表现最好,准确率为84%,误差最小。利用数据关联可视化研究定性性能,生成热图并进行比较。
{"title":"ML-based techniques for prediction of Ocean currents for underwater vehicles","authors":"Shaik Shakeera, V. Bala Naga Jyothi, H. Venkataraman","doi":"10.1109/ESDC56251.2023.10149859","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149859","url":null,"abstract":"Dynamic ocean current in real-time plays a significant role for the precise navigation of underwater vehicles. Estimation and prediction of ocean currents with traditional methods such as Navier–Stokes equations, which are computationally very complex and also need huge historical ocean data for developing numerical models. Hence, in this paper Machine Learning, based on less complex and easily deployable regression methods is exercised to identify the best prediction model for ocean currents. Further, all the regression methods performed were compared with the R2 score, Mean Absolute Error (MAE) and Mean Square Error (MSE). Among all methods, the Decision tree regression-based ML method performed best with 84% accuracy with minimal error. Qualitative performance is studied using visualization of data correlation, heat maps are also generated and compared.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123758469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review Paper on Contour Estimation Techniques in High-Resolution Automotive Radars 高分辨率汽车雷达轮廓估计技术综述
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149852
R. Mathew, Eesh Shekhar Gharat, Siddharth Hooda
By 2024, the market for automated driving systems is anticipated to reach $20 billion, growing at a 25.7 percent annual pace between 2016 and 2024. Contour estimation through High-Resolution radars is one of the key functionalities of these growing industries and many techniques have been used for it. KNN-DBSCAN, Generalized Hough Transform, and brute force approach are some of the techniques studied. Size of encountered Radar cross-section (RCS), dependency on heuristics, accuracy, and computational expensive are some of the parameters against which comparison of the various techniques is done. Although these parameters are studied in-depth, there are parameters like interference and contamination of RADAR that have not been studied extensively in the literature.
到2024年,自动驾驶系统市场预计将达到200亿美元,2016年至2024年期间的年增长率为25.7%。通过高分辨率雷达进行轮廓估计是这些新兴行业的关键功能之一,许多技术已用于此。研究了KNN-DBSCAN、广义霍夫变换和蛮力方法。遇到的雷达横截面(RCS)的大小、对启发式的依赖、准确性和计算成本是对各种技术进行比较的一些参数。虽然对这些参数进行了深入的研究,但雷达的干扰、污染等参数在文献中还没有得到广泛的研究。
{"title":"A Review Paper on Contour Estimation Techniques in High-Resolution Automotive Radars","authors":"R. Mathew, Eesh Shekhar Gharat, Siddharth Hooda","doi":"10.1109/ESDC56251.2023.10149852","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149852","url":null,"abstract":"By 2024, the market for automated driving systems is anticipated to reach $20 billion, growing at a 25.7 percent annual pace between 2016 and 2024. Contour estimation through High-Resolution radars is one of the key functionalities of these growing industries and many techniques have been used for it. KNN-DBSCAN, Generalized Hough Transform, and brute force approach are some of the techniques studied. Size of encountered Radar cross-section (RCS), dependency on heuristics, accuracy, and computational expensive are some of the parameters against which comparison of the various techniques is done. Although these parameters are studied in-depth, there are parameters like interference and contamination of RADAR that have not been studied extensively in the literature.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124696570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiclass Altitude and Rotation Invariant Object Detection Using Slicing Objectness Loss (SOL) For Images with Complex Background 基于分层目标损失(SOL)的复杂背景图像多类高度和旋转不变目标检测
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149850
Indrajit Kar, S. Mukhopadhyay
The difficulty of detecting and monitoring diverse types of solar panels from various elevations and complex backgrounds has not been investigated. The varying orientation and complex backgrounds lead to false positives and erroneous classifications. However, by employing our technique, we could precisely distinguish between three diverse types of solar panels in various challenging backgrounds. We propose a slicing objectness loss specifically to address model learning issues, false positives, and oriented object localization on complex backgrounds. We based our work on popular existing neural network architectures and designed them to adapt to areal object detection for complex backgrounds. To our knowledge in the multi-solar panel, detection has not been conducted previously due to a lack of data availability. We have designed the neural network to process the full-size image and extract multi-Altitude local and global features thus our models are all altitude invariant and can detect shapes of arbitrary orientation. To demonstrate Slicing loss function produces superior results, we present a comparison study between pre and post-application SOL. We show clear evidence that our slicing approach and slicing objectness loss (SOL) has a significant effect on multi-solar panel detection WPV (water heater Photovoltaic), FPV (farm type photovoltaic), and SPV (Single Photovoltaic). We have also shown our approach and custom loss works for other complex multi-object detection e.g., identifying enclosures, water, and fuel tanks from different altitudes.
对不同高度和复杂背景的不同类型太阳能电池板的检测和监测难度尚未进行研究。不同的定位和复杂的背景导致误报和错误分类。然而,通过使用我们的技术,我们可以在不同的挑战性背景下精确区分三种不同类型的太阳能电池板。我们提出了一种切片对象损失,专门用于解决复杂背景下的模型学习问题、误报问题和定向对象定位问题。我们的工作基于流行的现有神经网络架构,并设计它们以适应复杂背景的区域目标检测。据我们对多太阳能电池板的了解,由于缺乏数据可用性,以前没有进行过探测。我们设计了神经网络来处理全尺寸图像并提取多高度的局部和全局特征,因此我们的模型都是高度不变的,可以检测任意方向的形状。为了证明切片损失函数产生了更好的结果,我们对应用前和应用后的SOL进行了比较研究。我们展示了明确的证据,表明我们的切片方法和切片物体损失(SOL)对多太阳能电池板检测WPV(热水器光伏),FPV(农场光伏)和SPV(单光伏)有显著影响。我们还展示了我们的方法和自定义丢失工作,用于其他复杂的多目标检测,例如从不同高度识别外壳,水和油箱。
{"title":"Multiclass Altitude and Rotation Invariant Object Detection Using Slicing Objectness Loss (SOL) For Images with Complex Background","authors":"Indrajit Kar, S. Mukhopadhyay","doi":"10.1109/ESDC56251.2023.10149850","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149850","url":null,"abstract":"The difficulty of detecting and monitoring diverse types of solar panels from various elevations and complex backgrounds has not been investigated. The varying orientation and complex backgrounds lead to false positives and erroneous classifications. However, by employing our technique, we could precisely distinguish between three diverse types of solar panels in various challenging backgrounds. We propose a slicing objectness loss specifically to address model learning issues, false positives, and oriented object localization on complex backgrounds. We based our work on popular existing neural network architectures and designed them to adapt to areal object detection for complex backgrounds. To our knowledge in the multi-solar panel, detection has not been conducted previously due to a lack of data availability. We have designed the neural network to process the full-size image and extract multi-Altitude local and global features thus our models are all altitude invariant and can detect shapes of arbitrary orientation. To demonstrate Slicing loss function produces superior results, we present a comparison study between pre and post-application SOL. We show clear evidence that our slicing approach and slicing objectness loss (SOL) has a significant effect on multi-solar panel detection WPV (water heater Photovoltaic), FPV (farm type photovoltaic), and SPV (Single Photovoltaic). We have also shown our approach and custom loss works for other complex multi-object detection e.g., identifying enclosures, water, and fuel tanks from different altitudes.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"55 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128354221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Battery Modelling and Performance Analysis of Time-Varying Load Using Simulink 基于Simulink的时变负载电池建模与性能分析
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149874
Mahesh Naggarapu, Shaik Shakeera, H. Venkataraman
In recent years, each area has emerged with autonomous capabilities such as vehicular navigation, aquaculture, and industrial appliances. Especially, in the field of vehicular transportation, power is the main constraint for autonomous operations. The battery is the main source of energy or power storage. Especially, rechargeable batteries are potentially utilized as energy storage systems due to their high energy density. However, battery modelling is an indispensable tool for designing a real-time battery management system (BMS) that estimates the run-life time of autonomous battery power systems. In this paper, an easy-to-use battery Simulink model with time-varying dynamic load has been designed as a tool for all autonomous vehicles to estimate the State of Charge (SOC). This proposed model comprises a Controlled-Voltage source in series with internal resistance and time-varying resistive load. The proposed model differs from the ideal model by 0.01 (1%) and mirrors the general behaviour of the ideal model. The performance analysis of both the ideal and proposed model is evaluated by Root Mean Square Error (RMSE) which must be less than 0.4 to accept the battery model. However, the proposed model achieved the RMSE value of 0.1 and estimates SOC which is widely acceptable.
近年来,每个领域都出现了自主能力,如车辆导航、水产养殖和工业设备。特别是在车辆运输领域,动力是自动驾驶的主要制约因素。电池是能量或电能的主要来源。特别是,可充电电池由于其高能量密度,有可能被用作储能系统。然而,电池建模是设计实时电池管理系统(BMS)不可缺少的工具,该系统可以估计自主电池动力系统的运行寿命。本文设计了一个易于使用的具有时变动态负载的电池Simulink模型,作为所有自动驾驶汽车估计充电状态(SOC)的工具。该模型由一个具有内阻和时变电阻负载的串联控压源组成。所提出的模型与理想模型相差0.01(1%),反映了理想模型的一般行为。对理想模型和建议模型的性能分析均采用均方根误差(RMSE)进行评估,RMSE必须小于0.4才能接受电池模型。然而,该模型实现了0.1的RMSE值,并估计了广泛接受的SOC。
{"title":"Battery Modelling and Performance Analysis of Time-Varying Load Using Simulink","authors":"Mahesh Naggarapu, Shaik Shakeera, H. Venkataraman","doi":"10.1109/ESDC56251.2023.10149874","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149874","url":null,"abstract":"In recent years, each area has emerged with autonomous capabilities such as vehicular navigation, aquaculture, and industrial appliances. Especially, in the field of vehicular transportation, power is the main constraint for autonomous operations. The battery is the main source of energy or power storage. Especially, rechargeable batteries are potentially utilized as energy storage systems due to their high energy density. However, battery modelling is an indispensable tool for designing a real-time battery management system (BMS) that estimates the run-life time of autonomous battery power systems. In this paper, an easy-to-use battery Simulink model with time-varying dynamic load has been designed as a tool for all autonomous vehicles to estimate the State of Charge (SOC). This proposed model comprises a Controlled-Voltage source in series with internal resistance and time-varying resistive load. The proposed model differs from the ideal model by 0.01 (1%) and mirrors the general behaviour of the ideal model. The performance analysis of both the ideal and proposed model is evaluated by Root Mean Square Error (RMSE) which must be less than 0.4 to accept the battery model. However, the proposed model achieved the RMSE value of 0.1 and estimates SOC which is widely acceptable.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"347 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125775125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UAV and UGV Assisted Path Planning for Sensor Data Collection in Precision Agriculture 无人机和UGV辅助的精准农业传感器数据采集路径规划
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149861
Himanshu Singh, Moirangthem Biken Singh, Harsh Pratik, A. Pratap
In recent years, automated (Intelligent) decision support systems have become prevalent in various smart city applications such as healthcare, transportation, energy management, and environmental monitoring. Internet of Things (IoT) and Unmanned Aerial Vehicle (UAV)-based smart sensing and actuation devices in agricultural events can change the sector from static and manual to dynamic and intelligent, resulting in increased production with minimal human efforts. In addition, soil measurements that are time-consuming can be collected using Unmanned Ground Vehicles (UGVs). However, to collect data efficiently from wireless sensors in agricultural fields, UAV and UGV need to follow an optimal path. Thus, in this paper, we formulate utility maximization problem using UAV and UGV by simultaneously minimizing energy consumption and maximizing data collection. To solve the formulated problem, we propose a modified Greedy Randomised Adaptive Search Procedure (GRASP) algorithm to predict an efficient path for UAV and UGV to collect data from the agricultural field. Moreover, the efficacy of the proposed algorithm is showcased theoretically and experimentally on real-world data and compared with other state-of-the-art methods.
近年来,自动化(智能)决策支持系统在各种智能城市应用中变得普遍,例如医疗保健、交通、能源管理和环境监测。农业活动中基于物联网(IoT)和无人机(UAV)的智能传感和驱动设备可以将该部门从静态和手动转变为动态和智能,从而以最少的人力增加产量。此外,耗时的土壤测量可以使用无人地面车辆(ugv)进行收集。然而,为了有效地从农业领域的无线传感器中收集数据,无人机和UGV需要遵循最优路径。因此,在本文中,我们通过同时最小化能耗和最大化数据收集来制定无人机和UGV的效用最大化问题。为了解决这个问题,我们提出了一种改进的贪婪随机自适应搜索程序(GRASP)算法来预测无人机和UGV从农田收集数据的有效路径。此外,该算法的有效性在理论和实验上都得到了验证,并与其他最新方法进行了比较。
{"title":"UAV and UGV Assisted Path Planning for Sensor Data Collection in Precision Agriculture","authors":"Himanshu Singh, Moirangthem Biken Singh, Harsh Pratik, A. Pratap","doi":"10.1109/ESDC56251.2023.10149861","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149861","url":null,"abstract":"In recent years, automated (Intelligent) decision support systems have become prevalent in various smart city applications such as healthcare, transportation, energy management, and environmental monitoring. Internet of Things (IoT) and Unmanned Aerial Vehicle (UAV)-based smart sensing and actuation devices in agricultural events can change the sector from static and manual to dynamic and intelligent, resulting in increased production with minimal human efforts. In addition, soil measurements that are time-consuming can be collected using Unmanned Ground Vehicles (UGVs). However, to collect data efficiently from wireless sensors in agricultural fields, UAV and UGV need to follow an optimal path. Thus, in this paper, we formulate utility maximization problem using UAV and UGV by simultaneously minimizing energy consumption and maximizing data collection. To solve the formulated problem, we propose a modified Greedy Randomised Adaptive Search Procedure (GRASP) algorithm to predict an efficient path for UAV and UGV to collect data from the agricultural field. Moreover, the efficacy of the proposed algorithm is showcased theoretically and experimentally on real-world data and compared with other state-of-the-art methods.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121815969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Pruning and Quantization: A Light Weight Multi-Sensor Pothole Detection System 修剪和量化的影响:一种轻型多传感器凹坑检测系统
Pub Date : 2023-05-04 DOI: 10.1109/ESDC56251.2023.10149868
Jaswanth Nidamanuri, Trisanu Bhar, H. Venkataraman
Pothole Detection has been a part of Advanced Driver Assistant Systems (ADAS) for a long time. To detect potholes, many techniques have been used. Deep learning-based methods have been particularly successful in this regard. However, the localization of the potholes accurately may not be possible only by having one modality of information. This work explores the multi-sensor information fusion (from Accelerometer and Gyroscope) to detect the potholes. Notably, most of the existing works proposed to make use of models such as Convolution Neural Networks, and other Attention models like Long Short-Term Memory (LSTM)’s, Gated Recurrent Units (GRUs), and Transformers. Despite having such proven architectures for complex and non-linear learning representations with attention units, still, the challenge of real-time deployments with optimized computing devices remains unaddressed. With the proposed approach, efficient deployments are possible on edge devices embedded in the vehicle providing a reliable ADAS solution for improved driver safety. The investigations and ablation study from the proposal focus on two-fold addressing the trade-off between model size and test accuracy. Significantly, the proposed hybrid architecture, the INN-former with quantization, achieved a size reduction by 16.12%, not compromising much with the maximum test accuracy reported at 96.12%. Similarly, pruning achieves a 1.115% size reduction with a minimal difference in test accuracy of 85.43% for the INN-former, and a 3.76% decrease in size while only a 4.95% decrease in test accuracy reported as 95.43% with the Attention model making use of GRU/ LSTM. Importantly, the proposed work discusses the design parameters for lightweight architectures investigating the pruning and quantization techniques that are not compromising the generalization capability of the models, which is highly essential for real-time deployments and validation.
洼坑检测作为高级驾驶辅助系统(ADAS)的一部分已经存在很长时间了。为了探测坑洼,使用了许多技术。基于深度学习的方法在这方面特别成功。然而,仅靠一种信息模式可能无法准确定位坑穴。本研究探索了多传感器信息融合(加速度计和陀螺仪)来检测坑穴。值得注意的是,大多数现有的工作都建议使用卷积神经网络等模型,以及其他注意模型,如长短期记忆(LSTM)、门控循环单元(gru)和变压器。尽管对于具有注意力单元的复杂和非线性学习表示有这样成熟的体系结构,但是使用优化计算设备进行实时部署的挑战仍然没有得到解决。通过提出的方法,可以在嵌入车辆的边缘设备上进行高效部署,提供可靠的ADAS解决方案,以提高驾驶员的安全性。该提案的调查和消融研究集中在两方面解决模型尺寸和测试精度之间的权衡。值得注意的是,所提出的混合架构,即带有量化的INN-former,实现了16.12%的尺寸减小,而最大测试精度为96.12%。同样,对于使用GRU/ LSTM的注意力模型,修剪实现了1.115%的尺寸减少,测试精度的最小差异为85.43%,而尺寸减少了3.76%,而测试精度仅下降了4.95%,为95.43%。重要的是,提出的工作讨论了轻量级架构的设计参数,研究了不会损害模型泛化能力的修剪和量化技术,这对实时部署和验证至关重要。
{"title":"Impact of Pruning and Quantization: A Light Weight Multi-Sensor Pothole Detection System","authors":"Jaswanth Nidamanuri, Trisanu Bhar, H. Venkataraman","doi":"10.1109/ESDC56251.2023.10149868","DOIUrl":"https://doi.org/10.1109/ESDC56251.2023.10149868","url":null,"abstract":"Pothole Detection has been a part of Advanced Driver Assistant Systems (ADAS) for a long time. To detect potholes, many techniques have been used. Deep learning-based methods have been particularly successful in this regard. However, the localization of the potholes accurately may not be possible only by having one modality of information. This work explores the multi-sensor information fusion (from Accelerometer and Gyroscope) to detect the potholes. Notably, most of the existing works proposed to make use of models such as Convolution Neural Networks, and other Attention models like Long Short-Term Memory (LSTM)’s, Gated Recurrent Units (GRUs), and Transformers. Despite having such proven architectures for complex and non-linear learning representations with attention units, still, the challenge of real-time deployments with optimized computing devices remains unaddressed. With the proposed approach, efficient deployments are possible on edge devices embedded in the vehicle providing a reliable ADAS solution for improved driver safety. The investigations and ablation study from the proposal focus on two-fold addressing the trade-off between model size and test accuracy. Significantly, the proposed hybrid architecture, the INN-former with quantization, achieved a size reduction by 16.12%, not compromising much with the maximum test accuracy reported at 96.12%. Similarly, pruning achieves a 1.115% size reduction with a minimal difference in test accuracy of 85.43% for the INN-former, and a 3.76% decrease in size while only a 4.95% decrease in test accuracy reported as 95.43% with the Attention model making use of GRU/ LSTM. Importantly, the proposed work discusses the design parameters for lightweight architectures investigating the pruning and quantization techniques that are not compromising the generalization capability of the models, which is highly essential for real-time deployments and validation.","PeriodicalId":354855,"journal":{"name":"2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)","volume":"167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114837362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 11th International Symposium on Electronic Systems Devices and Computing (ESDC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1