首页 > 最新文献

2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)最新文献

英文 中文
Applications of Human Activity Recognition in Different Fields: A Review 人体活动识别在不同领域的应用综述
Yash Daga, S. Meena
Human activity recognition is a field where not enough work has been done yet, but its applications can be helpful in numerous domains. In this paper, we have discussed different methods by which implementation of human activity recognition can be carried away and compared those methods in terms of advantages, performance, accuracy, techniques, datasets, and limitations. We have also discussed numerous domains where this can be helpful such as healthcare, security, and augmented reality, along with the challenges and the type of method used.
人类活动识别是一个尚未完成足够工作的领域,但它的应用可以在许多领域提供帮助。在本文中,我们讨论了实现人类活动识别的不同方法,并从优势、性能、准确性、技术、数据集和局限性等方面对这些方法进行了比较。我们还讨论了许多可以提供帮助的领域,如医疗保健、安全和增强现实,以及所面临的挑战和使用的方法类型。
{"title":"Applications of Human Activity Recognition in Different Fields: A Review","authors":"Yash Daga, S. Meena","doi":"10.1109/UPCON56432.2022.9986408","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986408","url":null,"abstract":"Human activity recognition is a field where not enough work has been done yet, but its applications can be helpful in numerous domains. In this paper, we have discussed different methods by which implementation of human activity recognition can be carried away and compared those methods in terms of advantages, performance, accuracy, techniques, datasets, and limitations. We have also discussed numerous domains where this can be helpful such as healthcare, security, and augmented reality, along with the challenges and the type of method used.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122666617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Branch Line Coupler Based on Substrate Integrated Waveguide Technology 基于衬底集成波导技术的分支线耦合器
Brij Kumar Bharti, A. Yadav
This paper presents a 3-dB Quadrature Branch Line Coupler (BLC) using substrate integrated waveguide (SIW) technology. The proposed design is a combination of equal width SIW lines. The structure is having fixed-width lines therefore, can be incorporated into the SIW based microwave circuits. The proposed coupler is designed using derived design equations and simulated at 15 GHz, simulation results show that the reflection coefficients of all the ports are better than 28 dB around the operating frequency, and a good isolation characteristic is achieved which is below 25 dB between the two input ports. At the operating band, two output signals have a phase difference value of $90^{circ}pm 1.5^{circ}$. The insertion loss at centre frequency is 0.84 dB. The proposed structure can be used in butler matrix, mono-pulse comparator circuits, and antenna feeding networks.
提出了一种采用衬底集成波导技术的3db正交支路耦合器(BLC)。所提出的设计是等宽SIW线的组合。因此,该结构具有固定宽度的线路,可以集成到基于SIW的微波电路中。利用推导出的设计方程对该耦合器进行了设计,并在15ghz下进行了仿真,仿真结果表明,在工作频率附近,所有端口的反射系数都优于28 dB,并且实现了良好的隔离特性,两个输入端口之间的隔离小于25 dB。在工作频带,两个输出信号的相位差值为$90^{circ}pm 1.5^{circ}$。中心频率处的插入损耗为0.84 dB。该结构可用于管家矩阵、单脉冲比较器电路和天线馈电网络。
{"title":"Branch Line Coupler Based on Substrate Integrated Waveguide Technology","authors":"Brij Kumar Bharti, A. Yadav","doi":"10.1109/UPCON56432.2022.9986404","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986404","url":null,"abstract":"This paper presents a 3-dB Quadrature Branch Line Coupler (BLC) using substrate integrated waveguide (SIW) technology. The proposed design is a combination of equal width SIW lines. The structure is having fixed-width lines therefore, can be incorporated into the SIW based microwave circuits. The proposed coupler is designed using derived design equations and simulated at 15 GHz, simulation results show that the reflection coefficients of all the ports are better than 28 dB around the operating frequency, and a good isolation characteristic is achieved which is below 25 dB between the two input ports. At the operating band, two output signals have a phase difference value of $90^{circ}pm 1.5^{circ}$. The insertion loss at centre frequency is 0.84 dB. The proposed structure can be used in butler matrix, mono-pulse comparator circuits, and antenna feeding networks.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"450 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125071999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy Efficient Hardware Implementation of 2-D Convolution for Convolutional Neural Network 卷积神经网络二维卷积的节能硬件实现
S. K. Sharma, Anu Gupta, K. Raju
Over the last year, Deep neural networks (DNN) have been significantly accepted for computer vision applications because of high classification accuracy and versatility. Convolutional Neural Network (CNN) is one of the most popular architectures of DNN which is widely adopted for image, speech and video recognition. Extensive computation and large memory requirement of CNN s poses the bottleneck on its application. Field Programmable Gate Arrays (FPGAs) are considered to be suitable hardware platforms for deployment of CNNs with low power requirements. This paper focus on the design and implementation of hardware accelerator to perform the convolution product (matrix-matrix multiplication. We have used two optimization techniques to achieve energy efficiency. First, dataflow of the convolution phase is rescheduled to reduce the undesired on-chip memory accesses. Further, efficiency is enhanced by reducing the internal parallelism of structure as much as possible. Our architecture is implemented on the Xilinx ZCU104 evaluation board. The implemented design attains 98.1 GOPS/Joule and 32.77 GOPS/Joule for 8-bit and 16-bit data width respectively.
在过去的一年中,深度神经网络(DNN)因其高分类精度和多功能性而被广泛应用于计算机视觉应用。卷积神经网络(CNN)是深度神经网络中最流行的架构之一,广泛应用于图像、语音和视频识别。CNN的计算量大、内存需求大是其应用的瓶颈。现场可编程门阵列(fpga)被认为是部署低功耗cnn的合适硬件平台。本文重点研究了实现卷积积(矩阵-矩阵乘法)的硬件加速器的设计与实现。我们使用了两种优化技术来实现能源效率。首先,对卷积阶段的数据流进行重新调度,以减少不必要的片上存储器访问。此外,通过尽可能减少结构的内部平行度来提高效率。我们的架构是在赛灵思ZCU104评估板上实现的。实现的设计在8位和16位数据宽度下分别达到98.1 GOPS/焦耳和32.77 GOPS/焦耳。
{"title":"Energy Efficient Hardware Implementation of 2-D Convolution for Convolutional Neural Network","authors":"S. K. Sharma, Anu Gupta, K. Raju","doi":"10.1109/UPCON56432.2022.9986483","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986483","url":null,"abstract":"Over the last year, Deep neural networks (DNN) have been significantly accepted for computer vision applications because of high classification accuracy and versatility. Convolutional Neural Network (CNN) is one of the most popular architectures of DNN which is widely adopted for image, speech and video recognition. Extensive computation and large memory requirement of CNN s poses the bottleneck on its application. Field Programmable Gate Arrays (FPGAs) are considered to be suitable hardware platforms for deployment of CNNs with low power requirements. This paper focus on the design and implementation of hardware accelerator to perform the convolution product (matrix-matrix multiplication. We have used two optimization techniques to achieve energy efficiency. First, dataflow of the convolution phase is rescheduled to reduce the undesired on-chip memory accesses. Further, efficiency is enhanced by reducing the internal parallelism of structure as much as possible. Our architecture is implemented on the Xilinx ZCU104 evaluation board. The implemented design attains 98.1 GOPS/Joule and 32.77 GOPS/Joule for 8-bit and 16-bit data width respectively.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123390029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stock Market Prediction During COVID Using Stacked LSTM 基于堆叠LSTM的COVID期间股市预测
Ananya Singh, Swati Jain
In the field of computation, the art of predicting the stock market has always been a tough nut to crack for researchers. This is because stock prices are highly influential values. The prices depend on many factors, ranging from physical to physiological, rational and irrational, from geopolitical stability to the sentiments of the investors – all play a crucial role. Investors anticipate market conditions in the future for a successful investment. Hence considering the past stock prices as an embodiment of the factors mentioned above, we propose a stacked long-short-term-memory (LSTM) model to predict the closing index of stock prices during this highly uncertain pandemic period using root mean square error (RSME) as the performance indicator. The model is optimized to improve the prediction accuracy in order to achieve high performance stock forecasting. The dataset considered is from NIFTY 50 scaling across four sectors, namely – auto, bank, healthcare and metal from a duration of 30th January 2020 to 31st March 2022. This paper aims to consider the historical data to analyze future patterns and insights.
在计算领域,预测股票市场的艺术一直是研究人员难以攻克的难题。这是因为股价是极具影响力的价值。价格取决于许多因素,从生理因素到生理因素,从理性因素到非理性因素,从地缘政治稳定到投资者情绪,这些因素都起着至关重要的作用。投资者为成功投资而预测未来的市场状况。因此,考虑到过去的股票价格是上述因素的体现,我们提出了一个堆叠长短期记忆(LSTM)模型,以均方根误差(RSME)作为绩效指标来预测这一高度不确定的大流行期间的股票价格收盘指数。对模型进行了优化,提高了预测精度,实现了高效的股票预测。所考虑的数据集来自NIFTY 50在2020年1月30日至2022年3月31日期间跨越四个行业,即汽车、银行、医疗保健和金属。本文旨在考虑历史数据来分析未来的模式和见解。
{"title":"Stock Market Prediction During COVID Using Stacked LSTM","authors":"Ananya Singh, Swati Jain","doi":"10.1109/UPCON56432.2022.9986445","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986445","url":null,"abstract":"In the field of computation, the art of predicting the stock market has always been a tough nut to crack for researchers. This is because stock prices are highly influential values. The prices depend on many factors, ranging from physical to physiological, rational and irrational, from geopolitical stability to the sentiments of the investors – all play a crucial role. Investors anticipate market conditions in the future for a successful investment. Hence considering the past stock prices as an embodiment of the factors mentioned above, we propose a stacked long-short-term-memory (LSTM) model to predict the closing index of stock prices during this highly uncertain pandemic period using root mean square error (RSME) as the performance indicator. The model is optimized to improve the prediction accuracy in order to achieve high performance stock forecasting. The dataset considered is from NIFTY 50 scaling across four sectors, namely – auto, bank, healthcare and metal from a duration of 30th January 2020 to 31st March 2022. This paper aims to consider the historical data to analyze future patterns and insights.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125501014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic S-box with Reversible Gates for both Classical and Quantum Computer 经典和量子计算机的可逆门动态s盒
Anindita Sarkar, S. R. Chatterjee, M. Chakraborty
Reversible logic gates are universally adopted to replace the classical gates in various circuits to avail the advantages of power reduction and speed maximization. The application of reversible gates includes various fields like Quantum computing, Nano technology, low power CMOS, optical computing etc. Reversible gates are the key components for the quantum circuits. Here reversible gates are used to design a dynamic light weight S-box. The S-box is the key circuit component for cryptographic algorithm. The combination of Controlled NOT(CNOT), Peres and Selim Al Mamun (SAM) gates are used to achieve the dynamic nature of the S-Box. The simulation analysis is made to evaluate the cryptographic properties such as Avalanche Criteria (AC), Strict Avalanche Criteria (SAC), Bit Independence Criteria (BIC) and Nonlinearity to measure the strength and reliability of the proposed S-box. The comparative performance investigation is done with the Rijndael S-box used in popular AES algorithm as it was widely accepted by NISTIR and still shows that the proposed S-Box outperform with respect to the existing S-box. Use of input dependant reversible logic gate combination enhances the performance of the S-box in both classical and quantum computer.
在各种电路中,普遍采用可逆逻辑门来代替传统的门,以达到降低功耗和提高速度的目的。可逆门的应用领域包括量子计算、纳米技术、低功耗CMOS、光学计算等。可逆门是量子电路的关键器件。本文采用可逆栅极设计动态轻量化s盒。s盒是密码算法的关键电路元件。采用可控非门(CNOT)、Peres门和Selim Al Mamun门(SAM)的组合来实现S-Box的动态特性。通过仿真分析,对雪崩准则(AC)、严格雪崩准则(SAC)、位无关准则(BIC)和非线性等密码学特性进行了评价,以衡量所提出的s盒的强度和可靠性。比较性能调查是与流行的AES算法中使用的Rijndael S-box进行的,因为它被nistr广泛接受,并且仍然表明所提出的S-box优于现有的S-box。利用输入相关的可逆逻辑门组合增强了s盒在经典计算机和量子计算机中的性能。
{"title":"Dynamic S-box with Reversible Gates for both Classical and Quantum Computer","authors":"Anindita Sarkar, S. R. Chatterjee, M. Chakraborty","doi":"10.1109/UPCON56432.2022.9986426","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986426","url":null,"abstract":"Reversible logic gates are universally adopted to replace the classical gates in various circuits to avail the advantages of power reduction and speed maximization. The application of reversible gates includes various fields like Quantum computing, Nano technology, low power CMOS, optical computing etc. Reversible gates are the key components for the quantum circuits. Here reversible gates are used to design a dynamic light weight S-box. The S-box is the key circuit component for cryptographic algorithm. The combination of Controlled NOT(CNOT), Peres and Selim Al Mamun (SAM) gates are used to achieve the dynamic nature of the S-Box. The simulation analysis is made to evaluate the cryptographic properties such as Avalanche Criteria (AC), Strict Avalanche Criteria (SAC), Bit Independence Criteria (BIC) and Nonlinearity to measure the strength and reliability of the proposed S-box. The comparative performance investigation is done with the Rijndael S-box used in popular AES algorithm as it was widely accepted by NISTIR and still shows that the proposed S-Box outperform with respect to the existing S-box. Use of input dependant reversible logic gate combination enhances the performance of the S-box in both classical and quantum computer.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125571280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Speech Mode Transformation using Adversarial Network 基于对抗网络的多语音模式转换
Kumud Tripathi, Jatin Kumar
The objective of Multiple Speech Mode Transformation (MSMT) is to transform speech from one form to another on the basis of their mode characteristics. In this work, we have explored three different modes of speech (conversation, extempore, and read modes) for their inter-conversion while preserving the speaker identity and the linguistic content. To accomplish this we used a variant of Star Generative Adversarial Network (StarGAN) named as StarGAN-VC. For training, our model does not require parallel occurrences of the sentences and with relatively lesser number of training example we were able to generate high quality transformed outputs. On conducting objective and subjective evaluations, it is deduced that the transformed speech mode outputs are highly comparable to the target speech mode.
多语音模式转换(MSMT)的目标是根据语音的模式特征将语音从一种形式转换为另一种形式。在这项工作中,我们探讨了三种不同的语言模式(对话、即兴和阅读模式)在保留说话者身份和语言内容的情况下的相互转换。为了实现这一点,我们使用了星生成对抗网络(StarGAN)的一个变体,名为StarGAN- vc。对于训练,我们的模型不需要句子的并行出现,并且使用相对较少的训练示例,我们能够生成高质量的转换输出。通过客观和主观评价,推导出转换后的语音模式输出与目标语音模式具有较高的可比性。
{"title":"Multiple Speech Mode Transformation using Adversarial Network","authors":"Kumud Tripathi, Jatin Kumar","doi":"10.1109/UPCON56432.2022.9986477","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986477","url":null,"abstract":"The objective of Multiple Speech Mode Transformation (MSMT) is to transform speech from one form to another on the basis of their mode characteristics. In this work, we have explored three different modes of speech (conversation, extempore, and read modes) for their inter-conversion while preserving the speaker identity and the linguistic content. To accomplish this we used a variant of Star Generative Adversarial Network (StarGAN) named as StarGAN-VC. For training, our model does not require parallel occurrences of the sentences and with relatively lesser number of training example we were able to generate high quality transformed outputs. On conducting objective and subjective evaluations, it is deduced that the transformed speech mode outputs are highly comparable to the target speech mode.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"171 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127097551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Order Simplification of LTI Systems using Enhanced Pole Clustering Technique 基于增强极点聚类技术的LTI系统阶简化
Shekhar Gehlaut, Mirnal Singh Rawat, D. Kumar
This paper proposes a novel, enhanced pole clustering (EPC) method for simplifying complex linear-time invariant (LTI) systems. In the presented technique, the grey wolf optimizer (GWO) is adopted to derive the numerator polynomial of the reduced-order model (ROM) by minimizing the integral squared error (ISE). The proposed technique assures the stability of the resulting ROM. A comparison of different performance indices is conducted to establish the effectiveness of the proposed model order reduction (MOR) method.
本文提出了一种新的、增强的极点聚类(EPC)方法来简化复杂线性时不变系统。该方法采用灰狼优化器(GWO),通过最小化积分平方误差(ISE)来推导降阶模型(ROM)的分子多项式。本文通过对不同性能指标的比较,验证了模型降阶(MOR)方法的有效性。
{"title":"Order Simplification of LTI Systems using Enhanced Pole Clustering Technique","authors":"Shekhar Gehlaut, Mirnal Singh Rawat, D. Kumar","doi":"10.1109/UPCON56432.2022.9986463","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986463","url":null,"abstract":"This paper proposes a novel, enhanced pole clustering (EPC) method for simplifying complex linear-time invariant (LTI) systems. In the presented technique, the grey wolf optimizer (GWO) is adopted to derive the numerator polynomial of the reduced-order model (ROM) by minimizing the integral squared error (ISE). The proposed technique assures the stability of the resulting ROM. A comparison of different performance indices is conducted to establish the effectiveness of the proposed model order reduction (MOR) method.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128204363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extended Template Matching method for Region of Interest Extraction in Cephalometric Landmarks Annotation 基于扩展模板匹配的头颅特征点标注感兴趣区域提取方法
R. S, S. S, Rakshitha R, B. Poornima
Finding areas in the image where the subsequent processing of the features concentrates is known as Region of Interest (ROI) extraction. Utilizing ROI helps speed up processing by excluding irrelevant image regions. ROI extraction in biomedical landmark annotation problems is challenging as radiograph images have varying contrast and intensity levels. Cephalometric landmark annotation is a domain where ROI extraction plays a vital role in traditional machine learning and deep learning solutions. This work proposes a simple and feasible extension to the template matching method to extract the ROI from the cephalometric images. The exact ROI patch is located based on a combined metric calculated using the Normalized correlation coefficient measure and the distance measure. The algorithm is tested on publicly available cephalometric landmark annotation dataset. The experimental results show that the ROIs are extracted with an accuracy of 99.69%. Additionally, a reported average distance between the ROI patch center and the ground truth landmark is 3.96 mm. This demonstrates that the method can practically be used as an initial estimator, significantly improving the accuracy of landmark localization.
在图像中寻找特征后续处理集中的区域称为感兴趣区域(ROI)提取。利用ROI可以通过排除不相关的图像区域来加快处理速度。由于x射线图像具有不同的对比度和强度水平,因此在生物医学地标标注问题中ROI提取具有挑战性。在传统的机器学习和深度学习解决方案中,头部测量地标标注是ROI提取的重要领域。本文提出了一种简单可行的模板匹配方法的扩展,用于从头颅图像中提取感兴趣区域。根据归一化相关系数度量和距离度量计算的组合度量来定位精确的ROI补丁。该算法在公开可用的头颅测量地标标注数据集上进行了测试。实验结果表明,该方法提取roi的准确率为99.69%。此外,据报道,ROI补丁中心与地面真实地标之间的平均距离为3.96 mm。结果表明,该方法可以作为初始估计量,显著提高了地标定位的精度。
{"title":"Extended Template Matching method for Region of Interest Extraction in Cephalometric Landmarks Annotation","authors":"R. S, S. S, Rakshitha R, B. Poornima","doi":"10.1109/UPCON56432.2022.9986436","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986436","url":null,"abstract":"Finding areas in the image where the subsequent processing of the features concentrates is known as Region of Interest (ROI) extraction. Utilizing ROI helps speed up processing by excluding irrelevant image regions. ROI extraction in biomedical landmark annotation problems is challenging as radiograph images have varying contrast and intensity levels. Cephalometric landmark annotation is a domain where ROI extraction plays a vital role in traditional machine learning and deep learning solutions. This work proposes a simple and feasible extension to the template matching method to extract the ROI from the cephalometric images. The exact ROI patch is located based on a combined metric calculated using the Normalized correlation coefficient measure and the distance measure. The algorithm is tested on publicly available cephalometric landmark annotation dataset. The experimental results show that the ROIs are extracted with an accuracy of 99.69%. Additionally, a reported average distance between the ROI patch center and the ground truth landmark is 3.96 mm. This demonstrates that the method can practically be used as an initial estimator, significantly improving the accuracy of landmark localization.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129704017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PUC Optimal Switching Strategies for Renewable Applications in Single Phase Inverter 可再生电源在单相逆变器中的PUC最优开关策略
S. D, P. K, Sivamani D, N. A, N. S, R. R
Nowadays, a lot of different sectors and researchers employ multilevel inverters for high power medium voltage applications. Because of this, hybrid topologies with fewer components have become increasingly popular. In this paper, the authors present an architecture for Hybrid Packed U-Cells that combines fewer switches with or without voltage balancing (H-PUCs). Eight switches are necessary for the proposed H-PUC for it to be able to offer voltage output at levels 7 and 15. H-PUC makes use of a combination of high voltage low frequency (HVLF) and low voltage high frequency (LVHF), which helps to reduce the amount of power that is wasted while simultaneously improving efficiency. A simulation and validation of the topology for the 7-level asymmetry is carried out on a microcontroller with the model number C2000. To evaluate the functionality of the proposed inverter, a simulation is run in the MATLAB programming environment. The H-PUC inverter topology that has been presented is one that is capable of being implemented in applications involving the integration of renewable energy.
目前,许多不同的部门和研究人员都采用多电平逆变器进行大功率中压应用。正因为如此,具有更少组件的混合拓扑变得越来越流行。在本文中,作者提出了一种混合封装U-Cells架构,该架构结合了具有或不具有电压平衡(H-PUCs)的更少开关。为了能够提供7级和15级的电压输出,所提出的H-PUC需要8个开关。H-PUC利用高压低频(HVLF)和低压高频(LVHF)的组合,有助于减少浪费的功率,同时提高效率。在型号为C2000的微控制器上进行了7级不对称拓扑的仿真和验证。为了评估该逆变器的功能,在MATLAB编程环境下进行了仿真。所提出的H-PUC逆变器拓扑是一种能够在涉及可再生能源集成的应用中实现的拓扑。
{"title":"PUC Optimal Switching Strategies for Renewable Applications in Single Phase Inverter","authors":"S. D, P. K, Sivamani D, N. A, N. S, R. R","doi":"10.1109/UPCON56432.2022.9986405","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986405","url":null,"abstract":"Nowadays, a lot of different sectors and researchers employ multilevel inverters for high power medium voltage applications. Because of this, hybrid topologies with fewer components have become increasingly popular. In this paper, the authors present an architecture for Hybrid Packed U-Cells that combines fewer switches with or without voltage balancing (H-PUCs). Eight switches are necessary for the proposed H-PUC for it to be able to offer voltage output at levels 7 and 15. H-PUC makes use of a combination of high voltage low frequency (HVLF) and low voltage high frequency (LVHF), which helps to reduce the amount of power that is wasted while simultaneously improving efficiency. A simulation and validation of the topology for the 7-level asymmetry is carried out on a microcontroller with the model number C2000. To evaluate the functionality of the proposed inverter, a simulation is run in the MATLAB programming environment. The H-PUC inverter topology that has been presented is one that is capable of being implemented in applications involving the integration of renewable energy.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"380 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124737323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meme Detection For Sentiment Analysis and Human Robot Interactions Using Multiple Modes 情感分析和多模式人机交互的模因检测
Memes are a way of communicating concepts across social media. However, while most memes are intended to be funny, some can turn into offensive as well when text and images are combined together. Recently many successful studies related to sentiment analysis of both image and text have been performed. Such technology, when developed successfully, can be useful for effective Human-Robot-Interactions, specially with humanoid and collaborative robots. In this research, we intend to first develop such technology with available data set using given classes only, since getting labelled data in the robotics domain, specially in robot grasping domain is difficult. In subsequent research, we may extend the same technology for intelligent robot grasping. However, the majority of the research uses either text or images for the sentiment analysis. Since the content and image in memes are sometimes unrelated, detecting hateful memes is a more challenging problem, so the present work considers both as features and uses a multimodal approach for sentiment analysis which could also be useful for Human-Robot-Interactions. Being constrained however with the available data sets, in the present investigation, our focus is on developing multimodal and sequential approaches for classifying these memes into different required classes, more specifically, here two classes: offensive and non-offensive. The fusion approach has been used within multiple modes to take features of both image and text through different models and then it has been used for the classification. While in the sequential approach, the image captioning model which is trained on the MS COCO dataset, with Optical Character Recognition (OCR), is used and classified with the help of the FastText classifier. Both approaches are used on two datasets, one is the MultiOFF dataset, and the other is the Facebook Hateful Meme dataset. Results on both datasets are found to be promising for both approaches.
表情包是在社交媒体上交流概念的一种方式。然而,虽然大多数表情包都是为了搞笑,但当文字和图片结合在一起时,有些表情包也会变得令人反感。近年来,在图像和文本情感分析方面进行了许多成功的研究。如果开发成功,这种技术可以用于有效的人机交互,特别是与类人机器人和协作机器人。在本研究中,我们打算首先使用仅使用给定类的可用数据集开发这种技术,因为在机器人领域,特别是在机器人抓取领域获得标记数据是困难的。在后续的研究中,我们可以将相同的技术扩展到智能机器人抓取。然而,大多数研究使用文本或图像进行情感分析。由于模因中的内容和图像有时是不相关的,因此检测仇恨模因是一个更具挑战性的问题,因此本研究将两者都视为特征,并使用多模态方法进行情感分析,这也可能对人机交互有用。然而,由于现有数据集的限制,在目前的调查中,我们的重点是开发多模态和顺序方法,将这些模因分类为不同的所需类别,更具体地说,这里分为两类:攻击性和非攻击性。该方法在多种模式下使用,通过不同的模型提取图像和文本的特征,然后将其用于分类。在序列方法中,使用在MS COCO数据集上训练的带有光学字符识别(OCR)的图像字幕模型,并在FastText分类器的帮助下进行分类。这两种方法都用于两个数据集,一个是MultiOFF数据集,另一个是Facebook可恶的Meme数据集。两种方法在两个数据集上的结果都是有希望的。
{"title":"Meme Detection For Sentiment Analysis and Human Robot Interactions Using Multiple Modes","authors":"","doi":"10.1109/UPCON56432.2022.9986453","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986453","url":null,"abstract":"Memes are a way of communicating concepts across social media. However, while most memes are intended to be funny, some can turn into offensive as well when text and images are combined together. Recently many successful studies related to sentiment analysis of both image and text have been performed. Such technology, when developed successfully, can be useful for effective Human-Robot-Interactions, specially with humanoid and collaborative robots. In this research, we intend to first develop such technology with available data set using given classes only, since getting labelled data in the robotics domain, specially in robot grasping domain is difficult. In subsequent research, we may extend the same technology for intelligent robot grasping. However, the majority of the research uses either text or images for the sentiment analysis. Since the content and image in memes are sometimes unrelated, detecting hateful memes is a more challenging problem, so the present work considers both as features and uses a multimodal approach for sentiment analysis which could also be useful for Human-Robot-Interactions. Being constrained however with the available data sets, in the present investigation, our focus is on developing multimodal and sequential approaches for classifying these memes into different required classes, more specifically, here two classes: offensive and non-offensive. The fusion approach has been used within multiple modes to take features of both image and text through different models and then it has been used for the classification. While in the sequential approach, the image captioning model which is trained on the MS COCO dataset, with Optical Character Recognition (OCR), is used and classified with the help of the FastText classifier. Both approaches are used on two datasets, one is the MultiOFF dataset, and the other is the Facebook Hateful Meme dataset. Results on both datasets are found to be promising for both approaches.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124089888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1