首页 > 最新文献

2023 IEEE 8th International Conference for Convergence in Technology (I2CT)最新文献

英文 中文
Face Gesture Based Virtual Mouse Using Mediapipe 使用Mediapipe的基于面部手势的虚拟鼠标
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126453
Akula Kumar Raja, Chidakash Sugandhi, Gorantla Nymish, Nama Sai Havish, Manazhy Rashmi
A disabled person’s life is always dependent on someone else who needs aid with mobility or any other task. Individuals with disabilities may face challenges when using computers. The most common way of interacting with computers is using a mouse and keyboard. It is difficult for people with physical disabilities to use them. Facial movements are one of the best possible actions by physically disabled individuals, By recognizing and responding to these movements, it is possible for them to operate the computer using only their facial expressions. Face recognition is a contemporary approach to interaction between humans and computers (HCI) i.e., The proposed system can easily control the computers by using face gesture recognition. It can be a viable replacement for traditional HCI tools in the future. This research outlines the techniques utilized in the design, implementation, and evaluation of the experiments conducted and presents the results obtained.
残疾人的生活总是依赖于其他需要帮助行动或其他任务的人。残障人士在使用电脑时可能会面临挑战。与电脑交互最常见的方式是使用鼠标和键盘。身体残疾的人很难使用它们。面部动作是身体残疾的人最好的动作之一,通过识别和反应这些动作,他们可以只用面部表情来操作计算机。人脸识别是人类与计算机(HCI)之间交互的一种现代方法,即所提出的系统可以通过使用人脸手势识别轻松控制计算机。在未来,它可以成为传统HCI工具的可行替代品。本研究概述了在设计、实施和评估所进行的实验中使用的技术,并提出了所获得的结果。
{"title":"Face Gesture Based Virtual Mouse Using Mediapipe","authors":"Akula Kumar Raja, Chidakash Sugandhi, Gorantla Nymish, Nama Sai Havish, Manazhy Rashmi","doi":"10.1109/I2CT57861.2023.10126453","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126453","url":null,"abstract":"A disabled person’s life is always dependent on someone else who needs aid with mobility or any other task. Individuals with disabilities may face challenges when using computers. The most common way of interacting with computers is using a mouse and keyboard. It is difficult for people with physical disabilities to use them. Facial movements are one of the best possible actions by physically disabled individuals, By recognizing and responding to these movements, it is possible for them to operate the computer using only their facial expressions. Face recognition is a contemporary approach to interaction between humans and computers (HCI) i.e., The proposed system can easily control the computers by using face gesture recognition. It can be a viable replacement for traditional HCI tools in the future. This research outlines the techniques utilized in the design, implementation, and evaluation of the experiments conducted and presents the results obtained.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117249915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategies for Improving Object Detection in Real-Time Projects that use Deep Learning Technology 在使用深度学习技术的实时项目中改进目标检测的策略
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126449
Niloofar Abed, Ramu Murugan
The popularity and prevalence of devices equipped with object detection technology and controllable via the Internet of Things (IoT) have increased, especially in the post-Corona era. The development of neural networks and artificial intelligence by combining them with IoT systems has achieved acceptable satisfaction among users in adverse conditions by reducing the need for manpower and increasing productivity. Therefore, the scope of using such mechanisms has expanded in most fields, from self-driving vehicles to agricultural crops. Beginners will be confronted with a massive amount of complex information as a result of the design and application of such technologies in interdisciplinary fields. Due to the popularity of using the You Only Look Once (YOLO) object detection algorithm, this article provided a guideline as a traffic light subject classification and, offers suggested solutions and exclusive approches to increase the accuracy of object detection in real-time projects with a practical application attitude for the enthusiasts and developers particularly in object detection scenarios by employing YOLO.
配备目标检测技术并通过物联网(IoT)控制的设备的普及程度和流行程度有所增加,特别是在后冠状病毒时代。神经网络和人工智能的发展与物联网系统相结合,减少了对人力的需求,提高了生产率,在不利的条件下,用户获得了可接受的满意度。因此,从自动驾驶汽车到农作物,这种机制的使用范围在大多数领域都得到了扩展。由于这些技术在跨学科领域的设计和应用,初学者将面临大量复杂的信息。由于YOLO (You Only Look Once)目标检测算法的普及,本文作为红绿灯主题分类的指南,以实际应用的态度,为使用YOLO进行目标检测的爱好者和开发者提供了提高实时项目中目标检测精度的建议解决方案和独家途径。
{"title":"Strategies for Improving Object Detection in Real-Time Projects that use Deep Learning Technology","authors":"Niloofar Abed, Ramu Murugan","doi":"10.1109/I2CT57861.2023.10126449","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126449","url":null,"abstract":"The popularity and prevalence of devices equipped with object detection technology and controllable via the Internet of Things (IoT) have increased, especially in the post-Corona era. The development of neural networks and artificial intelligence by combining them with IoT systems has achieved acceptable satisfaction among users in adverse conditions by reducing the need for manpower and increasing productivity. Therefore, the scope of using such mechanisms has expanded in most fields, from self-driving vehicles to agricultural crops. Beginners will be confronted with a massive amount of complex information as a result of the design and application of such technologies in interdisciplinary fields. Due to the popularity of using the You Only Look Once (YOLO) object detection algorithm, this article provided a guideline as a traffic light subject classification and, offers suggested solutions and exclusive approches to increase the accuracy of object detection in real-time projects with a practical application attitude for the enthusiasts and developers particularly in object detection scenarios by employing YOLO.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115886182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Relative Study of Intelligent Control Techniques to Maintain Variable Pitch-Angle of the Wind Turbine 风电机组变俯仰角智能控制技术的相关研究
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126335
V. Khatavkar, Snehal Andhale, Panchshila Pillewar, Utkarsh Alset
The wind turbine requires a robust and time-responsive system to control the pitch–angle (Pit–Ang) of the mechanical actuator. If the response of speed is very efficient then, the controller can act according to the prescribed logic and its mechanical mechanism can work faster with its response time. In this paper, real-time Data from IMD (Indian Meteorological Department) is used for the relative study of the model of wind turbine created in MATLAB / Simulink® environment using Fuzzy Logic Toolbox™. The principle of wind turbine used here is to supply a controlled input to the system and after synthesis, these rules in the form of signal are transferred to the plant which has a drive train and pitch actuator. The output responses of the proposed controller are compared amongst proportional– integral–derivative (PID), fuzzy, and adaptive fuzzy–PID Controllers. The simulation results seen between the adaptive fuzzy– based PID controller surpasses the expected results by Tr = 95.454%, Ts = 61.409% and negligible overshoot as compared to open–looped and conventional responsive controller.
风力发电机需要一个鲁棒和时间响应系统来控制机械驱动器的俯仰角(Pit-Ang)。如果速度响应非常有效,则控制器可以按照规定的逻辑行动,其机械机构可以根据其响应时间更快地工作。本文利用IMD (Indian Meteorological Department)的实时数据,利用Fuzzy Logic Toolbox™对MATLAB / Simulink®环境下创建的风力机模型进行了相关研究。这里使用的风力发电机的原理是为系统提供一个受控输入,经过综合,这些规则以信号的形式传递给具有传动系和俯仰执行器的装置。比较了比例-积分-导数(PID)、模糊和自适应模糊PID控制器的输出响应。仿真结果表明,与开环和常规响应控制器相比,基于自适应模糊的PID控制器的Tr = 95.454%, Ts = 61.409%优于预期结果,超调可以忽略不计。
{"title":"Relative Study of Intelligent Control Techniques to Maintain Variable Pitch-Angle of the Wind Turbine","authors":"V. Khatavkar, Snehal Andhale, Panchshila Pillewar, Utkarsh Alset","doi":"10.1109/I2CT57861.2023.10126335","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126335","url":null,"abstract":"The wind turbine requires a robust and time-responsive system to control the pitch–angle (Pit–Ang) of the mechanical actuator. If the response of speed is very efficient then, the controller can act according to the prescribed logic and its mechanical mechanism can work faster with its response time. In this paper, real-time Data from IMD (Indian Meteorological Department) is used for the relative study of the model of wind turbine created in MATLAB / Simulink® environment using Fuzzy Logic Toolbox™. The principle of wind turbine used here is to supply a controlled input to the system and after synthesis, these rules in the form of signal are transferred to the plant which has a drive train and pitch actuator. The output responses of the proposed controller are compared amongst proportional– integral–derivative (PID), fuzzy, and adaptive fuzzy–PID Controllers. The simulation results seen between the adaptive fuzzy– based PID controller surpasses the expected results by Tr = 95.454%, Ts = 61.409% and negligible overshoot as compared to open–looped and conventional responsive controller.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"28 9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115634459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance Analysis of Machine Learning Algorithms to Predict Cardiovascular Disease 预测心血管疾病的机器学习算法性能分析
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126428
Hridya V Ramesh, Rahul Krishnan Pathinarupothi
Globally the rate of heart disease has increased drastically due to unhealthy eating habits and reduced physical activities. It has become one of the significant causes of death worldwide. As per the reports of the world health organization(WHO), 31% of all deaths worldwide are caused by cardiovascular diseases. This demands the development of a system capable of early detection of cardiovascular diseases at an affordable cost. With this as the objective, multiple machine learning algorithms have been selected to evaluate their performance in the early detection of cardiovascular diseases. This work utilizes available data sets of an individual’s vital parameters, demographic data, and exercise parameters for predicting cardiovascular diseases. An extensive evaluation is performed to identify the best-suited supervised machine learning classifier that could predict cardiovascular diseases using the available datasets. This research work details the nine different classification algorithms utilized for this analysis. For each algorithm, the F1-score, precision, recall, accuracy, and Area Under the Receiver Operating Characteristics (AUROC) values for each model have been determined and compared with the rest of the algorithms. The results show that random forest and gradient boosting models outperform others and demonstrate an F1-Score of 0.88 and an AUROC value of 0.92, respectively. This showcases that doctors could utilize this technique for the early identification of cardiovascular diseases. This will provide the opportunity to offer adequate medical treatments early, thus saving lives.
在全球范围内,由于不健康的饮食习惯和体育活动的减少,心脏病的发病率急剧上升。它已成为世界范围内死亡的主要原因之一。根据世界卫生组织(WHO)的报告,全世界31%的死亡是由心血管疾病引起的。这就要求开发一种能够以负担得起的成本及早发现心血管疾病的系统。以此为目标,我们选择了多种机器学习算法来评估它们在心血管疾病早期检测中的表现。这项工作利用个人重要参数、人口统计数据和运动参数的可用数据集来预测心血管疾病。进行了广泛的评估,以确定最适合的监督机器学习分类器,该分类器可以使用可用的数据集预测心血管疾病。这项研究工作详细介绍了用于此分析的九种不同的分类算法。对于每种算法,确定了每种模型的f1评分、精度、召回率、准确度和接收者操作特征下面积(Area Under the Receiver Operating Characteristics, AUROC)值,并与其他算法进行了比较。结果表明,随机森林模型和梯度增强模型的F1-Score为0.88,AUROC值为0.92,优于其他模型。这表明医生可以利用这项技术来早期识别心血管疾病。这将提供机会及早提供适当的医疗,从而挽救生命。
{"title":"Performance Analysis of Machine Learning Algorithms to Predict Cardiovascular Disease","authors":"Hridya V Ramesh, Rahul Krishnan Pathinarupothi","doi":"10.1109/I2CT57861.2023.10126428","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126428","url":null,"abstract":"Globally the rate of heart disease has increased drastically due to unhealthy eating habits and reduced physical activities. It has become one of the significant causes of death worldwide. As per the reports of the world health organization(WHO), 31% of all deaths worldwide are caused by cardiovascular diseases. This demands the development of a system capable of early detection of cardiovascular diseases at an affordable cost. With this as the objective, multiple machine learning algorithms have been selected to evaluate their performance in the early detection of cardiovascular diseases. This work utilizes available data sets of an individual’s vital parameters, demographic data, and exercise parameters for predicting cardiovascular diseases. An extensive evaluation is performed to identify the best-suited supervised machine learning classifier that could predict cardiovascular diseases using the available datasets. This research work details the nine different classification algorithms utilized for this analysis. For each algorithm, the F1-score, precision, recall, accuracy, and Area Under the Receiver Operating Characteristics (AUROC) values for each model have been determined and compared with the rest of the algorithms. The results show that random forest and gradient boosting models outperform others and demonstrate an F1-Score of 0.88 and an AUROC value of 0.92, respectively. This showcases that doctors could utilize this technique for the early identification of cardiovascular diseases. This will provide the opportunity to offer adequate medical treatments early, thus saving lives.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116332368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forest Fire Detection using Convolutional Neural Network Model 基于卷积神经网络模型的森林火灾探测
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126370
Shubham Sah, S. Prakash, S. Meena
Everyone recalls the destruction brought on by the Australian forest fires in 2019. Even though there isn’t much we can do to battle forest fires on our own, we can always rely on technology. By this we are trying to predict the accuracy of these models on forest fire data set. We are trying to detect forest fire in dense forest; our data set is very diverse and consist of images having forest fires, smokes, non-smoke and fire images. We have found out that Sensor detection and real-time geological data analysis are two methods for detecting forest fires. However, using image classification, for which Deep learning is the most efficient solution, is one of the best methods for detecting fire. In addition, these algorithms can be integrated with drones using deep learning techniques so that images can be taken frequently from the sky with ease, smoke can be detected in dense forests, and the authorities can be notified to take immediate action. The convolutional neural network algorithm for fire detection was the sole focus of our paper. The value of various epochs is used to evaluate these results.
每个人都记得2019年澳大利亚森林大火造成的破坏。尽管我们自己在扑灭森林大火方面无能为力,但我们总是可以依靠科技。通过这种方法,我们试图预测这些模型在森林火灾数据集上的准确性。我们试图在茂密的森林中发现森林火灾;我们的数据集非常多样化,包括森林火灾、烟雾、非烟雾和火灾图像。我们发现传感器探测和实时地质数据分析是森林火灾探测的两种方法。然而,使用图像分类是检测火灾的最佳方法之一,深度学习是其中最有效的解决方案。此外,这些算法可以通过深度学习技术与无人机集成,从而可以轻松地从空中频繁拍摄图像,可以在茂密的森林中检测到烟雾,并可以通知当局立即采取行动。卷积神经网络火灾探测算法是本文唯一的研究重点。用不同时期的值来评价这些结果。
{"title":"Forest Fire Detection using Convolutional Neural Network Model","authors":"Shubham Sah, S. Prakash, S. Meena","doi":"10.1109/I2CT57861.2023.10126370","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126370","url":null,"abstract":"Everyone recalls the destruction brought on by the Australian forest fires in 2019. Even though there isn’t much we can do to battle forest fires on our own, we can always rely on technology. By this we are trying to predict the accuracy of these models on forest fire data set. We are trying to detect forest fire in dense forest; our data set is very diverse and consist of images having forest fires, smokes, non-smoke and fire images. We have found out that Sensor detection and real-time geological data analysis are two methods for detecting forest fires. However, using image classification, for which Deep learning is the most efficient solution, is one of the best methods for detecting fire. In addition, these algorithms can be integrated with drones using deep learning techniques so that images can be taken frequently from the sky with ease, smoke can be detected in dense forests, and the authorities can be notified to take immediate action. The convolutional neural network algorithm for fire detection was the sole focus of our paper. The value of various epochs is used to evaluate these results.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115472819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of an Efficient Bioinspired Model for Optimizing Robotic Arm Movements via Ensemble Learning Operations 基于集成学习操作优化机械臂运动的高效仿生模型设计
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126406
Prachi V. Karlekar, Swapna Choudhary, Atul Deshmukh, Harish Banote
Robotic arm movements are highly dependent on design and deployment of sensors & actuation devices & their duty cycles. Optimizing current-level duty cycles for these devices can reduce the power consumption, and maximize the efficiency of control for different device operations. Existing duty cycle control models for robotic arms are highly complex, or have lower efficiency levels. To overcome these issues, this text proposes design of an efficient bioinspired model for optimizing robotic arm movements via ensemble learning operations. The arm is built using Arduino controller along with stepper motors, which assist in controlled movements for different arm operations. The proposed model uses Mayfly Optimization (MO) in order to identify duty cycles of different arm components for different movement types. The MO Model uses delay, energy and jitter parameters in order to estimate a fitness function that is optimized in order to identify arm movement sets. These movement sets are classified into performance-aware movements via a combination of Naïve Bayes (NB), k Nearest Neighbours (kNN), Support Vector Machine (SVM), Logistic Regression (LR), and Multilayer Perceptron (MLP) classifiers. Due to which the model is able to reduce the delay needed for control the arms by 8.3%, reduce the energy needed for control operations by 2.9%, and reduce the control jitter by 4.5% under real-time scenarios.
机械臂的运动高度依赖于传感器和驱动装置的设计和部署及其占空比。优化这些器件的电流级占空比可以降低功耗,并最大限度地提高不同器件操作的控制效率。现有的机械臂占空比控制模型过于复杂,或者效率较低。为了克服这些问题,本文提出了一个有效的生物启发模型的设计,通过集成学习操作来优化机械臂运动。该手臂使用Arduino控制器和步进电机构建,有助于控制不同手臂操作的运动。该模型采用蜉蝣优化(Mayfly Optimization, MO)来识别不同运动类型下不同手臂部件的占空比。MO模型使用延迟、能量和抖动参数来估计适应度函数,并对适应度函数进行优化,以识别手臂运动集。通过Naïve贝叶斯(NB)、k近邻(kNN)、支持向量机(SVM)、逻辑回归(LR)和多层感知器(MLP)分类器的组合,这些运动集被分类为性能感知运动。因此,在实时场景下,该模型能够将控制臂所需的延迟降低8.3%,将控制操作所需的能量降低2.9%,将控制抖动降低4.5%。
{"title":"Design of an Efficient Bioinspired Model for Optimizing Robotic Arm Movements via Ensemble Learning Operations","authors":"Prachi V. Karlekar, Swapna Choudhary, Atul Deshmukh, Harish Banote","doi":"10.1109/I2CT57861.2023.10126406","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126406","url":null,"abstract":"Robotic arm movements are highly dependent on design and deployment of sensors & actuation devices & their duty cycles. Optimizing current-level duty cycles for these devices can reduce the power consumption, and maximize the efficiency of control for different device operations. Existing duty cycle control models for robotic arms are highly complex, or have lower efficiency levels. To overcome these issues, this text proposes design of an efficient bioinspired model for optimizing robotic arm movements via ensemble learning operations. The arm is built using Arduino controller along with stepper motors, which assist in controlled movements for different arm operations. The proposed model uses Mayfly Optimization (MO) in order to identify duty cycles of different arm components for different movement types. The MO Model uses delay, energy and jitter parameters in order to estimate a fitness function that is optimized in order to identify arm movement sets. These movement sets are classified into performance-aware movements via a combination of Naïve Bayes (NB), k Nearest Neighbours (kNN), Support Vector Machine (SVM), Logistic Regression (LR), and Multilayer Perceptron (MLP) classifiers. Due to which the model is able to reduce the delay needed for control the arms by 8.3%, reduce the energy needed for control operations by 2.9%, and reduce the control jitter by 4.5% under real-time scenarios.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116748604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Machine Learning Model for Lie-Detection 用于测谎的混合机器学习模型
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126460
Rupali J Dhabarde, D. V. Kodawade, Sheetal Zalte
A technique for recognizing a person from his photograph is facial recognition. Due to its extensive range of applications in several fields, it has drawn the attention of numerous researchers in the field of computer vision in recent years (Cyber security, crime cases, and biometrics). This technology's operation is based on the extraction of features from an input picture using methods like PCA, ICA, LDA etc. After comparing them with others from another image to verify or assert an individual's identification. Via this work, we applied amalgamation of CNN and SVM techniques to two face datasets that will be split into two groups in a machine learning-based methodology. We assessed different machine learning-based lie detectors using our amassed dataset. Our findings demonstrate that combined CNN with SVM task achieved accuracy up to 58%.
从照片中识别一个人的技术是面部识别。由于其在多个领域的广泛应用,近年来引起了计算机视觉领域(网络安全、犯罪案件和生物识别)众多研究人员的关注。该技术的操作是基于使用PCA, ICA, LDA等方法从输入图像中提取特征。将其与另一图像中的其他图像进行比较,以验证或断言个人身份。通过这项工作,我们将CNN和SVM技术的合并应用于两个人脸数据集,这些数据集将在基于机器学习的方法中分成两组。我们使用我们积累的数据集评估了不同的基于机器学习的测谎仪。我们的研究结果表明,CNN与SVM任务相结合的准确率高达58%。
{"title":"Hybrid Machine Learning Model for Lie-Detection","authors":"Rupali J Dhabarde, D. V. Kodawade, Sheetal Zalte","doi":"10.1109/I2CT57861.2023.10126460","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126460","url":null,"abstract":"A technique for recognizing a person from his photograph is facial recognition. Due to its extensive range of applications in several fields, it has drawn the attention of numerous researchers in the field of computer vision in recent years (Cyber security, crime cases, and biometrics). This technology's operation is based on the extraction of features from an input picture using methods like PCA, ICA, LDA etc. After comparing them with others from another image to verify or assert an individual's identification. Via this work, we applied amalgamation of CNN and SVM techniques to two face datasets that will be split into two groups in a machine learning-based methodology. We assessed different machine learning-based lie detectors using our amassed dataset. Our findings demonstrate that combined CNN with SVM task achieved accuracy up to 58%.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115580918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Mango Fruit Disease Severity Assessment with CNN and SVM-Based Classification 基于CNN和svm分类增强芒果果实病害严重程度评估
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126397
D. Banerjee, V. Kukreja, S. Hariharan, Vishal Jain
The mango leaf powdery mildew disease poses a serious threat to mango production society globally by significantly lowering yield and quality. For timely intervention and efficient management, early disease detection and classification are important. In this research and education area, a novel hybrid approach utilizes Convolutional Neural Networks (CNN) and Support Vector Machines to identify the mango leaf powdery mildew disease based on four severity levels (SVM). Three phases make up the proposed approach: data structure, CNN-selected attributes, and SVM classification. We collect and preprocess images of mango leaves during the data organization step, and in the CNN - attributes selection phase, we apply a CNN model for feature extraction and selection. For the mango leaf powdery mildew dataset, we improve the CNN model to find the most relevant features for the classification task. The SVM - classification step includes training an SVM model on the obtained features and refining the hyperparameters via k-fold cross-validation. The proposed CNN and SVM hybrid multi-classification model for mango leaf powdery mildew disease achieved an overall accuracy of 89.29%. A dataset of 2559 images with 4 severity levels was utilized. The model works well overall, as a macro-average F1-score of 90.10, the weighted average F1-score's minimal value of 53.85%. The model is less successful in predicting instances for classes with smaller support proportions, as shown by the micro-average F1-score, which is 89.29% and is lower overall than the macro-average F1-score.
芒果叶白粉病严重影响芒果产量和品质,对全球芒果生产社会造成严重威胁。为了及时干预和有效管理,疾病的早期发现和分类是重要的。在这个研究和教育领域,一种新的混合方法利用卷积神经网络(CNN)和支持向量机(SVM)来识别芒果叶白粉病,基于四个严重程度(SVM)。该方法由三个阶段组成:数据结构、cnn选择属性和SVM分类。在数据组织阶段,我们采集芒果叶图像并进行预处理,在CNN -属性选择阶段,我们采用CNN模型进行特征提取和选择。对于芒果叶白粉病数据集,我们改进CNN模型,为分类任务找到最相关的特征。支持向量机分类步骤包括在得到的特征上训练支持向量机模型,并通过k-fold交叉验证来细化超参数。提出的芒果叶片白粉病CNN和SVM混合多分类模型总体准确率达到89.29%。使用了一个包含2559张图像的数据集,其中包含4个严重级别。模型总体效果良好,宏观平均f1得分为90.10,加权平均f1得分最小值为53.85%。该模型在预测支持比例较小的类的实例时不太成功,微观平均f1得分为89.29%,总体上低于宏观平均f1得分。
{"title":"Enhancing Mango Fruit Disease Severity Assessment with CNN and SVM-Based Classification","authors":"D. Banerjee, V. Kukreja, S. Hariharan, Vishal Jain","doi":"10.1109/I2CT57861.2023.10126397","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126397","url":null,"abstract":"The mango leaf powdery mildew disease poses a serious threat to mango production society globally by significantly lowering yield and quality. For timely intervention and efficient management, early disease detection and classification are important. In this research and education area, a novel hybrid approach utilizes Convolutional Neural Networks (CNN) and Support Vector Machines to identify the mango leaf powdery mildew disease based on four severity levels (SVM). Three phases make up the proposed approach: data structure, CNN-selected attributes, and SVM classification. We collect and preprocess images of mango leaves during the data organization step, and in the CNN - attributes selection phase, we apply a CNN model for feature extraction and selection. For the mango leaf powdery mildew dataset, we improve the CNN model to find the most relevant features for the classification task. The SVM - classification step includes training an SVM model on the obtained features and refining the hyperparameters via k-fold cross-validation. The proposed CNN and SVM hybrid multi-classification model for mango leaf powdery mildew disease achieved an overall accuracy of 89.29%. A dataset of 2559 images with 4 severity levels was utilized. The model works well overall, as a macro-average F1-score of 90.10, the weighted average F1-score's minimal value of 53.85%. The model is less successful in predicting instances for classes with smaller support proportions, as shown by the micro-average F1-score, which is 89.29% and is lower overall than the macro-average F1-score.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"164 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123135445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimized Recognition Of CAPTCHA Through Attention Models 通过注意模型优化验证码识别
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126193
Raghavendra A Hallyal, S. C, P. Desai, M. M
Information retrieval from the CAPTCHA is a crucial part, this CAPTCHA always contains some unwanted information along with required information, so attention technique comes in handy to select such useful information discarding the unwanted part. The attention concept has become a very important part in the field of deep learning which uses Natural Language Processing(NLP) and Computer Vision(CV). Attention mechanism is rigorously used in OCR based applications which requires generating of selected information rather than every information available. Our work includes implementation of general, global and local Attention mechanisms used with two different models which were transfer learning model and the parameter search model. As OCR with attention technique is computationally costly it is required to optimize the entire process so we suggest optimized retrieval of information from CAPTCHA using parameter search algorithm. This retrieval includes using weights that reduced the training time from 4.03 minutes to 3.33 minutes and the number of training images which were used for training were reduced than before. We obtained the highest accuracy of 87.34% for general attention with parameter search model and local attention model with parameter search model proved to have less computation and less training time than the general attention with parameter search model.
从CAPTCHA中检索信息是一个关键部分,CAPTCHA中总是包含一些不需要的信息和需要的信息,因此注意技术可以方便地选择有用的信息,丢弃不需要的部分。在自然语言处理(NLP)和计算机视觉(CV)相结合的深度学习领域中,注意力概念已经成为一个非常重要的组成部分。注意机制在基于OCR的应用中得到了严格的应用,该应用要求生成选择的信息而不是所有可用的信息。我们的工作包括使用迁移学习模型和参数搜索模型两种不同的模型实现一般、全局和局部注意机制。由于注意OCR技术计算量大,需要对整个过程进行优化,因此我们建议使用参数搜索算法对CAPTCHA信息进行优化检索。该检索包括使用权值将训练时间从4.03分钟减少到3.33分钟,并且用于训练的训练图像数量比以前减少。我们用参数搜索模型对一般注意模型获得了最高的准确率(87.34%),用参数搜索模型对局部注意模型的计算量和训练时间比用参数搜索模型少。
{"title":"Optimized Recognition Of CAPTCHA Through Attention Models","authors":"Raghavendra A Hallyal, S. C, P. Desai, M. M","doi":"10.1109/I2CT57861.2023.10126193","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126193","url":null,"abstract":"Information retrieval from the CAPTCHA is a crucial part, this CAPTCHA always contains some unwanted information along with required information, so attention technique comes in handy to select such useful information discarding the unwanted part. The attention concept has become a very important part in the field of deep learning which uses Natural Language Processing(NLP) and Computer Vision(CV). Attention mechanism is rigorously used in OCR based applications which requires generating of selected information rather than every information available. Our work includes implementation of general, global and local Attention mechanisms used with two different models which were transfer learning model and the parameter search model. As OCR with attention technique is computationally costly it is required to optimize the entire process so we suggest optimized retrieval of information from CAPTCHA using parameter search algorithm. This retrieval includes using weights that reduced the training time from 4.03 minutes to 3.33 minutes and the number of training images which were used for training were reduced than before. We obtained the highest accuracy of 87.34% for general attention with parameter search model and local attention model with parameter search model proved to have less computation and less training time than the general attention with parameter search model.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123330931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted Pooling RoBERTa for Effective Text Emotion Detection 有效文本情感检测的加权池RoBERTa
Pub Date : 2023-04-07 DOI: 10.1109/I2CT57861.2023.10126396
Meenu Mathew, J. Prakash
Textual emotion detection is a classification problem that assigns different emotions to a given text input. It reveals the writer’s mental state. Its diversity and uncertainty make it a challenging task. The existing methods in machine learning can be used for emotion detection; however, it fails in processing very long passages. In this work, we employ weighted pooling pretrained RoBERTa model for effective textual emotion detection. To perform experiments, we use two data sets, ISEAR and tweets, with 7516 and 21048 records, respectively. Precision, recall, F1-score, and classification accuracy are used to assess the models. Experimental results indicate that the weighted pooling RoBERTa model outperforms the deep learning models on both datasets with significant improvement in accuracy.
文本情感检测是一个分类问题,它将不同的情感分配给给定的文本输入。它揭示了作者的精神状态。它的多样性和不确定性使它成为一项具有挑战性的任务。现有的机器学习方法可以用于情绪检测;然而,它不能处理很长的段落。在这项工作中,我们使用加权池预训练RoBERTa模型进行有效的文本情感检测。为了进行实验,我们使用两个数据集,ISEAR和tweets,分别有7516条和21048条记录。精密度、召回率、f1分数和分类精度被用来评估模型。实验结果表明,加权池化RoBERTa模型在两个数据集上都优于深度学习模型,准确率显著提高。
{"title":"Weighted Pooling RoBERTa for Effective Text Emotion Detection","authors":"Meenu Mathew, J. Prakash","doi":"10.1109/I2CT57861.2023.10126396","DOIUrl":"https://doi.org/10.1109/I2CT57861.2023.10126396","url":null,"abstract":"Textual emotion detection is a classification problem that assigns different emotions to a given text input. It reveals the writer’s mental state. Its diversity and uncertainty make it a challenging task. The existing methods in machine learning can be used for emotion detection; however, it fails in processing very long passages. In this work, we employ weighted pooling pretrained RoBERTa model for effective textual emotion detection. To perform experiments, we use two data sets, ISEAR and tweets, with 7516 and 21048 records, respectively. Precision, recall, F1-score, and classification accuracy are used to assess the models. Experimental results indicate that the weighted pooling RoBERTa model outperforms the deep learning models on both datasets with significant improvement in accuracy.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123853003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 IEEE 8th International Conference for Convergence in Technology (I2CT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1