首页 > 最新文献

2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)最新文献

英文 中文
Intelligent Papilledema Detector (IPD) 智能乳头水肿检测器(IPD)
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074229
Priya Thiagarajan, S. M
Background: Timely diagnosis of papilledema is essential to avoid vision loss and progress of life threatening conditions. Expert ophthalmologists or neurologists are not available in Emergency departments and in rural healthcare centers for timely detection. An intelligent, non-invasive detection system to aid healthcare professionals to detect papilledema and triage neurological patients is essential for early diagnosis for saving vision & even livesMethodology: Retinal fundus images are used to identify papilledema. After suitable preprocessing of the data, trained Convolutional Neural Networks can be used to classify the images to detect papilledema. Our proposed model uses EfficientNet-B3 to accurately and efficiently detect papilledema using an image dataset.Results: Accuracy of 98.54% is achieved with the EfficientNet-B3 model. Other performance metrics are also significantly higher than existing literature.Conclusion: The Intelligent Papilledema Detector will be very helpful in emergency departments and rural healthcare centers to aid with early detection of papilledema. The results obtained are very encouraging, though training with more data from various sources will help improve the practical usability of the system. Emerging trends of using smartphones with a lens assembly to capture also can be taken up as further work.
背景:及时诊断乳头水肿对于避免视力丧失和危及生命的疾病发展至关重要。在急诊科和农村保健中心,没有专业的眼科医生或神经科医生可以及时发现。一个智能的、非侵入性的检测系统可以帮助医疗保健专业人员检测乳头状水肿,并对神经系统患者进行分类,这对于早期诊断至关重要,可以挽救视力甚至生命。方法:视网膜眼底图像用于识别乳头状水肿。在对数据进行适当的预处理后,可以使用训练好的卷积神经网络对图像进行分类,以检测乳头水肿。我们提出的模型使用effentnet - b3使用图像数据集准确有效地检测乳头水肿。结果:使用effentnet - b3模型,准确率达到98.54%。其他性能指标也明显高于现有文献。结论:智能乳头水肿检测仪对急诊科和农村卫生院早期发现乳头水肿有一定的帮助。获得的结果非常令人鼓舞,尽管使用来自各种来源的更多数据进行训练将有助于提高系统的实际可用性。使用带有镜头组件的智能手机进行拍摄的新兴趋势也可以作为进一步的工作。
{"title":"Intelligent Papilledema Detector (IPD)","authors":"Priya Thiagarajan, S. M","doi":"10.1109/AICAPS57044.2023.10074229","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074229","url":null,"abstract":"Background: Timely diagnosis of papilledema is essential to avoid vision loss and progress of life threatening conditions. Expert ophthalmologists or neurologists are not available in Emergency departments and in rural healthcare centers for timely detection. An intelligent, non-invasive detection system to aid healthcare professionals to detect papilledema and triage neurological patients is essential for early diagnosis for saving vision & even livesMethodology: Retinal fundus images are used to identify papilledema. After suitable preprocessing of the data, trained Convolutional Neural Networks can be used to classify the images to detect papilledema. Our proposed model uses EfficientNet-B3 to accurately and efficiently detect papilledema using an image dataset.Results: Accuracy of 98.54% is achieved with the EfficientNet-B3 model. Other performance metrics are also significantly higher than existing literature.Conclusion: The Intelligent Papilledema Detector will be very helpful in emergency departments and rural healthcare centers to aid with early detection of papilledema. The results obtained are very encouraging, though training with more data from various sources will help improve the practical usability of the system. Emerging trends of using smartphones with a lens assembly to capture also can be taken up as further work.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128121967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Autism and Dyslexia Using Machine Learning and Clinical Data Balancing 使用机器学习和临床数据平衡预测自闭症和阅读障碍
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074161
S. Shilaskar, S. Bhatlawande, Shivpriya Deshmukh, Harshal Dhande
Autism spectrum disorder (ASD) and dyslexia are expanding more swiftly than ever nowadays. Finding the characteristics of dyslexia and autism through screening tests is costly and time-consuming. Thanks to breakthroughs in artificial intelligence, computers, and machine learning, autism and dyslexia may be predicted at a very young age (ML). Even though several studies have been carried out using quite a few different approaches, none of them has shown a clear justification for how to predict autism and dyslexia traits across age groups. This study attempts to build a suitable prediction model enabled by ML technology to predict ASD and dyslexia for people of any age. This work seeks to examine the possible use of Random Forest, SVM with linear kernel, SVM with polynomial kernel, SVM with rbf kernel, SVM with sigmoid kernel, XGBoost, Decision Tree, Logistic Regression, Naïve Bayes, and KNN to forecast and assess ASD and dyslexia difficulties in children, adolescents and adults. Using real data set collected from individuals with and without autistic traits, the proposed model and the AQ-10 screening tool were assessed. The data for dyslexia is made up of 3644 cases with 197 properties, 196 of which are independent variables and one is a dependent variable. The data for autism consists of 704 cases with 22 characteristics, 21 independent variables, and 1 dependent variable with binary values (YES or NO). The results of the research showed that, in terms of accuracy, precision, F1 score, and recall, the recommended prediction model gave better results for the data set.
如今,自闭症谱系障碍(ASD)和阅读障碍的发展速度比以往任何时候都要快。通过筛选测试来发现阅读障碍和自闭症的特征既昂贵又耗时。由于人工智能、计算机和机器学习的突破,自闭症和阅读障碍可能在很小的时候就被预测出来。尽管已经进行了几项研究,使用了几种不同的方法,但没有一项研究能够明确地证明如何预测不同年龄段的自闭症和阅读障碍特征。本研究试图通过ML技术建立一个适合任何年龄段人群的ASD和阅读障碍预测模型。本研究旨在探讨随机森林、线性核支持向量机、多项式核支持向量机、rbf核支持向量机、s型核支持向量机、XGBoost、决策树、Logistic回归、Naïve贝叶斯和KNN在儿童、青少年和成人中预测和评估ASD和阅读障碍的可能性。使用从具有和不具有自闭症特征的个体中收集的真实数据集,对所提出的模型和AQ-10筛选工具进行评估。阅读障碍的数据由3644个案例组成,有197个属性,其中196个是自变量,1个是因变量。自闭症数据包括704例,22个特征,21个自变量和1个二元值(YES或NO)的因变量。研究结果表明,在准确率、精密度、F1分数和召回率方面,推荐的预测模型对数据集给出了更好的结果。
{"title":"Prediction of Autism and Dyslexia Using Machine Learning and Clinical Data Balancing","authors":"S. Shilaskar, S. Bhatlawande, Shivpriya Deshmukh, Harshal Dhande","doi":"10.1109/AICAPS57044.2023.10074161","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074161","url":null,"abstract":"Autism spectrum disorder (ASD) and dyslexia are expanding more swiftly than ever nowadays. Finding the characteristics of dyslexia and autism through screening tests is costly and time-consuming. Thanks to breakthroughs in artificial intelligence, computers, and machine learning, autism and dyslexia may be predicted at a very young age (ML). Even though several studies have been carried out using quite a few different approaches, none of them has shown a clear justification for how to predict autism and dyslexia traits across age groups. This study attempts to build a suitable prediction model enabled by ML technology to predict ASD and dyslexia for people of any age. This work seeks to examine the possible use of Random Forest, SVM with linear kernel, SVM with polynomial kernel, SVM with rbf kernel, SVM with sigmoid kernel, XGBoost, Decision Tree, Logistic Regression, Naïve Bayes, and KNN to forecast and assess ASD and dyslexia difficulties in children, adolescents and adults. Using real data set collected from individuals with and without autistic traits, the proposed model and the AQ-10 screening tool were assessed. The data for dyslexia is made up of 3644 cases with 197 properties, 196 of which are independent variables and one is a dependent variable. The data for autism consists of 704 cases with 22 characteristics, 21 independent variables, and 1 dependent variable with binary values (YES or NO). The results of the research showed that, in terms of accuracy, precision, F1 score, and recall, the recommended prediction model gave better results for the data set.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"152 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133684969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic feedback captions for eye-tracker based online assessment 基于眼动仪的在线评估自动反馈字幕
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074484
Ujjwal Kumar, A. J, Chandrika K R
In the era of Automation, there has to be a way to improvise online assessment. This paper has demonstrated a methodology to enhance the online assessment with the help of an Eye Tracker. That has the ability to categorize the behavior of the participant with the state of the art proposed within the paper. Compared with the traditional assessment, here with the help of this model, an automated feedback sentence is returned along with the score. This adds an advantage to the assessor for fair score awarding. The model has utilized an open-source eye tracker tool to capture eye movements during the assessment. Thereafter a customized classification model is used to provide the relevant keywords, out of which the sentence will be generated using a Deep Learning model.
在自动化时代,必须有一种方法来即兴进行在线评估。本文演示了一种在眼动仪的帮助下增强在线评估的方法。它具有将参与者的行为与论文中提出的最新技术进行分类的能力。与传统的评估相比,在此模型的帮助下,自动反馈语句与分数一起返回。这为评审员公平评分增加了一个优势。该模型利用了一个开源的眼动仪工具来捕捉评估过程中的眼球运动。然后使用一个定制的分类模型来提供相关的关键词,然后使用深度学习模型生成句子。
{"title":"Automatic feedback captions for eye-tracker based online assessment","authors":"Ujjwal Kumar, A. J, Chandrika K R","doi":"10.1109/AICAPS57044.2023.10074484","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074484","url":null,"abstract":"In the era of Automation, there has to be a way to improvise online assessment. This paper has demonstrated a methodology to enhance the online assessment with the help of an Eye Tracker. That has the ability to categorize the behavior of the participant with the state of the art proposed within the paper. Compared with the traditional assessment, here with the help of this model, an automated feedback sentence is returned along with the score. This adds an advantage to the assessor for fair score awarding. The model has utilized an open-source eye tracker tool to capture eye movements during the assessment. Thereafter a customized classification model is used to provide the relevant keywords, out of which the sentence will be generated using a Deep Learning model.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"196 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131880630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Immersive User Experience Quality of StudoBot Telepresence Robots with Reinforcement Learning 强化学习增强StudoBot远程呈现机器人的沉浸式用户体验质量
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074544
K. N. Rajanikanth, Mohammed Rehab Sait, Sumukh R Kashi
The pandemic situation (Covid 19) brought new challenges in the education sector while simultaneously presenting unique opportunities for technology enabled services. The use of Mobile Robotic Telepresence systems in educational sector is promising as it provides means to significantly enhance the involvement and benefits to stakeholders involved in such interactions. An immersive user interaction with such a system depends on many aspects which are both static and dynamic. We approach the dynamic aspect of such interactions recognizing that the video and audio aspects of such a system will require fine tuning and adaptation. Closely related is the aspect of maintaining the necessary quality of network connection. Considering each of these aspects a reinforcement learning mechanism is incorporated to improve the overall user experience with such a system. A working system is built and experiments performed to demonstrate the effectiveness of the approach. Reward generation matrix, a crucial piece of data gathering from the environment, takes about 45 minutes, offline training time is less than a second, while the robot is able to cover the workspace in slightly less than a minute. The system is not limited to educational sector alone and provides a foundational framework to extend the concepts and principles to adjacent markets.
新冠肺炎疫情给教育部门带来了新的挑战,同时也为技术服务提供了独特的机遇。在教育部门使用移动机器人远程呈现系统是有希望的,因为它提供了大大提高参与这种互动的利益相关者的参与和利益的手段。与这种系统的沉浸式用户交互取决于许多静态和动态方面。我们接近这种互动的动态方面,认识到这样一个系统的视频和音频方面将需要微调和适应。密切相关的是保持网络连接的必要质量方面。考虑到这些方面,一个强化学习机制被纳入其中,以改善这样一个系统的整体用户体验。建立了一个工作系统,并进行了实验,以证明该方法的有效性。奖励生成矩阵是从环境中收集的关键数据,大约需要45分钟,离线训练时间不到一秒,而机器人能够在不到一分钟的时间内覆盖工作空间。该制度不仅限于教育部门,而且为将概念和原则扩展到邻近市场提供了基础框架。
{"title":"Enhancing Immersive User Experience Quality of StudoBot Telepresence Robots with Reinforcement Learning","authors":"K. N. Rajanikanth, Mohammed Rehab Sait, Sumukh R Kashi","doi":"10.1109/AICAPS57044.2023.10074544","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074544","url":null,"abstract":"The pandemic situation (Covid 19) brought new challenges in the education sector while simultaneously presenting unique opportunities for technology enabled services. The use of Mobile Robotic Telepresence systems in educational sector is promising as it provides means to significantly enhance the involvement and benefits to stakeholders involved in such interactions. An immersive user interaction with such a system depends on many aspects which are both static and dynamic. We approach the dynamic aspect of such interactions recognizing that the video and audio aspects of such a system will require fine tuning and adaptation. Closely related is the aspect of maintaining the necessary quality of network connection. Considering each of these aspects a reinforcement learning mechanism is incorporated to improve the overall user experience with such a system. A working system is built and experiments performed to demonstrate the effectiveness of the approach. Reward generation matrix, a crucial piece of data gathering from the environment, takes about 45 minutes, offline training time is less than a second, while the robot is able to cover the workspace in slightly less than a minute. The system is not limited to educational sector alone and provides a foundational framework to extend the concepts and principles to adjacent markets.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122021019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical Modelling of Massive MIMO Channel at FR2 Frequency Bands for B5G Networks B5G网络FR2频段大规模MIMO信道的统计建模
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074158
Sharini D L, Ravilla Dilli, K. M
The rapid growth of extreme data traffic and over occupied networks are the current challenges in the wireless communication networks. However, next generation networks tackle them by utilizing vast frequency bands. These are FR2/FR1 frequency bands that play an important role in next-generation technologies, namely B5G or 6G. mmWave being one of those technologies, also utilize such frequency bands. Nevertheless, massive antennas and propagation characteristics of the channel model describes its behavior and some crucial aspects that need to be considered when operating on FR2 frequency bands for 5G NR networks. This paper analyzes the behavior of a mmWave massive MIMO channel at FR1 and FR2 frequency bands under for specific atmospheric conditions. The channel behavior was simulated using NYUSIM software and the performance metrics include path loss, received power, and path loss exponent of the channel power delay profile.
极端数据流量的快速增长和网络的过度占用是当前无线通信网络面临的挑战。然而,下一代网络通过利用广阔的频带来解决这些问题。这些是FR2/FR1频段,在下一代技术,即B5G或6G中发挥重要作用。毫米波是这些技术之一,也利用这样的频段。尽管如此,信道模型的大量天线和传播特性描述了其行为以及在5G NR网络的FR2频段上运行时需要考虑的一些关键方面。本文分析了在特定大气条件下,毫米波海量MIMO信道在FR1和FR2频段的性能。使用NYUSIM软件对信道行为进行了模拟,性能指标包括信道功率延迟配置文件的路径损耗、接收功率和路径损耗指数。
{"title":"Statistical Modelling of Massive MIMO Channel at FR2 Frequency Bands for B5G Networks","authors":"Sharini D L, Ravilla Dilli, K. M","doi":"10.1109/AICAPS57044.2023.10074158","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074158","url":null,"abstract":"The rapid growth of extreme data traffic and over occupied networks are the current challenges in the wireless communication networks. However, next generation networks tackle them by utilizing vast frequency bands. These are FR2/FR1 frequency bands that play an important role in next-generation technologies, namely B5G or 6G. mmWave being one of those technologies, also utilize such frequency bands. Nevertheless, massive antennas and propagation characteristics of the channel model describes its behavior and some crucial aspects that need to be considered when operating on FR2 frequency bands for 5G NR networks. This paper analyzes the behavior of a mmWave massive MIMO channel at FR1 and FR2 frequency bands under for specific atmospheric conditions. The channel behavior was simulated using NYUSIM software and the performance metrics include path loss, received power, and path loss exponent of the channel power delay profile.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"403 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126698195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Semi-Supervised GAN Architecture for Video Classification 一种用于视频分类的半监督GAN结构
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074051
P. Ghadekar, Dhruva Khanwelkar, Nirvisha Soni, Harsh More, Juhi Rajani, Chirag Vaswani
In recent years, several supervised deep-learning architectures have achieved state-of-the art accuracies in video-classification. However, they demand a considerable amount of annotated data which can be both cost and resource intensive. This study proposes a Semi-Supervised GAN architecture to efficiently perform classification on video datasets with a small percentage of labelled data. While the Generative Adversarial Network (GAN) architecture is known for its generative ability, we harness the discriminative property of this network instead for the classification of videos. The proposed model leverages the features extracted from the unlabelled data to classify the labelled videos. Results show that the proposed approach achieves 46% accuracy with just 5% labelled videos, reaching up to 62% when 50% of the videos are labelled. These results are a significant improvement over a standard supervised approach and show a promising aspect in the field of Semi-Supervised Learning domain.
近年来,一些有监督的深度学习架构已经在视频分类中达到了最先进的精度。然而,它们需要大量带注释的数据,这可能是成本和资源密集型的。本研究提出了一种半监督GAN架构,以有效地对具有小比例标记数据的视频数据集进行分类。虽然生成对抗网络(GAN)架构以其生成能力而闻名,但我们利用该网络的判别特性来对视频进行分类。该模型利用从未标记数据中提取的特征对标记的视频进行分类。结果表明,该方法在标记5%的视频时达到46%的准确率,在标记50%的视频时达到62%的准确率。这些结果是对标准监督方法的重大改进,显示了半监督学习领域的一个有前途的方面。
{"title":"A Semi-Supervised GAN Architecture for Video Classification","authors":"P. Ghadekar, Dhruva Khanwelkar, Nirvisha Soni, Harsh More, Juhi Rajani, Chirag Vaswani","doi":"10.1109/AICAPS57044.2023.10074051","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074051","url":null,"abstract":"In recent years, several supervised deep-learning architectures have achieved state-of-the art accuracies in video-classification. However, they demand a considerable amount of annotated data which can be both cost and resource intensive. This study proposes a Semi-Supervised GAN architecture to efficiently perform classification on video datasets with a small percentage of labelled data. While the Generative Adversarial Network (GAN) architecture is known for its generative ability, we harness the discriminative property of this network instead for the classification of videos. The proposed model leverages the features extracted from the unlabelled data to classify the labelled videos. Results show that the proposed approach achieves 46% accuracy with just 5% labelled videos, reaching up to 62% when 50% of the videos are labelled. These results are a significant improvement over a standard supervised approach and show a promising aspect in the field of Semi-Supervised Learning domain.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116198640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Labeled Hands in Wild 野外的手
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074180
Aashish Kalra, Aishwarya Salunke, Pooja Majali, Preeti Bhandiwad, Kavita Chachadi, S. Kamath, Sandeep Jana, Rajas Joshi
Hand pose estimation has been playing a major role in many applications such as in Augmented/Virtual reality that is the human-computer interaction and gesture recognition. Among the existing hand datasets, some of them are synthetically generated which do not provide information about the background considering the various lighting conditions where the hand skin tone information would be lost.Hence, the proposed Labelled Hand Dataset in the Wild provides this additional information and also solves a major problem ie occlusion. Since manually annotating a large dataset is a tedious task,hence we propose a novel approach to automate the generation of large dataset using the triangulation method which is also known as multiview Annotation.In this approach two best frames are labelled with the 2D points(21 keypoints) which are then triangulated in 3D space using multiview geometry with the use of fiducial markers.These triangulated points in a 3D space are reprojected onto all other images of a particular pose and this process is repeated for all the other poses thus automating the generation of large labeled dataset in wild.
手部姿态估计已经在许多应用中发挥了重要作用,例如增强/虚拟现实,即人机交互和手势识别。在现有的手部数据集中,有些数据集是综合生成的,考虑到各种光照条件会丢失手部肤色信息,因此无法提供背景信息。因此,在野外提出的标记手数据集提供了这些额外的信息,也解决了一个主要的问题,即遮挡。由于手动标注大型数据集是一项繁琐的任务,因此我们提出了一种新的方法来自动生成大型数据集的三角剖分方法,也称为多视图标注。在这种方法中,两个最佳帧被标记为2D点(21个关键点),然后使用基准标记在3D空间中使用多视图几何对其进行三角化。3D空间中的这些三角点被重新投影到特定姿势的所有其他图像上,并且对所有其他姿势重复此过程,从而自动生成大型标记数据集。
{"title":"Labeled Hands in Wild","authors":"Aashish Kalra, Aishwarya Salunke, Pooja Majali, Preeti Bhandiwad, Kavita Chachadi, S. Kamath, Sandeep Jana, Rajas Joshi","doi":"10.1109/AICAPS57044.2023.10074180","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074180","url":null,"abstract":"Hand pose estimation has been playing a major role in many applications such as in Augmented/Virtual reality that is the human-computer interaction and gesture recognition. Among the existing hand datasets, some of them are synthetically generated which do not provide information about the background considering the various lighting conditions where the hand skin tone information would be lost.Hence, the proposed Labelled Hand Dataset in the Wild provides this additional information and also solves a major problem ie occlusion. Since manually annotating a large dataset is a tedious task,hence we propose a novel approach to automate the generation of large dataset using the triangulation method which is also known as multiview Annotation.In this approach two best frames are labelled with the 2D points(21 keypoints) which are then triangulated in 3D space using multiview geometry with the use of fiducial markers.These triangulated points in a 3D space are reprojected onto all other images of a particular pose and this process is repeated for all the other poses thus automating the generation of large labeled dataset in wild.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130011886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ChainHire: A Privacy-Preserving and Transparent Job Search Portal Using an Enterprise-Level Permissioned Blockchain Framework ChainHire:一个使用企业级许可区块链框架的隐私保护和透明的求职门户
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074582
Satyajit Ghosh, Rakibul Islam, A. Jaman, Aratrika Bose, Abhishek Roy
Finding a suitable employment position is a crucial and difficult task. Nowadays, Job seekers may instantly apply for multiple job openings using job search platforms. Personal information has to be exchanged as part of every job application. Sharing personal information across unsafe channels, however, increases the risk of data theft. Apart from that, attempts to tamper with hiring data are unfortunately common. Previously, traditional databases were only used for job search platforms, which do not provide sufficient transparency or protection against tampering. The incorporation of blockchain technology in job search portals can address these issues and provide a more transparent and secure process. Our proposed solution uses Hyperledger Fabric (HLF), an open-source blockchain frame-work, to create a secure and transparent job search platform. In this platform, both recruiters and job seekers can participate in a permissioned network, where smart contracts are used to ensure a transparent and privacy-friendly hiring process. To demonstrate the feasibility of this solution, we have implemented and deployed a prototype using Amazon Managed Blockchain Service. To understand the optimal configuration for our system, we tested the performance of our network using the Hyperledger Caliper tool. Although further research is necessary to fully understand the capabilities and limitations of using blockchain technology in job search portals.
找到一份合适的工作是一项关键而艰巨的任务。如今,求职者可以在求职平台上立即申请多个职位空缺。每一份工作申请都必须交换个人信息。然而,通过不安全的渠道共享个人信息会增加数据被盗的风险。除此之外,不幸的是,试图篡改招聘数据的行为很常见。以前,传统的数据库只用于求职平台,这没有提供足够的透明度或防止篡改的保护。将区块链技术纳入求职门户可以解决这些问题,并提供更透明和安全的流程。我们提出的解决方案使用开源区块链框架Hyperledger Fabric (HLF)来创建一个安全透明的求职平台。在这个平台上,招聘人员和求职者都可以参与到一个允许的网络中,在这个网络中,智能合约被用来确保透明和隐私友好的招聘过程。为了证明该解决方案的可行性,我们使用Amazon Managed Blockchain Service实现并部署了一个原型。为了了解系统的最佳配置,我们使用Hyperledger Caliper工具测试了网络的性能。虽然需要进一步的研究来充分了解在求职门户中使用区块链技术的能力和局限性。
{"title":"ChainHire: A Privacy-Preserving and Transparent Job Search Portal Using an Enterprise-Level Permissioned Blockchain Framework","authors":"Satyajit Ghosh, Rakibul Islam, A. Jaman, Aratrika Bose, Abhishek Roy","doi":"10.1109/AICAPS57044.2023.10074582","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074582","url":null,"abstract":"Finding a suitable employment position is a crucial and difficult task. Nowadays, Job seekers may instantly apply for multiple job openings using job search platforms. Personal information has to be exchanged as part of every job application. Sharing personal information across unsafe channels, however, increases the risk of data theft. Apart from that, attempts to tamper with hiring data are unfortunately common. Previously, traditional databases were only used for job search platforms, which do not provide sufficient transparency or protection against tampering. The incorporation of blockchain technology in job search portals can address these issues and provide a more transparent and secure process. Our proposed solution uses Hyperledger Fabric (HLF), an open-source blockchain frame-work, to create a secure and transparent job search platform. In this platform, both recruiters and job seekers can participate in a permissioned network, where smart contracts are used to ensure a transparent and privacy-friendly hiring process. To demonstrate the feasibility of this solution, we have implemented and deployed a prototype using Amazon Managed Blockchain Service. To understand the optimal configuration for our system, we tested the performance of our network using the Hyperledger Caliper tool. Although further research is necessary to fully understand the capabilities and limitations of using blockchain technology in job search portals.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130062128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI Powered Screening Aid for Dyslexic Children in Tamil 泰米尔语阅读障碍儿童的人工智能筛查援助
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074214
K. Banumathi, G. Sudhasadasivam, B. Banurekha, N. Shahana, K. Vaishnavi
Dyslexia is a learning disorder characterized with a difficulty in learning due to impairment of the left hemisphere of the brain associated with processing. In India, the incidence of dyslexia is believed to be around 15%. Literacy is the foundation of all learning and hence, identifying dyslexia at an early age is critical. The appropriate age to identify dyslexia is between 5 and 8 years since early detection can nurture corrective measures. Identifying dyslexia at this age will also prevent future school drop outs. The major challenge in detecting dyslexia among children at an early age includes lack of awareness among school teachers and parents. The absence of simple standardized screening and assessment tools in regional languages makes the task more difficult. A Smartphone application to screen dyslexia offers the advantages of universal use and standardization. However, primary school educators feel the need to have screening apps in regional languages (like Tamil), as most of the communication in local population is using regional language. In this research work, a Smartphone based screening application for dyslexia is developed in Tamil. The app consists of questions that cover the areas of general qualities, health & personality, unique talents, speech & hearing skills, visual acuity& reading skills, writing & motor skills, mathematical ability and memory & cognitive skills. The app was created utilizing the expertise of special educators and following regional practices. The app generates visualizations and provides scoring on the severity level of dyslexia for the user. The report can be downloaded and printed. Then the dataset is subjected to training with five different machine learning algorithms and results are compared by their report of classification and error rates.
阅读障碍是一种学习障碍,其特征是由于与处理相关的大脑左半球受损而导致学习困难。在印度,阅读障碍的发病率约为15%。识字是所有学习的基础,因此,早期识别阅读障碍是至关重要的。识别阅读障碍的合适年龄是在5到8岁之间,因为早期发现可以培养纠正措施。在这个年龄段发现阅读障碍也将防止未来的辍学。在早期发现儿童阅读障碍的主要挑战包括学校教师和家长缺乏意识。由于缺乏以区域语言编写的简单的标准化筛选和评估工具,使得这项任务更加困难。一个智能手机应用程序,以筛选阅读障碍提供通用和标准化的优势。然而,小学教育工作者认为有必要使用地区语言(如泰米尔语)筛选应用程序,因为当地人口的大多数交流都使用地区语言。在这项研究工作中,开发了基于智能手机的泰米尔语阅读障碍筛查应用程序。该应用程序包含的问题涵盖了一般素质、健康与个性、独特才能、言语和听力技能、视觉灵敏度和阅读技能、写作和运动技能、数学能力以及记忆和认知技能等领域。该应用程序是利用特殊教育工作者的专业知识并遵循地区惯例创建的。该应用程序生成可视化,并为用户提供阅读障碍严重程度的评分。报告可下载及列印。然后用五种不同的机器学习算法对数据集进行训练,并通过它们的分类和错误率报告对结果进行比较。
{"title":"AI Powered Screening Aid for Dyslexic Children in Tamil","authors":"K. Banumathi, G. Sudhasadasivam, B. Banurekha, N. Shahana, K. Vaishnavi","doi":"10.1109/AICAPS57044.2023.10074214","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074214","url":null,"abstract":"Dyslexia is a learning disorder characterized with a difficulty in learning due to impairment of the left hemisphere of the brain associated with processing. In India, the incidence of dyslexia is believed to be around 15%. Literacy is the foundation of all learning and hence, identifying dyslexia at an early age is critical. The appropriate age to identify dyslexia is between 5 and 8 years since early detection can nurture corrective measures. Identifying dyslexia at this age will also prevent future school drop outs. The major challenge in detecting dyslexia among children at an early age includes lack of awareness among school teachers and parents. The absence of simple standardized screening and assessment tools in regional languages makes the task more difficult. A Smartphone application to screen dyslexia offers the advantages of universal use and standardization. However, primary school educators feel the need to have screening apps in regional languages (like Tamil), as most of the communication in local population is using regional language. In this research work, a Smartphone based screening application for dyslexia is developed in Tamil. The app consists of questions that cover the areas of general qualities, health & personality, unique talents, speech & hearing skills, visual acuity& reading skills, writing & motor skills, mathematical ability and memory & cognitive skills. The app was created utilizing the expertise of special educators and following regional practices. The app generates visualizations and provides scoring on the severity level of dyslexia for the user. The report can be downloaded and printed. Then the dataset is subjected to training with five different machine learning algorithms and results are compared by their report of classification and error rates.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122471483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simplified Micropayment Mechanism to Eliminate the Risk of Double Payment in E-Commerce 简化小额支付机制消除电子商务中重复支付的风险
Pub Date : 2023-02-01 DOI: 10.1109/AICAPS57044.2023.10074490
Narendra Kumar Chahar, Krishan Pal Singh, M. Hussain
In recent years, the emergence of the Internet and E-commerce has steered significant growth in digital transactions. Businesses today need mobile wallets, credit and debit cards, and e-cash to digitize payments. Digital payment systems are in transition and promise amazing advancements, but they also pose many risks and as the number of online transactions is increasing tremendously, we need a security system that follows all security norms. In this paper, we study digital transaction systems and evaluate various components of E-commerce plat-forms to address the security of these services. We evaluate the attributes that affect the security of digital payment processes and identify several barriers that hinder their performance to propose a simplified payment mechanism for micro-payments that eliminates the double payment problem.
近年来,互联网和电子商务的出现带动了数字交易的显著增长。今天的企业需要移动钱包、信用卡和借记卡以及电子现金来实现数字化支付。数字支付系统正处于转型阶段,有望取得惊人的进步,但它们也带来了许多风险,随着在线交易数量的急剧增加,我们需要一个遵循所有安全规范的安全系统。在本文中,我们研究了数字交易系统,并评估了电子商务平台的各个组成部分,以解决这些服务的安全性。我们评估了影响数字支付过程安全性的属性,并确定了阻碍其性能的几个障碍,从而提出了一种简化的微支付支付机制,消除了双重支付问题。
{"title":"Simplified Micropayment Mechanism to Eliminate the Risk of Double Payment in E-Commerce","authors":"Narendra Kumar Chahar, Krishan Pal Singh, M. Hussain","doi":"10.1109/AICAPS57044.2023.10074490","DOIUrl":"https://doi.org/10.1109/AICAPS57044.2023.10074490","url":null,"abstract":"In recent years, the emergence of the Internet and E-commerce has steered significant growth in digital transactions. Businesses today need mobile wallets, credit and debit cards, and e-cash to digitize payments. Digital payment systems are in transition and promise amazing advancements, but they also pose many risks and as the number of online transactions is increasing tremendously, we need a security system that follows all security norms. In this paper, we study digital transaction systems and evaluate various components of E-commerce plat-forms to address the security of these services. We evaluate the attributes that affect the security of digital payment processes and identify several barriers that hinder their performance to propose a simplified payment mechanism for micro-payments that eliminates the double payment problem.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121446423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1