IAES International Journal of Artificial Intelligence (IJ-AI)最新文献

英文中文

Design of a novel deep network model for spinal cord injury prediction 设计用于脊髓损伤预测的新型深度网络模型

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2131-2142

P. R. S. S. Venkatapathi Raju, Valayapathy Asanambigai, Suresh Babu Mudunuri

Degenerative cervical myelopathy must be diagnosed with magnetic resonance imaging (MRI) which predicts spinal cord injury (SCI). The growing volume of medical imaging data can be managed by deep learning models, which provide a preliminary interpretation of images taken in basic care settings. Our main goal was to create a deep-learning approach that could identify SCI using MRI data. This work concentrates on modeling a novel 2D-convolutional neural networks (2D-CNN) approach for predicting SCI. For holdouts, training, and validation, various datasets of patients were created. Two experts assigned labels to the images. The holdout dataset was used to evaluate the performance of our deep convolutional neural network (DCNN) over the image data from the available dataset. The dataset is acquired from the online resource for training and validation purpose. With the available dataset, the anticipated model attains 94% AUC, 0.1 p-value, and 92.2% accuracy. The anticipated model might make cervical spine MRI scan interpretation more accurate and reliable.

退行性颈椎病必须通过磁共振成像（MRI）进行诊断，而磁共振成像可预测脊髓损伤（SCI）。深度学习模型可以管理不断增长的医学影像数据量，并对基础护理环境中拍摄的图像进行初步解读。我们的主要目标是创建一种能利用核磁共振成像数据识别 SCI 的深度学习方法。这项工作的重点是为预测 SCI 的新型二维卷积神经网络（2D-CNN）建模。为进行保留、训练和验证，创建了各种患者数据集。两位专家为图像分配标签。保留数据集用于评估我们的深度卷积神经网络（DCNN）在可用数据集中的图像数据上的性能。该数据集是从在线资源中获取的，用于训练和验证目的。利用现有数据集，预期模型的 AUC 为 94%，P 值为 0.1，准确率为 92.2%。预期模型可使颈椎磁共振成像扫描判读更准确、更可靠。

引用次数: 0

Determining community happiness index with transformers and attention-based deep learning 利用变换器和基于注意力的深度学习确定社区幸福指数

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1753-1761

Hilman Singgih Wicaksana, Retno Kusumaningrum, R. Gernowo

In the current digital era, evaluating the quality of people's lives and their happiness index is closely related to their expressions and opinions on Twitter social media. Measuring population welfare goes beyond monetary aspects, focusing more on subjective well-being, and sentiment analysis helps evaluate people's perceptions of happiness aspects. Aspect-based sentiment analysis (ABSA) effectively identifies sentiments on predetermined aspects. The previous study has used Word-to-Vector (Word2Vec) and long short-term memory (LSTM) methods with or without attention mechanism (AM) to solve ABSA cases. However, the problem with the previous study is that Word2Vec has the disadvantage of being unable to handle the context of words in a sentence. Therefore, this study will address the problem with bidirectional encoder representations from transformers (BERT), which has the advantage of performing bidirectional training. Bayesian optimization as a hyperparameter tuning technique is used to find the best combination of parameters during the training process. Here we show that BERT-LSTM-AM outperforms the Word2Vec-LSTM-AM model in predicting aspect and sentiment. Furthermore, we found that BERT is the best state-of-the-art embedding technique for representing words in a sentence. Our results demonstrate how BERT as an embedding technique can significantly improve the model performance over Word2Vec.

在当前的数字时代，评价人们的生活质量和幸福指数与他们在推特社交媒体上的表达和意见密切相关。衡量民众福利的标准已超越了货币层面，而是更加关注主观幸福感，而情感分析有助于评估人们对幸福感的看法。基于方面的情感分析（ABSA）能有效识别预先设定方面的情感。以往的研究采用了有或无注意力机制（AM）的词到矢量（Word2Vec）和长短期记忆（LSTM）方法来解决 ABSA 案例。然而，以往研究的问题在于 Word2Vec 的缺点是无法处理句子中单词的上下文。因此，本研究将利用来自变换器的双向编码器表征（BERT）来解决这一问题，它的优点是可以进行双向训练。在训练过程中，贝叶斯优化作为一种超参数调整技术被用来寻找最佳参数组合。我们在此表明，BERT-LSTM-AM 在预测方面和情感方面优于 Word2Vec-LSTM-AM。此外，我们还发现 BERT 是最先进的嵌入技术，可用于表示句子中的单词。我们的研究结果表明，与 Word2Vec 相比，BERT 作为一种嵌入技术可以显著提高模型的性能。

{"title":"Determining community happiness index with transformers and attention-based deep learning","authors":"Hilman Singgih Wicaksana, Retno Kusumaningrum, R. Gernowo","doi":"10.11591/ijai.v13.i2.pp1753-1761","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1753-1761","url":null,"abstract":"In the current digital era, evaluating the quality of people's lives and their happiness index is closely related to their expressions and opinions on Twitter social media. Measuring population welfare goes beyond monetary aspects, focusing more on subjective well-being, and sentiment analysis helps evaluate people's perceptions of happiness aspects. Aspect-based sentiment analysis (ABSA) effectively identifies sentiments on predetermined aspects. The previous study has used Word-to-Vector (Word2Vec) and long short-term memory (LSTM) methods with or without attention mechanism (AM) to solve ABSA cases. However, the problem with the previous study is that Word2Vec has the disadvantage of being unable to handle the context of words in a sentence. Therefore, this study will address the problem with bidirectional encoder representations from transformers (BERT), which has the advantage of performing bidirectional training. Bayesian optimization as a hyperparameter tuning technique is used to find the best combination of parameters during the training process. Here we show that BERT-LSTM-AM outperforms the Word2Vec-LSTM-AM model in predicting aspect and sentiment. Furthermore, we found that BERT is the best state-of-the-art embedding technique for representing words in a sentence. Our results demonstrate how BERT as an embedding technique can significantly improve the model performance over Word2Vec.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141229084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Scaling effectivity in manifold methodologies to detect driver’s fatigueness and drowsiness state 检测驾驶员疲劳和昏昏欲睡状态的多种方法的缩放效应

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1227-1240

Gowrishankar Shiva Shankara Chari, Jyothi Arcot Prashant

The state of fatigueness and drowsiness relates to the stressed physical and mental condition of a driver that reduces the ability of a driver to drive safely leading to fatal consequences of road accidents. With a rising concerns about the road safety, the premium and modern vehicles are coming up with a sophisticated technology to detect and rise alarm during the positive case of fatigueness and drowsiness. Irrespective of availability of archives of literatures towards solving this problem, it is quite unclear about the strength and weakness of varied methodologies. Therefore, this paper presents a crisp and insightful discussion about the recent methodologies associated with detecting driver's attention, fatigueness, drowsiness along with highlights of commercial devices to realize various limiting factors and constraints associated with them. The paper contributes to introduce a well-structured flow of research trend to realize various patterns of current trend adopted towards solving varied problems and significant research gaps have been identified in this process. The outcome of this paper presents that still there is an open scope of an improvement towards accomplishing the agenda towards safer driving.

疲劳和嗜睡状态与驾驶员紧张的身体和精神状态有关，这种状态会降低驾驶员的安全驾驶能力，从而导致道路交通事故的致命后果。随着人们对道路安全的日益关注，高级和现代车辆都配备了先进的技术，可在疲劳和瞌睡的积极情况下检测并发出警报。尽管为解决这一问题已有大量文献，但对各种方法的优缺点还很不清楚。因此，本文对最近与检测驾驶员注意力、疲劳和嗜睡相关的方法进行了深入浅出的讨论，并重点介绍了商业设备，以实现与之相关的各种限制因素和制约因素。本文介绍了结构合理的研究趋势流程，以实现当前为解决各种问题而采用的各种模式，并在此过程中发现了重大的研究差距。本文的研究结果表明，在实现更安全驾驶的议程方面仍有改进的余地。

{"title":"Scaling effectivity in manifold methodologies to detect driver’s fatigueness and drowsiness state","authors":"Gowrishankar Shiva Shankara Chari, Jyothi Arcot Prashant","doi":"10.11591/ijai.v13.i2.pp1227-1240","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1227-1240","url":null,"abstract":"The state of fatigueness and drowsiness relates to the stressed physical and mental condition of a driver that reduces the ability of a driver to drive safely leading to fatal consequences of road accidents. With a rising concerns about the road safety, the premium and modern vehicles are coming up with a sophisticated technology to detect and rise alarm during the positive case of fatigueness and drowsiness. Irrespective of availability of archives of literatures towards solving this problem, it is quite unclear about the strength and weakness of varied methodologies. Therefore, this paper presents a crisp and insightful discussion about the recent methodologies associated with detecting driver's attention, fatigueness, drowsiness along with highlights of commercial devices to realize various limiting factors and constraints associated with them. The paper contributes to introduce a well-structured flow of research trend to realize various patterns of current trend adopted towards solving varied problems and significant research gaps have been identified in this process. The outcome of this paper presents that still there is an open scope of an improvement towards accomplishing the agenda towards safer driving.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"59 36","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141231939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FinTech forecasting using an evolving connectionist system for lenders and borrowers: ecosystem behavior 利用不断发展的出借人和借款人关联系统预测金融科技：生态系统行为

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2386-2394

A. Al-Khowarizmi, Michael J. Watts, Syahril Efendi, Anton Abdulbasah Kamil

Financial Technology (FinTech) which is included in the development of digitalization in the financial sector in the industrial era 4.0. Fintech can make any transactions anywhere with the pillars of Peet-to-Peer (P2P) lending, merchants and crowdfunding. In the P2P Lending pillar, there are borrowers and lenders who are digitized in Fintech devices. Fintech in Indonesia is controlled by a state agency called the Otoritas Jasa Keuangan or Financial Services Authority (OJK). In the movement of P2P Lending, there are borrowers and lenders who can be said to be investors where these activities are reported to the OJK. This data can be forecasted using a neural network approach such as ECoS, which is a method capable of forecasting with learning that develops in the hidden layer. In this research article, we present results on forecasting borrowers with a Mean Absolute Percentage Error (MAPE) of 0.148% and forecasting lenders with an accuracy measurement with MAPE of 0.209% with a learning rate 1 = 0.6 and a learning rate 2 = 0.3. So, this forecasting model can be said as an optimization in FinTech activities on the behavior of borrowers and lenders.

金融科技（FinTech）是工业 4.0 时代金融领域数字化发展的一部分。金融科技通过点对点借贷（P2P）、商户和众筹等支柱，可以在任何地方进行任何交易。在P2P借贷支柱中，借方和贷方都在金融科技设备中实现了数字化。印尼的金融科技由一个名为 "Otoritas Jasa Keuangan "或 "金融服务管理局（OJK）"的国家机构控制。在 P2P 借贷活动中，借款人和贷款人可以说是投资者，这些活动都要向 OJK 报告。可以使用 ECoS 等神经网络方法对这些数据进行预测。在这篇研究文章中，我们介绍了在学习率 1 = 0.6 和学习率 2 = 0.3 的情况下，预测借款人的平均绝对百分比误差（MAPE）为 0.148%，预测贷款人的准确度测量（MAPE 为 0.209%）的结果。因此，该预测模型可以说是金融科技活动中对借款人和贷款人行为的优化。

{"title":"FinTech forecasting using an evolving connectionist system for lenders and borrowers: ecosystem behavior","authors":"A. Al-Khowarizmi, Michael J. Watts, Syahril Efendi, Anton Abdulbasah Kamil","doi":"10.11591/ijai.v13.i2.pp2386-2394","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2386-2394","url":null,"abstract":"Financial Technology (FinTech) which is included in the development of digitalization in the financial sector in the industrial era 4.0. Fintech can make any transactions anywhere with the pillars of Peet-to-Peer (P2P) lending, merchants and crowdfunding. In the P2P Lending pillar, there are borrowers and lenders who are digitized in Fintech devices. Fintech in Indonesia is controlled by a state agency called the Otoritas Jasa Keuangan or Financial Services Authority (OJK). In the movement of P2P Lending, there are borrowers and lenders who can be said to be investors where these activities are reported to the OJK. This data can be forecasted using a neural network approach such as ECoS, which is a method capable of forecasting with learning that develops in the hidden layer. In this research article, we present results on forecasting borrowers with a Mean Absolute Percentage Error (MAPE) of 0.148% and forecasting lenders with an accuracy measurement with MAPE of 0.209% with a learning rate 1 = 0.6 and a learning rate 2 = 0.3. So, this forecasting model can be said as an optimization in FinTech activities on the behavior of borrowers and lenders.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"9 14","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141228757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing aerial image registration: outlier filtering through feature classification 增强航空图像配准：通过特征分类过滤离群点

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1900-1912

Hayder Mosa Merza, Ihab Sbeity, M. Dbouk, Z. Ibrahim

In the context of feature-based image registration, the crucial task of outlier removal plays a pivotal role in achieving precise registration accuracy. This research introduces an innovative binary classifier founded on an adaptive approach for effectively identifying and eliminating outliers. The methodology begins with the utilization of the scale invariant feature transform (SIFT) to extract features from two images, initially matched using the Euclidian distance metrics. Subsequently, a classification procedure is executed to segregate the feature points into two categories: genuine matches (inliers) and spurious matches (outliers), which is accomplished through the brute-force matcher (BFM) technique. To enhance this process further, a novel classifier rooted in the random forest algorithm is introduced. This classifier is trained and tested using a comprehensive dataset curated for this study. The newly proposed classifier plays a pivotal role in attenuating the influence of outliers, ultimately leading to refined image registration process characterized by enhanced accuracy. The effectiveness of this outlier removal approach is assessed through a meticulous analysis of positional and classification accuracy. Additionally, we offer comparative insights by evaluating the performance of selected algorithm on our dataset.

在基于特征的图像配准中，去除离群值是实现精确配准的关键任务。本研究介绍了一种创新的二元分类器，它建立在有效识别和消除异常值的自适应方法之上。该方法首先利用尺度不变特征变换（SIFT）从两幅图像中提取特征，最初使用欧几里得距离指标进行匹配。随后，执行分类程序，将特征点分为两类：真正匹配（离群值）和虚假匹配（离群值）。为了进一步加强这一过程，我们引入了一种植根于随机森林算法的新型分类器。该分类器使用为本研究策划的综合数据集进行训练和测试。新提出的分类器在减弱离群值的影响方面发挥了关键作用，最终实现了以提高准确性为特征的精细图像配准过程。通过对定位和分类准确性的细致分析，我们评估了这种异常值去除方法的有效性。此外，我们还通过评估所选算法在数据集上的性能，提供了比较见解。

{"title":"Enhancing aerial image registration: outlier filtering through feature classification","authors":"Hayder Mosa Merza, Ihab Sbeity, M. Dbouk, Z. Ibrahim","doi":"10.11591/ijai.v13.i2.pp1900-1912","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1900-1912","url":null,"abstract":"In the context of feature-based image registration, the crucial task of outlier removal plays a pivotal role in achieving precise registration accuracy. This research introduces an innovative binary classifier founded on an adaptive approach for effectively identifying and eliminating outliers. The methodology begins with the utilization of the scale invariant feature transform (SIFT) to extract features from two images, initially matched using the Euclidian distance metrics. Subsequently, a classification procedure is executed to segregate the feature points into two categories: genuine matches (inliers) and spurious matches (outliers), which is accomplished through the brute-force matcher (BFM) technique. To enhance this process further, a novel classifier rooted in the random forest algorithm is introduced. This classifier is trained and tested using a comprehensive dataset curated for this study. The newly proposed classifier plays a pivotal role in attenuating the influence of outliers, ultimately leading to refined image registration process characterized by enhanced accuracy. The effectiveness of this outlier removal approach is assessed through a meticulous analysis of positional and classification accuracy. Additionally, we offer comparative insights by evaluating the performance of selected algorithm on our dataset.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"95 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141234548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A survey on planet leaf disease identification and classification by various machine-learning technique 利用各种机器学习技术识别和分类行星叶病的研究

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1187-1194

Premakumari Pujar, Ashutosh Kumar, Vineet Kumar

An overview of methods for identifying plants diseases is given in this article. Each sample is categorized by being divided into various groups. The approach of classification involves identifying healthy and diseased leaves based on morphological traits including texture, color, shape, and pattern, among others. Sorting and categorizing plants can be challenging, especially when doing so across a large area, due to the closeness of their visual qualities. There are several methods based on computer vision and image processing. Selecting the right categorization method can be difficult because the outcomes rely on the data you supply. There are several applications for the categorization of plant leaf diseases in fields like agriculture and biological research. This article gives a summary of several approaches currently in use for identifying and categorizing leaf diseases, as well as their benefits and drawbacks. These approaches include preprocessing methods, feature extraction and selection methods, datasets employed, classifiers, and performance metrics

本文概述了识别植物病害的方法。每个样本都会被分为不同的组别。分类方法包括根据纹理、颜色、形状和图案等形态特征识别健康叶片和病叶。由于植物的视觉特质非常接近，因此对植物进行分类和归类具有一定的挑战性，尤其是在对大面积植物进行分类和归类时。目前有几种基于计算机视觉和图像处理的方法。选择正确的分类方法可能很困难，因为结果取决于您提供的数据。植物叶片病害分类在农业和生物研究等领域有多种应用。本文总结了目前用于识别和分类叶病的几种方法，以及它们的优点和缺点。这些方法包括预处理方法、特征提取和选择方法、使用的数据集、分类器和性能指标。

引用次数: 0

Effect of dataset distribution on automatic road extraction in very high-resolution orthophoto using DeepLab V3+ 数据集分布对使用 DeepLab V3+ 在超高分辨率正射影像图中自动提取道路的影响

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1650-1657

S. Sussi, E. Husni, Arthur Siburian, Rahadian Yusuf, Agung Budi Harto, D. Suwardhi

Road extraction is one of the stages in the map-making process, which has been done manually, takes a long time, and costs a lot. Deep Learning is used to speed up the road extraction process by performing binary semantic segmentation on the image. We propose DeepLab V3+ to produce road extraction from very high-resolution orthophoto for Indonesia study area, which poses many challenges, such as road obstruction by trees, clouds, building shadows, dense traffic, and similarities to rivers and rice fields. We compared the distribution of datasets to obtain the optimal performance of the DeepLab V3+ model in relation to the dataset. The results showed that dataset ratio of 75:10:15 resulted in mean Intersection Over Union (mIoU) of 0.92 and Dice Loss of 0.042. Visually, the results of road extraction are more accurate when compared to the results obtained from different distributions of the dataset.

道路提取是地图绘制过程中的一个阶段，一直以来都由人工完成，耗时长、成本高。深度学习可通过对图像进行二元语义分割来加快道路提取过程。我们建议使用 DeepLab V3+ 从印尼研究地区的高分辨率正射影像图中提取道路，该地区面临许多挑战，如道路被树木、云层、建筑阴影、密集的交通以及与河流和稻田的相似性所遮挡。我们比较了数据集的分布，以获得 DeepLab V3+ 模型与数据集相关的最佳性能。结果表明，数据集比例为 75:10:15，平均交叉重叠率（mIoU）为 0.92，骰子损失（Dice Loss）为 0.042。从视觉上看，与数据集不同分布的结果相比，道路提取的结果更为准确。

引用次数: 0

Face mask classification using convolutional neural networks with facial image regions and super resolution 利用具有面部图像区域和超分辨率的卷积神经网络进行人脸面具分类

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2423-2432

N. Wattanakitrungroj, W. Wettayaprasit, Peemakarn Rujirapong, Sasiporn Tongman

Face mask classification is relevant to public health and safety, so an approach for face mask classification using Multi-Task Cascaded Convolutional Networks (MTCNN) for face detection on image data, ResNet152 architecture for feature extraction, and super-resolution method, BSRGAN, for enhanced image quality was proposed. The classification model was trained by a fully connected layer of neural networks. The goal is to classify each facial image into three classes: the image with a mask, without a mask, or with an incorrectly worn mask. The performance of each classification model on two real-world datasets was evaluated by Accuracy, Precision, Recall, and F1 score for different sets of input patterns which were features extracted from the facial image regions including their combinations. Using multiple image regions, i.e. face, nose, and mouth, as resources for preparing input features showed the improved classification performance compared to using single image regions. In addition, the super-resolution technique applied to medium or large-sized images can improve the performance of the face mask classification model. Our findings may further guide the development for greater effective models and techniques on face mask classification contributing to practical scenarios.

人脸面具分类与公共卫生和安全息息相关，因此提出了一种人脸面具分类方法，使用多任务级联卷积网络（MTCNN）进行图像数据的人脸检测，使用 ResNet152 架构进行特征提取，并使用超分辨率方法 BSRGAN 提高图像质量。分类模型通过全连接神经网络层进行训练。目标是将每张面部图像分为三类：带面具的图像、不带面具的图像或错误佩戴面具的图像。每个分类模型在两个真实世界数据集上的性能都是通过准确率、精确率、召回率和 F1 分数来评估的，针对的是不同的输入模式集，这些输入模式是从面部图像区域（包括其组合）中提取的特征。与使用单一图像区域相比，使用多个图像区域（即脸部、鼻子和嘴巴）作为准备输入特征的资源提高了分类性能。此外，将超分辨率技术应用于中型或大型图像也能提高人脸面具分类模型的性能。我们的研究结果可进一步指导开发更有效的人脸面具分类模型和技术，为实际应用做出贡献。

{"title":"Face mask classification using convolutional neural networks with facial image regions and super resolution","authors":"N. Wattanakitrungroj, W. Wettayaprasit, Peemakarn Rujirapong, Sasiporn Tongman","doi":"10.11591/ijai.v13.i2.pp2423-2432","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2423-2432","url":null,"abstract":"Face mask classification is relevant to public health and safety, so an approach for face mask classification using Multi-Task Cascaded Convolutional Networks (MTCNN) for face detection on image data, ResNet152 architecture for feature extraction, and super-resolution method, BSRGAN, for enhanced image quality was proposed. The classification model was trained by a fully connected layer of neural networks. The goal is to classify each facial image into three classes: the image with a mask, without a mask, or with an incorrectly worn mask. The performance of each classification model on two real-world datasets was evaluated by Accuracy, Precision, Recall, and F1 score for different sets of input patterns which were features extracted from the facial image regions including their combinations. Using multiple image regions, i.e. face, nose, and mouth, as resources for preparing input features showed the improved classification performance compared to using single image regions. In addition, the super-resolution technique applied to medium or large-sized images can improve the performance of the face mask classification model. Our findings may further guide the development for greater effective models and techniques on face mask classification contributing to practical scenarios.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"78 11","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141231130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing service excellence: analyzing natural language question answering with advanced cosine similarity 提升卓越服务：利用高级余弦相似性分析自然语言问题解答

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1773-1781

R. Arifudin, Subhan Subhan, Yahya Nur Ifriza

Information related to student services in higher education must be produced and disseminated in various forms. Covid-19 pandemic, student services with a remote model related to this question and answer become very important. To carry out this automation process, the advanced cosine similarity method is used to check the similarity of the questions to the database and statistics to calculate the similarity value of each word. The proposed paper proceeds with three phases. The first stage to solve this problem is the data processed in question; the professional next step is word insertion. It converts alphanumeric words to vector format. Each word is a vector that represents a point in space with a certain dimension. The recommended advanced cosine similarity data still must be analyzed into a statistical approach. We will measure accuracy to get results so that optimal results and answers are obtained, research procedures are carried out based on literature study, initial data collection and observation, system development, system testing, system analysis, and system evaluation. This research implemented in universities with student chat automation applications providing an accuracy 83.90% given by natural language question answering system (NLQAS) so that it can improve excellent service in universities.

必须以各种形式制作和传播与高等教育中学生服务有关的信息。在 Covid-19 大流行的情况下，与学生服务相关的远程问答模式变得非常重要。为了实现这一自动化过程，我们采用了先进的余弦相似度方法来检查问题与数据库的相似度，并统计计算每个词的相似度值。本文拟分三个阶段进行。解决这一问题的第一阶段是问题数据处理；专业的下一步是单词插入。它将字母数字单词转换为矢量格式。每个词都是一个向量，代表空间中具有一定维度的点。推荐的高级余弦相似性数据仍然必须进行统计分析。我们将通过文献研究、初始数据收集和观察、系统开发、系统测试、系统分析和系统评估等研究程序来衡量准确性，以获得最佳结果和答案。这项研究在高校学生聊天自动化应用中实施，自然语言问题解答系统（NLQAS）的准确率达到 83.90%，从而提高了高校的优质服务水平。

{"title":"Enhancing service excellence: analyzing natural language question answering with advanced cosine similarity","authors":"R. Arifudin, Subhan Subhan, Yahya Nur Ifriza","doi":"10.11591/ijai.v13.i2.pp1773-1781","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1773-1781","url":null,"abstract":"Information related to student services in higher education must be produced and disseminated in various forms. Covid-19 pandemic, student services with a remote model related to this question and answer become very important. To carry out this automation process, the advanced cosine similarity method is used to check the similarity of the questions to the database and statistics to calculate the similarity value of each word. The proposed paper proceeds with three phases. The first stage to solve this problem is the data processed in question; the professional next step is word insertion. It converts alphanumeric words to vector format. Each word is a vector that represents a point in space with a certain dimension. The recommended advanced cosine similarity data still must be analyzed into a statistical approach. We will measure accuracy to get results so that optimal results and answers are obtained, research procedures are carried out based on literature study, initial data collection and observation, system development, system testing, system analysis, and system evaluation. This research implemented in universities with student chat automation applications providing an accuracy 83.90% given by natural language question answering system (NLQAS) so that it can improve excellent service in universities.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"54 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141231354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Structure tensor-based Gaussian kernel edge-adaptive depth map refinement with triangular point view in images 基于结构张量的高斯核边缘自适应深度图细化与图像中的三角点视图

IAES International Journal of Artificial Intelligence (IJ-AI)

Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1945-1953

H. Shalma, P. Selvaraj

Image reconstruction is the process of restoring the image resolution. In 3D image reconstruction, the objects in different viewpoints are processed with the triangular point view (TPV) method to estimate object geometry structure for 3D model. This work proposes a depth refinement methodology in preserving the geometric structure of objects using the structure tensor method with a Gaussian filter by transforming a series of 2D input images into a 3D model. The computation of depth map errors can be found by comparing the masked area/patch with the distribution of the original image's greyscale levels using the error pixel-based patch extraction algorithm. The presence of errors in the depth estimation could seriously deteriorate the quality of the 3D effect. The depth maps were iteratively refined based on histogram bins number to improve the accuracy of initial depth maps reconstructed from rigid objects. The existing datasets such as the dataset tanks and unit (DTU) and Middlebury datasets, were used to build the model out of the object scene structure. The results of this work have demonstrated that the proposed patch analysis outperformed the existing state of the art models depth refinement methods in terms of accuracy.

图像重建是恢复图像分辨率的过程。在三维图像重建中，不同视角下的物体通过三角点视图（TPV）方法进行处理，从而估算出三维模型的物体几何结构。这项工作提出了一种深度细化方法，通过将一系列二维输入图像转换为三维模型，使用带有高斯滤波器的结构张量法来保留物体的几何结构。利用基于误差像素的补丁提取算法，通过比较遮蔽区域/补丁与原始图像灰度级分布，可以发现深度图误差的计算。深度估算中的误差会严重影响三维效果的质量。深度图是根据直方图分段数进行迭代改进的，以提高从刚性物体重建的初始深度图的准确性。利用现有的数据集，如坦克和部队数据集（DTU）和米德尔伯里数据集，来建立物体场景结构模型。这项工作的结果表明，所提出的补丁分析法在精确度方面优于现有的先进模型深度细化方法。

{"title":"Structure tensor-based Gaussian kernel edge-adaptive depth map refinement with triangular point view in images","authors":"H. Shalma, P. Selvaraj","doi":"10.11591/ijai.v13.i2.pp1945-1953","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1945-1953","url":null,"abstract":"Image reconstruction is the process of restoring the image resolution. In 3D image reconstruction, the objects in different viewpoints are processed with the triangular point view (TPV) method to estimate object geometry structure for 3D model. This work proposes a depth refinement methodology in preserving the geometric structure of objects using the structure tensor method with a Gaussian filter by transforming a series of 2D input images into a 3D model. The computation of depth map errors can be found by comparing the masked area/patch with the distribution of the original image's greyscale levels using the error pixel-based patch extraction algorithm. The presence of errors in the depth estimation could seriously deteriorate the quality of the 3D effect. The depth maps were iteratively refined based on histogram bins number to improve the accuracy of initial depth maps reconstructed from rigid objects. The existing datasets such as the dataset tanks and unit (DTU) and Middlebury datasets, were used to build the model out of the object scene structure. The results of this work have demonstrated that the proposed patch analysis outperformed the existing state of the art models depth refinement methods in terms of accuracy.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"19 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141233854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IAES International Journal of Artificial Intelligence (IJ-AI)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀