首页 > 最新文献

International Journal of Advances in Intelligent Informatics最新文献

英文 中文
Temperament detection based on Twitter data: classical machine learning versus deep learning 基于Twitter数据的气质检测:经典机器学习vs深度学习
Pub Date : 2022-03-31 DOI: 10.26555/ijain.v8i1.692
Annisa Ulizulfa, R. Kusumaningrum, K. Khadijah, Rismiyati Rismiyati
Deep learning has shown promising results in various text-based classification tasks. However, deep learning performance is affected by the number of data, i.e., when the number of data is small, deep learning algorithms do not perform well, and vice versa. Classical machine learning algorithms commonly work well for a few data, and their performance reaches an optimal value and does not increase with the increase in sample data. Therefore, this study aimed to compare the performance of classical machine learning and deep learning methods to detect temperament based on Indonesian Twitter. In this study, the proposed Indonesian Linguistic Inquiry and Word Count were employed to analyze the context of Twitter. The classical machine learning methods implemented were support vector machine and K-nearest neighbor, whereas the deep learning method employed was a convolutional neural network (CNN) with three different architectures. Both learning methods were implemented using multiclass classification and one versus all (OVA) multiclass classification. The highest average f-measure was 58.73%, obtained by CNN OVA with a pool size of 3, a dropout value of 0.7, and a learning rate value of 0.0007.
深度学习在各种基于文本的分类任务中显示出有希望的结果。然而,深度学习的性能受到数据量的影响,即当数据量较少时,深度学习算法的性能不佳,反之亦然。经典的机器学习算法通常只适用于少数数据,其性能达到最优值,不会随着样本数据的增加而增加。因此,本研究旨在比较经典机器学习和深度学习方法在基于印尼Twitter的气质检测中的性能。本研究采用印尼语语言调查和字数统计来分析Twitter的语境。实现的经典机器学习方法是支持向量机和k近邻,而深度学习方法采用了具有三种不同架构的卷积神经网络(CNN)。两种学习方法均采用多类分类和一对全(OVA)多类分类实现。CNN OVA在池大小为3,dropout值为0.7,学习率值为0.0007的情况下,获得了最高的平均f值58.73%。
{"title":"Temperament detection based on Twitter data: classical machine learning versus deep learning","authors":"Annisa Ulizulfa, R. Kusumaningrum, K. Khadijah, Rismiyati Rismiyati","doi":"10.26555/ijain.v8i1.692","DOIUrl":"https://doi.org/10.26555/ijain.v8i1.692","url":null,"abstract":"Deep learning has shown promising results in various text-based classification tasks. However, deep learning performance is affected by the number of data, i.e., when the number of data is small, deep learning algorithms do not perform well, and vice versa. Classical machine learning algorithms commonly work well for a few data, and their performance reaches an optimal value and does not increase with the increase in sample data. Therefore, this study aimed to compare the performance of classical machine learning and deep learning methods to detect temperament based on Indonesian Twitter. In this study, the proposed Indonesian Linguistic Inquiry and Word Count were employed to analyze the context of Twitter. The classical machine learning methods implemented were support vector machine and K-nearest neighbor, whereas the deep learning method employed was a convolutional neural network (CNN) with three different architectures. Both learning methods were implemented using multiclass classification and one versus all (OVA) multiclass classification. The highest average f-measure was 58.73%, obtained by CNN OVA with a pool size of 3, a dropout value of 0.7, and a learning rate value of 0.0007.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81587249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A data mining approach for classification of traffic violations types 一种交通违规类型分类的数据挖掘方法
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.708
Norhidayah Othman, Cik Feresa Mohd Foozy, Aida Mustapha, S. Mostafa, Shamala Palaniappan, Shafiza Ariffin Kashinath
Traffic summons, also known as traffic tickets, is a notice issued by a law enforcement official to a motorist, who is a person who drives a car, lorry, or bus, and a person who rides a motorcycle. This study is set to perform a comparative experiment to compare the performance of three classification algorithms (Naive Bayes, Gradient Boosted Trees, and Deep Learning algorithm) in classifying the traffic violation types. The performance of all the three classification models developed in this work is measured and compared. The results show that the Gradient Boosted Trees and Deep Learning algorithm have the best value in accuracy and recall but low precision. Naïve Bayes, on the other hand, has high recall since it is a picky classifier that only performs well in a dataset that is high in precision. This paper’s results could serve as baseline results for investigations related to the classification of traffic violation types. It is also helpful for authorities to strategize and plan ways to reduce traffic violations among road users by studying the most common traffic violation types in an area, whether a citation, a warning, or an ESERO (Electronic Safety Equipment Repair Order).
交通传票,也被称为交通罚单,是由执法人员发给驾驶汽车、卡车或公共汽车的人和骑摩托车的人的通知。本研究拟进行对比实验,比较三种分类算法(朴素贝叶斯、梯度提升树和深度学习算法)对交通违章类型的分类性能。本文对所开发的三种分类模型的性能进行了测量和比较。结果表明,梯度增强树和深度学习算法在准确率和查全率方面具有较好的价值,但精度较低。Naïve另一方面,贝叶斯具有高召回率,因为它是一个挑剔的分类器,只在高精度的数据集中表现良好。本文的研究结果可作为交通违法类型分类调查的基准结果。通过研究一个地区最常见的交通违规类型,无论是传票、警告还是ESERO(电子安全设备维修令),这也有助于当局制定战略和规划减少道路使用者交通违规行为的方法。
{"title":"A data mining approach for classification of traffic violations types","authors":"Norhidayah Othman, Cik Feresa Mohd Foozy, Aida Mustapha, S. Mostafa, Shamala Palaniappan, Shafiza Ariffin Kashinath","doi":"10.26555/ijain.v7i3.708","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.708","url":null,"abstract":"Traffic summons, also known as traffic tickets, is a notice issued by a law enforcement official to a motorist, who is a person who drives a car, lorry, or bus, and a person who rides a motorcycle. This study is set to perform a comparative experiment to compare the performance of three classification algorithms (Naive Bayes, Gradient Boosted Trees, and Deep Learning algorithm) in classifying the traffic violation types. The performance of all the three classification models developed in this work is measured and compared. The results show that the Gradient Boosted Trees and Deep Learning algorithm have the best value in accuracy and recall but low precision. Naïve Bayes, on the other hand, has high recall since it is a picky classifier that only performs well in a dataset that is high in precision. This paper’s results could serve as baseline results for investigations related to the classification of traffic violation types. It is also helpful for authorities to strategize and plan ways to reduce traffic violations among road users by studying the most common traffic violation types in an area, whether a citation, a warning, or an ESERO (Electronic Safety Equipment Repair Order).","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90835286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Comparative analysis of classification techniques for leaves and land cover texture 叶片与土地覆盖纹理分类技术的比较分析
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.706
Azri Azrul Azmer, Norlida Hassan, Shihab Hamad Khaleefah, S. Mostafa, A. A. Ramli
The texture is the object’s appearance with different surfaces and sizes. It is mainly helpful for different applications, including object recognition, fingerprinting, and surface analysis. The goal of this research is to investigate the best classification models among the Naive Bayes (NB), Random Forest (DF), and k-Nearest Neighbor (k-NN) algorithms in performing texture classification. The algorithms classify the leaves and urban land cover of texture using several evaluation criteria. This research project aims to prove that the accuracy can be used on data of texture that have turned in a group of different types of data target based on the texture’s characteristic and find out which classification algorithm has better performance when analyzing texture patterns. The test results show that the NB algorithm has the best overall accuracy of 78.67% for the leaves dataset and 93.60% overall accuracy for the urban land cover dataset.
纹理是物体具有不同表面和大小的外观。它主要用于不同的应用,包括物体识别、指纹识别和表面分析。本研究的目的是探讨朴素贝叶斯(NB)、随机森林(DF)和k-近邻(k-NN)算法在纹理分类中的最佳分类模型。该算法使用多个评价标准对树叶和城市土地覆盖的纹理进行分类。本研究项目旨在根据纹理的特征,对一组不同类型的数据目标中的纹理数据进行精度验证,并在分析纹理模式时找出哪种分类算法的性能更好。测试结果表明,NB算法对树叶数据集的总体精度为78.67%,对城市土地覆盖数据集的总体精度为93.60%。
{"title":"Comparative analysis of classification techniques for leaves and land cover texture","authors":"Azri Azrul Azmer, Norlida Hassan, Shihab Hamad Khaleefah, S. Mostafa, A. A. Ramli","doi":"10.26555/ijain.v7i3.706","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.706","url":null,"abstract":"The texture is the object’s appearance with different surfaces and sizes. It is mainly helpful for different applications, including object recognition, fingerprinting, and surface analysis. The goal of this research is to investigate the best classification models among the Naive Bayes (NB), Random Forest (DF), and k-Nearest Neighbor (k-NN) algorithms in performing texture classification. The algorithms classify the leaves and urban land cover of texture using several evaluation criteria. This research project aims to prove that the accuracy can be used on data of texture that have turned in a group of different types of data target based on the texture’s characteristic and find out which classification algorithm has better performance when analyzing texture patterns. The test results show that the NB algorithm has the best overall accuracy of 78.67% for the leaves dataset and 93.60% overall accuracy for the urban land cover dataset.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"47 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77938154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning 印尼酒店评论的情感分析:从经典机器学习到深度学习
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.737
R. Kusumaningrum, Iffa Zainan Nisa, Rizka Putri Nawangsari, A. Wibowo
Currently, there are a large number of hotel reviews on the Internet that need to be evaluated to turn the data into practicable information. Deep learning has excellent capabilities for recognizing this type of data. With the advances in deep learning paradigms, many algorithms have been developed that can be used in sentiment analysis tasks. In this study, we aim to compare the performance of classical machine learning algorithms—logistic regression (LR), naïve Bayes (NB), and support vector machine (SVM) using the Word2Vec model in conjunction with deep learning algorithms such as a convolutional neural network (CNN) to classify hotel reviews on the Traveloka website into positive or negative classes. Both learning methods apply hyperparameter tuning to determine the parameters that produce the best model. Furthermore, the Word2Vec model parameters use the skip-gram model, hierarchical softmax evaluation, and the value of 100 vector dimensions. The highest average accuracy obtained was 98.08% by using the CNN with a dropout of 0.2, Tanh as convolution activation, softmax as output activation, and Adam as the optimizer. The findings from the study demonstrate that the integration of the Word2Vec model and the CNN model obtains significantly better accuracy than other classical machine learning methods.
目前,互联网上有大量的酒店评论,需要对这些评论进行评估,将数据转化为实用的信息。深度学习在识别这类数据方面具有出色的能力。随着深度学习范式的进步,已经开发出许多可用于情感分析任务的算法。在这项研究中,我们的目标是比较经典机器学习算法——逻辑回归(LR)、naïve贝叶斯(NB)和支持向量机(SVM)的性能,使用Word2Vec模型和卷积神经网络(CNN)等深度学习算法,将Traveloka网站上的酒店评论分为积极和消极两类。两种学习方法都采用超参数调优来确定产生最佳模型的参数。Word2Vec模型参数采用skip-gram模型、分层softmax评价和100个向量维值。使用dropout为0.2的CNN, Tanh作为卷积激活,softmax作为输出激活,Adam作为优化器,获得的最高平均准确率为98.08%。研究结果表明,Word2Vec模型与CNN模型的集成比其他经典机器学习方法获得了明显更好的精度。
{"title":"Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning","authors":"R. Kusumaningrum, Iffa Zainan Nisa, Rizka Putri Nawangsari, A. Wibowo","doi":"10.26555/ijain.v7i3.737","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.737","url":null,"abstract":"Currently, there are a large number of hotel reviews on the Internet that need to be evaluated to turn the data into practicable information. Deep learning has excellent capabilities for recognizing this type of data. With the advances in deep learning paradigms, many algorithms have been developed that can be used in sentiment analysis tasks. In this study, we aim to compare the performance of classical machine learning algorithms—logistic regression (LR), naïve Bayes (NB), and support vector machine (SVM) using the Word2Vec model in conjunction with deep learning algorithms such as a convolutional neural network (CNN) to classify hotel reviews on the Traveloka website into positive or negative classes. Both learning methods apply hyperparameter tuning to determine the parameters that produce the best model. Furthermore, the Word2Vec model parameters use the skip-gram model, hierarchical softmax evaluation, and the value of 100 vector dimensions. The highest average accuracy obtained was 98.08% by using the CNN with a dropout of 0.2, Tanh as convolution activation, softmax as output activation, and Adam as the optimizer. The findings from the study demonstrate that the integration of the Word2Vec model and the CNN model obtains significantly better accuracy than other classical machine learning methods.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"6 3-4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72569660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Prediction of silicon content in the hot metal using Bayesian networks and probabilistic reasoning 用贝叶斯网络和概率推理预测铁水中硅的含量
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.771
W. Cardoso, R. Felice
The blast furnace is the principal method of producing cast iron. In the production of cast iron, the control of silicon is vital because this impurity is harmful to almost all steels. Artificial neural networks with Bayesian regularization are more robust than traditional back-propagation networks and can reduce or eliminate the need for tedious cross-validation. Bayesian regularization is a mathematical process that converts a nonlinear regression into a "well-posed" statistical problem in the manner of ridge regression. The main objective of this work was to develop an artificial neural network to predict silicon content in hot metal by varying the number of neurons in the hidden layer by 10, 20, 25, 30, 40, 50, 75, and 100 neurons. The results show that all neural networks converged and presented reliable results, neural networks with 20, 25, and 30 neurons showed the best overall results. However, In short, Bayesian neural networks can be used in practice because the actual values correlate excellently with the values calculated by the neural network.
高炉是生产铸铁的主要方法。在铸铁生产中,硅的控制是至关重要的,因为这种杂质几乎对所有钢都有害。具有贝叶斯正则化的人工神经网络比传统的反向传播网络具有更强的鲁棒性,并且可以减少或消除繁琐的交叉验证。贝叶斯正则化是以脊回归的方式将非线性回归转化为“适定的”统计问题的数学过程。这项工作的主要目标是开发一个人工神经网络,通过改变隐藏层中10、20、25、30、40、50、75和100个神经元的数量来预测热金属中的硅含量。结果表明,所有神经网络均能收敛并呈现可靠的结果,其中20、25和30个神经元的神经网络整体效果最好。简而言之,贝叶斯神经网络可以在实践中使用,因为实际值与神经网络计算的值具有很好的相关性。
{"title":"Prediction of silicon content in the hot metal using Bayesian networks and probabilistic reasoning","authors":"W. Cardoso, R. Felice","doi":"10.26555/ijain.v7i3.771","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.771","url":null,"abstract":"The blast furnace is the principal method of producing cast iron. In the production of cast iron, the control of silicon is vital because this impurity is harmful to almost all steels. Artificial neural networks with Bayesian regularization are more robust than traditional back-propagation networks and can reduce or eliminate the need for tedious cross-validation. Bayesian regularization is a mathematical process that converts a nonlinear regression into a \"well-posed\" statistical problem in the manner of ridge regression. The main objective of this work was to develop an artificial neural network to predict silicon content in hot metal by varying the number of neurons in the hidden layer by 10, 20, 25, 30, 40, 50, 75, and 100 neurons. The results show that all neural networks converged and presented reliable results, neural networks with 20, 25, and 30 neurons showed the best overall results. However, In short, Bayesian neural networks can be used in practice because the actual values correlate excellently with the values calculated by the neural network.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"3 4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75375252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Cable fault classification in ADSL copper access network using machine learning 基于机器学习的ADSL铜接入网电缆故障分类
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.488
Nurul Bashirah Ghazali, Dang Fillatina Hashim, F. Che Seman, K. Isa, K. N. Ramli, Z. Abidin, S. Mustam, Mohammed Al Haek
Asymmetrical Digital Subscriber Line (ADSL) is the technology widely deployed worldwide, but its performance may be limited with respect to its intrinsic. The nature of the copper cable causes it to be more susceptible to signal degradation and faulty line. Common ADSL line faults are short-wired fault, open-wired fault, bridge taps, and uneven pair. However, ADSL technology is still one of the most established networks, and users in the suburban area still depend on the technology to access the internet service. This paper discussed and compared a machine learning algorithm based on Decision Trees (J48), K-Nearest Neighbor, Multi-level Perceptron, Naïve Bayes, Random Forest, and Sequential Minimal Optimization (SMO) for ADSL line impairment that affects the line operation performance concerning their percentage of accuracy. Resulting from classifications done using algorithms as mentioned above, the random forest algorithm gives the highest overall accuracy for the ADSL line impairment dataset. The best algorithm for classifying DSL line impairment is chosen based on the highest accuracy percentage. The accomplishment classification of fault type in the ADSL copper access network project may benefit the telecommunication network provider by remotely assessing the network condition rather than on-site.
非对称数字用户线路(ADSL)是一项在世界范围内广泛应用的技术,但其性能可能受到其固有特性的限制。铜电缆的性质使其更容易受到信号退化和线路故障的影响。常见的ADSL线路故障有短线故障、开线故障、桥接分接和不均匀对等。然而,ADSL技术仍然是最成熟的网络之一,郊区用户仍然依赖该技术访问互联网服务。本文讨论并比较了一种基于决策树(J48)、k近邻、多层感知器、Naïve贝叶斯、随机森林和顺序最小优化(SMO)的机器学习算法,用于影响线路操作性能的ADSL线路损伤,涉及其准确度百分比。由于使用上述算法进行分类,随机森林算法为ADSL线路损伤数据集提供了最高的总体精度。以最高的准确率为基础,选择最佳的DSL线路损伤分类算法。在ADSL铜接入网工程中,故障类型的完成分类可以通过远程评估网络状况而不是现场评估,从而使电信网络提供商受益。
{"title":"Cable fault classification in ADSL copper access network using machine learning","authors":"Nurul Bashirah Ghazali, Dang Fillatina Hashim, F. Che Seman, K. Isa, K. N. Ramli, Z. Abidin, S. Mustam, Mohammed Al Haek","doi":"10.26555/ijain.v7i3.488","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.488","url":null,"abstract":"Asymmetrical Digital Subscriber Line (ADSL) is the technology widely deployed worldwide, but its performance may be limited with respect to its intrinsic. The nature of the copper cable causes it to be more susceptible to signal degradation and faulty line. Common ADSL line faults are short-wired fault, open-wired fault, bridge taps, and uneven pair. However, ADSL technology is still one of the most established networks, and users in the suburban area still depend on the technology to access the internet service. This paper discussed and compared a machine learning algorithm based on Decision Trees (J48), K-Nearest Neighbor, Multi-level Perceptron, Naïve Bayes, Random Forest, and Sequential Minimal Optimization (SMO) for ADSL line impairment that affects the line operation performance concerning their percentage of accuracy. Resulting from classifications done using algorithms as mentioned above, the random forest algorithm gives the highest overall accuracy for the ADSL line impairment dataset. The best algorithm for classifying DSL line impairment is chosen based on the highest accuracy percentage. The accomplishment classification of fault type in the ADSL copper access network project may benefit the telecommunication network provider by remotely assessing the network condition rather than on-site.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"103 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72439806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adjusting cyber insurance premiums based on frequency in a communication network 根据通信网络的频率调整网络保险费
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.698
S. Indratno, Y. Antonio, S. Saputro
This study compares cyber insurance premiums with and without a communication network effect frequency. As a cybersecurity factor, the frequency in a communication network influences the speed of cyberattack transmission. It means that a network or a high activity node is more vulnerable than a network with low activity. Traditionally, cyber insurance pricing considers historical data to set premiums or rates. Conversely, the network security level can evaluate using the Monte Carlo simulation based on the epidemic model. This simulation requires spreading parameters, such as infection rate, recovery rate, and self-infection rate. Our idea is to modify the infection rate as a function of the frequency in a communication network. The node-based model uses probability distributions for the communication mechanism to generate the data. It adopts the co-purchase network formation in market basket analysis for building weighted edges and nodes. Simulations are used to compare the initial and modified infection rates. This paper considered prism and Petersen graph topology as case studies. The relative difference is a metric to compare the significance of premium adjustment. The results show that the premium for a node with a low level in a communication network can reach 28.28% lower than the initial premium. The premium can reach 20.99% lower than the initial network premium for a network. Based on these results, insurance companies can adjust cyber insurance premiums based on computer usage to offer a more appropriate price.
本研究比较了有无通讯网络效应频率的网路保险保费。作为一个网络安全因素,通信网络中的频率影响着网络攻击的传播速度。这意味着网络或高活动节点比低活动网络更容易受到攻击。传统上,网络保险定价考虑历史数据来设定保费或费率。反过来,网络安全水平可以使用基于流行病模型的蒙特卡罗模拟来评估。该模拟需要传播参数,如感染率、恢复率和自感染率。我们的想法是将感染率修改为通信网络频率的函数。基于节点的模型使用概率分布作为通信机制来生成数据。该算法采用购物篮分析中共同购买网络的形成来构建加权边和节点。模拟用于比较初始感染率和修改后的感染率。本文以棱镜图拓扑和Petersen图拓扑为例进行了研究。相对差是比较保费调整重要性的指标。结果表明,通信网络中低电平节点的溢价可达初始溢价的28.28%。对于一个网络,可以比初始网络保险费低20.99%。根据这些结果,保险公司可以根据电脑使用情况调整网络保险费,以提供更合适的价格。
{"title":"Adjusting cyber insurance premiums based on frequency in a communication network","authors":"S. Indratno, Y. Antonio, S. Saputro","doi":"10.26555/ijain.v7i3.698","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.698","url":null,"abstract":"This study compares cyber insurance premiums with and without a communication network effect frequency. As a cybersecurity factor, the frequency in a communication network influences the speed of cyberattack transmission. It means that a network or a high activity node is more vulnerable than a network with low activity. Traditionally, cyber insurance pricing considers historical data to set premiums or rates. Conversely, the network security level can evaluate using the Monte Carlo simulation based on the epidemic model. This simulation requires spreading parameters, such as infection rate, recovery rate, and self-infection rate. Our idea is to modify the infection rate as a function of the frequency in a communication network. The node-based model uses probability distributions for the communication mechanism to generate the data. It adopts the co-purchase network formation in market basket analysis for building weighted edges and nodes. Simulations are used to compare the initial and modified infection rates. This paper considered prism and Petersen graph topology as case studies. The relative difference is a metric to compare the significance of premium adjustment. The results show that the premium for a node with a low level in a communication network can reach 28.28% lower than the initial premium. The premium can reach 20.99% lower than the initial network premium for a network. Based on these results, insurance companies can adjust cyber insurance premiums based on computer usage to offer a more appropriate price.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75921750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Korean popular culture analytics in social media streaming: evidence from YouTube channels in Thailand 社交媒体流媒体中的韩国流行文化分析:来自泰国YouTube频道的证据
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.769
Wirapong Chansanam, Kulthida Tuamsuk, Kanyarat Kwiecien, Sam Oh
This research aimed to study and analyze the influence and impact of Korean popular culture (K-pop) on Thai society. In this study, we used Social Network Analysis (SNA) to analyze streaming data obtained from a variety of YouTube channels belonging to YouTubers across the world, text analytics to analyze demographic characteristics, YouTuber's presentation techniques, as well as subscriber behavior, and multiple correlations analysis to analyze the relationship between factors affecting YouTube Channels in Thailand. The findings revealed that five Thai YouTube Channels were influencing Thai society. Furthermore, there were robust positive correlations between the number of dislikes and the number of comments (0.79), and the number of likes and comments (0.65). Additionally, there was a positive correlation between the number of views and the number of dislikes and one between the number of likes and dislikes. Future research can supplement the present findings with other social media sources to yield an even more diverse and comprehensive analysis. These analytics can be applied to various situations, including corporate marketing strategies, political campaigns, or disease/symptom analysis in medicine. This research extends to social computing by revealing intelligent trends in social networks.
本研究旨在研究和分析韩国流行文化(K-pop)对泰国社会的影响和影响。在这项研究中,我们使用社交网络分析(SNA)来分析来自世界各地YouTube频道的各种YouTube频道的流媒体数据,文本分析来分析人口统计学特征,YouTube的演示技术,以及订阅者行为,以及多重相关性分析来分析影响泰国YouTube频道的因素之间的关系。调查结果显示,五个泰国YouTube频道正在影响泰国社会。此外,不喜欢的数量和评论的数量(0.79)以及喜欢和评论的数量(0.65)之间存在显著的正相关。此外,观看次数与不喜欢次数呈正相关,喜欢次数与不喜欢次数呈正相关。未来的研究可以用其他社交媒体来源补充目前的研究结果,以产生更多样化和全面的分析。这些分析可以应用于各种情况,包括企业营销策略、政治活动或医学上的疾病/症状分析。这项研究通过揭示社会网络中的智能趋势扩展到社会计算。
{"title":"Korean popular culture analytics in social media streaming: evidence from YouTube channels in Thailand","authors":"Wirapong Chansanam, Kulthida Tuamsuk, Kanyarat Kwiecien, Sam Oh","doi":"10.26555/ijain.v7i3.769","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.769","url":null,"abstract":"This research aimed to study and analyze the influence and impact of Korean popular culture (K-pop) on Thai society. In this study, we used Social Network Analysis (SNA) to analyze streaming data obtained from a variety of YouTube channels belonging to YouTubers across the world, text analytics to analyze demographic characteristics, YouTuber's presentation techniques, as well as subscriber behavior, and multiple correlations analysis to analyze the relationship between factors affecting YouTube Channels in Thailand. The findings revealed that five Thai YouTube Channels were influencing Thai society. Furthermore, there were robust positive correlations between the number of dislikes and the number of comments (0.79), and the number of likes and comments (0.65). Additionally, there was a positive correlation between the number of views and the number of dislikes and one between the number of likes and dislikes. Future research can supplement the present findings with other social media sources to yield an even more diverse and comprehensive analysis. These analytics can be applied to various situations, including corporate marketing strategies, political campaigns, or disease/symptom analysis in medicine. This research extends to social computing by revealing intelligent trends in social networks.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88551899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Development of marker detection method for estimating angle and distance of underwater remotely operated vehicle to buoyant boat 水下遥控航行器与浮力船的角度和距离估计标记检测方法的研制
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.455
Muhammad Qomaruz Zaman, R. Mardiyanto
The paper proposes a Marker Detection Method for Estimating the Angle and Distance of Underwater Remotely Operated Vehicle (ROV) to Buoyant Boat. To keep the ROV aligned with the boat, a marker and visual recognition system are designed. The marker is placed facing down under the boat and a method is developed to recognize the angle and distance of the marker from a facing up camera on the ROV. By considering space, payload, heat dissipation, and buoyancy in a micro class ROV, there are limited options for computing power that can be utilized. This challenge demands a lightweight visual recognition technique for small computers. The proposed method consists of two steps. The marker designing step explains how the marker is constructed of simple components. The marker recognizing step is based on image processing that uses threshold and blob filtering. They are blob size and blob circularity filters which are used to eliminate unwanted information. The real-time orientation and distance estimation by using one camera are the superiority of this method. The proposed method has been tested by using an 11x11 cm2 marker size. The detection rate of the marker is 90% and can be detected up to 120 cm from the camera. The marker can be tilted up to 50° and still has an 80% detection rate. The method can estimate marker rotation angle accurately with a 1.75° average error. The method can estimate the distance between the marker and camera with a -0.62 cm average error. The blob filter is also proven to be superior to a regular dilating and eroding method.
提出了一种估计水下遥控机器人与浮力船的角度和距离的标记检测方法。为了使ROV与船保持一致,设计了一个标记和视觉识别系统。标记面朝下放置在船下,并开发了一种方法来识别标记的角度和距离,从ROV上的一个面朝上的摄像头。考虑到微型ROV的空间、有效载荷、散热和浮力,可利用的计算能力选择有限。这个挑战需要一种适用于小型计算机的轻量级视觉识别技术。该方法分为两个步骤。标记设计步骤解释了如何用简单组件构造标记。标记识别步骤基于使用阈值和斑点滤波的图像处理。它们是用于消除不需要信息的斑点大小和斑点圆形过滤器。该方法的优点是单摄像机实时定位和距离估计。所提出的方法已通过使用11x11cm2标记尺寸进行了测试。该标记的检出率为90%,可在距离摄像机120厘米处检测到。该标记可以倾斜50°,仍然有80%的检出率。该方法可以准确估计标记旋转角度,平均误差为1.75°。该方法可以估计出标记点与相机之间的距离,平均误差为-0.62 cm。斑点过滤器也被证明优于常规的扩张和侵蚀方法。
{"title":"Development of marker detection method for estimating angle and distance of underwater remotely operated vehicle to buoyant boat","authors":"Muhammad Qomaruz Zaman, R. Mardiyanto","doi":"10.26555/ijain.v7i3.455","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.455","url":null,"abstract":"The paper proposes a Marker Detection Method for Estimating the Angle and Distance of Underwater Remotely Operated Vehicle (ROV) to Buoyant Boat. To keep the ROV aligned with the boat, a marker and visual recognition system are designed. The marker is placed facing down under the boat and a method is developed to recognize the angle and distance of the marker from a facing up camera on the ROV. By considering space, payload, heat dissipation, and buoyancy in a micro class ROV, there are limited options for computing power that can be utilized. This challenge demands a lightweight visual recognition technique for small computers. The proposed method consists of two steps. The marker designing step explains how the marker is constructed of simple components. The marker recognizing step is based on image processing that uses threshold and blob filtering. They are blob size and blob circularity filters which are used to eliminate unwanted information. The real-time orientation and distance estimation by using one camera are the superiority of this method. The proposed method has been tested by using an 11x11 cm2 marker size. The detection rate of the marker is 90% and can be detected up to 120 cm from the camera. The marker can be tilted up to 50° and still has an 80% detection rate. The method can estimate marker rotation angle accurately with a 1.75° average error. The method can estimate the distance between the marker and camera with a -0.62 cm average error. The blob filter is also proven to be superior to a regular dilating and eroding method.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85405815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An improved K-Nearest neighbour with grasshopper optimization algorithm for imputation of missing data 一种改进的k近邻蝗虫优化算法用于缺失数据的补全
Pub Date : 2021-11-30 DOI: 10.26555/ijain.v7i3.696
Nadzurah Zainal Abidin, Amelia Ritahani Ismail
K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing data with plausible values. One of the successes of KNN imputation is the ability to measure the missing data simulated from its nearest neighbors robustly. However, despite the favorable points, KNN still imposes undesirable circumstances. KNN suffers from high time complexity, choosing the right k, and different functions. Thus, this paper proposes a novel method for imputation of missing data, named KNNGOA, which optimized the KNN imputation technique based on the grasshopper optimization algorithm. Our GOA is designed to find the best value of k and optimize the imputed value from KNN that maximizes the imputation accuracy. Experimental evaluation for different types of datasets collected from UCI, with various rates of missing values ranging from 10%, 30%, and 50%. Our proposed algorithm has achieved promising results from the experiment conducted, which outperformed other methods, especially in terms of accuracy.
k近邻(KNN)算法被广泛用于用可信值替代缺失数据。KNN imputation的一个成功之处在于它能够鲁棒地测量最近邻居模拟的缺失数据。然而,尽管有这些优点,KNN仍然施加了不利的情况。KNN存在时间复杂度高、选择正确的k和不同的函数等问题。因此,本文提出了一种新的缺失数据补全方法KNNGOA,该方法对基于grasshopper优化算法的KNN补全技术进行了优化。我们的GOA旨在找到k的最佳值,并从KNN中优化输入值,使输入精度最大化。对UCI收集的不同类型数据集进行实验评估,缺失率从10%、30%到50%不等。通过实验,我们提出的算法取得了令人满意的结果,特别是在准确率方面优于其他方法。
{"title":"An improved K-Nearest neighbour with grasshopper optimization algorithm for imputation of missing data","authors":"Nadzurah Zainal Abidin, Amelia Ritahani Ismail","doi":"10.26555/ijain.v7i3.696","DOIUrl":"https://doi.org/10.26555/ijain.v7i3.696","url":null,"abstract":"K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing data with plausible values. One of the successes of KNN imputation is the ability to measure the missing data simulated from its nearest neighbors robustly. However, despite the favorable points, KNN still imposes undesirable circumstances. KNN suffers from high time complexity, choosing the right k, and different functions. Thus, this paper proposes a novel method for imputation of missing data, named KNNGOA, which optimized the KNN imputation technique based on the grasshopper optimization algorithm. Our GOA is designed to find the best value of k and optimize the imputed value from KNN that maximizes the imputation accuracy. Experimental evaluation for different types of datasets collected from UCI, with various rates of missing values ranging from 10%, 30%, and 50%. Our proposed algorithm has achieved promising results from the experiment conducted, which outperformed other methods, especially in terms of accuracy.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"128 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73621283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
International Journal of Advances in Intelligent Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1