Ilkom Jurnal Ilmiah最新文献_第4页

Fuzzy C-Means with Borda Algorithm in Cluster Determination System for Food Prone Areas in Aceh Utara 基于Borda算法的模糊C-均值在亚齐-乌塔拉食物易发区聚类判定系统中的应用

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1481.21-31

Mutammimul Ula, M. Ula, Desvina Yulisda, S. Susanti

In this research, the clustering of food prone areas in Aceh Utama is based on the Index Ketahanan Pangan (IKP) indicators compiled by Badan Ketahanan Pangan (BKP) using Fuzzy C-Means (FCM) and Borda algorithms. The fuzzy C-Means algorithm was used to classify food-prone areas with three clusters: very prone, moderately prone, and prone. The Borda algorithm was used to choose the most prone area from very prone clusters, which are considered urgently to be followed up by decision-makers. Based on the research results, it was found that in the aspect of food availability, four sub-districts are moderately prone, 10 are prone, and 13 are very prone. Regarding food affordability, it found that 12 sub-districts are moderately prone, seven are prone, and eight are very prone. Regarding food utilization, one sub-district is moderately prone, three are prone, and 23 are very prone. The results of voting using the Borda algorithm in very prone clusters are obtained Sawang District from the aspect of food availability, Syamtalira Aron District from the aspect of food affordability, and Lapang District from the aspect of food utilization. The clustering system is built based on the web using the PHP programming language.

在这项研究中，亚齐乌塔马的食物易发地区的聚类是基于Badan Ketahanan Pangan（BKP）使用模糊C均值（FCM）和Borda算法编制的指数Ketahanan-Pangan（IKP）指标。使用模糊C均值算法将食物易发区域分为三类：非常易发、中等易发和易发。Borda算法用于从非常容易发生的聚类中选择最容易发生的区域，决策者迫切需要跟进这些聚类。根据研究结果发现，在食物可得性方面，有4个分区中度易发，10个易发，13个非常易发。在食物负担能力方面，调查发现，12个分区中等倾向，7个倾向，8个非常倾向。在食物利用方面，一个街道是中度易发区，三个是易发区和23个是非常易发区。在非常容易发生的集群中，使用Borda算法的投票结果是从食物供应方面获得的Sawang区，从食物可负担性方面获得的Syamtalira Aron区，以及从食物利用方面获得的Lapang区。集群系统是使用PHP编程语言基于web构建的。

{"title":"Fuzzy C-Means with Borda Algorithm in Cluster Determination System for Food Prone Areas in Aceh Utara","authors":"Mutammimul Ula, M. Ula, Desvina Yulisda, S. Susanti","doi":"10.33096/ilkom.v15i1.1481.21-31","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1481.21-31","url":null,"abstract":"In this research, the clustering of food prone areas in Aceh Utama is based on the Index Ketahanan Pangan (IKP) indicators compiled by Badan Ketahanan Pangan (BKP) using Fuzzy C-Means (FCM) and Borda algorithms. The fuzzy C-Means algorithm was used to classify food-prone areas with three clusters: very prone, moderately prone, and prone. The Borda algorithm was used to choose the most prone area from very prone clusters, which are considered urgently to be followed up by decision-makers. Based on the research results, it was found that in the aspect of food availability, four sub-districts are moderately prone, 10 are prone, and 13 are very prone. Regarding food affordability, it found that 12 sub-districts are moderately prone, seven are prone, and eight are very prone. Regarding food utilization, one sub-district is moderately prone, three are prone, and 23 are very prone. The results of voting using the Borda algorithm in very prone clusters are obtained Sawang District from the aspect of food availability, Syamtalira Aron District from the aspect of food affordability, and Lapang District from the aspect of food utilization. The clustering system is built based on the web using the PHP programming language.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43555589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sentiment Analysis and Classification of Forest Fires in Indonesia 印尼森林火灾的情绪分析与分类

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1337.175-185

I. Irawanto, Cynthia Widodo, Atin Hasanah, Prema Adhitya Dharma Kusumah, Kusirini Kusrini, Kusnawi Kusnawi

so the preprocessing process must be carried out as was done in research [5], which retrieved Twitter data on the theme of COVID 2019. Moreover, weighting must be applied to tasks that must be completed prior to classification. This study uses VADER or commonly known as the lexicon. [6] It uses a lexicon that combines lexical dictionary features as a polarity assessment. Sentiment scores of 5 additional criteria, namely exclamation marks, large alphabet, level of word order, polarity shift due to the term "but," and using the tri-gram feature to study negation [7]. Once the text has been labeled, we will classify it using the sentiment analysis. The Nave Bayes technique, Random Forest, and SVM are some reliable classifications that have been demonstrated in numerous research (Support Vector Machine). A popular algorithm that is frequently employed by researchers is Naïve Bayes. The following researchers have used the Naive Bayes method for sentiment analysis research: : [8] analyzing the online store JD.ID, [9] regarding awareness of procedures to prevent COVID 2019. Random Forest is rarely implemented in research on sentiment analysis, although it has recently been investigated to gauge its accuracy. It is used by a number of researchers, including [10], who achieves an accuracy of about 0.829. Moreover, the SVM (Support Vector Machine) approach, whose accuracy is 85%, is also being investigated in sentiment analysis study by [11]. Hence, researchers want to compare the values of the 3 methods namely Naïve Bayes, Random Forest and SVM (Support Vector Machine) to find out the difference in accuracy of the three when using the same data. As for the accuracy will be calculated using the calculation on the confusion matrix. In addition, the researcher also wants to compare the results of classifying sentiment statements which are divided into positive, negative and neutral sentiments.

因此，必须像研究[5]中所做的那样进行预处理过程，该研究检索了2019冠状病毒病主题的推特数据。此外，必须对分类前必须完成的任务进行加权。这项研究使用VADER或俗称的词典。[6] 它使用一个结合了词典特征的词典作为极性评估。五个附加标准的情感得分，即感叹号、大字母表、语序水平、“但是”引起的极性变化，以及使用三元图特征研究否定[7]。一旦文本被标记，我们将使用情感分析对其进行分类。Nave Bayes技术、随机森林和SVM是一些可靠的分类，已经在许多研究（支持向量机）中得到了证明。研究人员经常使用的一种流行算法是朴素贝叶斯。以下研究人员使用朴素贝叶斯方法进行情绪分析研究：[8]分析在线商店JD.ID，[9]关于预防2019冠状病毒病的程序意识。随机森林很少用于情绪分析的研究，尽管最近已经对其准确性进行了调查。包括[10]在内的许多研究人员都在使用它，他们的准确度约为0.829。此外，[11]还在情感分析研究中研究了SVM（支持向量机）方法，其准确率为85%。因此，研究人员希望比较三种方法的值，即Naïve Bayes、随机森林和SVM（支持向量机），以找出在使用相同数据时三种方法准确性的差异。至于精度，将使用对混淆矩阵的计算来计算。此外，研究人员还想比较情绪陈述的分类结果，情绪陈述分为积极情绪、消极情绪和中性情绪。

{"title":"Sentiment Analysis and Classification of Forest Fires in Indonesia","authors":"I. Irawanto, Cynthia Widodo, Atin Hasanah, Prema Adhitya Dharma Kusumah, Kusirini Kusrini, Kusnawi Kusnawi","doi":"10.33096/ilkom.v15i1.1337.175-185","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1337.175-185","url":null,"abstract":"so the preprocessing process must be carried out as was done in research [5], which retrieved Twitter data on the theme of COVID 2019. Moreover, weighting must be applied to tasks that must be completed prior to classification. This study uses VADER or commonly known as the lexicon. [6] It uses a lexicon that combines lexical dictionary features as a polarity assessment. Sentiment scores of 5 additional criteria, namely exclamation marks, large alphabet, level of word order, polarity shift due to the term \"but,\" and using the tri-gram feature to study negation [7]. Once the text has been labeled, we will classify it using the sentiment analysis. The Nave Bayes technique, Random Forest, and SVM are some reliable classifications that have been demonstrated in numerous research (Support Vector Machine). A popular algorithm that is frequently employed by researchers is Naïve Bayes. The following researchers have used the Naive Bayes method for sentiment analysis research: : [8] analyzing the online store JD.ID, [9] regarding awareness of procedures to prevent COVID 2019. Random Forest is rarely implemented in research on sentiment analysis, although it has recently been investigated to gauge its accuracy. It is used by a number of researchers, including [10], who achieves an accuracy of about 0.829. Moreover, the SVM (Support Vector Machine) approach, whose accuracy is 85%, is also being investigated in sentiment analysis study by [11]. Hence, researchers want to compare the values of the 3 methods namely Naïve Bayes, Random Forest and SVM (Support Vector Machine) to find out the difference in accuracy of the three when using the same data. As for the accuracy will be calculated using the calculation on the confusion matrix. In addition, the researcher also wants to compare the results of classifying sentiment statements which are divided into positive, negative and neutral sentiments.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42018401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CNN Ensemble Learning Method for Transfer learning: A Review 迁移学习的CNN集成学习方法综述

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1541.45-63

Yudha Islami Sulistya, Elsi Titasari Br Bangun, Dyah Aruming Tyas

This study provides a review of CNN's ensemble learning method for transfer learning by highlighting sections such as review studies, datasets, pre-trained models, transfer learning, ensemble learning, and performance. The results indicate that the trend of ensemble learning, transfer learning ensemble, and transfer learning is growing every year. In 2022, there will be 35 papers reviewed related to this topic in this study. Some datasets contain apparent information starting from the dataset name, total data points, dataset splitting, target dataset availability, and type classification. ResNet-50, VGG-16, InceptionV3, and VGG-19 are used in most papers as pre-trained models and transfer learning processes. 50 (90.1%) papers use ensemble learning, and 5 (9.1%) do without ensemble learning. The reviewed paper summarizes several performance measurements, including accuracy, precision, recall, f1-score, sensitivity, specificity, training accuracy, validation accuracy, test accuracy, training losses, validation losses, test losses, training time, and AUC, DSC. In the last section, 49 papers produce the best model performance using the proposed model, and 6 other papers use DenseNet, DeQueezeNet, Extended Yager Model, InceptionV3, and ResNet-152.

本研究通过重点介绍综述研究、数据集、预训练模型、迁移学习、集成学习和表现等部分，对CNN的迁移学习集成学习方法进行了综述。研究结果表明，整体学习、迁移学习、整体学习和迁移学习呈逐年增长的趋势。2022年，本研究将对35篇与该主题相关的论文进行综述。一些数据集包含从数据集名称、总数据点、数据集拆分、目标数据集可用性和类型分类开始的明显信息。ResNet-50、VGG-16、InceptionV3和VGG-19在大多数论文中被用作预训练模型和迁移学习过程。50篇（90.1%）论文使用了集成学习，5篇（9.1%）论文没有使用集成学习。综述的论文总结了几种性能测量，包括准确性、精密度、召回率、f1分数、敏感性、特异性、训练准确性、验证准确性、测试准确性、训练损失、验证损失、测试损失、训练时间以及AUC、DSC。在最后一节中，49篇论文使用所提出的模型产生了最佳的模型性能，另外6篇论文使用DenseNet、DeQueezeNet、Extended Yager模型、InceptionV3和ResNet-152。

{"title":"CNN Ensemble Learning Method for Transfer learning: A Review","authors":"Yudha Islami Sulistya, Elsi Titasari Br Bangun, Dyah Aruming Tyas","doi":"10.33096/ilkom.v15i1.1541.45-63","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1541.45-63","url":null,"abstract":"This study provides a review of CNN's ensemble learning method for transfer learning by highlighting sections such as review studies, datasets, pre-trained models, transfer learning, ensemble learning, and performance. The results indicate that the trend of ensemble learning, transfer learning ensemble, and transfer learning is growing every year. In 2022, there will be 35 papers reviewed related to this topic in this study. Some datasets contain apparent information starting from the dataset name, total data points, dataset splitting, target dataset availability, and type classification. ResNet-50, VGG-16, InceptionV3, and VGG-19 are used in most papers as pre-trained models and transfer learning processes. 50 (90.1%) papers use ensemble learning, and 5 (9.1%) do without ensemble learning. The reviewed paper summarizes several performance measurements, including accuracy, precision, recall, f1-score, sensitivity, specificity, training accuracy, validation accuracy, test accuracy, training losses, validation losses, test losses, training time, and AUC, DSC. In the last section, 49 papers produce the best model performance using the proposed model, and 6 other papers use DenseNet, DeQueezeNet, Extended Yager Model, InceptionV3, and ResNet-152.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49541906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The K-Nearest Neighbor Algorithm using Forward Selection and Backward Elimination in Predicting the Student’s Satisfaction Level of University Ichsan Gorontalo toward Online Lectures during the COVID-19 Pandemic 基于前向选择和后向消除的k近邻算法预测新冠肺炎大流行期间高伦塔洛大学学生在线课程满意度

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1381.118-123

Andi Bode, Z. Y. Lamasigi, Ivo Colanus Rally Drajana

Academic services are actions taken by state and private universities to provide convenience for student’s academic activities. During the current covid-19 pandemic, every university remains active in academic activities. This study aimed to apply the K-Nearest Neighbor algorithm in predicting the level of student satisfaction with online lectures at University Ichsan Gorontalo. Our main aim was to obtain quantitative information to measure student satisfaction with online lectures during the pandemic, which should be taken into account when making decisions. K-Nearest Neighbor is a non-parametric Algorithm that can be used for classification and regression, but K-Nearest Neighbor are better if feature selection is applied in selecting features that are not relevant to the model. Feature Selection used in this research is Forward Selection and Backward Elimination. Seeing the results of experiments that have been carried out with the application of the K-nearest Neighbor algorithm and the selection feature, the results of the forecasting can be used for consideration or policy in decision making. The highest level of accuracy in the K-Nearest Neighbor algorithm model used Forward Selection with an accuracy rate of 98.00%. Thus, the experimental results showed that feature selection, namely forward selection, was a better model in the relevant selection variables compared to backward elimination.

学术服务是公立和私立大学为学生的学术活动提供便利而采取的行动。在当前新冠肺炎大流行期间，每一所大学都积极开展学术活动。本研究旨在应用K-最近邻算法预测学生对Ichsan Gorontalo大学在线讲座的满意度。我们的主要目标是获得量化信息，以衡量学生在疫情期间对在线讲座的满意度，在做出决定时应将其考虑在内。K-近邻是一种非参数算法，可用于分类和回归，但如果在选择与模型无关的特征时应用特征选择，则K-近邻更好。本研究中使用的特征选择是正向选择和反向消除。从应用K近邻算法和选择特征进行的实验结果来看，预测结果可以用于决策中的考虑或策略。K-最近邻算法模型的最高准确度使用了前向选择，准确率为98.00%。因此，实验结果表明，与后向消除相比，特征选择，即前向选择在相关选择变量中是一个更好的模型。

{"title":"The K-Nearest Neighbor Algorithm using Forward Selection and Backward Elimination in Predicting the Student’s Satisfaction Level of University Ichsan Gorontalo toward Online Lectures during the COVID-19 Pandemic","authors":"Andi Bode, Z. Y. Lamasigi, Ivo Colanus Rally Drajana","doi":"10.33096/ilkom.v15i1.1381.118-123","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1381.118-123","url":null,"abstract":"Academic services are actions taken by state and private universities to provide convenience for student’s academic activities. During the current covid-19 pandemic, every university remains active in academic activities. This study aimed to apply the K-Nearest Neighbor algorithm in predicting the level of student satisfaction with online lectures at University Ichsan Gorontalo. Our main aim was to obtain quantitative information to measure student satisfaction with online lectures during the pandemic, which should be taken into account when making decisions. K-Nearest Neighbor is a non-parametric Algorithm that can be used for classification and regression, but K-Nearest Neighbor are better if feature selection is applied in selecting features that are not relevant to the model. Feature Selection used in this research is Forward Selection and Backward Elimination. Seeing the results of experiments that have been carried out with the application of the K-nearest Neighbor algorithm and the selection feature, the results of the forecasting can be used for consideration or policy in decision making. The highest level of accuracy in the K-Nearest Neighbor algorithm model used Forward Selection with an accuracy rate of 98.00%. Thus, the experimental results showed that feature selection, namely forward selection, was a better model in the relevant selection variables compared to backward elimination.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48581955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Analysis of the Dynamic Source Routing Protocol on the Performance of File Transfer Protocol and Video Conference Services in the Mobile AdHoc Network Simulation 动态源路由协议对移动AdHoc网络仿真中文件传输协议和视频会议服务性能的影响分析

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1526.165-174

T. A. Cahyanto, Rizky Dwi Antoko, Taufiq Timur Warisaji, S. Santosa, Rodianto Rodianto

Current technological advancements make it easier for users to do their work effectively and efficiently, including the use of wireless networks to exchange data via File Transfer Protocol (FTP) and video conferencing services (VCS). A Mobile AdHoc Network (MANET) is a wireless network technology that applies a dynamic set of nodes. Data transmission on the MANET does not require the use of devices such as base stations. Because each node on the MANET can act as a router in determining the direction of the data sent, the number of nodes in the MANET will influence the quality of the data sent. Using the OPNET Modeler simulator, this paper shows how to assess the quality of FTP and VCS based on delay, jitter, and packet loss parameters. The simulation scenario employs five, fifteen, and thirty nodes with low, medium, and high traffic loads, using the Dynamic Source Routing (DSR) protocol. According to the measurement results, the FTP service with the bad category is the packet loss parameter in high traffic loads, which has the highest packet loss value of 56.6 percent with 15 nodes. In contrast, good results for VCS are only produced on the delay parameter. The jitter increases with the number of nodes, and it is 5 in this case. In all scenarios, the packet loss parameter yields poor results, with the highest packet loss value approaching 100%.

当前的技术进步使用户更容易有效地完成工作，包括使用无线网络通过文件传输协议（FTP）和视频会议服务（VCS）交换数据。移动自组织网络（MANET）是一种应用动态节点集的无线网络技术。MANET上的数据传输不需要使用诸如基站之类的设备。由于MANET上的每个节点都可以作为路由器来确定发送数据的方向，因此MANET中的节点数量将影响发送数据的质量。使用OPNETModeler模拟器，本文展示了如何根据延迟、抖动和数据包丢失参数来评估FTP和VCS的质量。模拟场景使用动态源路由（DSR）协议，使用具有低、中、高流量负载的五个、十五个和三十个节点。根据测量结果，不良类别的FTP服务是高流量负载下的丢包参数，其丢包率最高，为56.6%，有15个节点。相反，VCS的良好结果仅在延迟参数上产生。抖动随着节点数量的增加而增加，在这种情况下为5。在所有情况下，数据包丢失参数都会产生较差的结果，最高数据包丢失值接近100%。

{"title":"Analysis of the Dynamic Source Routing Protocol on the Performance of File Transfer Protocol and Video Conference Services in the Mobile AdHoc Network Simulation","authors":"T. A. Cahyanto, Rizky Dwi Antoko, Taufiq Timur Warisaji, S. Santosa, Rodianto Rodianto","doi":"10.33096/ilkom.v15i1.1526.165-174","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1526.165-174","url":null,"abstract":"Current technological advancements make it easier for users to do their work effectively and efficiently, including the use of wireless networks to exchange data via File Transfer Protocol (FTP) and video conferencing services (VCS). A Mobile AdHoc Network (MANET) is a wireless network technology that applies a dynamic set of nodes. Data transmission on the MANET does not require the use of devices such as base stations. Because each node on the MANET can act as a router in determining the direction of the data sent, the number of nodes in the MANET will influence the quality of the data sent. Using the OPNET Modeler simulator, this paper shows how to assess the quality of FTP and VCS based on delay, jitter, and packet loss parameters. The simulation scenario employs five, fifteen, and thirty nodes with low, medium, and high traffic loads, using the Dynamic Source Routing (DSR) protocol. According to the measurement results, the FTP service with the bad category is the packet loss parameter in high traffic loads, which has the highest packet loss value of 56.6 percent with 15 nodes. In contrast, good results for VCS are only produced on the delay parameter. The jitter increases with the number of nodes, and it is 5 in this case. In all scenarios, the packet loss parameter yields poor results, with the highest packet loss value approaching 100%.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45941129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Analysis of the Ensemble Method Classifier's Performance on Handwritten Arabic Characters Dataset 集成方法分类器在手写阿拉伯字符数据集上的性能分析

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1357.186-192

Abdul Rachman Manga’, A. N. Handayani, H. Herwanto, R. A. Asmara, Yudha Islami Sulistya, Kasmira Kasmira

Arabic character handwriting is one of the patterns and characteristics of each person's writing. This characteristic makes Arabic writing more challenging if the letter recognition process is based on a dataset of Arabic scripts. This Arabic script has been presented in a dataset totaling 16800, each representing a class of hijaiyah letters starting from alif to yes, consisting of 600 data for each class. The accuracy of the data used can be increased using the ensemble method. By using multiple algorithms at simultaneously, the ensemble technique can raise the level or result of a score in machine learning. This study's primary goal is to evaluate the ensemble method classifier's performance on datasets of handwritten Arabic characters. The classifier uses the ensemble method by applying the proposed soft voting to provide a multiclass classification of three machine learning algorithms, namely, SVM, Random Forest, and Decision Tree for classification. This research process produces an accuracy value for the voting classifier of 0.988 and several

阿拉伯文字笔迹是每个人的书写模式和特征之一。如果字母识别过程基于阿拉伯语脚本数据集，那么这个特征使阿拉伯语书写更具挑战性。这个阿拉伯语脚本已经在一个总计16800个的数据集中呈现，每个数据集代表从alif到yes的一类hijaiyah字母，每个类包含600个数据。使用集成方法可以提高所用数据的准确性。通过同时使用多种算法，集成技术可以提高机器学习得分的水平或结果。本研究的主要目的是评估集成方法分类器在手写阿拉伯字符数据集上的性能。该分类器采用集成方法，应用所提出的软投票，提供SVM、Random Forest和Decision Tree三种机器学习算法的多类分类。这个研究过程产生的投票分类器的准确率值为0.988和几个

引用次数: 0

The Implementation of GLCM and ANN Methods to Identify Dragon Fruit Maturity Level GLCM和ANN方法在龙果成熟度识别中的应用

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1504.64-71

Muhammad Faisal, Maryam Hasan, Kartika Candra Pelangi

The identification of the maturity level of dragon fruit in this study was divided into two groups of ripeness: the unripe and the ripe. This study aims to classify the maturity level based on dragon fruit images using the feature extraction method, the gray level co-occurrence matrix (GLCM). This research method consists of converting RGB data to grayscale, image normalization, detection of dragon fruit maturity, feature extraction, and identification. Data collection from real data totaled 60 images used in this study consisting of 40 training data and 20 testing data which are RGB image data in JPG format. Each data consists of 2 maturity categories. Training data consists of 20 images of 99% ripe dragon fruit and 20 images of 85%. Meanwhile, the testing data consisted of 10 of 99% ripe dragon fruit images and 10 of 85% ripe dragon fruit images. The image data is processed into a grayscale image which then detects the ripeness of the dragon fruit. After the maturity of the dragon fruit is obtained, segmentation is carried out on the location of the dragon fruit found. Then the feature calculation is performed using the Gray Level Co-Occurrence Matrix (GLCM). The Artificial Neural Network (ANN) algorithm is used for the identification process. The final test results show that the proposed method has been able to detect dragon fruit maturity level with an accuracy of = 9/10* 100% = 90%, calculated using the confusion matrix. Thus, implementing the Gray Level Co-Occurrence Matrix

本研究火龙果的成熟度鉴定分为两组:未成熟和成熟。本研究旨在利用灰度共生矩阵(GLCM)特征提取方法对火龙果图像进行成熟度分类。本研究方法包括RGB数据灰度转换、图像归一化、火龙果成熟度检测、特征提取和识别。本研究实际数据采集的数据共60张，其中训练数据40张，测试数据20张，均为JPG格式的RGB图像数据。每个数据由2个成熟度类别组成。训练数据由20张成熟度99%的火龙果图像和20张成熟度85%的火龙果图像组成。同时，测试数据为10张99%成熟火龙果图像和10张85%成熟火龙果图像。图像数据被处理成灰度图像，然后检测火龙果的成熟度。获得火龙果成熟后，对发现火龙果的位置进行分割。然后利用灰度共生矩阵(GLCM)进行特征计算。识别过程采用人工神经网络(ANN)算法。最终的测试结果表明，该方法能够检测火龙果的成熟度等级，准确率为= 9/10* 100% = 90%，使用混淆矩阵计算。从而实现灰度共生矩阵

{"title":"The Implementation of GLCM and ANN Methods to Identify Dragon Fruit Maturity Level","authors":"Muhammad Faisal, Maryam Hasan, Kartika Candra Pelangi","doi":"10.33096/ilkom.v15i1.1504.64-71","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1504.64-71","url":null,"abstract":"The identification of the maturity level of dragon fruit in this study was divided into two groups of ripeness: the unripe and the ripe. This study aims to classify the maturity level based on dragon fruit images using the feature extraction method, the gray level co-occurrence matrix (GLCM). This research method consists of converting RGB data to grayscale, image normalization, detection of dragon fruit maturity, feature extraction, and identification. Data collection from real data totaled 60 images used in this study consisting of 40 training data and 20 testing data which are RGB image data in JPG format. Each data consists of 2 maturity categories. Training data consists of 20 images of 99% ripe dragon fruit and 20 images of 85%. Meanwhile, the testing data consisted of 10 of 99% ripe dragon fruit images and 10 of 85% ripe dragon fruit images. The image data is processed into a grayscale image which then detects the ripeness of the dragon fruit. After the maturity of the dragon fruit is obtained, segmentation is carried out on the location of the dragon fruit found. Then the feature calculation is performed using the Gray Level Co-Occurrence Matrix (GLCM). The Artificial Neural Network (ANN) algorithm is used for the identification process. The final test results show that the proposed method has been able to detect dragon fruit maturity level with an accuracy of = 9/10* 100% = 90%, calculated using the confusion matrix. Thus, implementing the Gray Level Co-Occurrence Matrix","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45910403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Implementation of RFM Method and K-Means Algorithm for Customer Segmentation in E-Commerce with Streamlit 流媒体电子商务中客户细分的RFM方法和K-Means算法的实现

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1524.32-44

F. Alzami, Fikri Diva Sambasri, Mira Nabila, Rama Aria Megantara, Ahmad Akrom, R. A. Pramunendar, D. P. Prabowo, Puri Sulistiyawati

E-commerce is selling and buying goods through an online or online system. One of the business models in which consumers sell products to other consumers is the Customer to Customer (C2C) business model. One thing that needs to be considered in the business model is knowing the level of customer loyalty. By knowing the level of customer loyalty, the company can provide several different treatments to its customers to maintain good relationships with customers and increase product purchase revenue. In this study, the author wants to segment customers on data in E-commerce companies in Brazil using the K-Means clustering algorithm using the RFM (Recency, Frequency, Monetary) feature and display it in the form of a dashboard using the Streamlit framework. Several stages of research must be carried out. Firstly, taking data from the open public data site (Kaggle), then merging the data to select some data that needs to be used, understanding data by displaying it in graphic form, and conducting data selection to select features/attributes. The step follows the proposed method, performs data preprocessing, creates a model to get the cluster, and finally displays it as a dashboard using Streamlit. Based on the results of the research that has been done, the number of clusters is 4 clusters with the evaluation value of the model using the silhouette score is 0.470.

电子商务是通过在线或在线系统销售和购买商品。消费者向其他消费者销售产品的商业模式之一是客户对客户（C2C）商业模式。在商业模式中需要考虑的一件事是了解客户忠诚度的水平。通过了解客户忠诚度的水平，公司可以为客户提供几种不同的待遇，以保持与客户的良好关系并增加产品购买收入。在这项研究中，作者希望使用K-Means聚类算法，使用RFM（Recency，Frequency，Monetary）功能，对巴西电子商务公司的客户数据进行细分，并使用Streamlight框架以仪表板的形式显示。必须进行几个阶段的研究。首先，从开放的公共数据站点（Kaggle）获取数据，然后合并数据以选择一些需要使用的数据，通过以图形形式显示数据来理解数据，并进行数据选择以选择特征/属性。该步骤遵循所提出的方法，执行数据预处理，创建一个模型以获得集群，并最终使用Streamlight将其显示为仪表板。根据已经完成的研究结果，聚类数量为4个聚类，使用剪影得分的模型的评估值为0.470。

{"title":"Implementation of RFM Method and K-Means Algorithm for Customer Segmentation in E-Commerce with Streamlit","authors":"F. Alzami, Fikri Diva Sambasri, Mira Nabila, Rama Aria Megantara, Ahmad Akrom, R. A. Pramunendar, D. P. Prabowo, Puri Sulistiyawati","doi":"10.33096/ilkom.v15i1.1524.32-44","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1524.32-44","url":null,"abstract":"E-commerce is selling and buying goods through an online or online system. One of the business models in which consumers sell products to other consumers is the Customer to Customer (C2C) business model. One thing that needs to be considered in the business model is knowing the level of customer loyalty. By knowing the level of customer loyalty, the company can provide several different treatments to its customers to maintain good relationships with customers and increase product purchase revenue. In this study, the author wants to segment customers on data in E-commerce companies in Brazil using the K-Means clustering algorithm using the RFM (Recency, Frequency, Monetary) feature and display it in the form of a dashboard using the Streamlit framework. Several stages of research must be carried out. Firstly, taking data from the open public data site (Kaggle), then merging the data to select some data that needs to be used, understanding data by displaying it in graphic form, and conducting data selection to select features/attributes. The step follows the proposed method, performs data preprocessing, creates a model to get the cluster, and finally displays it as a dashboard using Streamlit. Based on the results of the research that has been done, the number of clusters is 4 clusters with the evaluation value of the model using the silhouette score is 0.470.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49395471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identification of the Freshness Level of Tuna based on Discrete Cosine Transform on Feature Extraction of Gray Level Co-Occurrence Matrix using K-Nearest Neighbor 基于离散余弦变换的金枪鱼新鲜度识别基于K-近邻的灰度共生矩阵特征提取

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1426.153-164

Z. Y. Lamasigi, Serwin Serwin, Yusrianto Malago

Gorontalo Province is one of the provinces that have fishery potential and has a large sea area that can be managed to support the economy and development of the province. Gorontalo is also one of the tuna-producing provinces in Indonesia, where tuna is also one of the mainstay fisheries commodities. This study aimed to combine transformation and texture feature extraction methods to improve the identification of the freshness level of tuna. This research used Discrete Cosine Transform as transformation detection and Gray Level Co-Occurrence Matrix as texture feature extraction. To find out the value of the proximity of the training data and image testing of tuna fish, the K-Nearest Neighbor classification method was employed. Then, the Confusion Matrix was used to calculate the accuracy level of the K-Nearest Neighbor classification. This research was carried out with 4 stages of testing, namely at angles of 0  , 45  , 90 

戈伦塔洛省是具有渔业潜力的省份之一，拥有可管理的大片海域，以支持该省的经济和发展。戈伦塔洛也是印度尼西亚金枪鱼生产省之一，金枪鱼也是印尼主要的渔业商品之一。本研究旨在将变换和纹理特征提取方法相结合，提高金枪鱼新鲜度的识别水平。本研究采用离散余弦变换作为变换检测，灰度共生矩阵作为纹理特征提取。为了找出训练数据和金枪鱼图像测试的接近度的值，采用了K近邻分类方法。然后，使用混淆矩阵来计算K近邻分类的准确度水平。这项研究分4个测试阶段进行，即角度为0 , 45 , 90

引用次数: 0

Abstractive Text Summarization using Pre-Trained Language Model "Text-to-Text Transfer Transformer (T5)" 基于预训练语言模型“文本到文本转换转换器(T5)”的抽象文本摘要

Ilkom Jurnal Ilmiah

Pub Date : 2023-04-07 DOI: 10.33096/ilkom.v15i1.1532.124-131

Qurrota A’yuna Itsnaini, Mardhiya Hayaty, Andriyan Dwi Putra, N. Jabari

Automatic Text Summarization (ATS) is one of the utilizations of technological sophistication in terms of text processing assisting humans in producing a summary or key points of a document in large quantities. We use Indonesian language as objects because there are few resources in NLP research using Indonesian language. This paper utilized PLTMs (Pre-Trained Language Models) from the transformer architecture, namely T5 (Text-to-Text Transfer Transformer) which has been completed previously with a larger dataset. Evaluation in this study was measured through comparison of the ROUGE (Recall-Oriented Understudy for Gisting Evaluation) calculation results between the reference summary and the model summary. The experiments with the pre-trained t5-base model with fine tuning parameters of 220M for the Indonesian news dataset yielded relatively high ROUGE values, namely ROUGE-1 = 0.68, ROUGE-2 = 0.61, and ROUGE-L = 0.65. The evaluation value worked well, but the resulting model has not achieved satisfactory results because in terms of abstraction, the model did not work optimally. We also found several errors in the reference summary in the dataset used.

自动文本摘要(Automatic Text Summarization, ATS)是在文本处理方面利用复杂的技术来帮助人类大量生成文档的摘要或要点的一种方法。我们之所以选择印尼语作为研究对象，是因为目前使用印尼语的自然语言处理研究资源很少。本文利用了来自转换器架构的pltm(预训练语言模型)，即T5(文本到文本传输转换器)，该转换器之前已经完成了一个更大的数据集。本研究的评价是通过比较参考总结和模型总结的ROUGE (Recall-Oriented Understudy for Gisting Evaluation)计算结果来衡量的。对印尼新闻数据集使用预训练的t5基模型和微调参数为220M的实验得到了较高的ROUGE值，即ROUGE-1 = 0.68, ROUGE-2 = 0.61, ROUGE- l = 0.65。评价值工作得很好，但由于在抽象方面，模型没有达到最优的效果，所以最终的模型并没有取得令人满意的结果。我们还在使用的数据集中发现了参考摘要中的几个错误。

{"title":"Abstractive Text Summarization using Pre-Trained Language Model \"Text-to-Text Transfer Transformer (T5)\"","authors":"Qurrota A’yuna Itsnaini, Mardhiya Hayaty, Andriyan Dwi Putra, N. Jabari","doi":"10.33096/ilkom.v15i1.1532.124-131","DOIUrl":"https://doi.org/10.33096/ilkom.v15i1.1532.124-131","url":null,"abstract":"Automatic Text Summarization (ATS) is one of the utilizations of technological sophistication in terms of text processing assisting humans in producing a summary or key points of a document in large quantities. We use Indonesian language as objects because there are few resources in NLP research using Indonesian language. This paper utilized PLTMs (Pre-Trained Language Models) from the transformer architecture, namely T5 (Text-to-Text Transfer Transformer) which has been completed previously with a larger dataset. Evaluation in this study was measured through comparison of the ROUGE (Recall-Oriented Understudy for Gisting Evaluation) calculation results between the reference summary and the model summary. The experiments with the pre-trained t5-base model with fine tuning parameters of 220M for the Indonesian news dataset yielded relatively high ROUGE values, namely ROUGE-1 = 0.68, ROUGE-2 = 0.61, and ROUGE-L = 0.65. The evaluation value worked well, but the resulting model has not achieved satisfactory results because in terms of abstraction, the model did not work optimally. We also found several errors in the reference summary in the dataset used.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46911410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0