In recent years, many crimes use technology to generate someone's face which has a bad effect on that person. Generative adversarial network is a method to generate fake images using discriminators and generators. Conventional GAN involved binary cross entropy loss for discriminator training to classify original image from dataset and fake image that generated from generator. However, use of binary cross entropy loss cannot provided gradient information to generator in creating a good fake image. When generator creates a fake image, discriminator only gives a little feedback (gradient information) to generator update its model. It causes generator take a long time to update the model. To solve this problem, there is an LSGAN that used a loss function (least squared loss). Discriminator can provide astrong gradient signal to generator update the model even though image was far from decision boundary. In making fake images, researchers used Least Squares GAN (LSGAN) with discriminator-1 loss value is 0.0061, discriminator-2 loss value is 0.0036, and generator loss value is 0.575. With the small loss value of the three important components, discriminator accuracy value in terms of classification reaches 95% for original image and 99% for fake image. In classified original image and fake image in this studyusing a supervised contrastive loss classification model with an accuracy value of 99.93%.
{"title":"Implementation of Generative Adversarial Network to Generate Fake Face Image","authors":"Jasman Pardede, Anisa Putri Setyaningrum","doi":"10.15575/join.v8i1.790","DOIUrl":"https://doi.org/10.15575/join.v8i1.790","url":null,"abstract":"In recent years, many crimes use technology to generate someone's face which has a bad effect on that person. Generative adversarial network is a method to generate fake images using discriminators and generators. Conventional GAN involved binary cross entropy loss for discriminator training to classify original image from dataset and fake image that generated from generator. However, use of binary cross entropy loss cannot provided gradient information to generator in creating a good fake image. When generator creates a fake image, discriminator only gives a little feedback (gradient information) to generator update its model. It causes generator take a long time to update the model. To solve this problem, there is an LSGAN that used a loss function (least squared loss). Discriminator can provide astrong gradient signal to generator update the model even though image was far from decision boundary. In making fake images, researchers used Least Squares GAN (LSGAN) with discriminator-1 loss value is 0.0061, discriminator-2 loss value is 0.0036, and generator loss value is 0.575. With the small loss value of the three important components, discriminator accuracy value in terms of classification reaches 95% for original image and 99% for fake image. In classified original image and fake image in this studyusing a supervised contrastive loss classification model with an accuracy value of 99.93%.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80622150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Umar Aditiawarman, Mega Lumbia, T. Mantoro, A. Ibrahim
Social media especially Twitter has been used by corporation or organization as an effective tool to interact and communicate with the consumers. Holywings is one of the popular restaurants in Indonesia that use social media as a tool to promote and disseminate information regarding their products and services. However, one of their promotional items has gone viral and invited public protests which turned into a trending topic on Twitter for a couple of weeks. Holywings allegedly improperly promoted their products by using the most honorable names, “Muhammad” and “Maria”. Social network analysis of Twitter data is conducted to identify and examine information circulating among the users, which leads to wider public attention and law enforcement. In this study, we focused on the conversation about Holywings on Twitter from 24 June to 31 July 2022. The analysis was carried out using Python to retrieve data and Gephi software to visualize the interactions and the intensity of the network group in viewing the spread of information. The findings reveal the centrality account that caused the news to go viral are the CNN Indonesia (@CNNIndonesia) news media account and Haris Pertama (@knpiharis), with a centrality of 0.161 and 0.282, respectively. There are also 121 groups involved in the conversation with modularity of 0.821.
社交媒体,尤其是Twitter,已经被企业或组织用作与消费者互动和沟通的有效工具。Holywings是印度尼西亚最受欢迎的餐厅之一,他们使用社交媒体作为宣传和传播其产品和服务信息的工具。然而,他们的一件促销品却在网上疯传,引发了公众的抗议,并成为推特上的热门话题,持续了几周。据称Holywings不正当地使用最尊贵的名字“穆罕默德”和“玛丽亚”来推广他们的产品。对Twitter数据进行社交网络分析,以识别和检查用户之间传播的信息,从而引起更广泛的公众关注和执法。在这项研究中,我们关注的是2022年6月24日至7月31日Twitter上关于霍利维恩的对话。分析使用Python检索数据,使用Gephi软件可视化网络群体在查看信息传播时的相互作用和强度。研究结果显示,导致新闻病毒式传播的中心性账户是CNN印度尼西亚(@CNNIndonesia)新闻媒体账户和Haris Pertama (@knpiharis),中心性分别为0.161和0.282。有121个群组参与对话,模块性为0.821。
{"title":"Social Network Analysis: Identification of Communication and Information Dissemination (Case Study of Holywings)","authors":"Umar Aditiawarman, Mega Lumbia, T. Mantoro, A. Ibrahim","doi":"10.15575/join.v8i1.911","DOIUrl":"https://doi.org/10.15575/join.v8i1.911","url":null,"abstract":"Social media especially Twitter has been used by corporation or organization as an effective tool to interact and communicate with the consumers. Holywings is one of the popular restaurants in Indonesia that use social media as a tool to promote and disseminate information regarding their products and services. However, one of their promotional items has gone viral and invited public protests which turned into a trending topic on Twitter for a couple of weeks. Holywings allegedly improperly promoted their products by using the most honorable names, “Muhammad” and “Maria”. Social network analysis of Twitter data is conducted to identify and examine information circulating among the users, which leads to wider public attention and law enforcement. In this study, we focused on the conversation about Holywings on Twitter from 24 June to 31 July 2022. The analysis was carried out using Python to retrieve data and Gephi software to visualize the interactions and the intensity of the network group in viewing the spread of information. The findings reveal the centrality account that caused the news to go viral are the CNN Indonesia (@CNNIndonesia) news media account and Haris Pertama (@knpiharis), with a centrality of 0.161 and 0.282, respectively. There are also 121 groups involved in the conversation with modularity of 0.821.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83123242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Traditional methods of detecting cardiac illness are often problematic in the medical field. The doctor must next study and interpret the findings of the patient's medical record received from the electrocardiogram and echocardiogram. These tasks often take a long time and require patience. The use of computational technology in medicine, especially the study of cardiac disease, is not new. Scientists are continuously striving for the most reliable method of diagnosing a patient's cardiac illness, particularly when an integrated system is constructed. The study attempted to propose an alternative for identifying cardiac illness using a supervised learning technique, namely the multi-layer perceptron (MLP). The study started with the collection of patient medical record data, which yielded up to 534 data points, followed by pre-processing and transformation to provide up to 324 data points suitable to be employed by learning algorithms. The last step is to create a heart disease classification model with distinct activation functions using MLP. The degree of classification accuracy, k-fold cross-validation, and bootstrap are all used to test the model. According to the findings of the study, MLP with the Tanh activation function is a more accurate prediction model than logistics and Relu. The classification accuracy level (CA) for MLP with Tanh and k-fold cross-validation is 0.788 in a data-sharing situation, while it is 0.672 with Bootstrap. MLP using the Tanh activation function is the best model based on the CA level and the AUC value, with values of 0.832 (k-fold cross-validation) and 0.857 (bootstrap).
{"title":"Data Mining for Heart Disease Prediction Based on Echocardiogram and Electrocardiogram Data","authors":"Tb Ai Munandar","doi":"10.15575/join.v8i1.1027","DOIUrl":"https://doi.org/10.15575/join.v8i1.1027","url":null,"abstract":"Traditional methods of detecting cardiac illness are often problematic in the medical field. The doctor must next study and interpret the findings of the patient's medical record received from the electrocardiogram and echocardiogram. These tasks often take a long time and require patience. The use of computational technology in medicine, especially the study of cardiac disease, is not new. Scientists are continuously striving for the most reliable method of diagnosing a patient's cardiac illness, particularly when an integrated system is constructed. The study attempted to propose an alternative for identifying cardiac illness using a supervised learning technique, namely the multi-layer perceptron (MLP). The study started with the collection of patient medical record data, which yielded up to 534 data points, followed by pre-processing and transformation to provide up to 324 data points suitable to be employed by learning algorithms. The last step is to create a heart disease classification model with distinct activation functions using MLP. The degree of classification accuracy, k-fold cross-validation, and bootstrap are all used to test the model. According to the findings of the study, MLP with the Tanh activation function is a more accurate prediction model than logistics and Relu. The classification accuracy level (CA) for MLP with Tanh and k-fold cross-validation is 0.788 in a data-sharing situation, while it is 0.672 with Bootstrap. MLP using the Tanh activation function is the best model based on the CA level and the AUC value, with values of 0.832 (k-fold cross-validation) and 0.857 (bootstrap).","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78749825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Patrol Information System for the Prevention of Forest Land Fires (SIPP Karhutla) in Indonesia is a tool for assisting patrol activities for controlling forest and land fires in Indonesia. The addition of Karhutla SIPP users causes the need for system scalability testing. This study aims to perform non-functional testing that focuses on scalability testing. The steps in scalability testing include creating schemas, conducting tests, and analyzing results. There are five schemes with a total sample of 700 samples. Testing was carried out using the JMeter automation testing tool assisted by Blazemeter in creating scripts. The scalability test parameter has three parameters: average CPU usage, memory usage, and network usage. The test results show that the CPU capacity used can handle up to 700 users, while with a memory capacity of 8GB it can handle up to 420 users. All users is the user menu that has the highest value for each test parameter The average value of CPU usage is 44.8%, the average memory usage is 69.48% and the average network usage is 2.8 Mb/s. In minimizing server performance, the tile cache map method can be applied to the system and can increase the memory capacity used.
{"title":"Scalability Testing of Land Forest Fire Patrol Information Systems","authors":"Ahmad Khusaeri, I. S. Sitanggang, H. Rahmawan","doi":"10.15575/join.v8i1.977","DOIUrl":"https://doi.org/10.15575/join.v8i1.977","url":null,"abstract":"The Patrol Information System for the Prevention of Forest Land Fires (SIPP Karhutla) in Indonesia is a tool for assisting patrol activities for controlling forest and land fires in Indonesia. The addition of Karhutla SIPP users causes the need for system scalability testing. This study aims to perform non-functional testing that focuses on scalability testing. The steps in scalability testing include creating schemas, conducting tests, and analyzing results. There are five schemes with a total sample of 700 samples. Testing was carried out using the JMeter automation testing tool assisted by Blazemeter in creating scripts. The scalability test parameter has three parameters: average CPU usage, memory usage, and network usage. The test results show that the CPU capacity used can handle up to 700 users, while with a memory capacity of 8GB it can handle up to 420 users. All users is the user menu that has the highest value for each test parameter The average value of CPU usage is 44.8%, the average memory usage is 69.48% and the average network usage is 2.8 Mb/s. In minimizing server performance, the tile cache map method can be applied to the system and can increase the memory capacity used.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84060062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
There are so many cat races in the world. Ignorance in recognizing cat breeds will be dangerous if the cat being kept is affected by a disease, which allows mishandling of the cat being kept. In addition, many cat breeds have different foods from one race to another. The problem is that a cat caretaker cannot easily recognize the cat breed. Therefore, technology needs to help a cat caretaker to treat cats appropriately. In this study, we proposed a Machine Learning approach to recognize cat breeds. This study aims to identify the cat breed from the cat images then deployed on an Android smartphone. It was tested with data from cat images of 13 races. The classification method applied in this study uses the Convolutional Neural Network (CNN) algorithm using transfer learning. The base models tested are MobilenetV2, VGG16, and InceptionV3. The results tested using several models and through several experimental scenarios produced the best classification model with an accuracy of 82% with MobilenetV2. The model with the best accuracy is then embedded in an application with the Android operating system. Then the application is named Catbreednet.
{"title":"Catbreedsnet: An Android Application for Cat Breed Classification Using Convolutional Neural Networks","authors":"Anugrah Tri Ramadhan, Abas Setiawan","doi":"10.15575/join.v8i1.1007","DOIUrl":"https://doi.org/10.15575/join.v8i1.1007","url":null,"abstract":"There are so many cat races in the world. Ignorance in recognizing cat breeds will be dangerous if the cat being kept is affected by a disease, which allows mishandling of the cat being kept. In addition, many cat breeds have different foods from one race to another. The problem is that a cat caretaker cannot easily recognize the cat breed. Therefore, technology needs to help a cat caretaker to treat cats appropriately. In this study, we proposed a Machine Learning approach to recognize cat breeds. This study aims to identify the cat breed from the cat images then deployed on an Android smartphone. It was tested with data from cat images of 13 races. The classification method applied in this study uses the Convolutional Neural Network (CNN) algorithm using transfer learning. The base models tested are MobilenetV2, VGG16, and InceptionV3. The results tested using several models and through several experimental scenarios produced the best classification model with an accuracy of 82% with MobilenetV2. The model with the best accuracy is then embedded in an application with the Android operating system. Then the application is named Catbreednet.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83112131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
B. Subaeki, Y. A. Gerhana, Meta Barokatul Karomah Rusyana, K. Manaf
Pornography is a severe problem in Indonesia, apart from drugs. This can be seen based on data from the Ministry of Communication and Informatics in 2021 which found 1.1 million pornographic content online. The increasing number of access to pornographic content sites on the internet can prove this. Several studies have been conducted to produce preventive formulas. However, this research flow has not been effective in solving the problem. This is because the results of the identification value in the output image obtained are not quite right. This study proposes a procedure for identifying pornographic content in digital images as an alternative approach for the early stages of a destructive content access prevention system. The formulation uses the YCbCr color space to analyze human skin on image objects that represent exposed body parts and the classification process with the Neuro Fuzzy approach. The performance of this formula was tested on 100 digital images of random categories of human objects (usually covered, skimpy, and naked) taken from the internet. The test results are at a relatively good level of accuracy, with a weight of 70% for the entire test data.
{"title":"Digital Image Processing Using YCbCr Colour Space and Neuro Fuzzy to Identify Pornography","authors":"B. Subaeki, Y. A. Gerhana, Meta Barokatul Karomah Rusyana, K. Manaf","doi":"10.15575/join.v8i1.1070","DOIUrl":"https://doi.org/10.15575/join.v8i1.1070","url":null,"abstract":"Pornography is a severe problem in Indonesia, apart from drugs. This can be seen based on data from the Ministry of Communication and Informatics in 2021 which found 1.1 million pornographic content online. The increasing number of access to pornographic content sites on the internet can prove this. Several studies have been conducted to produce preventive formulas. However, this research flow has not been effective in solving the problem. This is because the results of the identification value in the output image obtained are not quite right. This study proposes a procedure for identifying pornographic content in digital images as an alternative approach for the early stages of a destructive content access prevention system. The formulation uses the YCbCr color space to analyze human skin on image objects that represent exposed body parts and the classification process with the Neuro Fuzzy approach. The performance of this formula was tested on 100 digital images of random categories of human objects (usually covered, skimpy, and naked) taken from the internet. The test results are at a relatively good level of accuracy, with a weight of 70% for the entire test data.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75386353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sumatra and Borneo are areas consisting of rainforests with a high vulnerability to fire. Both areas are in the tropics which experience rainy and dry seasons annually. The long dry season such as in 2019 triggered forest and land fires in Borneo and Sumatra, causing haze disasters in the exposed areas. This indicates that climate variables play a role in burning forests and land in Borneo and Sumatra, but how climate affects the fires in both areas is still questionable. This study investigates the climate variables: temperature, humidity, precipitation, and wind speed in relation to the fire’s characteristics in Borneo and Sumatra. We use the Random Forest model to determine the characteristics of forest fires in Sumatra and Borneo based on the climate variables and carbon emission levels. According to the model, the fire event in Sumatra is slightly better predicted than in Borneo, indicating a climate-fire dependence is more prominent in Sumatra. Nevertheless, a maximum temperature variable is seemingly an important indicator for forest and land fire in both domains as it gives the largest contribution to the carbon emission.
{"title":"Comparative Analysis of Machine Learning-based Forest Fire Characteristics in Sumatra and Borneo","authors":"Ayu Shabrina, Intan Nuni Wahyuni, A. Latifah","doi":"10.15575/join.v8i1.1035","DOIUrl":"https://doi.org/10.15575/join.v8i1.1035","url":null,"abstract":"Sumatra and Borneo are areas consisting of rainforests with a high vulnerability to fire. Both areas are in the tropics which experience rainy and dry seasons annually. The long dry season such as in 2019 triggered forest and land fires in Borneo and Sumatra, causing haze disasters in the exposed areas. This indicates that climate variables play a role in burning forests and land in Borneo and Sumatra, but how climate affects the fires in both areas is still questionable. This study investigates the climate variables: temperature, humidity, precipitation, and wind speed in relation to the fire’s characteristics in Borneo and Sumatra. We use the Random Forest model to determine the characteristics of forest fires in Sumatra and Borneo based on the climate variables and carbon emission levels. According to the model, the fire event in Sumatra is slightly better predicted than in Borneo, indicating a climate-fire dependence is more prominent in Sumatra. Nevertheless, a maximum temperature variable is seemingly an important indicator for forest and land fire in both domains as it gives the largest contribution to the carbon emission.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77108494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This research paper proposes a novel method, Exponential Moving Average Long Short-Term Memory (EMA LSTM), for multi-step vector output prediction of time series data using deep learning. The method combines the LSTM with the exponential moving average (EMA) technique to reduce noise in the data and improve the accuracy of prediction. The research compares the performance of EMA LSTM to other commonly used deep learning models, including LSTM, GRU, RNN, and CNN, and evaluates the results using statistical tests. The dataset used in this study contains daily stock market prices for several years, with inputs of 60, 90, and 120 previous days, and predictions for the next 20 and 30 days. The results show that the EMA LSTM method outperforms other models in terms of accuracy, with lower RMSE and MAPE values. This study has important implications for real-world applications, such as stock market forecasting and climate prediction, and highlights the importance of careful preprocessing of the data to improve the performance of deep learning models.
{"title":"Multi-Step Vector Output Prediction of Time Series Using EMA LSTM","authors":"Mohammad Diqi, Ahmad Sahal, Farida Nur Aini","doi":"10.15575/join.v8i1.1037","DOIUrl":"https://doi.org/10.15575/join.v8i1.1037","url":null,"abstract":"This research paper proposes a novel method, Exponential Moving Average Long Short-Term Memory (EMA LSTM), for multi-step vector output prediction of time series data using deep learning. The method combines the LSTM with the exponential moving average (EMA) technique to reduce noise in the data and improve the accuracy of prediction. The research compares the performance of EMA LSTM to other commonly used deep learning models, including LSTM, GRU, RNN, and CNN, and evaluates the results using statistical tests. The dataset used in this study contains daily stock market prices for several years, with inputs of 60, 90, and 120 previous days, and predictions for the next 20 and 30 days. The results show that the EMA LSTM method outperforms other models in terms of accuracy, with lower RMSE and MAPE values. This study has important implications for real-world applications, such as stock market forecasting and climate prediction, and highlights the importance of careful preprocessing of the data to improve the performance of deep learning models.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85098274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Stunting is a disease caused by malnutrition in children, which results in slow growth. Generally, stunting is characterized by a lack of weight and height in young children. This study aims to classify stunting in children aged 0-60 months using the Decision Tree C4.5 method based on z-score calculations with a sample size of 224 records, consisting of 4 attributes and 1 label, namely Gender, Age, Weight, Height, and Nutritional Status. The results of the study obtained a C4.5 decision tree where the Age variable influenced the classification of stunting with the highest Gain Ratio of 0.185016337. Meanwhile, the evaluation of the model using the Confusion matrix resulted in the highest accuracy of 61.82% and AUC of 0.584.
{"title":"Classification of Stunting in Children Using the C4.5 Algorithm","authors":"Muhajir Yunus, M. K. Biddinika, Abdul Fadlil","doi":"10.15575/join.v8i1.1062","DOIUrl":"https://doi.org/10.15575/join.v8i1.1062","url":null,"abstract":"Stunting is a disease caused by malnutrition in children, which results in slow growth. Generally, stunting is characterized by a lack of weight and height in young children. This study aims to classify stunting in children aged 0-60 months using the Decision Tree C4.5 method based on z-score calculations with a sample size of 224 records, consisting of 4 attributes and 1 label, namely Gender, Age, Weight, Height, and Nutritional Status. The results of the study obtained a C4.5 decision tree where the Age variable influenced the classification of stunting with the highest Gain Ratio of 0.185016337. Meanwhile, the evaluation of the model using the Confusion matrix resulted in the highest accuracy of 61.82% and AUC of 0.584.","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78047250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Sumanto, A. Buono, K. Priandana, Bib Paruhum Silalahi, Elisabeth Sri Hendrastuti
Plant diseases significantly threaten agricultural productivity, necessitating accurate identification and classification of plant lesions for improved crop quality. Citrus plants, belonging to the Rutaceae family, are highly susceptible to diseases such as citrus canker, black spot, and the devastating Huanglongbing (HLB) disease. HLB, caused by gram-negative proteobacteria strains, severely impacts citrus orchards globally, resulting in economic losses. Early detection and classification of HLB-infected plants are crucial for effective disease management. Traditional approaches rely on expert knowledge and time-consuming laboratory tests, hindering rapid detection. This study explores an alternative method using the BEMD algorithm for texture feature extraction and SVM classification to improve HLB diagnosis. The BEMD algorithm decomposes citrus leaf images into Intrinsic Mode Functions (IMFs) and a residue component. Classification experiments were conducted using SVM on IMF 1, IMF 2, and residue features. The residue component provided the most outstanding level of classification accuracy, reaching 77% for two classes, 72% for three types, and 61% for four classes. In two categories, IMF 1 performed at a 72% accuracy rate, and in four other areas, it performed at a 51% accuracy rate, making it competitive. IMF 2 demonstrated lower accuracy, ranging from 43% for three classes to 57% for two categories. The findings highlight the significance of the image residue component, outperforming IMF features in HLB classification accuracy. The BEMD algorithm coupled with SVM classification presents a promising approach for accurate HLB diagnosis, surpassing the performance of previous studies using GLCM-SVM techniques.
{"title":"Texture Analysis of Citrus Leaf Images Using BEMD for Huanglongbing Disease Diagnosis","authors":"S. Sumanto, A. Buono, K. Priandana, Bib Paruhum Silalahi, Elisabeth Sri Hendrastuti","doi":"10.15575/join.v8i1.1075","DOIUrl":"https://doi.org/10.15575/join.v8i1.1075","url":null,"abstract":"Plant diseases significantly threaten agricultural productivity, necessitating accurate identification and classification of plant lesions for improved crop quality. Citrus plants, belonging to the Rutaceae family, are highly susceptible to diseases such as citrus canker, black spot, and the devastating Huanglongbing (HLB) disease. HLB, caused by gram-negative proteobacteria strains, severely impacts citrus orchards globally, resulting in economic losses. Early detection and classification of HLB-infected plants are crucial for effective disease management. Traditional approaches rely on expert knowledge and time-consuming laboratory tests, hindering rapid detection. This study explores an alternative method using the BEMD algorithm for texture feature extraction and SVM classification to improve HLB diagnosis. The BEMD algorithm decomposes citrus leaf images into Intrinsic Mode Functions (IMFs) and a residue component. Classification experiments were conducted using SVM on IMF 1, IMF 2, and residue features. The residue component provided the most outstanding level of classification accuracy, reaching 77% for two classes, 72% for three types, and 61% for four classes. In two categories, IMF 1 performed at a 72% accuracy rate, and in four other areas, it performed at a 51% accuracy rate, making it competitive. IMF 2 demonstrated lower accuracy, ranging from 43% for three classes to 57% for two categories. The findings highlight the significance of the image residue component, outperforming IMF features in HLB classification accuracy. The BEMD algorithm coupled with SVM classification presents a promising approach for accurate HLB diagnosis, surpassing the performance of previous studies using GLCM-SVM techniques. \u0000 ","PeriodicalId":32019,"journal":{"name":"JOIN Jurnal Online Informatika","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74415286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}