首页 > 最新文献

JUITA : Jurnal Informatika最新文献

英文 中文
Number of Cyber Attacks Predicted With Deep Learning Based LSTM Model 利用基于深度学习的 LSTM 模型预测的网络攻击数量
Pub Date : 2024-05-20 DOI: 10.30595/juita.v12i1.20210
Joko Siswanto, Irwan Sembiring, Adi Setiawan, Iwan Setyawan
The increasing number of cyber attacks will result in various damages to the functioning of technological infrastructure. A prediction model for the number of cyber attacks based on the type of attack, handling actions and severity using time-series data has never been done. A deep learning-based LSTM prediction model is proposed to predict the number of cyberattacks in a time series on 3 evaluated data sets MSLE, MSE, MAE, RMSE, and MAPE, and displays the predicted relationships between prediction variables. Cyber attack dataset obtained from kaggle.com. The best prediction model is epoch 20, batch size 16, and neuron 32 with the lowest evaluation value on MSLE of 0.094, MSE of 9.067, MAE of 2.440, RMSE of 3.010, and MAPE of 10.507 (very good model because the value is less than 15) compared other variations. There is a negative correlation for INTRUSION-MALWARE, BLOCKED-IGNORED, IGNORED-LOGGED, and LOW-MEDIUM. The predicted results for the next 12 months will increase starting from the second month at the same time. The resulting predictions can be used as a basis for policy and strategy decisions by stakeholders in dealing with fluctuations in cyber attacks that occur.
越来越多的网络攻击将对技术基础设施的运行造成各种损害。利用时间序列数据,根据攻击类型、处理行动和严重程度来预测网络攻击数量的模型还从未有过。本文提出了一种基于深度学习的 LSTM 预测模型,在 MSLE、MSE、MAE、RMSE 和 MAPE 3 个评估数据集上预测时间序列中的网络攻击数量,并显示预测变量之间的预测关系。网络攻击数据集来自 kaggle.com。最佳预测模型为 epoch 20、batch size 16 和 neuron 32,与其他变量相比,其 MSLE 最小,为 0.094;MSE 最低,为 9.067;MAE 最低,为 2.440;RMSE 最低,为 3.010;MAPE 最低,为 10.507(非常好的模型,因为其值小于 15)。入侵-恶意软件、封锁-忽略、忽略-标记和低级-中级呈负相关。从第二个月开始,未来 12 个月的预测结果将同时增加。预测结果可作为利益相关者应对网络攻击波动的政策和战略决策的依据。
{"title":"Number of Cyber Attacks Predicted With Deep Learning Based LSTM Model","authors":"Joko Siswanto, Irwan Sembiring, Adi Setiawan, Iwan Setyawan","doi":"10.30595/juita.v12i1.20210","DOIUrl":"https://doi.org/10.30595/juita.v12i1.20210","url":null,"abstract":"The increasing number of cyber attacks will result in various damages to the functioning of technological infrastructure. A prediction model for the number of cyber attacks based on the type of attack, handling actions and severity using time-series data has never been done. A deep learning-based LSTM prediction model is proposed to predict the number of cyberattacks in a time series on 3 evaluated data sets MSLE, MSE, MAE, RMSE, and MAPE, and displays the predicted relationships between prediction variables. Cyber attack dataset obtained from kaggle.com. The best prediction model is epoch 20, batch size 16, and neuron 32 with the lowest evaluation value on MSLE of 0.094, MSE of 9.067, MAE of 2.440, RMSE of 3.010, and MAPE of 10.507 (very good model because the value is less than 15) compared other variations. There is a negative correlation for INTRUSION-MALWARE, BLOCKED-IGNORED, IGNORED-LOGGED, and LOW-MEDIUM. The predicted results for the next 12 months will increase starting from the second month at the same time. The resulting predictions can be used as a basis for policy and strategy decisions by stakeholders in dealing with fluctuations in cyber attacks that occur.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"30 24","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BCBimax Biclustering Algorithm with Mixed-Type Data 混合类型数据的 BCBimax 双聚类算法
Pub Date : 2024-05-20 DOI: 10.30595/juita.v12i1.21519
Hanifa Izzati, Indahwati Indahwati, Anik Djuraidah
The application of biclustering analysis to mixed data is still relatively new. Initially, biclustering analysis was primarily used on gene expression data that has an interval scale. In this research, we will transform ordinal categorical variables into interval scales using the Method of Successive Interval (MSI). The BCBimax algorithm will be applied in this study with several binarization experiments that produce the smallest Mean Square Residual (MSR) at the predetermined column and row thresholds. Next, a row and column threshold test will be carried out to find the optimal bicluster threshold. The existence of different interests in the variables for international market potential and the number of Indonesian export destination countries is the reason for the need for identification regarding the mapping of destination countries based on international trade potential. The study's results with the median threshold of all data found that the optimal MSR is at the threshold of row 7 and column 2. The number of biclusters formed is 9 which covers 74.7% of countries. Most countries in the bicluster come from the European Continent and a few countries from the African Continent are included in the bicluster.
双聚类分析在混合数据中的应用还相对较新。最初,双聚类分析主要用于区间尺度的基因表达数据。在本研究中,我们将使用连续区间法(MSI)将序数分类变量转换为区间尺度。本研究将采用 BCBimax 算法,进行多次二值化实验,在预定的列和行阈值下产生最小的均方残差(MSR)。接下来,将进行行和列阈值测试,以找到最佳双簇阈值。国际市场潜力变量和印尼出口目的国数量存在不同的利益,因此需要根据国际贸易潜力对目的国的映射进行识别。使用所有数据的中值阈值进行研究的结果发现,最佳 MSR 位于第 7 行第 2 列的阈值处。形成的双集群数量为 9 个,覆盖了 74.7% 的国家。双集群中的大多数国家来自欧洲大陆,少数几个国家来自非洲大陆。
{"title":"BCBimax Biclustering Algorithm with Mixed-Type Data","authors":"Hanifa Izzati, Indahwati Indahwati, Anik Djuraidah","doi":"10.30595/juita.v12i1.21519","DOIUrl":"https://doi.org/10.30595/juita.v12i1.21519","url":null,"abstract":"The application of biclustering analysis to mixed data is still relatively new. Initially, biclustering analysis was primarily used on gene expression data that has an interval scale. In this research, we will transform ordinal categorical variables into interval scales using the Method of Successive Interval (MSI). The BCBimax algorithm will be applied in this study with several binarization experiments that produce the smallest Mean Square Residual (MSR) at the predetermined column and row thresholds. Next, a row and column threshold test will be carried out to find the optimal bicluster threshold. The existence of different interests in the variables for international market potential and the number of Indonesian export destination countries is the reason for the need for identification regarding the mapping of destination countries based on international trade potential. The study's results with the median threshold of all data found that the optimal MSR is at the threshold of row 7 and column 2. The number of biclusters formed is 9 which covers 74.7% of countries. Most countries in the bicluster come from the European Continent and a few countries from the African Continent are included in the bicluster.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"98 29","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Durrotalk Chatbot Accuracy Utilizing a Hybrid Model Based on Recurrent Neural Network (RNN) Algorithm and Decision Tree 利用基于递归神经网络 (RNN) 算法和决策树的混合模型提高 Durrotalk 聊天机器人的准确性
Pub Date : 2024-05-20 DOI: 10.30595/juita.v12i1.20868
Dede Rizki Darmawan, R. Arifudin
DurroTalk, a chatbot for new student admissions at Pondok Pesantren Durrotu Ahlissunnah Waljamaah, Semarang, integrates a hybrid model with Recurrent Neural Network (RNN) and Decision Tree. RNN, the base model, employs Natural Language Processing (NLP) to understand sentence structure and context, overcoming vanishing gradient through LSTM layers. The Decision Tree normalizes words, addressing slang and synonyms. The hybrid model boosts chatbot accuracy by 9%, reaching 77% from the initial 68%. This research signifies progress in integrating artificial intelligence into traditional education, showcasing a chatbot adept at handling non-standard language. Decision Tree integration enhances overall performance, making the chatbot proficient in understanding user inputs and generating contextually relevant responses. This study exemplifies the potential of AI, particularly chatbot technology, in modernizing educational processes at traditional institutions.
DurroTalk 是一款用于三宝垄 Pondok Pesantren Durrotu Ahlissunnah Waljamaah 大学新生入学的聊天机器人,它集成了循环神经网络(RNN)和决策树的混合模型。循环神经网络(RNN)是基础模型,它利用自然语言处理(NLP)来理解句子结构和上下文,通过 LSTM 层克服梯度消失问题。决策树对单词进行规范化处理,解决俚语和同义词问题。混合模型将聊天机器人的准确率提高了 9%,从最初的 68% 提高到 77%。这项研究标志着将人工智能融入传统教育的进展,展示了一个善于处理非标准语言的聊天机器人。决策树集成提高了整体性能,使聊天机器人能够熟练地理解用户输入并生成与上下文相关的回复。这项研究充分体现了人工智能,尤其是聊天机器人技术在传统机构教育流程现代化方面的潜力。
{"title":"Enhancing Durrotalk Chatbot Accuracy Utilizing a Hybrid Model Based on Recurrent Neural Network (RNN) Algorithm and Decision Tree","authors":"Dede Rizki Darmawan, R. Arifudin","doi":"10.30595/juita.v12i1.20868","DOIUrl":"https://doi.org/10.30595/juita.v12i1.20868","url":null,"abstract":"DurroTalk, a chatbot for new student admissions at Pondok Pesantren Durrotu Ahlissunnah Waljamaah, Semarang, integrates a hybrid model with Recurrent Neural Network (RNN) and Decision Tree. RNN, the base model, employs Natural Language Processing (NLP) to understand sentence structure and context, overcoming vanishing gradient through LSTM layers. The Decision Tree normalizes words, addressing slang and synonyms. The hybrid model boosts chatbot accuracy by 9%, reaching 77% from the initial 68%. This research signifies progress in integrating artificial intelligence into traditional education, showcasing a chatbot adept at handling non-standard language. Decision Tree integration enhances overall performance, making the chatbot proficient in understanding user inputs and generating contextually relevant responses. This study exemplifies the potential of AI, particularly chatbot technology, in modernizing educational processes at traditional institutions.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"25 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Face Gender Classification using Combination of LPQ-Self PCA 使用 LPQ-Self PCA 组合进行人脸性别分类
Pub Date : 2024-05-20 DOI: 10.30595/juita.v12i1.21137
Tio Dharmawan, Danu Adi Nugroho, Muhammad Arief Hidayat
The age factor had a significant impact on human faces, potentially influencing the performance of existing gender classification systems. This research proposed a new method that combined local descriptors such as Local Binary Patterns (LBP) and Local Phase Quantization (LPQ) with Self-Principal Component Analysis (Self-PCA) as a feature extraction technique. The use of Self-PCA was chosen for its ability to address the age factor in human facial images, while also leveraging local descriptors to capture features from these images. The primary focus was to compare the performance of Self-PCA with LPQ+Self-PCA, along with the additional comparison of LBP+Self-PCA, in the task of gender classification using facial images. Euclidean distance served as the classifier, and the evaluation was conducted using the FG-Net and ORL datasets. The combination of LPQ+Self-PCA showed an improvement in accuracy by 57.85% compared to the combination of LBP+Self-PCA, which provided an accuracy of 56.47%. Meanwhile, using Self-PCA alone gave an accuracy of 55.37% on the FG-Net. In contrast, on the ORL dataset, both combinations gave the same accuracy result as Self-PCA, which was 90.14%, for images without blurring.
年龄因素对人脸有重大影响,可能会影响现有性别分类系统的性能。这项研究提出了一种新方法,将局部二进制模式(LBP)和局部相位量化(LPQ)等局部描述符与自主成分分析(Self-PCA)相结合,作为特征提取技术。选择使用 Self-PCA 是因为它能够解决人类面部图像中的年龄因素,同时还能利用局部描述符来捕捉这些图像中的特征。主要重点是比较 Self-PCA 与 LPQ+Self-PCA 的性能,以及 LBP+Self-PCA 在使用面部图像进行性别分类时的性能。欧氏距离作为分类器,使用 FG-Net 和 ORL 数据集进行评估。与 LBP+Self-PCA 相比,LPQ+Self-PCA 组合的准确率提高了 57.85%,LBP+Self-PCA 组合的准确率为 56.47%。同时,在 FG-Net 数据集上,单独使用 Self-PCA 的准确率为 55.37%。相比之下,在 ORL 数据集上,对于没有模糊的图像,两种组合的准确率与 Self-PCA 相同,均为 90.14%。
{"title":"Face Gender Classification using Combination of LPQ-Self PCA","authors":"Tio Dharmawan, Danu Adi Nugroho, Muhammad Arief Hidayat","doi":"10.30595/juita.v12i1.21137","DOIUrl":"https://doi.org/10.30595/juita.v12i1.21137","url":null,"abstract":"The age factor had a significant impact on human faces, potentially influencing the performance of existing gender classification systems. This research proposed a new method that combined local descriptors such as Local Binary Patterns (LBP) and Local Phase Quantization (LPQ) with Self-Principal Component Analysis (Self-PCA) as a feature extraction technique. The use of Self-PCA was chosen for its ability to address the age factor in human facial images, while also leveraging local descriptors to capture features from these images. The primary focus was to compare the performance of Self-PCA with LPQ+Self-PCA, along with the additional comparison of LBP+Self-PCA, in the task of gender classification using facial images. Euclidean distance served as the classifier, and the evaluation was conducted using the FG-Net and ORL datasets. The combination of LPQ+Self-PCA showed an improvement in accuracy by 57.85% compared to the combination of LBP+Self-PCA, which provided an accuracy of 56.47%. Meanwhile, using Self-PCA alone gave an accuracy of 55.37% on the FG-Net. In contrast, on the ORL dataset, both combinations gave the same accuracy result as Self-PCA, which was 90.14%, for images without blurring.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"70 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141123260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital Twin and Blockchain Extension in Smart Buildings Platform as Cyber-Physical Systems 数字孪生和区块链在作为网络物理系统的智能建筑平台中的扩展
Pub Date : 2023-11-17 DOI: 10.30595/juita.v11i2.18704
Oktafian Sultan Hakim, M. A. Zainuddin, S. Sukaridhoto, Agus Prayudi
Cyber-physical systems is integrated computation with the physical world. CPS increasing in a wide range of applications, from smart homes to smart buildings. Digital twins are promising way to solve challenges with combination of CPS, 3D technology, and IoT. The system provides users with immersive interfaces to control and interact with devices within the smart building environment. Blockchain was chosen to secure user data using cryptographic algorithms and ensure data protection against manipulation, spying, and theft. Average load testing data for digital twin platform implemented in smart buildings range from 1 to 11 floors. The results reveal a gradual increase in average test times as the buildings' size and complexity grow, with the following values: 5.663s for 1 floor until 11 floors 7.294s. The data obtained from of the blockchain test using Hyperledger Besu provide essential insights into the system's performance with several bandwidth that used in the system. Average time for each test trial ranged from 1.066 seconds to 2.006 seconds, showing slight variations based on the bandwidth used. However, transactions per second (TPS) values were relatively fast, ranging from 1.066 tps to 0.499 tps with positive aspect of the retention rate for all trials was 100% success.
网络物理系统是计算与物理世界的集成。从智能家居到智能楼宇,CPS 的应用日益广泛。数字孪生是结合 CPS、3D 技术和物联网来解决各种挑战的有前途的方法。该系统为用户提供身临其境的界面,以控制智能建筑环境中的设备并与之互动。选择区块链的目的是利用加密算法保护用户数据,确保数据不被篡改、刺探和窃取。在智能建筑中实施的数字孪生平台的平均负载测试数据从 1 层到 11 层不等。结果显示,随着楼宇规模和复杂程度的增加,平均测试时间也逐渐增加,具体数值如下:1 层楼为 5.663 秒,11 层楼为 7.294 秒。使用 Hyperledger Besu 进行的区块链测试所获得的数据提供了有关系统性能的重要信息,其中包括系统中使用的几种带宽。每次测试的平均时间从 1.066 秒到 2.006 秒不等,根据使用的带宽略有不同。不过,每秒交易(TPS)值相对较快,从 1.066 tps 到 0.499 tps 不等,积极的一面是所有试验的保留率均为 100%。
{"title":"Digital Twin and Blockchain Extension in Smart Buildings Platform as Cyber-Physical Systems","authors":"Oktafian Sultan Hakim, M. A. Zainuddin, S. Sukaridhoto, Agus Prayudi","doi":"10.30595/juita.v11i2.18704","DOIUrl":"https://doi.org/10.30595/juita.v11i2.18704","url":null,"abstract":"Cyber-physical systems is integrated computation with the physical world. CPS increasing in a wide range of applications, from smart homes to smart buildings. Digital twins are promising way to solve challenges with combination of CPS, 3D technology, and IoT. The system provides users with immersive interfaces to control and interact with devices within the smart building environment. Blockchain was chosen to secure user data using cryptographic algorithms and ensure data protection against manipulation, spying, and theft. Average load testing data for digital twin platform implemented in smart buildings range from 1 to 11 floors. The results reveal a gradual increase in average test times as the buildings' size and complexity grow, with the following values: 5.663s for 1 floor until 11 floors 7.294s. The data obtained from of the blockchain test using Hyperledger Besu provide essential insights into the system's performance with several bandwidth that used in the system. Average time for each test trial ranged from 1.066 seconds to 2.006 seconds, showing slight variations based on the bandwidth used. However, transactions per second (TPS) values were relatively fast, ranging from 1.066 tps to 0.499 tps with positive aspect of the retention rate for all trials was 100% success.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"63 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139264051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of Data Mining Classification Algorithms for Stroke Disease Prediction Using the SMOTE Upsampling Method 使用 SMOTE 提升取样法预测中风疾病的数据挖掘分类算法比较
Pub Date : 2023-11-17 DOI: 10.30595/juita.v11i2.17348
Ronald Sebastian, Christina Juliane
Stroke is a circulation disorder in the brain that can cause symptoms and signs related to the affected part of the brain and is the leading cause of death and disability in Indonesia. Everyone is at risk of experiencing a stroke, and it is important to recognize and manage risk factors. Data Mining techniques can help in the extraction and prediction of information, as well as finding hidden patterns in stroke medical data. The dataset used in this research comes from Kaggle and is imbalanced, so the SMOTE Upsampling technique is used to address this imbalance issue. The results of the study conclude that the use of SMOTE technique in the C4.5, NB, and KNN algorithms can increase precision, recall, and AUC. The C4.5 algorithm and SMOTE technique as the best performing algorithm were selected for testing new data, and the results show that the model created can predict stroke risk more accurately than the C4.5 model without SMOTE. However, it should be noted that based on the author's interview with one of the medical practitioners, the model cannot be directly used in medical practice because the observations in the medical field to determine factors related to stroke are highly complex. Thus, a new understanding revealed that predicting stroke in a practical setting is highly complex. While data mining can be used as a predictive tool in the initial stage for predictions in the general population, it is strongly recommended to undergo direct examination by doctors in a hospital to obtain more accurate and comprehensive medical evaluations.
中风是一种脑循环障碍,可引起与受影响的脑部相关的症状和体征,是导致印度尼西亚人死亡和残疾的主要原因。每个人都有中风的风险,因此识别和管理风险因素非常重要。数据挖掘技术可以帮助提取和预测信息,并发现中风医疗数据中隐藏的模式。本研究使用的数据集来自 Kaggle,具有不平衡性,因此使用了 SMOTE 升采样技术来解决这一不平衡性问题。研究结果表明,在 C4.5、NB 和 KNN 算法中使用 SMOTE 技术可以提高精确度、召回率和 AUC。在测试新数据时,选择了 C4.5 算法和 SMOTE 技术作为性能最好的算法,结果表明所创建的模型比不使用 SMOTE 的 C4.5 模型能更准确地预测中风风险。但需要注意的是,根据笔者对其中一位医学从业者的访谈,该模型不能直接用于医疗实践,因为医学领域判断中风相关因素的观察非常复杂。因此,新的认识表明,在实际环境中预测中风是非常复杂的。虽然数据挖掘在初始阶段可作为预测工具用于普通人群的预测,但强烈建议在医院接受医生的直接检查,以获得更准确、更全面的医疗评估。
{"title":"Comparison of Data Mining Classification Algorithms for Stroke Disease Prediction Using the SMOTE Upsampling Method","authors":"Ronald Sebastian, Christina Juliane","doi":"10.30595/juita.v11i2.17348","DOIUrl":"https://doi.org/10.30595/juita.v11i2.17348","url":null,"abstract":"Stroke is a circulation disorder in the brain that can cause symptoms and signs related to the affected part of the brain and is the leading cause of death and disability in Indonesia. Everyone is at risk of experiencing a stroke, and it is important to recognize and manage risk factors. Data Mining techniques can help in the extraction and prediction of information, as well as finding hidden patterns in stroke medical data. The dataset used in this research comes from Kaggle and is imbalanced, so the SMOTE Upsampling technique is used to address this imbalance issue. The results of the study conclude that the use of SMOTE technique in the C4.5, NB, and KNN algorithms can increase precision, recall, and AUC. The C4.5 algorithm and SMOTE technique as the best performing algorithm were selected for testing new data, and the results show that the model created can predict stroke risk more accurately than the C4.5 model without SMOTE. However, it should be noted that based on the author's interview with one of the medical practitioners, the model cannot be directly used in medical practice because the observations in the medical field to determine factors related to stroke are highly complex. Thus, a new understanding revealed that predicting stroke in a practical setting is highly complex. While data mining can be used as a predictive tool in the initial stage for predictions in the general population, it is strongly recommended to undergo direct examination by doctors in a hospital to obtain more accurate and comprehensive medical evaluations.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"12 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139264441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of the Minkowski Distance Similarity Method in Case-Based Reasoning for Stroke Diagnosis 明考斯基距离相似法在基于病例推理的脑卒中诊断中的应用
Pub Date : 2023-11-17 DOI: 10.30595/juita.v11i2.18582
Angelina Rumuy, R. Delima, Kuncoro Probo Saputra, J. Purwadi
A Stroke is a cerebrovascular disease characterized by impaired brain function due to damage or death of brain tissue caused by reduced or blocked blood and oxygen flow to the brain. Expert systems can be used as learning aids for medical students to diagnose stroke. Medical records of stroke cases can be reused as a reference for diagnosing stroke when there are new cases, known as the case-based reasoning (CBR) method. This study implements the Minkowski distance similarity method in CBR to calculate the similarity value between cases, where each similar case has the same solution. This study uses the Minkowski distance similarity method in CBR to obtain the most optimal value of r and the most appropriate threshold value in the expert system for stroke diagnosis. The diagnosis process is carried out by inputting the patient's condition, symptoms, and risk factors. Then the system will calculate the similarity value and take the case with the highest similarity value as the solution, providing that the similarity value must be greater than or equal to the threshold value. Based on system testing, the best accuracy value was achieved by applying a threshold value of 75 with an r value of 3 or 4, with an accuracy rate of 88,89%, a recall value of 88%, and a precision of 100%.
中风是一种脑血管疾病,其特点是由于流向大脑的血液和氧气减少或受阻,导致脑组织受损或死亡,从而使大脑功能受损。专家系统可用作医科学生诊断中风的学习辅助工具。当有新病例出现时,中风病例的医疗记录可作为诊断中风的参考资料重新使用,这就是所谓的基于病例的推理(CBR)方法。本研究在 CBR 中采用明考斯基距离相似性方法计算病例之间的相似值,每个相似病例都有相同的解决方案。本研究利用 CBR 中的明考斯基距离相似性方法,在脑卒中诊断专家系统中获得最优的 r 值和最合适的阈值。诊断过程是通过输入病人的病情、症状和危险因素来进行的。然后,系统将计算相似度值,并将相似度值最高的病例作为解决方案,前提是相似度值必须大于或等于阈值。根据系统测试,阈值为 75、r 值为 3 或 4 的准确率最高,准确率为 88.89%,召回值为 88%,精确度为 100%。
{"title":"Application of the Minkowski Distance Similarity Method in Case-Based Reasoning for Stroke Diagnosis","authors":"Angelina Rumuy, R. Delima, Kuncoro Probo Saputra, J. Purwadi","doi":"10.30595/juita.v11i2.18582","DOIUrl":"https://doi.org/10.30595/juita.v11i2.18582","url":null,"abstract":"A Stroke is a cerebrovascular disease characterized by impaired brain function due to damage or death of brain tissue caused by reduced or blocked blood and oxygen flow to the brain. Expert systems can be used as learning aids for medical students to diagnose stroke. Medical records of stroke cases can be reused as a reference for diagnosing stroke when there are new cases, known as the case-based reasoning (CBR) method. This study implements the Minkowski distance similarity method in CBR to calculate the similarity value between cases, where each similar case has the same solution. This study uses the Minkowski distance similarity method in CBR to obtain the most optimal value of r and the most appropriate threshold value in the expert system for stroke diagnosis. The diagnosis process is carried out by inputting the patient's condition, symptoms, and risk factors. Then the system will calculate the similarity value and take the case with the highest similarity value as the solution, providing that the similarity value must be greater than or equal to the threshold value. Based on system testing, the best accuracy value was achieved by applying a threshold value of 75 with an r value of 3 or 4, with an accuracy rate of 88,89%, a recall value of 88%, and a precision of 100%.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"26 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139264590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sentiment Analysis of Student Comment on the College Performance Evaluation Questionnaire Using Naïve Bayes and IndoBERT 使用 Naïve Bayes 和 IndoBERT 对学生对大学成绩评估问卷的评论进行情感分析
Pub Date : 2023-11-17 DOI: 10.30595/juita.v11i2.17336
Wiga Maaulana Baihaqi, Arif Munandar
The development of the Internet has played a significant role in various aspects of life and has generated vast amounts of data, including student comments about universities. The challenge in analyzing comment data is the large number of students providing feedback, which makes manual analysis impractical. The purpose of this study is to analyze the performance evaluation of universities by students in terms of positive and negative sentiments, with the aim of assessing the level of student satisfaction with all elements and areas of university operations. This research utilized the Naïve Bayes algorithm and the IndoBERT model to build a classification model based on questionnaire data, starting from the data collection process, data preprocessing, feature extraction, modeling, and evaluation. The results of the IndoBERT model demonstrated the best performance, with an accuracy of 85%. The IndoBERT model effectively recognizes sentiments in text, distinguishing between positive and negative comments regarding university performance.
互联网的发展在生活的各个方面都发挥了重要作用,并产生了大量数据,包括学生对大学的评论。分析评论数据的挑战在于提供反馈的学生人数众多,人工分析不切实际。本研究旨在从正面和负面情绪两个方面分析学生对大学的绩效评价,目的是评估学生对大学运营的所有要素和领域的满意程度。本研究利用奈伊夫贝叶斯算法和 IndoBERT 模型,从数据收集过程、数据预处理、特征提取、建模和评价等方面入手,建立了基于问卷数据的分类模型。IndoBERT 模型的结果表明性能最佳,准确率达到 85%。IndoBERT 模型能有效识别文本中的情绪,区分有关大学表现的正面和负面评论。
{"title":"Sentiment Analysis of Student Comment on the College Performance Evaluation Questionnaire Using Naïve Bayes and IndoBERT","authors":"Wiga Maaulana Baihaqi, Arif Munandar","doi":"10.30595/juita.v11i2.17336","DOIUrl":"https://doi.org/10.30595/juita.v11i2.17336","url":null,"abstract":"The development of the Internet has played a significant role in various aspects of life and has generated vast amounts of data, including student comments about universities. The challenge in analyzing comment data is the large number of students providing feedback, which makes manual analysis impractical. The purpose of this study is to analyze the performance evaluation of universities by students in terms of positive and negative sentiments, with the aim of assessing the level of student satisfaction with all elements and areas of university operations. This research utilized the Naïve Bayes algorithm and the IndoBERT model to build a classification model based on questionnaire data, starting from the data collection process, data preprocessing, feature extraction, modeling, and evaluation. The results of the IndoBERT model demonstrated the best performance, with an accuracy of 85%. The IndoBERT model effectively recognizes sentiments in text, distinguishing between positive and negative comments regarding university performance.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"54 10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139264071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leaderboard Application as A Ranking Media for Internet Users 作为互联网用户排名媒体的排行榜应用程序
Pub Date : 2023-11-17 DOI: 10.30595/juita.v11i2.17094
Hariadi Yutanto, Gaguk Suprianto, Yusuf Effendi
The technology of utilizing hotspot networks has developed quite rapidly. In its development, internet technology uses a more flexible Mikrotik hotspot because it provides convenience for administrators and users. The object of this study is the hotspot network of Hayam Wuruk University (UHW) Perbanas.  The goal is to develop a leaderboard design as a medium for monitoring internet use through the UHW Perbanas hotspot.  Its application is through the integration of mikrotik with the web service API as a ranking of internet users against three categories of activities, namely downloads, uploads and internet usage times on each day and month.  Each of these categories has 20 users.  The test method uses a black box.  Hasil testing states  that the system is successfully operating, so that it can be implemented in the context of decision making by the management of  UHW Perbanas.
利用热点网络的技术发展相当迅速。在其发展过程中,互联网技术使用了更为灵活的 Mikrotik 热点,因为它为管理员和用户提供了便利。本研究的对象是 Hayam Wuruk 大学(UHW)Perbanas 分校的热点网络。 其目的是开发一种排行榜设计,作为通过 UHW Perbanas 热点监控互联网使用情况的媒介。 其应用是通过 mikrotik 与网络服务 API 的集成,根据三类活动(即下载、上传和每天和每月的互联网使用时间)对互联网用户进行排名。 每个类别有 20 个用户。 测试方法使用黑盒。 Hasil 测试表明系统运行成功,因此可以在 UHW Perbanas 管理层的决策中实施。
{"title":"Leaderboard Application as A Ranking Media for Internet Users","authors":"Hariadi Yutanto, Gaguk Suprianto, Yusuf Effendi","doi":"10.30595/juita.v11i2.17094","DOIUrl":"https://doi.org/10.30595/juita.v11i2.17094","url":null,"abstract":"The technology of utilizing hotspot networks has developed quite rapidly. In its development, internet technology uses a more flexible Mikrotik hotspot because it provides convenience for administrators and users. The object of this study is the hotspot network of Hayam Wuruk University (UHW) Perbanas.  The goal is to develop a leaderboard design as a medium for monitoring internet use through the UHW Perbanas hotspot.  Its application is through the integration of mikrotik with the web service API as a ranking of internet users against three categories of activities, namely downloads, uploads and internet usage times on each day and month.  Each of these categories has 20 users.  The test method uses a black box.  Hasil testing states  that the system is successfully operating, so that it can be implemented in the context of decision making by the management of  UHW Perbanas.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"28 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139264672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiplayer Game Guessing Sunda’s Proverb Using Socket.Io And Node.Js 使用 Socket.Io 和 Node.Js 进行多人游戏,猜测巽他谚语
Pub Date : 2023-11-17 DOI: 10.30595/juita.v11i2.16828
Leni Fitriani, Dewi Tresnawati, M. Iqbal, Ismail Safei Pamungkas
Game development is currently quite rapid. Now games can be played by various groups, because many games now contain not just games, but there are also games with educational content. The educational game that will be made in this study is a website-based Sundanese proverb game, this type of game will be multiplayer so that players can compete with other players. The purpose of this research is to make a Sundanese proverb educational multiplayer game that can be played simultaneously with many players, so that it can introduce the regional language, namely Sundanese, to the wider community. The technology used in making this game is Socket.IO and Node.JS, using these technologies can make end users interact in real time. In making this game using the Game Development Life Cycle (GDLC) methodology with the stages of initialization, pre-production, production, testing, beta and release. The results obtained in this research are website-based Sundanese proverb educational games that can be used without taking up much space on the device.
目前,游戏发展相当迅速。现在的游戏可以供不同的群体玩,因为现在的许多游戏不仅包含游戏,还包含有教育内容的游戏。本研究将制作的教育游戏是一个基于网站的巽他语谚语游戏,这种类型的游戏将是多人游戏,以便玩家可以与其他玩家竞争。本研究的目的是制作一个多人同时玩的巽他语谚语教育游戏,以便向更广泛的社区介绍巽他语这一地区语言。制作这款游戏所使用的技术是 Socket.IO 和 Node.JS,使用这些技术可以让终端用户进行实时互动。在制作这款游戏的过程中,使用了游戏开发生命周期(GDLC)方法,包括初始化、预制作、制作、测试、测试版和发布等阶段。本研究的成果是基于网站的巽他谚语教育游戏,可以在不占用设备空间的情况下使用。
{"title":"Multiplayer Game Guessing Sunda’s Proverb Using Socket.Io And Node.Js","authors":"Leni Fitriani, Dewi Tresnawati, M. Iqbal, Ismail Safei Pamungkas","doi":"10.30595/juita.v11i2.16828","DOIUrl":"https://doi.org/10.30595/juita.v11i2.16828","url":null,"abstract":"Game development is currently quite rapid. Now games can be played by various groups, because many games now contain not just games, but there are also games with educational content. The educational game that will be made in this study is a website-based Sundanese proverb game, this type of game will be multiplayer so that players can compete with other players. The purpose of this research is to make a Sundanese proverb educational multiplayer game that can be played simultaneously with many players, so that it can introduce the regional language, namely Sundanese, to the wider community. The technology used in making this game is Socket.IO and Node.JS, using these technologies can make end users interact in real time. In making this game using the Game Development Life Cycle (GDLC) methodology with the stages of initialization, pre-production, production, testing, beta and release. The results obtained in this research are website-based Sundanese proverb educational games that can be used without taking up much space on the device.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"65 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139262596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JUITA : Jurnal Informatika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1