IJCCS Indonesian Journal of Computing and Cybernetics Systems最新文献_第4页

Improving Cardiovascular Disease Prediction by Integrating Imputation, Imbalance Resampling, and Feature Selection Techniques into Machine Learning Model 利用机器学习模型整合插值、不平衡重采样和特征选择技术改善心血管疾病预测

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.80214

Fadlan Hamid Alfebi, M. D. Anasanti

Cardiovascular disease (CVD) is the leading cause of death worldwide. Primary prevention is by early prediction of the disease onset. Using laboratory data from the National Health and Nutrition Examination Survey (NHANES) in 2017-2020 timeframe (N= 7.974), we tested the ability of machine learning (ML) algorithms to classify individuals at risk. The ML models were evaluated based on their classification performances after comparing four imputation, three imbalance resampling, and three feature selection techniques.Due to its popularity, we utilized decision tree (DT) as the baseline. Integration of multiple imputation by chained equation (MICE) and synthetic minority oversampling with Tomek link down-sampling (SMOTETomek) into the model improved the area under the curve-receiver operating characteristics (AUC-ROC) from 57% to 83%. Applying simultaneous perturbation feature selection and ranking (spFSR) reduced the feature predictors from 144 to 30 features and the computational time by 22%. The best techniques were applied to six ML models, resulting in Xtreme gradient boosting (XGBoost) achieving the highest accuracy of 93% and AUC-ROC of 89%.The accuracy of our ML model in predicting CVD outperforms those from previous studies. We also highlight the important causes of CVD, which might be investigated further for potential effects on electronic health records.

心血管疾病(CVD)是世界范围内死亡的主要原因。初级预防是通过对疾病发病的早期预测。使用2017-2020年国家健康和营养检查调查(NHANES)的实验室数据(N= 7.974)，我们测试了机器学习(ML)算法对风险个体进行分类的能力。通过比较四种输入、三种不平衡重采样和三种特征选择技术，对ML模型的分类性能进行了评价。由于它的流行，我们使用决策树(DT)作为基线。将链式方程(MICE)多次插值和Tomek链路下采样(SMOTETomek)合成少数过采样集成到模型中，将曲线下接收者工作特征(AUC-ROC)面积从57%提高到83%。同时应用摄动特征选择和排序(spFSR)将特征预测器从144个特征减少到30个特征，计算时间减少22%。将最佳技术应用于6个ML模型，Xtreme梯度增强(XGBoost)的准确率最高，达到93%，AUC-ROC为89%。我们的ML模型在预测心血管疾病方面的准确性优于以往的研究。我们还强调了心血管疾病的重要原因，这可能会进一步研究对电子健康记录的潜在影响。

{"title":"Improving Cardiovascular Disease Prediction by Integrating Imputation, Imbalance Resampling, and Feature Selection Techniques into Machine Learning Model","authors":"Fadlan Hamid Alfebi, M. D. Anasanti","doi":"10.22146/ijccs.80214","DOIUrl":"https://doi.org/10.22146/ijccs.80214","url":null,"abstract":"Cardiovascular disease (CVD) is the leading cause of death worldwide. Primary prevention is by early prediction of the disease onset. Using laboratory data from the National Health and Nutrition Examination Survey (NHANES) in 2017-2020 timeframe (N= 7.974), we tested the ability of machine learning (ML) algorithms to classify individuals at risk. The ML models were evaluated based on their classification performances after comparing four imputation, three imbalance resampling, and three feature selection techniques.Due to its popularity, we utilized decision tree (DT) as the baseline. Integration of multiple imputation by chained equation (MICE) and synthetic minority oversampling with Tomek link down-sampling (SMOTETomek) into the model improved the area under the curve-receiver operating characteristics (AUC-ROC) from 57% to 83%. Applying simultaneous perturbation feature selection and ranking (spFSR) reduced the feature predictors from 144 to 30 features and the computational time by 22%. The best techniques were applied to six ML models, resulting in Xtreme gradient boosting (XGBoost) achieving the highest accuracy of 93% and AUC-ROC of 89%.The accuracy of our ML model in predicting CVD outperforms those from previous studies. We also highlight the important causes of CVD, which might be investigated further for potential effects on electronic health records. ","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42287060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Implementation of Ensemble Methods on Classification of CDK2 Inhibitor as Anti-Cancer Agent CDK2抑制剂抗癌分类的集成方法实现

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.78537

I. Kurniawan, Melani Anggraini, A. Aditsania, E. B. Setiawan

Cancer is known as the second leading cause of death worldwide. About 7-10 million cases of death by cancer occur every year. The recent treatment to heal the cancer is chemotherapy. However, chemotherapy treatment is known to have side effects and cell resistance issues to certain drugs. Therefore, it is required to develop a new drug that can reduce the side effects and provide a better treatment effect. In general, anti-cancer drugs are developed by targeting Cyclin-Dependent Kinase 2 (CDK2) enzyme. Conventional drug design is not effective and efficient for obtaining new drug candidates because of no information about the biological activity before it is synthesized. In this study, we aim to develop a model to predict the activity of CDK2 inhibitors by using ensemble methods, i.e., XGBoost, Random Forest, and AdaBoost. The study was conducted by calculating several fingerprints, i.e., Estate, Extended, Maccs, and Pubchem, as feature variables. Based on the results, we found that Random Forest with Pubchem fingerprint gives the best result with the value of Matthews Correlation Coefficient (MCC) and Area Under the ROC Curve (AUC) values are 0.979 and 0.999, respectively. From this study, we contributed to revealing the potency of the ensemble with fingerprint in bioactivity prediction, especially CDK2 inhibitors as anti-cancer agents.

癌症是全球第二大死亡原因。每年约有700万至1000万人死于癌症。最近治疗癌症的方法是化疗。然而，众所周知，化疗会对某些药物产生副作用和细胞耐药性问题。因此，需要开发一种能够减少副作用并提供更好治疗效果的新药。一般来说，抗癌药物是通过靶向细胞周期蛋白依赖性激酶2（CDK2）酶开发的。常规药物设计对于获得新的候选药物是无效和高效的，因为在合成之前没有关于生物活性的信息。在本研究中，我们的目标是通过使用集成方法，即XGBoost、Random Forest和AdaBoost，开发一个预测CDK2抑制剂活性的模型。该研究通过计算几个指纹作为特征变量进行，即Estate、Extended、Maccs和Pubchem。基于这些结果，我们发现具有Pubchem指纹的随机森林给出了最好的结果，Matthews相关系数（MCC）和ROC曲线下面积（AUC）分别为0.979和0.999。从这项研究中，我们有助于揭示指纹组合在生物活性预测中的效力，尤其是作为抗癌剂的CDK2抑制剂。

{"title":"Implementation of Ensemble Methods on Classification of CDK2 Inhibitor as Anti-Cancer Agent","authors":"I. Kurniawan, Melani Anggraini, A. Aditsania, E. B. Setiawan","doi":"10.22146/ijccs.78537","DOIUrl":"https://doi.org/10.22146/ijccs.78537","url":null,"abstract":"Cancer is known as the second leading cause of death worldwide. About 7-10 million cases of death by cancer occur every year. The recent treatment to heal the cancer is chemotherapy. However, chemotherapy treatment is known to have side effects and cell resistance issues to certain drugs. Therefore, it is required to develop a new drug that can reduce the side effects and provide a better treatment effect. In general, anti-cancer drugs are developed by targeting Cyclin-Dependent Kinase 2 (CDK2) enzyme. Conventional drug design is not effective and efficient for obtaining new drug candidates because of no information about the biological activity before it is synthesized. In this study, we aim to develop a model to predict the activity of CDK2 inhibitors by using ensemble methods, i.e., XGBoost, Random Forest, and AdaBoost. The study was conducted by calculating several fingerprints, i.e., Estate, Extended, Maccs, and Pubchem, as feature variables. Based on the results, we found that Random Forest with Pubchem fingerprint gives the best result with the value of Matthews Correlation Coefficient (MCC) and Area Under the ROC Curve (AUC) values are 0.979 and 0.999, respectively. From this study, we contributed to revealing the potency of the ensemble with fingerprint in bioactivity prediction, especially CDK2 inhibitors as anti-cancer agents.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43421425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unsupervised Text Style Transfer for Authorship Obfuscation in Bahasa Indonesia 印尼巴哈萨语作者困惑的无监督文本风格转换

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.79623

Yunita Sari, Fadhlan Pasyah Al Faridzi

Authorship attribution is an NLP task to identify the author of a text based on stylometric analysis. On the other hand, authorship obfuscation aims to protect against authorship attribution by modifying a text’s style. The main challenge in authorship obfuscation is how to keep the content of the text despite the text modification. In this research, we are applying text style transfer methods for modifying the writing style while preserving the content of the input text. We implemented two unsupervised text style transfer: dictionary-based and back translation methods to change the formality level of the text. Experiment results shows that the back-translation method outperformed the dictionary-based method. The authorship attribution performance decreased up to 16.15% and 23.66% on F1-score for 3 and 10 authors respectively using back-translation. While for dictionary-based method the F1-score dropped up to 1.99% and 11.56% for 3 and 10 authors respectively. Evaluation on sensibleness and soundness factors show that the back-translation method can preserve the semantic of the obfuscated texts. Moreover, the modified texts are well-formed and inconspicuous.

作者归属是一项基于文体分析来识别文本作者的NLP任务。另一方面，作者身份混淆旨在通过修改文本的样式来防止作者身份归属。作者身份混淆的主要挑战是如何在修改文本的情况下保持文本的内容。在本研究中，我们应用文本风格转移方法来修改写作风格，同时保留输入文本的内容。我们实现了两种无监督的文本风格转换:基于字典的和反向翻译的方法来改变文本的正式程度。实验结果表明，反翻译方法优于基于字典的方法。使用反译的3位作者和10位作者在f1分上分别下降了16.15%和23.66%。而在基于词典的方法中，3位作者和10位作者的f1得分分别下降到1.99%和11.56%。通过对敏感性和合理性因素的评价，表明反翻译方法可以保留混淆文本的语义。此外，修改后的文本格式良好，不显眼。

{"title":"Unsupervised Text Style Transfer for Authorship Obfuscation in Bahasa Indonesia","authors":"Yunita Sari, Fadhlan Pasyah Al Faridzi","doi":"10.22146/ijccs.79623","DOIUrl":"https://doi.org/10.22146/ijccs.79623","url":null,"abstract":"Authorship attribution is an NLP task to identify the author of a text based on stylometric analysis. On the other hand, authorship obfuscation aims to protect against authorship attribution by modifying a text’s style. The main challenge in authorship obfuscation is how to keep the content of the text despite the text modification. In this research, we are applying text style transfer methods for modifying the writing style while preserving the content of the input text. We implemented two unsupervised text style transfer: dictionary-based and back translation methods to change the formality level of the text. Experiment results shows that the back-translation method outperformed the dictionary-based method. The authorship attribution performance decreased up to 16.15% and 23.66% on F1-score for 3 and 10 authors respectively using back-translation. While for dictionary-based method the F1-score dropped up to 1.99% and 11.56% for 3 and 10 authors respectively. Evaluation on sensibleness and soundness factors show that the back-translation method can preserve the semantic of the obfuscated texts. Moreover, the modified texts are well-formed and inconspicuous. ","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44597010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparison of CNN Models With Transfer Learning in the Classification of Insect Pests CNN模型与迁移学习在害虫分类中的比较

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.80956

Angga Prima Syahputra, Alda Cendekia Siregar, Rachmat Wahid Saleh Insani

Insect pests are an important problem to overcome in agriculture. The purpose of this research is to classify insect pests with the IP-102 dataset using several CNN pre-trained models and choose which model is best for classifying insect pest data. The method used is the transfer learning method with a fine-tuning approach. Transfer learning was chosen because this technique can use the features and weights that have been obtained during the previous training process. Thus, computation time can be reduced and accuracy can be increased. The models used include Xception, MobileNetV3L, MobileNetV2, DenseNet-201, and InceptionV3. Fine-tuning and freeze layer techniques are also used to improve the quality of the resulting model, making it more accurate and better suited to the problem at hand. This study uses 75,222 image data with 102 classes. The results of this study are the DenseNet-201 model with fine-tuning produces an accuracy value of 70%, MobileNetV2 66%, MobileNetV3L 68%, InceptionV3 67%, Xception 69%. The conclusion of this study is that the transfer learning method with the fine-tuning approach produces the highest accuracy value of 70% in the DenseNet-201 model.

害虫是农业中需要克服的一个重要问题。本研究的目的是使用几种CNN预训练模型，使用IP-102数据集对害虫进行分类，并选择哪种模型最适合对害虫数据进行分类。所使用的方法是带有微调方法的迁移学习方法。之所以选择迁移学习，是因为这种技术可以使用在之前的训练过程中获得的特征和权重。因此，可以减少计算时间并且可以提高精度。使用的模型包括Xception、MobileNetV3L、MobileNetw2、DenseNet-201和InceptionV3。微调和冻结层技术也用于提高最终模型的质量，使其更准确，更适合当前的问题。本研究使用了102个类别的75222个图像数据。本研究的结果是，经过微调的DenseNet-201模型产生的准确度值为70%，MobileNetV2为66%，MobileNet V3L为68%，InceptionV3为67%，Xception为69%。本研究的结论是，在DenseNet-201模型中，采用微调方法的迁移学习方法产生了70%的最高精度值。

{"title":"Comparison of CNN Models With Transfer Learning in the Classification of Insect Pests","authors":"Angga Prima Syahputra, Alda Cendekia Siregar, Rachmat Wahid Saleh Insani","doi":"10.22146/ijccs.80956","DOIUrl":"https://doi.org/10.22146/ijccs.80956","url":null,"abstract":"Insect pests are an important problem to overcome in agriculture. The purpose of this research is to classify insect pests with the IP-102 dataset using several CNN pre-trained models and choose which model is best for classifying insect pest data. The method used is the transfer learning method with a fine-tuning approach. Transfer learning was chosen because this technique can use the features and weights that have been obtained during the previous training process. Thus, computation time can be reduced and accuracy can be increased. The models used include Xception, MobileNetV3L, MobileNetV2, DenseNet-201, and InceptionV3. Fine-tuning and freeze layer techniques are also used to improve the quality of the resulting model, making it more accurate and better suited to the problem at hand. This study uses 75,222 image data with 102 classes. The results of this study are the DenseNet-201 model with fine-tuning produces an accuracy value of 70%, MobileNetV2 66%, MobileNetV3L 68%, InceptionV3 67%, Xception 69%. The conclusion of this study is that the transfer learning method with the fine-tuning approach produces the highest accuracy value of 70% in the DenseNet-201 model.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44358665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Aspect-Based Sentiment Analysis in Bromo Tengger Semeru National Park Indonesia Based on Google Maps User Reviews 基于谷歌地图用户评论的印尼Bromo Tengger sememeru国家公园面向方面的情感分析

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.77354

Cynthia As Bahri, Lya Hulliyyatus Suadaa

Technology can influence and shape a person's behavior patterns when planning tours, traveling, and after traveling. Visitors' reviews can be used as evaluation material to improve the quality of tourist destinations and become a determining factor for other tourists to visit or revisit the destinations. The process of utilizing these reviews can be done by assessing the aspects of tourist destinations based on reviews from visitors. This study aims to conduct an aspect-based sentiment analysis on one of the tourist destinations in Indonesia, namely Bromo Tengger Semeru National Park, based on reviews of Google Maps users. The aspects consist of attractions, facilities, access, and price. The sentiment classification model used is a machine learning model consisting of SVM, Complement Naïve Bayes, Logistic Regression, and transfer learning from pre-trained BERT, IndoBERT, and mBERT. Based on the experimental results, transfer learning from the IndoBERT model achieved the best performance with accuracy and F1-Score of 91.48% and 71.56%, respectively. In addition, among the machine learning models used, the SVM model gives the best results with an accuracy of 89.16% and an F1-Score of 62.23%.

技术可以影响和塑造一个人在计划旅行、旅行和旅行后的行为模式。游客的评价可以作为提高旅游目的地质量的评估材料，并成为其他游客访问或重新访问目的地的决定因素。利用这些评论的过程可以通过根据游客的评论评估旅游目的地的各个方面来完成。本研究旨在根据谷歌地图用户的评论，对印度尼西亚的一个旅游目的地，即Bromo Tengger Semeru国家公园进行基于方面的情绪分析。这些方面包括景点、设施、通道和价格。所使用的情绪分类模型是一个机器学习模型，由SVM、互补朴素贝叶斯、逻辑回归和来自预训练的BERT、IndoBERT和mBERT的迁移学习组成。基于实验结果，IndoBERT模型的迁移学习取得了最佳性能，准确率和F1得分分别为91.48%和71.56%。此外，在所使用的机器学习模型中，SVM模型给出了最好的结果，准确率为89.16%，F1得分为62.23%。

{"title":"Aspect-Based Sentiment Analysis in Bromo Tengger Semeru National Park Indonesia Based on Google Maps User Reviews","authors":"Cynthia As Bahri, Lya Hulliyyatus Suadaa","doi":"10.22146/ijccs.77354","DOIUrl":"https://doi.org/10.22146/ijccs.77354","url":null,"abstract":"Technology can influence and shape a person's behavior patterns when planning tours, traveling, and after traveling. Visitors' reviews can be used as evaluation material to improve the quality of tourist destinations and become a determining factor for other tourists to visit or revisit the destinations. The process of utilizing these reviews can be done by assessing the aspects of tourist destinations based on reviews from visitors. This study aims to conduct an aspect-based sentiment analysis on one of the tourist destinations in Indonesia, namely Bromo Tengger Semeru National Park, based on reviews of Google Maps users. The aspects consist of attractions, facilities, access, and price. The sentiment classification model used is a machine learning model consisting of SVM, Complement Naïve Bayes, Logistic Regression, and transfer learning from pre-trained BERT, IndoBERT, and mBERT. Based on the experimental results, transfer learning from the IndoBERT model achieved the best performance with accuracy and F1-Score of 91.48% and 71.56%, respectively. In addition, among the machine learning models used, the SVM model gives the best results with an accuracy of 89.16% and an F1-Score of 62.23%.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45330560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Fishku Apps: Fishes Freshness Detection Using CNN With MobilenetV2 Fishku应用程序：使用CNN和MobiletV2检测鱼类新鲜度

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.80049

Muthia Farah Hanifa, Anugrah Tri Ramadhan, Ni’Matul Husna, Nabila Apriliana Widiyono, Rhamdan Syahrul Mubarak, Adisti Anjani Putri, Sigit Priyanta

Marine fish are one of the most promising economic commodities for the Indonesian economy. Marine fish will decrease in protein content along with the decreasing level of freshness of the fish that will be consumed. There are still many people who do not know about the classification of fresh and unfresh fish, so we need a system that can classify which fish are fresh and which are not. Previous studies have succeeded in classifying tuna using a convolutional neural network (CNN) algorithm with an accuracy of 90%. In the preprocessing stage of this research, segmentation is carried out, which aims to separate the object to be studied and the background image, then feature extraction is carried out using a color moment, which aims to get the value of the object to be studied. This research was conducted to increase the accuracy value in the freshness classification of tuna and also to add some fish for freshness detection, such as mackerel and milkfish, using the MobilenetV2. The results were able to produce accuracy of 97%, 94%, and 93% for each fish. The freshness detection method in this study has been implemented in the Fishku mobile-based application.

海鱼是印尼经济最有前景的经济商品之一。海鱼的蛋白质含量会随着食用鱼类新鲜度的降低而降低。仍然有很多人不知道新鲜鱼和未解冻鱼的分类，所以我们需要一个系统来分类哪些鱼是新鲜的，哪些不是。先前的研究已经成功地使用卷积神经网络（CNN）算法对金枪鱼进行了分类，准确率为90%。在本研究的预处理阶段，进行了分割，目的是将待研究对象与背景图像分离，然后使用颜色矩进行特征提取，目的是获得待研究对象的值。本研究旨在提高金枪鱼新鲜度分类的准确性，并使用MobilenetV2添加一些鱼类进行新鲜度检测，如鲭鱼和乳鱼。结果显示，每条鱼的准确率分别为97%、94%和93%。本研究中的新鲜度检测方法已在Fishku移动应用程序中实现。

{"title":"Fishku Apps: Fishes Freshness Detection Using CNN With MobilenetV2","authors":"Muthia Farah Hanifa, Anugrah Tri Ramadhan, Ni’Matul Husna, Nabila Apriliana Widiyono, Rhamdan Syahrul Mubarak, Adisti Anjani Putri, Sigit Priyanta","doi":"10.22146/ijccs.80049","DOIUrl":"https://doi.org/10.22146/ijccs.80049","url":null,"abstract":"Marine fish are one of the most promising economic commodities for the Indonesian economy. Marine fish will decrease in protein content along with the decreasing level of freshness of the fish that will be consumed. There are still many people who do not know about the classification of fresh and unfresh fish, so we need a system that can classify which fish are fresh and which are not. Previous studies have succeeded in classifying tuna using a convolutional neural network (CNN) algorithm with an accuracy of 90%. In the preprocessing stage of this research, segmentation is carried out, which aims to separate the object to be studied and the background image, then feature extraction is carried out using a color moment, which aims to get the value of the object to be studied. This research was conducted to increase the accuracy value in the freshness classification of tuna and also to add some fish for freshness detection, such as mackerel and milkfish, using the MobilenetV2. The results were able to produce accuracy of 97%, 94%, and 93% for each fish. The freshness detection method in this study has been implemented in the Fishku mobile-based application.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42546742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Progressive Content Generation Based on Cyclic Graph for Generate Dungeon 基于循环图的生成地下城渐进式内容生成

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.81178

Muhammad Anshar, R. Sumiharto, Moh Edi Wibowo

Dungeon is level in game consisting collection of rooms and doors with obstacles inside. To make good level, takes a lot of time. With Procedural Content Generation (PCG), dungeons can be created automatically. One of the approaches in PCG to create levels is progressive. Progressive approach produces timeline as representation of the interactions in the game. Timeline representation that is in the form of one straight line is good for endless runner, but for dungeon, the levels are linear. In this research, the timeline is changed to cyclic graph. Cyclic graph is formed using graph grammar algorithm. This research aims to build dungeon that has not linear and minimal dead ends. To eliminate linearity in dungeons, branching in dungeons needs to be formed. The steps carried out in this research are designing graph grammar rules, generating population of graphs, evaluating graphs with fitness values, and building dungeons. Four functions are used to determine the fitness value: shortest vertices, average duration, replayability, and variation. Dungeons produced with progressive approach manage to minimize linearity in dungeons. Dungeon formation is very dependent on the rule grammar that forms it. With the evaluation process, linear dungeons resulting from grammar rules can be minimized.

地下城是游戏中的关卡，由房间和门组成，里面有障碍物。要做出好的关卡，需要花费大量的时间。使用程序内容生成(PCG)，可以自动创建地下城。PCG中创造关卡的方法之一便是渐进式。渐进式方法将时间轴作为游戏互动的代表。以直线形式呈现的时间轴表现对于无尽奔跑者来说很好，但对于地下城来说，关卡则是线性的。在本研究中，将时间轴改为循环图。利用图语法算法生成循环图。这项研究的目的是建造一个没有线性和最小死角的地下城。为了消除地下城中的线性，需要在地下城中形成分支。本研究的步骤是设计图的语法规则，生成图的总体，用适应度值评估图，以及构建地下城。四个函数用于确定适应度值:最短顶点、平均持续时间、重玩性和变化。采用渐进式方法制作的《地下城》设法将地下城中的线性最小化。地下城的形成非常依赖于形成它的规则语法。通过评估过程，可以最小化由语法规则产生的线性地牢。

{"title":"Progressive Content Generation Based on Cyclic Graph for Generate Dungeon","authors":"Muhammad Anshar, R. Sumiharto, Moh Edi Wibowo","doi":"10.22146/ijccs.81178","DOIUrl":"https://doi.org/10.22146/ijccs.81178","url":null,"abstract":"Dungeon is level in game consisting collection of rooms and doors with obstacles inside. To make good level, takes a lot of time. With Procedural Content Generation (PCG), dungeons can be created automatically. One of the approaches in PCG to create levels is progressive. Progressive approach produces timeline as representation of the interactions in the game. Timeline representation that is in the form of one straight line is good for endless runner, but for dungeon, the levels are linear. In this research, the timeline is changed to cyclic graph. Cyclic graph is formed using graph grammar algorithm. This research aims to build dungeon that has not linear and minimal dead ends. To eliminate linearity in dungeons, branching in dungeons needs to be formed. The steps carried out in this research are designing graph grammar rules, generating population of graphs, evaluating graphs with fitness values, and building dungeons. Four functions are used to determine the fitness value: shortest vertices, average duration, replayability, and variation. Dungeons produced with progressive approach manage to minimize linearity in dungeons. Dungeon formation is very dependent on the rule grammar that forms it. With the evaluation process, linear dungeons resulting from grammar rules can be minimized.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43259877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Improved LSTM Method of Predicting Cryptocurrency Price Using Short-Term Data 基于短期数据预测加密货币价格的改进LSTM方法

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.80776

Risna Sari, Kusrini Kusrini, Tonny Hidayat, T. Orphanoudakis

As cryptocurrencies develop, it cannot be denied that crypto prices are volatile. One of the influencing factors is the increasing volume of transactions which attracts the interest of researchers to conduct research in developing coin price predictions from cryptocurrencies. The method, algorithm and amount of data affect the prediction results. In this study, prediction modelling will be carried out using the LSTM method and short-term data. This study will conduct two experiments using the simple LSTM method and utilising multivariate time series with LSTM. The smallest predicted value is obtained using an 80/20 data allocation distribution scenario, input layer LSTM = 360, Epoch = 500, a Solana coin with RMSE = 0.111, R2 = 0.9962. It can be interpreted that short-term data can be used in making predictive models. Still, special attention needs to be paid to the characteristics of the dataset used and the modelling methodology, and it is hoped that the results of this study can be used in further research.

随着加密货币的发展，不可否认的是，加密价格是不稳定的。其中一个影响因素是交易量的增加，这吸引了研究人员对开发加密货币的硬币价格预测进行研究的兴趣。方法、算法和数据量影响预测结果。在本研究中，将使用LSTM方法和短期数据进行预测建模。本研究将使用简单LSTM方法和使用LSTM的多元时间序列进行两个实验。使用80/20的数据分配分布场景，输入层LSTM = 360, Epoch = 500，一个索拉纳硬币，RMSE = 0.111, R2 = 0.9962，得到预测值最小。可以这样解释，短期数据可以用来建立预测模型。但是，需要特别注意所使用的数据集的特点和建模方法，并希望本研究的结果可以用于进一步的研究。

引用次数: 2

SVEAuAdIR model of COVID-19 Transmission COVID-19传播的SVEAuAdIR模型

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.73334

Anindhita Nisitasari, N. Rokhman

The COVID-19 pandemic that has occurred has received worldwide attention due to the rapid rate of transmission of the outbreak and the large number of deaths that occurred. The aim of this study is to build the SVEAuAdIR model , determine the transmission of COVID-19 in Indonesia by forecast the spread of the disease, and determine the effect of vaccination by looking at the basic reproduction number of SVEAuAdIR model. The results obtained from MAPE on the model are 12%. So it can be said that the SVEAuAdIR model is good for prediction models for the spread of COVID-19. The situation where there are no more individuals infected with COVID-19 is called COVID-19 disease free, thus it is predicted that Indonesia will be free of COVID-19 on October 7, 2021. The target of the Indonesian Ministry of Health is that by the end of 2021 the spread of COVID-19 can be stopped . However, on October 7, 2021, judging from the actual data during this research, there were still new cases of COVID-19. On that day there were 1393 new cases infected with COVID-19. Thus, showing that Indonesia's target of being free of COVID-19 disease by the end of 2021 has not been achieved. The number of the SVEAuAdIR model is in the range of values , which means that the spread of disease is close to disease-free. Based on the results of the value of the SVEAuAdIR model, this study concluded that vaccination could reduce the spread of COVID-19 compared to those who did not vaccinate

新冠肺炎疫情因其传播速度快和死亡人数多而受到全世界的关注。本研究的目的是建立SVEAuAdIR模型，通过预测疾病的传播来确定新冠肺炎在印度尼西亚的传播，并通过观察SVEAuAdIR模型的基本繁殖数来确定疫苗接种的效果。从模型的MAPE获得的结果是12%。因此可以说，SVEAuAdIR模型对于新冠肺炎传播的预测模型是很好的。没有更多人感染新冠肺炎的情况被称为无新冠肺炎疾病，因此预测印度尼西亚将于2021年10月7日无新冠肺炎。印度尼西亚卫生部的目标是到2021年底可以阻止新冠肺炎的传播。然而，2021年10月7日，从本次研究期间的实际数据来看，新冠肺炎仍有新增病例。当日新增感染新冠肺炎病例1393例。因此，这表明印度尼西亚到2021年底摆脱新冠肺炎疾病的目标尚未实现。SVEAuAdIR模型的数量在值的范围内，这意味着疾病的传播接近无病。根据SVEAuAdIR模型的价值结果，本研究得出结论，与未接种疫苗的人相比，接种疫苗可以减少新冠肺炎的传播

{"title":"SVEAuAdIR model of COVID-19 Transmission","authors":"Anindhita Nisitasari, N. Rokhman","doi":"10.22146/ijccs.73334","DOIUrl":"https://doi.org/10.22146/ijccs.73334","url":null,"abstract":"The COVID-19 pandemic that has occurred has received worldwide attention due to the rapid rate of transmission of the outbreak and the large number of deaths that occurred. The aim of this study is to build the SVEAuAdIR model , determine the transmission of COVID-19 in Indonesia by forecast the spread of the disease, and determine the effect of vaccination by looking at the basic reproduction number of SVEAuAdIR model. The results obtained from MAPE on the model are 12%. So it can be said that the SVEAuAdIR model is good for prediction models for the spread of COVID-19. The situation where there are no more individuals infected with COVID-19 is called COVID-19 disease free, thus it is predicted that Indonesia will be free of COVID-19 on October 7, 2021. The target of the Indonesian Ministry of Health is that by the end of 2021 the spread of COVID-19 can be stopped . However, on October 7, 2021, judging from the actual data during this research, there were still new cases of COVID-19. On that day there were 1393 new cases infected with COVID-19. Thus, showing that Indonesia's target of being free of COVID-19 disease by the end of 2021 has not been achieved. The number of the SVEAuAdIR model is in the range of values , which means that the spread of disease is close to disease-free. Based on the results of the value of the SVEAuAdIR model, this study concluded that vaccination could reduce the spread of COVID-19 compared to those who did not vaccinate","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42191223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Information Retrieval for Early Detection of Disease Using Semantic Similarity 基于语义相似度的疾病早期检测信息检索

IJCCS Indonesian Journal of Computing and Cybernetics Systems

Pub Date : 2023-02-23 DOI: 10.22146/ijccs.80077

Aszani Aszani, Hayyu Ilham Wicaksono, Uffi Nadzima, Lukman Heryawan

The growth of medical records continues to increase and needs to be used to improve doctors' performance in diagnosing a disease. A retrieval method returns proposed information to provide diagnostic recommendations based on symptoms from medical record datasets by applying the TF-IDF and cosine similarity methods. The challenge in this study was that the symptoms in the medical record dataset were dirty data obtained from patients who were not familiar with biological terms. Therefore, the symptoms were matched in the medical record data with the symptom terms used in the system and from the results, data augmentation was carried out to increase the amount of data up to about 3 times more. In the TF-IDF the highest accuracy with is only , while after augmentation of the test data, the accuracy becomes . The highest accuracy results with the same value using the cosine similarity method is and with the augmented test data accuracy increasing to . From this study it was concluded that a system with sufficient and relevant input of symptoms would provide a more accurate disease prediction. Prediction results using the TF-IDF method with are more accurate than predictions using the cosine similarity method.

医疗记录的增长持续增加，需要用来提高医生诊断疾病的表现。检索方法通过应用TF-IDF和余弦相似性方法，返回建议的信息，以基于来自医疗记录数据集的症状提供诊断建议。这项研究的挑战是，病历数据集中的症状是从不熟悉生物学术语的患者那里获得的肮脏数据。因此，将病历数据中的症状与系统中使用的症状术语相匹配，并根据结果进行数据扩充，将数据量增加约3倍。在TF-IDF中，最高精度仅为，而在增加测试数据后，精度变为。使用余弦相似性方法得到的具有相同值的最高精度结果是，并且随着增强测试数据精度增加到。根据这项研究得出的结论是，一个具有足够和相关症状输入的系统将提供更准确的疾病预测。使用TF-IDF方法的预测结果比使用余弦相似性方法的预测更准确。

{"title":"Information Retrieval for Early Detection of Disease Using Semantic Similarity","authors":"Aszani Aszani, Hayyu Ilham Wicaksono, Uffi Nadzima, Lukman Heryawan","doi":"10.22146/ijccs.80077","DOIUrl":"https://doi.org/10.22146/ijccs.80077","url":null,"abstract":" The growth of medical records continues to increase and needs to be used to improve doctors' performance in diagnosing a disease. A retrieval method returns proposed information to provide diagnostic recommendations based on symptoms from medical record datasets by applying the TF-IDF and cosine similarity methods. The challenge in this study was that the symptoms in the medical record dataset were dirty data obtained from patients who were not familiar with biological terms. Therefore, the symptoms were matched in the medical record data with the symptom terms used in the system and from the results, data augmentation was carried out to increase the amount of data up to about 3 times more. In the TF-IDF the highest accuracy with is only , while after augmentation of the test data, the accuracy becomes . The highest accuracy results with the same value using the cosine similarity method is and with the augmented test data accuracy increasing to . From this study it was concluded that a system with sufficient and relevant input of symptoms would provide a more accurate disease prediction. Prediction results using the TF-IDF method with are more accurate than predictions using the cosine similarity method.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45079187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0