Cardiovascular disease (CVD) is the leading cause of death worldwide. Primary prevention is by early prediction of the disease onset. Using laboratory data from the National Health and Nutrition Examination Survey (NHANES) in 2017-2020 timeframe (N= 7.974), we tested the ability of machine learning (ML) algorithms to classify individuals at risk. The ML models were evaluated based on their classification performances after comparing four imputation, three imbalance resampling, and three feature selection techniques.Due to its popularity, we utilized decision tree (DT) as the baseline. Integration of multiple imputation by chained equation (MICE) and synthetic minority oversampling with Tomek link down-sampling (SMOTETomek) into the model improved the area under the curve-receiver operating characteristics (AUC-ROC) from 57% to 83%. Applying simultaneous perturbation feature selection and ranking (spFSR) reduced the feature predictors from 144 to 30 features and the computational time by 22%. The best techniques were applied to six ML models, resulting in Xtreme gradient boosting (XGBoost) achieving the highest accuracy of 93% and AUC-ROC of 89%.The accuracy of our ML model in predicting CVD outperforms those from previous studies. We also highlight the important causes of CVD, which might be investigated further for potential effects on electronic health records.
{"title":"Improving Cardiovascular Disease Prediction by Integrating Imputation, Imbalance Resampling, and Feature Selection Techniques into Machine Learning Model","authors":"Fadlan Hamid Alfebi, M. D. Anasanti","doi":"10.22146/ijccs.80214","DOIUrl":"https://doi.org/10.22146/ijccs.80214","url":null,"abstract":"Cardiovascular disease (CVD) is the leading cause of death worldwide. Primary prevention is by early prediction of the disease onset. Using laboratory data from the National Health and Nutrition Examination Survey (NHANES) in 2017-2020 timeframe (N= 7.974), we tested the ability of machine learning (ML) algorithms to classify individuals at risk. The ML models were evaluated based on their classification performances after comparing four imputation, three imbalance resampling, and three feature selection techniques.Due to its popularity, we utilized decision tree (DT) as the baseline. Integration of multiple imputation by chained equation (MICE) and synthetic minority oversampling with Tomek link down-sampling (SMOTETomek) into the model improved the area under the curve-receiver operating characteristics (AUC-ROC) from 57% to 83%. Applying simultaneous perturbation feature selection and ranking (spFSR) reduced the feature predictors from 144 to 30 features and the computational time by 22%. The best techniques were applied to six ML models, resulting in Xtreme gradient boosting (XGBoost) achieving the highest accuracy of 93% and AUC-ROC of 89%.The accuracy of our ML model in predicting CVD outperforms those from previous studies. We also highlight the important causes of CVD, which might be investigated further for potential effects on electronic health records. ","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42287060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
I. Kurniawan, Melani Anggraini, A. Aditsania, E. B. Setiawan
Cancer is known as the second leading cause of death worldwide. About 7-10 million cases of death by cancer occur every year. The recent treatment to heal the cancer is chemotherapy. However, chemotherapy treatment is known to have side effects and cell resistance issues to certain drugs. Therefore, it is required to develop a new drug that can reduce the side effects and provide a better treatment effect. In general, anti-cancer drugs are developed by targeting Cyclin-Dependent Kinase 2 (CDK2) enzyme. Conventional drug design is not effective and efficient for obtaining new drug candidates because of no information about the biological activity before it is synthesized. In this study, we aim to develop a model to predict the activity of CDK2 inhibitors by using ensemble methods, i.e., XGBoost, Random Forest, and AdaBoost. The study was conducted by calculating several fingerprints, i.e., Estate, Extended, Maccs, and Pubchem, as feature variables. Based on the results, we found that Random Forest with Pubchem fingerprint gives the best result with the value of Matthews Correlation Coefficient (MCC) and Area Under the ROC Curve (AUC) values are 0.979 and 0.999, respectively. From this study, we contributed to revealing the potency of the ensemble with fingerprint in bioactivity prediction, especially CDK2 inhibitors as anti-cancer agents.
{"title":"Implementation of Ensemble Methods on Classification of CDK2 Inhibitor as Anti-Cancer Agent","authors":"I. Kurniawan, Melani Anggraini, A. Aditsania, E. B. Setiawan","doi":"10.22146/ijccs.78537","DOIUrl":"https://doi.org/10.22146/ijccs.78537","url":null,"abstract":"Cancer is known as the second leading cause of death worldwide. About 7-10 million cases of death by cancer occur every year. The recent treatment to heal the cancer is chemotherapy. However, chemotherapy treatment is known to have side effects and cell resistance issues to certain drugs. Therefore, it is required to develop a new drug that can reduce the side effects and provide a better treatment effect. In general, anti-cancer drugs are developed by targeting Cyclin-Dependent Kinase 2 (CDK2) enzyme. Conventional drug design is not effective and efficient for obtaining new drug candidates because of no information about the biological activity before it is synthesized. In this study, we aim to develop a model to predict the activity of CDK2 inhibitors by using ensemble methods, i.e., XGBoost, Random Forest, and AdaBoost. The study was conducted by calculating several fingerprints, i.e., Estate, Extended, Maccs, and Pubchem, as feature variables. Based on the results, we found that Random Forest with Pubchem fingerprint gives the best result with the value of Matthews Correlation Coefficient (MCC) and Area Under the ROC Curve (AUC) values are 0.979 and 0.999, respectively. From this study, we contributed to revealing the potency of the ensemble with fingerprint in bioactivity prediction, especially CDK2 inhibitors as anti-cancer agents.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43421425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Authorship attribution is an NLP task to identify the author of a text based on stylometric analysis. On the other hand, authorship obfuscation aims to protect against authorship attribution by modifying a text’s style. The main challenge in authorship obfuscation is how to keep the content of the text despite the text modification. In this research, we are applying text style transfer methods for modifying the writing style while preserving the content of the input text. We implemented two unsupervised text style transfer: dictionary-based and back translation methods to change the formality level of the text. Experiment results shows that the back-translation method outperformed the dictionary-based method. The authorship attribution performance decreased up to 16.15% and 23.66% on F1-score for 3 and 10 authors respectively using back-translation. While for dictionary-based method the F1-score dropped up to 1.99% and 11.56% for 3 and 10 authors respectively. Evaluation on sensibleness and soundness factors show that the back-translation method can preserve the semantic of the obfuscated texts. Moreover, the modified texts are well-formed and inconspicuous.
{"title":"Unsupervised Text Style Transfer for Authorship Obfuscation in Bahasa Indonesia","authors":"Yunita Sari, Fadhlan Pasyah Al Faridzi","doi":"10.22146/ijccs.79623","DOIUrl":"https://doi.org/10.22146/ijccs.79623","url":null,"abstract":"Authorship attribution is an NLP task to identify the author of a text based on stylometric analysis. On the other hand, authorship obfuscation aims to protect against authorship attribution by modifying a text’s style. The main challenge in authorship obfuscation is how to keep the content of the text despite the text modification. In this research, we are applying text style transfer methods for modifying the writing style while preserving the content of the input text. We implemented two unsupervised text style transfer: dictionary-based and back translation methods to change the formality level of the text. Experiment results shows that the back-translation method outperformed the dictionary-based method. The authorship attribution performance decreased up to 16.15% and 23.66% on F1-score for 3 and 10 authors respectively using back-translation. While for dictionary-based method the F1-score dropped up to 1.99% and 11.56% for 3 and 10 authors respectively. Evaluation on sensibleness and soundness factors show that the back-translation method can preserve the semantic of the obfuscated texts. Moreover, the modified texts are well-formed and inconspicuous. ","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44597010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Angga Prima Syahputra, Alda Cendekia Siregar, Rachmat Wahid Saleh Insani
Insect pests are an important problem to overcome in agriculture. The purpose of this research is to classify insect pests with the IP-102 dataset using several CNN pre-trained models and choose which model is best for classifying insect pest data. The method used is the transfer learning method with a fine-tuning approach. Transfer learning was chosen because this technique can use the features and weights that have been obtained during the previous training process. Thus, computation time can be reduced and accuracy can be increased. The models used include Xception, MobileNetV3L, MobileNetV2, DenseNet-201, and InceptionV3. Fine-tuning and freeze layer techniques are also used to improve the quality of the resulting model, making it more accurate and better suited to the problem at hand. This study uses 75,222 image data with 102 classes. The results of this study are the DenseNet-201 model with fine-tuning produces an accuracy value of 70%, MobileNetV2 66%, MobileNetV3L 68%, InceptionV3 67%, Xception 69%. The conclusion of this study is that the transfer learning method with the fine-tuning approach produces the highest accuracy value of 70% in the DenseNet-201 model.
{"title":"Comparison of CNN Models With Transfer Learning in the Classification of Insect Pests","authors":"Angga Prima Syahputra, Alda Cendekia Siregar, Rachmat Wahid Saleh Insani","doi":"10.22146/ijccs.80956","DOIUrl":"https://doi.org/10.22146/ijccs.80956","url":null,"abstract":"Insect pests are an important problem to overcome in agriculture. The purpose of this research is to classify insect pests with the IP-102 dataset using several CNN pre-trained models and choose which model is best for classifying insect pest data. The method used is the transfer learning method with a fine-tuning approach. Transfer learning was chosen because this technique can use the features and weights that have been obtained during the previous training process. Thus, computation time can be reduced and accuracy can be increased. The models used include Xception, MobileNetV3L, MobileNetV2, DenseNet-201, and InceptionV3. Fine-tuning and freeze layer techniques are also used to improve the quality of the resulting model, making it more accurate and better suited to the problem at hand. This study uses 75,222 image data with 102 classes. The results of this study are the DenseNet-201 model with fine-tuning produces an accuracy value of 70%, MobileNetV2 66%, MobileNetV3L 68%, InceptionV3 67%, Xception 69%. The conclusion of this study is that the transfer learning method with the fine-tuning approach produces the highest accuracy value of 70% in the DenseNet-201 model.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44358665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Technology can influence and shape a person's behavior patterns when planning tours, traveling, and after traveling. Visitors' reviews can be used as evaluation material to improve the quality of tourist destinations and become a determining factor for other tourists to visit or revisit the destinations. The process of utilizing these reviews can be done by assessing the aspects of tourist destinations based on reviews from visitors. This study aims to conduct an aspect-based sentiment analysis on one of the tourist destinations in Indonesia, namely Bromo Tengger Semeru National Park, based on reviews of Google Maps users. The aspects consist of attractions, facilities, access, and price. The sentiment classification model used is a machine learning model consisting of SVM, Complement Naïve Bayes, Logistic Regression, and transfer learning from pre-trained BERT, IndoBERT, and mBERT. Based on the experimental results, transfer learning from the IndoBERT model achieved the best performance with accuracy and F1-Score of 91.48% and 71.56%, respectively. In addition, among the machine learning models used, the SVM model gives the best results with an accuracy of 89.16% and an F1-Score of 62.23%.
{"title":"Aspect-Based Sentiment Analysis in Bromo Tengger Semeru National Park Indonesia Based on Google Maps User Reviews","authors":"Cynthia As Bahri, Lya Hulliyyatus Suadaa","doi":"10.22146/ijccs.77354","DOIUrl":"https://doi.org/10.22146/ijccs.77354","url":null,"abstract":"Technology can influence and shape a person's behavior patterns when planning tours, traveling, and after traveling. Visitors' reviews can be used as evaluation material to improve the quality of tourist destinations and become a determining factor for other tourists to visit or revisit the destinations. The process of utilizing these reviews can be done by assessing the aspects of tourist destinations based on reviews from visitors. This study aims to conduct an aspect-based sentiment analysis on one of the tourist destinations in Indonesia, namely Bromo Tengger Semeru National Park, based on reviews of Google Maps users. The aspects consist of attractions, facilities, access, and price. The sentiment classification model used is a machine learning model consisting of SVM, Complement Naïve Bayes, Logistic Regression, and transfer learning from pre-trained BERT, IndoBERT, and mBERT. Based on the experimental results, transfer learning from the IndoBERT model achieved the best performance with accuracy and F1-Score of 91.48% and 71.56%, respectively. In addition, among the machine learning models used, the SVM model gives the best results with an accuracy of 89.16% and an F1-Score of 62.23%.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45330560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marine fish are one of the most promising economic commodities for the Indonesian economy. Marine fish will decrease in protein content along with the decreasing level of freshness of the fish that will be consumed. There are still many people who do not know about the classification of fresh and unfresh fish, so we need a system that can classify which fish are fresh and which are not. Previous studies have succeeded in classifying tuna using a convolutional neural network (CNN) algorithm with an accuracy of 90%. In the preprocessing stage of this research, segmentation is carried out, which aims to separate the object to be studied and the background image, then feature extraction is carried out using a color moment, which aims to get the value of the object to be studied. This research was conducted to increase the accuracy value in the freshness classification of tuna and also to add some fish for freshness detection, such as mackerel and milkfish, using the MobilenetV2. The results were able to produce accuracy of 97%, 94%, and 93% for each fish. The freshness detection method in this study has been implemented in the Fishku mobile-based application.
{"title":"Fishku Apps: Fishes Freshness Detection Using CNN With MobilenetV2","authors":"Muthia Farah Hanifa, Anugrah Tri Ramadhan, Ni’Matul Husna, Nabila Apriliana Widiyono, Rhamdan Syahrul Mubarak, Adisti Anjani Putri, Sigit Priyanta","doi":"10.22146/ijccs.80049","DOIUrl":"https://doi.org/10.22146/ijccs.80049","url":null,"abstract":"Marine fish are one of the most promising economic commodities for the Indonesian economy. Marine fish will decrease in protein content along with the decreasing level of freshness of the fish that will be consumed. There are still many people who do not know about the classification of fresh and unfresh fish, so we need a system that can classify which fish are fresh and which are not. Previous studies have succeeded in classifying tuna using a convolutional neural network (CNN) algorithm with an accuracy of 90%. In the preprocessing stage of this research, segmentation is carried out, which aims to separate the object to be studied and the background image, then feature extraction is carried out using a color moment, which aims to get the value of the object to be studied. This research was conducted to increase the accuracy value in the freshness classification of tuna and also to add some fish for freshness detection, such as mackerel and milkfish, using the MobilenetV2. The results were able to produce accuracy of 97%, 94%, and 93% for each fish. The freshness detection method in this study has been implemented in the Fishku mobile-based application.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42546742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dungeon is level in game consisting collection of rooms and doors with obstacles inside. To make good level, takes a lot of time. With Procedural Content Generation (PCG), dungeons can be created automatically. One of the approaches in PCG to create levels is progressive. Progressive approach produces timeline as representation of the interactions in the game. Timeline representation that is in the form of one straight line is good for endless runner, but for dungeon, the levels are linear. In this research, the timeline is changed to cyclic graph. Cyclic graph is formed using graph grammar algorithm. This research aims to build dungeon that has not linear and minimal dead ends. To eliminate linearity in dungeons, branching in dungeons needs to be formed. The steps carried out in this research are designing graph grammar rules, generating population of graphs, evaluating graphs with fitness values, and building dungeons. Four functions are used to determine the fitness value: shortest vertices, average duration, replayability, and variation. Dungeons produced with progressive approach manage to minimize linearity in dungeons. Dungeon formation is very dependent on the rule grammar that forms it. With the evaluation process, linear dungeons resulting from grammar rules can be minimized.
{"title":"Progressive Content Generation Based on Cyclic Graph for Generate Dungeon","authors":"Muhammad Anshar, R. Sumiharto, Moh Edi Wibowo","doi":"10.22146/ijccs.81178","DOIUrl":"https://doi.org/10.22146/ijccs.81178","url":null,"abstract":"Dungeon is level in game consisting collection of rooms and doors with obstacles inside. To make good level, takes a lot of time. With Procedural Content Generation (PCG), dungeons can be created automatically. One of the approaches in PCG to create levels is progressive. Progressive approach produces timeline as representation of the interactions in the game. Timeline representation that is in the form of one straight line is good for endless runner, but for dungeon, the levels are linear. In this research, the timeline is changed to cyclic graph. Cyclic graph is formed using graph grammar algorithm. This research aims to build dungeon that has not linear and minimal dead ends. To eliminate linearity in dungeons, branching in dungeons needs to be formed. The steps carried out in this research are designing graph grammar rules, generating population of graphs, evaluating graphs with fitness values, and building dungeons. Four functions are used to determine the fitness value: shortest vertices, average duration, replayability, and variation. Dungeons produced with progressive approach manage to minimize linearity in dungeons. Dungeon formation is very dependent on the rule grammar that forms it. With the evaluation process, linear dungeons resulting from grammar rules can be minimized.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43259877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Risna Sari, Kusrini Kusrini, Tonny Hidayat, T. Orphanoudakis
As cryptocurrencies develop, it cannot be denied that crypto prices are volatile. One of the influencing factors is the increasing volume of transactions which attracts the interest of researchers to conduct research in developing coin price predictions from cryptocurrencies. The method, algorithm and amount of data affect the prediction results. In this study, prediction modelling will be carried out using the LSTM method and short-term data. This study will conduct two experiments using the simple LSTM method and utilising multivariate time series with LSTM. The smallest predicted value is obtained using an 80/20 data allocation distribution scenario, input layer LSTM = 360, Epoch = 500, a Solana coin with RMSE = 0.111, R2 = 0.9962. It can be interpreted that short-term data can be used in making predictive models. Still, special attention needs to be paid to the characteristics of the dataset used and the modelling methodology, and it is hoped that the results of this study can be used in further research.
{"title":"Improved LSTM Method of Predicting Cryptocurrency Price Using Short-Term Data","authors":"Risna Sari, Kusrini Kusrini, Tonny Hidayat, T. Orphanoudakis","doi":"10.22146/ijccs.80776","DOIUrl":"https://doi.org/10.22146/ijccs.80776","url":null,"abstract":"As cryptocurrencies develop, it cannot be denied that crypto prices are volatile. One of the influencing factors is the increasing volume of transactions which attracts the interest of researchers to conduct research in developing coin price predictions from cryptocurrencies. The method, algorithm and amount of data affect the prediction results. In this study, prediction modelling will be carried out using the LSTM method and short-term data. This study will conduct two experiments using the simple LSTM method and utilising multivariate time series with LSTM. The smallest predicted value is obtained using an 80/20 data allocation distribution scenario, input layer LSTM = 360, Epoch = 500, a Solana coin with RMSE = 0.111, R2 = 0.9962. It can be interpreted that short-term data can be used in making predictive models. Still, special attention needs to be paid to the characteristics of the dataset used and the modelling methodology, and it is hoped that the results of this study can be used in further research.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43083069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The COVID-19 pandemic that has occurred has received worldwide attention due to the rapid rate of transmission of the outbreak and the large number of deaths that occurred. The aim of this study is to build the SVEAuAdIR model , determine the transmission of COVID-19 in Indonesia by forecast the spread of the disease, and determine the effect of vaccination by looking at the basic reproduction number of SVEAuAdIR model. The results obtained from MAPE on the model are 12%. So it can be said that the SVEAuAdIR model is good for prediction models for the spread of COVID-19. The situation where there are no more individuals infected with COVID-19 is called COVID-19 disease free, thus it is predicted that Indonesia will be free of COVID-19 on October 7, 2021. The target of the Indonesian Ministry of Health is that by the end of 2021 the spread of COVID-19 can be stopped . However, on October 7, 2021, judging from the actual data during this research, there were still new cases of COVID-19. On that day there were 1393 new cases infected with COVID-19. Thus, showing that Indonesia's target of being free of COVID-19 disease by the end of 2021 has not been achieved. The number of the SVEAuAdIR model is in the range of values , which means that the spread of disease is close to disease-free. Based on the results of the value of the SVEAuAdIR model, this study concluded that vaccination could reduce the spread of COVID-19 compared to those who did not vaccinate
{"title":"SVEAuAdIR model of COVID-19 Transmission","authors":"Anindhita Nisitasari, N. Rokhman","doi":"10.22146/ijccs.73334","DOIUrl":"https://doi.org/10.22146/ijccs.73334","url":null,"abstract":"The COVID-19 pandemic that has occurred has received worldwide attention due to the rapid rate of transmission of the outbreak and the large number of deaths that occurred. The aim of this study is to build the SVEAuAdIR model , determine the transmission of COVID-19 in Indonesia by forecast the spread of the disease, and determine the effect of vaccination by looking at the basic reproduction number of SVEAuAdIR model. The results obtained from MAPE on the model are 12%. So it can be said that the SVEAuAdIR model is good for prediction models for the spread of COVID-19. The situation where there are no more individuals infected with COVID-19 is called COVID-19 disease free, thus it is predicted that Indonesia will be free of COVID-19 on October 7, 2021. The target of the Indonesian Ministry of Health is that by the end of 2021 the spread of COVID-19 can be stopped . However, on October 7, 2021, judging from the actual data during this research, there were still new cases of COVID-19. On that day there were 1393 new cases infected with COVID-19. Thus, showing that Indonesia's target of being free of COVID-19 disease by the end of 2021 has not been achieved. The number of the SVEAuAdIR model is in the range of values , which means that the spread of disease is close to disease-free. Based on the results of the value of the SVEAuAdIR model, this study concluded that vaccination could reduce the spread of COVID-19 compared to those who did not vaccinate","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42191223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aszani Aszani, Hayyu Ilham Wicaksono, Uffi Nadzima, Lukman Heryawan
The growth of medical records continues to increase and needs to be used to improve doctors' performance in diagnosing a disease. A retrieval method returns proposed information to provide diagnostic recommendations based on symptoms from medical record datasets by applying the TF-IDF and cosine similarity methods. The challenge in this study was that the symptoms in the medical record dataset were dirty data obtained from patients who were not familiar with biological terms. Therefore, the symptoms were matched in the medical record data with the symptom terms used in the system and from the results, data augmentation was carried out to increase the amount of data up to about 3 times more. In the TF-IDF the highest accuracy with is only , while after augmentation of the test data, the accuracy becomes . The highest accuracy results with the same value using the cosine similarity method is and with the augmented test data accuracy increasing to . From this study it was concluded that a system with sufficient and relevant input of symptoms would provide a more accurate disease prediction. Prediction results using the TF-IDF method with are more accurate than predictions using the cosine similarity method.
{"title":"Information Retrieval for Early Detection of Disease Using Semantic Similarity","authors":"Aszani Aszani, Hayyu Ilham Wicaksono, Uffi Nadzima, Lukman Heryawan","doi":"10.22146/ijccs.80077","DOIUrl":"https://doi.org/10.22146/ijccs.80077","url":null,"abstract":" The growth of medical records continues to increase and needs to be used to improve doctors' performance in diagnosing a disease. A retrieval method returns proposed information to provide diagnostic recommendations based on symptoms from medical record datasets by applying the TF-IDF and cosine similarity methods. The challenge in this study was that the symptoms in the medical record dataset were dirty data obtained from patients who were not familiar with biological terms. Therefore, the symptoms were matched in the medical record data with the symptom terms used in the system and from the results, data augmentation was carried out to increase the amount of data up to about 3 times more. In the TF-IDF the highest accuracy with is only , while after augmentation of the test data, the accuracy becomes . The highest accuracy results with the same value using the cosine similarity method is and with the augmented test data accuracy increasing to . From this study it was concluded that a system with sufficient and relevant input of symptoms would provide a more accurate disease prediction. Prediction results using the TF-IDF method with are more accurate than predictions using the cosine similarity method.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45079187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}