首页 > 最新文献

Jurnal Teknologi dan Sistem Komputer最新文献

英文 中文
An authentication alternative using histogram shifting steganography method 使用直方图移位隐写方法的认证替代方案
Pub Date : 2021-01-01 DOI: 10.14710/jtsiskom.2021.13931
Irsandy Maulana Satya Viddin, Antonius Cahya Prihandoko, D. M. Firmansyah
This study aims to develop an authentication alternative by applying the Histogram shifting steganography method. The media used for authentication is image media. Histogram shifting utilizes the histogram of an image to insert a secret message. The developed authentication has implemented the Histogram shifting to insert user credentials into the carrier image. Users can use the steganographic image to log into their accounts. The method extracts the credentials from the image during the login. PSNR test of the steganographic images produces an average value of 52.52 dB. The extraction capability test shows that the method can extract all test images correctly. In addition, this authentication method is also more resistant to attacks common to password authentication.
本研究旨在应用直方图移位隐写法开发一种认证替代方案。用于身份验证的媒体是映像媒体。直方图移位利用图像的直方图插入秘密信息。所开发的身份验证实现了直方图移位,将用户凭据插入到载波图像中。用户可以使用隐写图像登录他们的账户。该方法在登录期间从映像提取凭据。对隐写图像进行PSNR测试,得到的均值为52.52 dB。提取能力测试表明,该方法能够正确提取所有测试图像。此外,这种认证方式也更能抵御密码认证常见的攻击。
{"title":"An authentication alternative using histogram shifting steganography method","authors":"Irsandy Maulana Satya Viddin, Antonius Cahya Prihandoko, D. M. Firmansyah","doi":"10.14710/jtsiskom.2021.13931","DOIUrl":"https://doi.org/10.14710/jtsiskom.2021.13931","url":null,"abstract":"This study aims to develop an authentication alternative by applying the Histogram shifting steganography method. The media used for authentication is image media. Histogram shifting utilizes the histogram of an image to insert a secret message. The developed authentication has implemented the Histogram shifting to insert user credentials into the carrier image. Users can use the steganographic image to log into their accounts. The method extracts the credentials from the image during the login. PSNR test of the steganographic images produces an average value of 52.52 dB. The extraction capability test shows that the method can extract all test images correctly. In addition, this authentication method is also more resistant to attacks common to password authentication.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67019419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum: Optimization of k value and lag parameter of k-nearest neighbor algorithm on the prediction of hotel occupancy rates 勘误:优化k-最近邻算法的k值和滞后参数对酒店入住率的预测
Pub Date : 2020-12-09 DOI: 10.14710/jtsiskom.2021.14007
Agus Subhan Akbar, R. H. Kusumodestoni
This correct the article "Optimasi nilai k dan parameter lag algoritme k-nearest neighbor pada prediksi tingkat hunian hotel (Optimization of k value and lag parameter of k-nearest neighbor algorithm on the prediction of hotel occupancy rates)" in vol. 8, no. 3, pp. 246-254, Jul. 2020; https://doi.org/10.14710/jtsiskom.2020.13648In the original published article, the placement of Figure 8 and Figure 9 less appropriate, which causes the manuscript hard to read. In addition, Table 2 through Table 6 need to be repositioned. These placing errors have been corrected online.The publisher apologizes for these errors. 
本文对《酒店入住率预测的k值和k近邻算法的滞后参数优化》(第八卷第8期)一文中的“Optimasi nilai k- dan参数滞后算法k-nearest neighbor pada prediksi tingkat huunian hotel”进行了修正。3, pp. 246-254, july 2020;https://doi.org/10.14710/jtsiskom.2020.13648In原发表的文章,图8和图9的位置不太合适,这导致稿件难以阅读。此外,表2到表6需要重新定位。这些放置错误已在网上更正。出版商对这些错误表示歉意。
{"title":"Erratum: Optimization of k value and lag parameter of k-nearest neighbor algorithm on the prediction of hotel occupancy rates","authors":"Agus Subhan Akbar, R. H. Kusumodestoni","doi":"10.14710/jtsiskom.2021.14007","DOIUrl":"https://doi.org/10.14710/jtsiskom.2021.14007","url":null,"abstract":"This correct the article \"Optimasi nilai k dan parameter lag algoritme k-nearest neighbor pada prediksi tingkat hunian hotel (Optimization of k value and lag parameter of k-nearest neighbor algorithm on the prediction of hotel occupancy rates)\" in vol. 8, no. 3, pp. 246-254, Jul. 2020; https://doi.org/10.14710/jtsiskom.2020.13648In the original published article, the placement of Figure 8 and Figure 9 less appropriate, which causes the manuscript hard to read. In addition, Table 2 through Table 6 need to be repositioned. These placing errors have been corrected online.The publisher apologizes for these errors. ","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47927731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison analysis of Euclidean and Gower distance measures on k-medoids cluster k-medoids簇上欧氏距离测度与Gower距离测度的比较分析
Pub Date : 2020-12-08 DOI: 10.14710/JTSISKOM.2020.13747
Agil Aditya, B. Sari, T. N. Padilah
K-medoids is a clustering method that uses the distance method to find and classify data that have similarities and inequalities between data. This shows that the determination of the distance measurement method is important because it affects the performance of the k-medoids cluster results. From several studies, it is stated that the Euclidean and Gower methods can be used as measurement methods in clustering with numerical data. This study aims to compare the performance of the k-medoids clustering results on a numerical dataset using the Euclidean and Gower methods. The method used is the Knowledge Discovery in Database (KDD) method. In this study, seven numerical datasets were used and the evaluation of clustering results used silhouette, Dunn, and connectivity values. The Euclidean distance method is superior to the two values of silhouette evaluation and connectivity, it shows that Euclidean has a good data grouping structure, while the Gower is superior to the Dunn value, which shows that the Gower has good cluster separation compared to Euclidean. This study shows that the Euclidean method is superior to the Gower method in the application of the k-medoids algorithm with a numeric dataset.
K-medoids是一种聚类方法,它使用距离方法来发现和分类数据之间具有相似性和不对称性的数据。这表明距离测量方法的确定很重要,因为它会影响k- medioid聚类结果的性能。一些研究表明,欧几里得方法和高尔方法可以作为数值数据聚类的测量方法。本研究旨在比较欧几里得方法和高尔方法在数值数据集上k- medioids聚类结果的性能。使用的方法是数据库中的知识发现(KDD)方法。在本研究中,使用了7个数值数据集,并使用剪影值、Dunn值和连通性值对聚类结果进行了评估。欧几里得距离法优于轮廓评价和连通性两个值,说明欧几里得具有良好的数据分组结构,而高尔值优于邓恩值,说明高尔值相对于欧几里得具有良好的聚类分离性。研究表明,在数值数据集上应用k-medoids算法时,欧几里得方法优于高尔方法。
{"title":"Comparison analysis of Euclidean and Gower distance measures on k-medoids cluster","authors":"Agil Aditya, B. Sari, T. N. Padilah","doi":"10.14710/JTSISKOM.2020.13747","DOIUrl":"https://doi.org/10.14710/JTSISKOM.2020.13747","url":null,"abstract":"K-medoids is a clustering method that uses the distance method to find and classify data that have similarities and inequalities between data. This shows that the determination of the distance measurement method is important because it affects the performance of the k-medoids cluster results. From several studies, it is stated that the Euclidean and Gower methods can be used as measurement methods in clustering with numerical data. This study aims to compare the performance of the k-medoids clustering results on a numerical dataset using the Euclidean and Gower methods. The method used is the Knowledge Discovery in Database (KDD) method. In this study, seven numerical datasets were used and the evaluation of clustering results used silhouette, Dunn, and connectivity values. The Euclidean distance method is superior to the two values of silhouette evaluation and connectivity, it shows that Euclidean has a good data grouping structure, while the Gower is superior to the Dunn value, which shows that the Gower has good cluster separation compared to Euclidean. This study shows that the Euclidean method is superior to the Gower method in the application of the k-medoids algorithm with a numeric dataset.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"9 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2020-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48411754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Landslide monitoring system based on water adsorption rate utilizing humidity, accelerometer, and temperature sensors 基于水分吸附率的滑坡监测系统,利用湿度、加速度计和温度传感器
Pub Date : 2020-10-31 DOI: 10.14710/jtsiskom.2020.13591
F. Budiman, E. Susanto, D. Perdana, Husneni Mukhtar, Yulius Anggoro Pamungkas, Yakobus Yulyanto Kevin
This study examines the application of a landslide disaster monitoring system based on soil activity information that utilizes humidity, temperature, and accelerometer sensors. An artificial highland was built as the research object, and the landslide process was triggered by supplying the system with continuous artificial rainfall. The soil activities were observed through its slope movement, temperature, and moisture content, utilizing an accelerometer, temperature, and humidity sensors both in dry and wet conditions. The system could well observe the soil activities, and the obtained data could be accessed in real-time and online mode on a website. The time delay in sending the data to the server was 2 seconds. Moreover, the characteristics of soil porosity and its relevance to soil saturation level due to water pressure were studied as well. Kinetic study showed that the water adsorption to soil followed the intraparticle diffusion model with a coefficient of determination R 2 0.99043. The system prototype should be used to build the information center of disaster mitigation, particularly in Indonesia.
本研究考察了基于土壤活动信息的滑坡灾害监测系统的应用,该系统利用了湿度、温度和加速度传感器。以人工高地为研究对象,通过向该系统持续提供人工降雨触发滑坡过程。在干燥和潮湿的条件下,利用加速度计、温度和湿度传感器,通过其斜坡运动、温度和水分含量来观察土壤活动。该系统能很好地观测土壤活动,所获得的数据可在网站上实时在线获取。向服务器发送数据的时间延迟为2秒。此外,还研究了水压作用下土壤孔隙度特征及其与土壤饱和水平的关系。动力学研究表明,水对土壤的吸附符合颗粒内扩散模型,决定系数为R 2 0.99043。该系统原型应用于建设减灾信息中心,特别是在印度尼西亚。
{"title":"Landslide monitoring system based on water adsorption rate utilizing humidity, accelerometer, and temperature sensors","authors":"F. Budiman, E. Susanto, D. Perdana, Husneni Mukhtar, Yulius Anggoro Pamungkas, Yakobus Yulyanto Kevin","doi":"10.14710/jtsiskom.2020.13591","DOIUrl":"https://doi.org/10.14710/jtsiskom.2020.13591","url":null,"abstract":"This study examines the application of a landslide disaster monitoring system based on soil activity information that utilizes humidity, temperature, and accelerometer sensors. An artificial highland was built as the research object, and the landslide process was triggered by supplying the system with continuous artificial rainfall. The soil activities were observed through its slope movement, temperature, and moisture content, utilizing an accelerometer, temperature, and humidity sensors both in dry and wet conditions. The system could well observe the soil activities, and the obtained data could be accessed in real-time and online mode on a website. The time delay in sending the data to the server was 2 seconds. Moreover, the characteristics of soil porosity and its relevance to soil saturation level due to water pressure were studied as well. Kinetic study showed that the water adsorption to soil followed the intraparticle diffusion model with a coefficient of determination R 2 0.99043. The system prototype should be used to build the information center of disaster mitigation, particularly in Indonesia.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"8 1","pages":"255-262"},"PeriodicalIF":0.0,"publicationDate":"2020-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42472514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator 提出了一种基于Myers-Briggs型指标的血型分类数据不平衡处理方法
Pub Date : 2020-10-31 DOI: 10.14710/JTSISKOM.2020.13625
Ahmad Taufiq Akbar, Rochmat Husaini, Bagus Muhammad Akbar, S. Saifullah
Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly.
血型仍然导致对其与某些个性方面的关系的假设。本研究观察了提高MBTI数据分类精度的预处理方法,以确定血型。培训和测试数据使用了250名受访者提供的MBTI问卷答案中的250个数据。该分类使用k最近邻(k-NN)算法。在没有预处理的情况下,k-NN的准确率约为32%,因此在分类之前需要进行一些预处理来处理数据的不平衡。所提出的预处理由两个阶段组成,第一阶段是无监督重采样,第二阶段是有监督重采样。对于验证,它使用了十个交叉验证。在使用这些提出的预处理阶段后,k-最近邻分类的结果最终显著提高了准确性、F-得分和召回率。
{"title":"A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator","authors":"Ahmad Taufiq Akbar, Rochmat Husaini, Bagus Muhammad Akbar, S. Saifullah","doi":"10.14710/JTSISKOM.2020.13625","DOIUrl":"https://doi.org/10.14710/JTSISKOM.2020.13625","url":null,"abstract":"Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"8 1","pages":"276-283"},"PeriodicalIF":0.0,"publicationDate":"2020-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48422531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Comparative analysis of classification algorithms for critical land prediction in agricultural cultivation areas 农用地临界土地预测分类算法的比较分析
Pub Date : 2020-10-31 DOI: 10.14710/jtsiskom.2020.13668
Deden Istiawan
Currently, the identification of critical land, that has been physically, chemically, and biologically damaged, uses a geographic information system. However, it requires a high cost to get the high resolution of satellite images. In this study, a comparison framework is proposed to determine the performance of the classification algorithms, namely C.45, ID3, Random Forest, k-Nearest Neighbor, and Naive Bayes. This research aims to find out the best algorithm for the classification of critical land in agricultural cultivation areas. The results show that the highest accuracy Random Forest algorithm was 93.10 % in predicting critical land, and the naive Bayes has the lowest performance, with 89.32 % of accuracy in predicting critical land.
目前,对已受到物理、化学和生物破坏的关键土地的识别使用地理信息系统。然而,要获得高分辨率的卫星图像需要很高的成本。在本研究中,提出了一个比较框架来确定分类算法的性能,即C.45, ID3,随机森林,k-近邻和朴素贝叶斯。本研究的目的是找出农业种植区关键土地分类的最佳算法。结果表明,随机森林算法预测临界土地的准确率最高,为93.10%,朴素贝叶斯算法预测临界土地的准确率最低,为89.32%。
{"title":"Comparative analysis of classification algorithms for critical land prediction in agricultural cultivation areas","authors":"Deden Istiawan","doi":"10.14710/jtsiskom.2020.13668","DOIUrl":"https://doi.org/10.14710/jtsiskom.2020.13668","url":null,"abstract":"Currently, the identification of critical land, that has been physically, chemically, and biologically damaged, uses a geographic information system. However, it requires a high cost to get the high resolution of satellite images. In this study, a comparison framework is proposed to determine the performance of the classification algorithms, namely C.45, ID3, Random Forest, k-Nearest Neighbor, and Naive Bayes. This research aims to find out the best algorithm for the classification of critical land in agricultural cultivation areas. The results show that the highest accuracy Random Forest algorithm was 93.10 % in predicting critical land, and the naive Bayes has the lowest performance, with 89.32 % of accuracy in predicting critical land.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"8 1","pages":"270-275"},"PeriodicalIF":0.0,"publicationDate":"2020-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47103051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Watermelon ripeness detector using near infrared spectroscopy 西瓜熟度检测器的近红外光谱
Pub Date : 2020-10-31 DOI: 10.14710/jtsiskom.2020.13744
Edwin R. Arboleda, Kimberly M. Parazo, C. Pareja
This study aimed to design and develop a watermelon ripeness detector using Near-Infrared Spectroscopy (NIRS). The research problem being solved in this study is developing a prototype wherein the watermelon ripeness can be detected without the need to open it. This detector will save customers from buying unripe watermelon and the farmers from harvesting an unripe watermelon. The researchers attempted to use the NIRS technique in determining the ripeness level of watermelon as it is widely used in the agricultural sector with high-speed analysis. The project was composed of Raspberry Pi Zero W as the microprocessor unit connected to input and output devices, such as the NIR spectral sensor and the OLED display. It was programmed by Python 3 IDLE. The detector scanned a total of 200 watermelon samples. These samples were grouped as 60 % for the training dataset, 20 % for testing, and another 20 % for evaluation. The data sets were collected and are subjected to the Support Vector Machine (SVM) algorithm. Overall, experimental results showed that the detector could correctly classify both unripe and ripe watermelons with 92.5 % accuracy.
本研究旨在利用近红外光谱(NIRS)技术设计和研制西瓜成熟度检测器。本研究要解决的研究问题是开发一种无需打开西瓜就能检测西瓜成熟度的样机。这种检测器可以避免消费者购买未成熟的西瓜,也可以避免农民收割未成熟的西瓜。由于近红外光谱技术在农业部门的高速分析中得到了广泛应用,因此研究人员试图利用近红外光谱技术来确定西瓜的成熟度。该项目由树莓派Zero W作为微处理器单元,连接近红外光谱传感器、OLED显示屏等输入输出设备。它是由Python 3 IDLE编程的。探测器一共扫描了200个西瓜样本。这些样本分为60%用于训练数据集,20%用于测试,另外20%用于评估。收集数据集,并进行支持向量机(SVM)算法。实验结果表明,该检测器对未熟西瓜和熟西瓜的分类准确率均为92.5%。
{"title":"Watermelon ripeness detector using near infrared spectroscopy","authors":"Edwin R. Arboleda, Kimberly M. Parazo, C. Pareja","doi":"10.14710/jtsiskom.2020.13744","DOIUrl":"https://doi.org/10.14710/jtsiskom.2020.13744","url":null,"abstract":"This study aimed to design and develop a watermelon ripeness detector using Near-Infrared Spectroscopy (NIRS). The research problem being solved in this study is developing a prototype wherein the watermelon ripeness can be detected without the need to open it. This detector will save customers from buying unripe watermelon and the farmers from harvesting an unripe watermelon. The researchers attempted to use the NIRS technique in determining the ripeness level of watermelon as it is widely used in the agricultural sector with high-speed analysis. The project was composed of Raspberry Pi Zero W as the microprocessor unit connected to input and output devices, such as the NIR spectral sensor and the OLED display. It was programmed by Python 3 IDLE. The detector scanned a total of 200 watermelon samples. These samples were grouped as 60 % for the training dataset, 20 % for testing, and another 20 % for evaluation. The data sets were collected and are subjected to the Support Vector Machine (SVM) algorithm. Overall, experimental results showed that the detector could correctly classify both unripe and ripe watermelons with 92.5 % accuracy.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"8 1","pages":"317-322"},"PeriodicalIF":0.0,"publicationDate":"2020-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48832363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Tree-based homogeneous ensemble model with feature selection for diabetic retinopathy prediction 基于树的特征选择齐次集成模型在糖尿病视网膜病变预测中的应用
Pub Date : 2020-10-31 DOI: 10.14710/jtsiskom.2020.13669
Tamunopriye Ene Dagogo-George, H. A. Mojeed, A. Balogun, M. Mabayoje, S. A. Salihu
Diabetic Retinopathy (DR) is a condition that emerges from prolonged diabetes, causing severe damages to the eyes. Early diagnosis of this disease is highly imperative as late diagnosis may be fatal. Existing studies employed machine learning approaches with Support Vector Machines (SVM) having the highest performance on most analyses and Decision Trees (DT) having the lowest. However, SVM has been known to suffer from parameter and kernel selection problems, which undermine its predictive capability. Hence, this study presents homogenous ensemble classification methods with DT as the base classifier to optimize predictive performance. Boosting and Bagging ensemble methods with feature selection were employed, and experiments were carried out using Python Scikit Learn libraries on DR datasets extracted from UCI Machine Learning repository. Experimental results showed that Bagged and Boosted DT were better than SVM. Specifically, Bagged DT performed best with accuracy 65.38 %, f-score 0.664, and AUC 0.731, followed by Boosted DT with accuracy 65.42 %, f-score 0.655, and AUC 0.724 when compared to SVM (accuracy 65.16 %, f-score 0.652, and AUC 0.721). These results indicate that DT's predictive performance can be optimized by employing the homogeneous ensemble methods to outperform SVM in predicting DR.
糖尿病视网膜病变(DR)是一种由长期糖尿病引起的疾病,会对眼睛造成严重损伤。这种疾病的早期诊断是非常必要的,因为晚期诊断可能是致命的。现有的研究采用了机器学习方法,其中支持向量机(SVM)在大多数分析中具有最高的性能,而决策树(DT)具有最低的性能。然而,众所周知,支持向量机存在参数和核选择问题,这削弱了其预测能力。因此,本研究提出了以DT为基础分类器的同质集成分类方法,以优化预测性能。采用了具有特征选择的Boosting和Bagging集成方法,并使用Python Scikit Learn库在从UCI机器学习库中提取的DR数据集上进行了实验。实验结果表明,Bagged和Boosted DT均优于SVM。具体而言,与SVM相比,Bagged DT表现最好,准确率为65.38%,f-score为0.664,AUC为0.731,其次是Boosted DT,准确率分别为65.42%、0.655和0.724(准确率为65%、0.652和0.721)。这些结果表明,通过采用齐次集成方法可以优化DT的预测性能,使其在预测DR方面优于SVM。
{"title":"Tree-based homogeneous ensemble model with feature selection for diabetic retinopathy prediction","authors":"Tamunopriye Ene Dagogo-George, H. A. Mojeed, A. Balogun, M. Mabayoje, S. A. Salihu","doi":"10.14710/jtsiskom.2020.13669","DOIUrl":"https://doi.org/10.14710/jtsiskom.2020.13669","url":null,"abstract":"Diabetic Retinopathy (DR) is a condition that emerges from prolonged diabetes, causing severe damages to the eyes. Early diagnosis of this disease is highly imperative as late diagnosis may be fatal. Existing studies employed machine learning approaches with Support Vector Machines (SVM) having the highest performance on most analyses and Decision Trees (DT) having the lowest. However, SVM has been known to suffer from parameter and kernel selection problems, which undermine its predictive capability. Hence, this study presents homogenous ensemble classification methods with DT as the base classifier to optimize predictive performance. Boosting and Bagging ensemble methods with feature selection were employed, and experiments were carried out using Python Scikit Learn libraries on DR datasets extracted from UCI Machine Learning repository. Experimental results showed that Bagged and Boosted DT were better than SVM. Specifically, Bagged DT performed best with accuracy 65.38 %, f-score 0.664, and AUC 0.731, followed by Boosted DT with accuracy 65.42 %, f-score 0.655, and AUC 0.724 when compared to SVM (accuracy 65.16 %, f-score 0.652, and AUC 0.721). These results indicate that DT's predictive performance can be optimized by employing the homogeneous ensemble methods to outperform SVM in predicting DR.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"8 1","pages":"297-303"},"PeriodicalIF":0.0,"publicationDate":"2020-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46421531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Preprocessing kNN algorithm classification using K-means and distance matrix with students’ academic performance dataset 利用K-means和距离矩阵对学生学习成绩数据集进行kNN算法分类预处理
Pub Date : 2020-10-21 DOI: 10.14710/jtsiskom.2020.13874
Sugriyono Sugriyono, M. U. Siregar
The existence of outliers in the dataset can cause low accuracy in a classification process. Outliers in the dataset can be removed from a preprocessing stage of classification algorithms. Clustering can be used as an outlier detection method. This study applies K-means and a distance matrix to detect outliers and remove them from datasets with class labels. This research used a dataset of students’ academic performance totaling 6847 instances, having 18 attributes and 3 class labels. Preprocessing applies the K-means method to get centroid in each class. The distance matrix is used to evaluate the distance of instance to the centroid. Outliers, which are a different class, will be removed from the dataset. This preprocessing improves the classification accuracy of the kNN algorithm. Data without preprocessing has 72.28 % accuracy, preprocessed data using K-means with Euclidean has 98.42 % accuracy (an increase of 26.14 %), while the K-means with Manhattan has 97.76 % accuracy (an increase of 25.48 %).
数据集中异常值的存在会导致分类过程的准确率较低。数据集中的异常值可以从分类算法的预处理阶段去除。聚类可以作为一种异常点检测方法。本研究应用K-means和距离矩阵来检测异常值,并将其从带有类标签的数据集中移除。本研究使用的学生学习成绩数据集共有6847个实例,有18个属性和3个类标签。预处理采用K-means方法得到每一类的质心。距离矩阵用于计算实例到质心的距离。异常值是一个不同的类别,将从数据集中删除。这种预处理提高了kNN算法的分类精度。未经预处理的数据准确率为72.28%,使用欧氏K-means预处理的数据准确率为98.42%(提高26.14%),而使用曼哈顿K-means的数据准确率为97.76%(提高25.48%)。
{"title":"Preprocessing kNN algorithm classification using K-means and distance matrix with students’ academic performance dataset","authors":"Sugriyono Sugriyono, M. U. Siregar","doi":"10.14710/jtsiskom.2020.13874","DOIUrl":"https://doi.org/10.14710/jtsiskom.2020.13874","url":null,"abstract":"The existence of outliers in the dataset can cause low accuracy in a classification process. Outliers in the dataset can be removed from a preprocessing stage of classification algorithms. Clustering can be used as an outlier detection method. This study applies K-means and a distance matrix to detect outliers and remove them from datasets with class labels. This research used a dataset of students’ academic performance totaling 6847 instances, having 18 attributes and 3 class labels. Preprocessing applies the K-means method to get centroid in each class. The distance matrix is used to evaluate the distance of instance to the centroid. Outliers, which are a different class, will be removed from the dataset. This preprocessing improves the classification accuracy of the kNN algorithm. Data without preprocessing has 72.28 % accuracy, preprocessed data using K-means with Euclidean has 98.42 % accuracy (an increase of 26.14 %), while the K-means with Manhattan has 97.76 % accuracy (an increase of 25.48 %).","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":"465 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67018442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Sand temperature and moisture monitoring system for turtle nests using Arduino Uno 基于Arduino Uno的海龟巢沙温湿度监测系统
Pub Date : 2020-10-13 DOI: 10.14710/jtsiskom.2020.13725
Hendi Santoso, T. Hestirianoto, I. Jaya
This study aims to develop a turtle nests real-time monitoring system using the Arduino Uno to measure the temperature and moisture of sand used conveniently for certain applications. Sand temperature measurement uses a DS18B20 waterproof sensor, sand moisture uses SKU:SEN0193, and air temperature and humidity using DHT22. The micro SD card module is used to store data from sensor calculations in real-time and continuously. The measuring instrument was designed to be portable and easy to use. The material used is polypropylene that has dimensions of 11x6x18 cm3. Using the regression linear analysis, there was no significant difference in temperature measurements using the DS18B20 sensor and analog thermometer and sand humidity using an SKU:SEN0193 sensor and analog humidity measuring instrument.
本研究旨在利用Arduino Uno开发一个海龟巢实时监测系统,以测量某些应用中方便使用的沙子的温度和湿度。测砂温度采用DS18B20防水传感器,测砂湿度采用SKU:SEN0193,测空气温湿度采用DHT22。微型SD卡模块用于实时连续存储传感器计算数据。该测量仪器的设计便于携带和使用。使用的材料是聚丙烯,尺寸为11x6x18cm3。通过回归线性分析,使用DS18B20传感器和模拟温度计测量的温度和使用SKU:SEN0193传感器和模拟湿度测量仪测量的沙子湿度没有显著差异。
{"title":"Sand temperature and moisture monitoring system for turtle nests using Arduino Uno","authors":"Hendi Santoso, T. Hestirianoto, I. Jaya","doi":"10.14710/jtsiskom.2020.13725","DOIUrl":"https://doi.org/10.14710/jtsiskom.2020.13725","url":null,"abstract":"This study aims to develop a turtle nests real-time monitoring system using the Arduino Uno to measure the temperature and moisture of sand used conveniently for certain applications. Sand temperature measurement uses a DS18B20 waterproof sensor, sand moisture uses SKU:SEN0193, and air temperature and humidity using DHT22. The micro SD card module is used to store data from sensor calculations in real-time and continuously. The measuring instrument was designed to be portable and easy to use. The material used is polypropylene that has dimensions of 11x6x18 cm3. Using the regression linear analysis, there was no significant difference in temperature measurements using the DS18B20 sensor and analog thermometer and sand humidity using an SKU:SEN0193 sensor and analog humidity measuring instrument.","PeriodicalId":56231,"journal":{"name":"Jurnal Teknologi dan Sistem Komputer","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45558041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Jurnal Teknologi dan Sistem Komputer
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1