{"title":"Evaluation of Bicluster Analysis Results in Capture Fisheries Using the BCBimax Algorithm","authors":"Cynthia Wulandari, I. Sumertajaya, M. Aidi","doi":"10.30595/juita.v11i1.15457","DOIUrl":null,"url":null,"abstract":"Biclustering is a simultaneous clustering technique by finding sub-matrixes that have the same similarity between rows and columns. One of the biclustering algorithms that is relatively fast and can be used as a reference for the comparison of several algorithms is the BCBimax algorithm. The BCBimax algorithm works by finding a sub-matrix containing element 1 of the formed binary data matrix. The selection of thresholds in the binarization process and the minimum combination of rows and columns are essential in finding the optimal bicluster. Capture fisheries have an important role in supporting sustainable growth in Indonesia, so information on the potential of fish species that have similarities in several provinces is needed in optimally mapping the potential. The BCBimax algorithm found 11 optimal biclusters in grouping capture fisheries data. The median of each variable is used as a threshold in the binarization process, and the minimum combination of row 2 and maximum column 2 is chosen to find the optimal bicluster result. The optimal average value of Mean Square Residual bicluster obtained is 0.405403 with the similarity of bicluster results (Liu and Wang index) which is different for each bicluster combination produced. All the bicluster results grouped the provinces and types of fish that had the same potential simultaneously.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JUITA : Jurnal Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30595/juita.v11i1.15457","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Biclustering is a simultaneous clustering technique by finding sub-matrixes that have the same similarity between rows and columns. One of the biclustering algorithms that is relatively fast and can be used as a reference for the comparison of several algorithms is the BCBimax algorithm. The BCBimax algorithm works by finding a sub-matrix containing element 1 of the formed binary data matrix. The selection of thresholds in the binarization process and the minimum combination of rows and columns are essential in finding the optimal bicluster. Capture fisheries have an important role in supporting sustainable growth in Indonesia, so information on the potential of fish species that have similarities in several provinces is needed in optimally mapping the potential. The BCBimax algorithm found 11 optimal biclusters in grouping capture fisheries data. The median of each variable is used as a threshold in the binarization process, and the minimum combination of row 2 and maximum column 2 is chosen to find the optimal bicluster result. The optimal average value of Mean Square Residual bicluster obtained is 0.405403 with the similarity of bicluster results (Liu and Wang index) which is different for each bicluster combination produced. All the bicluster results grouped the provinces and types of fish that had the same potential simultaneously.
双聚类是一种同时聚类技术,通过寻找行和列之间具有相同相似性的子矩阵。BCBimax算法是比较快的一种双聚类算法,可以作为几种算法比较的参考。BCBimax算法的工作原理是找到包含所形成的二进制数据矩阵的元素1的子矩阵。二值化过程中阈值的选择以及行和列的最小组合对于找到最佳双聚类至关重要。捕捞渔业在支持印度尼西亚的可持续增长方面发挥着重要作用,因此需要关于几个省份具有相似性的鱼类品种潜力的信息,以最佳方式绘制潜力图。BCBimax算法在捕捞渔业数据分组中找到了11个最优双聚类。在二值化过程中,使用每个变量的中位数作为阈值,选择第2行和第2列的最小组合来找到最优的双聚类结果。得到的均方残差双聚类的最优平均值为0.405403,双聚类结果的相似度(Liu and Wang指数)对产生的每个双聚类组合有所不同。所有双聚类结果将同时具有相同潜力的省份和鱼类类型分组。