BCBimax Biclustering Algorithm with Mixed-Type Data

Hanifa Izzati, Indahwati Indahwati, Anik Djuraidah
{"title":"BCBimax Biclustering Algorithm with Mixed-Type Data","authors":"Hanifa Izzati, Indahwati Indahwati, Anik Djuraidah","doi":"10.30595/juita.v12i1.21519","DOIUrl":null,"url":null,"abstract":"The application of biclustering analysis to mixed data is still relatively new. Initially, biclustering analysis was primarily used on gene expression data that has an interval scale. In this research, we will transform ordinal categorical variables into interval scales using the Method of Successive Interval (MSI). The BCBimax algorithm will be applied in this study with several binarization experiments that produce the smallest Mean Square Residual (MSR) at the predetermined column and row thresholds. Next, a row and column threshold test will be carried out to find the optimal bicluster threshold. The existence of different interests in the variables for international market potential and the number of Indonesian export destination countries is the reason for the need for identification regarding the mapping of destination countries based on international trade potential. The study's results with the median threshold of all data found that the optimal MSR is at the threshold of row 7 and column 2. The number of biclusters formed is 9 which covers 74.7% of countries. Most countries in the bicluster come from the European Continent and a few countries from the African Continent are included in the bicluster.","PeriodicalId":151254,"journal":{"name":"JUITA : Jurnal Informatika","volume":"98 29","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JUITA : Jurnal Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30595/juita.v12i1.21519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The application of biclustering analysis to mixed data is still relatively new. Initially, biclustering analysis was primarily used on gene expression data that has an interval scale. In this research, we will transform ordinal categorical variables into interval scales using the Method of Successive Interval (MSI). The BCBimax algorithm will be applied in this study with several binarization experiments that produce the smallest Mean Square Residual (MSR) at the predetermined column and row thresholds. Next, a row and column threshold test will be carried out to find the optimal bicluster threshold. The existence of different interests in the variables for international market potential and the number of Indonesian export destination countries is the reason for the need for identification regarding the mapping of destination countries based on international trade potential. The study's results with the median threshold of all data found that the optimal MSR is at the threshold of row 7 and column 2. The number of biclusters formed is 9 which covers 74.7% of countries. Most countries in the bicluster come from the European Continent and a few countries from the African Continent are included in the bicluster.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
混合类型数据的 BCBimax 双聚类算法
双聚类分析在混合数据中的应用还相对较新。最初,双聚类分析主要用于区间尺度的基因表达数据。在本研究中,我们将使用连续区间法(MSI)将序数分类变量转换为区间尺度。本研究将采用 BCBimax 算法,进行多次二值化实验,在预定的列和行阈值下产生最小的均方残差(MSR)。接下来,将进行行和列阈值测试,以找到最佳双簇阈值。国际市场潜力变量和印尼出口目的国数量存在不同的利益,因此需要根据国际贸易潜力对目的国的映射进行识别。使用所有数据的中值阈值进行研究的结果发现,最佳 MSR 位于第 7 行第 2 列的阈值处。形成的双集群数量为 9 个,覆盖了 74.7% 的国家。双集群中的大多数国家来自欧洲大陆,少数几个国家来自非洲大陆。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Enhancing Information Technology Adoption Potential in MSMEs: a Conceptual Model Based on TOE Framework Improving Stroke Detection with Hybrid Sampling and Cascade Generalization Comparative Study of Predictive Classification Models on Data with Severely Imbalanced Predictors Image Classification of Room Tidiness Using VGGNet with Data Augmentation Number of Cyber Attacks Predicted With Deep Learning Based LSTM Model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1