用图优化方法求解聚类问题

I. Konnov, O. Kashina, E. I. Gilmanova
{"title":"用图优化方法求解聚类问题","authors":"I. Konnov, O. Kashina, E. I. Gilmanova","doi":"10.26907/2541-7746.2019.3.423-437","DOIUrl":null,"url":null,"abstract":"The rapid growth in the volume of processed information that takes place nowadays deter-mines the urgency of the development of methods for reducing the dimension of computational problems. One of the approaches to reducing the dimensionality of data is their clustering, i.e., uniting into maximally homogeneous groups. At the same time, it is desirable that rep-resentatives of different clusters should be as much as possible unlike each other. Along with the dimension reduction, clustering procedures have an independent value. For example, we know the market segmentation problem in economics, the feature typologization problem in sociology, faces diagnostics in geology, etc. Despite the large number of known clusterization methods, the development and study of new ones remain relevant. The reason is that there is no algorithm that would surpass all the rest by all criteria (speed, insensitivity to clusters’ size and shape, number of input parameters, etc.). In this paper, we propose a clustering algorithm based on the notions of the graph theory (namely, the maximum flow (the minimum cut) theorem) and compare the results obtained by it and by four other algorithms that belong to various classes of clusterization techniques.","PeriodicalId":41863,"journal":{"name":"Uchenye Zapiski Kazanskogo Universiteta-Seriya Fiziko-Matematicheskie Nauki","volume":"3 1","pages":""},"PeriodicalIF":0.1000,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Solution of Clusterization Problem by Graph Optimization Methods\",\"authors\":\"I. Konnov, O. Kashina, E. I. Gilmanova\",\"doi\":\"10.26907/2541-7746.2019.3.423-437\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapid growth in the volume of processed information that takes place nowadays deter-mines the urgency of the development of methods for reducing the dimension of computational problems. One of the approaches to reducing the dimensionality of data is their clustering, i.e., uniting into maximally homogeneous groups. At the same time, it is desirable that rep-resentatives of different clusters should be as much as possible unlike each other. Along with the dimension reduction, clustering procedures have an independent value. For example, we know the market segmentation problem in economics, the feature typologization problem in sociology, faces diagnostics in geology, etc. Despite the large number of known clusterization methods, the development and study of new ones remain relevant. The reason is that there is no algorithm that would surpass all the rest by all criteria (speed, insensitivity to clusters’ size and shape, number of input parameters, etc.). In this paper, we propose a clustering algorithm based on the notions of the graph theory (namely, the maximum flow (the minimum cut) theorem) and compare the results obtained by it and by four other algorithms that belong to various classes of clusterization techniques.\",\"PeriodicalId\":41863,\"journal\":{\"name\":\"Uchenye Zapiski Kazanskogo Universiteta-Seriya Fiziko-Matematicheskie Nauki\",\"volume\":\"3 1\",\"pages\":\"\"},\"PeriodicalIF\":0.1000,\"publicationDate\":\"2019-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Uchenye Zapiski Kazanskogo Universiteta-Seriya Fiziko-Matematicheskie Nauki\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26907/2541-7746.2019.3.423-437\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Uchenye Zapiski Kazanskogo Universiteta-Seriya Fiziko-Matematicheskie Nauki","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26907/2541-7746.2019.3.423-437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 1

摘要

如今处理过的信息量迅速增长,这决定了开发降低计算问题维数的方法的紧迫性。降低数据维数的方法之一是它们的聚类,即最大限度地统一为同质组。与此同时,不同集群的代表应该尽可能地彼此不同。随着维数的降维,聚类过程有了一个独立的值。例如,我们知道经济学中的市场分割问题,社会学中的特征类型学问题,地质学中的面孔诊断问题等等。尽管已知的聚类方法数量众多,但新方法的开发和研究仍然具有重要意义。原因在于,没有一种算法能在所有标准(速度、对聚类大小和形状的不敏感性、输入参数的数量等)上都超越其他所有算法。在本文中,我们提出了一种基于图论概念(即最大流量(最小切割)定理)的聚类算法,并将其与属于不同类聚类技术的其他四种算法得到的结果进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Solution of Clusterization Problem by Graph Optimization Methods
The rapid growth in the volume of processed information that takes place nowadays deter-mines the urgency of the development of methods for reducing the dimension of computational problems. One of the approaches to reducing the dimensionality of data is their clustering, i.e., uniting into maximally homogeneous groups. At the same time, it is desirable that rep-resentatives of different clusters should be as much as possible unlike each other. Along with the dimension reduction, clustering procedures have an independent value. For example, we know the market segmentation problem in economics, the feature typologization problem in sociology, faces diagnostics in geology, etc. Despite the large number of known clusterization methods, the development and study of new ones remain relevant. The reason is that there is no algorithm that would surpass all the rest by all criteria (speed, insensitivity to clusters’ size and shape, number of input parameters, etc.). In this paper, we propose a clustering algorithm based on the notions of the graph theory (namely, the maximum flow (the minimum cut) theorem) and compare the results obtained by it and by four other algorithms that belong to various classes of clusterization techniques.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
0.60
自引率
0.00%
发文量
0
审稿时长
17 weeks
期刊最新文献
The Impact of Coagulation and Division of Drops on the Parameters of the Gas-Drop Turbulent Jet Determination of Hydrodynamic Forces Acting on the Body in an Unsteady Viscous Flow Using Characteristics of the Flow on the Control Surface Modeling of Fluid Inflow towards Multistage Hydraulic Fractures of Infinite Permeability Using Stream Tubes About the Causes of the Bearing Capacity Loss of a Composite Beam under Three-Point Bending Modeling of Fiberglass Degradation Process under Stresses and Alkaline Environment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1