结合多重聚类和网络分析发现基因表达数据

Sleiman Alhajj, A. Alhajj, S. Özyer
{"title":"结合多重聚类和网络分析发现基因表达数据","authors":"Sleiman Alhajj, A. Alhajj, S. Özyer","doi":"10.1145/3487351.3490961","DOIUrl":null,"url":null,"abstract":"Clustering is a challenging research task which could benefit a wide range of practical applications, including bioinformatics. It targets success by optimizing a number of objectives, a characteristic mostly ignored by clustering approaches. This paper describes a synthetic clustering algorithm which first applies multi-objective based approach to produce the alternative clustering solutions. Then the best clusters from each solution are selected and combined into a seed for a compact and effective solution which is expected to be better than all the individual solutions because it combines the best of each. This way, the developed algorithm may be classified as a fuzzy clustering approach because each object may belong to more than one cluster in the synthesized solution with a degree of membership in each cluster. Another interesting aspect of the algorithm is that it identifies the outliers. Further, a network is built from the relationships of the objects within the various clusters. The network is analyzed to reveal interesting discoveries not clearly reflected in the clustering outcome. The validity and applicability of the presented methodology has been assessed using synthetic and real data from the cancer.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Combining multiple clustering and network analysis for discoveries in gene expression data\",\"authors\":\"Sleiman Alhajj, A. Alhajj, S. Özyer\",\"doi\":\"10.1145/3487351.3490961\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is a challenging research task which could benefit a wide range of practical applications, including bioinformatics. It targets success by optimizing a number of objectives, a characteristic mostly ignored by clustering approaches. This paper describes a synthetic clustering algorithm which first applies multi-objective based approach to produce the alternative clustering solutions. Then the best clusters from each solution are selected and combined into a seed for a compact and effective solution which is expected to be better than all the individual solutions because it combines the best of each. This way, the developed algorithm may be classified as a fuzzy clustering approach because each object may belong to more than one cluster in the synthesized solution with a degree of membership in each cluster. Another interesting aspect of the algorithm is that it identifies the outliers. Further, a network is built from the relationships of the objects within the various clusters. The network is analyzed to reveal interesting discoveries not clearly reflected in the clustering outcome. The validity and applicability of the presented methodology has been assessed using synthetic and real data from the cancer.\",\"PeriodicalId\":320904,\"journal\":{\"name\":\"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3487351.3490961\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3487351.3490961","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

聚类是一项具有挑战性的研究任务,它可以有益于广泛的实际应用,包括生物信息学。它通过优化许多目标来实现成功,这是聚类方法通常忽略的一个特征。本文介绍了一种综合聚类算法,该算法首先采用基于多目标的方法产生备选聚类解。然后从每个解决方案中选择最好的集群并组合成一个种子,形成一个紧凑而有效的解决方案,该解决方案由于结合了每个解决方案的最佳方案而被期望优于所有单个解决方案。这样,所开发的算法可以归类为模糊聚类方法,因为每个对象在每个聚类中都具有一定的隶属度,因此在综合解中可能属于多个聚类。该算法的另一个有趣的方面是它可以识别异常值。此外,网络是根据各种集群内对象的关系构建的。对网络进行分析,以揭示在聚类结果中未明确反映的有趣发现。所提出的方法的有效性和适用性已经用合成的和真实的癌症数据进行了评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Combining multiple clustering and network analysis for discoveries in gene expression data
Clustering is a challenging research task which could benefit a wide range of practical applications, including bioinformatics. It targets success by optimizing a number of objectives, a characteristic mostly ignored by clustering approaches. This paper describes a synthetic clustering algorithm which first applies multi-objective based approach to produce the alternative clustering solutions. Then the best clusters from each solution are selected and combined into a seed for a compact and effective solution which is expected to be better than all the individual solutions because it combines the best of each. This way, the developed algorithm may be classified as a fuzzy clustering approach because each object may belong to more than one cluster in the synthesized solution with a degree of membership in each cluster. Another interesting aspect of the algorithm is that it identifies the outliers. Further, a network is built from the relationships of the objects within the various clusters. The network is analyzed to reveal interesting discoveries not clearly reflected in the clustering outcome. The validity and applicability of the presented methodology has been assessed using synthetic and real data from the cancer.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Predicting COVID-19 with AI techniques: current research and future directions Predictions of drug metabolism pathways through CYP 3A4 enzyme by analysing drug-target interactions network graph An insight into network structure measures and number of driver nodes Temporal dynamics of posts and user engagement of influencers on Facebook and Instagram Vibe check: social resonance learning for enhanced recommendation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1