基因表达数据模糊分类与分析的集成框架

M. Khabbaz, K. Kianmehr, Mohammed Al-Shalalfa, R. Alhajj
{"title":"基因表达数据模糊分类与分析的集成框架","authors":"M. Khabbaz, K. Kianmehr, Mohammed Al-Shalalfa, R. Alhajj","doi":"10.4018/978-1-60566-717-1.CH009","DOIUrl":null,"url":null,"abstract":"This chapter takes advantage of using fuzzy classifier rules to capture the correlations between genes. The main motivation to conduct this study is that a fuzzy classifier rule is essentially an “if-then” rule that contains linguistic terms to represent the feature values. This representation of a rule that demonstrates the correlations among the genes is very simple to understand and interpret for domain experts. In this proposed gene selection procedure, instead of measuring the effectiveness of every single gene for building the classifier model, the authors incorporate the impotence of a gene correlation with other existing genes in the process of gene selection. That is, a gene is rejected if it is not in a significant correlation with other genes in the dataset. Furthermore, in order to improve the reliability of this approach, the process is repeated several times in these experiments, and the genes reported as the result are the genes selected in most experiments. This chapter reports test results on five datasets and analyzes the achieved results from biological perspective. DOI: 10.4018/978-1-60566-717-1.ch009","PeriodicalId":399104,"journal":{"name":"Strategic Advancements in Utilizing Data Mining and Warehousing Technologies","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An Integrated Framework for Fuzzy Classification and Analysis of Gene Expression Data\",\"authors\":\"M. Khabbaz, K. Kianmehr, Mohammed Al-Shalalfa, R. Alhajj\",\"doi\":\"10.4018/978-1-60566-717-1.CH009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This chapter takes advantage of using fuzzy classifier rules to capture the correlations between genes. The main motivation to conduct this study is that a fuzzy classifier rule is essentially an “if-then” rule that contains linguistic terms to represent the feature values. This representation of a rule that demonstrates the correlations among the genes is very simple to understand and interpret for domain experts. In this proposed gene selection procedure, instead of measuring the effectiveness of every single gene for building the classifier model, the authors incorporate the impotence of a gene correlation with other existing genes in the process of gene selection. That is, a gene is rejected if it is not in a significant correlation with other genes in the dataset. Furthermore, in order to improve the reliability of this approach, the process is repeated several times in these experiments, and the genes reported as the result are the genes selected in most experiments. This chapter reports test results on five datasets and analyzes the achieved results from biological perspective. DOI: 10.4018/978-1-60566-717-1.ch009\",\"PeriodicalId\":399104,\"journal\":{\"name\":\"Strategic Advancements in Utilizing Data Mining and Warehousing Technologies\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Strategic Advancements in Utilizing Data Mining and Warehousing Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/978-1-60566-717-1.CH009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Strategic Advancements in Utilizing Data Mining and Warehousing Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/978-1-60566-717-1.CH009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

本章利用模糊分类器规则来捕获基因之间的相关性。进行这项研究的主要动机是模糊分类器规则本质上是一个“if-then”规则,它包含语言术语来表示特征值。对于领域专家来说,这种表示基因之间相关性的规则非常容易理解和解释。在这个基因选择过程中,作者在基因选择过程中考虑了基因与其他现有基因相关的无能性,而不是衡量每个基因的有效性来构建分类器模型。也就是说,如果一个基因与数据集中的其他基因没有显著的相关性,它就会被拒绝。此外,为了提高该方法的可靠性,该过程在这些实验中重复多次,并且作为结果报告的基因是大多数实验中选择的基因。本章报告了五个数据集的测试结果,并从生物学角度分析了所获得的结果。DOI: 10.4018 / 978 - 1 - 60566 - 717 - 1. - ch009
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An Integrated Framework for Fuzzy Classification and Analysis of Gene Expression Data
This chapter takes advantage of using fuzzy classifier rules to capture the correlations between genes. The main motivation to conduct this study is that a fuzzy classifier rule is essentially an “if-then” rule that contains linguistic terms to represent the feature values. This representation of a rule that demonstrates the correlations among the genes is very simple to understand and interpret for domain experts. In this proposed gene selection procedure, instead of measuring the effectiveness of every single gene for building the classifier model, the authors incorporate the impotence of a gene correlation with other existing genes in the process of gene selection. That is, a gene is rejected if it is not in a significant correlation with other genes in the dataset. Furthermore, in order to improve the reliability of this approach, the process is repeated several times in these experiments, and the genes reported as the result are the genes selected in most experiments. This chapter reports test results on five datasets and analyzes the achieved results from biological perspective. DOI: 10.4018/978-1-60566-717-1.ch009
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Power of Sampling and Stacking for the PAKDD-2007 Cross-Selling Problem Bagging Probit Models for Unbalanced Classification Seismological Data Warehousing and Mining Selecting Salient Features and Samples Simultaneously to Enhance Cross-Selling Model Performance An Integrated Framework for Fuzzy Classification and Analysis of Gene Expression Data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1