Shijia Hao, Hui Li, Xiaoping Zhang, Mei Chen, Ming-yi Zhu
{"title":"Optimizing Correlation Measure Based Exploratory Analysis","authors":"Shijia Hao, Hui Li, Xiaoping Zhang, Mei Chen, Ming-yi Zhu","doi":"10.1109/ITME.2016.0149","DOIUrl":null,"url":null,"abstract":"Exploratory data analysis refers to the existing data to explore under the assumption of less as far as possible, through drawing, tabulation, calculation methods to explore characteristics of data structure and regularity of a kind of analysis method. However, exploratory data by calculation method is a very general method to find the key of data. In this paper, we introduce a correlation measure for exploratory analysis based on maximal information coefficient. First, we briefly introduce the traditional data analysis methods and features, expound the necessity of exploring analysis and content. Then, correlation measurement which used commonly are expounded, summarized their characteristics and models. Therefore, we propose a weighted measure based on Maximal Information Coefficient to improve effectiveness of exploratory analysis. Then we get eigenvalues of the Maximal Information Coefficient and Pearson correlation coefficient in linear and nonlinear function of plus noise. Finally, explore analysis display visualization of the results by test dataset, emphasis direction of further research.","PeriodicalId":184905,"journal":{"name":"2016 8th International Conference on Information Technology in Medicine and Education (ITME)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 8th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME.2016.0149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Exploratory data analysis refers to the existing data to explore under the assumption of less as far as possible, through drawing, tabulation, calculation methods to explore characteristics of data structure and regularity of a kind of analysis method. However, exploratory data by calculation method is a very general method to find the key of data. In this paper, we introduce a correlation measure for exploratory analysis based on maximal information coefficient. First, we briefly introduce the traditional data analysis methods and features, expound the necessity of exploring analysis and content. Then, correlation measurement which used commonly are expounded, summarized their characteristics and models. Therefore, we propose a weighted measure based on Maximal Information Coefficient to improve effectiveness of exploratory analysis. Then we get eigenvalues of the Maximal Information Coefficient and Pearson correlation coefficient in linear and nonlinear function of plus noise. Finally, explore analysis display visualization of the results by test dataset, emphasis direction of further research.