J. Anzola, Luz Andrea Rodríguez Rojas, G. T. Bermúdez
{"title":"基于PCA和k-means的IEEE explore数字图书馆数据挖掘","authors":"J. Anzola, Luz Andrea Rodríguez Rojas, G. T. Bermúdez","doi":"10.1145/2925995.2926007","DOIUrl":null,"url":null,"abstract":"An important feature in data analysis is the exploration and data representation. This article describes the Principal Components Analysis techniques (PCA) and clusters analysis with k-means, in order to represent a set of two-dimensional spatial data and group similar data to find relationships between the two techniques. Data is extracted from IEEE Xplore digital library, which lacks processing tools and information display since it doesn't permit analysis and identification of trends and patterns in a query. At the end of the article, is discussed as a technique of data analysis unsupervised allows grouping and organizing of data by proximity based on the variance, finding similar keywords between groups and major components, allowing temporary and evolutionary view of a set of keywords, which can later be interpreted as topics and areas of exploration and research.","PeriodicalId":159180,"journal":{"name":"Proceedings of the The 11th International Knowledge Management in Organizations Conference on The changing face of Knowledge Management Impacting Society","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Exploring data by PCA and k-means for IEEE Xplore digital library\",\"authors\":\"J. Anzola, Luz Andrea Rodríguez Rojas, G. T. Bermúdez\",\"doi\":\"10.1145/2925995.2926007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An important feature in data analysis is the exploration and data representation. This article describes the Principal Components Analysis techniques (PCA) and clusters analysis with k-means, in order to represent a set of two-dimensional spatial data and group similar data to find relationships between the two techniques. Data is extracted from IEEE Xplore digital library, which lacks processing tools and information display since it doesn't permit analysis and identification of trends and patterns in a query. At the end of the article, is discussed as a technique of data analysis unsupervised allows grouping and organizing of data by proximity based on the variance, finding similar keywords between groups and major components, allowing temporary and evolutionary view of a set of keywords, which can later be interpreted as topics and areas of exploration and research.\",\"PeriodicalId\":159180,\"journal\":{\"name\":\"Proceedings of the The 11th International Knowledge Management in Organizations Conference on The changing face of Knowledge Management Impacting Society\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the The 11th International Knowledge Management in Organizations Conference on The changing face of Knowledge Management Impacting Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2925995.2926007\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the The 11th International Knowledge Management in Organizations Conference on The changing face of Knowledge Management Impacting Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2925995.2926007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring data by PCA and k-means for IEEE Xplore digital library
An important feature in data analysis is the exploration and data representation. This article describes the Principal Components Analysis techniques (PCA) and clusters analysis with k-means, in order to represent a set of two-dimensional spatial data and group similar data to find relationships between the two techniques. Data is extracted from IEEE Xplore digital library, which lacks processing tools and information display since it doesn't permit analysis and identification of trends and patterns in a query. At the end of the article, is discussed as a technique of data analysis unsupervised allows grouping and organizing of data by proximity based on the variance, finding similar keywords between groups and major components, allowing temporary and evolutionary view of a set of keywords, which can later be interpreted as topics and areas of exploration and research.