{"title":"基于特征选择的大型数据库数据处理","authors":"Nikat Parveen, A. M","doi":"10.1109/ICCCT2.2017.7972294","DOIUrl":null,"url":null,"abstract":"Big data is a term for huge amount of data sets that are becoming more complex for data processing applications and are inadequate to deal with them. Big data need a set of techniques and technologies which can easily handle the complex data set and can be easily processed. Feature selection techniques have become an apparent need in many applications to identify the required information from a large set of data. Existing systems processes the same data repeatedly each time when a user request is submitted even for a small task. In this work, a feature selection technique using spark streaming is proposed which can get live stream of input data and process it in batches to extract the required feature from the given input stream data. The proposed method will use the feature selection technique to extract the required data from the large dataset based on the user requirements. The algorithm proposed will also help to increase the throughput of the system.","PeriodicalId":445567,"journal":{"name":"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Data processing for large database using feature selection\",\"authors\":\"Nikat Parveen, A. M\",\"doi\":\"10.1109/ICCCT2.2017.7972294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Big data is a term for huge amount of data sets that are becoming more complex for data processing applications and are inadequate to deal with them. Big data need a set of techniques and technologies which can easily handle the complex data set and can be easily processed. Feature selection techniques have become an apparent need in many applications to identify the required information from a large set of data. Existing systems processes the same data repeatedly each time when a user request is submitted even for a small task. In this work, a feature selection technique using spark streaming is proposed which can get live stream of input data and process it in batches to extract the required feature from the given input stream data. The proposed method will use the feature selection technique to extract the required data from the large dataset based on the user requirements. The algorithm proposed will also help to increase the throughput of the system.\",\"PeriodicalId\":445567,\"journal\":{\"name\":\"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCT2.2017.7972294\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCT2.2017.7972294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data processing for large database using feature selection
Big data is a term for huge amount of data sets that are becoming more complex for data processing applications and are inadequate to deal with them. Big data need a set of techniques and technologies which can easily handle the complex data set and can be easily processed. Feature selection techniques have become an apparent need in many applications to identify the required information from a large set of data. Existing systems processes the same data repeatedly each time when a user request is submitted even for a small task. In this work, a feature selection technique using spark streaming is proposed which can get live stream of input data and process it in batches to extract the required feature from the given input stream data. The proposed method will use the feature selection technique to extract the required data from the large dataset based on the user requirements. The algorithm proposed will also help to increase the throughput of the system.