基于特征选择的大型数据库数据处理

Nikat Parveen, A. M
{"title":"基于特征选择的大型数据库数据处理","authors":"Nikat Parveen, A. M","doi":"10.1109/ICCCT2.2017.7972294","DOIUrl":null,"url":null,"abstract":"Big data is a term for huge amount of data sets that are becoming more complex for data processing applications and are inadequate to deal with them. Big data need a set of techniques and technologies which can easily handle the complex data set and can be easily processed. Feature selection techniques have become an apparent need in many applications to identify the required information from a large set of data. Existing systems processes the same data repeatedly each time when a user request is submitted even for a small task. In this work, a feature selection technique using spark streaming is proposed which can get live stream of input data and process it in batches to extract the required feature from the given input stream data. The proposed method will use the feature selection technique to extract the required data from the large dataset based on the user requirements. The algorithm proposed will also help to increase the throughput of the system.","PeriodicalId":445567,"journal":{"name":"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Data processing for large database using feature selection\",\"authors\":\"Nikat Parveen, A. M\",\"doi\":\"10.1109/ICCCT2.2017.7972294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Big data is a term for huge amount of data sets that are becoming more complex for data processing applications and are inadequate to deal with them. Big data need a set of techniques and technologies which can easily handle the complex data set and can be easily processed. Feature selection techniques have become an apparent need in many applications to identify the required information from a large set of data. Existing systems processes the same data repeatedly each time when a user request is submitted even for a small task. In this work, a feature selection technique using spark streaming is proposed which can get live stream of input data and process it in batches to extract the required feature from the given input stream data. The proposed method will use the feature selection technique to extract the required data from the large dataset based on the user requirements. The algorithm proposed will also help to increase the throughput of the system.\",\"PeriodicalId\":445567,\"journal\":{\"name\":\"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCT2.2017.7972294\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd International Conference on Computing and Communications Technologies (ICCCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCT2.2017.7972294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

大数据是一个术语,指的是大量的数据集,这些数据集对于数据处理应用程序来说变得越来越复杂,而且不足以处理它们。大数据需要一套能够轻松处理复杂数据集、易于处理的技术和技术。在许多应用中,特征选择技术已经成为从大量数据中识别所需信息的明显需求。现有系统在每次提交用户请求时都重复处理相同的数据,即使是一个小任务。本文提出了一种基于火花流的特征选择技术,该技术可以实时获取输入数据流,并对其进行批量处理,从给定的输入流数据中提取出需要的特征。该方法将基于用户需求,利用特征选择技术从大型数据集中提取所需数据。该算法还有助于提高系统的吞吐量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Data processing for large database using feature selection
Big data is a term for huge amount of data sets that are becoming more complex for data processing applications and are inadequate to deal with them. Big data need a set of techniques and technologies which can easily handle the complex data set and can be easily processed. Feature selection techniques have become an apparent need in many applications to identify the required information from a large set of data. Existing systems processes the same data repeatedly each time when a user request is submitted even for a small task. In this work, a feature selection technique using spark streaming is proposed which can get live stream of input data and process it in batches to extract the required feature from the given input stream data. The proposed method will use the feature selection technique to extract the required data from the large dataset based on the user requirements. The algorithm proposed will also help to increase the throughput of the system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Smart waste management using Internet-of-Things (IoT) HOT GLASS - human face, object and textual recognition for visually challenged Preserving data and key privacy in Data Aggregation for Wireless Sensor Networks FPGA implementation of artificial Neural Network for forest fire detection in wireless Sensor Network Rival Check Cross Correlator for locating strategic defense base using supervised learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1