{"title":"基于流媒体大数据基础设施的Twitter股票市场分析和新闻","authors":"C. Lee, Incheon Paik","doi":"10.1109/ICAWST.2017.8256469","DOIUrl":null,"url":null,"abstract":"Due to the rapid development of the web, services of social media and Internet of Things (IoT) are producing a huge volume of data in every second. This data is not only large, but also grows quickly and is difficult to analyze. Most of traditional big data framework can't process such data in real-time. For processing the data in real-time, many companies and researchers have started to develop new big data frameworks. The Apache Spark, Apache Flink and Apache Storm have been introduced for real-time data processing. With the new processing frameworks, it has become more efficient to analyze the streaming data. Stock market analysis is a hot issued domain to analyze the big streaming data. In this paper, we build a real-time processing system to analyze tweets for finding correlation with the stock market. System configuration, performance of our system is explained. With 77% accuracy of Twitter data classification, we got 80% of separation of increase/decrease of stock value.","PeriodicalId":378618,"journal":{"name":"2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Stock market analysis from Twitter and news based on streaming big data infrastructure\",\"authors\":\"C. Lee, Incheon Paik\",\"doi\":\"10.1109/ICAWST.2017.8256469\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the rapid development of the web, services of social media and Internet of Things (IoT) are producing a huge volume of data in every second. This data is not only large, but also grows quickly and is difficult to analyze. Most of traditional big data framework can't process such data in real-time. For processing the data in real-time, many companies and researchers have started to develop new big data frameworks. The Apache Spark, Apache Flink and Apache Storm have been introduced for real-time data processing. With the new processing frameworks, it has become more efficient to analyze the streaming data. Stock market analysis is a hot issued domain to analyze the big streaming data. In this paper, we build a real-time processing system to analyze tweets for finding correlation with the stock market. System configuration, performance of our system is explained. With 77% accuracy of Twitter data classification, we got 80% of separation of increase/decrease of stock value.\",\"PeriodicalId\":378618,\"journal\":{\"name\":\"2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAWST.2017.8256469\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAWST.2017.8256469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Stock market analysis from Twitter and news based on streaming big data infrastructure
Due to the rapid development of the web, services of social media and Internet of Things (IoT) are producing a huge volume of data in every second. This data is not only large, but also grows quickly and is difficult to analyze. Most of traditional big data framework can't process such data in real-time. For processing the data in real-time, many companies and researchers have started to develop new big data frameworks. The Apache Spark, Apache Flink and Apache Storm have been introduced for real-time data processing. With the new processing frameworks, it has become more efficient to analyze the streaming data. Stock market analysis is a hot issued domain to analyze the big streaming data. In this paper, we build a real-time processing system to analyze tweets for finding correlation with the stock market. System configuration, performance of our system is explained. With 77% accuracy of Twitter data classification, we got 80% of separation of increase/decrease of stock value.