StreamWeb: Real-Time Web Monitoring with Stream Computing

T. Suzumura, Tomoaki Oiki
{"title":"StreamWeb: Real-Time Web Monitoring with Stream Computing","authors":"T. Suzumura, Tomoaki Oiki","doi":"10.1109/ICWS.2011.16","DOIUrl":null,"url":null,"abstract":"A new trend involves Web services such as Twitter beginning to publish streaming Web APIs that enable partners and end users to retrieve streaming data. By combining such push-based Web services and existing pull-based Web services, it is now possible for us to understand the current status or trends of the world in a more real-time way, such as real-time tracking of infectious disease, real-time crime prediction, or real-time marketing, and so various innovative business services are possible. For a system architecture to implement such services, the services are normally built from the scratch, and the performance and scalability depend upon the engineers' skills. In this paper we propose a real-time Web monitoring system called gStreamWebh on top of a stream computing system called System S developed by IBM Research. The Stream Web system allows developers to easily describe their analytical algorithms for a variety of kinds of Web streaming data without worrying about the performance and scalability, and provides real-time and scalable Web monitoring for massive amounts of data. As an experimental proof-of-concept application, we built an application that monitors a list of keywords in the Twitter streaming data, and that displays any messages including the specified keywords onto a map of the physical location (from Google) where the message was posted. Our system can handle nearly 30 thousand Twitter messages per second on a system with 8 computing nodes. This prototype application confirms that we can build real-time Web monitoring systems while satisfying the needs for high software productivity and for system scalability.","PeriodicalId":118512,"journal":{"name":"2011 IEEE International Conference on Web Services","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Web Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWS.2011.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

A new trend involves Web services such as Twitter beginning to publish streaming Web APIs that enable partners and end users to retrieve streaming data. By combining such push-based Web services and existing pull-based Web services, it is now possible for us to understand the current status or trends of the world in a more real-time way, such as real-time tracking of infectious disease, real-time crime prediction, or real-time marketing, and so various innovative business services are possible. For a system architecture to implement such services, the services are normally built from the scratch, and the performance and scalability depend upon the engineers' skills. In this paper we propose a real-time Web monitoring system called gStreamWebh on top of a stream computing system called System S developed by IBM Research. The Stream Web system allows developers to easily describe their analytical algorithms for a variety of kinds of Web streaming data without worrying about the performance and scalability, and provides real-time and scalable Web monitoring for massive amounts of data. As an experimental proof-of-concept application, we built an application that monitors a list of keywords in the Twitter streaming data, and that displays any messages including the specified keywords onto a map of the physical location (from Google) where the message was posted. Our system can handle nearly 30 thousand Twitter messages per second on a system with 8 computing nodes. This prototype application confirms that we can build real-time Web monitoring systems while satisfying the needs for high software productivity and for system scalability.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
流网络:实时网络监控与流计算
一个新的趋势是,像Twitter这样的Web服务开始发布流Web api,使合作伙伴和最终用户能够检索流数据。通过将这种基于推送的Web服务与现有的基于拉动的Web服务相结合,我们现在可以更实时地了解世界的现状或趋势,例如实时跟踪传染病、实时犯罪预测或实时营销,从而可以实现各种创新的业务服务。对于实现此类服务的系统体系结构,服务通常是从头构建的,性能和可伸缩性取决于工程师的技能。本文在IBM研究院开发的流计算系统system S的基础上,提出了一个名为“gStreamWeb”的实时Web监控系统。流Web系统允许开发人员轻松地描述他们对各种Web流数据的分析算法,而不必担心性能和可伸缩性,并为大量数据提供实时和可伸缩的Web监控。作为一个实验性的概念验证应用程序,我们构建了一个应用程序,它监视Twitter流数据中的关键字列表,并将包含指定关键字的任何消息显示在发布消息的物理位置(来自Google)的地图上。我们的系统可以在一个有8个计算节点的系统上每秒处理近3万条Twitter消息。这个原型应用程序证实了我们可以构建实时Web监控系统,同时满足对高软件生产力和系统可伸缩性的需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Selection of Composable Web Services Driven by User Requirements Progressive Reliability Forecasting of Service-Oriented Software Opportunistic Composition of Sequentially-Connected Services in Mobile Computing Environments Improving Web API Discovery by Leveraging Social Information CLAM: Cross-Layer Management of Adaptation Decisions for Service-Based Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1