首页 > 最新文献

2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)最新文献

英文 中文
Fog Computing: Platform and Applications 雾计算:平台和应用
Shanhe Yi, Zijiang Hao, Zhengrui Qin, Qun A. Li
Despite the broad utilization of cloud computing, some applications and services still cannot benefit from this popular computing paradigm due to inherent problems of cloud computing such as unacceptable latency, lack of mobility support and location-awareness. As a result, fog computing, has emerged as a promising infrastructure to provide elastic resources at the edge of network. In this paper, we have discussed current definitions of fog computing and similar concepts, and proposed a more comprehensive definition. We also analyzed the goals and challenges in fog computing platform, and presented platform design with several exemplar applications. We finally implemented and evaluated a prototype fog computing platform.
尽管云计算得到了广泛的应用,但由于云计算固有的问题,例如不可接受的延迟、缺乏移动性支持和位置感知,一些应用程序和服务仍然无法从这种流行的计算范式中受益。因此,雾计算作为在网络边缘提供弹性资源的一种很有前途的基础设施而出现。在本文中,我们讨论了雾计算和类似概念的现有定义,并提出了一个更全面的定义。本文还分析了雾计算平台的目标和面临的挑战,并通过几个典型应用介绍了平台的设计。我们最终实现并评估了一个原型雾计算平台。
{"title":"Fog Computing: Platform and Applications","authors":"Shanhe Yi, Zijiang Hao, Zhengrui Qin, Qun A. Li","doi":"10.1109/HotWeb.2015.22","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.22","url":null,"abstract":"Despite the broad utilization of cloud computing, some applications and services still cannot benefit from this popular computing paradigm due to inherent problems of cloud computing such as unacceptable latency, lack of mobility support and location-awareness. As a result, fog computing, has emerged as a promising infrastructure to provide elastic resources at the edge of network. In this paper, we have discussed current definitions of fog computing and similar concepts, and proposed a more comprehensive definition. We also analyzed the goals and challenges in fog computing platform, and presented platform design with several exemplar applications. We finally implemented and evaluated a prototype fog computing platform.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129923031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 623
An Intersection Cache Based on Frequent Itemset Mining in Large Scale Search Engines 大规模搜索引擎中基于频繁项集挖掘的交集缓存
Wanwan Zhou, Ruixuan Li, Xinhua Dong, Zhiyong Xu, Weijun Xiao
Caching is an effective optimization in large scale web search engines, which is to reduce the underlying I/O burden of storage systems as far as possible by leveraging cache localities. Result cache and posting list cache are popular used approaches. However, they cannot perform well with long queries. The policies used in intersection cache are inefficient with poor flexibility for different applications. In this paper, we analyze the characteristics of query term intersections in typical search engines, and present a novel three-level cache architecture, called TLMCA, which combines the intersection cache, result cache, and posting list cache in memory. In TLMCA, we introduce an intersection cache data selection policy based on the Top-N frequent itemset mining, and design an intersection cache data replacement policy based on incremental frequent itemset mining. The experimental results demonstrate that the proposed intersection cache selection and replacement policies used in TLMCA can improve the retrieval performance by up to 27% compared to the two-level cache.
缓存是大规模web搜索引擎中的一种有效优化,它通过利用缓存位置尽可能地减少存储系统的底层I/O负担。结果缓存和发布列表缓存是常用的方法。但是,它们不能很好地处理长查询。交叉缓存中使用的策略效率低下,对于不同的应用程序灵活性差。本文分析了典型搜索引擎中查询词交集的特点,提出了一种新的三层缓存架构TLMCA,该架构将交集缓存、结果缓存和发布列表缓存结合在内存中。在TLMCA中,我们引入了基于Top-N频繁项集挖掘的交集缓存数据选择策略,并设计了基于增量频繁项集挖掘的交集缓存数据替换策略。实验结果表明,与两级缓存相比,所提出的交叉缓存选择和替换策略可使TLMCA检索性能提高27%。
{"title":"An Intersection Cache Based on Frequent Itemset Mining in Large Scale Search Engines","authors":"Wanwan Zhou, Ruixuan Li, Xinhua Dong, Zhiyong Xu, Weijun Xiao","doi":"10.1109/HotWeb.2015.17","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.17","url":null,"abstract":"Caching is an effective optimization in large scale web search engines, which is to reduce the underlying I/O burden of storage systems as far as possible by leveraging cache localities. Result cache and posting list cache are popular used approaches. However, they cannot perform well with long queries. The policies used in intersection cache are inefficient with poor flexibility for different applications. In this paper, we analyze the characteristics of query term intersections in typical search engines, and present a novel three-level cache architecture, called TLMCA, which combines the intersection cache, result cache, and posting list cache in memory. In TLMCA, we introduce an intersection cache data selection policy based on the Top-N frequent itemset mining, and design an intersection cache data replacement policy based on incremental frequent itemset mining. The experimental results demonstrate that the proposed intersection cache selection and replacement policies used in TLMCA can improve the retrieval performance by up to 27% compared to the two-level cache.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"243 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132961443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Self-Generating a Labor Force for Crowdsourcing: Is Worker Confidence a Predictor of Quality? 为众包自我生成劳动力:员工信心是质量的预测指标吗?
Julian Jarrett, Larissa Ferreira da Silva, Laerte Mello, Sadallo Andere, Gustavo Cruz, M. Brian Blake
When leveraging the crowd to perform complex tasks, it is imperative to identify the most effective worker for a particular job. Demographic profiles provided by workers, skill self-assessments by workers, and past performance as captured by employers all represent viable data points available within labor markets. Employers often question the validity of a worker's self-assessment of skills and expertise level when selecting workers in context of other information. More specifically, employers would like to answer the question, "Is worker confidence a predictor of quality?" In this paper, we discuss the state-of-the-art in recommending crowd workers based on assessment information. A major contribution of our work is an architecture, platform, and push/pull process for categorizing and recommending workers based on available self-assessment information. We present a study exploring the validity of skills input by workers in light of their actual performance and other metrics captured by employers. A further contribution of this approach is the extrapolation of a body of workers to describe the nature of the community more broadly. Through experimentation, within the language-processing domain, we demonstrate a new capability of deriving trends that might help future employers to select appropriate workers.
当利用人群来执行复杂的任务时,必须为特定的工作确定最有效的员工。工人提供的人口统计资料、工人的技能自我评估以及雇主过去的表现都是劳动力市场中可用的可行数据点。在其他信息的背景下选择工人时,雇主经常质疑工人对技能和专业水平的自我评估的有效性。更具体地说,雇主想要回答的问题是,“员工的信心是质量的预测指标吗?”本文讨论了基于评估信息推荐人群工作者的研究现状。我们工作的主要贡献是一个架构、平台和推/拉过程,用于根据可用的自我评估信息对员工进行分类和推荐。我们提出了一项研究,根据工人的实际表现和雇主捕获的其他指标,探索工人技能投入的有效性。这种方法的另一个贡献是由一组工作人员进行外推,以更广泛地描述社区的性质。通过在语言处理领域的实验,我们展示了一种推断趋势的新能力,这可能有助于未来的雇主选择合适的工人。
{"title":"Self-Generating a Labor Force for Crowdsourcing: Is Worker Confidence a Predictor of Quality?","authors":"Julian Jarrett, Larissa Ferreira da Silva, Laerte Mello, Sadallo Andere, Gustavo Cruz, M. Brian Blake","doi":"10.1109/HotWeb.2015.9","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.9","url":null,"abstract":"When leveraging the crowd to perform complex tasks, it is imperative to identify the most effective worker for a particular job. Demographic profiles provided by workers, skill self-assessments by workers, and past performance as captured by employers all represent viable data points available within labor markets. Employers often question the validity of a worker's self-assessment of skills and expertise level when selecting workers in context of other information. More specifically, employers would like to answer the question, \"Is worker confidence a predictor of quality?\" In this paper, we discuss the state-of-the-art in recommending crowd workers based on assessment information. A major contribution of our work is an architecture, platform, and push/pull process for categorizing and recommending workers based on available self-assessment information. We present a study exploring the validity of skills input by workers in light of their actual performance and other metrics captured by employers. A further contribution of this approach is the extrapolation of a body of workers to describe the nature of the community more broadly. Through experimentation, within the language-processing domain, we demonstrate a new capability of deriving trends that might help future employers to select appropriate workers.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133778718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Performance Comparison of Web Servers with Different Architectures: A Case Study Using High Concurrency Workload 不同架构的Web服务器性能比较:使用高并发工作负载的案例研究
Qingwen Fan, Qingyang Wang
Responsiveness is increasingly important for web servers to seamlessly interact with end-users and enhance user experience. In this paper, we studied how different server architectures -- asynchronous and thread-based -- impact the responsiveness of web servers under high concurrency workload. Through extensive measurements of a standard web server benchmark (Apache Bench), we show that the web servers with asynchronous architecture can achieve much better tail-latency than the thread-based version due to their robustness to handle high concurrency workload. Our fine-grained timeline analysis shows that a thread-based server is fragile to high concurrency workload because of its limited queue size (e.g., limited by thread pool size) for high concurrent requests, causing queue overflow and requests with very long response time due to TCP retransmissions. On the other hand, if we configure a thread-based server with large thread pool size to avoid queue overflow, the maximum achievable throughput can be significantly lower than that of the asynchronous version due to the multi-threading overhead. Our initial results suggest that asynchronous architecture should be considered to construct high responsive and robust web applications that involve hundreds of servers in cloud data centers.
响应性对于web服务器与最终用户无缝交互和增强用户体验越来越重要。在本文中,我们研究了不同的服务器架构——异步的和基于线程的——如何在高并发工作负载下影响web服务器的响应性。通过对标准web服务器基准(Apache Bench)的广泛测量,我们表明异步架构的web服务器可以实现比基于线程的版本更好的尾部延迟,因为它们具有处理高并发工作负载的鲁棒性。我们的细粒度时间线分析表明,基于线程的服务器对高并发工作负载很脆弱,因为它对高并发请求的队列大小有限(例如,受线程池大小的限制),导致队列溢出和由于TCP重传而具有很长响应时间的请求。另一方面,如果我们将基于线程的服务器配置为具有较大线程池大小以避免队列溢出,则由于多线程开销,可实现的最大吞吐量可能显著低于异步版本。我们的初步结果表明,应该考虑异步架构来构建高响应性和健壮的web应用程序,这些应用程序涉及云数据中心中的数百台服务器。
{"title":"Performance Comparison of Web Servers with Different Architectures: A Case Study Using High Concurrency Workload","authors":"Qingwen Fan, Qingyang Wang","doi":"10.1109/HotWeb.2015.11","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.11","url":null,"abstract":"Responsiveness is increasingly important for web servers to seamlessly interact with end-users and enhance user experience. In this paper, we studied how different server architectures -- asynchronous and thread-based -- impact the responsiveness of web servers under high concurrency workload. Through extensive measurements of a standard web server benchmark (Apache Bench), we show that the web servers with asynchronous architecture can achieve much better tail-latency than the thread-based version due to their robustness to handle high concurrency workload. Our fine-grained timeline analysis shows that a thread-based server is fragile to high concurrency workload because of its limited queue size (e.g., limited by thread pool size) for high concurrent requests, causing queue overflow and requests with very long response time due to TCP retransmissions. On the other hand, if we configure a thread-based server with large thread pool size to avoid queue overflow, the maximum achievable throughput can be significantly lower than that of the asynchronous version due to the multi-threading overhead. Our initial results suggest that asynchronous architecture should be considered to construct high responsive and robust web applications that involve hundreds of servers in cloud data centers.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114212125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Fog Computing Based Ultraviolet Radiation Measurement via Smartphones 基于雾计算的智能手机紫外线辐射测量
Bo Mei, Wei Cheng, Xiuzhen Cheng
Ultraviolet (UV) radiation has a great impact on human health. Nowadays, the public basically gets information about UV radiance through weather forecasts. However, weather forecasts just provide rough and average predication for a certain large region. Since CMOS sensors in mobile phone cameras are very sensitive to UV, mobile phones have potential to be ideal equipment to measure UV radiance. This paper introduced a method that can measure UV radiance by just using mobile phone cameras. In addition, by utilizing fog computing, results can be gathered and amended locally through fog server to provide relatively accurate UV measurement. In this paper, advantages of using mobile phones to do UV measurement were discussed at first. Then, theoretical foundations were meticulously illustrated. Later, a procedure that can be implemented in mobile phones was provided. Furthermore, an Android app called UV Meter was developed based on the procedure. At last, verification was conducted under different weather conditions. Results showed that the procedure is valid and can be easily implemented onto mobile phones for everyday UV measurement.
紫外线辐射对人体健康有很大的影响。现时,市民基本上透过天气预报获得有关紫外线辐射的资料。然而,天气预报只是对某一大区域提供粗略和平均的预测。由于手机相机中的CMOS传感器对紫外线非常敏感,手机有可能成为测量紫外线辐射的理想设备。本文介绍了一种仅利用手机摄像头即可测量紫外线辐射的方法。此外,利用雾计算,可以通过雾服务器对结果进行收集和本地修正,从而提供相对准确的紫外线测量。本文首先讨论了利用手机进行紫外测量的优点。然后,对理论基础进行了细致的阐述。后来,提供了一个可以在手机上实现的程序。此外,一个名为UV Meter的Android应用程序是基于该过程开发的。最后,在不同天气条件下进行了验证。结果表明,该方法是有效的,可以很容易地在手机上实现日常紫外线测量。
{"title":"Fog Computing Based Ultraviolet Radiation Measurement via Smartphones","authors":"Bo Mei, Wei Cheng, Xiuzhen Cheng","doi":"10.1109/HotWeb.2015.16","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.16","url":null,"abstract":"Ultraviolet (UV) radiation has a great impact on human health. Nowadays, the public basically gets information about UV radiance through weather forecasts. However, weather forecasts just provide rough and average predication for a certain large region. Since CMOS sensors in mobile phone cameras are very sensitive to UV, mobile phones have potential to be ideal equipment to measure UV radiance. This paper introduced a method that can measure UV radiance by just using mobile phone cameras. In addition, by utilizing fog computing, results can be gathered and amended locally through fog server to provide relatively accurate UV measurement. In this paper, advantages of using mobile phones to do UV measurement were discussed at first. Then, theoretical foundations were meticulously illustrated. Later, a procedure that can be implemented in mobile phones was provided. Furthermore, an Android app called UV Meter was developed based on the procedure. At last, verification was conducted under different weather conditions. Results showed that the procedure is valid and can be easily implemented onto mobile phones for everyday UV measurement.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125538397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Re-Examining the Complexity of Popular Websites 重新审视流行网站的复杂性
Ran Tian, R. Rejaie
During the past decade, the Web has become increasingly more popular and thus more important for delivery of content and services over the Internet. At the same time, the number of requested objects, their size and delivery mechanisms for popular websites have become more complex. This in turn has various implications including the impact on page loading time that directly affects the experience of visiting users. Therefore, it is important to capture and characterize the complexity of popular web pages. An earlier study by Butkiewicz et al. have characterized the complexity of 1700 popular pages in 2011. In this study, we adopt the methodology proposed by Butkiewicz et al., develop the required tools and conduct a detailed measurement study to re-assess the complexity of 2000 popular web pages and present any observed trends in their complexity characteristics over the past four years. Our results show that the number of requested objects and contacted servers for each website has significantly increased. But a growing number of contacted servers are associated with third parties. Despite these changes, the page loading time remains rather unchanged and it is primarily affected by the same key parameters. Overall, our results sheds a useful light on trends in web site complexity and motivates a range of issues to be explored.
在过去的十年中,Web变得越来越流行,因此对于在Internet上提供内容和服务也变得越来越重要。与此同时,请求对象的数量、它们的大小和流行网站的传递机制变得更加复杂。这反过来又有各种影响,包括对页面加载时间的影响,直接影响访问用户的体验。因此,捕获和描述流行网页的复杂性是很重要的。Butkiewicz等人在2011年的一项早期研究中描述了1700个流行页面的复杂性。在本研究中,我们采用Butkiewicz等人提出的方法,开发了所需的工具,并进行了详细的测量研究,以重新评估2000个流行网页的复杂性,并提出了在过去四年中观察到的复杂性特征趋势。我们的结果表明,每个网站的请求对象和联系服务器的数量都显着增加。但是越来越多的被联系的服务器与第三方有关联。尽管有这些变化,页面加载时间仍然保持不变,它主要受到相同的关键参数的影响。总的来说,我们的研究结果揭示了网站复杂性的趋势,并激发了一系列有待探索的问题。
{"title":"Re-Examining the Complexity of Popular Websites","authors":"Ran Tian, R. Rejaie","doi":"10.1109/HotWeb.2015.23","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.23","url":null,"abstract":"During the past decade, the Web has become increasingly more popular and thus more important for delivery of content and services over the Internet. At the same time, the number of requested objects, their size and delivery mechanisms for popular websites have become more complex. This in turn has various implications including the impact on page loading time that directly affects the experience of visiting users. Therefore, it is important to capture and characterize the complexity of popular web pages. An earlier study by Butkiewicz et al. have characterized the complexity of 1700 popular pages in 2011. In this study, we adopt the methodology proposed by Butkiewicz et al., develop the required tools and conduct a detailed measurement study to re-assess the complexity of 2000 popular web pages and present any observed trends in their complexity characteristics over the past four years. Our results show that the number of requested objects and contacted servers for each website has significantly increased. But a growing number of contacted servers are associated with third parties. Despite these changes, the page loading time remains rather unchanged and it is primarily affected by the same key parameters. Overall, our results sheds a useful light on trends in web site complexity and motivates a range of issues to be explored.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"154 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122874262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Inferring Web API Descriptions from Usage Data 从使用数据推断Web API描述
Philippe Suter, Erik Wittern
We describe a set of techniques to infer structured descriptions of web APIs from usage examples. Using trained classifiers, we identify fixed and variable segments in paths, and tag parameters according to their types. We implemented our techniques and evaluated their precision on 10 APIs for which we obtained: 1) descriptions, manually written by the API maintainers, and 2) server logs of the API usage. Our experiments show that our system is able to reconstruct the structure of both simple and complex web API descriptions, outperforming an existing tool with similar goals. Finally, we assess the impact of noise in the input data on the results of our method.
我们描述了一组从使用示例中推断web api结构化描述的技术。使用训练好的分类器,我们识别路径中固定和可变的片段,并根据它们的类型标记参数。我们实现了我们的技术,并在10个API上评估了它们的精度,我们获得了:1)由API维护者手工编写的描述,以及2)API使用的服务器日志。我们的实验表明,我们的系统能够重建简单和复杂的web API描述的结构,优于具有类似目标的现有工具。最后,我们评估了输入数据中的噪声对我们方法结果的影响。
{"title":"Inferring Web API Descriptions from Usage Data","authors":"Philippe Suter, Erik Wittern","doi":"10.1109/HotWeb.2015.19","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.19","url":null,"abstract":"We describe a set of techniques to infer structured descriptions of web APIs from usage examples. Using trained classifiers, we identify fixed and variable segments in paths, and tag parameters according to their types. We implemented our techniques and evaluated their precision on 10 APIs for which we obtained: 1) descriptions, manually written by the API maintainers, and 2) server logs of the API usage. Our experiments show that our system is able to reconstruct the structure of both simple and complex web API descriptions, outperforming an existing tool with similar goals. Finally, we assess the impact of noise in the input data on the results of our method.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128190389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Towards Automatic and Lightweight Detection and Classification of Malicious Web Contents Web恶意内容的自动轻量级检测与分类
Aziz Mohaisen
Malicious webpages are today one of the most prevalent threats in the Internet security landscape. To understand such problem, there has been several efforts of analysis, classification, and labeling of malicious webpages, ranging from the simple static techniques to the more elaborate dynamic techniques. Building on such efforts, this work summarizes our work in the design and evaluation of a system that utilizes machine learning techniques over network metadata to identify malicious webpages and classify them into broader classes of vulnerabilities. The system uses easy to interpret features, utilizes uniquely acquired dynamic network artifacts, and known labels for rendered webpages in a sandboxed environment. We report on the success (and failure) of our system, and the way forward by suggesting open directions for practical malicious web contents classification.
恶意网页是当今互联网安全领域最普遍的威胁之一。为了理解这样的问题,已经有一些恶意网页的分析、分类和标记的努力,从简单的静态技术到更复杂的动态技术。在此基础上,本工作总结了我们在设计和评估系统方面的工作,该系统利用网络元数据上的机器学习技术来识别恶意网页并将其分类为更广泛的漏洞类别。该系统使用易于解释的特征,利用独特获得的动态网络工件,以及在沙盒环境中呈现的网页的已知标签。我们报告了我们系统的成功(和失败),并通过建议实际恶意web内容分类的开放方向来前进。
{"title":"Towards Automatic and Lightweight Detection and Classification of Malicious Web Contents","authors":"Aziz Mohaisen","doi":"10.1109/HotWeb.2015.20","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.20","url":null,"abstract":"Malicious webpages are today one of the most prevalent threats in the Internet security landscape. To understand such problem, there has been several efforts of analysis, classification, and labeling of malicious webpages, ranging from the simple static techniques to the more elaborate dynamic techniques. Building on such efforts, this work summarizes our work in the design and evaluation of a system that utilizes machine learning techniques over network metadata to identify malicious webpages and classify them into broader classes of vulnerabilities. The system uses easy to interpret features, utilizes uniquely acquired dynamic network artifacts, and known labels for rendered webpages in a sandboxed environment. We report on the success (and failure) of our system, and the way forward by suggesting open directions for practical malicious web contents classification.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129927139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Opportunities and Challenges for Caching and Prefetching on Mobile Devices 移动设备上缓存和预取的机遇和挑战
P. Cao
Because demand for mobile data will outgrow the capacity of wireless networks for a long time, mobile applications should employ aggressive data caching and preloading to reduce the end users' data cost. This paper examines the algorithms that individual mobile applications can use for caching and preloading decisions, and proposes mechanisms for mobile operating systems to support such application-controlled caching and preloading. The paper ends with a call for data and tools to enable future research in the area.
由于对移动数据的需求将在很长一段时间内超过无线网络的容量,移动应用程序应该采用积极的数据缓存和预加载,以减少最终用户的数据成本。本文研究了单个移动应用程序可用于缓存和预加载决策的算法,并提出了移动操作系统支持此类应用程序控制的缓存和预加载的机制。论文最后呼吁为该领域的未来研究提供数据和工具。
{"title":"Opportunities and Challenges for Caching and Prefetching on Mobile Devices","authors":"P. Cao","doi":"10.1109/HotWeb.2015.18","DOIUrl":"https://doi.org/10.1109/HotWeb.2015.18","url":null,"abstract":"Because demand for mobile data will outgrow the capacity of wireless networks for a long time, mobile applications should employ aggressive data caching and preloading to reduce the end users' data cost. This paper examines the algorithms that individual mobile applications can use for caching and preloading decisions, and proposes mechanisms for mobile operating systems to support such application-controlled caching and preloading. The paper ends with a call for data and tools to enable future research in the area.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131018934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Programming Support for an Integrated Multi-Party Computation and MapReduce Infrastructure 集成多方计算和MapReduce基础设施的编程支持
Nikolaj Volgushev, A. Lapets, Azer Bestavros
We describe and present a prototype of a distributed computational infrastructure and associated high-level programming language that allow multiple parties to leverage their own computational resources capable of supporting MapReduce [1] operations in combination with multi-party computation (MPC). Our architecture allows a programmer to author and compile a protocol using a uniform collection of standard constructs, even when that protocol involves computations that take place locally within each participant's MapReduce cluster as well as across all the participants using an MPC protocol. The high-level programming language provided to the user is accompanied by static analysis algorithms that allow the programmer to reason about the efficiency of the protocol before compiling and running it. We present two example applications demonstrating how such an infrastructure can be employed.
我们描述并提出了一个分布式计算基础设施和相关高级编程语言的原型,它允许多方利用自己的计算资源,能够支持MapReduce[1]操作,并结合多方计算(MPC)。我们的架构允许程序员使用统一的标准结构集合来编写和编译协议,即使该协议涉及到在每个参与者的MapReduce集群中本地进行的计算,以及使用MPC协议的所有参与者。提供给用户的高级编程语言伴随着静态分析算法,允许程序员在编译和运行协议之前对协议的效率进行推理。我们将提供两个示例应用程序,演示如何使用这样的基础设施。
{"title":"Programming Support for an Integrated Multi-Party Computation and MapReduce Infrastructure","authors":"Nikolaj Volgushev, A. Lapets, Azer Bestavros","doi":"10.1109/HOTWEB.2015.21","DOIUrl":"https://doi.org/10.1109/HOTWEB.2015.21","url":null,"abstract":"We describe and present a prototype of a distributed computational infrastructure and associated high-level programming language that allow multiple parties to leverage their own computational resources capable of supporting MapReduce [1] operations in combination with multi-party computation (MPC). Our architecture allows a programmer to author and compile a protocol using a uniform collection of standard constructs, even when that protocol involves computations that take place locally within each participant's MapReduce cluster as well as across all the participants using an MPC protocol. The high-level programming language provided to the user is accompanied by static analysis algorithms that allow the programmer to reason about the efficiency of the protocol before compiling and running it. We present two example applications demonstrating how such an infrastructure can be employed.","PeriodicalId":252318,"journal":{"name":"2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126749938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1