首页 > 最新文献

2009 Fifth IEEE International Conference on e-Science最新文献

英文 中文
Towards a Mobile Workbench for Researchers 面向研究人员的移动工作台
Pub Date : 2009-12-09 DOI: 10.1109/e-Science.2009.26
Andreas V. Hense, Florian Quadt, Matthias Römer
Mobile devices as the iPhone and Android-based smartphones can be turned into useful tools for researchers in the field. The concept of a scholarly or scientific workbench can be extended by a mobile usage scenario, where the researcher takes photographs, records sounds, and performs basic metadata editing. We present the prototype and the architecture of a mobile workbench that uploads geo-tagged photographs taken by the camera of the smartphone into a digital repository. We discuss the idea of a research workbench as an integrated but loosely-coupled set of services; we show where in our solution the functionalities of a research workbench are contained or where other services can be added. Finally, we focus on the software running on the mobile phone, which is in itself a set of loosely-coupled applications.
移动设备,如iPhone和基于android的智能手机,可以成为该领域研究人员的有用工具。学术或科学工作台的概念可以通过移动使用场景进行扩展,在移动使用场景中,研究人员可以拍摄照片、记录声音并执行基本的元数据编辑。我们展示了一个移动工作台的原型和架构,它可以将智能手机相机拍摄的地理标记照片上传到数字存储库中。我们讨论了研究工作台作为一组集成但松散耦合的服务的概念;我们展示了解决方案中包含研究工作台功能的位置,或者可以添加其他服务的位置。最后,我们将重点放在运行在手机上的软件上,它本身就是一组松耦合的应用程序。
{"title":"Towards a Mobile Workbench for Researchers","authors":"Andreas V. Hense, Florian Quadt, Matthias Römer","doi":"10.1109/e-Science.2009.26","DOIUrl":"https://doi.org/10.1109/e-Science.2009.26","url":null,"abstract":"Mobile devices as the iPhone and Android-based smartphones can be turned into useful tools for researchers in the field. The concept of a scholarly or scientific workbench can be extended by a mobile usage scenario, where the researcher takes photographs, records sounds, and performs basic metadata editing. We present the prototype and the architecture of a mobile workbench that uploads geo-tagged photographs taken by the camera of the smartphone into a digital repository. We discuss the idea of a research workbench as an integrated but loosely-coupled set of services; we show where in our solution the functionalities of a research workbench are contained or where other services can be added. Finally, we focus on the software running on the mobile phone, which is in itself a set of loosely-coupled applications.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128975265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Robust Asynchronous Optimization for Volunteer Computing Grids 志愿计算网格的鲁棒异步优化
Pub Date : 2009-12-09 DOI: 10.1109/E-SCIENCE.2009.44
Travis Desell, M. Magdon-Ismail, B. Szymanski, Carlos A. Varela, H. Newberg, N. Cole
Volunteer computing grids offer significant computing power at relatively low cost to researchers, while at the same time generating public interest in different scientific projects. However, in order to be used effectively, their heterogeneity, volatility and restrictive computing models must be overcome. As these computing grids are open, incorrect or malicious results must also be handled. This paper examines extending the BOINC volunteer computing framework to allow for asynchronous global optimization as applied to scientific computing problems. The asynchronous optimization method used is resilient to faults and the heterogeneous nature of volunteer computing grids, while allowing scalability to tens of thousands of hosts. A work verification strategy that does not require the validation of every result is presented. This is shown to be able to effectively reduce the need for verification done to less than 30% of the reported results, without degrading the performance of the asynchronous search methods. An asynchronous version of particle swarm optimization (APSO) is presented and com- pared to previously used asynchronous genetic search (AGS) using the MilkyWay@Home BOINC computing project. Both search methods are shown to scale to MilkyWay@Home’s current user base, over 75,000 heterogeneous and volatile hosts, something not possible for traditional optimization methods. APSO is shown to provide faster convergence to optimal results while being less sensitive to its search parameters. The verification strategy presented is shown to be effective for both AGS and APSO.
志愿者计算网格以相对较低的成本为研究人员提供了重要的计算能力,同时也引起了公众对不同科学项目的兴趣。然而,为了有效地利用它们,必须克服它们的异构性、波动性和限制性计算模型。由于这些计算网格是开放的,因此还必须处理不正确或恶意的结果。本文研究了扩展BOINC志愿计算框架,以允许应用于科学计算问题的异步全局优化。所使用的异步优化方法对故障和志愿计算网格的异构特性具有弹性,同时允许数万台主机的可伸缩性。提出了一种不需要对每个结果进行验证的工作验证策略。事实证明,这能够有效地将需要完成的验证减少到报告结果的30%以下,而不会降低异步搜索方法的性能。提出了一种异步版本的粒子群优化算法(APSO),并利用MilkyWay@Home BOINC计算项目与先前使用的异步遗传搜索算法(AGS)进行了比较。这两种搜索方法都可以扩展到MilkyWay@Home当前的用户群,超过75,000个异构和不稳定的主机,这对于传统的优化方法来说是不可能的。结果表明,APSO可以更快地收敛到最优结果,同时对其搜索参数不太敏感。所提出的验证策略对AGS和APSO均有效。
{"title":"Robust Asynchronous Optimization for Volunteer Computing Grids","authors":"Travis Desell, M. Magdon-Ismail, B. Szymanski, Carlos A. Varela, H. Newberg, N. Cole","doi":"10.1109/E-SCIENCE.2009.44","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2009.44","url":null,"abstract":"Volunteer computing grids offer significant computing power at relatively low cost to researchers, while at the same time generating public interest in different scientific projects. However, in order to be used effectively, their heterogeneity, volatility and restrictive computing models must be overcome. As these computing grids are open, incorrect or malicious results must also be handled. This paper examines extending the BOINC volunteer computing framework to allow for asynchronous global optimization as applied to scientific computing problems. The asynchronous optimization method used is resilient to faults and the heterogeneous nature of volunteer computing grids, while allowing scalability to tens of thousands of hosts. A work verification strategy that does not require the validation of every result is presented. This is shown to be able to effectively reduce the need for verification done to less than 30% of the reported results, without degrading the performance of the asynchronous search methods. An asynchronous version of particle swarm optimization (APSO) is presented and com- pared to previously used asynchronous genetic search (AGS) using the MilkyWay@Home BOINC computing project. Both search methods are shown to scale to MilkyWay@Home’s current user base, over 75,000 heterogeneous and volatile hosts, something not possible for traditional optimization methods. APSO is shown to provide faster convergence to optimal results while being less sensitive to its search parameters. The verification strategy presented is shown to be effective for both AGS and APSO.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"144 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132233464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Stream Monitoring in Large-Scale Distributed Concealed Environments 大规模分布式隐蔽环境中的流监测
Pub Date : 2009-12-09 DOI: 10.1109/e-Science.2009.30
M. Lassnig, T. Fahringer, V. Garonne, A. Molfetas, M. Branco
We present a probabilistic tracing method that captures both user and system behaviour for large-scale distributed applications. Our method extends the notion of data stream monitoring to work within what we define as concealed environments. We detail the conceptual design and implementation of our method. Additionally, we evaluate the scalability of the tracing method in a real petabyte-scale distributed data management system. Finally, we demonstrate the usefulness of the collected trace data in three scenarios. First, we use collected trace data to examine the arrival of user events and find self-similar processes. Second, we examine the behaviour and performance of mass storage systems in a grid under concurrent requests. Third, we develop a model for prediction of user event arrivals based on historical data. Our results suggest that a probabilistic tracing method is scalable, straightforward to integrate with existing applications, and provides useful insight into the behaviour of very large-scale applications.
我们提出了一种概率跟踪方法,可以捕获大规模分布式应用程序的用户和系统行为。我们的方法扩展了数据流监控的概念,使其在我们定义的隐藏环境中工作。详细介绍了该方法的概念设计和实现。此外,我们还评估了跟踪方法在真实的pb级分布式数据管理系统中的可扩展性。最后,我们将在三种场景中演示所收集的跟踪数据的有用性。首先,我们使用收集的跟踪数据来检查用户事件的到达并找到自相似的过程。其次,我们研究了并发请求下网格中大容量存储系统的行为和性能。第三,我们开发了一个基于历史数据的用户事件到达预测模型。我们的结果表明,概率跟踪方法是可扩展的,可以直接与现有应用程序集成,并提供对非常大规模应用程序行为的有用见解。
{"title":"Stream Monitoring in Large-Scale Distributed Concealed Environments","authors":"M. Lassnig, T. Fahringer, V. Garonne, A. Molfetas, M. Branco","doi":"10.1109/e-Science.2009.30","DOIUrl":"https://doi.org/10.1109/e-Science.2009.30","url":null,"abstract":"We present a probabilistic tracing method that captures both user and system behaviour for large-scale distributed applications. Our method extends the notion of data stream monitoring to work within what we define as concealed environments. We detail the conceptual design and implementation of our method. Additionally, we evaluate the scalability of the tracing method in a real petabyte-scale distributed data management system. Finally, we demonstrate the usefulness of the collected trace data in three scenarios. First, we use collected trace data to examine the arrival of user events and find self-similar processes. Second, we examine the behaviour and performance of mass storage systems in a grid under concurrent requests. Third, we develop a model for prediction of user event arrivals based on historical data. Our results suggest that a probabilistic tracing method is scalable, straightforward to integrate with existing applications, and provides useful insight into the behaviour of very large-scale applications.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"28 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132468892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Comparing METS and OAI-ORE for Encapsulating Scientific Data Products: A Protein Crystallography Case Study 比较METS和OAI-ORE封装科学数据产品:蛋白质晶体学案例研究
Pub Date : 2009-12-09 DOI: 10.1109/e-Science.2009.29
C. Brooking, S. R. Shouldice, Gautier Robin, B. Kobe, Jennifer L. Martin, J. Hunter
This paper describes the set of eResearch services developed by the eResearch Lab within the University of Queensland (UQ) for the Structural Genomics (SG) Group at UQ. The aim of these services is to enable collaborative teams of protein crystallographers in the SG group to track their experiments and to manage the plethora and diversity of data that they generate through distributed high-throughput approaches and complex scientific workflows. More specifically we describe: the secure Web-based laboratory information management system (TIMTAM) and the X-ray diffraction image archive (DIMER) used to monitor experiments and record data captured prior to structure determination and the publication of a new crystal structure in public repositories such as the Protein Data bank (PDB). We also describe the services that we have developed to relate the different products generated at each stage in the protein crystallography pipeline through OAI-ORE compound objects. We conclude by comparing the OAI-ORE approach for publishing and sharing related scientific outcomes with the METS-based approach employed by other scientific laboratories.
本文介绍了由昆士兰大学(UQ)内的erresearch实验室为昆士兰大学结构基因组学(SG)小组开发的erresearch服务。这些服务的目的是使SG小组的蛋白质晶体学家协作团队能够跟踪他们的实验,并管理他们通过分布式高通量方法和复杂的科学工作流程生成的过多和多样性的数据。更具体地说,我们描述了:安全的基于web的实验室信息管理系统(TIMTAM)和x射线衍射图像档案(DIMER),用于监测实验和记录在结构确定之前捕获的数据,并在诸如蛋白质数据库(PDB)等公共存储库中发布新的晶体结构。我们还描述了我们开发的服务,这些服务通过OAI-ORE复合对象将蛋白质晶体学管道中每个阶段生成的不同产品联系起来。最后,我们将OAI-ORE发表和分享相关科学成果的方法与其他科学实验室采用的基于mets的方法进行了比较。
{"title":"Comparing METS and OAI-ORE for Encapsulating Scientific Data Products: A Protein Crystallography Case Study","authors":"C. Brooking, S. R. Shouldice, Gautier Robin, B. Kobe, Jennifer L. Martin, J. Hunter","doi":"10.1109/e-Science.2009.29","DOIUrl":"https://doi.org/10.1109/e-Science.2009.29","url":null,"abstract":"This paper describes the set of eResearch services developed by the eResearch Lab within the University of Queensland (UQ) for the Structural Genomics (SG) Group at UQ. The aim of these services is to enable collaborative teams of protein crystallographers in the SG group to track their experiments and to manage the plethora and diversity of data that they generate through distributed high-throughput approaches and complex scientific workflows. More specifically we describe: the secure Web-based laboratory information management system (TIMTAM) and the X-ray diffraction image archive (DIMER) used to monitor experiments and record data captured prior to structure determination and the publication of a new crystal structure in public repositories such as the Protein Data bank (PDB). We also describe the services that we have developed to relate the different products generated at each stage in the protein crystallography pipeline through OAI-ORE compound objects. We conclude by comparing the OAI-ORE approach for publishing and sharing related scientific outcomes with the METS-based approach employed by other scientific laboratories.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116669430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fluxdata.org: Publication and Curation of Shared Scientific Climate and Earth Sciences Data Fluxdata.org:气候和地球科学共享数据的出版和管理
Pub Date : 2009-12-09 DOI: 10.1109/E-SCIENCE.2009.25
M. Humphrey, D. Agarwal, C. Ingen
Many of today’s large-scale scientific projects attempt to collect data from a diverse set of sources. The traditional campaign-style approach to “synthesis” efforts gathers data through a single concentrated effort, and the data contributors know in advance exactly who will use their data and why. At even moderate scales, the cost and time required to find, gather, collate, normalize, and customize data in order to build a synthesis dataset can quickly outweigh the value of the resulting dataset. By explicitly identifying and addressing the different requirements for each data role (author, publisher, curator, and consumer), our data management architecture for large-scale shared scientific data enables the creation of such synthesis datasets that continue to grow and evolve with new data, data annotations, participants, and use rules. We show the effectiveness of our approach in the context of the FLUXNET Synthesis Dataset, one of the largest ongoing biogeophysical experiments.
当今许多大型科学项目都试图从不同的来源收集数据。传统的运动式“综合”方法通过单一的集中努力收集数据,数据贡献者事先确切地知道谁将使用他们的数据以及为什么使用。即使在中等规模下,为了构建合成数据集而查找、收集、整理、规范化和自定义数据所需的成本和时间也可能很快超过最终数据集的价值。通过明确地识别和解决每个数据角色(作者、发布者、管理者和消费者)的不同需求,我们的大规模共享科学数据的数据管理体系结构可以创建这样的综合数据集,这些数据集可以随着新的数据、数据注释、参与者和使用规则而不断增长和发展。我们在FLUXNET合成数据集(目前最大的生物地球物理实验之一)的背景下展示了我们方法的有效性。
{"title":"Fluxdata.org: Publication and Curation of Shared Scientific Climate and Earth Sciences Data","authors":"M. Humphrey, D. Agarwal, C. Ingen","doi":"10.1109/E-SCIENCE.2009.25","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2009.25","url":null,"abstract":"Many of today’s large-scale scientific projects attempt to collect data from a diverse set of sources. The traditional campaign-style approach to “synthesis” efforts gathers data through a single concentrated effort, and the data contributors know in advance exactly who will use their data and why. At even moderate scales, the cost and time required to find, gather, collate, normalize, and customize data in order to build a synthesis dataset can quickly outweigh the value of the resulting dataset. By explicitly identifying and addressing the different requirements for each data role (author, publisher, curator, and consumer), our data management architecture for large-scale shared scientific data enables the creation of such synthesis datasets that continue to grow and evolve with new data, data annotations, participants, and use rules. We show the effectiveness of our approach in the context of the FLUXNET Synthesis Dataset, one of the largest ongoing biogeophysical experiments.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128467958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Publish My Data: A Composition of Services from ANDS and ARCS 发布我的数据:来自ANDS和arc的服务组合
Pub Date : 2009-12-01 DOI: 10.1109/E-SCIENCE.2009.31
A. Burton, A. Treloar
This paper first provides an overview of the e-Research environment in Australia, before introducing the two organizations (the Australian National Data Service – ANDS, and the Australian Research Collaboration Services - ARCS) whose services will be focus of the paper. Next the paper describes the problem that needs to be solved before talking about the requirements for management of data and metadata. The paper then discusses the main ARCS and ANDS services that will be combined to provide Publish My Data. The paper provides an overview of how this combined service will work for data that is already stored and data not yet stored, before concluding.
本文首先概述了澳大利亚的电子研究环境,然后介绍了两个组织(澳大利亚国家数据服务- ANDS和澳大利亚研究合作服务- ARCS),这两个组织的服务将是本文的重点。接下来,本文描述了在讨论数据和元数据管理需求之前需要解决的问题。然后,本文讨论了主要的ARCS和ANDS服务,它们将结合起来提供“发布我的数据”。在总结之前,本文概述了这个组合服务将如何为已经存储和尚未存储的数据工作。
{"title":"Publish My Data: A Composition of Services from ANDS and ARCS","authors":"A. Burton, A. Treloar","doi":"10.1109/E-SCIENCE.2009.31","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2009.31","url":null,"abstract":"This paper first provides an overview of the e-Research environment in Australia, before introducing the two organizations (the Australian National Data Service – ANDS, and the Australian Research Collaboration Services - ARCS) whose services will be focus of the paper. Next the paper describes the problem that needs to be solved before talking about the requirements for management of data and metadata. The paper then discusses the main ARCS and ANDS services that will be combined to provide Publish My Data. The paper provides an overview of how this combined service will work for data that is already stored and data not yet stored, before concluding.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130522871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Rapid Chemistry Portals through Engaging Researchers 通过参与研究人员快速化学门户
Pub Date : 2009-12-01 DOI: 10.1109/e-Science.2009.47
J. Koetsier, Andrew Turner, Patricia Richardson, Jano van Hemert
In this study, we apply a methodology for rapid development of portlets for scientific computing to the domain of computational chemistry. We report results in terms of the portals delivered, the changes made to our methodology and the experience gained in terms of interaction with domain-specialists. Our major contributions are: several web portals for teaching and research in computational chemistry; a successful transition to having our development tool used by the domain specialist as opposed by us, the developers; and an updated version of our methodology and technology for rapid development of portlets for computational science, which is free for anyone to pick up and use.
在本研究中,我们将科学计算portlet的快速开发方法应用于计算化学领域。我们根据交付的门户、对我们的方法所做的更改以及根据与领域专家的交互获得的经验来报告结果。我们的主要贡献是:计算化学教学和研究的几个门户网站;一个成功的过渡,让我们的开发工具由领域专家使用,而不是由我们开发人员使用;我们的方法和技术的更新版本,用于快速开发用于计算科学的portlet,任何人都可以免费获取和使用。
{"title":"Rapid Chemistry Portals through Engaging Researchers","authors":"J. Koetsier, Andrew Turner, Patricia Richardson, Jano van Hemert","doi":"10.1109/e-Science.2009.47","DOIUrl":"https://doi.org/10.1109/e-Science.2009.47","url":null,"abstract":"In this study, we apply a methodology for rapid development of portlets for scientific computing to the domain of computational chemistry. We report results in terms of the portals delivered, the changes made to our methodology and the experience gained in terms of interaction with domain-specialists. Our major contributions are: several web portals for teaching and research in computational chemistry; a successful transition to having our development tool used by the domain specialist as opposed by us, the developers; and an updated version of our methodology and technology for rapid development of portlets for computational science, which is free for anyone to pick up and use.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123546158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An Autonomic Approach to Integrated HPC Grid and Cloud Usage 集成HPC网格和云使用的自主方法
Pub Date : 2009-12-01 DOI: 10.1109/E-SCIENCE.2009.58
Hyunjoo Kim, Y. E. Khamra, S. Jha, M. Parashar
Clouds are rapidly joining high-performance Grids as viable computational platforms for scientific exploration and discovery, and it is clear that production computational infrastructures will integrate both these paradigms in the near future. As a result, understanding usage modes that are meaningful in such a hybrid infrastructure is critical. For example, there are interesting application workflows that can benefit from such hybrid usage modes to, per- haps, reduce times to solutions, reduce costs (in terms of currency or resource allocation), or handle unexpected runtime situations (e.g., unexpected delays in scheduling queues or unexpected failures). The primary goal of this paper is to experimentally investigate, from an applications perspective, how autonomics can enable interesting usage modes and scenarios for integrating HPC Grid and Clouds. Specifically, we used a reservoir characterization application workflow, based on Ensemble Kalman Filters (EnKF) for history matching, and the CometCloud autonomic Cloud engine on a hybrid platform consisting of the TeraGrid and Amazon EC2, to investigate 3 usage modes (or autonomic objectives) – acceleration, conservation and resilience.
云正在迅速加入高性能网格,成为科学探索和发现的可行计算平台,很明显,在不久的将来,生产计算基础设施将集成这两种范式。因此,理解在这种混合基础设施中有意义的使用模式至关重要。例如,有一些有趣的应用程序工作流可以从这种混合使用模式中受益,从而可能减少解决方案的时间、降低成本(就货币或资源分配而言),或者处理意外的运行时情况(例如,调度队列中的意外延迟或意外故障)。本文的主要目标是从应用的角度来实验研究,自治系统如何能够为集成高性能计算网格和云提供有趣的使用模式和场景。具体来说,我们使用了一个基于集成卡尔曼滤波器(EnKF)进行历史匹配的油藏描述应用流程,以及由TeraGrid和Amazon EC2组成的混合平台上的CometCloud自主云引擎,来研究3种使用模式(或自主目标)——加速、保护和弹性。
{"title":"An Autonomic Approach to Integrated HPC Grid and Cloud Usage","authors":"Hyunjoo Kim, Y. E. Khamra, S. Jha, M. Parashar","doi":"10.1109/E-SCIENCE.2009.58","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2009.58","url":null,"abstract":"Clouds are rapidly joining high-performance Grids as viable computational platforms for scientific exploration and discovery, and it is clear that production computational infrastructures will integrate both these paradigms in the near future. As a result, understanding usage modes that are meaningful in such a hybrid infrastructure is critical. For example, there are interesting application workflows that can benefit from such hybrid usage modes to, per- haps, reduce times to solutions, reduce costs (in terms of currency or resource allocation), or handle unexpected runtime situations (e.g., unexpected delays in scheduling queues or unexpected failures). The primary goal of this paper is to experimentally investigate, from an applications perspective, how autonomics can enable interesting usage modes and scenarios for integrating HPC Grid and Clouds. Specifically, we used a reservoir characterization application workflow, based on Ensemble Kalman Filters (EnKF) for history matching, and the CometCloud autonomic Cloud engine on a hybrid platform consisting of the TeraGrid and Amazon EC2, to investigate 3 usage modes (or autonomic objectives) – acceleration, conservation and resilience.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126446167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
A Topological Framework for the Interactive Exploration of Large Scale Turbulent Combustion 大尺度湍流燃烧相互作用探索的拓扑框架
Pub Date : 2009-12-01 DOI: 10.1109/E-SCIENCE.2009.42
P. Bremer, G. Weber, Julien Tierny, Valerio Pascucci, M. Day, J. Bell
The advent of highly accurate, large scale volumetric simulations has made data analysis and visualization techniques an integral part of the modern scientific process. To develop new insights from raw data, scientists need the ability to define features of interest in a flexible manner and to understand how changes in the feature definition impact the subsequent analysis of the data. Therefore, simply exploring the raw data is not sufficient. This paper presents a new topological framework for the analysis of large scale, time-varying, turbulent combustion simulations. It allows the scientists to interactively explore the complete parameter space of fuel consumption thresholds for an entire time-dependent combustion simulation. By computing augmented merge trees and their corresponding data segmentations, the system allows the user complete flexibility to segment, select, and track burning cells through time thanks to a linked view interface. We developed this technique in the context of low-swirl turbulent pre-mixed same simulation analysis, where the topological abstractions enable an efficient tracking through time of the burning cells and provide new qualitative and quantitative insights into the dynamics of the combustion process.
高精度、大规模体积模拟的出现使得数据分析和可视化技术成为现代科学过程中不可或缺的一部分。为了从原始数据中获得新的见解,科学家需要能够以灵活的方式定义感兴趣的特征,并了解特征定义的变化如何影响随后的数据分析。因此,仅仅探索原始数据是不够的。本文提出了一种新的拓扑框架,用于分析大尺度、时变湍流燃烧模拟。它使科学家能够交互式地探索整个时间相关燃烧模拟的油耗阈值的完整参数空间。通过计算增强合并树及其相应的数据分割,该系统允许用户完全灵活地分割、选择和跟踪燃烧细胞,这要归功于一个链接视图界面。我们在低涡流湍流预混合相同模拟分析的背景下开发了这项技术,其中拓扑抽象能够有效地跟踪燃烧细胞的时间,并为燃烧过程的动力学提供新的定性和定量见解。
{"title":"A Topological Framework for the Interactive Exploration of Large Scale Turbulent Combustion","authors":"P. Bremer, G. Weber, Julien Tierny, Valerio Pascucci, M. Day, J. Bell","doi":"10.1109/E-SCIENCE.2009.42","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2009.42","url":null,"abstract":"The advent of highly accurate, large scale volumetric simulations has made data analysis and visualization techniques an integral part of the modern scientific process. To develop new insights from raw data, scientists need the ability to define features of interest in a flexible manner and to understand how changes in the feature definition impact the subsequent analysis of the data. Therefore, simply exploring the raw data is not sufficient. This paper presents a new topological framework for the analysis of large scale, time-varying, turbulent combustion simulations. It allows the scientists to interactively explore the complete parameter space of fuel consumption thresholds for an entire time-dependent combustion simulation. By computing augmented merge trees and their corresponding data segmentations, the system allows the user complete flexibility to segment, select, and track burning cells through time thanks to a linked view interface. We developed this technique in the context of low-swirl turbulent pre-mixed same simulation analysis, where the topological abstractions enable an efficient tracking through time of the burning cells and provide new qualitative and quantitative insights into the dynamics of the combustion process.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"190 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121083813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
2009 Fifth IEEE International Conference on e-Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1