首页 > 最新文献

2012 IEEE 8th International Conference on E-Science最新文献

英文 中文
Mining hidden mixture context with ADIOS-P to improve predictive pre-fetcher accuracy 利用ADIOS-P挖掘隐藏混合上下文,提高预测预取精度
Pub Date : 2012-10-01 DOI: 10.1109/eScience.2012.6404418
J. Choi, H. Abbasi, D. Pugmire, N. Podhorszki, S. Klasky, Cristian Capdevila, M. Parashar, M. Wolf, J. Qiu, G. Fox
Predictive pre-fetcher, which predicts future data access events and loads the data before users requests, has been widely studied, especially in file systems or web contents servers, to reduce data load latency. Especially in scientific data visualization, pre-fetching can reduce the IO waiting time. In order to increase the accuracy, we apply a data mining technique to extract hidden information. More specifically, we apply a data mining technique for discovering the hidden contexts in data access patterns and make prediction based on the inferred context to boost the accuracy. In particular, we performed Probabilistic Latent Semantic Analysis (PLSA), a mixture model based algorithm popular in the text mining area, to mine hidden contexts from the collected user access patterns and, then, we run a predictor within the discovered context. We further improve PLSA by applying the Deterministic Annealing (DA) method to overcome the local optimum problem. In this paper we demonstrate how we can apply PLSA and DA optimization to mine hidden contexts from users data access patterns and improve predictive pre-fetcher performance.
预测性预取(Predictive pre-fetcher)是一种预测未来的数据访问事件并在用户请求之前加载数据的方法,在文件系统或web内容服务器中得到了广泛的研究,以减少数据加载延迟。特别是在科学数据可视化中,预取可以减少IO等待时间。为了提高准确率,我们采用了数据挖掘技术来提取隐藏信息。更具体地说,我们应用数据挖掘技术来发现数据访问模式中隐藏的上下文,并根据推断的上下文进行预测,以提高预测的准确性。特别是,我们执行了概率潜在语义分析(PLSA),这是一种在文本挖掘领域流行的基于混合模型的算法,从收集的用户访问模式中挖掘隐藏上下文,然后,我们在发现的上下文中运行预测器。我们利用确定性退火(DA)方法克服了局部最优问题,进一步改进了PLSA。在本文中,我们演示了如何应用PLSA和DA优化来挖掘用户数据访问模式中的隐藏上下文,并提高预测预取器的性能。
{"title":"Mining hidden mixture context with ADIOS-P to improve predictive pre-fetcher accuracy","authors":"J. Choi, H. Abbasi, D. Pugmire, N. Podhorszki, S. Klasky, Cristian Capdevila, M. Parashar, M. Wolf, J. Qiu, G. Fox","doi":"10.1109/eScience.2012.6404418","DOIUrl":"https://doi.org/10.1109/eScience.2012.6404418","url":null,"abstract":"Predictive pre-fetcher, which predicts future data access events and loads the data before users requests, has been widely studied, especially in file systems or web contents servers, to reduce data load latency. Especially in scientific data visualization, pre-fetching can reduce the IO waiting time. In order to increase the accuracy, we apply a data mining technique to extract hidden information. More specifically, we apply a data mining technique for discovering the hidden contexts in data access patterns and make prediction based on the inferred context to boost the accuracy. In particular, we performed Probabilistic Latent Semantic Analysis (PLSA), a mixture model based algorithm popular in the text mining area, to mine hidden contexts from the collected user access patterns and, then, we run a predictor within the discovered context. We further improve PLSA by applying the Deterministic Annealing (DA) method to overcome the local optimum problem. In this paper we demonstrate how we can apply PLSA and DA optimization to mine hidden contexts from users data access patterns and improve predictive pre-fetcher performance.","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89001899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Flashes in a star stream: Automated classification of astronomical transient events 星流中的闪光:天文瞬变事件的自动分类
Pub Date : 2012-09-08 DOI: 10.1109/eScience.2012.6404437
S. Djorgovski, A. Mahabal, C. Donalek, M. Graham, A. Drake, B. Moghaddam, M. Turmon
An automated, rapid classification of transient events detected in the modern synoptic sky surveys is essential for their scientific utility and effective follow-up using scarce resources. This presents some unusual challenges: the data are sparse, heterogeneous and incomplete; evolving in time; and most of the relevant information comes not from the data stream itself, but from a variety of archival data and contextual information (spatial, temporal, and multi-wavelength). We are exploring a variety of novel techniques, mostly Bayesian, to respond to these challenges, using the ongoing CRTS sky survey as a testbed. The current surveys are already overwhelming our ability to effectively follow all of the potentially interesting events, and these challenges will grow by orders of magnitude over the next decade as the more ambitious sky surveys get under way. While we focus on an application in a specific domain (astrophysics), these challenges are more broadly relevant for event or anomaly detection and knowledge discovery in massive data streams.
对现代天气巡天中探测到的瞬变事件进行自动、快速分类,对于其科学应用和利用稀缺资源进行有效跟踪至关重要。这就提出了一些不同寻常的挑战:数据稀疏、异构且不完整;演化的:在时间上演化的;而且大多数相关信息不是来自数据流本身,而是来自各种档案数据和上下文信息(空间、时间和多波长)。我们正在探索各种新技术,主要是贝叶斯,以应对这些挑战,使用正在进行的CRTS天空调查作为测试平台。目前的调查已经压倒了我们有效跟踪所有潜在有趣事件的能力,随着更雄心勃勃的天空调查的进行,这些挑战将在未来十年中以数量级增长。当我们专注于特定领域(天体物理学)的应用程序时,这些挑战更广泛地与大规模数据流中的事件或异常检测和知识发现相关。
{"title":"Flashes in a star stream: Automated classification of astronomical transient events","authors":"S. Djorgovski, A. Mahabal, C. Donalek, M. Graham, A. Drake, B. Moghaddam, M. Turmon","doi":"10.1109/eScience.2012.6404437","DOIUrl":"https://doi.org/10.1109/eScience.2012.6404437","url":null,"abstract":"An automated, rapid classification of transient events detected in the modern synoptic sky surveys is essential for their scientific utility and effective follow-up using scarce resources. This presents some unusual challenges: the data are sparse, heterogeneous and incomplete; evolving in time; and most of the relevant information comes not from the data stream itself, but from a variety of archival data and contextual information (spatial, temporal, and multi-wavelength). We are exploring a variety of novel techniques, mostly Bayesian, to respond to these challenges, using the ongoing CRTS sky survey as a testbed. The current surveys are already overwhelming our ability to effectively follow all of the potentially interesting events, and these challenges will grow by orders of magnitude over the next decade as the more ambitious sky surveys get under way. While we focus on an application in a specific domain (astrophysics), these challenges are more broadly relevant for event or anomaly detection and knowledge discovery in massive data streams.","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88801844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
P∗: A model of pilot-abstractions P *:一个有导抽象的模型
Pub Date : 2012-07-27 DOI: 10.1109/eScience.2012.6404423
André Luckow, M. Santcroos, André Merzky, Ole Weidner, P. Mantha, S. Jha
Pilot-Jobs support effective distributed resource utilization, and are arguably one of the most widely-used distributed computing abstractions - as measured by the number and types of applications that use them, as well as the number of production distributed cyberinfrastructures that support them. In spite of broad uptake, there does not exist a well-defined, unifying conceptual model of Pilot-Jobs which can be used to define, compare and contrast different implementations. Often Pilot-Job implementations are strongly coupled to the distributed cyber-infrastructure they were originally designed for. These factors present a barrier to extensibility and interoperability. This paper is an attempt to (i) provide a minimal but complete model (P*) of Pilot-Jobs, (ii) establish the generality of the P* Model by mapping various existing and well known Pilot-Job frameworks such as Condor and DIANE to P*, (iii) derive an interoperable and extensible API for the P* Model (Pilot-API), (iv) validate the implementation of the Pilot-API by concurrently using multiple distinct Pilot-Job frameworks on distinct production distributed cyberinfrastructures, and (v) apply the P* Model to Pilot-Data.
试点任务支持有效的分布式资源利用,并且可以说是最广泛使用的分布式计算抽象之一——通过使用它们的应用程序的数量和类型以及支持它们的生产分布式网络基础设施的数量来衡量。尽管试点工作被广泛采用,但并不存在一个定义良好、统一的试点工作概念模型,该模型可用于定义、比较和对比不同的实施。通常,试点作业实现与它们最初设计的分布式网络基础设施是强耦合的。这些因素对可扩展性和互操作性构成了障碍。本文试图(i)提供一个最小但完整的Pilot-Job模型(P*), (ii)通过将各种现有的和众所周知的Pilot-Job框架(如Condor和DIANE)映射到P*来建立P*模型的通用性,(iii)为P*模型(Pilot-API)派生一个可互操作和可扩展的API, (iv)通过在不同的生产分布式网络基础设施上同时使用多个不同的Pilot-Job框架来验证Pilot-API的实现。(v)将P*模型应用于试点数据。
{"title":"P∗: A model of pilot-abstractions","authors":"André Luckow, M. Santcroos, André Merzky, Ole Weidner, P. Mantha, S. Jha","doi":"10.1109/eScience.2012.6404423","DOIUrl":"https://doi.org/10.1109/eScience.2012.6404423","url":null,"abstract":"Pilot-Jobs support effective distributed resource utilization, and are arguably one of the most widely-used distributed computing abstractions - as measured by the number and types of applications that use them, as well as the number of production distributed cyberinfrastructures that support them. In spite of broad uptake, there does not exist a well-defined, unifying conceptual model of Pilot-Jobs which can be used to define, compare and contrast different implementations. Often Pilot-Job implementations are strongly coupled to the distributed cyber-infrastructure they were originally designed for. These factors present a barrier to extensibility and interoperability. This paper is an attempt to (i) provide a minimal but complete model (P*) of Pilot-Jobs, (ii) establish the generality of the P* Model by mapping various existing and well known Pilot-Job frameworks such as Condor and DIANE to P*, (iii) derive an interoperable and extensible API for the P* Model (Pilot-API), (iv) validate the implementation of the Pilot-API by concurrently using multiple distinct Pilot-Job frameworks on distinct production distributed cyberinfrastructures, and (v) apply the P* Model to Pilot-Data.","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89978938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
Reverse Engineering Europe's PSI Re-use Rules -- Towards an Integrated Conceptual Framework for PSI Re-use 逆向工程欧洲的PSI再利用规则——面向PSI再利用的集成概念框架
Pub Date : 2010-12-07 DOI: 10.1109/ESCIENCEW.2010.29
M. D. Vries
Despite various studies evincing the huge potential locked up in public sector information (PSI), this potential is far from being fully exploited. To a large extent, this failure is caused by the immensely complex legal labyrinth surrounding PSI re-use. This complexity works in two ways: public sector bodies do not comply with the regulatory framework and reusers do not avail themselves of the legal instruments offered, resulting in an unexploited economic potential. What makes the legal framework so complex is the transcending nature of PSI re-use, as it blends four areas of law – freedom of information law, ICT law, intellectual property law and competition law – that, throughout the years, have been regulated at a European, national and even sect oral level, but in isolation. The fundamental impact that ICT developments have on our society, subsequently also rocking the legal rules and underlying principles and axioms, makes the picture even more complicated. In this article, these legal frameworks are reverse engineered, demonstrating their interaction, culminating in a conceptual framework that allows public sector bodies and re-users (and courts where necessary) to apply and rely on the rules.
尽管各种研究表明,公共部门信息(PSI)蕴含着巨大的潜力,但这种潜力远未得到充分利用。在很大程度上,这种失败是由围绕PSI再利用的极其复杂的法律迷宫造成的。这种复杂性以两种方式起作用:公共部门机构不遵守管理框架,再使用者不利用所提供的法律文书,导致未开发的经济潜力。使法律框架如此复杂的是PSI再利用的超越性,因为它融合了四个法律领域——信息自由法、信息和通信技术法、知识产权法和竞争法——这些法律多年来一直在欧洲、国家甚至宗派一级进行监管,但都是孤立的。信息和通信技术的发展对我们的社会产生了根本性的影响,随后也动摇了法律规则和基本原则和公理,使情况更加复杂。在本文中,对这些法律框架进行了反向工程,展示了它们之间的相互作用,最终形成了一个概念框架,允许公共部门机构和重用者(以及必要时的法院)应用和依赖这些规则。
{"title":"Reverse Engineering Europe's PSI Re-use Rules -- Towards an Integrated Conceptual Framework for PSI Re-use","authors":"M. D. Vries","doi":"10.1109/ESCIENCEW.2010.29","DOIUrl":"https://doi.org/10.1109/ESCIENCEW.2010.29","url":null,"abstract":"Despite various studies evincing the huge potential locked up in public sector information (PSI), this potential is far from being fully exploited. To a large extent, this failure is caused by the immensely complex legal labyrinth surrounding PSI re-use. This complexity works in two ways: public sector bodies do not comply with the regulatory framework and reusers do not avail themselves of the legal instruments offered, resulting in an unexploited economic potential. What makes the legal framework so complex is the transcending nature of PSI re-use, as it blends four areas of law – freedom of information law, ICT law, intellectual property law and competition law – that, throughout the years, have been regulated at a European, national and even sect oral level, but in isolation. The fundamental impact that ICT developments have on our society, subsequently also rocking the legal rules and underlying principles and axioms, makes the picture even more complicated. In this article, these legal frameworks are reverse engineered, demonstrating their interaction, culminating in a conceptual framework that allows public sector bodies and re-users (and courts where necessary) to apply and rely on the rules.","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2010-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78067932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Educating the Humanities for e-Science 为电子科学教育人文学科
Pub Date : 2006-12-04 DOI: 10.1109/E-SCIENCE.2006.56
S. Strömqvist
The first part of the present paper discusses why the Humanities is lagging behind in terms of making use of e-science and what might be done to remedy that situation. The diversity of ontologies in the Humanities, hampering consensus over metadata, is one problem. Another problem is the lack of education in e-science tailored to the needs of researchers in the Humanities and the lack of efforts to try to integrate elements of e-science with the standard repertoire of research and education in the Humanities. Drawing on experiences from the project European Cultural Heritage Online (ECHO) and from the on-going project Distributed Access Management of Language Resources (DAM-LR), the Centre for Languages and literature at Lund University is trying to implement new elements of e-science at the local Faculty of Humanities. The second part of the paper briefly describes the process as well as some of its added values.
本文的第一部分讨论了人文学科在利用电子科学方面落后的原因,以及如何补救这种情况。人文学科本体的多样性阻碍了对元数据的共识,这是一个问题。另一个问题是缺乏针对人文学科研究人员需求的电子科学教育,以及缺乏将电子科学元素与人文学科研究和教育的标准曲目相结合的努力。根据欧洲文化遗产在线项目(ECHO)和正在进行的语言资源分布式访问管理项目(DAM-LR)的经验,隆德大学语言和文学中心正试图在当地人文学院实施电子科学的新元素。论文的第二部分简要介绍了这一过程以及它的一些附加价值。
{"title":"Educating the Humanities for e-Science","authors":"S. Strömqvist","doi":"10.1109/E-SCIENCE.2006.56","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2006.56","url":null,"abstract":"The first part of the present paper discusses why the Humanities is lagging behind in terms of making use of e-science and what might be done to remedy that situation. The diversity of ontologies in the Humanities, hampering consensus over metadata, is one problem. Another problem is the lack of education in e-science tailored to the needs of researchers in the Humanities and the lack of efforts to try to integrate elements of e-science with the standard repertoire of research and education in the Humanities. Drawing on experiences from the project European Cultural Heritage Online (ECHO) and from the on-going project Distributed Access Management of Language Resources (DAM-LR), the Centre for Languages and literature at Lund University is trying to implement new elements of e-science at the local Faculty of Humanities. The second part of the paper briefly describes the process as well as some of its added values.","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73438186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Grid Environment for Data Integration of Scientific Databases 面向科学数据库数据集成的网格环境
Pub Date : 2005-12-05 DOI: 10.1109/E-SCIENCE.2005.5
H. Matsuda
Effective integration of heterogeneous data sources has been studied as the most pressing challenge in various fields; such as, high energy physics, astronomy, and life sciences. In this talk, we present a data integration system by using Globus Toolkit with OGSA-DAI. For associating related data among many databases, we have introduced metadata based on their domain ontologies. Using the system one can make a database access flow for describing a set of queries as a workflow, and can query across the databases without aware of their locations and schemas
异构数据源的有效集成已成为各领域研究中最紧迫的挑战;比如高能物理、天文学和生命科学。在这个演讲中,我们提出了一个使用Globus Toolkit和OGSA-DAI的数据集成系统。为了在多个数据库之间关联相关数据,我们引入了基于它们的域本体的元数据。使用该系统,可以创建一个数据库访问流,将一组查询描述为工作流,并且可以在不知道数据库位置和模式的情况下跨数据库进行查询
{"title":"A Grid Environment for Data Integration of Scientific Databases","authors":"H. Matsuda","doi":"10.1109/E-SCIENCE.2005.5","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2005.5","url":null,"abstract":"Effective integration of heterogeneous data sources has been studied as the most pressing challenge in various fields; such as, high energy physics, astronomy, and life sciences. In this talk, we present a data integration system by using Globus Toolkit with OGSA-DAI. For associating related data among many databases, we have introduced metadata based on their domain ontologies. Using the system one can make a database access flow for describing a set of queries as a workflow, and can query across the databases without aware of their locations and schemas","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2005-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72894344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Service-Oriented Science: Scaling the Application and Impact of eResearch 面向服务的科学:扩展研究的应用和影响
Pub Date : 2005-12-05 DOI: 10.1109/E-SCIENCE.2005.75
Ian T. Foster
The importance of service-oriented architecture for science is widely recognized. Increasingly, scientific communities are making information tools accessible as services that clients can access over the network, without knowledge of their internal workings. In this way, tools formerly accessible only to the specialist can be made available to all. Equally importantly, new value-added services can be constructed that integrate other services to automate useful tasks. The value of such service-oriented science has been demonstrated in disciplines as diverse as astronomy, biology, and fusion science. The mechanisms required to achieve these goals are provided, in part, by grid infrastructure. I review the mechanisms that have been developed to date for grid infrastructure and experience gained implementing these mechanisms, for example within the open source Globus Toolkit version 4. I present a range of dynamic service deployment scenarios, in which for example the TeraGrid and Open Science Grid are used to host services for science communities. I discuss how these scenarios demonstrate the potential for scaling service-oriented science
面向服务的体系结构对科学的重要性已得到广泛认可。科学界越来越多地将信息工具作为服务提供给客户,使其可以通过网络访问,而无需了解其内部工作原理。通过这种方式,以前只有专家才能使用的工具可以向所有人开放。同样重要的是,可以构建新的增值服务来集成其他服务以自动执行有用的任务。这种以服务为导向的科学的价值已经在天文学、生物学和融合科学等多种学科中得到了证明。实现这些目标所需的机制部分由网格基础设施提供。我回顾了迄今为止为网格基础设施开发的机制以及实现这些机制所获得的经验,例如在开源的Globus Toolkit版本4中。我提出了一系列动态服务部署场景,例如使用TeraGrid和Open Science Grid为科学社区提供服务。我将讨论这些场景如何展示扩展面向服务的科学的潜力
{"title":"Service-Oriented Science: Scaling the Application and Impact of eResearch","authors":"Ian T. Foster","doi":"10.1109/E-SCIENCE.2005.75","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2005.75","url":null,"abstract":"The importance of service-oriented architecture for science is widely recognized. Increasingly, scientific communities are making information tools accessible as services that clients can access over the network, without knowledge of their internal workings. In this way, tools formerly accessible only to the specialist can be made available to all. Equally importantly, new value-added services can be constructed that integrate other services to automate useful tasks. The value of such service-oriented science has been demonstrated in disciplines as diverse as astronomy, biology, and fusion science. The mechanisms required to achieve these goals are provided, in part, by grid infrastructure. I review the mechanisms that have been developed to date for grid infrastructure and experience gained implementing these mechanisms, for example within the open source Globus Toolkit version 4. I present a range of dynamic service deployment scenarios, in which for example the TeraGrid and Open Science Grid are used to host services for science communities. I discuss how these scenarios demonstrate the potential for scaling service-oriented science","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2005-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86490895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Reprocessing D0 Data with SAMGrid 用SAMGrid重新处理D0数据
Pub Date : 2005-12-05 DOI: 10.1109/E-SCIENCE.2005.70
F. Villeneuve-Séguier
The DOslash experiment studies proton-antiproton collisions at the Tevatron collider based at Fermilab. Reprocessing, managing and distributing the large amount of real data coming from the detector as well as generating sufficient Monte Carlo data are some of the challenges faced by the DOslash collaboration. SAMGrid combines the SAM data handling system with the necessary job and information management allowing us to use the distributed computing resources in the various worldwide computing centers. This is one of the first large scale grid applications in high energy physics (in particular as we are using real data). After successful Monte Carlo production and a limited data reprocessing in the winter of 2003/04, the next milestone will be the reprocessing of the full current data set by this autumn/winter. It consists of ~500 TB of data, encompassing one billion events
DOslash实验在费米实验室的Tevatron对撞机上研究质子-反质子碰撞。重新处理、管理和分发来自探测器的大量真实数据以及生成足够的蒙特卡罗数据是DOslash合作面临的一些挑战。SAMGrid将SAM数据处理系统与必要的作业和信息管理相结合,使我们能够在各种全球计算中心中使用分布式计算资源。这是在高能物理领域的第一个大规模网格应用之一(特别是当我们使用真实数据时)。在2003/04年冬季成功的蒙特卡罗生产和有限的数据再处理之后,下一个里程碑将是在今年秋冬之前对全部当前数据集进行再处理。它包含约500tb的数据,包含10亿个事件
{"title":"Reprocessing D0 Data with SAMGrid","authors":"F. Villeneuve-Séguier","doi":"10.1109/E-SCIENCE.2005.70","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2005.70","url":null,"abstract":"The DOslash experiment studies proton-antiproton collisions at the Tevatron collider based at Fermilab. Reprocessing, managing and distributing the large amount of real data coming from the detector as well as generating sufficient Monte Carlo data are some of the challenges faced by the DOslash collaboration. SAMGrid combines the SAM data handling system with the necessary job and information management allowing us to use the distributed computing resources in the various worldwide computing centers. This is one of the first large scale grid applications in high energy physics (in particular as we are using real data). After successful Monte Carlo production and a limited data reprocessing in the winter of 2003/04, the next milestone will be the reprocessing of the full current data set by this autumn/winter. It consists of ~500 TB of data, encompassing one billion events","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2005-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86820111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Experiences with GRIA - Industrial Applications on a Web Services Grid 使用GRIA的经验——Web服务网格上的工业应用程序
Pub Date : 2005-12-05 DOI: 10.1109/E-SCIENCE.2005.38
M. Surridge, Steve Taylor, D. D. Roure, E. Zaluska
The GRIA project set out to make the grid usable by industry. The GRIA middleware is based on Web services, and designed to meet the needs of industry for security and business-to-business (B2B) service procurement and operation. It provides well-defined B2B models for accounting and QoS agreement, and proxy-free delegation to support account management and service federation. The GRIA v3 software is now being used by industry. By taking a business-oriented approach independent of the evolving Open Grid Services Architecture proposals from the Global Grid Forum, GRIA has demonstrated the need for a wider understanding of virtual organizations (VOs). Traditional academic VOs are persistent, resourceful and have logically centralized, membership-oriented management structures. In contrast, the GRIA experience has been that business VOs are likely to be project-focused and have distributed process-oriented management structures
GRIA项目旨在使电网为工业所用。GRIA中间件基于Web服务,旨在满足行业对安全性和企业对企业(B2B)服务采购和操作的需求。它为记帐和QoS协议提供了良好定义的B2B模型,并为支持帐户管理和服务联合提供了无代理委托。GRIA v3软件现在正在工业中使用。通过采用面向业务的方法,独立于全球网格论坛上不断发展的开放网格服务体系结构提案,GRIA证明了对虚拟组织(VOs)有更广泛理解的必要性。传统的学术vo是持久的、资源丰富的,并且具有逻辑上集中的、面向成员的管理结构。相比之下,GRIA的经验是,业务VOs很可能是以项目为中心的,并且具有分布式的面向过程的管理结构
{"title":"Experiences with GRIA - Industrial Applications on a Web Services Grid","authors":"M. Surridge, Steve Taylor, D. D. Roure, E. Zaluska","doi":"10.1109/E-SCIENCE.2005.38","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2005.38","url":null,"abstract":"The GRIA project set out to make the grid usable by industry. The GRIA middleware is based on Web services, and designed to meet the needs of industry for security and business-to-business (B2B) service procurement and operation. It provides well-defined B2B models for accounting and QoS agreement, and proxy-free delegation to support account management and service federation. The GRIA v3 software is now being used by industry. By taking a business-oriented approach independent of the evolving Open Grid Services Architecture proposals from the Global Grid Forum, GRIA has demonstrated the need for a wider understanding of virtual organizations (VOs). Traditional academic VOs are persistent, resourceful and have logically centralized, membership-oriented management structures. In contrast, the GRIA experience has been that business VOs are likely to be project-focused and have distributed process-oriented management structures","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2005-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73098054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 85
Putting Semantics into e-Science and Grids 将语义学应用于电子科学与网格
Pub Date : 2005-12-05 DOI: 10.1109/E-SCIENCE.2005.68
C. Goble
What is the semantic grid? How can e-Science benefit from the technologies of the semantic grid? Can we build a semantic Web for e-Science? Would that differ from a semantic grid? Given our past experiences with scientists, grid developers and semantic Web researchers, what are the prospects, and pitfalls, of putting semantics into e-Science applications and grid infrastructure?
什么是语义网格?e-Science如何从语义网格技术中获益?我们能为电子科学建立一个语义网吗?这和语义网格有什么不同吗?鉴于我们过去与科学家、网格开发人员和语义Web研究人员的经验,将语义放入电子科学应用程序和网格基础设施的前景和陷阱是什么?
{"title":"Putting Semantics into e-Science and Grids","authors":"C. Goble","doi":"10.1109/E-SCIENCE.2005.68","DOIUrl":"https://doi.org/10.1109/E-SCIENCE.2005.68","url":null,"abstract":"What is the semantic grid? How can e-Science benefit from the technologies of the semantic grid? Can we build a semantic Web for e-Science? Would that differ from a semantic grid? Given our past experiences with scientists, grid developers and semantic Web researchers, what are the prospects, and pitfalls, of putting semantics into e-Science applications and grid infrastructure?","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2005-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90304424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
2012 IEEE 8th International Conference on E-Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1