首页 > 最新文献

2013 IEEE 9th International Conference on e-Science最新文献

英文 中文
Biomedical Research Data Cloud Services with Duckling Collaboration LiBrary (CLB) 基于Duckling协作库(CLB)的生物医学研究数据云服务
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.17
Kejun Dong, Ji Li, Kai Nan, Wilfred W. Li
Rapid advances in scientific research have led to unprecedented data deluge and significant challenges in data interoperability, certification and collaboration. The Collaboration LiBrary (CLB) is designed to manage and collate millions of data files by setting up a unified, robust, and scalable data repository, especially in support of experimental data collaboration and timeline-based data life cycle management. It has recently been released as a component of Duckling, an open-source collaboration environment toolkit developed by the Chinese Academy of Sciences (CAS) and widely adopted in many disciplines. In this paper, we present newly developed components for data synchronization and snapshots in an updated architecture for CLB. We have also extended CLB with new data cloud service modules (CLB+) that enables data mapping and synchronization from the cloud to user workspace. CLB+ is implemented as CLB plugins that provide interfaces with biomedical research cloud services from a computer aided drug discovery (CADD) workflow for ensemble-based virtual screening. The flexible plug in architecture of CLB makes it easy to develop a prototype biomedical research data cloud environment. Many other e-science applications may leverage or expand CLB functionalities in data life cycle management in a similar fashion.
科学研究的快速发展导致了前所未有的数据泛滥,并在数据互操作性、认证和协作方面带来了重大挑战。协作库(CLB)旨在通过建立统一、健壮和可扩展的数据存储库来管理和整理数百万个数据文件,特别是支持实验性数据协作和基于时间轴的数据生命周期管理。它最近作为Duckling的一个组件发布,Duckling是一个由中国科学院(CAS)开发的开源协作环境工具包,在许多学科中被广泛采用。在本文中,我们介绍了在更新的CLB体系结构中用于数据同步和快照的新开发组件。我们还使用新的数据云服务模块(CLB+)扩展了CLB,这些模块支持从云到用户工作空间的数据映射和同步。CLB+是作为CLB插件实现的,CLB插件提供来自计算机辅助药物发现(CADD)工作流的生物医学研究云服务接口,用于基于集成的虚拟筛选。CLB灵活的插入式架构使生物医学研究数据云环境原型的开发变得非常容易。许多其他电子科学应用程序可能以类似的方式利用或扩展数据生命周期管理中的CLB功能。
{"title":"Biomedical Research Data Cloud Services with Duckling Collaboration LiBrary (CLB)","authors":"Kejun Dong, Ji Li, Kai Nan, Wilfred W. Li","doi":"10.1109/eScience.2013.17","DOIUrl":"https://doi.org/10.1109/eScience.2013.17","url":null,"abstract":"Rapid advances in scientific research have led to unprecedented data deluge and significant challenges in data interoperability, certification and collaboration. The Collaboration LiBrary (CLB) is designed to manage and collate millions of data files by setting up a unified, robust, and scalable data repository, especially in support of experimental data collaboration and timeline-based data life cycle management. It has recently been released as a component of Duckling, an open-source collaboration environment toolkit developed by the Chinese Academy of Sciences (CAS) and widely adopted in many disciplines. In this paper, we present newly developed components for data synchronization and snapshots in an updated architecture for CLB. We have also extended CLB with new data cloud service modules (CLB+) that enables data mapping and synchronization from the cloud to user workspace. CLB+ is implemented as CLB plugins that provide interfaces with biomedical research cloud services from a computer aided drug discovery (CADD) workflow for ensemble-based virtual screening. The flexible plug in architecture of CLB makes it easy to develop a prototype biomedical research data cloud environment. Many other e-science applications may leverage or expand CLB functionalities in data life cycle management in a similar fashion.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128222321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Policy Derived Access Rights in the Social Cloud 社交云中的策略派生访问权限
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.27
Ferry Hendrikx, K. Bubendorfer
Social clouds are a relatively new paradigm that allow users of an underlying social network to share their resources with their "friends", using previously established relationships. However, this sharing has a number of issues, including granularity of friendships, resource costs and maintenance. In this paper we argue that sharing decisions should be based on relationship information augmented by supplementary metadata derived from multiple sources. Users should be able to leverage the information available on their non-uniform friend relationships when making decisions, allowing them to confidently share their resources with those that would normally be outside of their immediate social circle. We introduce Graft, our Generalised Recommendation Architecture, that provides us with a mechanism to support this new approach.
社交云是一种相对较新的范例,它允许底层社交网络的用户使用先前建立的关系与他们的“朋友”共享资源。然而,这种共享存在许多问题,包括友谊的粒度、资源成本和维护。在本文中,我们认为共享决策应该基于由来自多个来源的补充元数据增强的关系信息。用户应该能够在做决定时利用他们的非统一朋友关系提供的信息,允许他们自信地与那些通常不在他们直接社交圈之外的人分享他们的资源。我们引入了Graft,我们的通用推荐架构,它为我们提供了一种支持这种新方法的机制。
{"title":"Policy Derived Access Rights in the Social Cloud","authors":"Ferry Hendrikx, K. Bubendorfer","doi":"10.1109/eScience.2013.27","DOIUrl":"https://doi.org/10.1109/eScience.2013.27","url":null,"abstract":"Social clouds are a relatively new paradigm that allow users of an underlying social network to share their resources with their \"friends\", using previously established relationships. However, this sharing has a number of issues, including granularity of friendships, resource costs and maintenance. In this paper we argue that sharing decisions should be based on relationship information augmented by supplementary metadata derived from multiple sources. Users should be able to leverage the information available on their non-uniform friend relationships when making decisions, allowing them to confidently share their resources with those that would normally be outside of their immediate social circle. We introduce Graft, our Generalised Recommendation Architecture, that provides us with a mechanism to support this new approach.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115142134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
ADIOS Visualization Schema: A First Step Towards Improving Interdisciplinary Collaboration in High Performance Computing ADIOS可视化架构:迈向高性能计算领域跨学科协作的第一步
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.24
Roselyne B. Tchoua, J. Choi, S. Klasky, Qing Liu, Jeremy S. Logan, K. Moreland, Jingqing Mu, M. Parashar, N. Podhorszki, D. Pugmire, M. Wolf
Scientific communities have benefitted from a significant increase of available computing and storage resources in the last few decades. For science projects that have access to leadership scale computing resources, the capacity to produce data has been growing exponentially. Teams working on such projects must now include, in addition to the traditional application scientists, experts in various disciplines including applied mathematicians for development of algorithms, visualization specialists for large data, and I/O specialists. Sharing of knowledge and data is becoming a requirement for scientific discovery, providing useful mechanisms to facilitate this sharing is a key challenge for e-Science. Our hypothesis is that in order to decrease the time to solution for application scientists we need to lower the barrier of entry into related computing fields. We aim at improving users' experience when interacting with a vast software ecosystem and/or huge amount of data, while maintaining focus on their primary research field. In this context we present our approach to bridge the gap between the application scientists and the visualization experts through a visualization schema as a first step and proof of concept for a new way to look at interdisciplinary collaboration among scientists dealing with big data. The key to our approach is recognizing that our users are scientists who mostly work as islands. They tend to work in very specialized environment but occasionally have to collaborate with other researchers in order to take full advantage of computing innovations and get insight from big data. We present an example of identifying the connecting elements between one of such relationships and offer a liaison schema to facilitate their collaboration.
在过去的几十年里,科学界从可用的计算和存储资源的显著增加中受益。对于那些能够获得领导级计算资源的科学项目来说,产生数据的能力已经呈指数级增长。从事此类项目的团队现在除了传统的应用科学家之外,还必须包括各种学科的专家,包括开发算法的应用数学家、大数据的可视化专家和I/O专家。知识和数据的共享正在成为科学发现的一项要求,提供有用的机制来促进这种共享是电子科学的一项关键挑战。我们的假设是,为了减少应用科学家解决问题的时间,我们需要降低进入相关计算领域的门槛。我们的目标是改善用户在与庞大的软件生态系统和/或大量数据交互时的体验,同时保持对他们主要研究领域的关注。在这种背景下,我们提出了我们的方法,通过可视化模式来弥合应用科学家和可视化专家之间的差距,作为第一步和概念的证明,以一种新的方式来看待处理大数据的科学家之间的跨学科合作。我们的方法的关键是认识到我们的用户是科学家,他们大多像孤岛一样工作。他们往往在非常专业的环境中工作,但偶尔必须与其他研究人员合作,以便充分利用计算创新并从大数据中获得洞察力。我们提供了一个例子来确定其中一个关系之间的连接元素,并提供了一个连接模式来促进它们之间的协作。
{"title":"ADIOS Visualization Schema: A First Step Towards Improving Interdisciplinary Collaboration in High Performance Computing","authors":"Roselyne B. Tchoua, J. Choi, S. Klasky, Qing Liu, Jeremy S. Logan, K. Moreland, Jingqing Mu, M. Parashar, N. Podhorszki, D. Pugmire, M. Wolf","doi":"10.1109/eScience.2013.24","DOIUrl":"https://doi.org/10.1109/eScience.2013.24","url":null,"abstract":"Scientific communities have benefitted from a significant increase of available computing and storage resources in the last few decades. For science projects that have access to leadership scale computing resources, the capacity to produce data has been growing exponentially. Teams working on such projects must now include, in addition to the traditional application scientists, experts in various disciplines including applied mathematicians for development of algorithms, visualization specialists for large data, and I/O specialists. Sharing of knowledge and data is becoming a requirement for scientific discovery, providing useful mechanisms to facilitate this sharing is a key challenge for e-Science. Our hypothesis is that in order to decrease the time to solution for application scientists we need to lower the barrier of entry into related computing fields. We aim at improving users' experience when interacting with a vast software ecosystem and/or huge amount of data, while maintaining focus on their primary research field. In this context we present our approach to bridge the gap between the application scientists and the visualization experts through a visualization schema as a first step and proof of concept for a new way to look at interdisciplinary collaboration among scientists dealing with big data. The key to our approach is recognizing that our users are scientists who mostly work as islands. They tend to work in very specialized environment but occasionally have to collaborate with other researchers in order to take full advantage of computing innovations and get insight from big data. We present an example of identifying the connecting elements between one of such relationships and offer a liaison schema to facilitate their collaboration.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129311341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Continuous Dataflow Update Strategies for Mission-Critical Applications 关键任务应用程序的持续数据流更新策略
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.35
Charith Wickramaarachchi, Yogesh L. Simmhan
Continuous data flows complement scientific work-flows by allowing composition of real time data ingest and analytics pipelines to process data streams from pervasive sensors and "always-on" scientific instruments. Such data flows are mission-critical applications that cannot suffer downtime, need to operate consistently, and are long running, but may need to be updated to fix bugs or add features. This poses the problem: How do we update the continuous dataflow application with minimal disruption? In this paper, we formalize different types of dataflow update models for continuous dataflow applications, and identify the qualitative and quantitative metrics to be considered when choosing an update strategy. We propose five dataflow update strategies, and analytically characterize their performance trade-offs. We validate one of these consistent, low-latency update strategies using the Floe dataflow engine for an eEngineering application from the Smart Power Grid domain, and show its relative performance benefits against a naïve update strategy.
连续数据流通过允许实时数据摄取和分析管道的组合来处理来自无处不在的传感器和“永远在线”的科学仪器的数据流,从而补充了科学工作流程。这些数据流是任务关键型应用程序,它们不能停机,需要一致地操作,并且长时间运行,但可能需要更新以修复错误或添加功能。这就提出了一个问题:我们如何在最小的中断下更新连续数据流应用程序?在本文中,我们为连续数据流应用程序形式化了不同类型的数据流更新模型,并确定了在选择更新策略时要考虑的定性和定量指标。我们提出了五种数据流更新策略,并分析表征了它们的性能权衡。我们使用来自智能电网领域的eEngineering应用程序的Floe数据流引擎验证这些一致的低延迟更新策略之一,并显示其相对于naïve更新策略的相对性能优势。
{"title":"Continuous Dataflow Update Strategies for Mission-Critical Applications","authors":"Charith Wickramaarachchi, Yogesh L. Simmhan","doi":"10.1109/eScience.2013.35","DOIUrl":"https://doi.org/10.1109/eScience.2013.35","url":null,"abstract":"Continuous data flows complement scientific work-flows by allowing composition of real time data ingest and analytics pipelines to process data streams from pervasive sensors and \"always-on\" scientific instruments. Such data flows are mission-critical applications that cannot suffer downtime, need to operate consistently, and are long running, but may need to be updated to fix bugs or add features. This poses the problem: How do we update the continuous dataflow application with minimal disruption? In this paper, we formalize different types of dataflow update models for continuous dataflow applications, and identify the qualitative and quantitative metrics to be considered when choosing an update strategy. We propose five dataflow update strategies, and analytically characterize their performance trade-offs. We validate one of these consistent, low-latency update strategies using the Floe dataflow engine for an eEngineering application from the Smart Power Grid domain, and show its relative performance benefits against a naïve update strategy.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129694553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Computational- and Storage-Cloud for Integration of Biodiversity Collections 生物多样性数据集集成的计算与存储云
Pub Date : 2013-10-22 DOI: 10.1109/ESCIENCE.2013.48
Andréa M. Matsunaga, A. Thompson, R. Figueiredo, Charlotte C. Germain-Aubrey, Matthew Collins, R. Beaman, B. MacFadden, G. Riccardi, P. Soltis, L. Page, J. Fortes
A core mission of the Integrated Digitized Biocollections (iDigBio) project is the building and deployment of a cloud computing environment customized to support the digitization workflow and integration of data from all U.S. non-federal biocollections. iDigBio chose to use cloud computing technologies to deliver a cyber infrastructure that is flexible, agile, resilient, and scalable to meet the needs of the biodiversity community. In this context, this paper describes the integration of open source cloud middleware, applications, and third party services using standard formats, protocols, and services. In addition, this paper demonstrates the value of the digitized information from collections in a broader scenario involving multiple disciplines.
集成数字化生物收集(iDigBio)项目的核心任务是构建和部署一个定制的云计算环境,以支持所有美国非联邦生物收集的数字化工作流程和数据集成。iDigBio选择使用云计算技术来提供灵活、敏捷、有弹性和可扩展的网络基础设施,以满足生物多样性社区的需求。在这种情况下,本文描述了使用标准格式、协议和服务的开源云中间件、应用程序和第三方服务的集成。此外,本文还论证了馆藏数字化信息在涉及多学科的更广泛场景中的价值。
{"title":"A Computational- and Storage-Cloud for Integration of Biodiversity Collections","authors":"Andréa M. Matsunaga, A. Thompson, R. Figueiredo, Charlotte C. Germain-Aubrey, Matthew Collins, R. Beaman, B. MacFadden, G. Riccardi, P. Soltis, L. Page, J. Fortes","doi":"10.1109/ESCIENCE.2013.48","DOIUrl":"https://doi.org/10.1109/ESCIENCE.2013.48","url":null,"abstract":"A core mission of the Integrated Digitized Biocollections (iDigBio) project is the building and deployment of a cloud computing environment customized to support the digitization workflow and integration of data from all U.S. non-federal biocollections. iDigBio chose to use cloud computing technologies to deliver a cyber infrastructure that is flexible, agile, resilient, and scalable to meet the needs of the biodiversity community. In this context, this paper describes the integration of open source cloud middleware, applications, and third party services using standard formats, protocols, and services. In addition, this paper demonstrates the value of the digitized information from collections in a broader scenario involving multiple disciplines.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122664977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Lens: A Faceted Browser for Research Networking Platforms Lens:面向研究网络平台的多面浏览器
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.54
Richard Whaling, T. Malik, Ian T Foster
Research networking platforms, such as VIVO and Profiles Networking provide an information infrastructure for scholarship, representing information about research and researchers-their scholarly works, research interests, and organizational relationships. These platforms are open information infrastructures for scholarship, consisting of linked open data and open-source software tools for managing and visualizing scholarly information. Being RDF based, faceted browsing is a natural technique for navigating such data, partitioning the scholarly information space into orthogonal conceptual dimensions. However, this technique has so far been explored through limited queries in research networking platforms-not allowing for instance full graph based navigation on RDF data. In this paper we present Lens a client-side user interface for faceted navigation of scholarly RDF data. Lens is based on Exhibit, which is a lightweight structured data-publishing framework, but extends Exhibit for expressive SPARQL-like queries and scales it up for navigating amounts of RDF data. Lens consumes data in VIVO ontology, the de facto schema for researcher networking systems. We show how Lens provides better usability over current faceted browsers for research networking platforms.
研究网络平台,如VIVO和Profiles networking,为学术研究提供了一个信息基础设施,代表了研究和研究人员的信息——他们的学术工作、研究兴趣和组织关系。这些平台是面向学术的开放信息基础设施,由链接的开放数据和开源软件工具组成,用于管理和可视化学术信息。由于基于RDF,分面浏览是导航此类数据的自然技术,它将学术信息空间划分为正交的概念维。然而,到目前为止,这项技术只是通过研究网络平台中的有限查询进行了探索——例如,不允许在RDF数据上进行基于全图的导航。在本文中,我们为Lens提供了一个客户端用户界面,用于学术RDF数据的分面导航。Lens是基于Exhibit的,Exhibit是一个轻量级的结构化数据发布框架,但是它扩展了Exhibit,用于表达类似sparql的查询,并扩展了Exhibit,用于导航大量RDF数据。Lens在VIVO本体中消耗数据,VIVO本体是研究人员网络系统的事实上的模式。我们展示了Lens如何为研究网络平台提供比当前的多面浏览器更好的可用性。
{"title":"Lens: A Faceted Browser for Research Networking Platforms","authors":"Richard Whaling, T. Malik, Ian T Foster","doi":"10.1109/eScience.2013.54","DOIUrl":"https://doi.org/10.1109/eScience.2013.54","url":null,"abstract":"Research networking platforms, such as VIVO and Profiles Networking provide an information infrastructure for scholarship, representing information about research and researchers-their scholarly works, research interests, and organizational relationships. These platforms are open information infrastructures for scholarship, consisting of linked open data and open-source software tools for managing and visualizing scholarly information. Being RDF based, faceted browsing is a natural technique for navigating such data, partitioning the scholarly information space into orthogonal conceptual dimensions. However, this technique has so far been explored through limited queries in research networking platforms-not allowing for instance full graph based navigation on RDF data. In this paper we present Lens a client-side user interface for faceted navigation of scholarly RDF data. Lens is based on Exhibit, which is a lightweight structured data-publishing framework, but extends Exhibit for expressive SPARQL-like queries and scales it up for navigating amounts of RDF data. Lens consumes data in VIVO ontology, the de facto schema for researcher networking systems. We show how Lens provides better usability over current faceted browsers for research networking platforms.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"704 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116963480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
DSN: A Knowledge-Based Scholar Networking Practice Towards Research Community DSN:面向研究社区的基于知识的学者网络实践
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.20
Juan Zhao, Kejun Dong, Jianjun Yu, Wei Hu
In this paper, we carry out a knowledge-based scholar network practice towards Research community, named Research Social Networking, shortly DSN, by setting up a large knowledge base of scientists. We discuss key technologies in the paper, including scholar disambiguation and relationship extraction with the better performance evaluation than traditional methods. The DSN system has been implemented and integrated with Duckling cloud service, known as Research Online, with more than 60 thousand scientists and 100 thousand papers.
在本文中,我们通过建立一个庞大的科学家知识库,对科研社区进行了基于知识的学者网络实践,称为科研社交网络(Research Social Networking,简称DSN)。本文讨论了学者消歧和关系提取等关键技术,并对其进行了性能评价。DSN系统已实施并与小鸭云服务集成,称为“研究在线”,拥有6万多名科学家,10万多篇论文。
{"title":"DSN: A Knowledge-Based Scholar Networking Practice Towards Research Community","authors":"Juan Zhao, Kejun Dong, Jianjun Yu, Wei Hu","doi":"10.1109/eScience.2013.20","DOIUrl":"https://doi.org/10.1109/eScience.2013.20","url":null,"abstract":"In this paper, we carry out a knowledge-based scholar network practice towards Research community, named Research Social Networking, shortly DSN, by setting up a large knowledge base of scientists. We discuss key technologies in the paper, including scholar disambiguation and relationship extraction with the better performance evaluation than traditional methods. The DSN system has been implemented and integrated with Duckling cloud service, known as Research Online, with more than 60 thousand scientists and 100 thousand papers.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"03 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124473020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Beyond Scientific Workflows: Networked Open Processes 超越科学工作流程:网络化开放流程
Pub Date : 2013-10-22 DOI: 10.1109/ESCIENCE.2013.51
R. Cushing, M. Bubak, A. Belloum, C. D. Laat
The multitude of scientific services and processes being developed brings about challenges for future in silico distributed experiments. Choosing the correct service from an expanding body of processes means that the the task of manually building workflows is becoming untenable. In this paper we propose a framework to tackle the future of scientific collaborative distributed computing. We introduce the notion of Networked Open Processes whereby processes are exposed, published, and linked using semantics in the same way as is done with Linked Open Data. As part of the framework we introduce several novel concepts including Process Object Identifiers, Semantic Function Templates, and TReQL, a SQL-like language for querying networked open process graphs.
正在开发的众多科学服务和流程为未来的计算机分布式实验带来了挑战。从不断扩展的流程体中选择正确的服务意味着手动构建工作流的任务变得站不住脚。在本文中,我们提出了一个框架来解决科学协作分布式计算的未来。我们引入了网络开放流程的概念,在这个概念中,流程使用与关联开放数据相同的语义进行公开、发布和链接。作为框架的一部分,我们介绍了几个新概念,包括进程对象标识符、语义函数模板和TReQL, TReQL是一种类似sql的语言,用于查询联网的开放流程图。
{"title":"Beyond Scientific Workflows: Networked Open Processes","authors":"R. Cushing, M. Bubak, A. Belloum, C. D. Laat","doi":"10.1109/ESCIENCE.2013.51","DOIUrl":"https://doi.org/10.1109/ESCIENCE.2013.51","url":null,"abstract":"The multitude of scientific services and processes being developed brings about challenges for future in silico distributed experiments. Choosing the correct service from an expanding body of processes means that the the task of manually building workflows is becoming untenable. In this paper we propose a framework to tackle the future of scientific collaborative distributed computing. We introduce the notion of Networked Open Processes whereby processes are exposed, published, and linked using semantics in the same way as is done with Linked Open Data. As part of the framework we introduce several novel concepts including Process Object Identifiers, Semantic Function Templates, and TReQL, a SQL-like language for querying networked open process graphs.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128689605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Efficient SDS Simulations on Multi-GPU Nodes of XSEDE High-End Clusters 基于XSEDE高端集群多gpu节点的高效SDS仿真
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.22
S. Schlachter, Stephen Herbein, M. Taufer, S. Ou, Sandeep Patel, Jeremy S. Logan
Efficiently studying Sodium Dodecyl Sulfate (SDS) molecules' formations in the presence of different molar concentrations on high-end GPU clusters whose nodes share accelerators exposes us to several challenges, including the need to dynamically adapt the job lengths. Neither virtualization nor lightweight OS solutions can easily support generality, portability, and maintainability in concert. Our solution complements rather than rewrites existing workflow and resource managers with a companion module that complements functions of the workflow manager and a wrapper module that extends functions of the resource managers. Results on the Keene land cluster show how, by using our modules, accelerated SDS simulations more efficiently use the cluster's GPUs while leading to relevant scientific observations.
在节点共享加速器的高端GPU集群上,高效地研究十二烷基硫酸钠(SDS)分子在不同摩尔浓度下的形成,给我们带来了几个挑战,包括需要动态调整工作长度。虚拟化和轻量级操作系统解决方案都无法轻松地同时支持通用性、可移植性和可维护性。我们的解决方案是对现有工作流和资源管理器的补充,而不是重写,它有一个配套模块来补充工作流管理器的功能,还有一个包装器模块来扩展资源管理器的功能。Keene陆地集群上的结果表明,通过使用我们的模块,加速SDS模拟如何更有效地利用集群的gpu,同时导致相关的科学观测。
{"title":"Efficient SDS Simulations on Multi-GPU Nodes of XSEDE High-End Clusters","authors":"S. Schlachter, Stephen Herbein, M. Taufer, S. Ou, Sandeep Patel, Jeremy S. Logan","doi":"10.1109/eScience.2013.22","DOIUrl":"https://doi.org/10.1109/eScience.2013.22","url":null,"abstract":"Efficiently studying Sodium Dodecyl Sulfate (SDS) molecules' formations in the presence of different molar concentrations on high-end GPU clusters whose nodes share accelerators exposes us to several challenges, including the need to dynamically adapt the job lengths. Neither virtualization nor lightweight OS solutions can easily support generality, portability, and maintainability in concert. Our solution complements rather than rewrites existing workflow and resource managers with a companion module that complements functions of the workflow manager and a wrapper module that extends functions of the resource managers. Results on the Keene land cluster show how, by using our modules, accelerated SDS simulations more efficiently use the cluster's GPUs while leading to relevant scientific observations.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122567097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visual Rhythm-Based Method for Continuous Plankton Monitoring 基于视觉节奏的浮游生物连续监测方法
Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.18
Damian J. Matuszewski, R. Lopes, R. M. C. Junior
Plankton microorganisms constitute the base of the marine food web and play a great role in global atmospheric carbon dioxide draw down. Moreover, being very sensitive to any environmental changes they allow noticing (and potentially counteracting) them faster than with any other means. As such they not only influence the fishery industry but are also frequently used to analyze changes in exploited coastal areas and the influence of these interferences on local environment and climate. As a consequence, there is a strong need for highly efficient systems allowing long time and large volume observation of plankton communities. The adopted sensors typically provide huge amounts of data that must be processed efficiently. This would provide us with better understanding of their role on global climate as well as help maintain the fragile environmental equilibrium. In this paper a new system for large volume plankton monitoring system is presented. It is based on visual analysis of small particles immersed in a water flux. The image sequences are analyzed with Visual Rhythm-based method which greatly accelerates the processing time and allows higher volume throughput. To assure maximal performance the algorithm was implemented using CUDA for GPGPU. The method was then tested on a large data set and compared with alternative frame-by-frame approach. The results prove that the method can be successfully applied for the large volume plankton monitoring problem, as well as in any other application where targets are to be detected and counted while moving in a unidirectional flux.
浮游微生物是海洋食物网的基础,在全球大气二氧化碳吸收中起着重要作用。此外,它们对任何环境变化都非常敏感,因此可以比任何其他手段更快地注意到(并可能抵消)它们。因此,它们不仅影响渔业,而且经常用于分析已开发沿海地区的变化以及这些干扰对当地环境和气候的影响。因此,迫切需要能够长时间、大规模地观察浮游生物群落的高效系统。所采用的传感器通常提供大量数据,必须进行有效处理。这将使我们更好地了解它们在全球气候中的作用,并有助于维持脆弱的环境平衡。本文介绍了一种新的大体积浮游生物监测系统。它是基于浸入水通量的小颗粒的视觉分析。采用基于视觉节奏的方法对图像序列进行分析,大大加快了处理时间,提高了批量吞吐量。为了保证最大的性能,算法在CUDA的GPGPU上实现。然后在大型数据集上对该方法进行了测试,并与其他逐帧方法进行了比较。结果证明,该方法可以成功地应用于大体积浮游生物监测问题,以及任何其他需要在单向通量中移动的目标检测和计数的应用。
{"title":"Visual Rhythm-Based Method for Continuous Plankton Monitoring","authors":"Damian J. Matuszewski, R. Lopes, R. M. C. Junior","doi":"10.1109/eScience.2013.18","DOIUrl":"https://doi.org/10.1109/eScience.2013.18","url":null,"abstract":"Plankton microorganisms constitute the base of the marine food web and play a great role in global atmospheric carbon dioxide draw down. Moreover, being very sensitive to any environmental changes they allow noticing (and potentially counteracting) them faster than with any other means. As such they not only influence the fishery industry but are also frequently used to analyze changes in exploited coastal areas and the influence of these interferences on local environment and climate. As a consequence, there is a strong need for highly efficient systems allowing long time and large volume observation of plankton communities. The adopted sensors typically provide huge amounts of data that must be processed efficiently. This would provide us with better understanding of their role on global climate as well as help maintain the fragile environmental equilibrium. In this paper a new system for large volume plankton monitoring system is presented. It is based on visual analysis of small particles immersed in a water flux. The image sequences are analyzed with Visual Rhythm-based method which greatly accelerates the processing time and allows higher volume throughput. To assure maximal performance the algorithm was implemented using CUDA for GPGPU. The method was then tested on a large data set and compared with alternative frame-by-frame approach. The results prove that the method can be successfully applied for the large volume plankton monitoring problem, as well as in any other application where targets are to be detected and counted while moving in a unidirectional flux.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126289682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2013 IEEE 9th International Conference on e-Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1