2019 15th International Conference on eScience (eScience)最新文献

英文中文

Workflow Automation in Liquid Chromatography Mass Spectrometry 液相色谱-质谱分析中的工作流程自动化

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00095

R. Gentz, H. Martín, Edward Baidoo, S. Peisert

We describe the fully automated workflow path developed for the ingest and analysis of liquid chromatography mass spectrometry (LCMS) data. With the help of this computational workflow, we were able to replace two human work days to analyze data with two hours of unsupervised computation time. In addition, this tool also can compute confidence intervals for all its results, based on the noise level present in the data. We leverage only open source tools and libraries in this workflow.

我们描述了为液相色谱质谱(LCMS)数据的摄取和分析开发的全自动工作流程路径。在这个计算工作流程的帮助下，我们能够用两个小时的无监督计算时间取代两个工作日来分析数据。此外，该工具还可以根据数据中存在的噪声水平计算其所有结果的置信区间。我们在这个工作流中只利用开源工具和库。

引用次数: 1

Teaching DevOps and Cloud Based Software Engineering in University Curricula 大学课程中DevOps和基于云的软件工程教学

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00075

Y. Demchenko, Zhiming Zhao, Jayachander Surbiryala, Spiros Koulouzis, Zeshun Shi, X. Liao, Jelena Gordiyenko

This paper presents recommendations on the design and pilot implementation of the DevOps and Cloud based Software Development curricula for Computer Science and Software Engineering masters. The central part of proposed approach is the Body of Knowledge in the DevOps technologies for Software Engineering (DevOpsSE BoK) that defines a set Knowledge Areas and Knowledge Units required for SE professionals to work efficiently as DevOps engineer or application developer. Defining DevOpsSE-BoK provides a basis for defining required professional competences and skills and allows consistent curricula structuring and profiling. The paper also reports on the experience of the first course run on 2018/2019 academic year at the University of Amsterdam. The paper presents the structure of the course and explains what instructional methodologies have been used for course development, such as project based learning that facilitates the students' team based skills both in mastering Agile development process and skills sharing. The paper provides a short summary of the generally used DevOps definitions, concepts, models and tools, specifically focusing on the cloud based DevOps tools for software development, deployment and operation that allows the main DevOps principle of continuous development and continuous improvement which are critical for modern agile data driven companies.

本文提出了针对计算机科学和软件工程硕士的DevOps和基于云的软件开发课程的设计和试点实施的建议。建议方法的核心部分是软件工程DevOps技术中的知识体系(DevOpsSE BoK)，它定义了一组知识领域和知识单元，这些知识领域和单元是软件工程专业人员作为DevOps工程师或应用程序开发人员有效工作所必需的。定义DevOpsSE-BoK为定义所需的专业能力和技能提供了基础，并允许一致的课程结构和分析。本文还报告了阿姆斯特丹大学2018/2019学年开设的第一门课程的经验。本文介绍了课程的结构，并解释了课程开发中使用的教学方法，例如基于项目的学习，促进了学生在掌握敏捷开发过程和技能共享方面的基于团队的技能。本文简要总结了常用的DevOps定义、概念、模型和工具，特别关注了用于软件开发、部署和操作的基于云的DevOps工具，这些工具支持持续开发和持续改进的主要DevOps原则，这对现代敏捷数据驱动型公司至关重要。

{"title":"Teaching DevOps and Cloud Based Software Engineering in University Curricula","authors":"Y. Demchenko, Zhiming Zhao, Jayachander Surbiryala, Spiros Koulouzis, Zeshun Shi, X. Liao, Jelena Gordiyenko","doi":"10.1109/eScience.2019.00075","DOIUrl":"https://doi.org/10.1109/eScience.2019.00075","url":null,"abstract":"This paper presents recommendations on the design and pilot implementation of the DevOps and Cloud based Software Development curricula for Computer Science and Software Engineering masters. The central part of proposed approach is the Body of Knowledge in the DevOps technologies for Software Engineering (DevOpsSE BoK) that defines a set Knowledge Areas and Knowledge Units required for SE professionals to work efficiently as DevOps engineer or application developer. Defining DevOpsSE-BoK provides a basis for defining required professional competences and skills and allows consistent curricula structuring and profiling. The paper also reports on the experience of the first course run on 2018/2019 academic year at the University of Amsterdam. The paper presents the structure of the course and explains what instructional methodologies have been used for course development, such as project based learning that facilitates the students' team based skills both in mastering Agile development process and skills sharing. The paper provides a short summary of the generally used DevOps definitions, concepts, models and tools, specifically focusing on the cloud based DevOps tools for software development, deployment and operation that allows the main DevOps principle of continuous development and continuous improvement which are critical for modern agile data driven companies.","PeriodicalId":142614,"journal":{"name":"2019 15th International Conference on eScience (eScience)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124418813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

AdaptLidarTools: A Full-Waveform Lidar Processing Suite AdaptLidarTools:全波形激光雷达处理套件

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00048

Ravi Shankar, N. Ilangakoon, A. Orenstein, Floriana Ciaglia, N. Glenn, C. Olschanowsky

AdaptLidarTools is a software package that processes full-waveform lidar data. Full-waveform lidar is an active remote sensing technique in which a laser beam is emitted towards a target and the backscattered energy is recorded as a near continuous waveform. A collection of waveforms from airborne lidar can capture landscape characteristics in three dimensions. Specific to vegetation, the extracted echoes and echo properties from the waveforms can provide scientists structural (height, volume, layers of canopy, among others) and functional (leaf area index, diversity) characteristics. The discrete waveforms can be transformed into georeferenced 2D rasters (images), allowing scientists to correlate field-based observations for validation of the waveform observations and fusing the data with other geospatial information. AdaptLidarTools provides an extensible, open-source framework that processes the waveforms and produces multiple data outputs that can be used in vegetation and terrain analysis. AdaptLidarTools is designed to explore new methods to fit full-waveform lidar signals and to maximize the information in the waveforms for vegetation applications. The toolkit explores first differencing, complementary to Gaussian fitting, for faster processing of full-waveform lidar signals and for handling increasingly large volumes of full-waveform lidar datasets. AdaptLidarTools takes approximately 30 min to derive a raster of a given echo property from a raw waveform file of 1 GB size. The toolkit generates first order echo properties such as position, amplitude, pulse width, and other properties such as rise time, fall time and backscattered cross section. It also generates other properties that current proprietary and open-source tools do not. The derived echo properties are delivered as georeferenced raster files of a given spatial resolution that can be viewed and processed by most remote sensing data processing software.

AdaptLidarTools是一个处理全波形激光雷达数据的软件包。全波形激光雷达是一种主动遥感技术，其中激光束向目标发射，后向散射能量被记录为接近连续的波形。从机载激光雷达收集的波形可以捕捉到三维的景观特征。具体到植被，从波形中提取的回波和回波特性可以为科学家提供结构(高度、体积、冠层等)和功能(叶面积指数、多样性)特征。离散波形可以转换为地理参考的二维光栅(图像)，使科学家能够将基于现场的观测结果关联起来，以验证波形观测结果，并将数据与其他地理空间信息融合。AdaptLidarTools提供了一个可扩展的开源框架，可以处理波形并产生多种数据输出，可用于植被和地形分析。AdaptLidarTools旨在探索适合全波形激光雷达信号的新方法，并最大限度地利用植被应用的波形信息。该工具包探索了第一差分，补充高斯拟合，以更快地处理全波形激光雷达信号，并处理越来越多的全波形激光雷达数据集。AdaptLidarTools从1gb大小的原始波形文件中导出给定回声属性的栅格大约需要30分钟。该工具包可生成一阶回波属性，如位置、幅度、脉冲宽度以及其他属性，如上升时间、下降时间和后向散射截面。它还生成了其他当前专有和开源工具所不具备的特性。导出的回波属性以给定空间分辨率的地理参考光栅文件的形式提供，可由大多数遥感数据处理软件查看和处理。

{"title":"AdaptLidarTools: A Full-Waveform Lidar Processing Suite","authors":"Ravi Shankar, N. Ilangakoon, A. Orenstein, Floriana Ciaglia, N. Glenn, C. Olschanowsky","doi":"10.1109/eScience.2019.00048","DOIUrl":"https://doi.org/10.1109/eScience.2019.00048","url":null,"abstract":"AdaptLidarTools is a software package that processes full-waveform lidar data. Full-waveform lidar is an active remote sensing technique in which a laser beam is emitted towards a target and the backscattered energy is recorded as a near continuous waveform. A collection of waveforms from airborne lidar can capture landscape characteristics in three dimensions. Specific to vegetation, the extracted echoes and echo properties from the waveforms can provide scientists structural (height, volume, layers of canopy, among others) and functional (leaf area index, diversity) characteristics. The discrete waveforms can be transformed into georeferenced 2D rasters (images), allowing scientists to correlate field-based observations for validation of the waveform observations and fusing the data with other geospatial information. AdaptLidarTools provides an extensible, open-source framework that processes the waveforms and produces multiple data outputs that can be used in vegetation and terrain analysis. AdaptLidarTools is designed to explore new methods to fit full-waveform lidar signals and to maximize the information in the waveforms for vegetation applications. The toolkit explores first differencing, complementary to Gaussian fitting, for faster processing of full-waveform lidar signals and for handling increasingly large volumes of full-waveform lidar datasets. AdaptLidarTools takes approximately 30 min to derive a raster of a given echo property from a raw waveform file of 1 GB size. The toolkit generates first order echo properties such as position, amplitude, pulse width, and other properties such as rise time, fall time and backscattered cross section. It also generates other properties that current proprietary and open-source tools do not. The derived echo properties are delivered as georeferenced raster files of a given spatial resolution that can be viewed and processed by most remote sensing data processing software.","PeriodicalId":142614,"journal":{"name":"2019 15th International Conference on eScience (eScience)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116368680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

HUBzero© Goes OneSciencePlace: The Next Community-Driven Steps for Providing Software-as-a-Service HUBzero©Goes OneSciencePlace:提供软件即服务的下一个社区驱动步骤

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00097

David Benham, S. Gesing

HUBzero© is a well-used science gateway framework actively developed for over a decade. The needs of its community are one of the primary driving forces guiding the team behind HUBzero©. For example, requirements in the community led to the integration of JupyterHub, RStudio and the provision of Docker containers for the tool submission environment. Besides its community driven requirements, the team behind HUBzero© continuously analyzes the existing science gateway landscape, the usage of instances of HUBzero© as well as trends in the usage of computational platforms in general to keep HUBzero© robust, scalable, and sustainable. HUBzero© has begun development of a science gateway platform called Open Science Place (OSP). OSP is a SaaS-based (Softwareas-a-Service) solution to give researchers a way to execute, share, and archive their research in a publically accessible venue. The poster goes into detail for the different aspects of sustainability addressed in OSP such as a community-hosting concept, flexible financing models, interoperability and scalability of tools.

HUBzero©是一个使用良好的科学网关框架，积极发展了十多年。社区的需求是HUBzero©背后团队的主要驱动力之一。例如，社区的需求导致了JupyterHub、RStudio的集成，并为工具提交环境提供了Docker容器。除了社区驱动的需求外，HUBzero©背后的团队还不断分析现有的科学门户景观，HUBzero©实例的使用情况以及计算平台使用的趋势，以保持HUBzero©的健壮性，可扩展性和可持续性。HUBzero©已经开始开发一个名为Open science Place (OSP)的科学门户平台。OSP是一种基于saas(软件领域即服务)的解决方案，为研究人员提供了一种在公共场所执行、共享和存档他们的研究的方法。海报详细介绍了OSP中可持续性的不同方面，如社区托管概念、灵活的融资模式、工具的互操作性和可扩展性。

引用次数: 2

ESiWACE: On European Infrastructure Efforts for Weather and Climate Modeling at Exascale ESiWACE:关于欧洲在百亿亿次天气和气候模拟方面的基础设施努力

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00065

P. Neumann, J. Biercamp

Kilometer-scale ensemble simulations are expected to significantly boost and impact weather and climate predictions in the future. However, these simulations will only be enabled by exascale compute power and corresponding data capacity. In the following, we discuss a European effort in terms of the e-infrastructure Centre of Excellence in Simulation of Weather and Climate in Europe (ESiWACE). ESiWACE provides infrastructural means to prepare the weather and climate communities for simulations at the exascale. We give an overview of several ESiWACE infrastructure components and discuss their role in reaching the goal of kilometer-scale ensemble predictions. We particularly review the outcomes of the ESiWACE demonstrators, that is community-driven kilometer-scale models that have been developed throughout the last years.

千米尺度的集合模拟预计将显著促进和影响未来的天气和气候预测。然而，这些模拟只能通过百亿亿次的计算能力和相应的数据容量来实现。在下文中，我们将讨论欧洲在欧洲天气和气候模拟卓越电子基础设施中心(ESiWACE)方面的努力。ESiWACE提供了基础设施手段，使天气和气候社区为百亿亿次的模拟做好准备。我们概述了几个ESiWACE基础设施组件，并讨论了它们在实现千米尺度集合预测目标中的作用。我们特别回顾了ESiWACE示范项目的成果，即过去几年开发的社区驱动的公里尺度模型。

引用次数: 2

The Future of Swedish e-Science: SeRC 2.0 瑞典电子科学的未来:SeRC 2.0

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00053

E. Laure, Olivia Eriksson, Erik Lindahl, D. Henningson

Since 2010, the Swedish e-Science Research Centre (SeRC) is funding and coordinating e-Science activities in a broad spectrum of scientific disciplines. After an initial 5-year phase that produced outstanding results, SeRC is increasingly focusing on fostering interactions between disciplines and has created so-called Multidisciplinary Collaborative Programs (MCPs). In these programs, domain researchers collaborate with e-Science methods and tool developers and e-Infrastructure providers. In this paper we give an overview of the initial phase of SeRC and present the new programs that started operating in 2019.

自2010年以来，瑞典电子科学研究中心(SeRC)在广泛的科学学科领域资助和协调电子科学活动。在最初的5年阶段取得了显著成果之后，SeRC越来越注重促进学科之间的互动，并创建了所谓的多学科合作项目(mcp)。在这些项目中，领域研究人员与电子科学方法和工具开发人员以及电子基础设施提供商合作。在本文中，我们概述了SeRC的初始阶段，并介绍了2019年开始运行的新项目。

引用次数: 0

The AllScale API

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00064

P. Gschwandtner, Herbert Jordan, Peter Thoman, T. Fahringer

Effectively implementing scientific algorithms in distributed memory parallel applications is a difficult task for domain scientists, as evident by the large number of domain-specific languages and libraries available today attempting to facilitate the process. However, they usually provide a closed set of parallel patterns and are not open for extension without vast modifications to the underlying system. In this work, we present the AllScale API, a programming interface for developing distributed memory parallel applications with the ease of shared memory programming models. The AllScale API is closed for modification but open for extension, allowing new, user-defined parallel patterns and data structures to be implemented based on existing core primitives and therefore fully supported in the AllScale framework. Focusing on high-level functionality directly offered to application developers, we present the design advantages of such an API design, detail some of its specifications and evaluate it using three real-world use cases. Our results show that AllScale decreases the complexity of implementing scientific applications for distributed memory while attaining comparable or higher performance compared to MPI reference implementations.

对于领域科学家来说，在分布式内存并行应用程序中有效地实现科学算法是一项艰巨的任务，目前有大量的领域特定语言和库试图促进这一过程。然而，它们通常提供一组封闭的并行模式，如果不对底层系统进行大量修改，就不能对扩展开放。在这项工作中，我们提出了AllScale API，这是一个编程接口，用于开发具有共享内存编程模型的分布式内存并行应用程序。AllScale API对修改是封闭的，但对扩展是开放的，允许新的、用户定义的并行模式和数据结构基于现有的核心原语实现，因此在AllScale框架中完全支持。重点关注直接提供给应用程序开发人员的高级功能，我们展示了这种API设计的设计优势，详细介绍了它的一些规范，并使用三个实际用例对其进行了评估。我们的结果表明，与MPI参考实现相比，AllScale降低了为分布式内存实现科学应用程序的复杂性，同时获得了相当或更高的性能。

{"title":"The AllScale API","authors":"P. Gschwandtner, Herbert Jordan, Peter Thoman, T. Fahringer","doi":"10.1109/eScience.2019.00064","DOIUrl":"https://doi.org/10.1109/eScience.2019.00064","url":null,"abstract":"Effectively implementing scientific algorithms in distributed memory parallel applications is a difficult task for domain scientists, as evident by the large number of domain-specific languages and libraries available today attempting to facilitate the process. However, they usually provide a closed set of parallel patterns and are not open for extension without vast modifications to the underlying system. In this work, we present the AllScale API, a programming interface for developing distributed memory parallel applications with the ease of shared memory programming models. The AllScale API is closed for modification but open for extension, allowing new, user-defined parallel patterns and data structures to be implemented based on existing core primitives and therefore fully supported in the AllScale framework. Focusing on high-level functionality directly offered to application developers, we present the design advantages of such an API design, detail some of its specifications and evaluate it using three real-world use cases. Our results show that AllScale decreases the complexity of implementing scientific applications for distributed memory while attaining comparable or higher performance compared to MPI reference implementations.","PeriodicalId":142614,"journal":{"name":"2019 15th International Conference on eScience (eScience)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127382113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Describing Datasets in Wikidata 描述维基数据中的数据集

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00070

Denny Vrandečić

We propose to use Wikidata to provide metadata for datasets when the traditional approach via Schema.org is not feasible. We describe and discuss the proposal, and believe that the process described in this paper can help with increasing findability and accessibility of certain datasets.

当通过Schema.org的传统方法不可行的时候，我们建议使用Wikidata为数据集提供元数据。我们描述并讨论了该提议，并相信本文中描述的过程可以帮助提高某些数据集的可查找性和可访问性。

引用次数: 4

BBBlockchain: Blockchain-Based Participation in Urban Development BBBlockchain:基于区块链参与城市发展

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00043

R. Muth, Kerstin Eisenhut, J. Rabe, Florian Tschorsch

Urban development processes often suffer from mistrust amongst different stakeholder groups. The lack of transparency within complex and long-term planning processes and the limited scope for co-creation and joint decision-making constitute a persistent problem for successful participation in urban planning. Civic technology has the potential to improve this predicament. With BBBlockchain, we propose a blockchain-based participation platform, which is able to address all layers of participation. In the development of the platform, we focus on two key aspects: How to increase transparency and how to introduce enhanced co-decision-making. To this end, we exploit the immutable nature of blockchains and effectively offer a platform that excludes monopolistic control over information. The decision-making process is governed by smart contracts implementing, for example, timestamping of planning documents, opinion polls, and the management of a participatory budget. Our architecture and prototypes show the operational capabilities of this approach in a series of use cases for urban development.

城市发展过程往往受到不同利益相关者群体之间不信任的影响。在复杂和长期的规划过程中缺乏透明度，共同创造和共同决策的范围有限，这是成功参与城市规划的一个长期问题。市政技术有可能改善这种困境。通过BBBlockchain，我们提出了一个基于区块链的参与平台，它能够解决所有层次的参与。在平台的开发中，我们关注两个关键方面:如何提高透明度和如何引入增强的共同决策。为此，我们利用区块链的不可变特性，有效地提供了一个排除对信息的垄断控制的平台。决策过程由智能合约管理，例如，规划文件的时间戳、民意调查和参与式预算的管理。我们的建筑和原型在城市发展的一系列用例中展示了这种方法的操作能力。

引用次数: 6

The Evolution of Bits and Bottlenecks in a Scientific Workflow Trying to Keep Up with Technology: Accelerating 4D Image Segmentation Applied to NASA Data 试图跟上技术的科学工作流程中的比特和瓶颈的演变:加速应用于NASA数据的4D图像分割

2019 15th International Conference on eScience (eScience)

Pub Date : 2019-09-01 DOI: 10.1109/eScience.2019.00016

S. Sellars, John Graham, D. Mishin, Kyle Marcus, I. Altintas, T. DeFanti, L. Smarr, Camille Crittenden, F. Wuerthwein, Joulien Tatar, P. Nguyen, E. Shearer, S. Sorooshian, F. M. Ralph

In 2016, a team of earth scientists directly engaged a team of computer scientists to identify cyberinfrastructure (CI) approaches that would speed up an earth science workflow. This paper describes the evolution of that workflow as the two teams bridged CI and an image segmentation algorithm to do large scale earth science research. The Pacific Research Platform (PRP) and The Cognitive Hardware and Software Ecosystem Community Infrastructure (CHASE-CI) resources were used to significantly decreased the earth science workflow's wall-clock time from 19.5 days to 53 minutes. The improvement in wall-clock time comes from the use of network appliances, improved image segmentation, deployment of a containerized workflow, and the increase in CI experience and training for the earth scientists. This paper presents a description of the evolving innovations used to improve the workflow, bottlenecks identified within each workflow version, and improvements made within each version of the workflow, over a three-year time period.

2016年，一个地球科学家团队直接与一个计算机科学家团队合作，以确定可以加快地球科学工作流程的网络基础设施(CI)方法。本文描述了这两个团队将CI和图像分割算法结合起来进行大规模地球科学研究的过程。利用太平洋研究平台(PRP)和认知软硬件生态系统社区基础设施(CHASE-CI)资源，将地球科学工作流程的时钟时间从19.5天显著缩短至53分钟。时钟时间的改进来自网络设备的使用、图像分割的改进、容器化工作流的部署，以及CI经验的增加和对地球科学家的培训。本文描述了用于改进工作流的不断发展的创新，每个工作流版本中确定的瓶颈，以及在三年时间内每个工作流版本中所做的改进。

{"title":"The Evolution of Bits and Bottlenecks in a Scientific Workflow Trying to Keep Up with Technology: Accelerating 4D Image Segmentation Applied to NASA Data","authors":"S. Sellars, John Graham, D. Mishin, Kyle Marcus, I. Altintas, T. DeFanti, L. Smarr, Camille Crittenden, F. Wuerthwein, Joulien Tatar, P. Nguyen, E. Shearer, S. Sorooshian, F. M. Ralph","doi":"10.1109/eScience.2019.00016","DOIUrl":"https://doi.org/10.1109/eScience.2019.00016","url":null,"abstract":"In 2016, a team of earth scientists directly engaged a team of computer scientists to identify cyberinfrastructure (CI) approaches that would speed up an earth science workflow. This paper describes the evolution of that workflow as the two teams bridged CI and an image segmentation algorithm to do large scale earth science research. The Pacific Research Platform (PRP) and The Cognitive Hardware and Software Ecosystem Community Infrastructure (CHASE-CI) resources were used to significantly decreased the earth science workflow's wall-clock time from 19.5 days to 53 minutes. The improvement in wall-clock time comes from the use of network appliances, improved image segmentation, deployment of a containerized workflow, and the increase in CI experience and training for the earth scientists. This paper presents a description of the evolving innovations used to improve the workflow, bottlenecks identified within each workflow version, and improvements made within each version of the workflow, over a three-year time period.","PeriodicalId":142614,"journal":{"name":"2019 15th International Conference on eScience (eScience)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130908524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2019 15th International Conference on eScience (eScience)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀