2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)最新文献

英文中文

A Novel Coupling Pattern in Computational Science and Engineering Software 计算科学与工程软件中的一种新的耦合模式

2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)

Pub Date : 2017-05-20 DOI: 10.1109/SE4Science.2017.10

Suraj Kamble, Xiaoyu Jin, Nan Niu, Michelle Simon

Computational science and engineering (CSE) software is written by experts of certain area(s). Due to the specialization, existing CSE software may need to integrate other CSE software systems developed by different groups of experts. The coupling problem is one of the challenges for software integration. Here, the coupling we study means the issues involved in integrating 2 legacy codes together, but not the multiphysics coupling where distinct codes are combined in order to model particular phenomena. In this paper, we identify a complex coupling pattern when trying to integrate two CSE software systems. We describe the coupling pattern in detail and show the complexity of resolving such kind of coupling patterns. Our work contributes to area of CSE software since there were few previous studies addressing the coupling problem in CSE domain. Our work will further inspire future research in solving the coupling problem during CSE software integration.

计算科学与工程(CSE)软件是由某个领域的专家编写的。由于专业化，现有的CSE软件可能需要集成不同专家组开发的其他CSE软件系统。耦合问题是软件集成面临的挑战之一。在这里，我们研究的耦合是指将两个遗留代码集成在一起所涉及的问题，而不是为了模拟特定现象而将不同的代码组合在一起的多物理场耦合。在本文中，我们在试图集成两个CSE软件系统时识别了一个复杂的耦合模式。我们详细描述了耦合模式，并展示了解决这类耦合模式的复杂性。我们的工作对CSE软件领域有一定的贡献，因为以前的研究很少涉及CSE领域的耦合问题。本文的工作将进一步启发CSE软件集成过程中耦合问题的研究。

引用次数: 10

Advancing Open Science with Version Control and Blockchains 通过版本控制和区块链推进开放科学

2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)

Pub Date : 2017-05-20 DOI: 10.1109/SE4Science.2017.11

Jonathan Bell, Thomas D. Latoza, Foteini Baldimtsi, A. Stavrou

The scientific community is facing a crisis of reproducibility: confidence in scientific results is damaged by concerns regarding the integrity of experimental data and the analyses applied to that data. Experimental integrity can be compromised inadvertently when researchers overlook some important component of their experimental procedure, or intentionally by researchers or malicious third-parties who are biased towards ensuring a specific outcome of an experiment. The scientific community has pushed for "open science" to add transparency to the experimental process, asking researchers to publicly register their data sets and experimental procedures. We argue that the software engineering community can leverage its expertise in tracking traceability and provenance of source code and its related artifacts to simplify data management for scientists. Moreover, by leveraging smart contract and blockchain technologies, we believe that it is possible for such a system to guarantee end-to-end integrity of scientific data and results while supporting collaborative research.

科学界正面临着可再现性的危机:对实验数据的完整性和应用于这些数据的分析的担忧破坏了对科学结果的信心。当研究人员忽视实验程序的一些重要组成部分时，或者当研究人员或恶意的第三方为了确保实验的特定结果而有偏见时，实验的完整性可能在无意中受到损害。科学界一直在推动“开放科学”，以增加实验过程的透明度，要求研究人员公开注册他们的数据集和实验程序。我们认为，软件工程社区可以利用其在跟踪可追溯性和源代码及其相关工件的来源方面的专业知识来简化科学家的数据管理。此外，通过利用智能合约和区块链技术，我们相信这样的系统可以在支持协作研究的同时保证科学数据和结果的端到端完整性。

引用次数: 21

WIP: Generating Sequence Diagrams for Modern Fortran WIP:为现代Fortran生成序列图

2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)

Pub Date : 2017-05-01 DOI: 10.1109/SE4Science.2017.13

Anawat Leatongkam, Aziz Nanthaamornphong, D. Rouson

Fortran finds widespread use in scientific and engineering communities that embraced computing early, including weather and climate science and mechanical, nuclear, and aerospace engineering. Over its lifetime, Fortran has evolved to support multiple programming paradigms, including Object-Oriented Programming (OOP). Despite the recently burgeoning ecosystem of tools and libraries supporting modern Fortran, there remains limited support for generating common Object-Oriented Design (OOD) diagrams from Fortran source code. ForUML partially fills this need by reverse engineering Unified Modeling Language (UML) class diagrams from object-oriented (OO) Fortran programs. Class diagrams provide useful information about class structures and inter-relationships, but class diagrams do not convey the temporal information required to understand runtime class behavior and interactions. UML sequence diagrams provide such important algorithmic details. This paper proposes to extend ForUML to extract UML sequence diagrams from Fortran code and to offer this capability via a widely used open-source platform. The paper argues that the proposed capability can raise the level of abstraction at which the computational science community discusses modern Fortran.

Fortran在科学和工程社区中得到了广泛的应用，这些社区很早就接受了计算，包括天气和气候科学以及机械、核和航空航天工程。在其生命周期中，Fortran已经发展到支持多种编程范式，包括面向对象编程(OOP)。尽管最近支持现代Fortran的工具和库的生态系统迅速发展，但从Fortran源代码生成通用面向对象设计(OOD)图的支持仍然有限。ForUML通过从面向对象(OO) Fortran程序中逆向工程统一建模语言(UML)类图，部分地满足了这一需求。类图提供了关于类结构和相互关系的有用信息，但是类图并没有传达理解运行时类行为和交互所需的临时信息。UML序列图提供了如此重要的算法细节。本文建议扩展ForUML以从Fortran代码中提取UML序列图，并通过广泛使用的开源平台提供此功能。本文认为，所提出的能力可以提高抽象水平，计算科学界在此基础上讨论现代Fortran。

{"title":"WIP: Generating Sequence Diagrams for Modern Fortran","authors":"Anawat Leatongkam, Aziz Nanthaamornphong, D. Rouson","doi":"10.1109/SE4Science.2017.13","DOIUrl":"https://doi.org/10.1109/SE4Science.2017.13","url":null,"abstract":"Fortran finds widespread use in scientific and engineering communities that embraced computing early, including weather and climate science and mechanical, nuclear, and aerospace engineering. Over its lifetime, Fortran has evolved to support multiple programming paradigms, including Object-Oriented Programming (OOP). Despite the recently burgeoning ecosystem of tools and libraries supporting modern Fortran, there remains limited support for generating common Object-Oriented Design (OOD) diagrams from Fortran source code. ForUML partially fills this need by reverse engineering Unified Modeling Language (UML) class diagrams from object-oriented (OO) Fortran programs. Class diagrams provide useful information about class structures and inter-relationships, but class diagrams do not convey the temporal information required to understand runtime class behavior and interactions. UML sequence diagrams provide such important algorithmic details. This paper proposes to extend ForUML to extract UML sequence diagrams from Fortran code and to offer this capability via a widely used open-source platform. The paper argues that the proposed capability can raise the level of abstraction at which the computational science community discusses modern Fortran.","PeriodicalId":318588,"journal":{"name":"2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)","volume":"695 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125289941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

WIP: Live Restructuring of Data Architecture WIP:数据架构的实时重构

2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)

Pub Date : 2017-05-01 DOI: 10.1109/SE4Science.2017.14

W. Macey, Dali Wang, P. Thornton, A. Mockus

In large-scale Earth System simulation codes, such asthe Accelerated Climate Model for Energy (ACME), complex user derived data types (containing large numberof variables) are designed to represent the interactionsof atmosphere, ocean, land, ice, and biosphere toproject global climate under a wide variety of conditions. The following is our proposed approach to restructurethe data architecture of a land component within theACME project while the project is undergoing activedevelopment. The data architect for the land subsystemdefines the new datatype requirements that wouldgreatly simplify the implementation of terrestrial landsubmodels by converting more than 50 to just eight primarydata-types. Since the code is developed with thecommunity governance, we have to ensure that the restructuringdoes not interface the other developmentwhich, with dozens of changes occurring every day, makeit impossible to work on a shared development branch. The active development also occurs on almost five hundredbranches, making it extremely difficult to assesspotential interactions. To address these challenges we have designed andstarted an iterative procedure for implementing the datarestructuring and estimating both the effort it takes torestructure and the effort would save once the restructuringis implemented.

在大规模地球系统模拟代码中，如加速气候模型(ACME)，复杂的用户派生数据类型(包含大量变量)被设计用来表示大气、海洋、陆地、冰和生物圈的相互作用，以预测各种条件下的全球气候。以下是我们建议的方法，以便在acme项目进行积极开发时重组该项目中土地组件的数据架构。陆地子系统的数据架构师定义了新的数据类型需求，通过将50多个主要数据类型转换为仅8个主要数据类型，将大大简化陆地子模型的实现。由于代码是在社区治理下开发的，我们必须确保重构不与其他开发相连接，因为每天都有几十个变化发生，所以不可能在共享的开发分支上工作。活跃的发展也发生在近500个分支上，这使得评估潜在的相互作用变得极其困难。为了应对这些挑战，我们设计并启动了一个迭代过程，用于实现数据结构，并估计重构所需的工作量和实现重构后节省的工作量。

{"title":"WIP: Live Restructuring of Data Architecture","authors":"W. Macey, Dali Wang, P. Thornton, A. Mockus","doi":"10.1109/SE4Science.2017.14","DOIUrl":"https://doi.org/10.1109/SE4Science.2017.14","url":null,"abstract":"In large-scale Earth System simulation codes, such asthe Accelerated Climate Model for Energy (ACME), complex user derived data types (containing large numberof variables) are designed to represent the interactionsof atmosphere, ocean, land, ice, and biosphere toproject global climate under a wide variety of conditions. The following is our proposed approach to restructurethe data architecture of a land component within theACME project while the project is undergoing activedevelopment. The data architect for the land subsystemdefines the new datatype requirements that wouldgreatly simplify the implementation of terrestrial landsubmodels by converting more than 50 to just eight primarydata-types. Since the code is developed with thecommunity governance, we have to ensure that the restructuringdoes not interface the other developmentwhich, with dozens of changes occurring every day, makeit impossible to work on a shared development branch. The active development also occurs on almost five hundredbranches, making it extremely difficult to assesspotential interactions. To address these challenges we have designed andstarted an iterative procedure for implementing the datarestructuring and estimating both the effort it takes torestructure and the effort would save once the restructuringis implemented.","PeriodicalId":318588,"journal":{"name":"2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117221385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

WIP: Prov-SE-O: A Provenance Ontology to Support Scientists in Scientific Experimentation Process WIP: prof - se - o:支持科学家进行科学实验的来源本体

2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)

Pub Date : 1900-01-01 DOI: 10.1109/SE4Science.2017.12

Lenita M. Ambrósio, J. M. David, Regina M. M. Braga, Victor Ströele A. Menezes, Fernanda Campos, M. A. Araújo

The management of provenance information plays a key role in the scientific experimentation domain, since scientists often need to examine and audit the results obtained from experiments. In addition, provenance data are essential to ensure reproducibility and reuse of experiments or artifacts produced by them. In this way, the objective of this work is to present an ontology to support the researchers in the process of scientific experimentation using provenance data. These data assist in the reproduction and reuse of scientific experiments, as well as allow the discovery of implicit knowledge using inference mechanisms.

来源信息的管理在科学实验领域起着关键作用，因为科学家经常需要检查和审计从实验中获得的结果。此外，来源数据对于确保由它们产生的实验或工件的再现性和重用是必不可少的。通过这种方式，本工作的目的是提供一个本体，以支持研究人员在使用来源数据的科学实验过程中。这些数据有助于科学实验的再现和重用，并允许使用推理机制发现隐含知识。

引用次数: 2

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀