首页 > 最新文献

SICS Software-Intensive Cyber-Physical Systems最新文献

英文 中文
Where to begin: on pattern language entry points 从哪里开始:模式语言的切入点
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00417-6
Lukas Reinfurt, Michael Falkenthal, Frank Leymann
Pattern languages as tools for solving problems based on interconnected, abstract, and proven solutions can offer valuable help to practitioners. But there is always the question of where to begin when a pattern language should be applied. Their authors often provide entry points, but these are usually only useful if one starts completely from scratch or from a very specific situation. When confronted with problems at hand, practitioners are often left to find a suitable entry point themselves by reading through the whole pattern language to find applicable patterns. To help with this problem, we present a general approach and its formalization that provides entry points for any kind of situation. Our general three step approach guides practitioners through Situation Assessment, Treatment Selection, and Treatment Application in order to find and apply a suitable pattern language for their specific problems. We formalize all the parts involved and show that the facts collected during Situation Assessment can be used to find a suitable entry point for a specific situation. We also present an algorithm for finding these entry points.
模式语言作为解决基于相互关联的、抽象的和经过验证的解决方案的问题的工具,可以为实践者提供有价值的帮助。但是,当应用模式语言时,从哪里开始总是存在问题。它们的作者通常提供入口点,但这些通常只在完全从零开始或从非常特定的情况开始时才有用。当遇到手头的问题时,从业者通常需要通过通读整个模式语言来找到合适的切入点。为了帮助解决这个问题,我们提出了一种通用方法及其形式化,为任何类型的情况提供了入口点。我们一般的三步方法指导从业者通过情况评估、治疗选择和治疗应用,以便为他们的具体问题找到并应用合适的模式语言。我们将所涉及的所有部分形式化,并表明在情况评估期间收集的事实可用于为特定情况找到合适的切入点。我们还提出了一种寻找这些入口点的算法。
{"title":"Where to begin: on pattern language entry points","authors":"Lukas Reinfurt, Michael Falkenthal, Frank Leymann","doi":"10.1007/s00450-019-00417-6","DOIUrl":"https://doi.org/10.1007/s00450-019-00417-6","url":null,"abstract":"Pattern languages as tools for solving problems based on interconnected, abstract, and proven solutions can offer valuable help to practitioners. But there is always the question of where to begin when a pattern language should be applied. Their authors often provide entry points, but these are usually only useful if one starts completely from scratch or from a very specific situation. When confronted with problems at hand, practitioners are often left to find a suitable entry point themselves by reading through the whole pattern language to find applicable patterns. To help with this problem, we present a general approach and its formalization that provides entry points for any kind of situation. Our general three step approach guides practitioners through <i>Situation Assessment</i>, <i>Treatment Selection</i>, and <i>Treatment Application</i> in order to find and apply a suitable pattern language for their specific problems. We formalize all the parts involved and show that the facts collected during <i>Situation Assessment</i> can be used to find a suitable entry point for a specific situation. We also present an algorithm for finding these entry points.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"310 3","pages":"1 - 13"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Requirements for a model-driven cloud-native migration of monolithic web-based applications 对基于web的单一应用程序的模型驱动的云原生迁移的需求
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00414-9
Robin Lichtenthäler, Mike Prechtl, Christoph Schwille, Tobias Schwartz, Pascal Cezanne, Guido Wirtz
With the emergence of cloud-native applications, the question arises how existing, often monolithic, applications can be migrated to this new paradigm. The main migration challenges are the decomposition of an application into fine-grained components and the introduction of cloud computing paradigms. With complex existing applications the migration is difficult. A structured and tool-supported approach would facilitate the migration process, which is why this paper proposes a model-driven engineering based approach. As a basis, this paper derives and presents requirements for such an approach from the literature on the one hand and a migration case study on the other. The requirements are specifically targeted at the necessary models, but also consider the overall approach. The necessities for the realization of a model-driven cloud-native migration approach are shown and remaining challenges are discussed.
随着云原生应用程序的出现,出现了一个问题,即如何将现有的(通常是单一的)应用程序迁移到这种新范式。迁移的主要挑战是将应用程序分解为细粒度组件和引入云计算范例。对于复杂的现有应用程序,迁移是困难的。结构化和工具支持的方法将促进迁移过程,这就是本文提出基于模型驱动的工程方法的原因。在此基础上,本文一方面从文献中推导并提出了这种方法的要求,另一方面从移民案例研究中推导并提出了这种方法的要求。需求特别针对必要的模型,但也要考虑总体方法。本文展示了实现模型驱动的云原生迁移方法的必要性,并讨论了仍然存在的挑战。
{"title":"Requirements for a model-driven cloud-native migration of monolithic web-based applications","authors":"Robin Lichtenthäler, Mike Prechtl, Christoph Schwille, Tobias Schwartz, Pascal Cezanne, Guido Wirtz","doi":"10.1007/s00450-019-00414-9","DOIUrl":"https://doi.org/10.1007/s00450-019-00414-9","url":null,"abstract":"With the emergence of cloud-native applications, the question arises how existing, often monolithic, applications can be migrated to this new paradigm. The main migration challenges are the decomposition of an application into fine-grained components and the introduction of cloud computing paradigms. With complex existing applications the migration is difficult. A structured and tool-supported approach would facilitate the migration process, which is why this paper proposes a model-driven engineering based approach. As a basis, this paper derives and presents requirements for such an approach from the literature on the one hand and a migration case study on the other. The requirements are specifically targeted at the necessary models, but also consider the overall approach. The necessities for the realization of a model-driven cloud-native migration approach are shown and remaining challenges are discussed.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"310 9","pages":"1 - 12"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A formal approach to build privacy-awareness into clinical workflows 在临床工作流程中建立隐私意识的正式方法
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00418-5
Saliha Irem Besik, Johann-Christoph Freytag
Clinical workflows consist of sets of tasks involving patients and healthcare professionals. In such an environment, maintaining the privacy of patient data is a significant challenge. Healthcare providers have to consider both legislative compliances with tightening privacy regulations and growing privacy concerns of individuals. Unlike data security, which aims at preventing unauthorized access, privacy focuses on providing individuals the ability to control when, how, and to what extent their data is used with a particular purpose. In this paper, we present our first steps on transforming existing non-privacy-aware clinical workflows into privacy-aware ones through algorithms based on privacy policies and privacy preferences.
临床工作流程由涉及患者和医疗保健专业人员的任务集组成。在这样的环境中,维护患者数据的隐私是一项重大挑战。医疗保健提供商必须考虑严格的隐私法规的立法遵从性和日益增长的个人隐私问题。与旨在防止未经授权访问的数据安全不同,隐私的重点是让个人能够控制自己的数据何时、如何以及在多大程度上被用于特定目的。在本文中,我们介绍了通过基于隐私策略和隐私偏好的算法将现有的非隐私感知临床工作流程转换为隐私感知的第一步。
{"title":"A formal approach to build privacy-awareness into clinical workflows","authors":"Saliha Irem Besik, Johann-Christoph Freytag","doi":"10.1007/s00450-019-00418-5","DOIUrl":"https://doi.org/10.1007/s00450-019-00418-5","url":null,"abstract":"Clinical workflows consist of sets of tasks involving patients and healthcare professionals. In such an environment, maintaining the privacy of patient data is a significant challenge. Healthcare providers have to consider both legislative compliances with tightening privacy regulations and growing privacy concerns of individuals. Unlike data security, which aims at preventing unauthorized access, privacy focuses on providing individuals the ability to control when, how, and to what extent their data is used with a particular purpose. In this paper, we present our first steps on transforming existing non-privacy-aware clinical workflows into privacy-aware ones through algorithms based on privacy policies and privacy preferences.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"310 4","pages":"1 - 12"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A lightweight messaging engine for decentralized data processing in the Internet of Things 用于物联网中分散数据处理的轻量级消息传递引擎
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00410-z
Daniel Del Gaudio, Pascal Hirmer
Today, Internet of Things applications are available throughout many different domains (manufacturing, health, cities, homes), enabling a high degree of automation to ease people’s lives. For example, automated heating systems in a smart home can lead to reduced costs and an increased comfort for the residents. In the IoT, situations can be detected through interpretation of data produced by heterogeneous sensors, which typically lead to an invocation of actuators. In such applications, sensor data is usually streamed to a central instance for processing. However, especially in time-critical applications, this is not feasible, since high latency is an issue. To cope with this problem, in this paper, we introduce an approach for decentralized data processing in the IoT. This leads to decreased latency as well as a reduction of costs.
如今,物联网应用遍及许多不同的领域(制造业、健康、城市、家庭),实现了高度自动化,使人们的生活更加轻松。例如,智能家居中的自动供暖系统可以降低成本,提高居民的舒适度。在物联网中,可以通过解释异构传感器产生的数据来检测情况,这通常会导致执行器的调用。在这样的应用中,传感器数据通常流式传输到中央实例进行处理。但是,特别是在时间关键型应用程序中,这是不可行的,因为高延迟是一个问题。为了解决这一问题,本文介绍了一种物联网中分散数据处理的方法。这可以减少延迟并降低成本。
{"title":"A lightweight messaging engine for decentralized data processing in the Internet of Things","authors":"Daniel Del Gaudio, Pascal Hirmer","doi":"10.1007/s00450-019-00410-z","DOIUrl":"https://doi.org/10.1007/s00450-019-00410-z","url":null,"abstract":"Today, Internet of Things applications are available throughout many different domains (manufacturing, health, cities, homes), enabling a high degree of automation to ease people’s lives. For example, automated heating systems in a smart home can lead to reduced costs and an increased comfort for the residents. In the IoT, situations can be detected through interpretation of data produced by heterogeneous sensors, which typically lead to an invocation of actuators. In such applications, sensor data is usually streamed to a central instance for processing. However, especially in time-critical applications, this is not feasible, since high latency is an issue. To cope with this problem, in this paper, we introduce an approach for decentralized data processing in the IoT. This leads to decreased latency as well as a reduction of costs.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"5 1","pages":"1 - 10"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138517065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Quantum humanities: a vision for quantum computing in digital humanities 量子人文:数字人文中的量子计算远景
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00419-4
Johanna Barzen, F. Leymann
{"title":"Quantum humanities: a vision for quantum computing in digital humanities","authors":"Johanna Barzen, F. Leymann","doi":"10.1007/s00450-019-00419-4","DOIUrl":"https://doi.org/10.1007/s00450-019-00419-4","url":null,"abstract":"","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"35 1","pages":"153-158"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s00450-019-00419-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44933589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
RADON: rational decomposition and orchestration for serverless computing RADON:无服务器计算的合理分解和编排
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00413-w
G. Casale, M. Artač, W.-J. van den Heuvel, A. van Hoorn, P. Jakovits, F. Leymann, M. Long, V. Papanikolaou, D. Presenza, A. Russo, S. N. Srirama, D. A. Tamburri, M. Wurster, L. Zhu
Emerging serverless computing technologies, such as function as a service (FaaS), enable developers to virtualize the internal logic of an application, simplifying the management of cloud-native services and allowing cost savings through billing and scaling at the level of individual functions. Serverless computing is therefore rapidly shifting the attention of software vendors to the challenge of developing cloud applications deployable on FaaS platforms. In this vision paper, we present the research agenda of the RADON project (http://radon-h2020.eu), which aims to develop a model-driven DevOps framework for creating and managing applications based on serverless computing. RADON applications will consist of fine-grained and independent microservices that can efficiently and optimally exploit FaaS and container technologies. Our methodology strives to tackle complexity in designing such applications, including the solution of optimal decomposition, the reuse of serverless functions as well as the abstraction and actuation of event processing chains, while avoiding cloud vendor lock-in through models.
新兴的无服务器计算技术,如功能即服务(FaaS),使开发人员能够虚拟化应用程序的内部逻辑,简化云原生服务的管理,并通过在单个功能级别上计费和扩展来节省成本。因此,无服务器计算正迅速将软件供应商的注意力转移到开发可部署在FaaS平台上的云应用程序的挑战上。在这篇远景论文中,我们介绍了RADON项目(http://radon-h2020.eu)的研究议程,该项目旨在开发一个模型驱动的DevOps框架,用于创建和管理基于无服务器计算的应用程序。RADON应用程序将由细粒度和独立的微服务组成,这些微服务可以有效和最佳地利用FaaS和容器技术。我们的方法致力于解决设计此类应用程序的复杂性,包括最优分解的解决方案、无服务器功能的重用以及事件处理链的抽象和驱动,同时避免通过模型锁定云供应商。
{"title":"RADON: rational decomposition and orchestration for serverless computing","authors":"G. Casale, M. Artač, W.-J. van den Heuvel, A. van Hoorn, P. Jakovits, F. Leymann, M. Long, V. Papanikolaou, D. Presenza, A. Russo, S. N. Srirama, D. A. Tamburri, M. Wurster, L. Zhu","doi":"10.1007/s00450-019-00413-w","DOIUrl":"https://doi.org/10.1007/s00450-019-00413-w","url":null,"abstract":"Emerging serverless computing technologies, such as function as a service (FaaS), enable developers to virtualize the internal logic of an application, simplifying the management of cloud-native services and allowing cost savings through billing and scaling at the level of individual functions. Serverless computing is therefore rapidly shifting the attention of software vendors to the challenge of developing cloud applications deployable on FaaS platforms. In this vision paper, we present the research agenda of the RADON project (http://radon-h2020.eu), which aims to develop a model-driven DevOps framework for creating and managing applications based on serverless computing. RADON applications will consist of fine-grained and independent microservices that can efficiently and optimally exploit FaaS and container technologies. Our methodology strives to tackle complexity in designing such applications, including the solution of optimal decomposition, the reuse of serverless functions as well as the abstraction and actuation of event processing chains, while avoiding cloud vendor lock-in through models.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"310 7","pages":"1 - 11"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Freezing and defrosting cloud applications: automated saving and restoring of running applications 冻结和解冻云应用:自动保存和恢复运行的应用程序
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00415-8
Lukas Harzenetter, Uwe Breitenbücher, Kálmán Képes, Frank Leymann
In recent years, several technologies were developed enabling the automated provisioning and decommissioning of cloud applications. To reduce costs, these applications can be terminated and restarted on demand. However, as an application is terminated, its current application state, i.e., all application specific data is deleted along with the running application instance. This application state may be holding all business-critical information, and, hence, must be saved before the application is terminated. One possibility to save application states is to create VM snapshots although this is not always possible or sufficient. Therefore, we introduce two approaches: (1) a concept to generically terminate applications and save their internal state, and (2) an approach to reinstate the application in the same state again. To evaluate their practical feasibility, we implemented a proof of concept in the OpenTOSCA ecosystem.
近年来,开发了几种技术,可以实现云应用程序的自动供应和退役。为了降低成本,可以根据需要终止和重新启动这些应用程序。但是,当一个应用程序被终止时,它的当前应用程序状态,即所有应用程序特定的数据将与正在运行的应用程序实例一起被删除。此应用程序状态可能包含所有业务关键型信息,因此必须在终止应用程序之前保存。保存应用程序状态的一种可能性是创建VM快照,尽管这并不总是可行或不够充分的。因此,我们介绍了两种方法:(1)一般终止应用程序并保存其内部状态的概念,以及(2)再次将应用程序恢复到相同状态的方法。为了评估它们的实际可行性,我们在OpenTOSCA生态系统中实现了一个概念验证。
{"title":"Freezing and defrosting cloud applications: automated saving and restoring of running applications","authors":"Lukas Harzenetter, Uwe Breitenbücher, Kálmán Képes, Frank Leymann","doi":"10.1007/s00450-019-00415-8","DOIUrl":"https://doi.org/10.1007/s00450-019-00415-8","url":null,"abstract":"In recent years, several technologies were developed enabling the automated provisioning and decommissioning of cloud applications. To reduce costs, these applications can be terminated and restarted on demand. However, as an application is terminated, its current application state, i.e., all application specific data is deleted along with the running application instance. This application state may be holding all business-critical information, and, hence, must be saved before the application is terminated. One possibility to save application states is to create VM snapshots although this is not always possible or sufficient. Therefore, we introduce two approaches: (1) a concept to generically terminate applications and save their internal state, and (2) an approach to reinstate the application in the same state again. To evaluate their practical feasibility, we implemented a proof of concept in the OpenTOSCA ecosystem.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"309 12","pages":"1 - 14"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Pattern-based rewrite and refinement of architectures using graph theory 使用图论对架构进行基于模式的重写和细化
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00416-7
Jasmin Guth, Frank Leymann
With the continuous growth of IT application systems, the complexity of architecture modeling and development increases. Patterns document proven solutions for recurring problems in an abstract and human readable manner. Within the domain of IT architectures, they should support the architecture modeling and development process. Due to the documentation of patterns as text documents, they cannot be applied to an architecture automatically: patterns have to be read, understood, adapted to the corresponding use case, and realized manually over and over again. Consequently, architecture modeling considering the variety of documented patterns becomes even more complex and time-consuming. To tackle these issues, we introduce an approach and for an automated application and realization of patterns in architectural graphs using graph transformation techniques. This eases the architecture modeling process in which proven solutions for recurring problems can be selected as required and applied automatically. We present the concept, formalization, and validate our approach based on an application example.
随着IT应用系统的不断增长,体系结构建模和开发的复杂性不断增加。模式以抽象和人类可读的方式为反复出现的问题记录经过验证的解决方案。在IT体系结构领域内,它们应该支持体系结构建模和开发过程。由于模式的文档是文本文档,它们不能自动应用于体系结构:模式必须被阅读、理解、适应相应的用例,并一次又一次地手动实现。因此,考虑各种记录模式的体系结构建模变得更加复杂和耗时。为了解决这些问题,我们介绍了一种使用图转换技术在架构图中实现模式的自动化应用程序和方法。这简化了体系结构建模过程,在这个过程中,可以根据需要为重复出现的问题选择经过验证的解决方案并自动应用。我们根据一个应用程序示例介绍了概念、形式化并验证了我们的方法。
{"title":"Pattern-based rewrite and refinement of architectures using graph theory","authors":"Jasmin Guth, Frank Leymann","doi":"10.1007/s00450-019-00416-7","DOIUrl":"https://doi.org/10.1007/s00450-019-00416-7","url":null,"abstract":"With the continuous growth of IT application systems, the complexity of architecture modeling and development increases. Patterns document proven solutions for recurring problems in an abstract and human readable manner. Within the domain of IT architectures, they should support the architecture modeling and development process. Due to the documentation of patterns as text documents, they cannot be applied to an architecture automatically: patterns have to be read, understood, adapted to the corresponding use case, and realized manually over and over again. Consequently, architecture modeling considering the variety of documented patterns becomes even more complex and time-consuming. To tackle these issues, we introduce an approach and for an automated application and realization of patterns in architectural graphs using graph transformation techniques. This eases the architecture modeling process in which proven solutions for recurring problems can be selected as required and applied automatically. We present the concept, formalization, and validate our approach based on an application example.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"310 2","pages":"1 - 12"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards task-based parallelization for entity resolution 面向基于任务的实体解析并行化
IF 2.4 Q1 Computer Science Pub Date : 2019-08-26 DOI: 10.1007/s00450-019-00409-6
Leonardo Gazzarri, Melanie Herschel
Entity resolution (ER) refers to the problem of finding which virtual representations in one or more data sources refer to the same real-world entity. A central question in ER is how to find matching entity representations (so called duplicates) efficiently and in a scalable way. One general technique to address these issues is to leverage parallelization. In particular, almost all work on parallel ER focus on data parallelism. This paper focuses on task parallelism for ER. This type of parallelism allows to support incremental ER that offers incremental computation of the solution by streaming results of intermediate stages of ER as soon as they are computed. This possibly allows to obtain results in a more timely fashion and can also serve in a service-oriented setting with limited time or monetary budget. In summary, this paper presents a framework for task-parallelization of ER, supporting in particular ER of large amounts of semi-structured and heterogeneous data. We also discuss a possible implementation of our framework.
实体解析(ER)指的是在一个或多个数据源中查找哪些虚拟表示引用同一个真实世界实体的问题。ER中的一个中心问题是如何以可伸缩的方式高效地找到匹配的实体表示(所谓的副本)。解决这些问题的一种通用技术是利用并行化。特别是,几乎所有关于并行ER的工作都集中在数据并行性上。本文主要研究ER的任务并行性。这种类型的并行性允许支持增量ER,通过在计算ER中间阶段的结果后立即流式传输来提供解决方案的增量计算。这可能允许以更及时的方式获得结果,并且还可以在时间或金钱预算有限的面向服务的环境中使用。总之,本文提出了一个ER任务并行化框架,特别支持大量半结构化和异构数据的ER。我们还讨论了框架的可能实现。
{"title":"Towards task-based parallelization for entity resolution","authors":"Leonardo Gazzarri, Melanie Herschel","doi":"10.1007/s00450-019-00409-6","DOIUrl":"https://doi.org/10.1007/s00450-019-00409-6","url":null,"abstract":"Entity resolution (ER) refers to the problem of finding which virtual representations in one or more data sources refer to the same real-world entity. A central question in ER is how to find matching entity representations (so called duplicates) efficiently and in a scalable way. One general technique to address these issues is to leverage parallelization. In particular, almost all work on parallel ER focus on data parallelism. This paper focuses on task parallelism for ER. This type of parallelism allows to support incremental ER that offers incremental computation of the solution by streaming results of intermediate stages of ER as soon as they are computed. This possibly allows to obtain results in a more timely fashion and can also serve in a service-oriented setting with limited time or monetary budget. In summary, this paper presents a framework for task-parallelization of ER, supporting in particular ER of large amounts of semi-structured and heterogeneous data. We also discuss a possible implementation of our framework.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"310 8","pages":"1 - 8"},"PeriodicalIF":2.4,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
ASAP-DM: a framework for automatic selection of analytic platforms for data mining asp - dm:用于自动选择数据挖掘分析平台的框架
IF 2.4 Q1 Computer Science Pub Date : 2019-08-17 DOI: 10.1007/s00450-019-00408-7
Manuel Fritz, Osama Muazzen, Michael Behringer, Holger Schwarz
The plethora of analytic platforms escalates the difficulty of selecting the most appropriate analytic platform that fits the needed data mining task, the dataset as well as additional user-defined criteria. Especially analysts, who are rather focused on the analytics domain, experience difficulties to keep up with the latest developments. In this work, we introduce the ASAP-DM framework, which enables analysts to seamlessly use several platforms, whereas programmers can easily add several platforms to the framework. Furthermore, we investigate how to predict a platform based on specific criteria, such as lowest runtime or resource consumption during the execution of a data mining task. We formulate this task as an optimization problem, which can be solved by today’s classification algorithms. We evaluate the proposed framework on several analytic platforms such as Spark, Mahout, and WEKA along with several data mining algorithms for classification, clustering, and association rule discovery. Our experiments unveil that the automatic selection process can save up to 99.71% of the execution time due to automatically choosing a faster platform.
过多的分析平台增加了选择适合所需数据挖掘任务、数据集以及其他用户定义标准的最合适的分析平台的难度。特别是那些专注于分析领域的分析人员,在跟上最新发展方面遇到了困难。在这项工作中,我们介绍了asp - dm框架,它使分析人员能够无缝地使用多个平台,而程序员可以轻松地将多个平台添加到框架中。此外,我们还研究了如何基于特定的标准来预测平台,例如在执行数据挖掘任务期间最低的运行时间或资源消耗。我们将这个任务表述为一个优化问题,它可以通过今天的分类算法来解决。我们在几个分析平台(如Spark、Mahout和WEKA)上评估了所提出的框架,以及用于分类、聚类和关联规则发现的几种数据挖掘算法。我们的实验表明,由于自动选择更快的平台,自动选择过程可以节省高达99.71%的执行时间。
{"title":"ASAP-DM: a framework for automatic selection of analytic platforms for data mining","authors":"Manuel Fritz, Osama Muazzen, Michael Behringer, Holger Schwarz","doi":"10.1007/s00450-019-00408-7","DOIUrl":"https://doi.org/10.1007/s00450-019-00408-7","url":null,"abstract":"The plethora of analytic platforms escalates the difficulty of selecting the most appropriate analytic platform that fits the needed data mining task, the dataset as well as additional user-defined criteria. Especially analysts, who are rather focused on the analytics domain, experience difficulties to keep up with the latest developments. In this work, we introduce the ASAP-DM framework, which enables analysts to seamlessly use several platforms, whereas programmers can easily add several platforms to the framework. Furthermore, we investigate how to predict a platform based on specific criteria, such as lowest runtime or resource consumption during the execution of a data mining task. We formulate this task as an optimization problem, which can be solved by today’s classification algorithms. We evaluate the proposed framework on several analytic platforms such as Spark, Mahout, and WEKA along with several data mining algorithms for classification, clustering, and association rule discovery. Our experiments unveil that the automatic selection process can save up to 99.71% of the execution time due to automatically choosing a faster platform.","PeriodicalId":41265,"journal":{"name":"SICS Software-Intensive Cyber-Physical Systems","volume":"310 10","pages":"1 - 13"},"PeriodicalIF":2.4,"publicationDate":"2019-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SICS Software-Intensive Cyber-Physical Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1