2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)最新文献

英文中文

Uncovering Energy-Efficient Practices in Deep Learning Training: Preliminary Steps Towards Green AI 揭示深度学习训练中的节能实践:迈向绿色人工智能的初步步骤

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

Pub Date : 2023-03-24 DOI: 10.1109/CAIN58948.2023.00012

Tim Yarally, Luís Cruz, Daniel Feitosa, June Sallou, A. V. Deursen

Modern AI practices all strive towards the same goal: better results. In the context of deep learning, the term "results" often refers to the achieved accuracy on a competitive problem set. In this paper, we adopt an idea from the emerging field of $color{green}{text{Green AI}}$ to consider energy consumption as a metric of equal importance to accuracy and to reduce any irrelevant tasks or energy usage. We examine the training stage of the deep learning pipeline from a sustainability perspective, through the study of hyperparameter tuning strategies and the model complexity, two factors vastly impacting the overall pipeline’s energy consumption. First, we investigate the effectiveness of grid search, random search and Bayesian optimisation during hyperparameter tuning, and we find that Bayesian optimisation significantly dominates the other strategies. Furthermore, we analyse the architecture of convolutional neural networks with the energy consumption of three prominent layer types: convolutional, linear and ReLU layers. The results show that convolutional layers are the most computationally expensive by a strong margin. Additionally, we observe diminishing returns in accuracy for more energy-hungry models. The overall energy consumption of training can be halved by reducing the network complexity. In conclusion, we highlight innovative and promising energy-efficient practices for training deep learning models. To expand the application of $color{green}{text{Green AI}}$, we advocate for a shift in the design of deep learning models, by considering the trade-off between energy efficiency and accuracy.

现代人工智能实践都朝着同一个目标努力:更好的结果。在深度学习的上下文中，术语“结果”通常是指在竞争性问题集上达到的准确性。在本文中，我们采用了新兴领域$color{green}{text{green AI}}$的想法，将能耗视为与准确性同等重要的度量，并减少任何不相关的任务或能源使用。我们从可持续性的角度考察了深度学习管道的训练阶段，通过研究超参数调整策略和模型复杂性，这两个因素极大地影响了整个管道的能量消耗。首先，我们研究了网格搜索、随机搜索和贝叶斯优化在超参数调优过程中的有效性，我们发现贝叶斯优化明显优于其他策略。此外，我们分析了卷积神经网络的结构与三个突出的层类型的能量消耗:卷积层，线性层和ReLU层。结果表明，卷积层的计算开销是最大的。此外，我们观察到，对于耗能更大的模型，准确性的回报在递减。通过降低网络的复杂性，训练的总能耗可以减少一半。总之，我们强调了训练深度学习模型的创新和有前途的节能实践。为了扩展$color{green}{text{green AI}}$的应用，我们主张在深度学习模型的设计上进行转变，考虑能源效率和准确性之间的权衡。

{"title":"Uncovering Energy-Efficient Practices in Deep Learning Training: Preliminary Steps Towards Green AI","authors":"Tim Yarally, Luís Cruz, Daniel Feitosa, June Sallou, A. V. Deursen","doi":"10.1109/CAIN58948.2023.00012","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00012","url":null,"abstract":"Modern AI practices all strive towards the same goal: better results. In the context of deep learning, the term \"results\" often refers to the achieved accuracy on a competitive problem set. In this paper, we adopt an idea from the emerging field of $color{green}{text{Green AI}}$ to consider energy consumption as a metric of equal importance to accuracy and to reduce any irrelevant tasks or energy usage. We examine the training stage of the deep learning pipeline from a sustainability perspective, through the study of hyperparameter tuning strategies and the model complexity, two factors vastly impacting the overall pipeline’s energy consumption. First, we investigate the effectiveness of grid search, random search and Bayesian optimisation during hyperparameter tuning, and we find that Bayesian optimisation significantly dominates the other strategies. Furthermore, we analyse the architecture of convolutional neural networks with the energy consumption of three prominent layer types: convolutional, linear and ReLU layers. The results show that convolutional layers are the most computationally expensive by a strong margin. Additionally, we observe diminishing returns in accuracy for more energy-hungry models. The overall energy consumption of training can be halved by reducing the network complexity. In conclusion, we highlight innovative and promising energy-efficient practices for training deep learning models. To expand the application of $color{green}{text{Green AI}}$, we advocate for a shift in the design of deep learning models, by considering the trade-off between energy efficiency and accuracy.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115647931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A Case Study on AI Engineering Practices: Developing an Autonomous Stock Trading System 人工智能工程实践的案例研究:开发自主股票交易系统

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

Pub Date : 2023-03-23 DOI: 10.1109/CAIN58948.2023.00032

M. Grote, J. Bogner

Today, many systems use artificial intelligence (AI) to solve complex problems. While this often increases system effectiveness, developing a production-ready AI-based system is a difficult task. Thus, solid AI engineering practices are required to ensure the quality of the resulting system and to improve the development process. While several practices have already been proposed for the development of AI-based systems, detailed practical experiences of applying these practices are rare.In this paper, we aim to address this gap by collecting such experiences during a case study, namely the development of an autonomous stock trading system that uses machine learning functionality to invest in stocks. We selected 10 AI engineering practices from the literature and systematically applied them during development, with the goal to collect evidence about their applicability and effectiveness. Using structured field notes, we documented our experiences. Furthermore, we also used field notes to document challenges that occurred during the development, and the solutions we applied to overcome them. Afterwards, we analyzed the collected field notes, and evaluated how each practice improved the development. Lastly, we compared our evidence with existing literature.Most applied practices improved our system, albeit to varying extent, and we were able to overcome all major challenges. The qualitative results provide detailed accounts about 10 AI engineering practices, as well as challenges and solutions associated with such a project. Our experiences therefore enrich the emerging body of evidence in this field, which may be especially helpful for practitioner teams new to AI engineering.

今天，许多系统使用人工智能(AI)来解决复杂的问题。虽然这通常会提高系统的效率，但开发一个生产就绪的基于ai的系统是一项艰巨的任务。因此，需要可靠的人工智能工程实践来确保最终系统的质量并改进开发过程。虽然已经提出了一些用于开发基于人工智能的系统的实践，但应用这些实践的详细实践经验很少。在本文中，我们的目标是通过在案例研究中收集这些经验来解决这一差距，即开发一个使用机器学习功能来投资股票的自主股票交易系统。我们从文献中选择了10个人工智能工程实践，并在开发过程中系统地应用它们，目的是收集有关它们的适用性和有效性的证据。使用结构化的现场笔记，我们记录了我们的经验。此外，我们还使用现场记录来记录开发过程中出现的挑战，以及我们为克服这些挑战所采用的解决方案。之后，我们分析收集到的现场记录，并评估每个实践如何改善开发。最后，我们将我们的证据与现有文献进行了比较。大多数应用实践改进了我们的系统，尽管程度不同，并且我们能够克服所有主要的挑战。定性结果提供了关于10个人工智能工程实践的详细描述，以及与此类项目相关的挑战和解决方案。因此，我们的经验丰富了这一领域的新证据，这可能对新接触人工智能工程的从业者团队特别有帮助。

{"title":"A Case Study on AI Engineering Practices: Developing an Autonomous Stock Trading System","authors":"M. Grote, J. Bogner","doi":"10.1109/CAIN58948.2023.00032","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00032","url":null,"abstract":"Today, many systems use artificial intelligence (AI) to solve complex problems. While this often increases system effectiveness, developing a production-ready AI-based system is a difficult task. Thus, solid AI engineering practices are required to ensure the quality of the resulting system and to improve the development process. While several practices have already been proposed for the development of AI-based systems, detailed practical experiences of applying these practices are rare.In this paper, we aim to address this gap by collecting such experiences during a case study, namely the development of an autonomous stock trading system that uses machine learning functionality to invest in stocks. We selected 10 AI engineering practices from the literature and systematically applied them during development, with the goal to collect evidence about their applicability and effectiveness. Using structured field notes, we documented our experiences. Furthermore, we also used field notes to document challenges that occurred during the development, and the solutions we applied to overcome them. Afterwards, we analyzed the collected field notes, and evaluated how each practice improved the development. Lastly, we compared our evidence with existing literature.Most applied practices improved our system, albeit to varying extent, and we were able to overcome all major challenges. The qualitative results provide detailed accounts about 10 AI engineering practices, as well as challenges and solutions associated with such a project. Our experiences therefore enrich the emerging body of evidence in this field, which may be especially helpful for practitioner teams new to AI engineering.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123071325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design Patterns for AI-based Systems: A Multivocal Literature Review and Pattern Repository 基于人工智能系统的设计模式:多语种文献综述和模式库

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

Pub Date : 2023-03-23 DOI: 10.1109/CAIN58948.2023.00035

Lukas Heiland, Marius Hauser, J. Bogner

Systems with artificial intelligence components, so-called AI-based systems, have gained considerable attention recently. However, many organizations have issues with achieving production readiness with such systems. As a means to improve certain software quality attributes and to address frequently occurring problems, design patterns represent proven solution blueprints. While new patterns for AI-based systems are emerging, existing patterns have also been adapted to this new context.The goal of this study is to provide an overview of design patterns for AI-based systems, both new and adapted ones. We want to collect and categorize patterns, and make them accessible for researchers and practitioners. To this end, we first performed a multivocal literature review (MLR) to collect design patterns used with AI-based systems. We then integrated the created pattern collection into a web-based pattern repository to make the patterns browsable and easy to find.As a result, we selected 51 resources (35 white and 16 gray ones), from which we extracted 70 unique patterns used for AI-based systems. Among these are 34 new patterns and 36 traditional ones that have been adapted to this context. Popular pattern categories include architecture (25 patterns), deployment (16), implementation (9), or security & safety (9). While some patterns with four or more mentions already seem established, the majority of patterns have only been mentioned once or twice (51 patterns). Our results in this emerging field can be used by researchers as a foundation for follow-up studies and by practitioners to discover relevant patterns for informing the design of AI-based systems.

具有人工智能组件的系统，即所谓的基于ai的系统，最近受到了相当大的关注。然而，许多组织在使用这样的系统实现生产就绪方面存在问题。作为改进某些软件质量属性和处理经常出现的问题的一种手段，设计模式代表了经过验证的解决方案蓝图。虽然基于人工智能的系统的新模式正在出现，但现有的模式也已经适应了这种新的环境。本研究的目的是概述基于ai的系统的设计模式，包括新的和经过调整的设计模式。我们希望收集和分类模式，并使研究人员和实践者能够访问它们。为此，我们首先进行了多语种文献综述(MLR)，以收集基于人工智能的系统使用的设计模式。然后，我们将创建的模式集合集成到基于web的模式存储库中，以使模式易于浏览和查找。因此，我们选择了51个资源(35个白色资源和16个灰色资源)，从中我们提取了70个用于基于ai的系统的独特模式。其中有34种新模式和36种传统模式已经适应了这种情况。流行的模式类别包括体系结构(25种模式)、部署(16种模式)、实现(9种模式)或安全性(9种模式)。虽然一些模式被提及四次或更多，但大多数模式只被提及一两次(51种模式)。我们在这个新兴领域的研究结果可以被研究人员用作后续研究的基础，也可以被实践者用来发现相关的模式，为基于人工智能的系统设计提供信息。

{"title":"Design Patterns for AI-based Systems: A Multivocal Literature Review and Pattern Repository","authors":"Lukas Heiland, Marius Hauser, J. Bogner","doi":"10.1109/CAIN58948.2023.00035","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00035","url":null,"abstract":"Systems with artificial intelligence components, so-called AI-based systems, have gained considerable attention recently. However, many organizations have issues with achieving production readiness with such systems. As a means to improve certain software quality attributes and to address frequently occurring problems, design patterns represent proven solution blueprints. While new patterns for AI-based systems are emerging, existing patterns have also been adapted to this new context.The goal of this study is to provide an overview of design patterns for AI-based systems, both new and adapted ones. We want to collect and categorize patterns, and make them accessible for researchers and practitioners. To this end, we first performed a multivocal literature review (MLR) to collect design patterns used with AI-based systems. We then integrated the created pattern collection into a web-based pattern repository to make the patterns browsable and easy to find.As a result, we selected 51 resources (35 white and 16 gray ones), from which we extracted 70 unique patterns used for AI-based systems. Among these are 34 new patterns and 36 traditional ones that have been adapted to this context. Popular pattern categories include architecture (25 patterns), deployment (16), implementation (9), or security & safety (9). While some patterns with four or more mentions already seem established, the majority of patterns have only been mentioned once or twice (51 patterns). Our results in this emerging field can be used by researchers as a foundation for follow-up studies and by practitioners to discover relevant patterns for informing the design of AI-based systems.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123177115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform 定义可信赖AI野花监测平台的质量要求

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

Pub Date : 2023-03-23 DOI: 10.1109/CAIN58948.2023.00029

P. Heck, Gerard Schouten

For an AI solution to evolve from a trained machine learning model into a production-ready AI system, many more things need to be considered than just the performance of the machine learning model. A production-ready AI system needs to be trustworthy, i.e. of high quality. But how to determine this in practiceƒ For traditional software, ISO25000 and its predecessors have since long time been used to define and measure quality characteristics. Recently, quality models for AI systems, based on ISO25000, have been introduced. This paper applies one such quality model to a real-life case study: a deep learning platform for monitoring wildflowers. The paper presents three realistic scenarios sketching what it means to respectively use, extend and incrementally improve the deep learning platform for wildflower identification and counting. Next, it is shown how the quality model can be used as a structured dictionary to define quality requirements for data, model and software. Future work remains to extend the quality model with metrics, tools and best practices to aid AI engineering practitioners in implementing trustworthy AI systems.

为了使人工智能解决方案从训练有素的机器学习模型演变为生产就绪的人工智能系统，除了机器学习模型的性能之外，还需要考虑更多的事情。生产就绪的人工智能系统需要值得信赖，即高质量。但是在实践中如何确定呢?对于传统软件来说，ISO25000及其前身长期以来一直被用来定义和测量质量特性。最近，基于ISO25000的人工智能系统质量模型已经被引入。本文将一个这样的质量模型应用于一个现实生活中的案例研究:一个监测野花的深度学习平台。本文提出了三种现实场景，概述了分别使用、扩展和逐步改进用于野花识别和计数的深度学习平台的意义。接下来，展示了如何将质量模型用作结构化字典来定义数据、模型和软件的质量需求。未来的工作仍然是用度量、工具和最佳实践来扩展质量模型，以帮助人工智能工程从业者实现值得信赖的人工智能系统。

{"title":"Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform","authors":"P. Heck, Gerard Schouten","doi":"10.1109/CAIN58948.2023.00029","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00029","url":null,"abstract":"For an AI solution to evolve from a trained machine learning model into a production-ready AI system, many more things need to be considered than just the performance of the machine learning model. A production-ready AI system needs to be trustworthy, i.e. of high quality. But how to determine this in practiceƒ For traditional software, ISO25000 and its predecessors have since long time been used to define and measure quality characteristics. Recently, quality models for AI systems, based on ISO25000, have been introduced. This paper applies one such quality model to a real-life case study: a deep learning platform for monitoring wildflowers. The paper presents three realistic scenarios sketching what it means to respectively use, extend and incrementally improve the deep learning platform for wildflower identification and counting. Next, it is shown how the quality model can be used as a structured dictionary to define quality requirements for data, model and software. Future work remains to extend the quality model with metrics, tools and best practices to aid AI engineering practitioners in implementing trustworthy AI systems.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"42 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131205550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prevalence of Code Smells in Reinforcement Learning Projects 代码气味在强化学习项目中的流行

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

Pub Date : 2023-03-17 DOI: 10.1109/CAIN58948.2023.00013

Nicolás Cardozo, Ivana Dusparic, Christian Cabrera

Reinforcement Learning (RL) is being increasingly used to learn and adapt application behavior in many domains, including large-scale and safety critical systems, as for example, autonomous driving. With the advent of plug-n-play RL libraries, its applicability has further increased, enabling integration of RL algorithms by users. We note, however, that the majority of such code is not developed by RL engineers, which as a consequence, may lead to poor program quality yielding bugs, suboptimal performance, maintainability, and evolution problems for RL-based projects. In this paper we begin the exploration of this hypothesis, specific to code utilizing RL, analyzing different projects found in the wild, to assess their quality from a software engineering perspective. Our study includes 24 popular RL-based Python projects, analyzed with standard software engineering metrics. Our results, aligned with similar analyses for ML code in general, show that popular and widely reused RL repositories contain many code smells (3.95% of the code base on average), significantly affecting the projects’ maintainability. The most common code smells detected are long method and long method chain, highlighting problems in the definition and interaction of agents. Detected code smells suggest problems in responsibility separation, and the appropriateness of current abstractions for the definition of RL algorithms.

强化学习(RL)正越来越多地用于学习和适应许多领域的应用行为，包括大规模和安全关键系统，例如自动驾驶。随着即插即用RL库的出现，其适用性进一步提高，使用户能够集成RL算法。然而，我们注意到，大多数这样的代码不是由RL工程师开发的，其结果可能导致程序质量差，产生错误，次优性能，可维护性，以及基于RL的项目的进化问题。在本文中，我们开始探索这一假设，具体到利用强化学习的代码，分析在野外发现的不同项目，从软件工程的角度评估它们的质量。我们的研究包括24个流行的基于rl的Python项目，用标准的软件工程指标进行分析。我们的结果与一般ML代码的类似分析一致，表明流行和广泛重用的RL存储库包含许多代码气味(平均占代码库的3.95%)，显著影响项目的可维护性。检测到的最常见的代码气味是长方法和长方法链，突出了代理的定义和交互中的问题。检测到的代码气味表明责任分离方面存在问题，以及当前抽象对RL算法定义的适当性。

{"title":"Prevalence of Code Smells in Reinforcement Learning Projects","authors":"Nicolás Cardozo, Ivana Dusparic, Christian Cabrera","doi":"10.1109/CAIN58948.2023.00013","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00013","url":null,"abstract":"Reinforcement Learning (RL) is being increasingly used to learn and adapt application behavior in many domains, including large-scale and safety critical systems, as for example, autonomous driving. With the advent of plug-n-play RL libraries, its applicability has further increased, enabling integration of RL algorithms by users. We note, however, that the majority of such code is not developed by RL engineers, which as a consequence, may lead to poor program quality yielding bugs, suboptimal performance, maintainability, and evolution problems for RL-based projects. In this paper we begin the exploration of this hypothesis, specific to code utilizing RL, analyzing different projects found in the wild, to assess their quality from a software engineering perspective. Our study includes 24 popular RL-based Python projects, analyzed with standard software engineering metrics. Our results, aligned with similar analyses for ML code in general, show that popular and widely reused RL repositories contain many code smells (3.95% of the code base on average), significantly affecting the projects’ maintainability. The most common code smells detected are long method and long method chain, highlighting problems in the definition and interaction of agents. Detected code smells suggest problems in responsibility separation, and the appropriateness of current abstractions for the definition of RL algorithms.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"291 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114837337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Dataflow graphs as complete causal graphs 数据流图作为完整的因果图

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

Pub Date : 2023-03-16 DOI: 10.1109/CAIN58948.2023.00010

Andrei Paleyes, Siyuan Guo, B. Scholkopf, Neil D. Lawrence

Component-based development is one of the core principles behind modern software engineering practices. Understanding of causal relationships between components of a software system can yield significant benefits to developers. Yet modern software design approaches make it difficult to track and discover such relationships at system scale, which leads to growing intellectual debt. In this paper we consider an alternative approach to software design, flow-based programming (FBP), and draw the attention of the community to the connection between dataflow graphs produced by FBP and structural causal models. With expository examples we show how this connection can be leveraged to improve day-to-day tasks in software projects, including fault localisation, business analysis and experimentation.

基于组件的开发是现代软件工程实践背后的核心原则之一。了解软件系统组件之间的因果关系可以为开发人员带来巨大的好处。然而，现代软件设计方法使得很难在系统规模上跟踪和发现这种关系，这导致了不断增长的智力债务。在本文中，我们考虑了软件设计的另一种方法，基于流的编程(flow-based programming, FBP)，并提请社区注意由FBP生成的数据流图与结构因果模型之间的联系。通过说明性的例子，我们展示了如何利用这种联系来改进软件项目中的日常任务，包括故障定位、业务分析和实验。

引用次数: 2

Automotive Perception Software Development: An Empirical Investigation into Data, Annotation, and Ecosystem Challenges 汽车感知软件开发:对数据、注释和生态系统挑战的实证调查

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

Pub Date : 2023-03-10 DOI: 10.1109/CAIN58948.2023.00011

Hans-Martin Heyn, K. M. Habibullah, E. Knauss, Jennifer Horkoff, Markus Borg, Alessia Knauss, Polly Jing Li

Software that contains machine learning algorithms is an integral part of automotive perception, for example, in driving automation systems. The development of such software, specifically the training and validation of the machine learning components, requires large annotated datasets. An industry of data and annotation services has emerged to serve the development of such data-intensive automotive software components. Wide-spread difficulties to specify data and annotation needs challenge collaborations between OEMs (Original Equipment Manufacturers) and their suppliers of software components, data, and annotations.This paper investigates the reasons for these difficulties for practitioners in the Swedish automotive industry to arrive at clear specifications for data and annotations. The results from an interview study show that a lack of effective metrics for data quality aspects, ambiguities in the way of working, unclear definitions of annotation quality, and deficits in the business ecosystems are causes for the difficulty in deriving the specifications. We provide a list of recommendations that can mitigate challenges when deriving specifications and we propose future research opportunities to overcome these challenges. Our work contributes towards the on-going research on accountability of machine learning as applied to complex software systems, especially for high-stake applications such as automated driving.

包含机器学习算法的软件是汽车感知的一个组成部分，例如在驾驶自动化系统中。这类软件的开发，特别是机器学习组件的训练和验证，需要大量带注释的数据集。数据和注释服务行业已经出现，以服务于此类数据密集型汽车软件组件的开发。指定数据和注释的普遍困难需要挑战oem(原始设备制造商)与其软件组件、数据和注释供应商之间的合作。本文调查了这些困难的原因，为从业者在瑞典汽车行业，以达到明确的规格数据和注释。访谈研究的结果表明，缺乏数据质量方面的有效度量、工作方式的模糊性、注释质量定义的不明确以及业务生态系统的缺陷是难以获得规范的原因。我们提供了一份建议清单，可以在制定规范时减轻挑战，并提出未来的研究机会来克服这些挑战。我们的工作有助于机器学习应用于复杂软件系统的问责性研究，特别是自动驾驶等高风险应用。

{"title":"Automotive Perception Software Development: An Empirical Investigation into Data, Annotation, and Ecosystem Challenges","authors":"Hans-Martin Heyn, K. M. Habibullah, E. Knauss, Jennifer Horkoff, Markus Borg, Alessia Knauss, Polly Jing Li","doi":"10.1109/CAIN58948.2023.00011","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00011","url":null,"abstract":"Software that contains machine learning algorithms is an integral part of automotive perception, for example, in driving automation systems. The development of such software, specifically the training and validation of the machine learning components, requires large annotated datasets. An industry of data and annotation services has emerged to serve the development of such data-intensive automotive software components. Wide-spread difficulties to specify data and annotation needs challenge collaborations between OEMs (Original Equipment Manufacturers) and their suppliers of software components, data, and annotations.This paper investigates the reasons for these difficulties for practitioners in the Swedish automotive industry to arrive at clear specifications for data and annotations. The results from an interview study show that a lack of effective metrics for data quality aspects, ambiguities in the way of working, unclear definitions of annotation quality, and deficits in the business ecosystems are causes for the difficulty in deriving the specifications. We provide a list of recommendations that can mitigate challenges when deriving specifications and we propose future research opportunities to overcome these challenges. Our work contributes towards the on-going research on accountability of machine learning as applied to complex software systems, especially for high-stake applications such as automated driving.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131332223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀