首页 > 最新文献

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)最新文献

英文 中文
Replay-Driven Continual Learning for the Industrial Internet of Things 工业物联网的重放驱动持续学习
Sagar Sen, Simon Myklebust Nielsen, E. J. Husom, Arda Goknil, Simeon Tverdal, Leonardo Sastoque Pinilla
The Industrial Internet of Things (IIoT) leverages thousands of interconnected sensors and computing devices to monitor and control large and complex industrial processes. Machine learning (ML) applications in IIoT use data acquired from multiple sensors to perform tasks such as predictive maintenance. While remembering useful learning from the past, these applications need to adapt learning for evolving sensor data stemming from changes in industrial processes and environmental conditions. This paper presents a continual learning pipeline to learn from the evolving data while replaying selected parts of the old data. The pipeline is configured to produce ML experiences (e.g., training a baseline neural network model), improve the baseline model with the new data while replaying part of the old data, and infer/predict using a specific model version given a stream of IIoT sensor data. We have evaluated our approach from an AI Engineering perspective using three industrial case studies, i.e., predicting tool wear, remaining useful lifetime, and anomalies from sensor data acquired from CNC machining and broaching operations. Our results show that configuring experiences for replay-driven continual learning allows dynamic maintenance of ML performance on evolving data while minimizing the excessive accumulation of legacy sensor data.
工业物联网(IIoT)利用数千个相互连接的传感器和计算设备来监控和控制大型复杂的工业过程。工业物联网中的机器学习(ML)应用程序使用从多个传感器获取的数据来执行预测性维护等任务。在记住过去有用的学习经验的同时,这些应用需要适应不断变化的传感器数据,这些数据来自工业过程和环境条件的变化。本文提出了一个持续的学习管道,从不断发展的数据中学习,同时重播旧数据的选定部分。该管道被配置为产生机器学习体验(例如,训练基线神经网络模型),在重播部分旧数据的同时使用新数据改进基线模型,并在给定IIoT传感器数据流的情况下使用特定模型版本进行推断/预测。我们通过三个工业案例研究,从人工智能工程的角度评估了我们的方法,即预测刀具磨损、剩余使用寿命以及从CNC加工和拉削操作中获得的传感器数据的异常情况。我们的研究结果表明,为重放驱动的持续学习配置经验,可以在不断发展的数据上动态维护机器学习性能,同时最大限度地减少遗留传感器数据的过度积累。
{"title":"Replay-Driven Continual Learning for the Industrial Internet of Things","authors":"Sagar Sen, Simon Myklebust Nielsen, E. J. Husom, Arda Goknil, Simeon Tverdal, Leonardo Sastoque Pinilla","doi":"10.1109/CAIN58948.2023.00014","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00014","url":null,"abstract":"The Industrial Internet of Things (IIoT) leverages thousands of interconnected sensors and computing devices to monitor and control large and complex industrial processes. Machine learning (ML) applications in IIoT use data acquired from multiple sensors to perform tasks such as predictive maintenance. While remembering useful learning from the past, these applications need to adapt learning for evolving sensor data stemming from changes in industrial processes and environmental conditions. This paper presents a continual learning pipeline to learn from the evolving data while replaying selected parts of the old data. The pipeline is configured to produce ML experiences (e.g., training a baseline neural network model), improve the baseline model with the new data while replaying part of the old data, and infer/predict using a specific model version given a stream of IIoT sensor data. We have evaluated our approach from an AI Engineering perspective using three industrial case studies, i.e., predicting tool wear, remaining useful lifetime, and anomalies from sensor data acquired from CNC machining and broaching operations. Our results show that configuring experiences for replay-driven continual learning allows dynamic maintenance of ML performance on evolving data while minimizing the excessive accumulation of legacy sensor data.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126628048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards Code Generation from BDD Test Case Specifications: A Vision 从BDD测试用例规范走向代码生成:远景
Leon Chemnitz, David Reichenbach, Hani Aldebes, Mariam Naveed, Krishna Narasimhan, M. Mezini
Automatic code generation has recently attracted large attention and is becoming more significant to the software development process. Solutions based on Machine Learning and Artificial Intelligence are being used to increase human and software efficiency in potent and innovative ways. In this paper, we aim to leverage these developments and introduce a novel approach to generating frontend component code for the popular Angular framework. We propose to do this using behavior-driven development test specifications as input to a transformer-based machine learning model; however, we do not provide any proof-of-concept solution in this work. Our approach aims to drastically reduce the development time needed for web applications while potentially increasing software quality and introducing new research ideas toward automatic code generation.
自动代码生成最近引起了广泛的关注,并且在软件开发过程中变得越来越重要。基于机器学习和人工智能的解决方案正被用于以有效和创新的方式提高人和软件的效率。在本文中,我们的目标是利用这些发展,并介绍一种为流行的Angular框架生成前端组件代码的新方法。我们建议使用行为驱动的开发测试规范作为基于变压器的机器学习模型的输入;然而,我们在这项工作中没有提供任何概念验证解决方案。我们的方法旨在大幅减少web应用程序所需的开发时间,同时潜在地提高软件质量,并为自动代码生成引入新的研究思路。
{"title":"Towards Code Generation from BDD Test Case Specifications: A Vision","authors":"Leon Chemnitz, David Reichenbach, Hani Aldebes, Mariam Naveed, Krishna Narasimhan, M. Mezini","doi":"10.1109/CAIN58948.2023.00031","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00031","url":null,"abstract":"Automatic code generation has recently attracted large attention and is becoming more significant to the software development process. Solutions based on Machine Learning and Artificial Intelligence are being used to increase human and software efficiency in potent and innovative ways. In this paper, we aim to leverage these developments and introduce a novel approach to generating frontend component code for the popular Angular framework. We propose to do this using behavior-driven development test specifications as input to a transformer-based machine learning model; however, we do not provide any proof-of-concept solution in this work. Our approach aims to drastically reduce the development time needed for web applications while potentially increasing software quality and introducing new research ideas toward automatic code generation.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130416523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Reproducibility Requires Consolidated Artifacts 再现性需要统一的工件
Iordanis Fostiropoulos, Bowman Brown, L. Itti
Machine learning is facing a ‘reproducibility crisis’ where a significant number of works report failures when attempting to reproduce previously published results. We evaluate the sources of reproducibility failures using a meta-analysis of 142 replication studies from ReScience C and 204 code repositories. We find that missing experiment details such as hyperparameters are potential causes of unreproducibility. We experimentally show the bias of different hyperparameter selection strategies and conclude that consolidated artifacts with a unified framework can help support reproducibility.
机器学习正面临着“可重复性危机”,在试图重现先前发表的结果时,大量工作报告失败。我们通过对来自ReScience C和204个代码库的142个复制研究的荟萃分析来评估可重复性失败的来源。我们发现缺少实验细节,如超参数是不可重复性的潜在原因。我们通过实验证明了不同超参数选择策略的偏差,并得出结论,统一框架下的整合工件有助于支持再现性。
{"title":"Reproducibility Requires Consolidated Artifacts","authors":"Iordanis Fostiropoulos, Bowman Brown, L. Itti","doi":"10.1109/CAIN58948.2023.00025","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00025","url":null,"abstract":"Machine learning is facing a ‘reproducibility crisis’ where a significant number of works report failures when attempting to reproduce previously published results. We evaluate the sources of reproducibility failures using a meta-analysis of 142 replication studies from ReScience C and 204 code repositories. We find that missing experiment details such as hyperparameters are potential causes of unreproducibility. We experimentally show the bias of different hyperparameter selection strategies and conclude that consolidated artifacts with a unified framework can help support reproducibility.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"320 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123692519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Understanding Machine Learning Testing in Practise 在实践中理解机器学习测试
Arumoy Shome, Luís Cruz, A. Deursen
Visualisations drive all aspects of the Machine Learning (ML) Development Cycle but remain a vastly untapped resource by the research community. ML testing is a highly interactive and cognitive process which demands a human-in-the-loop approach. Besides writing tests for the code base, bulk of the evaluation requires application of domain expertise to generate and interpret visualisations. To gain a deeper insight into the process of testing ML systems, we propose to study visualisations of ML pipelines by mining Jupyter notebooks. We propose a two prong approach in conducting the analysis. First, gather general insights and trends using a qualitative study of a smaller sample of notebooks. And then use the knowledge gained from the qualitative study to design an empirical study using a larger sample of notebooks. Computational notebooks provide a rich source of information in three formats—text, code and images. We hope to utilise existing work in image analysis and Natural Language Processing for text and code, to analyse the information present in notebooks. We hope to gain a new perspective into program comprehension and debugging in the context of ML testing.
可视化驱动机器学习(ML)开发周期的各个方面,但仍然是研究界尚未开发的资源。机器学习测试是一个高度互动和认知的过程,需要一个人在循环的方法。除了为代码库编写测试之外,大部分评估还需要应用领域专家来生成和解释可视化。为了更深入地了解机器学习系统的测试过程,我们建议通过挖掘Jupyter笔记本来研究机器学习管道的可视化。我们建议采用双管齐下的方法进行分析。首先,通过对较小的笔记本样本进行定性研究,收集总体见解和趋势。然后利用从定性研究中获得的知识,设计一个使用更大样本的笔记本的实证研究。计算笔记本以三种格式提供了丰富的信息源——文本、代码和图像。我们希望利用现有的图像分析和文本和代码的自然语言处理工作来分析笔记本中的信息。我们希望在机器学习测试的背景下对程序理解和调试获得一个新的视角。
{"title":"Towards Understanding Machine Learning Testing in Practise","authors":"Arumoy Shome, Luís Cruz, A. Deursen","doi":"10.1109/CAIN58948.2023.00028","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00028","url":null,"abstract":"Visualisations drive all aspects of the Machine Learning (ML) Development Cycle but remain a vastly untapped resource by the research community. ML testing is a highly interactive and cognitive process which demands a human-in-the-loop approach. Besides writing tests for the code base, bulk of the evaluation requires application of domain expertise to generate and interpret visualisations. To gain a deeper insight into the process of testing ML systems, we propose to study visualisations of ML pipelines by mining Jupyter notebooks. We propose a two prong approach in conducting the analysis. First, gather general insights and trends using a qualitative study of a smaller sample of notebooks. And then use the knowledge gained from the qualitative study to design an empirical study using a larger sample of notebooks. Computational notebooks provide a rich source of information in three formats—text, code and images. We hope to utilise existing work in image analysis and Natural Language Processing for text and code, to analyse the information present in notebooks. We hope to gain a new perspective into program comprehension and debugging in the context of ML testing.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134599940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tenet: A Flexible Framework for Machine-Learning-based Vulnerability Detection 宗旨:基于机器学习的漏洞检测的灵活框架
Eduard Pinconschi, Sofia Reis, Chi Zhang, Rui Abreu, H. Erdogmus, C. Pasareanu, Limin Jia
Software vulnerability detection (SVD) aims to identify potential security weaknesses in software. SVD systems have been rapidly evolving from those being based on testing, static analysis, and dynamic analysis to those based on machine learning (ML). Many ML-based approaches have been proposed, but challenges remain: training and testing datasets contain duplicates, and building customized end-to-end pipelines for SVD is time-consuming. We present Tenet, a modular framework for building end-to-end, customizable, reusable, and automated pipelines through a plugin-based architecture that supports SVD for several deep learning (DL) and basic ML models. We demonstrate the applicability of Tenet by building practical pipelines performing SVD on real-world vulnerabilities.
软件漏洞检测(SVD)旨在识别软件中潜在的安全漏洞。SVD系统已经从基于测试、静态分析和动态分析的系统迅速发展到基于机器学习(ML)的系统。已经提出了许多基于ml的方法,但挑战仍然存在:训练和测试数据集包含重复,并且为SVD构建定制的端到端管道非常耗时。我们提出了Tenet,这是一个模块化框架,用于通过基于插件的架构构建端到端、可定制、可重用和自动化的管道,该架构支持用于几种深度学习(DL)和基本ML模型的SVD。我们通过构建在现实世界的漏洞上执行SVD的实际管道来演示Tenet的适用性。
{"title":"Tenet: A Flexible Framework for Machine-Learning-based Vulnerability Detection","authors":"Eduard Pinconschi, Sofia Reis, Chi Zhang, Rui Abreu, H. Erdogmus, C. Pasareanu, Limin Jia","doi":"10.1109/CAIN58948.2023.00026","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00026","url":null,"abstract":"Software vulnerability detection (SVD) aims to identify potential security weaknesses in software. SVD systems have been rapidly evolving from those being based on testing, static analysis, and dynamic analysis to those based on machine learning (ML). Many ML-based approaches have been proposed, but challenges remain: training and testing datasets contain duplicates, and building customized end-to-end pipelines for SVD is time-consuming. We present Tenet, a modular framework for building end-to-end, customizable, reusable, and automated pipelines through a plugin-based architecture that supports SVD for several deep learning (DL) and basic ML models. We demonstrate the applicability of Tenet by building practical pipelines performing SVD on real-world vulnerabilities.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124034519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
AI Living Lab: Quality Assurance for AI-based Health systems 人工智能生活实验室:基于人工智能的卫生系统的质量保证
Valentina Lenarduzzi, M. Isomursu
The main goal of this project is to develop an AI Living Lab providing methods and software tools for AI trustworthiness analysis, running digital twins to simulate Digital Health solutions (Hardware and Software) integrated with AI elements in vitro for early-stage validation experiments. In this paper, we present the motivation beyond the need of a AI Living Lab methods for researchers and companies, our idea in practice, and the scheduled roadmap. The insights of the AI Living Lab can enable researchers to understand possible problems on the quality of AI-enabled systems opening new research topics and allows companies to understand how to better address quality issues in their systems.
该项目的主要目标是开发一个人工智能生活实验室,为人工智能可信度分析提供方法和软件工具,运行数字双胞胎,模拟与人工智能元素集成的数字健康解决方案(硬件和软件),用于早期验证实验。在本文中,我们提出了人工智能生活实验室的动机,研究人员和公司的方法,我们的想法在实践中,以及计划的路线图。人工智能生活实验室的见解可以使研究人员了解人工智能系统的质量可能存在的问题,开辟新的研究课题,并使企业了解如何更好地解决其系统中的质量问题。
{"title":"AI Living Lab: Quality Assurance for AI-based Health systems","authors":"Valentina Lenarduzzi, M. Isomursu","doi":"10.1109/CAIN58948.2023.00018","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00018","url":null,"abstract":"The main goal of this project is to develop an AI Living Lab providing methods and software tools for AI trustworthiness analysis, running digital twins to simulate Digital Health solutions (Hardware and Software) integrated with AI elements in vitro for early-stage validation experiments. In this paper, we present the motivation beyond the need of a AI Living Lab methods for researchers and companies, our idea in practice, and the scheduled roadmap. The insights of the AI Living Lab can enable researchers to understand possible problems on the quality of AI-enabled systems opening new research topics and allows companies to understand how to better address quality issues in their systems.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124591816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extensible Modeling Framework for Reliable Machine Learning System Analysis 可靠机器学习系统分析的可扩展建模框架
Jati H. Husen, H. Washizaki, H. Tun, Nobukazu Yoshioka, Y. Fukazawa, Hironori Takeuchi, Hiroshi Tanaka, Kazuki Munakata
Machine learning system analysis requires different approaches for each different task and domain. Selecting a proper set of analytic models can be challenging for a specific problem. This paper discusses the extensibility of the Multi-View Modeling Framework for ML Systems approach using process mapping and extensible metamodel. We conducted a case study to evaluate the feasibility of such extensibility by extending the approach to facilitate an activity-driven analysis for an optical character recognition system. Based on the result of the case study, we found that Multi-View Modeling Framework for ML Systems is likely to be extensible.
机器学习系统分析需要针对不同的任务和领域采用不同的方法。对于特定的问题,选择一组合适的分析模型可能具有挑战性。本文利用过程映射和可扩展元模型讨论了机器学习系统多视图建模框架方法的可扩展性。我们进行了一个案例研究,通过扩展该方法来促进光学字符识别系统的活动驱动分析,以评估这种可扩展性的可行性。基于案例研究的结果,我们发现机器学习系统的多视图建模框架可能是可扩展的。
{"title":"Extensible Modeling Framework for Reliable Machine Learning System Analysis","authors":"Jati H. Husen, H. Washizaki, H. Tun, Nobukazu Yoshioka, Y. Fukazawa, Hironori Takeuchi, Hiroshi Tanaka, Kazuki Munakata","doi":"10.1109/CAIN58948.2023.00022","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00022","url":null,"abstract":"Machine learning system analysis requires different approaches for each different task and domain. Selecting a proper set of analytic models can be challenging for a specific problem. This paper discusses the extensibility of the Multi-View Modeling Framework for ML Systems approach using process mapping and extensible metamodel. We conducted a case study to evaluate the feasibility of such extensibility by extending the approach to facilitate an activity-driven analysis for an optical character recognition system. Based on the result of the case study, we found that Multi-View Modeling Framework for ML Systems is likely to be extensible.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115902975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
safe.trAIn – Engineering and Assurance of a Driverless Regional Train 安全的。列车——无人驾驶区域列车的工程与保障
M. Zeller, M. Rothfelder, C. Klein
Traditional automation technologies alone are not sufficient to enable the fully automated operation of trains. However, Artificial Intelligence (AI) and Machine Learning (ML) offers great potential to realize the mandatory novel functions to replace the tasks of a human train driver, such as obstacle detection on the tracks. The problem, which still remains unresolved, is to find a practical way to link AI/ML techniques with the requirements and approval processes that are applied in the railway domain. The safe.trAIn project aims to lay the foundation for the safe use of AI/ML to achieve the driverless operation of rail vehicles and thus addresses this key technological challenge hindering the adoption of unmanned rail transport. The project goals are to develop guidelines and methods for the reliable engineering and safety assurance of ML in the railway domain. Therefore, the project investigates methods to reliable design ML models and to prove the trustworthiness of AI-based functions taking robustness, uncertainty, and transparency aspects of the AI/ML model into account.
仅靠传统的自动化技术还不足以实现列车的全自动运行。然而,人工智能(AI)和机器学习(ML)提供了巨大的潜力,可以实现强制性的新功能,以取代人类火车司机的任务,例如轨道上的障碍物检测。目前仍未解决的问题是,找到一种实用的方法,将AI/ML技术与铁路领域应用的需求和审批流程联系起来。保险箱里。trAIn项目旨在为安全使用AI/ML实现轨道车辆无人驾驶运营奠定基础,从而解决这一阻碍无人轨道交通采用的关键技术挑战。该项目的目标是为铁路领域机器学习的可靠工程和安全保证制定指导方针和方法。因此,该项目研究了可靠设计ML模型的方法,并在考虑到AI/ML模型的鲁棒性、不确定性和透明度方面的情况下,证明基于AI的函数的可信度。
{"title":"safe.trAIn – Engineering and Assurance of a Driverless Regional Train","authors":"M. Zeller, M. Rothfelder, C. Klein","doi":"10.1109/CAIN58948.2023.00036","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00036","url":null,"abstract":"Traditional automation technologies alone are not sufficient to enable the fully automated operation of trains. However, Artificial Intelligence (AI) and Machine Learning (ML) offers great potential to realize the mandatory novel functions to replace the tasks of a human train driver, such as obstacle detection on the tracks. The problem, which still remains unresolved, is to find a practical way to link AI/ML techniques with the requirements and approval processes that are applied in the railway domain. The safe.trAIn project aims to lay the foundation for the safe use of AI/ML to achieve the driverless operation of rail vehicles and thus addresses this key technological challenge hindering the adoption of unmanned rail transport. The project goals are to develop guidelines and methods for the reliable engineering and safety assurance of ML in the railway domain. Therefore, the project investigates methods to reliable design ML models and to prove the trustworthiness of AI-based functions taking robustness, uncertainty, and transparency aspects of the AI/ML model into account.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114928547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Trustworthy and Robust AI Deployment by Design: A framework to inject best practice support into AI deployment pipelines 可信赖且稳健的AI部署设计:将最佳实践支持注入AI部署管道的框架
András Schmelczer, Joost Visser
Trustworthy and robust deployment of AI applications requires adherence to a range of AI engineering best practices. But, while professionals already have access to frameworks for deploying AI, case studies and developer surveys have found that many deployments do not follow best practices.We hypothesize that the adoption of AI deployment best practices can be improved by finding less complex framework designs that combine ease of use with built-in support for best practices. To investigate this hypothesis, we applied a design science approach to develop a new framework, called GreatAI, and evaluate its ease of use and best practice support.The initial design focusses on the domain of natural language processing (NLP), but with generalisation in mind. To assess applicability and generalisability, we conducted interviews with ten practitioners. We also assessed best practice coverage.We found that our framework helps implement 33 best practices through an accessible interface. These target the transition from prototype to production phase in the AI development lifecycle. Feedback from professional data scientists and software engineers showed that ease of use and functionality are equally important in deciding to adopt deployment technologies, and the proposed framework was rated positively in both dimensions.
值得信赖和强大的人工智能应用程序部署需要遵守一系列人工智能工程最佳实践。但是,尽管专业人士已经可以访问部署人工智能的框架,但案例研究和开发人员调查发现,许多部署并没有遵循最佳实践。我们假设,采用人工智能部署最佳实践可以通过寻找不太复杂的框架设计来改进,这些框架设计结合了易用性和对最佳实践的内置支持。为了调查这一假设,我们应用设计科学方法开发了一个名为GreatAI的新框架,并评估了其易用性和最佳实践支持。最初的设计集中在自然语言处理(NLP)领域,但考虑到泛化。为了评估适用性和普遍性,我们对10位从业者进行了访谈。我们还评估了最佳实践覆盖率。我们发现我们的框架通过一个可访问的接口帮助实现了33个最佳实践。这些目标是在AI开发生命周期中从原型到生产阶段的过渡。来自专业数据科学家和软件工程师的反馈表明,在决定采用部署技术时,易用性和功能性同样重要,并且提议的框架在这两个方面都得到了积极的评价。
{"title":"Trustworthy and Robust AI Deployment by Design: A framework to inject best practice support into AI deployment pipelines","authors":"András Schmelczer, Joost Visser","doi":"10.1109/CAIN58948.2023.00030","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00030","url":null,"abstract":"Trustworthy and robust deployment of AI applications requires adherence to a range of AI engineering best practices. But, while professionals already have access to frameworks for deploying AI, case studies and developer surveys have found that many deployments do not follow best practices.We hypothesize that the adoption of AI deployment best practices can be improved by finding less complex framework designs that combine ease of use with built-in support for best practices. To investigate this hypothesis, we applied a design science approach to develop a new framework, called GreatAI, and evaluate its ease of use and best practice support.The initial design focusses on the domain of natural language processing (NLP), but with generalisation in mind. To assess applicability and generalisability, we conducted interviews with ten practitioners. We also assessed best practice coverage.We found that our framework helps implement 33 best practices through an accessible interface. These target the transition from prototype to production phase in the AI development lifecycle. Feedback from professional data scientists and software engineers showed that ease of use and functionality are equally important in deciding to adopt deployment technologies, and the proposed framework was rated positively in both dimensions.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131869154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatically Resolving Data Source Dependency Hell in Large Scale Data Science Projects 自动解决大规模数据科学项目中的数据源依赖地狱
L. Boué, Pratap Kunireddy, Pavle Subotic
Dependency hell is a well-known pain point in the development of large software projects and machine learning (ML) code bases are not immune from it. In fact, ML applications suffer from an additional form of dependency hell, namely, data source dependency hell. This term refers to the central role played by data and its unique quirks that often lead to unexpected failures of ML models which cannot be explained by code changes. In this paper, we present an automated data source dependency mapping framework that allows MLOps engineers to monitor the whole dependency map of their models in a fast paced engineering environment and thus mitigate ahead of time the consequences of any data source changes. Our system is based on a unified and generic approach, employing techniques from static analysis, from which data sources can be identified on a wide range of source artifacts. Our framework is currently deployed within Microsoft and used by Microsoft MLOps engineers in production.
依赖地狱是大型软件项目开发中一个众所周知的痛点,机器学习(ML)代码库也不能幸免。事实上,ML应用程序还遭受着另一种形式的依赖地狱,即数据源依赖地狱。这个术语指的是数据所扮演的核心角色及其独特的怪癖,这些怪癖经常导致机器学习模型的意外故障,而这些故障无法通过代码更改来解释。在本文中,我们提出了一个自动化的数据源依赖映射框架,该框架允许MLOps工程师在快节奏的工程环境中监视其模型的整个依赖映射,从而提前减轻任何数据源更改的后果。我们的系统基于统一和通用的方法,采用来自静态分析的技术,从中可以在广泛的源工件上识别数据源。我们的框架目前部署在Microsoft内部,并由Microsoft MLOps工程师在生产环境中使用。
{"title":"Automatically Resolving Data Source Dependency Hell in Large Scale Data Science Projects","authors":"L. Boué, Pratap Kunireddy, Pavle Subotic","doi":"10.1109/CAIN58948.2023.00009","DOIUrl":"https://doi.org/10.1109/CAIN58948.2023.00009","url":null,"abstract":"Dependency hell is a well-known pain point in the development of large software projects and machine learning (ML) code bases are not immune from it. In fact, ML applications suffer from an additional form of dependency hell, namely, data source dependency hell. This term refers to the central role played by data and its unique quirks that often lead to unexpected failures of ML models which cannot be explained by code changes. In this paper, we present an automated data source dependency mapping framework that allows MLOps engineers to monitor the whole dependency map of their models in a fast paced engineering environment and thus mitigate ahead of time the consequences of any data source changes. Our system is based on a unified and generic approach, employing techniques from static analysis, from which data sources can be identified on a wide range of source artifacts. Our framework is currently deployed within Microsoft and used by Microsoft MLOps engineers in production.","PeriodicalId":175580,"journal":{"name":"2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)","volume":"85 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133749862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1