Toward Futuristic Autonomous Experimentation—A Surprise-Reacting Sequential Experiment Policy

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-10-11 DOI:10.1109/TASE.2024.3474549

Imtiaz Ahmed;Satish T. S. Bukkapatnam;Bhaskar Botcha;Yu Ding

{"title":"Toward Futuristic Autonomous Experimentation—A Surprise-Reacting Sequential Experiment Policy","authors":"Imtiaz Ahmed;Satish T. S. Bukkapatnam;Bhaskar Botcha;Yu Ding","doi":"10.1109/TASE.2024.3474549","DOIUrl":null,"url":null,"abstract":"An autonomous experimentation platform in manufacturing is supposedly capable of conducting a sequential search for finding suitable manufacturing conditions by itself or even for discovering new materials with minimal human intervention. The core of the intelligent control of such platforms is a policy to decide where to conduct the next experiment based on what has been done thus far. Such policy inevitably trades off between exploitation and exploration. Currently, the prevailing approach is to use various acquisition functions in the Bayesian optimization framework. We discuss whether it is beneficial to trade off exploitation versus exploration by measuring the element and degree of surprise associated with the immediate past observation. We devise a surprise-reacting policy using two existing surprise metrics, known as the Shannon surprise and Bayesian surprise. Our analysis shows that the surprise-reacting policy appears to be better suited for quickly characterizing the overall landscape of a response surface under resource constraints. We do not claim that we have a fully autonomous experimentation system but believe that the surprise-reacting capability benefits the automation of sequential decisions in autonomous experimentation.Note to Practitioners Autonomous systems should be able to go beyond repetitive automatic actions that are generally pre-programmed through a recipe. To decide what to do next on the fly differentiates autonomy from automation. Arguably, autonomy is the highest form of automation. To endow a manufacturing with autonomy, one necessary capability is for it to react properly to the “unexpected,” which are those observations disagreeing with its model’s anticipation. Are these bad measurements, an anomaly, or an indicator of model inadequacy? Should the observations be discarded or should the model be updated using the new observation? If latter, should model be updated gradually overtime or radically altered? Figuratively, upon observing the unexpected, we say that a manufacturing control system is “surprised” and ask the question of how it should react. Our investigation shares our current insights on this question.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"7912-7926"},"PeriodicalIF":6.4000,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10714007/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

An autonomous experimentation platform in manufacturing is supposedly capable of conducting a sequential search for finding suitable manufacturing conditions by itself or even for discovering new materials with minimal human intervention. The core of the intelligent control of such platforms is a policy to decide where to conduct the next experiment based on what has been done thus far. Such policy inevitably trades off between exploitation and exploration. Currently, the prevailing approach is to use various acquisition functions in the Bayesian optimization framework. We discuss whether it is beneficial to trade off exploitation versus exploration by measuring the element and degree of surprise associated with the immediate past observation. We devise a surprise-reacting policy using two existing surprise metrics, known as the Shannon surprise and Bayesian surprise. Our analysis shows that the surprise-reacting policy appears to be better suited for quickly characterizing the overall landscape of a response surface under resource constraints. We do not claim that we have a fully autonomous experimentation system but believe that the surprise-reacting capability benefits the automation of sequential decisions in autonomous experimentation.Note to Practitioners Autonomous systems should be able to go beyond repetitive automatic actions that are generally pre-programmed through a recipe. To decide what to do next on the fly differentiates autonomy from automation. Arguably, autonomy is the highest form of automation. To endow a manufacturing with autonomy, one necessary capability is for it to react properly to the “unexpected,” which are those observations disagreeing with its model’s anticipation. Are these bad measurements, an anomaly, or an indicator of model inadequacy? Should the observations be discarded or should the model be updated using the new observation? If latter, should model be updated gradually overtime or radically altered? Figuratively, upon observing the unexpected, we say that a manufacturing control system is “surprised” and ask the question of how it should react. Our investigation shares our current insights on this question.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

面向未来的自主实验--惊喜反应序列实验政策

制造业中的自主实验平台据称能够自行进行顺序搜索，以找到合适的制造条件，甚至可以在最少的人为干预下发现新材料。这种平台的智能控制的核心是一种策略，根据迄今为止所做的工作来决定在哪里进行下一次实验。这种政策不可避免地要在开发和探索之间进行权衡。目前，流行的方法是在贝叶斯优化框架中使用各种采集函数。我们通过测量与直接过去观察相关的意外元素和程度来讨论在开采与勘探之间进行权衡是否有益。我们设计了一个惊喜反应策略，使用两个现有的惊喜指标，被称为香农惊喜和贝叶斯惊喜。我们的分析表明，意外反应策略似乎更适合于在资源限制下快速描述响应面整体景观。我们并没有声称我们有一个完全自主的实验系统，但相信意外反应能力有利于自主实验中顺序决策的自动化。从业人员注意：自治系统应该能够超越通常通过配方预先编程的重复自动操作。在飞行中决定下一步做什么是自主与自动化的区别。可以说，自主是自动化的最高形式。为了赋予制造自主性，一个必要的能力是让它对“意外”做出适当的反应，即那些与模型预期不一致的观察结果。这些测量结果是错误的，是异常的，还是模型不足的指示？应该丢弃这些观测值还是应该使用新的观测值来更新模型？如果是后者，模型应该逐步更新还是彻底改变？形象地说，在观察到意想不到的情况时，我们说制造控制系统“感到惊讶”，并提出它应该如何反应的问题。我们的调查分享了我们目前对这个问题的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.