{"title":"Coactive Preference-Guided Multi-Objective Bayesian Optimization: An Application to Policy Learning in Personalized Plasma Medicine","authors":"Ketong Shao;Ankush Chakrabarty;Ali Mesbah;Diego Romeres","doi":"10.1109/LCSYS.2024.3521965","DOIUrl":null,"url":null,"abstract":"The design of advanced learning- and optimization-based controllers requires selecting parameters that balance performance objectives and constraints. Bayesian optimization (BO) has proven effective for resource-efficient calibration of such controllers. Preference-guided BO incorporates user preferences to prioritize areas of interest, but it lacks a mechanism for users to specify desired outcomes directly. This letter introduces a user-centric framework for preference-guided BO, leveraging a novel knowledge-gradient based coactive acquisition function that allows users not only to select preferred outcomes but also also propose alternatives to guide exploration. To enable efficient implementation, we approximate the acquisition function, avoiding costly bilevel optimization. The approach is validated for control policy adaptation in personalized plasma medicine, where it outperforms standard preference-guided BO by effectively integrating user feedback to personalize treatment protocol.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"8 ","pages":"3081-3086"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Control Systems Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10812946/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The design of advanced learning- and optimization-based controllers requires selecting parameters that balance performance objectives and constraints. Bayesian optimization (BO) has proven effective for resource-efficient calibration of such controllers. Preference-guided BO incorporates user preferences to prioritize areas of interest, but it lacks a mechanism for users to specify desired outcomes directly. This letter introduces a user-centric framework for preference-guided BO, leveraging a novel knowledge-gradient based coactive acquisition function that allows users not only to select preferred outcomes but also also propose alternatives to guide exploration. To enable efficient implementation, we approximate the acquisition function, avoiding costly bilevel optimization. The approach is validated for control policy adaptation in personalized plasma medicine, where it outperforms standard preference-guided BO by effectively integrating user feedback to personalize treatment protocol.