{"title":"系外行星主星分类:不完整恒星丰度数据的多目标优化","authors":"Miguel A Zammit, Josef Borg, Kristian Zarb Adami","doi":"10.1093/rasti/rzae020","DOIUrl":null,"url":null,"abstract":"\n The presence of a planetary companion around its host star has been repeatedly linked with stellar properties, affecting the likelihood of sub-stellar object formation and stability in the protoplanetary disc, thus presenting a key challenge in exoplanet science. Furthermore, abundance and stellar parameter datasets tend to be incomplete, which limits the ability to infer distributional characteristics harnessing the entire dataset. This work aims to develop a methodology using machine learning and multi-objective optimisation for reliable imputation for subsequent comparison tests and host star recommendation. It integrates fuzzy clustering for imputation and ML classification of hosts and comparison stars into an evolutionary multi-objective optimisation algorithm. We test several candidates for the classification model, starting with a binary classification for giant planet hosts. Upon confirmation that the XGBoost algorithm provides the best performance, we interpret the performance of both the imputation and classification modules for binary classification. The model is extended to handle multi-label classification for low-mass planets and planet multiplicity. Constraints on the model’s use and feature/sample selection are given, outlining strengths and limitations. We conclude that the careful use of this technique for host star recommendation will be an asset to future missions and the compilation of necessary target lists.","PeriodicalId":500957,"journal":{"name":"RAS Techniques and Instruments","volume":"28 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exoplanet host star classification: Multi-Objective Optimisation of incomplete stellar abundance data\",\"authors\":\"Miguel A Zammit, Josef Borg, Kristian Zarb Adami\",\"doi\":\"10.1093/rasti/rzae020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n The presence of a planetary companion around its host star has been repeatedly linked with stellar properties, affecting the likelihood of sub-stellar object formation and stability in the protoplanetary disc, thus presenting a key challenge in exoplanet science. Furthermore, abundance and stellar parameter datasets tend to be incomplete, which limits the ability to infer distributional characteristics harnessing the entire dataset. This work aims to develop a methodology using machine learning and multi-objective optimisation for reliable imputation for subsequent comparison tests and host star recommendation. It integrates fuzzy clustering for imputation and ML classification of hosts and comparison stars into an evolutionary multi-objective optimisation algorithm. We test several candidates for the classification model, starting with a binary classification for giant planet hosts. Upon confirmation that the XGBoost algorithm provides the best performance, we interpret the performance of both the imputation and classification modules for binary classification. The model is extended to handle multi-label classification for low-mass planets and planet multiplicity. Constraints on the model’s use and feature/sample selection are given, outlining strengths and limitations. We conclude that the careful use of this technique for host star recommendation will be an asset to future missions and the compilation of necessary target lists.\",\"PeriodicalId\":500957,\"journal\":{\"name\":\"RAS Techniques and Instruments\",\"volume\":\"28 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"RAS Techniques and Instruments\",\"FirstCategoryId\":\"0\",\"ListUrlMain\":\"https://doi.org/10.1093/rasti/rzae020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"RAS Techniques and Instruments","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.1093/rasti/rzae020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
宿主恒星周围的行星伴星的存在一再与恒星特性联系在一起,影响着亚恒星天体形成的可能性和原行星盘的稳定性,从而给系外行星科学带来了关键的挑战。此外,丰度和恒星参数数据集往往不完整,这限制了利用整个数据集推断分布特征的能力。这项工作旨在利用机器学习和多目标优化开发一种方法,为后续对比测试和宿主星推荐提供可靠的估算。它将用于估算的模糊聚类以及主机和对比恒星的 ML 分类集成到进化多目标优化算法中。我们测试了几个候选分类模型,首先是巨行星宿主的二元分类。在确认 XGBoost 算法提供了最佳性能之后,我们对双星分类的估算和分类模块的性能进行了解释。我们对模型进行了扩展,以处理低质量行星和行星多度的多标签分类。我们给出了模型使用和特征/样本选择的约束条件,概述了其优势和局限性。我们的结论是,谨慎使用这一技术来推荐主星将是未来任务和编制必要目标列表的宝贵财富。
Exoplanet host star classification: Multi-Objective Optimisation of incomplete stellar abundance data
The presence of a planetary companion around its host star has been repeatedly linked with stellar properties, affecting the likelihood of sub-stellar object formation and stability in the protoplanetary disc, thus presenting a key challenge in exoplanet science. Furthermore, abundance and stellar parameter datasets tend to be incomplete, which limits the ability to infer distributional characteristics harnessing the entire dataset. This work aims to develop a methodology using machine learning and multi-objective optimisation for reliable imputation for subsequent comparison tests and host star recommendation. It integrates fuzzy clustering for imputation and ML classification of hosts and comparison stars into an evolutionary multi-objective optimisation algorithm. We test several candidates for the classification model, starting with a binary classification for giant planet hosts. Upon confirmation that the XGBoost algorithm provides the best performance, we interpret the performance of both the imputation and classification modules for binary classification. The model is extended to handle multi-label classification for low-mass planets and planet multiplicity. Constraints on the model’s use and feature/sample selection are given, outlining strengths and limitations. We conclude that the careful use of this technique for host star recommendation will be an asset to future missions and the compilation of necessary target lists.