Qiuling Tao
(, ), JinXin Yu
(, ), Xiangyu Mu
(, ), Xue Jia
(, ), Rongpei Shi
(, ), Zhifu Yao
(, ), Cuiping Wang
(, ), Haijun Zhang
(, ), Xingjun Liu
(, )
{"title":"Machine learning strategies for small sample size in materials science","authors":"Qiuling Tao \n (, ), JinXin Yu \n (, ), Xiangyu Mu \n (, ), Xue Jia \n (, ), Rongpei Shi \n (, ), Zhifu Yao \n (, ), Cuiping Wang \n (, ), Haijun Zhang \n (, ), Xingjun Liu \n (, )","doi":"10.1007/s40843-024-3204-5","DOIUrl":null,"url":null,"abstract":"<div><p>Machine learning (ML) has been widely used to design and develop new materials owing to its low computational cost and powerful predictive capabilities. In recent years, the shortcomings of ML in materials science have gradually emerged, with a primary concern being the scarcity of data. It is challenging to build reliable and accurate ML models using limited data. Moreover, the small sample size problem will remain long-standing in materials science because of the slow accumulation of material data. Therefore, it is important to review and categorize strategies for small-sample learning for the development of ML in materials science. This review systematically sorts the research progress of small-sample learning strategies in materials science, including ensemble learning, unsupervised learning, active learning, and transfer learning. The directions for future research are proposed, including few-shot learning, and virtual sample generation. More importantly, we emphasize the significance of embedding material domain knowledge into ML and elaborate on the basic idea for implementing this strategy.</p><div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":773,"journal":{"name":"Science China Materials","volume":"68 2","pages":"387 - 405"},"PeriodicalIF":6.8000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science China Materials","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s40843-024-3204-5","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning (ML) has been widely used to design and develop new materials owing to its low computational cost and powerful predictive capabilities. In recent years, the shortcomings of ML in materials science have gradually emerged, with a primary concern being the scarcity of data. It is challenging to build reliable and accurate ML models using limited data. Moreover, the small sample size problem will remain long-standing in materials science because of the slow accumulation of material data. Therefore, it is important to review and categorize strategies for small-sample learning for the development of ML in materials science. This review systematically sorts the research progress of small-sample learning strategies in materials science, including ensemble learning, unsupervised learning, active learning, and transfer learning. The directions for future research are proposed, including few-shot learning, and virtual sample generation. More importantly, we emphasize the significance of embedding material domain knowledge into ML and elaborate on the basic idea for implementing this strategy.
期刊介绍:
Science China Materials (SCM) is a globally peer-reviewed journal that covers all facets of materials science. It is supervised by the Chinese Academy of Sciences and co-sponsored by the Chinese Academy of Sciences and the National Natural Science Foundation of China. The journal is jointly published monthly in both printed and electronic forms by Science China Press and Springer. The aim of SCM is to encourage communication of high-quality, innovative research results at the cutting-edge interface of materials science with chemistry, physics, biology, and engineering. It focuses on breakthroughs from around the world and aims to become a world-leading academic journal for materials science.