{"title":"为包晶太阳能电池数据集选择合适的机器学习模型","authors":"Mohamed M. Salah, Zahraa Ismail, Sameh Abdellatif","doi":"10.1007/s40243-023-00239-2","DOIUrl":null,"url":null,"abstract":"<div><p>Utilizing artificial intelligent based algorithms in solving engineering problems is widely spread nowadays. Herein, this study provides a comprehensive and insightful analysis of the application of machine learning (ML) models to complex datasets in the field of solar cell power conversion efficiency (PCE). Mainly, perovskite solar cells generate three datasets, varying dataset size and complexity. Various popular regression models and hyperparameter tuning techniques are studied to guide researchers and practitioners looking to leverage machine learning methods for their data-driven projects. Specifically, four ML models were investigated; random forest (RF), gradient boosting (GBR), K-nearest neighbors (KNN), and linear regression (LR), while monitoring the ML model accuracy, complexity, computational cost, and time as evaluating parameters. Inputs' importance and contribution were examined for the three datasets, recording a dominating effect for the electron transport layer's (ETL) doping as the main controlling parameter in tuning the cell's overall PCE. For the first dataset, ETL doping recorded 93.6%, as the main contributor to the cell PCE, reducing to 79.0% in the third dataset.</p></div>","PeriodicalId":692,"journal":{"name":"Materials for Renewable and Sustainable Energy","volume":"12 3","pages":"187 - 198"},"PeriodicalIF":3.6000,"publicationDate":"2023-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s40243-023-00239-2.pdf","citationCount":"0","resultStr":"{\"title\":\"Selecting an appropriate machine-learning model for perovskite solar cell datasets\",\"authors\":\"Mohamed M. Salah, Zahraa Ismail, Sameh Abdellatif\",\"doi\":\"10.1007/s40243-023-00239-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Utilizing artificial intelligent based algorithms in solving engineering problems is widely spread nowadays. Herein, this study provides a comprehensive and insightful analysis of the application of machine learning (ML) models to complex datasets in the field of solar cell power conversion efficiency (PCE). Mainly, perovskite solar cells generate three datasets, varying dataset size and complexity. Various popular regression models and hyperparameter tuning techniques are studied to guide researchers and practitioners looking to leverage machine learning methods for their data-driven projects. Specifically, four ML models were investigated; random forest (RF), gradient boosting (GBR), K-nearest neighbors (KNN), and linear regression (LR), while monitoring the ML model accuracy, complexity, computational cost, and time as evaluating parameters. Inputs' importance and contribution were examined for the three datasets, recording a dominating effect for the electron transport layer's (ETL) doping as the main controlling parameter in tuning the cell's overall PCE. For the first dataset, ETL doping recorded 93.6%, as the main contributor to the cell PCE, reducing to 79.0% in the third dataset.</p></div>\",\"PeriodicalId\":692,\"journal\":{\"name\":\"Materials for Renewable and Sustainable Energy\",\"volume\":\"12 3\",\"pages\":\"187 - 198\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2023-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s40243-023-00239-2.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Materials for Renewable and Sustainable Energy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s40243-023-00239-2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Materials for Renewable and Sustainable Energy","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s40243-023-00239-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
如今,利用基于人工智能的算法解决工程问题已广为流传。在此,本研究对机器学习(ML)模型在太阳能电池功率转换效率(PCE)领域复杂数据集中的应用进行了全面而深入的分析。主要由包晶石太阳能电池产生三个数据集,数据集的大小和复杂程度各不相同。研究了各种流行的回归模型和超参数调整技术,为希望在数据驱动项目中利用机器学习方法的研究人员和从业人员提供指导。具体来说,研究了四种 ML 模型:随机森林 (RF)、梯度提升 (GBR)、K-近邻 (KNN) 和线性回归 (LR),同时监测 ML 模型的准确性、复杂性、计算成本和时间作为评估参数。对三个数据集的输入的重要性和贡献进行了研究,发现电子传输层(ETL)掺杂作为调整电池整体 PCE 的主要控制参数具有主导作用。在第一个数据集中,电子传输层掺杂占 93.6%,是电池 PCE 的主要贡献者,而在第三个数据集中则降至 79.0%。
Selecting an appropriate machine-learning model for perovskite solar cell datasets
Utilizing artificial intelligent based algorithms in solving engineering problems is widely spread nowadays. Herein, this study provides a comprehensive and insightful analysis of the application of machine learning (ML) models to complex datasets in the field of solar cell power conversion efficiency (PCE). Mainly, perovskite solar cells generate three datasets, varying dataset size and complexity. Various popular regression models and hyperparameter tuning techniques are studied to guide researchers and practitioners looking to leverage machine learning methods for their data-driven projects. Specifically, four ML models were investigated; random forest (RF), gradient boosting (GBR), K-nearest neighbors (KNN), and linear regression (LR), while monitoring the ML model accuracy, complexity, computational cost, and time as evaluating parameters. Inputs' importance and contribution were examined for the three datasets, recording a dominating effect for the electron transport layer's (ETL) doping as the main controlling parameter in tuning the cell's overall PCE. For the first dataset, ETL doping recorded 93.6%, as the main contributor to the cell PCE, reducing to 79.0% in the third dataset.
期刊介绍:
Energy is the single most valuable resource for human activity and the basis for all human progress. Materials play a key role in enabling technologies that can offer promising solutions to achieve renewable and sustainable energy pathways for the future.
Materials for Renewable and Sustainable Energy has been established to be the world''s foremost interdisciplinary forum for publication of research on all aspects of the study of materials for the deployment of renewable and sustainable energy technologies. The journal covers experimental and theoretical aspects of materials and prototype devices for sustainable energy conversion, storage, and saving, together with materials needed for renewable fuel production. It publishes reviews, original research articles, rapid communications, and perspectives. All manuscripts are peer-reviewed for scientific quality.
Topics include:
1. MATERIALS for renewable energy storage and conversion: Batteries, Supercapacitors, Fuel cells, Hydrogen storage, and Photovoltaics and solar cells.
2. MATERIALS for renewable and sustainable fuel production: Hydrogen production and fuel generation from renewables (catalysis), Solar-driven reactions to hydrogen and fuels from renewables (photocatalysis), Biofuels, and Carbon dioxide sequestration and conversion.
3. MATERIALS for energy saving: Thermoelectrics, Novel illumination sources for efficient lighting, and Energy saving in buildings.
4. MATERIALS modeling and theoretical aspects.
5. Advanced characterization techniques of MATERIALS
Materials for Renewable and Sustainable Energy is committed to upholding the integrity of the scientific record. As a member of the Committee on Publication Ethics (COPE) the journal will follow the COPE guidelines on how to deal with potential acts of misconduct. Authors should refrain from misrepresenting research results which could damage the trust in the journal and ultimately the entire scientific endeavor. Maintaining integrity of the research and its presentation can be achieved by following the rules of good scientific practice as detailed here: https://www.springer.com/us/editorial-policies