Nafees Ahmad , Mahmoud A.A. Ibrahim , Shaban R.M. Sayed , Syed Shoaib Ahmad Shah , Mudassir Hussain Tahir , Yingping Zou
{"title":"Data-mining and machine learning based search for optimal materials for perovskite and organic solar cells","authors":"Nafees Ahmad , Mahmoud A.A. Ibrahim , Shaban R.M. Sayed , Syed Shoaib Ahmad Shah , Mudassir Hussain Tahir , Yingping Zou","doi":"10.1016/j.solener.2024.113223","DOIUrl":null,"url":null,"abstract":"<div><div>A data mining-based approach is introduced to search the organic compounds for Photovoltaics applications. Organic semiconductors are search from a database of organic compounds having lower reorganization energy for hole transfer. Three polymer donors are selected as a standard structure to search similar materials from database. Energy levels are predicted using machine learning as a screening criterion for the selection of best materials for photovoltaics applications. Fingerprints are used for training the machine learning models. More than 40 machine learning models are tried, random forest has appeared as a best model (r-squared of 0.800 and 0.609 for training and test set, respectively). This machine learning model is used to predict the energy levels of new materials. Synthetic accessibility of selected organic semi-conductors is also predicted, all these semi-conductors are straight-forward to synthesize. Their Synthetic accessibility (SA) score is less than 6.</div></div>","PeriodicalId":428,"journal":{"name":"Solar Energy","volume":"287 ","pages":"Article 113223"},"PeriodicalIF":6.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Solar Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0038092X24009186","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
A data mining-based approach is introduced to search the organic compounds for Photovoltaics applications. Organic semiconductors are search from a database of organic compounds having lower reorganization energy for hole transfer. Three polymer donors are selected as a standard structure to search similar materials from database. Energy levels are predicted using machine learning as a screening criterion for the selection of best materials for photovoltaics applications. Fingerprints are used for training the machine learning models. More than 40 machine learning models are tried, random forest has appeared as a best model (r-squared of 0.800 and 0.609 for training and test set, respectively). This machine learning model is used to predict the energy levels of new materials. Synthetic accessibility of selected organic semi-conductors is also predicted, all these semi-conductors are straight-forward to synthesize. Their Synthetic accessibility (SA) score is less than 6.
期刊介绍:
Solar Energy welcomes manuscripts presenting information not previously published in journals on any aspect of solar energy research, development, application, measurement or policy. The term "solar energy" in this context includes the indirect uses such as wind energy and biomass