Harnessing DFT and machine learning for accurate optical gap prediction in conjugated polymers†

IF 5.1 3区 材料科学 Q1 CHEMISTRY, MULTIDISCIPLINARY Nanoscale Pub Date : 2025-02-18 DOI:10.1039/D4NR03702B
Bin Liu, Yunrui Yan and Mingjie Liu
{"title":"Harnessing DFT and machine learning for accurate optical gap prediction in conjugated polymers†","authors":"Bin Liu, Yunrui Yan and Mingjie Liu","doi":"10.1039/D4NR03702B","DOIUrl":null,"url":null,"abstract":"<p >Conjugated polymers (CPs), characterized by alternating <em>σ</em> and π bonds, have attracted significant attention for their diverse structures and adjustable electronic properties. However, predicting the optical band gap (<em>E</em><small><sup>exp</sup></small><small><sub>gap</sub></small>) of CPs remains challenging. This study presents a rational model that integrates density functional theory (DFT) calculation with a data-driven machine learning (ML) approach to predict the experimentally measured <em>E</em><small><sup>exp</sup></small><small><sub>gap</sub></small> of CPs, using 1096 data points. Through alkyl side chain truncation and conjugated backbone extension, the modified oligomers effectively capture the electronic properties of CPs, significantly improving the correlation between the DFT-calculated HOMO–LUMO gap (<em>E</em><small><sup>oligomer</sup></small><small><sub>gap</sub></small>) and <em>E</em><small><sup>exp</sup></small><small><sub>gap</sub></small> (<em>R</em><small><sup>2</sup></small> = 0.51) compared to the unmodified side-chain-containing monomers (<em>R</em><small><sup>2</sup></small> = 0.15). Moreover, we trained six ML models with two categories of features as input: <em>E</em><small><sup>oligomer</sup></small><small><sub>gap</sub></small> to represent the extended backbone and molecular features of unmodified monomers to capture the alkyl-side-chain effect. The best model, XGBoost-2, achieved an <em>R</em><small><sup>2</sup></small> of 0.77 and an MAE of 0.065 eV for predicting <em>E</em><small><sup>exp</sup></small><small><sub>gap</sub></small>, falling within the experimental error margin of ∼0.1 eV. We further validated XGBoost-2 on a dataset of 227 newly synthesized CPs collected from literature without further retraining. Notably, XGBoost-2 exhibits both excellent interpolation for BT-, BTA-, QA-, DPP-, and TPD-based CPs, and exceptional extrapolation for PDI-, NDI-, DTBT-, BBX-, and Y6-based CPs, which are attributed to the integration of DFT methods with rationally designed oligomer structures. For the first time, we demonstrated a novel and effective strategy combining quantum chemistry calculations with ML modeling for accurate and efficient prediction of experimentally measured fundamental properties of CPs. Our study paves the way for the accelerated design and development of high-performance CPs in photoelectronic applications.</p>","PeriodicalId":92,"journal":{"name":"Nanoscale","volume":" 13","pages":" 7865-7876"},"PeriodicalIF":5.1000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/nr/d4nr03702b?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nanoscale","FirstCategoryId":"88","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/nr/d4nr03702b","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Conjugated polymers (CPs), characterized by alternating σ and π bonds, have attracted significant attention for their diverse structures and adjustable electronic properties. However, predicting the optical band gap (Eexpgap) of CPs remains challenging. This study presents a rational model that integrates density functional theory (DFT) calculation with a data-driven machine learning (ML) approach to predict the experimentally measured Eexpgap of CPs, using 1096 data points. Through alkyl side chain truncation and conjugated backbone extension, the modified oligomers effectively capture the electronic properties of CPs, significantly improving the correlation between the DFT-calculated HOMO–LUMO gap (Eoligomergap) and Eexpgap (R2 = 0.51) compared to the unmodified side-chain-containing monomers (R2 = 0.15). Moreover, we trained six ML models with two categories of features as input: Eoligomergap to represent the extended backbone and molecular features of unmodified monomers to capture the alkyl-side-chain effect. The best model, XGBoost-2, achieved an R2 of 0.77 and an MAE of 0.065 eV for predicting Eexpgap, falling within the experimental error margin of ∼0.1 eV. We further validated XGBoost-2 on a dataset of 227 newly synthesized CPs collected from literature without further retraining. Notably, XGBoost-2 exhibits both excellent interpolation for BT-, BTA-, QA-, DPP-, and TPD-based CPs, and exceptional extrapolation for PDI-, NDI-, DTBT-, BBX-, and Y6-based CPs, which are attributed to the integration of DFT methods with rationally designed oligomer structures. For the first time, we demonstrated a novel and effective strategy combining quantum chemistry calculations with ML modeling for accurate and efficient prediction of experimentally measured fundamental properties of CPs. Our study paves the way for the accelerated design and development of high-performance CPs in photoelectronic applications.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用DFT和机器学习精确预测共轭聚合物的光隙
共轭聚合物(CPs)以σ键和π键交替为特征,以其多样的结构和可调节的电子性能而受到人们的广泛关注。然而,预测CPs的光学带隙(Egapexp)仍然具有挑战性。本研究提出了一个合理的模型,将密度泛函理论(DFT)计算与数据驱动的机器学习(ML)方法相结合,使用1096个数据点预测CPs的实验测量Egapexp。通过烷基侧链截断和共轭主链延伸,修饰后的低聚物有效捕获了CPs的电子性质,与未修饰的含侧链单体相比,dft计算的HOMO-LUMO间隙(Egapoligomer)和Egapexp之间的相关性(R2=0.51)显著提高(R2=0.15)。此外,我们以两类特征作为输入训练了6个ML模型:egapoligomer(代表扩展的主链)和未修饰单体的分子特征(以捕捉烷基侧链效应)。最佳模型XGBoost-2预测Egapexp的R2为0.77,MAE为0.065 eV,实验误差范围在~ 0.1 eV以内。我们在227个新合成的CPs数据集上进一步验证了XGBoost-2,这些数据集来自文献,没有进一步的再训练。值得注意的是,XGBoost-2对基于BT-、BTA-、QA-、DPP-和tpd的CPs具有出色的插值能力,并且对基于PDI-、NDI-、dbt -、BBX-和y6的CPs具有出色的外推能力,这要归功于DFT方法与合理设计的低聚物结构的集成。我们首次展示了一种将量子化学计算与ML建模相结合的新颖有效的策略,用于准确有效地预测实验测量的CPs的基本性质。我们的研究为光电子应用中高性能cp的加速设计和开发铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nanoscale
Nanoscale CHEMISTRY, MULTIDISCIPLINARY-NANOSCIENCE & NANOTECHNOLOGY
CiteScore
12.10
自引率
3.00%
发文量
1628
审稿时长
1.6 months
期刊介绍: Nanoscale is a high-impact international journal, publishing high-quality research across nanoscience and nanotechnology. Nanoscale publishes a full mix of research articles on experimental and theoretical work, including reviews, communications, and full papers.Highly interdisciplinary, this journal appeals to scientists, researchers and professionals interested in nanoscience and nanotechnology, quantum materials and quantum technology, including the areas of physics, chemistry, biology, medicine, materials, energy/environment, information technology, detection science, healthcare and drug discovery, and electronics.
期刊最新文献
Bioinspired multi-compartment mesoporous nanoreactors: modular assembly and functional applications. Dehydrogenase-like stacked MoS2 nanozymes for cancer treatment through disrupting tricarboxylic acid cycle Light-responsive Au/TiO2/PDA coating enhances the antibacterial performance of titanium implants. Transfer Learning-Enabled Discovery of MXenes as Efficient Anchoring Materials for Li-S Batteries Synergistic optimization of thermoelectric performance in Ag2Se through MoSe2 decoration.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1