Lei Zhang, Yiru Huang, Leiming Yan, Jinghao Ge, Xiaokang Ma, Zhike Liu, Jiaxue You, Alex K. Y. Jen, Shengzhong Frank Liu
{"title":"通过语言机器学习快速探索文献,设计 Perovskite 太阳能电池材料","authors":"Lei Zhang, Yiru Huang, Leiming Yan, Jinghao Ge, Xiaokang Ma, Zhike Liu, Jiaxue You, Alex K. Y. Jen, Shengzhong Frank Liu","doi":"10.1002/aisy.202300678","DOIUrl":null,"url":null,"abstract":"<p>Making computers automatically extract latent scientific knowledge from literature is highly desired for future materials and chemical research in the artificial intelligence era. Herein, the natural language processing (NLP)-based machine learning technique to build language models and automatically extract hidden information regarding perovskite solar cell (PSC) materials from 29 060 publications is employed. The concept that there are light-absorbing materials, electron-transporting materials, and hole-transporting materials in PSCs is successfully learned by the NLP-based machine learning model without a time-consuming human expert training process. The NLP model highlights a hole-transporting material that receives insufficient attention in the literature, which is then elaborated via density functional theory calculations to provide an atomistic view of the perovskite/hole-transporting layer heterostructures and their optoelectronic properties. Finally, the above results are confirmed by device experiments. The present study demonstrates the viability of NLP as a universal machine learning tool to extract useful information from existing publications.</p>","PeriodicalId":93858,"journal":{"name":"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)","volume":null,"pages":null},"PeriodicalIF":6.8000,"publicationDate":"2024-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aisy.202300678","citationCount":"0","resultStr":"{\"title\":\"Fast Exploring Literature by Language Machine Learning for Perovskite Solar Cell Materials Design\",\"authors\":\"Lei Zhang, Yiru Huang, Leiming Yan, Jinghao Ge, Xiaokang Ma, Zhike Liu, Jiaxue You, Alex K. Y. Jen, Shengzhong Frank Liu\",\"doi\":\"10.1002/aisy.202300678\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Making computers automatically extract latent scientific knowledge from literature is highly desired for future materials and chemical research in the artificial intelligence era. Herein, the natural language processing (NLP)-based machine learning technique to build language models and automatically extract hidden information regarding perovskite solar cell (PSC) materials from 29 060 publications is employed. The concept that there are light-absorbing materials, electron-transporting materials, and hole-transporting materials in PSCs is successfully learned by the NLP-based machine learning model without a time-consuming human expert training process. The NLP model highlights a hole-transporting material that receives insufficient attention in the literature, which is then elaborated via density functional theory calculations to provide an atomistic view of the perovskite/hole-transporting layer heterostructures and their optoelectronic properties. Finally, the above results are confirmed by device experiments. The present study demonstrates the viability of NLP as a universal machine learning tool to extract useful information from existing publications.</p>\",\"PeriodicalId\":93858,\"journal\":{\"name\":\"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2024-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aisy.202300678\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/aisy.202300678\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aisy.202300678","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Fast Exploring Literature by Language Machine Learning for Perovskite Solar Cell Materials Design
Making computers automatically extract latent scientific knowledge from literature is highly desired for future materials and chemical research in the artificial intelligence era. Herein, the natural language processing (NLP)-based machine learning technique to build language models and automatically extract hidden information regarding perovskite solar cell (PSC) materials from 29 060 publications is employed. The concept that there are light-absorbing materials, electron-transporting materials, and hole-transporting materials in PSCs is successfully learned by the NLP-based machine learning model without a time-consuming human expert training process. The NLP model highlights a hole-transporting material that receives insufficient attention in the literature, which is then elaborated via density functional theory calculations to provide an atomistic view of the perovskite/hole-transporting layer heterostructures and their optoelectronic properties. Finally, the above results are confirmed by device experiments. The present study demonstrates the viability of NLP as a universal machine learning tool to extract useful information from existing publications.