{"title":"Natural Language Processing Applied on Large Scale Data Extraction from Scientific Papers in Fuel Cells","authors":"Feifan Yang","doi":"10.1145/3508230.3508256","DOIUrl":null,"url":null,"abstract":"Natural language processing (NLP) has a great potential to help scientists automatically extract information from large-scale text datasets. In this paper, we focus on the process of NLP — including text acquisition, text preprocessing, word embedding training, and named entity recognition — applied on 106,181 abstracts of fuel cell papers. Then we evaluate our trained model on its ability of analogy, use the model to analyze the research trend in fuel cell materials and predict new materials. To the best of our knowledge, it is the first time that NLP has been applied in the field of fuel cells. This data-driven technique is demonstrated to have the potential to promote the discoveries of new fuel cell materials.","PeriodicalId":252146,"journal":{"name":"Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3508230.3508256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Natural language processing (NLP) has a great potential to help scientists automatically extract information from large-scale text datasets. In this paper, we focus on the process of NLP — including text acquisition, text preprocessing, word embedding training, and named entity recognition — applied on 106,181 abstracts of fuel cell papers. Then we evaluate our trained model on its ability of analogy, use the model to analyze the research trend in fuel cell materials and predict new materials. To the best of our knowledge, it is the first time that NLP has been applied in the field of fuel cells. This data-driven technique is demonstrated to have the potential to promote the discoveries of new fuel cell materials.