{"title":"Expert system for extracting keywords in educational texts and textbooks based on transformers models","authors":"Irene Cid Rico, Jordán Pascual Espada","doi":"10.1016/j.eswa.2025.127735","DOIUrl":null,"url":null,"abstract":"<div><div>Automated keyword extraction is widely used for tasks like classification and summarization, but generic methods often fail to address domain-specific requirements. In education, texts are designed to help students grasp and retain key concepts needed for exercises and resolve questions. Despite the variety of existing keyword extraction algorithms, none are specifically adapted to the unique structure and purpose of educational materials like textbooks or lecture notes.Supervised methods have demonstrated their effectiveness in various domains through advanced techniques like contextual embeddings and domain-specific fine-tuning, Our study proposes a novel solution leveraging pretrained transformer models, specifically BERT, to adapt to the structure of educational materials for effective keyword extraction. Our research demonstrates that by fine-tuning BERT models to the specific characteristics of educational texts, we can achieve more accurate and relevant keyword extraction. YodkW, our adapted model, outperforms traditional algorithms in identifying the key concepts that are essential for educational purposes. Performance is quantified using the F1 score relative to text books key terms list, Preliminary results demonstrate that our approach can improve the identification of key concepts pertinent to student understanding and facilitate the automatic generation of test questions.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"282 ","pages":"Article 127735"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013570","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Automated keyword extraction is widely used for tasks like classification and summarization, but generic methods often fail to address domain-specific requirements. In education, texts are designed to help students grasp and retain key concepts needed for exercises and resolve questions. Despite the variety of existing keyword extraction algorithms, none are specifically adapted to the unique structure and purpose of educational materials like textbooks or lecture notes.Supervised methods have demonstrated their effectiveness in various domains through advanced techniques like contextual embeddings and domain-specific fine-tuning, Our study proposes a novel solution leveraging pretrained transformer models, specifically BERT, to adapt to the structure of educational materials for effective keyword extraction. Our research demonstrates that by fine-tuning BERT models to the specific characteristics of educational texts, we can achieve more accurate and relevant keyword extraction. YodkW, our adapted model, outperforms traditional algorithms in identifying the key concepts that are essential for educational purposes. Performance is quantified using the F1 score relative to text books key terms list, Preliminary results demonstrate that our approach can improve the identification of key concepts pertinent to student understanding and facilitate the automatic generation of test questions.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.