Expert system for extracting keywords in educational texts and textbooks based on transformers models

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems with Applications Pub Date : 2025-07-05 Epub Date: 2025-04-26 DOI:10.1016/j.eswa.2025.127735
Irene Cid Rico, Jordán Pascual Espada
{"title":"Expert system for extracting keywords in educational texts and textbooks based on transformers models","authors":"Irene Cid Rico,&nbsp;Jordán Pascual Espada","doi":"10.1016/j.eswa.2025.127735","DOIUrl":null,"url":null,"abstract":"<div><div>Automated keyword extraction is widely used for tasks like classification and summarization, but generic methods often fail to address domain-specific requirements. In education, texts are designed to help students grasp and retain key concepts needed for exercises and resolve questions. Despite the variety of existing keyword extraction algorithms, none are specifically adapted to the unique structure and purpose of educational materials like textbooks or lecture notes.Supervised methods have demonstrated their effectiveness in various domains through advanced techniques like contextual embeddings and domain-specific fine-tuning, Our study proposes a novel solution leveraging pretrained transformer models, specifically BERT, to adapt to the structure of educational materials for effective keyword extraction. Our research demonstrates that by fine-tuning BERT models to the specific characteristics of educational texts, we can achieve more accurate and relevant keyword extraction. YodkW, our adapted model, outperforms traditional algorithms in identifying the key concepts that are essential for educational purposes. Performance is quantified using the F1 score relative to text books key terms list, Preliminary results demonstrate that our approach can improve the identification of key concepts pertinent to student understanding and facilitate the automatic generation of test questions.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"282 ","pages":"Article 127735"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013570","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Automated keyword extraction is widely used for tasks like classification and summarization, but generic methods often fail to address domain-specific requirements. In education, texts are designed to help students grasp and retain key concepts needed for exercises and resolve questions. Despite the variety of existing keyword extraction algorithms, none are specifically adapted to the unique structure and purpose of educational materials like textbooks or lecture notes.Supervised methods have demonstrated their effectiveness in various domains through advanced techniques like contextual embeddings and domain-specific fine-tuning, Our study proposes a novel solution leveraging pretrained transformer models, specifically BERT, to adapt to the structure of educational materials for effective keyword extraction. Our research demonstrates that by fine-tuning BERT models to the specific characteristics of educational texts, we can achieve more accurate and relevant keyword extraction. YodkW, our adapted model, outperforms traditional algorithms in identifying the key concepts that are essential for educational purposes. Performance is quantified using the F1 score relative to text books key terms list, Preliminary results demonstrate that our approach can improve the identification of key concepts pertinent to student understanding and facilitate the automatic generation of test questions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于变形模型的教材关键词提取专家系统
自动关键字提取广泛用于分类和摘要等任务,但通用方法通常无法满足特定于领域的需求。在教育中,文本的设计是为了帮助学生掌握和记住练习和解决问题所需的关键概念。尽管现有的关键字提取算法多种多样,但没有一种算法专门适用于教科书或课堂讲稿等教育材料的独特结构和目的。监督方法已经通过上下文嵌入和特定领域微调等先进技术在各个领域证明了它们的有效性。我们的研究提出了一种新的解决方案,利用预训练的变压器模型,特别是BERT,来适应有效提取关键字的教育材料结构。研究表明,根据教育文本的具体特征对BERT模型进行微调,可以实现更准确、更相关的关键词提取。YodkW,我们的改编模型,在识别对教育目的至关重要的关键概念方面优于传统算法。初步结果表明,我们的方法可以提高与学生理解相关的关键概念的识别,并促进试题的自动生成。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
期刊最新文献
FairDiff: Masked condition diffusion for fairness-aware recommendation CTGAN-MNLIME: A CTGAN-boosted multidimensional nonlinear LIME method for corporate environmental indicators prediction An explainable machine learning-based scoring function using interpretable features and model explanation approaches for binding affinity prediction Hybrid fuzzy multi-criteria decision-making model for assessing sustainable waste management strategies MPGCF: Multi-objective and popularity-smoothing graph collaborative filtering for long-tail web API recommendation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1