Learning how to transfer: A lifelong domain knowledge distillation framework for continual MRC

IF 4.3 Intelligent Systems with Applications Pub Date : 2025-06-01 Epub Date: 2025-03-08 DOI:10.1016/j.iswa.2025.200497
Songze Li , Zhijing Wu , Runmin Cao , Xiaohan Zhang , Yifan Wang , Hua Xu , Kai Gao
{"title":"Learning how to transfer: A lifelong domain knowledge distillation framework for continual MRC","authors":"Songze Li ,&nbsp;Zhijing Wu ,&nbsp;Runmin Cao ,&nbsp;Xiaohan Zhang ,&nbsp;Yifan Wang ,&nbsp;Hua Xu ,&nbsp;Kai Gao","doi":"10.1016/j.iswa.2025.200497","DOIUrl":null,"url":null,"abstract":"<div><div>Machine Reading Comprehension (MRC) has attracted wide attention in recent years. It can reflect how well a machine understands human language. Benefitting from the increasing large-scale benchmark and pre-trained language models, a lot of MRC models have achieved remarkable success and even exceeded human performance. However, real-world MRC systems need incrementally learn from a continuous data stream across time without accessing the previously seen data, called Continual MRC system. It is a great challenge to learn a new domain incrementally without catastrophically forgetting previous knowledge. In this paper, MK-MRC (an extension of MA-MRC), a continual MRC framework with uncertainty-aware fixed <strong>M</strong>emory and lifelong domain <strong>K</strong>nowledge distillation, is proposed. MK-MRC is a memory replaying based method, in which a fixed-size memory buffer stores a small number of samples in previous domain data along with an uncertainty-aware updating strategy when new domain data arrives. For incremental learning, MK-MRC fully uses the domain adaptation and transfer relationship between memory and new domain data through several domain knowledge distillation strategies.</div><div>Compared with MA-MRC, MK-MRC additionally introduces more strategies to strengthen the ability of continual learning, such as data augmentation and special task-related knowledge distillation. Experimental results show that MK-MRC yields consistent improvement compared with strong baselines and has a substantial incremental learning ability without catastrophically forgetting under four continual span-extractive and multiple-choice MRC settings.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200497"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Systems with Applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667305325000237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Machine Reading Comprehension (MRC) has attracted wide attention in recent years. It can reflect how well a machine understands human language. Benefitting from the increasing large-scale benchmark and pre-trained language models, a lot of MRC models have achieved remarkable success and even exceeded human performance. However, real-world MRC systems need incrementally learn from a continuous data stream across time without accessing the previously seen data, called Continual MRC system. It is a great challenge to learn a new domain incrementally without catastrophically forgetting previous knowledge. In this paper, MK-MRC (an extension of MA-MRC), a continual MRC framework with uncertainty-aware fixed Memory and lifelong domain Knowledge distillation, is proposed. MK-MRC is a memory replaying based method, in which a fixed-size memory buffer stores a small number of samples in previous domain data along with an uncertainty-aware updating strategy when new domain data arrives. For incremental learning, MK-MRC fully uses the domain adaptation and transfer relationship between memory and new domain data through several domain knowledge distillation strategies.
Compared with MA-MRC, MK-MRC additionally introduces more strategies to strengthen the ability of continual learning, such as data augmentation and special task-related knowledge distillation. Experimental results show that MK-MRC yields consistent improvement compared with strong baselines and has a substantial incremental learning ability without catastrophically forgetting under four continual span-extractive and multiple-choice MRC settings.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
学习如何转移:持续MRC的终身领域知识蒸馏框架
机器阅读理解(MRC)近年来引起了广泛的关注。它可以反映出机器对人类语言的理解程度。得益于越来越多的大规模基准测试和预训练语言模型,许多MRC模型取得了显著的成功,甚至超过了人类的表现。然而,现实世界的MRC系统需要从连续的数据流中逐步学习,而不需要访问以前看到的数据,称为连续MRC系统。在不灾难性地忘记以前的知识的情况下,增量地学习一个新领域是一个巨大的挑战。本文提出了一种具有不确定性感知的固定记忆和终身领域知识蒸馏的连续MRC框架——MK-MRC (MA-MRC的扩展)。MK-MRC是一种基于记忆重放的方法,该方法使用固定大小的内存缓冲区存储先前域数据中的少量样本,并在新域数据到达时采用不确定性感知更新策略。对于增量学习,MK-MRC通过多种领域知识蒸馏策略,充分利用了记忆与新领域数据之间的领域适应和迁移关系。与MA-MRC相比,MK-MRC还引入了更多的策略来增强持续学习的能力,如数据增强和与特殊任务相关的知识蒸馏。实验结果表明,与强基线相比,MK-MRC在四种连续跨度提取和多项选择的MRC设置下取得了一致的改善,并且在没有灾难性遗忘的情况下具有显著的增量学习能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
5.60
自引率
0.00%
发文量
0
期刊最新文献
Ensemble of stochastic models, machine learning, deep learning and wavelet based techniques: A Whale optimization approach Real-time route optimization in smart cities via Bidirectional A* algorithm Integration of digital twins and physical AI in cyber-physical systems Document modelling using topics based on meaningful-interesting patterns for information filtering EOAC-LLM: An LLM-driven event ontology automatic construction system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1