Mapping the Learning Curves of Deep Learning Networks

Yanru Jiang, Rick Dale
{"title":"Mapping the Learning Curves of Deep Learning Networks","authors":"Yanru Jiang, Rick Dale","doi":"10.1101/2024.07.01.601491","DOIUrl":null,"url":null,"abstract":"There is an important challenge in systematically interpreting the internal representations of deep neural networks. This study introduces a multi-dimensional quantification and visualization approach which can capture two temporal dimensions of a model learning experience: the \"information processing trajectory\" and the \"developmental trajectory.\" The former represents the influence of incoming signals on an agent's decision-making, while the latter conceptualizes the gradual improvement in an agent's performance throughout its lifespan. Tracking the learning curves of a DNN enables researchers to explicitly identify the model appropriateness of a given task, examine the properties of the underlying input signals, and assess the model's alignment (or lack thereof) with human learning experiences. To illustrate the method, we conducted 750 runs of simulations on two temporal tasks: gesture detection and natural language processing (NLP) classification, showcasing its applicability across a spectrum of deep learning tasks. Based on the quantitative analysis of the learning curves across two distinct datasets, we have identified three insights gained from mapping these curves: nonlinearity, pairwise comparisons, and domain distinctions. We reflect on the theoretical implications of this method for cognitive processing, language models and multimodal representation.","PeriodicalId":501568,"journal":{"name":"bioRxiv - Scientific Communication and Education","volume":"19 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Scientific Communication and Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.01.601491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

There is an important challenge in systematically interpreting the internal representations of deep neural networks. This study introduces a multi-dimensional quantification and visualization approach which can capture two temporal dimensions of a model learning experience: the "information processing trajectory" and the "developmental trajectory." The former represents the influence of incoming signals on an agent's decision-making, while the latter conceptualizes the gradual improvement in an agent's performance throughout its lifespan. Tracking the learning curves of a DNN enables researchers to explicitly identify the model appropriateness of a given task, examine the properties of the underlying input signals, and assess the model's alignment (or lack thereof) with human learning experiences. To illustrate the method, we conducted 750 runs of simulations on two temporal tasks: gesture detection and natural language processing (NLP) classification, showcasing its applicability across a spectrum of deep learning tasks. Based on the quantitative analysis of the learning curves across two distinct datasets, we have identified three insights gained from mapping these curves: nonlinearity, pairwise comparisons, and domain distinctions. We reflect on the theoretical implications of this method for cognitive processing, language models and multimodal representation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
绘制深度学习网络的学习曲线
系统地解释深度神经网络的内部表征是一项重要挑战。本研究引入了一种多维量化和可视化方法,可捕捉模型学习经历的两个时间维度:"信息处理轨迹 "和 "发展轨迹"。前者代表了传入信号对代理决策的影响,后者则将代理在整个生命周期中性能的逐步提高概念化。通过跟踪 DNN 的学习曲线,研究人员可以明确识别特定任务的模型适当性,检查基本输入信号的属性,并评估模型与人类学习经验的一致性(或缺乏一致性)。为了说明该方法,我们在手势检测和自然语言处理(NLP)分类这两个时间任务上进行了 750 次模拟运行,展示了该方法在一系列深度学习任务中的适用性。基于对两个不同数据集学习曲线的定量分析,我们确定了绘制这些曲线所获得的三点启示:非线性、成对比较和领域区分。我们思考了这种方法对认知处理、语言模型和多模态表征的理论意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Learning on a Limb: An outreach module to engage high school students in orthopaedics Updated science-wide author databases of standardized citation indicators including retraction data Benchmarking open access publication rates for the pharmaceutical industry and research-intensive academic institutions Community-Based Entomological Surveillance and Control of Vector-Borne Diseases: A Scoping Review When crayfish make news, headlines are correct but still misleading
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1