Dataset for Software Engineering Learning Resources

Muddassira Arshad, M. Yousaf, S. M. Sarwar
{"title":"Dataset for Software Engineering Learning Resources","authors":"Muddassira Arshad, M. Yousaf, S. M. Sarwar","doi":"10.59287/icsis.588","DOIUrl":null,"url":null,"abstract":"– In the current digital age, an abundance of digital resources is readily available to learners. Withthe ongoing COVID pandemic and prevalent economic crises, a significant number of learners prefer toengage in self-learning. To develop customized self-learning applications and guide learners to utilizeresources based on their learning preferences, a dataset containing learning resources and their prerequisiterelationships is required. Several learning resource datasets exist for Machine Learning (ML), InformationRetrieval (IR), and Natural Language Processing (NLP). To contribute to this area, we present the SoftwareEngineering Learning Resource Dataset (SELRD), which is a publicly available dataset specificallydesigned for learning Software Engineering (SE). We have extracted the data for SELRD from multiplesources, including edX, my-mooc, and textbooks. The SE learning resources (SELR) are organized basedon topics, and the dataset includes 602 SELRs referring to 302 topics. We have extracted the content fromlectures and books available in presentation files (pptx) and Portable Document Format (PDF) using Pythonlibraries. Additionally, we have computed the expected reading time for each SELR, which would facilitatelearners by guiding them on the time required to read each respective resource. The SELRD comprises 692prerequisite pairs, including 592 positive pairs and 100 negative pairs. This data can be used along withmachine learning algorithms to generate learning paths that would facilitate self-learners. Additionally, theSELRD can also serve as a repository of SE learning resources. In the future, we plan to add best practicesand examples for each SELR, making it even more useful for learners.","PeriodicalId":178836,"journal":{"name":"International Conference on Scientific and Innovative Studies","volume":"279 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Scientific and Innovative Studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59287/icsis.588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

– In the current digital age, an abundance of digital resources is readily available to learners. Withthe ongoing COVID pandemic and prevalent economic crises, a significant number of learners prefer toengage in self-learning. To develop customized self-learning applications and guide learners to utilizeresources based on their learning preferences, a dataset containing learning resources and their prerequisiterelationships is required. Several learning resource datasets exist for Machine Learning (ML), InformationRetrieval (IR), and Natural Language Processing (NLP). To contribute to this area, we present the SoftwareEngineering Learning Resource Dataset (SELRD), which is a publicly available dataset specificallydesigned for learning Software Engineering (SE). We have extracted the data for SELRD from multiplesources, including edX, my-mooc, and textbooks. The SE learning resources (SELR) are organized basedon topics, and the dataset includes 602 SELRs referring to 302 topics. We have extracted the content fromlectures and books available in presentation files (pptx) and Portable Document Format (PDF) using Pythonlibraries. Additionally, we have computed the expected reading time for each SELR, which would facilitatelearners by guiding them on the time required to read each respective resource. The SELRD comprises 692prerequisite pairs, including 592 positive pairs and 100 negative pairs. This data can be used along withmachine learning algorithms to generate learning paths that would facilitate self-learners. Additionally, theSELRD can also serve as a repository of SE learning resources. In the future, we plan to add best practicesand examples for each SELR, making it even more useful for learners.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
软件工程学习资源数据集
-在当今的数字时代,学习者可以随时获得丰富的数字资源。随着COVID大流行和普遍的经济危机的持续,相当多的学习者倾向于进行自学。为了开发定制的自主学习应用程序并指导学习者根据自己的学习偏好利用资源,需要一个包含学习资源及其先决条件关系的数据集。机器学习(ML)、信息检索(IR)和自然语言处理(NLP)有几个学习资源数据集。为了在这一领域做出贡献,我们提出了软件工程学习资源数据集(SELRD),这是一个专门为学习软件工程(SE)设计的公开可用的数据集。我们从多个资源中提取SELRD的数据,包括edX, my-mooc和教科书。SE学习资源(SELR)是基于主题组织的,数据集包括602个SELR,涉及302个主题。我们使用python库从演讲文件(pptx)和可移植文档格式(PDF)中提取了讲座和书籍中的内容。此外,我们还计算了每个SELR的预期阅读时间,这将通过指导学习者阅读每个资源所需的时间来方便他们。SELRD包括692对先决条件对,包括592对正对和100对负对。这些数据可以与机器学习算法一起使用,以生成有利于自学习者的学习路径。此外,selrd还可以作为SE学习资源的存储库。在未来,我们计划为每个SELR添加最佳实践和示例,使其对学习者更有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Şırnak Üniversitesi Yerleşkesinde Bulunan Mühendislik Fakültesi Bina Çatılarının Örnek Şebekeye Bağlı (On-Grid) Bir Fotovoltaik Sistem Tasarımı A Noise Reduction Method for Semi-Noisy Multiobjective Optimization Problems The Effect of Smart Home Technology on the Quality of Life of the Elderly VERGİ USUL KANUNU’NA GÖRE GİRİŞİM SERMAYESİ FONU VE FON AYIRMA VE AKTARMA ZORUNLULUĞU Preparation and characterization of wood plastic composite produced from waste polystyrene with organic filler
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1