Latent space bias mitigation for predicting at-risk students

Ali Al-Zawqari, Dries Peumans, Gerd Vandersteen
{"title":"Latent space bias mitigation for predicting at-risk students","authors":"Ali Al-Zawqari,&nbsp;Dries Peumans,&nbsp;Gerd Vandersteen","doi":"10.1016/j.caeai.2024.100300","DOIUrl":null,"url":null,"abstract":"<div><div>Researchers have observed the relationship between educational achievements and students' demographic characteristics in physical classroom-based learning. In the context of online education, recent studies were conducted to explore the leading factors of successful online courses. These studies also investigated how demographic features impact student achievement in the online learning environment. This motivates the use of demographic information alongside other features to predict students' academic performance. Since demographic features include protected attributes, such as gender and age, evaluating predictive models must go beyond minimizing the overall error. In this work, we analyze and investigate the use of neural networks to predict underperforming students in online courses. However, our goal is not only to enhance the accuracy but also to evaluate the fairness of the predictive models, a problem concerning the application of machine learning in education. This paper starts by analyzing the available solutions to fairness in predictive models: bias mitigation with pre-processing and in-processing methods. We show that the current evaluation is missing the case of partial awareness of protected features, which is the case when the model is aware of bias on some protected attributes but not all. The in-processing method, specifically the adversarial bias mitigation, shows that debiasing in some protected features exacerbates the bias on other protected features. This observation motivates our proposal of an alternative approach to enhance bias mitigation even in the partial awareness scenario by working with latent space. We implement the proposed solution using denoising autoencoders. The quantitative analysis used three distributions from The Open University Learning Analytics dataset (OULAD). The obtained results show that the latent space-based method offers the best solution as it maintains accuracy while mitigating the bias of the prediction models. These results indicate that in the case of partial awareness, the latent space method is considered superior to the adversarial bias mitigation approach.</div></div>","PeriodicalId":34469,"journal":{"name":"Computers and Education Artificial Intelligence","volume":"7 ","pages":"Article 100300"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Education Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666920X24001036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Researchers have observed the relationship between educational achievements and students' demographic characteristics in physical classroom-based learning. In the context of online education, recent studies were conducted to explore the leading factors of successful online courses. These studies also investigated how demographic features impact student achievement in the online learning environment. This motivates the use of demographic information alongside other features to predict students' academic performance. Since demographic features include protected attributes, such as gender and age, evaluating predictive models must go beyond minimizing the overall error. In this work, we analyze and investigate the use of neural networks to predict underperforming students in online courses. However, our goal is not only to enhance the accuracy but also to evaluate the fairness of the predictive models, a problem concerning the application of machine learning in education. This paper starts by analyzing the available solutions to fairness in predictive models: bias mitigation with pre-processing and in-processing methods. We show that the current evaluation is missing the case of partial awareness of protected features, which is the case when the model is aware of bias on some protected attributes but not all. The in-processing method, specifically the adversarial bias mitigation, shows that debiasing in some protected features exacerbates the bias on other protected features. This observation motivates our proposal of an alternative approach to enhance bias mitigation even in the partial awareness scenario by working with latent space. We implement the proposed solution using denoising autoencoders. The quantitative analysis used three distributions from The Open University Learning Analytics dataset (OULAD). The obtained results show that the latent space-based method offers the best solution as it maintains accuracy while mitigating the bias of the prediction models. These results indicate that in the case of partial awareness, the latent space method is considered superior to the adversarial bias mitigation approach.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
减少潜空间偏差,预测问题学生
研究人员观察了基于物理课堂学习的教育成就与学生人口特征之间的关系。在在线教育方面,最近开展了一些研究,以探索成功在线课程的主导因素。这些研究还调查了人口特征如何影响学生在在线学习环境中的成绩。这就促使我们使用人口统计学信息和其他特征来预测学生的学业成绩。由于人口统计学特征包括受保护的属性,如性别和年龄,因此评估预测模型必须超越最小化整体误差的范畴。在这项工作中,我们分析并研究了如何利用神经网络来预测在线课程中表现不佳的学生。然而,我们的目标不仅是提高准确性,还要评估预测模型的公平性,这是机器学习在教育领域应用的一个难题。本文首先分析了预测模型公平性的现有解决方案:通过预处理和内处理方法减轻偏差。我们发现,目前的评估缺少对部分受保护特征的认识,即模型能认识到某些受保护属性的偏差,但不能认识到所有属性的偏差。内处理方法,特别是对抗性偏差缓解方法表明,某些受保护特征的去偏差会加剧其他受保护特征的偏差。这一观察结果促使我们提出了一种替代方法,通过使用潜空间来加强偏差缓解,即使在部分感知场景下也是如此。我们使用去噪自动编码器来实现所提出的解决方案。定量分析使用了开放大学学习分析数据集(OULAD)中的三种分布。结果表明,基于潜在空间的方法提供了最佳解决方案,因为它既能保持准确性,又能减轻预测模型的偏差。这些结果表明,在部分认知的情况下,潜空间方法被认为优于对抗性偏差缓解方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
16.80
自引率
0.00%
发文量
66
审稿时长
50 days
期刊最新文献
Enhancing data analysis and programming skills through structured prompt training: The impact of generative AI in engineering education Understanding the practices, perceptions, and (dis)trust of generative AI among instructors: A mixed-methods study in the U.S. higher education Technological self-efficacy and sense of coherence: Key drivers in teachers' AI acceptance and adoption The influence of AI literacy on complex problem-solving skills through systematic thinking skills and intuition thinking skills: An empirical study in Thai gen Z accounting students Psychometrics of an Elo-based large-scale online learning system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1