Doctoral Consortium of WSDM'22: Exploring the Bias of Adversarial Defenses

Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining Pub Date : 2022-02-11 DOI:10.1145/3488560.3502215

Han Xu

引用次数: 0

Abstract

Deep neural networks (DNNs) have achieved extraordinary accomplishments on various machine learning tasks. However, the existence of adversarial attacks still raise great concerns when they are adopted to safety-critical tasks. As countermeasures to protect DNN models against adversarial attacks, there are various defense strategies proposed. However, we find that the robustness ("safety'') provided by the robust training algorithms usually result unequal performance either among classes or sub-populations across the whole data distribution. For example, the model can achieve extremely low accuracy / robustness on certain groups of data. As a result, the safety of the model is still under great threats. As a summary, our project is about to study the bias problems of robust trained neural networks from different perspectives, which aims to build eventually reliable and safe deep learning models. We propose to present our research works in the Doctoral Consortium in WSDM'22 and gain opportunities to share our contribution to the relate problems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

WSDM'22博士联盟:探索对抗性防御的偏见

深度神经网络(dnn)在各种机器学习任务上取得了非凡的成就。然而，对抗性攻击的存在仍然引起了极大的关注，当它们被用于安全关键任务时。为了保护DNN模型免受对抗性攻击，人们提出了多种防御策略。然而，我们发现，鲁棒性训练算法提供的鲁棒性(“安全性”)通常会导致整个数据分布的类或子种群之间的性能不平等。例如，该模型在某些数据组上可能达到极低的准确性/鲁棒性。因此，模型的安全性仍然受到很大的威胁。综上所述，我们的项目将从不同的角度研究经过鲁棒训练的神经网络的偏差问题，旨在最终建立可靠和安全的深度学习模型。我们建议在WSDM'22的博士联盟中展示我们的研究工作，并获得分享我们对相关问题的贡献的机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量

期刊最新文献

AdaptKT: A Domain Adaptable Method for Knowledge Tracing Doctoral Consortium of WSDM'22: Exploring the Bias of Adversarial Defenses Half-Day Tutorial on Combating Online Hate Speech: The Role of Content, Networks, Psychology, User Behavior, etc. Near Real Time AI Personalization for Notifications at LinkedIn k-Clustering with Fair Outliers