一种训练不同类型的推文健康提及分类语言模型的新方法

Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society) Pub Date : 2022-04-13 DOI:10.48550/arXiv.2204.06337

Pervaiz Iqbal Khan, Imran Razzak, A. Dengel, Sheraz Ahmed

{"title":"一种训练不同类型的推文健康提及分类语言模型的新方法","authors":"Pervaiz Iqbal Khan, Imran Razzak, A. Dengel, Sheraz Ahmed","doi":"10.48550/arXiv.2204.06337","DOIUrl":null,"url":null,"abstract":"Health mention classification deals with the disease detection in a given text containing disease words. However, non-health and figurative use of disease words adds challenges to the task. Recently, adversarial training acting as a means of regularization has gained popularity in many NLP tasks. In this paper, we propose a novel approach to train language models for health mention classification of tweets that involves adversarial training. We generate adversarial examples by adding perturbation to the representations of transformer models for tweet examples at various levels using Gaussian noise. Further, we employ contrastive loss as an additional objective function. We evaluate the proposed method on the PHM2017 dataset extended version. Results show that our proposed approach improves the performance of classifier significantly over the baseline methods. Moreover, our analysis shows that adding noise at earlier layers improves models' performance whereas adding noise at intermediate layers deteriorates models' performance. Finally, adding noise towards the final layers performs better than the middle layers noise addition.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"157 1","pages":"136-147"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Novel Approach to Train Diverse Types of Language Models for Health Mention Classification of Tweets\",\"authors\":\"Pervaiz Iqbal Khan, Imran Razzak, A. Dengel, Sheraz Ahmed\",\"doi\":\"10.48550/arXiv.2204.06337\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Health mention classification deals with the disease detection in a given text containing disease words. However, non-health and figurative use of disease words adds challenges to the task. Recently, adversarial training acting as a means of regularization has gained popularity in many NLP tasks. In this paper, we propose a novel approach to train language models for health mention classification of tweets that involves adversarial training. We generate adversarial examples by adding perturbation to the representations of transformer models for tweet examples at various levels using Gaussian noise. Further, we employ contrastive loss as an additional objective function. We evaluate the proposed method on the PHM2017 dataset extended version. Results show that our proposed approach improves the performance of classifier significantly over the baseline methods. Moreover, our analysis shows that adding noise at earlier layers improves models' performance whereas adding noise at intermediate layers deteriorates models' performance. Finally, adding noise towards the final layers performs better than the middle layers noise addition.\",\"PeriodicalId\":93416,\"journal\":{\"name\":\"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)\",\"volume\":\"157 1\",\"pages\":\"136-147\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2204.06337\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2204.06337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

健康提及分类处理包含疾病词的给定文本中的疾病检测。然而，非健康和比喻性的疾病词汇的使用给这项任务增加了挑战。最近，对抗训练作为一种正则化手段在许多NLP任务中得到了普及。在本文中，我们提出了一种新的方法来训练涉及对抗性训练的推文健康提及分类的语言模型。我们通过使用高斯噪声在不同级别的推文示例的变压器模型的表示中添加扰动来生成对抗性示例。此外，我们采用对比损失作为一个额外的目标函数。我们在PHM2017数据集扩展版本上对所提出的方法进行了评估。结果表明，与基线方法相比，我们提出的方法显著提高了分类器的性能。此外，我们的分析表明，在早期层添加噪声可以提高模型的性能，而在中间层添加噪声会降低模型的性能。最后，向最后一层添加噪声比中间层添加噪声效果更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Novel Approach to Train Diverse Types of Language Models for Health Mention Classification of Tweets

Health mention classification deals with the disease detection in a given text containing disease words. However, non-health and figurative use of disease words adds challenges to the task. Recently, adversarial training acting as a means of regularization has gained popularity in many NLP tasks. In this paper, we propose a novel approach to train language models for health mention classification of tweets that involves adversarial training. We generate adversarial examples by adding perturbation to the representations of transformer models for tweet examples at various levels using Gaussian noise. Further, we employ contrastive loss as an additional objective function. We evaluate the proposed method on the PHM2017 dataset extended version. Results show that our proposed approach improves the performance of classifier significantly over the baseline methods. Moreover, our analysis shows that adding noise at earlier layers improves models' performance whereas adding noise at intermediate layers deteriorates models' performance. Finally, adding noise towards the final layers performs better than the middle layers noise addition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)

自引率

0.00%

发文量

期刊最新文献

Dual Branch Network Towards Accurate Printed Mathematical Expression Recognition PE-YOLO: Pyramid Enhancement Network for Dark Object Detection Variational Autoencoders for Anomaly Detection in Respiratory Sounds Deep Feature Learning for Medical Acoustics Time Series Forecasting Models Copy the Past: How to Mitigate