{"title":"AACEM:混合代码的情感自动注释和分类","authors":"Asia Samreen , Syed Asif Ali , Hina Shakir","doi":"10.1016/j.simpa.2024.100626","DOIUrl":null,"url":null,"abstract":"<div><p>This paper presents a framework for automatic creation of an emotions-labeled dataset specifically designed for short texts written in a blend of Roman Urdu and English, and addresses the inherent absence of distinct structure in Roman Urdu language. The software development is carried out in two key phases. During the first phase, cleaning and automatic annotation of raw text is performed and in the second phase, classification of emotions along with prediction is carried out. The developed software significantly simplifies the process of dataset creation by employing natural language processing (NLP) techniques, tailored for the mixed-codes.</p></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"19 ","pages":"Article 100626"},"PeriodicalIF":1.3000,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2665963824000149/pdfft?md5=2f00d3a4c8d3114d2d8f044a626cb533&pid=1-s2.0-S2665963824000149-main.pdf","citationCount":"0","resultStr":"{\"title\":\"AACEM: Automatic Annotation and Classification of Emotions for mixed-codes\",\"authors\":\"Asia Samreen , Syed Asif Ali , Hina Shakir\",\"doi\":\"10.1016/j.simpa.2024.100626\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper presents a framework for automatic creation of an emotions-labeled dataset specifically designed for short texts written in a blend of Roman Urdu and English, and addresses the inherent absence of distinct structure in Roman Urdu language. The software development is carried out in two key phases. During the first phase, cleaning and automatic annotation of raw text is performed and in the second phase, classification of emotions along with prediction is carried out. The developed software significantly simplifies the process of dataset creation by employing natural language processing (NLP) techniques, tailored for the mixed-codes.</p></div>\",\"PeriodicalId\":29771,\"journal\":{\"name\":\"Software Impacts\",\"volume\":\"19 \",\"pages\":\"Article 100626\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-02-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2665963824000149/pdfft?md5=2f00d3a4c8d3114d2d8f044a626cb533&pid=1-s2.0-S2665963824000149-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Software Impacts\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2665963824000149\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software Impacts","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2665963824000149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
AACEM: Automatic Annotation and Classification of Emotions for mixed-codes
This paper presents a framework for automatic creation of an emotions-labeled dataset specifically designed for short texts written in a blend of Roman Urdu and English, and addresses the inherent absence of distinct structure in Roman Urdu language. The software development is carried out in two key phases. During the first phase, cleaning and automatic annotation of raw text is performed and in the second phase, classification of emotions along with prediction is carried out. The developed software significantly simplifies the process of dataset creation by employing natural language processing (NLP) techniques, tailored for the mixed-codes.