Christian Nwankwo, H. Wimmer, Lei Chen, Jongyeop Kim
{"title":"Text Classification of Digital Forensic Data","authors":"Christian Nwankwo, H. Wimmer, Lei Chen, Jongyeop Kim","doi":"10.1109/IEMCON51383.2020.9284913","DOIUrl":null,"url":null,"abstract":"This research aims to propose a model to classify text messages that extracted from the smart phone using forensic software and several machine learning algorithms. The data analysis procedure subdivided into physical extraction, relevant partitions, logical extraction, digital forensic analysis, and text classification. In the text classification step, the final result derived by applying sentiment analysis and k-means clustering algorithm under the control of python application. Through this model, we were able to classify most of the messages correctly as either being positive or negative.","PeriodicalId":6871,"journal":{"name":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"21 1","pages":"0661-0667"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON51383.2020.9284913","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This research aims to propose a model to classify text messages that extracted from the smart phone using forensic software and several machine learning algorithms. The data analysis procedure subdivided into physical extraction, relevant partitions, logical extraction, digital forensic analysis, and text classification. In the text classification step, the final result derived by applying sentiment analysis and k-means clustering algorithm under the control of python application. Through this model, we were able to classify most of the messages correctly as either being positive or negative.