Romi Fadillah Rahmat, Aina Hubby Aziira, Sarah Purnamawati, Yunita Marito Pane, Sharfina Faza, None Al-Khowarizm, Farhad Nadi
{"title":"基于互信息与支持向量机的印尼网络犯罪案件分类","authors":"Romi Fadillah Rahmat, Aina Hubby Aziira, Sarah Purnamawati, Yunita Marito Pane, Sharfina Faza, None Al-Khowarizm, Farhad Nadi","doi":"10.18280/ijsse.130507","DOIUrl":null,"url":null,"abstract":"In Indonesia, the process of identifying and categorizing cyberlaw infringements traditionally involves manual procedures administered by experts, lawyers, or law enforcement personnel. This study introduces a method to enhance the analysis and processing of case chronological data through the application of text mining. Using the Support Vector Machine for classification, alongside feature extraction both with and without Mutual Information, the study aims to automate the classification of cybercrime cases. The preprocessing phase encompasses text cleaning, case folding, stop word removal, stemming, and tokenization and weighting with TF-IDF. The model achieved an accuracy rate of 95.45% during evaluation and 91.42% when tested on 35 data points with 1500 selected features. This performance surpasses the classification accuracy obtained in previous research.","PeriodicalId":37802,"journal":{"name":"International Journal of Safety and Security Engineering","volume":"113 32","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classifying Indonesian Cyber Crime Cases under ITE Law Using a Hybrid of Mutual Information and Support Vector Machine\",\"authors\":\"Romi Fadillah Rahmat, Aina Hubby Aziira, Sarah Purnamawati, Yunita Marito Pane, Sharfina Faza, None Al-Khowarizm, Farhad Nadi\",\"doi\":\"10.18280/ijsse.130507\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In Indonesia, the process of identifying and categorizing cyberlaw infringements traditionally involves manual procedures administered by experts, lawyers, or law enforcement personnel. This study introduces a method to enhance the analysis and processing of case chronological data through the application of text mining. Using the Support Vector Machine for classification, alongside feature extraction both with and without Mutual Information, the study aims to automate the classification of cybercrime cases. The preprocessing phase encompasses text cleaning, case folding, stop word removal, stemming, and tokenization and weighting with TF-IDF. The model achieved an accuracy rate of 95.45% during evaluation and 91.42% when tested on 35 data points with 1500 selected features. This performance surpasses the classification accuracy obtained in previous research.\",\"PeriodicalId\":37802,\"journal\":{\"name\":\"International Journal of Safety and Security Engineering\",\"volume\":\"113 32\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Safety and Security Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18280/ijsse.130507\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Safety and Security Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18280/ijsse.130507","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
Classifying Indonesian Cyber Crime Cases under ITE Law Using a Hybrid of Mutual Information and Support Vector Machine
In Indonesia, the process of identifying and categorizing cyberlaw infringements traditionally involves manual procedures administered by experts, lawyers, or law enforcement personnel. This study introduces a method to enhance the analysis and processing of case chronological data through the application of text mining. Using the Support Vector Machine for classification, alongside feature extraction both with and without Mutual Information, the study aims to automate the classification of cybercrime cases. The preprocessing phase encompasses text cleaning, case folding, stop word removal, stemming, and tokenization and weighting with TF-IDF. The model achieved an accuracy rate of 95.45% during evaluation and 91.42% when tested on 35 data points with 1500 selected features. This performance surpasses the classification accuracy obtained in previous research.
期刊介绍:
The International Journal of Safety and Security Engineering aims to provide a forum for the publication of papers on the most recent developments in the theoretical and practical aspects of these important fields. Safety and Security Engineering, due to its special nature, is an interdisciplinary area of research and applications that brings together in a systematic way many disciplines of engineering, from the traditional to the most technologically advanced. The Journal covers areas such as crisis management; security engineering; natural disasters and emergencies; terrorism; IT security; man-made hazards; risk management; control; protection and mitigation issues. The Journal aims to attract papers in all related fields, in addition to those listed under the List of Topics, as well as case studies describing practical experiences. The study of multifactor risk impact will be given special emphasis. Due to the multitude and variety of topics included, the List is only indicative of the themes of the expected papers. Authors are encouraged to submit papers in all areas of Safety and Security, with particular attention to integrated and interdisciplinary aspects.