Frismanda, Agustinus Bimo Gumelar, Derry Pramono Adi, Eman Setiawan, Agung Widodo, M. T. Sulistyono
{"title":"有毒语音分类的机器学习性能比较:印度尼西亚的在线发薪日贷款诈骗","authors":"Frismanda, Agustinus Bimo Gumelar, Derry Pramono Adi, Eman Setiawan, Agung Widodo, M. T. Sulistyono","doi":"10.1109/iSemantic50169.2020.9234259","DOIUrl":null,"url":null,"abstract":"The recent advancement of Machine Learning (ML) has brought us to many implementations. Online payday loan scam is a phenomenon which interestingly containing toxic speech in conversation. Toxic speech means implying threat toxic speech, offensive language, and hate speech. toxic speech would ultimately trigger such responses, namely loss of work ethic, alienation from the social, even suicidal thought. Despite the unnerving impact of toxic speech, there is still little known research regarding toxic speech, one of them is how to classify toxic speech. This research aims to make a comparison of various ML techniques with the means of classifying toxic speech found in the online payday loan scam phenomenon. For this experiment, we employed Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Random Forest (RF), and k-Nearest Neighbour (k-NN). All data were taken, filtered, and normalized manually from YouTube. Many reported the incident of online payday loan scam via YouTube in the form of two-way call communication. In total, there are 79 fraud report records converted into *.wav files, followed by the feature extraction process using openSMILE, and are classified using machine learning. We get the MLP result which has an acquisition value of 97.9%, below that received SVM 97.2%.","PeriodicalId":345558,"journal":{"name":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Performance Comparison for Toxic Speech Classification : Online Payday Loan Scams in Indonesia\",\"authors\":\"Frismanda, Agustinus Bimo Gumelar, Derry Pramono Adi, Eman Setiawan, Agung Widodo, M. T. Sulistyono\",\"doi\":\"10.1109/iSemantic50169.2020.9234259\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The recent advancement of Machine Learning (ML) has brought us to many implementations. Online payday loan scam is a phenomenon which interestingly containing toxic speech in conversation. Toxic speech means implying threat toxic speech, offensive language, and hate speech. toxic speech would ultimately trigger such responses, namely loss of work ethic, alienation from the social, even suicidal thought. Despite the unnerving impact of toxic speech, there is still little known research regarding toxic speech, one of them is how to classify toxic speech. This research aims to make a comparison of various ML techniques with the means of classifying toxic speech found in the online payday loan scam phenomenon. For this experiment, we employed Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Random Forest (RF), and k-Nearest Neighbour (k-NN). All data were taken, filtered, and normalized manually from YouTube. Many reported the incident of online payday loan scam via YouTube in the form of two-way call communication. In total, there are 79 fraud report records converted into *.wav files, followed by the feature extraction process using openSMILE, and are classified using machine learning. We get the MLP result which has an acquisition value of 97.9%, below that received SVM 97.2%.\",\"PeriodicalId\":345558,\"journal\":{\"name\":\"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSemantic50169.2020.9234259\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic50169.2020.9234259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine Learning Performance Comparison for Toxic Speech Classification : Online Payday Loan Scams in Indonesia
The recent advancement of Machine Learning (ML) has brought us to many implementations. Online payday loan scam is a phenomenon which interestingly containing toxic speech in conversation. Toxic speech means implying threat toxic speech, offensive language, and hate speech. toxic speech would ultimately trigger such responses, namely loss of work ethic, alienation from the social, even suicidal thought. Despite the unnerving impact of toxic speech, there is still little known research regarding toxic speech, one of them is how to classify toxic speech. This research aims to make a comparison of various ML techniques with the means of classifying toxic speech found in the online payday loan scam phenomenon. For this experiment, we employed Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Random Forest (RF), and k-Nearest Neighbour (k-NN). All data were taken, filtered, and normalized manually from YouTube. Many reported the incident of online payday loan scam via YouTube in the form of two-way call communication. In total, there are 79 fraud report records converted into *.wav files, followed by the feature extraction process using openSMILE, and are classified using machine learning. We get the MLP result which has an acquisition value of 97.9%, below that received SVM 97.2%.