利用机器学习确定最佳特征,解决电子票务中的复杂数据集问题

IF 0.8 Q3 MULTIDISCIPLINARY SCIENCES Malaysian Journal of Fundamental and Applied Sciences Pub Date : 2023-12-04 DOI:10.11113/mjfas.v19n6.2921
Siti Zulaikha Mohd Jamaludin, Majid Khan Majahar Ali, Eric Shiung Wong Vun, Mohd. Tahir Ismail, Noor Farizah Ibrahim
{"title":"利用机器学习确定最佳特征,解决电子票务中的复杂数据集问题","authors":"Siti Zulaikha Mohd Jamaludin, Majid Khan Majahar Ali, Eric Shiung Wong Vun, Mohd. Tahir Ismail, Noor Farizah Ibrahim","doi":"10.11113/mjfas.v19n6.2921","DOIUrl":null,"url":null,"abstract":"e-ticketing is one of the common applications used in technical support in Information Technology (IT) and has been used worldwide in any field of company. The benefits of e-ticketing can reduce the human efforts, increase the sufficiency of system and provides the benefits and efficiency to the customers. Also, e-ticketing ticketing system can enhance worker safety, improving productivity, increasing project efficiency and cause the good impact on the performance of the business in terms of profitability. The main objective of this study is defining the model performance in each important feature by analysing the complex dataset by using logistic regression as a Machine Learning (ML) algorithm. In evaluation the performance of classifier, the dataset is injected to Python programming and split into 90% as training set and 10% for the testing set. From the analysis, the study found that only 3 out of 11 independent features in dataset that are relevant chosen to proceed the ML analysis. From the result, the accuracy for sct_short_description, sct_cmdb_ci, and sct_assignment_group is 41.65%, 48.77% and 96.49%, respectively. It showed that the accuracy’s result for the sct_assignment_group resulted that the model is very good accuracy and indicate that the model is well performing. Meanwhile, the value of F1-score is 96.11% in each feature. This result indicates that the model has a good balance of precision and recall in its binary classification predictions. Hence, the study considers the sct_assignment_group as a best features to proceed the analysis. The future study will consider dealing the combination of complexity features by implementing more analysis on ML such as Support Vector Machines and Naïve Bayes.","PeriodicalId":18149,"journal":{"name":"Malaysian Journal of Fundamental and Applied Sciences","volume":"67 4","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Solving Complexity Dataset in e-Ticketing using Machine Learning to Determine Optimum Feature\",\"authors\":\"Siti Zulaikha Mohd Jamaludin, Majid Khan Majahar Ali, Eric Shiung Wong Vun, Mohd. Tahir Ismail, Noor Farizah Ibrahim\",\"doi\":\"10.11113/mjfas.v19n6.2921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"e-ticketing is one of the common applications used in technical support in Information Technology (IT) and has been used worldwide in any field of company. The benefits of e-ticketing can reduce the human efforts, increase the sufficiency of system and provides the benefits and efficiency to the customers. Also, e-ticketing ticketing system can enhance worker safety, improving productivity, increasing project efficiency and cause the good impact on the performance of the business in terms of profitability. The main objective of this study is defining the model performance in each important feature by analysing the complex dataset by using logistic regression as a Machine Learning (ML) algorithm. In evaluation the performance of classifier, the dataset is injected to Python programming and split into 90% as training set and 10% for the testing set. From the analysis, the study found that only 3 out of 11 independent features in dataset that are relevant chosen to proceed the ML analysis. From the result, the accuracy for sct_short_description, sct_cmdb_ci, and sct_assignment_group is 41.65%, 48.77% and 96.49%, respectively. It showed that the accuracy’s result for the sct_assignment_group resulted that the model is very good accuracy and indicate that the model is well performing. Meanwhile, the value of F1-score is 96.11% in each feature. This result indicates that the model has a good balance of precision and recall in its binary classification predictions. Hence, the study considers the sct_assignment_group as a best features to proceed the analysis. The future study will consider dealing the combination of complexity features by implementing more analysis on ML such as Support Vector Machines and Naïve Bayes.\",\"PeriodicalId\":18149,\"journal\":{\"name\":\"Malaysian Journal of Fundamental and Applied Sciences\",\"volume\":\"67 4\",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2023-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Malaysian Journal of Fundamental and Applied Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11113/mjfas.v19n6.2921\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Malaysian Journal of Fundamental and Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11113/mjfas.v19n6.2921","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

电子票务是信息技术(IT)技术支持中常用的应用之一,已在世界范围内的各个领域得到广泛应用。电子票务的优势在于减少人力劳动,提高系统的充分性,为客户提供效益和效率。此外,电子票务系统可以提高工人的安全,提高生产力,提高项目效率,并在盈利方面对企业的绩效产生良好的影响。本研究的主要目标是通过使用逻辑回归作为机器学习(ML)算法分析复杂数据集来定义每个重要特征的模型性能。在评估分类器性能时,将数据集注入Python编程,并将其分成90%作为训练集,10%作为测试集。从分析中,研究发现数据集中11个相关的独立特征中只有3个被选择进行ML分析。从结果来看,sct_short_description、sct_cmdb_ci和sct_assignment_group的准确率分别为41.65%、48.77%和96.49%。结果表明,sct_assignment_group的精度结果表明该模型具有很好的精度,表明该模型具有良好的性能。同时,各特征的F1-score值为96.11%。结果表明,该模型在二元分类预测中具有较好的准确率和召回率平衡。因此,本研究认为sct_assignment_group是进行分析的最佳特征。未来的研究将考虑通过对ML(如支持向量机和Naïve贝叶斯)进行更多分析来处理复杂性特征的组合。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Solving Complexity Dataset in e-Ticketing using Machine Learning to Determine Optimum Feature
e-ticketing is one of the common applications used in technical support in Information Technology (IT) and has been used worldwide in any field of company. The benefits of e-ticketing can reduce the human efforts, increase the sufficiency of system and provides the benefits and efficiency to the customers. Also, e-ticketing ticketing system can enhance worker safety, improving productivity, increasing project efficiency and cause the good impact on the performance of the business in terms of profitability. The main objective of this study is defining the model performance in each important feature by analysing the complex dataset by using logistic regression as a Machine Learning (ML) algorithm. In evaluation the performance of classifier, the dataset is injected to Python programming and split into 90% as training set and 10% for the testing set. From the analysis, the study found that only 3 out of 11 independent features in dataset that are relevant chosen to proceed the ML analysis. From the result, the accuracy for sct_short_description, sct_cmdb_ci, and sct_assignment_group is 41.65%, 48.77% and 96.49%, respectively. It showed that the accuracy’s result for the sct_assignment_group resulted that the model is very good accuracy and indicate that the model is well performing. Meanwhile, the value of F1-score is 96.11% in each feature. This result indicates that the model has a good balance of precision and recall in its binary classification predictions. Hence, the study considers the sct_assignment_group as a best features to proceed the analysis. The future study will consider dealing the combination of complexity features by implementing more analysis on ML such as Support Vector Machines and Naïve Bayes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.40
自引率
0.00%
发文量
45
期刊最新文献
A Review on Synthesis and Physicochemical Properties-Photocatalytic Activity Relationships of Carbon Quantum Dots Graphitic Carbon Nitride in Reduction of Carbon Dioxide A Multi-Criteria Generalised L-R Intuitionistic Fuzzy TOPSIS with CRITIC for River Water Pollution Classification Phytochemical Screening and Antioxidant Activities of Geniotrigona thoracica Propolis Extracts Derived from Different Locations in Malaysia Two-Dimensional Heavy Metal Migration in Soil with Adsorption and Instantaneous Injection Fuzzy Intuitionistic Alpha-cut Interpolation Rational Bézier Curve Modeling for Shoreline Island Data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1