{"title":"FL-XGBTC: federated learning inspired with XG-boost tuned classifier for YouTube spam content detection","authors":"Vandana Sharma, Anurag Sinha, Ahmed Alkhayyat, Ankit Agarwal, Peddi Nikitha, Sable Ramkumar, Tripti Rathee, Mopuru Bhargavi, Nitish Kumar","doi":"10.1007/s13198-024-02502-9","DOIUrl":null,"url":null,"abstract":"<p>The problem of spam content in YouTube comments is an ongoing issue, and detecting such content is a critical task to maintain the quality of user experience on the platform. In this study, we propose a Federated Learning Inspired XG-Boost Tuned Classifier, FL-XGBTC, for YouTube spam content detection. The proposed model leverages the advantages of federated learning, which enables the training of a model collaboratively across multiple devices without sharing raw data. The FL-XGBTC model is based on the XGBoost algorithm, which is a powerful and widely used ensemble learning algorithm for classification tasks. The proposed model was trained on a large and diverse dataset of YouTube comments, which includes both spam and non-spam comments. The results demonstrate that the FL-XGBTC model achieved a high level of accuracy in detecting spam content in YouTube comments, outperforming several baseline models. Additionally, the proposed model provides the benefit of preserving user privacy, which is a critical consideration in modern machine-learning applications. Overall, the proposed Federated Learning Inspired XG-Boost Tuned Classifier provides a promising solution for YouTube spam content detection that leverages the benefits of federated learning and ensemble learning algorithms. The major contribution of this work is to demonstrate and propose a framework for showing a distributed federated classifier for the multiscale classification of youtube spam comments using the Ensemble learning method.</p>","PeriodicalId":14463,"journal":{"name":"International Journal of System Assurance Engineering and Management","volume":"18 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of System Assurance Engineering and Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13198-024-02502-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The problem of spam content in YouTube comments is an ongoing issue, and detecting such content is a critical task to maintain the quality of user experience on the platform. In this study, we propose a Federated Learning Inspired XG-Boost Tuned Classifier, FL-XGBTC, for YouTube spam content detection. The proposed model leverages the advantages of federated learning, which enables the training of a model collaboratively across multiple devices without sharing raw data. The FL-XGBTC model is based on the XGBoost algorithm, which is a powerful and widely used ensemble learning algorithm for classification tasks. The proposed model was trained on a large and diverse dataset of YouTube comments, which includes both spam and non-spam comments. The results demonstrate that the FL-XGBTC model achieved a high level of accuracy in detecting spam content in YouTube comments, outperforming several baseline models. Additionally, the proposed model provides the benefit of preserving user privacy, which is a critical consideration in modern machine-learning applications. Overall, the proposed Federated Learning Inspired XG-Boost Tuned Classifier provides a promising solution for YouTube spam content detection that leverages the benefits of federated learning and ensemble learning algorithms. The major contribution of this work is to demonstrate and propose a framework for showing a distributed federated classifier for the multiscale classification of youtube spam comments using the Ensemble learning method.
期刊介绍:
This Journal is established with a view to cater to increased awareness for high quality research in the seamless integration of heterogeneous technologies to formulate bankable solutions to the emergent complex engineering problems.
Assurance engineering could be thought of as relating to the provision of higher confidence in the reliable and secure implementation of a system’s critical characteristic features through the espousal of a holistic approach by using a wide variety of cross disciplinary tools and techniques. Successful realization of sustainable and dependable products, systems and services involves an extensive adoption of Reliability, Quality, Safety and Risk related procedures for achieving high assurancelevels of performance; also pivotal are the management issues related to risk and uncertainty that govern the practical constraints encountered in their deployment. It is our intention to provide a platform for the modeling and analysis of large engineering systems, among the other aforementioned allied goals of systems assurance engineering, leading to the enforcement of performance enhancement measures. Achieving a fine balance between theory and practice is the primary focus. The Journal only publishes high quality papers that have passed the rigorous peer review procedure of an archival scientific Journal. The aim is an increasing number of submissions, wide circulation and a high impact factor.