数据挖掘最新文献_第2页

Research on KCF Method of Scale Adaptation and Anti-Occlusion 尺度适应与抗遮挡的KCF方法研究

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.132013

虓堃闫

引用次数: 1

Fast Attribute Reduction Algorithm Based on Maximum Decision Entropy 基于最大决策熵的快速属性约简算法

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.133022

梅袁

In the era of big data, data

引用次数: 0

Analysis and Mining of 5G Potential Customers Based on Machine Learning 基于机器学习的5G潜在客户分析与挖掘

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.132017

晓晴洪

With the continuous development and improvement of communication network engineering and new infrastructure technologies, China is gradually realizing the transition from a 4G society to a 5G society. 5G, with its technical advantages of low latency, large bandwidth and wide connectivity, has become an important technical background for the construction of smart cities and digital villages. In order to achieve the conditions for large-scale connectivity of 5G networks required for the construction of smart cities, a higher utilization rate of 5G users is required. Based on this problem, this paper obtains data from a mobile big data platform, builds a classification prediction model based on the prediction problem of potential 5G users, correctly identifies potential 5G users and makes accurate service recommendations to them, improves the 5G utilization rate in China, and promotes the rapid upgrade of the construction of new smart cities. The process of building the prediction model mainly includes data pre-processing, feature engineering, training and evaluation of the model. Firstly, data pre-processing and exploratory analysis were performed, and a series of pre-processing work including data cleaning, removal of unique value attributes, data transformation, etc. were carried out for the data, followed by variable screening of the features in the dataset of this paper through chi-square test, statistical t-test and Pearson correlation coefficient method, and 24 feature variables with high feature importance were screened out. Models were constructed based on the screened feature variables, including Random Forest model, CatBoost model, and LightGBM model, and parameter tuning was performed to find the optimal parameters. The models are built according to the obtained optimal parameters and tested by the test set, and the models are evaluated by accuracy, recall, and AUC value indexes, and the comparison reveals that the LightGBM model is generally better than other models for 5G potential user prediction. In addition, the importance scores of the features are obtained by the above model and ranked in importance. Through the method of this paper to achieve more accurate identification and mining of 5G potential users, operators can accordingly realize accurate marketing for different customers, promote more users to realize the transition from 4G to 5G, and accelerate the sustainable development of China’s 5G market and the construction of smart cities.

{"title":"Analysis and Mining of 5G Potential Customers Based on Machine Learning","authors":"晓晴洪","doi":"10.12677/hjdm.2023.132017","DOIUrl":"https://doi.org/10.12677/hjdm.2023.132017","url":null,"abstract":"With the continuous development and improvement of communication network engineering and new infrastructure technologies, China is gradually realizing the transition from a 4G society to a 5G society. 5G, with its technical advantages of low latency, large bandwidth and wide connectivity, has become an important technical background for the construction of smart cities and digital villages. In order to achieve the conditions for large-scale connectivity of 5G networks required for the construction of smart cities, a higher utilization rate of 5G users is required. Based on this problem, this paper obtains data from a mobile big data platform, builds a classification prediction model based on the prediction problem of potential 5G users, correctly identifies potential 5G users and makes accurate service recommendations to them, improves the 5G utilization rate in China, and promotes the rapid upgrade of the construction of new smart cities. The process of building the prediction model mainly includes data pre-processing, feature engineering, training and evaluation of the model. Firstly, data pre-processing and exploratory analysis were performed, and a series of pre-processing work including data cleaning, removal of unique value attributes, data transformation, etc. were carried out for the data, followed by variable screening of the features in the dataset of this paper through chi-square test, statistical t-test and Pearson correlation coefficient method, and 24 feature variables with high feature importance were screened out. Models were constructed based on the screened feature variables, including Random Forest model, CatBoost model, and LightGBM model, and parameter tuning was performed to find the optimal parameters. The models are built according to the obtained optimal parameters and tested by the test set, and the models are evaluated by accuracy, recall, and AUC value indexes, and the comparison reveals that the LightGBM model is generally better than other models for 5G potential user prediction. In addition, the importance scores of the features are obtained by the above model and ranked in importance. Through the method of this paper to achieve more accurate identification and mining of 5G potential users, operators can accordingly realize accurate marketing for different customers, promote more users to realize the transition from 4G to 5G, and accelerate the sustainable development of China’s 5G market and the construction of smart cities.","PeriodicalId":57348,"journal":{"name":"数据挖掘","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78723199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Researches on Feature Selection Algorithm for Generalized Multi-Granularity Rough Sets 广义多粒度粗糙集特征选择算法研究

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.133021

晓敏梁

With the rapid development of information technology, a large amount of data has been generated

引用次数: 0

Analysis on Domestic Literature Related to Novel Coronavirus 国内新型冠状病毒相关文献分析

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.131004

海鹏刘

引用次数: 0

Analysis of Old and New Insurance Contract Guidelines Based on Text Mining 基于文本挖掘的新旧保险合同指南分析

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.133026

莉婷韩

The new insurance contract standard came into effect on January 1 this year, marking the further improvement of China’s accounting standards system for enterprises and maintaining continuous convergence with IFRS. Therefore, this paper uses the Python programming language to conduct

引用次数: 0

The Impact of Rural Population Aging on Agricultural Development in Guangdong Province 广东省农村人口老龄化对农业发展的影响

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.131009

泽凯陈

引用次数: 0

Prediction Research on the Scale of Middle School Students in Jilin Province Based on BP Neural Network 基于BP神经网络的吉林省中学生规模预测研究

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.132010

晓筝李

引用次数: 0

Competitive Intelligence Analysis of Real Es-tate Enterprises 房地产企业竞争情报分析

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.131001

格竹周

引用次数: 0

Research and Implementation of Abnormal Product Identification Model Based on Stability 基于稳定性的异常产品识别模型的研究与实现

数据挖掘

Pub Date : 2023-01-01 DOI: 10.12677/hjdm.2023.133025

飞燕马

With the development of the economy, online shopping has gained widespread popularity in all aspects. Due to its advantages such as convenience, speed, time and effort saving, and door-to-door delivery, it is increasingly favored by people and has become an indispensable part of daily life. With the improvement of people’s economic ability and consumption level, the demand for on-line shopping experience is also increasing. At the same time, competition among major online retail businesses has become increasingly fierce. In order to attract consumers’ attention and increase product sales, some businesses have started to use “speculation” and “order” methods such as selling, positive reviews, and negative reviews to maliciously promote products, in-fringing on consumers’ rights and interests. To protect consumers’ right to know and choose, this project uses a dataset provided by Inspur Zhuosu Company to analyze the reasons for abnormal products through a combination of quantitative and qualitative data mining analysis. Mathematical modeling and machine learning methods are used to define some abnormal product indicators, and these indicators are used to construct a model for finding and predicting abnormal products. The experimental results indicate that the model has good performance and certain practicality.

{"title":"Research and Implementation of Abnormal Product Identification Model Based on Stability","authors":"飞燕马","doi":"10.12677/hjdm.2023.133025","DOIUrl":"https://doi.org/10.12677/hjdm.2023.133025","url":null,"abstract":"With the development of the economy, online shopping has gained widespread popularity in all aspects. Due to its advantages such as convenience, speed, time and effort saving, and door-to-door delivery, it is increasingly favored by people and has become an indispensable part of daily life. With the improvement of people’s economic ability and consumption level, the demand for on-line shopping experience is also increasing. At the same time, competition among major online retail businesses has become increasingly fierce. In order to attract consumers’ attention and increase product sales, some businesses have started to use “speculation” and “order” methods such as selling, positive reviews, and negative reviews to maliciously promote products, in-fringing on consumers’ rights and interests. To protect consumers’ right to know and choose, this project uses a dataset provided by Inspur Zhuosu Company to analyze the reasons for abnormal products through a combination of quantitative and qualitative data mining analysis. Mathematical modeling and machine learning methods are used to define some abnormal product indicators, and these indicators are used to construct a model for finding and predicting abnormal products. The experimental results indicate that the model has good performance and certain practicality.","PeriodicalId":57348,"journal":{"name":"数据挖掘","volume":"44 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84031610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0