Intelligent Data Analysis using Optimized Support Vector Machine Based Data Mining Approach for Tourism Industry

Ms Promila Sharma, Uma Meena, Girish Sharma
{"title":"Intelligent Data Analysis using Optimized Support Vector Machine Based Data Mining Approach for Tourism Industry","authors":"Ms Promila Sharma, Uma Meena, Girish Sharma","doi":"10.1145/3494566","DOIUrl":null,"url":null,"abstract":"Data analysis involves the deployment of sophisticated approaches from data mining methods, information theory, and artificial intelligence in various fields like tourism, hospitality, and so on for the extraction of knowledge from the gathered and preprocessed data. In tourism, pattern analysis or data analysis using classification is significant for finding the patterns that represent new and potentially useful information or knowledge about the destination and other data. Several data mining techniques are introduced for the classification of data or patterns. However, overfitting, less accuracy, local minima, sensitive to noise are the drawbacks in some existing data mining classification methods. To overcome these challenges, Support vector machine with Red deer optimization (SVM-RDO) based data mining strategy is proposed in this article. Extended Kalman filter (EKF) is utilized in the first phase, i.e., data cleaning to remove the noise and missing values from the input data. Mantaray foraging algorithm (MaFA) is used in the data selection phase, in which the significant data are selected for the further process to reduce the computational complexity. The final phase is the classification, in which SVM-RDO is proposed to access the useful pattern from the selected data. PYTHON is the implementation tool used for the experiment of the proposed model. The experimental analysis is done to show the efficacy of the proposed work. From the experimental results, the proposed SVM-RDO achieved better accuracy, precision, recall, and F1 score than the existing methods for the tourism dataset. Thus, it is showed the effectiveness of the proposed SVM-RDO for pattern analysis.","PeriodicalId":435653,"journal":{"name":"ACM Transactions on Knowledge Discovery from Data (TKDD)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Knowledge Discovery from Data (TKDD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3494566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Data analysis involves the deployment of sophisticated approaches from data mining methods, information theory, and artificial intelligence in various fields like tourism, hospitality, and so on for the extraction of knowledge from the gathered and preprocessed data. In tourism, pattern analysis or data analysis using classification is significant for finding the patterns that represent new and potentially useful information or knowledge about the destination and other data. Several data mining techniques are introduced for the classification of data or patterns. However, overfitting, less accuracy, local minima, sensitive to noise are the drawbacks in some existing data mining classification methods. To overcome these challenges, Support vector machine with Red deer optimization (SVM-RDO) based data mining strategy is proposed in this article. Extended Kalman filter (EKF) is utilized in the first phase, i.e., data cleaning to remove the noise and missing values from the input data. Mantaray foraging algorithm (MaFA) is used in the data selection phase, in which the significant data are selected for the further process to reduce the computational complexity. The final phase is the classification, in which SVM-RDO is proposed to access the useful pattern from the selected data. PYTHON is the implementation tool used for the experiment of the proposed model. The experimental analysis is done to show the efficacy of the proposed work. From the experimental results, the proposed SVM-RDO achieved better accuracy, precision, recall, and F1 score than the existing methods for the tourism dataset. Thus, it is showed the effectiveness of the proposed SVM-RDO for pattern analysis.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于优化支持向量机的旅游业数据挖掘智能数据分析
数据分析涉及部署数据挖掘方法、信息论和人工智能等各个领域的复杂方法,如旅游、酒店等,以便从收集和预处理的数据中提取知识。在旅游业中,使用分类的模式分析或数据分析对于发现代表关于目的地和其他数据的新的和潜在有用的信息或知识的模式具有重要意义。介绍了几种用于数据或模式分类的数据挖掘技术。然而,现有的一些数据挖掘分类方法存在过拟合、精度不高、局部极小、对噪声敏感等缺点。为了克服这些挑战,本文提出了基于支持向量机与马鹿优化(SVM-RDO)的数据挖掘策略。在第一阶段,即数据清洗中使用扩展卡尔曼滤波(EKF)去除输入数据中的噪声和缺失值。在数据选择阶段采用Mantaray觅食算法(Mantaray foraging algorithm, MaFA),选取有意义的数据进行下一步处理,以降低计算复杂度。最后一个阶段是分类,提出SVM-RDO从选定的数据中访问有用的模式。PYTHON是用于所提议模型实验的实现工具。实验分析表明了所提方法的有效性。实验结果表明,SVM-RDO在旅游数据集上的准确率、精密度、查全率和F1分数均优于现有方法。验证了SVM-RDO模式分析的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
相关文献
Risk factors of ipsilateral breast tumor recurrence in triple-negative or HER2-positive breast cancer patients who achieved pathological complete response after neoadjuvant chemotherapy.
IF 45.3 1区 医学Journal of Clinical OncologyPub Date : 2020-05-25 DOI: 10.1200/jco.2020.38.15_suppl.e12599
Mizuho Tazo, Y. Kojima, A. Yoshida, Sayuka Nakayama, R. Tokui, T. Ogawa, T. Kuwayama, T. Nakayama, H. Yamauchi, K. Tsugawa, Seigo Nakamura, N. Hayashi, M. Ishitobi
Abstract P2-16-27: Risk factors of ipsilateral breast tumor recurrence in primary breast cancer patients who achieved pathological complete response after neoadjuvant chemotherapy
IF 11.2 1区 医学Cancer researchPub Date : 2020-02-15 DOI: 10.1158/1538-7445.sabcs19-p2-16-27
N. Matsuda, N. Hayashi, R. Tokui, T. Nakayama, H. Yamauchi, M. Ishitobi
Risk Factors Predictive of Recurrence and Progression for Patients Who Suffered Initial Recurrence After Transurethral Resection of Stage pT1 Bladder Tumor in Chinese Population: A Retrospective Study
IF 1.6 4区 医学MedicinePub Date : 2016-02-01 DOI: 10.1097/MD.0000000000002625
Zhonghua Shen, Linguo Xie, Tao Chen, Dawei Tian, Xiaoteng Liu, Hao Xu, Yu Zhang, Zhouliang Wu, N. Sha, Chen Xing, Na Ding, Hailong Hu, Chang-li Wu
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Machine Learning-based Short-term Rainfall Prediction from Sky Data Incremental Feature Spaces Learning with Label Scarcity Multi-objective Learning to Overcome Catastrophic Forgetting in Time-series Applications Combining Filtering and Cross-Correlation Efficiently for Streaming Time Series Segment-Wise Time-Varying Dynamic Bayesian Network with Graph Regularization
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1