Network Transmission Flags Data Affinity-based Classification by K-Nearest Neighbor

IF 1.2 Q3 MULTIDISCIPLINARY SCIENCES ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY Pub Date : 2022-04-25 DOI:10.14500/aro.10880
N. Aljojo
{"title":"Network Transmission Flags Data Affinity-based Classification by K-Nearest Neighbor","authors":"N. Aljojo","doi":"10.14500/aro.10880","DOIUrl":null,"url":null,"abstract":"Abstract—This research is concerned with the data generated during a network transmission session to understand how to extract value from the data generated and be able to conduct tasks. Instead of comparing all of the transmission flags for a transmission session at the same time to conduct any analysis, this paper conceptualized the influence of each transmission flag on network-aware applications by comparing the flags one by one on their impact to the application during the transmission session, rather than comparing all of the transmission flags at the same time. The K-nearest neighbor (KNN) type classification was used becauseit is a simple distance-based learning algorithm that remembers earlier training samples and is suitable for taking various flags withtheir effect on application protocols by comparing each new sample with the K-nearest points to make a decision. We used transmission session datasets received from Kaggle for IP flow with 87 features and 3.577.296 instances. We picked 13 features from the datasets and ran them through KNN. RapidMiner was used for the study, and the results of the experiments revealed that the KNN-based model was not only significantly more accurate in categorizing data, but it was also significantly more efficient due to the decreased processing costs.","PeriodicalId":8398,"journal":{"name":"ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2022-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14500/aro.10880","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract—This research is concerned with the data generated during a network transmission session to understand how to extract value from the data generated and be able to conduct tasks. Instead of comparing all of the transmission flags for a transmission session at the same time to conduct any analysis, this paper conceptualized the influence of each transmission flag on network-aware applications by comparing the flags one by one on their impact to the application during the transmission session, rather than comparing all of the transmission flags at the same time. The K-nearest neighbor (KNN) type classification was used becauseit is a simple distance-based learning algorithm that remembers earlier training samples and is suitable for taking various flags withtheir effect on application protocols by comparing each new sample with the K-nearest points to make a decision. We used transmission session datasets received from Kaggle for IP flow with 87 features and 3.577.296 instances. We picked 13 features from the datasets and ran them through KNN. RapidMiner was used for the study, and the results of the experiments revealed that the KNN-based model was not only significantly more accurate in categorizing data, but it was also significantly more efficient due to the decreased processing costs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
网络传输标志基于k近邻的数据亲和力分类
摘要:本研究关注网络传输过程中产生的数据,了解如何从产生的数据中提取价值并能够执行任务。本文不是同时比较一个传输会话的所有传输标志来进行任何分析,而是通过逐个比较每个传输标志在传输会话期间对应用程序的影响来概念化每个传输标志对网络感知应用程序的影响,而不是同时比较所有传输标志。使用k -最近邻(KNN)类型分类是因为它是一种简单的基于距离的学习算法,可以记住早期的训练样本,并且适合通过比较每个新样本与k -最近点来确定各种标志对应用协议的影响。我们使用从Kaggle接收到的传输会话数据集来分析IP流,其中包含87个特征和3.577.296个实例。我们从数据集中挑选了13个特征,并通过KNN运行它们。使用RapidMiner进行研究,实验结果表明,基于knn的模型不仅在数据分类方面更加准确,而且由于处理成本的降低,效率也显著提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY
ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY MULTIDISCIPLINARY SCIENCES-
自引率
33.30%
发文量
33
审稿时长
16 weeks
期刊最新文献
Encryption of Color Images with a New Framework Microstrip Passive Components for Energy Harvesting and 5G Applications Optimizing Emotional Insight through Unimodal and Multimodal Long Short-term Memory Models A Review on Adverse Drug Reaction Detection Techniques Deep Learning-Based Optical Music Recognition for Semantic Representation of Non-overlap and Overlap Music Notes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1