首页 > 最新文献

2020 21st International Arab Conference on Information Technology (ACIT)最新文献

英文 中文
Hybrid SMS Spam Filtering System Using Machine Learning Techniques 使用机器学习技术的混合短信垃圾邮件过滤系统
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9300071
Hind Baaqeel, Rachid Zagrouba
Due to the massive proliferation of Short Message Service (SMS), Spammers got the interest to dig their way into it in the hope to reach more targets. Spam SMS can trick mobile users into giving away their confidential information which can result in severe consequences. The seriousness of this problem has raised the need to develop an accurate Spam filtration solution. Machine learning algorithms have emerged as a great tool to classify data into labels. This description fits our case perfectly as it classifies SMS into two labels: spam or ham. This paper will tackle the SMS spam filtration solutions by introducing a hybrid system using two types of machine learning techniques: supervised & unsupervised machine learning algorithms. The new hybrid system is designed to achieve better spam filtration accuracy and F-measures
由于短消息服务(SMS)的大规模扩散,垃圾邮件发送者有兴趣挖掘他们的方式,希望达到更多的目标。垃圾短信可以欺骗手机用户泄露他们的机密信息,这可能会导致严重的后果。这个问题的严重性提高了开发精确的垃圾邮件过滤解决方案的必要性。机器学习算法已经成为将数据分类为标签的好工具。这种描述完全符合我们的情况,因为它将SMS分为两个标签:垃圾邮件或火腿。本文将通过引入一种混合系统来解决短信垃圾邮件过滤解决方案,该系统使用两种类型的机器学习技术:监督和无监督机器学习算法。新的混合系统旨在实现更好的垃圾邮件过滤精度和f -措施
{"title":"Hybrid SMS Spam Filtering System Using Machine Learning Techniques","authors":"Hind Baaqeel, Rachid Zagrouba","doi":"10.1109/ACIT50332.2020.9300071","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9300071","url":null,"abstract":"Due to the massive proliferation of Short Message Service (SMS), Spammers got the interest to dig their way into it in the hope to reach more targets. Spam SMS can trick mobile users into giving away their confidential information which can result in severe consequences. The seriousness of this problem has raised the need to develop an accurate Spam filtration solution. Machine learning algorithms have emerged as a great tool to classify data into labels. This description fits our case perfectly as it classifies SMS into two labels: spam or ham. This paper will tackle the SMS spam filtration solutions by introducing a hybrid system using two types of machine learning techniques: supervised & unsupervised machine learning algorithms. The new hybrid system is designed to achieve better spam filtration accuracy and F-measures","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124085492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Cluster Center Initialization Method using Hyperspace-based Multi-level Thresholding (HMLT): Application to Color Ancient Document Image Denoising 基于多尺度阈值的聚类中心初始化方法在彩色古文档图像去噪中的应用
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9300075
Walid Elhedda, Maroua Mehri, M. Mahjoub
Many iterative supervised clustering algorithms such as K-means and its derivatives depend closely on the initial cluster center positions. In order to overcome the convergence problems inherent in the clustering algorithms (i.e., local optimum), and subsequently to avoid a drop in clustering performance, many researchers continue to propose novel efficient methods able to determine automatically the optimal cluster centers. Therefore, in this paper, we propose a simple and efficient cluster center initialization method, called hyperspace-based multi-level thresholding (HMLT). The proposed HMLT method is based on using a novel multi-level thresholding approach on the multi-dimensional representation of color images (called hyperspace). In order to show the high performance of the HMLT method, experiments have been conducted using a recent clustering method, called the hyperkernel-based intuitionistic fuzzy c-means (HKIFCM), and after initializing the cluster center positions randomly and by means of the HMLT method. The HKIFCM clustering method that its performance tightly depends on the cluster center initialization, is applied for color ancient document image denoising (i.e., separate noise from text and background). Qualitative and quantitative assessments of results are deduced from a number of ancient document images collected from two different datasets.
许多迭代监督聚类算法,如K-means及其导数,密切依赖于初始聚类中心位置。为了克服聚类算法固有的收敛性问题(即局部最优),从而避免聚类性能下降,许多研究人员不断提出能够自动确定最优聚类中心的新颖高效方法。因此,本文提出了一种简单高效的聚类中心初始化方法,称为基于超空间的多级阈值(HMLT)。提出的HMLT方法是基于对彩色图像的多维表示(称为超空间)使用一种新的多层次阈值方法。为了证明HMLT方法的高性能,在随机初始化聚类中心位置后,利用HMLT方法,利用一种最新的聚类方法——基于超核的直觉模糊c-均值(HKIFCM)进行了实验。将性能与聚类中心初始化密切相关的hifcm聚类方法应用于彩色古文档图像去噪(即将噪声与文本和背景分离)。定性和定量评估的结果是从从两个不同的数据集收集的一些古代文献图像推断出来的。
{"title":"A Cluster Center Initialization Method using Hyperspace-based Multi-level Thresholding (HMLT): Application to Color Ancient Document Image Denoising","authors":"Walid Elhedda, Maroua Mehri, M. Mahjoub","doi":"10.1109/ACIT50332.2020.9300075","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9300075","url":null,"abstract":"Many iterative supervised clustering algorithms such as K-means and its derivatives depend closely on the initial cluster center positions. In order to overcome the convergence problems inherent in the clustering algorithms (i.e., local optimum), and subsequently to avoid a drop in clustering performance, many researchers continue to propose novel efficient methods able to determine automatically the optimal cluster centers. Therefore, in this paper, we propose a simple and efficient cluster center initialization method, called hyperspace-based multi-level thresholding (HMLT). The proposed HMLT method is based on using a novel multi-level thresholding approach on the multi-dimensional representation of color images (called hyperspace). In order to show the high performance of the HMLT method, experiments have been conducted using a recent clustering method, called the hyperkernel-based intuitionistic fuzzy c-means (HKIFCM), and after initializing the cluster center positions randomly and by means of the HMLT method. The HKIFCM clustering method that its performance tightly depends on the cluster center initialization, is applied for color ancient document image denoising (i.e., separate noise from text and background). Qualitative and quantitative assessments of results are deduced from a number of ancient document images collected from two different datasets.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124830460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Longest Common Subsequence based Multistage Collaborative Filtering for Recommender Systems 基于最长公共子序列的多阶段协同过滤推荐系统
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9300068
Dilip Singh Sisodia, Inakollu NehaPriyanka, P. Amulya
The contemporary recommender systems are facing challenges such as noise in user's choice, scalability, cold-start problem, availability of ample choices and handling of sparse data sets. In this paper, a multistage collaborative filtering is proposed to address the issues of noise in user's choice and ample choices availability. The two-stage filtering at first stage, filtering is performed using Pearson coefficient as a similarity measure and in the second stage, the longest common subsequence (LCS) is used to do filtering. The experiments are performed using benchmark 100k movielense datasets. The performance of multistage collaborative filtering is evaluated using accuracy, precision, recall, and f-measure. The results are also compared with single stage filtering and performance of multistage collaborative filtering is significantly improved over the used datasets.
当前的推荐系统面临着用户选择噪声、可扩展性、冷启动问题、充足选择的可用性和稀疏数据集处理等挑战。本文提出了一种多阶段协同过滤方法,以解决用户选择噪声和选择可用性不足的问题。第一阶段采用两阶段滤波,采用Pearson系数作为相似度度量进行滤波,第二阶段采用LCS进行滤波。实验是使用基准100k电影数据集进行的。用准确度、精密度、召回率和f-measure来评价多级协同过滤的性能。结果还与单阶段滤波进行了比较,结果表明,在使用的数据集上,多阶段协同滤波的性能显著提高。
{"title":"Longest Common Subsequence based Multistage Collaborative Filtering for Recommender Systems","authors":"Dilip Singh Sisodia, Inakollu NehaPriyanka, P. Amulya","doi":"10.1109/ACIT50332.2020.9300068","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9300068","url":null,"abstract":"The contemporary recommender systems are facing challenges such as noise in user's choice, scalability, cold-start problem, availability of ample choices and handling of sparse data sets. In this paper, a multistage collaborative filtering is proposed to address the issues of noise in user's choice and ample choices availability. The two-stage filtering at first stage, filtering is performed using Pearson coefficient as a similarity measure and in the second stage, the longest common subsequence (LCS) is used to do filtering. The experiments are performed using benchmark 100k movielense datasets. The performance of multistage collaborative filtering is evaluated using accuracy, precision, recall, and f-measure. The results are also compared with single stage filtering and performance of multistage collaborative filtering is significantly improved over the used datasets.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128968678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Detecting Online Radicalization and Extremism Using Natural Language Processing 利用自然语言处理技术检测网络激进和极端主义
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9300086
Shynar Mussiraliyeva, M. Bolatbek, B. Omarov, Zhanar Medetbek, G. Baispay, R. Ospanov
Due to the activity of terrorist propaganda on the Internet and social networks, as well as given the high dynamics of the emergence of new sites and accounts of extremist orientation, it is important to quickly detect content that demonstrates a tendency to extremism in the prevention of extremist and terrorist activities. This article is intended to explore the possibilities of automatic recognition of extremist content using machine learning from this point of view. This article is devoted to the application of machine learning methods for solving the problem of security, in part-countering terrorism and extremism using information from the Internet.
由于恐怖主义在互联网和社交网络上的宣传活动,以及极端主义取向的新网站和账户的出现的高度动态,在预防极端主义和恐怖活动中,快速检测显示极端主义倾向的内容非常重要。本文旨在从这个角度探讨使用机器学习自动识别极端主义内容的可能性。本文致力于应用机器学习方法解决安全问题,部分是利用互联网信息打击恐怖主义和极端主义。
{"title":"On Detecting Online Radicalization and Extremism Using Natural Language Processing","authors":"Shynar Mussiraliyeva, M. Bolatbek, B. Omarov, Zhanar Medetbek, G. Baispay, R. Ospanov","doi":"10.1109/ACIT50332.2020.9300086","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9300086","url":null,"abstract":"Due to the activity of terrorist propaganda on the Internet and social networks, as well as given the high dynamics of the emergence of new sites and accounts of extremist orientation, it is important to quickly detect content that demonstrates a tendency to extremism in the prevention of extremist and terrorist activities. This article is intended to explore the possibilities of automatic recognition of extremist content using machine learning from this point of view. This article is devoted to the application of machine learning methods for solving the problem of security, in part-countering terrorism and extremism using information from the Internet.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127140150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A New Approach in Islamic Learning: Performance Evaluation of Motion Recognition System for Salat Movement 伊斯兰学习的新途径:Salat动作识别系统的性能评价
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9300063
N. Jaafar, N. A. Ismail, Yusman Azimi Yusoff
In Muslim life, there is an important ritual that they need to do in their daily lives, a prayer known as salat. There was evidence that showed performing salat correctly is good for better health. This paper developed a motion recognition system for salat movement using a cooperative multisensor approach based on salat law. Existing work in this related field could recognize a few salat movements; however, they could not cover salat movements based on salat law by using a single camera. This paper presents the motion recognition system's usability study, named SalatLab using a cooperative multisensor approach. In this paper, the user's error rate was evaluated by the SalatLab prototype and tutor-based method to test if the proposed system can bridge the gap in assessing user error. The experiment was conducted to evaluate the user error rate in salat activity by comparing it with the traditional tutor-based methodology. Success scores in recognizing salat movement and user acceptance of the proposed prototype have also been evaluated. The results show a significant difference in error rate and success score assessed by the proposed system and tutor-based. However, the proposed prototype was accepted by the user and received good feedback.
在穆斯林的生活中,有一个重要的仪式是他们在日常生活中需要做的,一个被称为salat的祈祷。有证据表明,正确地锻炼身体有利于健康。采用基于salat定律的多传感器协同方法开发了一种salat运动识别系统。在这一有关领域的现有工作可以认识到一些薪金变动;但是,他们无法使用单一摄像机来拍摄基于salat法的salat运动。本文介绍了运动识别系统的可用性研究,称为SalatLab,采用多传感器合作方法。在本文中,通过SalatLab原型和基于导师的方法来评估用户的错误率,以测试所提出的系统是否可以弥补评估用户错误的差距。通过与传统的基于导师的方法进行比较,实验评估了用户在salat活动中的错误率。在识别salat运动和用户接受提出的原型成功得分也进行了评估。结果表明,该系统与基于导师的系统在错误率和成功率评分方面存在显著差异。然而,提出的原型被用户接受并得到了很好的反馈。
{"title":"A New Approach in Islamic Learning: Performance Evaluation of Motion Recognition System for Salat Movement","authors":"N. Jaafar, N. A. Ismail, Yusman Azimi Yusoff","doi":"10.1109/ACIT50332.2020.9300063","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9300063","url":null,"abstract":"In Muslim life, there is an important ritual that they need to do in their daily lives, a prayer known as salat. There was evidence that showed performing salat correctly is good for better health. This paper developed a motion recognition system for salat movement using a cooperative multisensor approach based on salat law. Existing work in this related field could recognize a few salat movements; however, they could not cover salat movements based on salat law by using a single camera. This paper presents the motion recognition system's usability study, named SalatLab using a cooperative multisensor approach. In this paper, the user's error rate was evaluated by the SalatLab prototype and tutor-based method to test if the proposed system can bridge the gap in assessing user error. The experiment was conducted to evaluate the user error rate in salat activity by comparing it with the traditional tutor-based methodology. Success scores in recognizing salat movement and user acceptance of the proposed prototype have also been evaluated. The results show a significant difference in error rate and success score assessed by the proposed system and tutor-based. However, the proposed prototype was accepted by the user and received good feedback.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127279750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Modified Clustering Algorithms for Energy Harvesting Wireless Sensor Networks- A Survey 能量收集无线传感器网络的改进聚类算法综述
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9300078
Deivanai Gurusamy, Sadik Abas
Wireless Sensor Network (WSN) has evolved as a key technology and has brought about a few changes in how the network is being adopted for modern applications. Despite the considerable growth and success, the limitation in the energy of sensor nodes persists. Researches have been rising to mitigate energy consumption by various energy management approaches, and the most efficient among them is clustering. Clustering groups the nodes according to their similarities or other parameters and helps to utilize the energy efficiently. However, it cannot wholly reduce energy consumption, so the WSN now adapts the advancement of energy harvesting (EH) and takes the new shape called EH-WSN. Energy harvesting converts the ambient energy into electrical energy and power the sensor nodes. By exploiting the recharging capabilities of this new technique, the network's lifetime is significantly increased. The performance enhancement is also crucial; hence, the energy harvesting nodes are clustered, and efficient clustering algorithms are being developed. This paper presents the clustering algorithms that have been developed newly and revised for EH-WSN. The parameters considered and that differ from the traditional clustering are analyzed. Further, the comparison among those approaches has facilitated the paper to bring out the challenges and future research directions in the clustering algorithms for EH-WSN.
无线传感器网络(WSN)已经发展成为一项关键技术,并为网络在现代应用中的应用带来了一些变化。尽管取得了长足的发展和成功,但传感器节点能量的限制仍然存在。通过各种能源管理方法来降低能源消耗的研究已经兴起,其中最有效的是聚类。聚类是根据节点的相似度或其他参数对节点进行分组,有助于有效地利用能量。但是,它不能完全降低能耗,因此,现在的WSN适应了能量收集(EH)技术的进步,采用了新的形态EH-WSN。能量收集将环境能量转化为电能,并为传感器节点供电。通过利用这种新技术的充电能力,网络的寿命显着增加。性能提升也至关重要;因此,能量收集节点被聚类,并且高效的聚类算法正在被开发。本文介绍了针对EH-WSN新开发和改进的聚类算法。分析了该方法所考虑的参数和与传统聚类方法的不同之处。通过对这些方法的比较,提出了EH-WSN聚类算法面临的挑战和未来的研究方向。
{"title":"Modified Clustering Algorithms for Energy Harvesting Wireless Sensor Networks- A Survey","authors":"Deivanai Gurusamy, Sadik Abas","doi":"10.1109/ACIT50332.2020.9300078","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9300078","url":null,"abstract":"Wireless Sensor Network (WSN) has evolved as a key technology and has brought about a few changes in how the network is being adopted for modern applications. Despite the considerable growth and success, the limitation in the energy of sensor nodes persists. Researches have been rising to mitigate energy consumption by various energy management approaches, and the most efficient among them is clustering. Clustering groups the nodes according to their similarities or other parameters and helps to utilize the energy efficiently. However, it cannot wholly reduce energy consumption, so the WSN now adapts the advancement of energy harvesting (EH) and takes the new shape called EH-WSN. Energy harvesting converts the ambient energy into electrical energy and power the sensor nodes. By exploiting the recharging capabilities of this new technique, the network's lifetime is significantly increased. The performance enhancement is also crucial; hence, the energy harvesting nodes are clustered, and efficient clustering algorithms are being developed. This paper presents the clustering algorithms that have been developed newly and revised for EH-WSN. The parameters considered and that differ from the traditional clustering are analyzed. Further, the comparison among those approaches has facilitated the paper to bring out the challenges and future research directions in the clustering algorithms for EH-WSN.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127583560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Hybrid Model for Anomaly-Based Intrusion Detection in Complex Computer Networks 复杂计算机网络中基于异常的入侵检测混合模型
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9299965
D. Protić, M. Stankovic
Anomaly-based intrusion detection classifiers detect the notion of normality and classify both intrusion and/or misuse as either 'normal' or 'anomaly'. In complex computer networks, the number of the training records is often large which makes the evaluation of the classifiers computationally expensive. In this paper we present a feature selection and instances normalization algorithm that reduces the dimensionality of the dataset size, decrease processing time and increase accuracy of two classifier models, namely weighted k-Nearest Neighbor (wk-NN) and Feedforward Neural Network (FNN). The experiments are conducted on three daily records of the real computer network traffic data derived from the Kyoto 2006+ dataset. The results show high accuracy of both wk-NN and FNN classifiers but variations in mutual decisions on detected anomalies. Variations are determined with the novel hybrid model by performing logical exclusive or operation to the predicted outcomes. Improvement in the anomaly detection ranges from 0.67% to 8.08%.
基于异常的入侵检测分类器检测正常的概念,并将入侵和/或滥用分类为“正常”或“异常”。在复杂的计算机网络中,训练记录的数量往往很大,这使得分类器的评估计算成本很高。在本文中,我们提出了一种特征选择和实例归一化算法,该算法降低了数据集大小的维数,减少了处理时间,提高了两种分类器模型的精度,即加权k-近邻(wk-NN)和前馈神经网络(FNN)。实验采用京都2006+数据集的3个真实计算机网络流量日记录进行。结果表明,wk-NN和FNN分类器的准确率都很高,但在对检测到的异常的相互决策上存在差异。通过对预测结果执行逻辑排他或运算,利用新型混合模型确定变量。异常检测的改进幅度为0.67% ~ 8.08%。
{"title":"A Hybrid Model for Anomaly-Based Intrusion Detection in Complex Computer Networks","authors":"D. Protić, M. Stankovic","doi":"10.1109/ACIT50332.2020.9299965","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9299965","url":null,"abstract":"Anomaly-based intrusion detection classifiers detect the notion of normality and classify both intrusion and/or misuse as either 'normal' or 'anomaly'. In complex computer networks, the number of the training records is often large which makes the evaluation of the classifiers computationally expensive. In this paper we present a feature selection and instances normalization algorithm that reduces the dimensionality of the dataset size, decrease processing time and increase accuracy of two classifier models, namely weighted k-Nearest Neighbor (wk-NN) and Feedforward Neural Network (FNN). The experiments are conducted on three daily records of the real computer network traffic data derived from the Kyoto 2006+ dataset. The results show high accuracy of both wk-NN and FNN classifiers but variations in mutual decisions on detected anomalies. Variations are determined with the novel hybrid model by performing logical exclusive or operation to the predicted outcomes. Improvement in the anomaly detection ranges from 0.67% to 8.08%.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126254158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
House Price Prediction using Machine Learning Algorithm - The Case of Karachi City, Pakistan 使用机器学习算法预测房价-以巴基斯坦卡拉奇市为例
Pub Date : 2020-11-28 DOI: 10.1109/ACIT50332.2020.9300074
Maida Ahtesham, N. Bawany, Kiran Fatima
House prices are a significant impression of the economy, and its value ranges are of great concerns for the clients and property dealers. Housing price escalate every year that eventually reinforced the need of strategy or technique that could predict house prices in future. There are certain factors that influence house prices including physical conditions, locations, number of bedrooms and others. Traditionally predictions are made on the basis of these factors. However such prediction methods require an appropriate knowledge and experience regarding this domain. Machine Learning techniques have been a significant source of advanced opportunities to analyze, predict and visualize housing prices. In this paper, Gradient Boosting Model XGBoost is utilized to predict housing prices. Publicly available dataset containing 38,961 records of Karachi city is attained from an Open Real Estate Portal of Pakistan. Lot of work has been done in predicting house prices across many countries, however very limited amount of work has been done for predicting house prices in Pakistan. Our proposed house price prediction model is able to predict 98% accuracy.
房价是经济的一个重要标志,其价值范围是客户和房地产经纪人非常关注的问题。房价每年都在上涨,这最终加强了对预测未来房价的策略或技术的需求。有一些因素会影响房价,包括物理条件、位置、卧室数量等。传统上,预测是基于这些因素做出的。然而,这样的预测方法需要在这个领域有适当的知识和经验。机器学习技术已经成为分析、预测和可视化房价的重要机会来源。本文采用梯度提升模型XGBoost对房价进行预测。从巴基斯坦开放房地产门户网站获得了包含38,961条卡拉奇市记录的公开数据集。在预测许多国家的房价方面已经做了很多工作,但是在预测巴基斯坦的房价方面所做的工作非常有限。我们提出的房价预测模型预测准确率为98%。
{"title":"House Price Prediction using Machine Learning Algorithm - The Case of Karachi City, Pakistan","authors":"Maida Ahtesham, N. Bawany, Kiran Fatima","doi":"10.1109/ACIT50332.2020.9300074","DOIUrl":"https://doi.org/10.1109/ACIT50332.2020.9300074","url":null,"abstract":"House prices are a significant impression of the economy, and its value ranges are of great concerns for the clients and property dealers. Housing price escalate every year that eventually reinforced the need of strategy or technique that could predict house prices in future. There are certain factors that influence house prices including physical conditions, locations, number of bedrooms and others. Traditionally predictions are made on the basis of these factors. However such prediction methods require an appropriate knowledge and experience regarding this domain. Machine Learning techniques have been a significant source of advanced opportunities to analyze, predict and visualize housing prices. In this paper, Gradient Boosting Model XGBoost is utilized to predict housing prices. Publicly available dataset containing 38,961 records of Karachi city is attained from an Open Real Estate Portal of Pakistan. Lot of work has been done in predicting house prices across many countries, however very limited amount of work has been done for predicting house prices in Pakistan. Our proposed house price prediction model is able to predict 98% accuracy.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126842407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Copyright 版权
Pub Date : 2020-11-28 DOI: 10.1109/acit50332.2020.9300076
{"title":"Copyright","authors":"","doi":"10.1109/acit50332.2020.9300076","DOIUrl":"https://doi.org/10.1109/acit50332.2020.9300076","url":null,"abstract":"","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121522698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2020 21st International Arab Conference on Information Technology (ACIT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1