首页 > 最新文献

2017 7th International Conference on Communication Systems and Network Technologies (CSNT)最新文献

英文 中文
Holo entropy enabled decision tree classifier for breast cancer diagnosis using wisconsin (prognostic) data set Holo熵使决策树分类器乳腺癌诊断使用威斯康星(预后)数据集
Shabina Sayed, Shoeb Ahmed, R. Poonia
The breast cancer diagnostic and prognostic problems are mainly in the scope of the widely discussed classification problems. These problems have attracted many researchers in computational intelligence, data mining, and statistics fields. The objective of these predictions is to handle cases for which cancer has not recurred (censored data) as well as case for which cancer has recurred at a specific time. The proposed study uses Breast Cancer Wisconsin (Prognostic) Data Set for training and testing purpose. It has implemented holo entropy enable decision tree(HDT). The proposed strategy utilizes the training data to train the classifier. It categorizes each instance of breast cancer growth as recurrent or non recurrent. It ascertains the precision of the classifier to decide the exact classifier accuracy. In the present situation where there is continuous increment in the breast cancer cases and the expanding number of death cases the proposed strategy can be a guide in the determination of breast cancer.
乳腺癌的诊断和预后问题主要是在广泛讨论的范围内的分类问题。这些问题吸引了许多计算智能、数据挖掘和统计领域的研究人员。这些预测的目的是处理癌症未复发的病例(经过审查的数据)以及癌症在特定时间复发的病例。拟议的研究使用乳腺癌威斯康星(预后)数据集进行培训和测试。它实现了全熵使能决策树(HDT)。该策略利用训练数据来训练分类器。它将每一种乳腺癌的生长情况分为复发性和非复发性。通过确定分类器的精度来确定准确的分类器精度。在目前乳腺癌病例不断增加、死亡人数不断增加的情况下,拟议的战略可以作为确定乳腺癌的指南。
{"title":"Holo entropy enabled decision tree classifier for breast cancer diagnosis using wisconsin (prognostic) data set","authors":"Shabina Sayed, Shoeb Ahmed, R. Poonia","doi":"10.1109/CSNT.2017.8418532","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418532","url":null,"abstract":"The breast cancer diagnostic and prognostic problems are mainly in the scope of the widely discussed classification problems. These problems have attracted many researchers in computational intelligence, data mining, and statistics fields. The objective of these predictions is to handle cases for which cancer has not recurred (censored data) as well as case for which cancer has recurred at a specific time. The proposed study uses Breast Cancer Wisconsin (Prognostic) Data Set for training and testing purpose. It has implemented holo entropy enable decision tree(HDT). The proposed strategy utilizes the training data to train the classifier. It categorizes each instance of breast cancer growth as recurrent or non recurrent. It ascertains the precision of the classifier to decide the exact classifier accuracy. In the present situation where there is continuous increment in the breast cancer cases and the expanding number of death cases the proposed strategy can be a guide in the determination of breast cancer.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126734476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A review and analysis on data mining methods to predict diabetes 糖尿病预测数据挖掘方法综述与分析
Girdhar Gopal Ladha, R. S. Pippal
According to international diabetes federation one in eleven adults have diabetes in 2015 worldwide. Their report also suggest that it may increase by one in ten adults has diabetes by 2040 worldwide. It is dangerous as in maximum chances it is diagnosed in the higher stages. The main aim of this paper is to find the methodologies which can help in early stage diagnosis. This paper provides a detail study and analysis on data mining techniques. A deep analysis has been presented for finding the gaps and also suggested the enhancements in terms of future suggestions. It also provides methodological review along with the impact of diabetes and in the direction of their symptoms identification.
根据国际糖尿病联合会的数据,2015年全球每11个成年人中就有1人患有糖尿病。他们的报告还指出,到2040年,全球每10名成年人中就有1人患有糖尿病。它是危险的,因为它在较高的阶段最有可能被诊断出来。本文的主要目的是寻找有助于早期诊断的方法。本文对数据挖掘技术进行了详细的研究和分析。本文对这些不足进行了深入分析,并就未来的建议提出了改进建议。它还提供了方法学审查以及糖尿病的影响,并在其症状识别的方向。
{"title":"A review and analysis on data mining methods to predict diabetes","authors":"Girdhar Gopal Ladha, R. S. Pippal","doi":"10.1109/CSNT.2017.8418562","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418562","url":null,"abstract":"According to international diabetes federation one in eleven adults have diabetes in 2015 worldwide. Their report also suggest that it may increase by one in ten adults has diabetes by 2040 worldwide. It is dangerous as in maximum chances it is diagnosed in the higher stages. The main aim of this paper is to find the methodologies which can help in early stage diagnosis. This paper provides a detail study and analysis on data mining techniques. A deep analysis has been presented for finding the gaps and also suggested the enhancements in terms of future suggestions. It also provides methodological review along with the impact of diabetes and in the direction of their symptoms identification.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123079478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
DFDM — Dynamic frequency division multiplexing 动态频分复用
Muralidhar Reddy Challa, Bharath Simha Reddy Eedula, Gnana Pavan Bombothu, Ram Mohan Rao Kanugu
This paper presents the DFDM (Dynamic Frequency Division Multiplexing) system, a dynamic digital multi-carrier transceiver concept. The system aims at exploiting the spectrum fragmentation in widely separated spectrum bands, which is due to guard bands provided to channels and random spectrum allocations. The system implements dynamic spectrum exploitation of the white spaces in UHF (Ultra High Frequency) band and also below noise floor communication links.
本文提出了动态数字多载波收发器DFDM (Dynamic Frequency Division Multiplexing)系统。该系统旨在利用由于向信道提供保护带和随机分配频谱而产生的宽频带内的频谱碎片。该系统实现了对超高频(UHF)频段和噪声底以下通信链路空白频段的动态频谱利用。
{"title":"DFDM — Dynamic frequency division multiplexing","authors":"Muralidhar Reddy Challa, Bharath Simha Reddy Eedula, Gnana Pavan Bombothu, Ram Mohan Rao Kanugu","doi":"10.1109/CSNT.2017.8418506","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418506","url":null,"abstract":"This paper presents the DFDM (Dynamic Frequency Division Multiplexing) system, a dynamic digital multi-carrier transceiver concept. The system aims at exploiting the spectrum fragmentation in widely separated spectrum bands, which is due to guard bands provided to channels and random spectrum allocations. The system implements dynamic spectrum exploitation of the white spaces in UHF (Ultra High Frequency) band and also below noise floor communication links.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132511594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FPGA implementation and power efficient CORDIC based ADPLL for signal processing and application FPGA实现和基于CORDIC的低功耗ADPLL信号处理与应用
Akarshika Singhal, Anjana Goen, Tanutrushna Mohapatra
Most of the electronic circuit components do not receive the clock at same time due to various factors involved in circuitry. Phase locked loop is a precision and familiar circuit for high frequency and high accuracy application with very short interlocking time. This paper presents All Digital Phase Locked Loop (ADPLL) and has been analysed for the required applications on the basis of its cost, power consumption and speed of operation for phase locked loop. In the given ADPLL system phase detection system has been realized by generating analytic signal using Hilbert transform and then computing the instantaneous phase using CORDIC algorithm. The loop filter of the ADPLL has been designed using a low pass filter and is used to discard the higher order harmonics. The proposed architecture is implemented using VHDL code and is synthesized using Xilinx ISE 9.2 software. To validate its functionality, verification and simulation is done by using the Modelsim SE 6.2C. The ADPLL is planned for 100 MHz central frequency. The work in this paper mainly deals with the power efficiency of ADPLL.
由于电路中涉及的各种因素,大多数电子电路元件不能同时接收时钟。锁相环是一种高频、高精度、联锁时间极短的精密电路。本文介绍了全数字锁相环(ADPLL),并在其成本、功耗和锁相环运行速度的基础上,对其应用进行了分析。在给定的ADPLL系统中,相位检测系统是通过希尔伯特变换产生解析信号,然后用CORDIC算法计算瞬时相位来实现的。ADPLL的环路滤波器采用低通滤波器设计,用于去除高次谐波。所提出的架构使用VHDL代码实现,并使用Xilinx ISE 9.2软件进行综合。为了验证其功能,可以使用Modelsim SE 6.2C进行验证和仿真。ADPLL的中心频率为100mhz。本文主要研究了ADPLL的功率效率问题。
{"title":"FPGA implementation and power efficient CORDIC based ADPLL for signal processing and application","authors":"Akarshika Singhal, Anjana Goen, Tanutrushna Mohapatra","doi":"10.1109/CSNT.2017.8418560","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418560","url":null,"abstract":"Most of the electronic circuit components do not receive the clock at same time due to various factors involved in circuitry. Phase locked loop is a precision and familiar circuit for high frequency and high accuracy application with very short interlocking time. This paper presents All Digital Phase Locked Loop (ADPLL) and has been analysed for the required applications on the basis of its cost, power consumption and speed of operation for phase locked loop. In the given ADPLL system phase detection system has been realized by generating analytic signal using Hilbert transform and then computing the instantaneous phase using CORDIC algorithm. The loop filter of the ADPLL has been designed using a low pass filter and is used to discard the higher order harmonics. The proposed architecture is implemented using VHDL code and is synthesized using Xilinx ISE 9.2 software. To validate its functionality, verification and simulation is done by using the Modelsim SE 6.2C. The ADPLL is planned for 100 MHz central frequency. The work in this paper mainly deals with the power efficiency of ADPLL.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134212893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Inventory supply & shelfing through data analytics 通过数据分析库存供应和货架
Komal Verma, R. Pandey, Nidhi Srivastava
Experts from different specialized fields trust that another era of Computer Science is emerging. Sometime ago, applications use to run on the individual PC framework's hard drive. With taking a break, the application started to be put away on a nearby server of an organization and from that point the applications were keep running on the individual framework. The following stage to this advancement of software engineering time is moving both the applications and information to the ‘Cloud’, where all information is put away at a virtual server and can be gotten to from anyplace through Internet. Microsoft Azure is a Microsoft's open Cloud Computing Platform. It gives an extensive variety of cloud administrations from Data Storage, Analysis to Computing and Networking of Data. Analysis through Machine learning is one of the administrations that Azure Offers. For dissecting one can use programming languages like R, Python and Visual Studio. R gives a Programming domain broadly utilized for graphically analysis and measurable computing.
来自不同专业领域的专家相信,计算机科学的另一个时代正在出现。以前,应用程序通常运行在个人电脑框架的硬盘上。经过一段时间的休息,应用程序开始放在组织的附近服务器上,从那时起,应用程序继续在单个框架上运行。软件工程时代的下一阶段是将应用程序和信息都转移到“云”上,所有的信息都存放在虚拟服务器上,可以通过互联网从任何地方获取。微软Azure是微软的开放云计算平台。它提供了广泛的各种云管理,从数据存储、分析到计算和数据网络。通过机器学习进行分析是Azure提供的服务之一。为了进行剖析,可以使用R、Python和Visual Studio等编程语言。R提供了一个广泛用于图形分析和可测量计算的编程领域。
{"title":"Inventory supply & shelfing through data analytics","authors":"Komal Verma, R. Pandey, Nidhi Srivastava","doi":"10.1109/CSNT.2017.8418500","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418500","url":null,"abstract":"Experts from different specialized fields trust that another era of Computer Science is emerging. Sometime ago, applications use to run on the individual PC framework's hard drive. With taking a break, the application started to be put away on a nearby server of an organization and from that point the applications were keep running on the individual framework. The following stage to this advancement of software engineering time is moving both the applications and information to the ‘Cloud’, where all information is put away at a virtual server and can be gotten to from anyplace through Internet. Microsoft Azure is a Microsoft's open Cloud Computing Platform. It gives an extensive variety of cloud administrations from Data Storage, Analysis to Computing and Networking of Data. Analysis through Machine learning is one of the administrations that Azure Offers. For dissecting one can use programming languages like R, Python and Visual Studio. R gives a Programming domain broadly utilized for graphically analysis and measurable computing.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133490292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid approach for human skin detection 一种用于人体皮肤检测的混合方法
Prateeksha Dhantre, R. Prasad, P. Saurabh, B. Verma
Human skin detection strives to spot skin from the pictures. Automatic skin detection is considered as a significantly difficult and complex as skin image differ on the aspects of contents due to variation in size, style, orientation, alignment coupled with different contrast and background. This paper proposes a skin detection approach using localization, tracking, extraction, enhancement, and recognition. This approach remains sensitive to the color palette and uses edge detection technique. Also, color classification box incorporates a deep impact on the performance of the rule. The proposed approach detects single as well as multiple persons in a picture. Promising results are obtained on variety of pictures, except in few pictures wherever color distinction is hard to even when edge detection rule.
人体皮肤检测努力从图片中找出皮肤。由于皮肤图像的大小、样式、方向、对齐以及对比度和背景的不同,在内容方面存在差异,因此自动皮肤检测被认为是一项非常困难和复杂的工作。本文提出了一种基于定位、跟踪、提取、增强和识别的皮肤检测方法。这种方法对调色板保持敏感,并使用边缘检测技术。另外,颜色分类盒对合并规则的性能有很深的影响。该方法既可以检测照片中的单个人物,也可以检测照片中的多个人物。除了少数在边缘检测规则下难以区分颜色的图像外,在各种图像上都取得了令人满意的结果。
{"title":"A hybrid approach for human skin detection","authors":"Prateeksha Dhantre, R. Prasad, P. Saurabh, B. Verma","doi":"10.1109/CSNT.2017.8418526","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418526","url":null,"abstract":"Human skin detection strives to spot skin from the pictures. Automatic skin detection is considered as a significantly difficult and complex as skin image differ on the aspects of contents due to variation in size, style, orientation, alignment coupled with different contrast and background. This paper proposes a skin detection approach using localization, tracking, extraction, enhancement, and recognition. This approach remains sensitive to the color palette and uses edge detection technique. Also, color classification box incorporates a deep impact on the performance of the rule. The proposed approach detects single as well as multiple persons in a picture. Promising results are obtained on variety of pictures, except in few pictures wherever color distinction is hard to even when edge detection rule.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114223959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Image splicing detection using HMRF superpixel segmentation 基于HMRF超像素分割的图像拼接检测
K. Vamsi, Raman Chadha, B. Ramkumar, S. Prasad
Nowadays generation moves upon, digital forgeries also increasing with new trending tools for general concerns illegally. Moreover, applications are used for morphing/tampering an image to judge the world's computation. Spliced Location of any images, we pinpointed a probable approach to grab the forgery section easily and clearly. The approaches used are Super-pixels identification, Discrete Cosine Transform, Scale-invariant feature transform along with Kurtosis mapping, passive/blind forgery assumes a worthy part to search for spliced images without certain information which increases the execution of retrieval of duplicity image and consumption of time. In this proposed methodology, the controlled mechanism for "n" iteration is calculated with the help of estimation local noise variance algorithm. Approach narrates the splicing methodology in consign way to Speculate the loop-hole detection mechanism i.e., Gives information about a traced image spliced area for verification.
如今,一代向前移动,数字伪造也增加了新的趋势工具,一般关注非法。此外,应用程序用于变形/篡改图像来判断世界的计算。拼接任何图像的位置,我们确定了一个可能的方法来轻松清晰地抓取伪造部分。采用的方法有超像素识别、离散余弦变换、比例不变特征变换及峰度映射,被动/盲目伪造在没有特定信息的拼接图像中起到了重要的搜索作用,增加了重复图像检索的执行力和时间消耗。在该方法中,利用估计局部噪声方差算法计算n次迭代的控制机制。方法以设计的方式叙述拼接方法,推测漏洞检测机制,即给出跟踪图像拼接区域的信息以供验证。
{"title":"Image splicing detection using HMRF superpixel segmentation","authors":"K. Vamsi, Raman Chadha, B. Ramkumar, S. Prasad","doi":"10.1109/CSNT.2017.8418533","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418533","url":null,"abstract":"Nowadays generation moves upon, digital forgeries also increasing with new trending tools for general concerns illegally. Moreover, applications are used for morphing/tampering an image to judge the world's computation. Spliced Location of any images, we pinpointed a probable approach to grab the forgery section easily and clearly. The approaches used are Super-pixels identification, Discrete Cosine Transform, Scale-invariant feature transform along with Kurtosis mapping, passive/blind forgery assumes a worthy part to search for spliced images without certain information which increases the execution of retrieval of duplicity image and consumption of time. In this proposed methodology, the controlled mechanism for \"n\" iteration is calculated with the help of estimation local noise variance algorithm. Approach narrates the splicing methodology in consign way to Speculate the loop-hole detection mechanism i.e., Gives information about a traced image spliced area for verification.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121315945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementation of SNMP-JSON translator and integrating SNMP agents with JSON based network management system 实现SNMP-JSON转换器,并将SNMP代理与基于JSON的网管系统集成
Kasula Chaithanya Pramodh, Iluri Nikhil, J. Ranjith Singh
The present paper focuses on developing an SNMP-JSON translator which automatically translates the data in an MIB file written in ASN.1 to JSON. Further, we integrate the SNMP Agents, Managers and implement the standard SNMP operations by using sockets of TCP/IP suite in a client-server architecture. The result is an integrated Network Management System which performs better than the existing XML and HTTP based approach.
本文的重点是开发一个SNMP-JSON转换器,它可以自动将用ASN.1编写的MIB文件中的数据转换为JSON。此外,我们还集成了SNMP代理、管理器,并在客户机-服务器体系结构中使用TCP/IP套接字实现了标准的SNMP操作。其结果是一个集成的网络管理系统,其性能优于现有的基于XML和HTTP的方法。
{"title":"Implementation of SNMP-JSON translator and integrating SNMP agents with JSON based network management system","authors":"Kasula Chaithanya Pramodh, Iluri Nikhil, J. Ranjith Singh","doi":"10.1109/CSNT.2017.8418513","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418513","url":null,"abstract":"The present paper focuses on developing an SNMP-JSON translator which automatically translates the data in an MIB file written in ASN.1 to JSON. Further, we integrate the SNMP Agents, Managers and implement the standard SNMP operations by using sockets of TCP/IP suite in a client-server architecture. The result is an integrated Network Management System which performs better than the existing XML and HTTP based approach.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128977184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Email spam classification using neighbor probability based Naïve Bayes algorithm 基于邻居概率Naïve贝叶斯算法的垃圾邮件分类
P. Anitha, C. Rao, S. Babu
Email spam is a kind of electronic spam, which tends to be a more difficult problem nowadays among all internet challenges. Spam mails are mostly sent in commercial purpose, some of them may contain malware links that lead to phishing websites. The aim of this study is to classify into ham and spam emails with an optimized and well efficient classification technique. Ham holds emails that are legitimate or legally valid message can get accepted by users. Spam emails are unwanted emails that a user doesn't want and to get rid of it. This study emphasizes on the improvement in classifying all mails into these two groups with minimal requirement of training and with an accuracy of hundred percent. Here in this study, Modified Naïve Bayes (MNB) classifier ensured the requirements with very low percentage of training and produces accurate results than existing Naïve Bayes (NB) or Supporting Vector Machine (SVM) classifier.
电子垃圾邮件是电子垃圾邮件的一种,是当今互联网面临的诸多挑战中较为棘手的问题。垃圾邮件大多是出于商业目的而发送的,其中一些邮件可能包含恶意软件链接,导致网络钓鱼网站。本研究的目的是用一种优化的、高效的分类技术对垃圾邮件和垃圾邮件进行分类。Ham持有的电子邮件是合法的或合法有效的信息,可以被用户接受。垃圾邮件是用户不想要的电子邮件,用户想要摆脱它。本研究的重点是在训练要求最低的情况下,将所有邮件分类为这两组,并且准确率达到100%。在本研究中,改进的Naïve贝叶斯(MNB)分类器以极低的训练率保证了要求,并且比现有的Naïve贝叶斯(NB)或支持向量机(SVM)分类器产生更准确的结果。
{"title":"Email spam classification using neighbor probability based Naïve Bayes algorithm","authors":"P. Anitha, C. Rao, S. Babu","doi":"10.1109/CSNT.2017.8418565","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418565","url":null,"abstract":"Email spam is a kind of electronic spam, which tends to be a more difficult problem nowadays among all internet challenges. Spam mails are mostly sent in commercial purpose, some of them may contain malware links that lead to phishing websites. The aim of this study is to classify into ham and spam emails with an optimized and well efficient classification technique. Ham holds emails that are legitimate or legally valid message can get accepted by users. Spam emails are unwanted emails that a user doesn't want and to get rid of it. This study emphasizes on the improvement in classifying all mails into these two groups with minimal requirement of training and with an accuracy of hundred percent. Here in this study, Modified Naïve Bayes (MNB) classifier ensured the requirements with very low percentage of training and produces accurate results than existing Naïve Bayes (NB) or Supporting Vector Machine (SVM) classifier.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128405253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Hellinger distance based oversampling method to solve multi-class imbalance problem 基于海灵格距离的过采样方法求解多类不平衡问题
Amisha Kumari, Urjita Thakar
Classification is a popular technique used to predict group membership for data samples in datasets. A multi-class or multinomial classification is the problem of classifying instances into more than two classes. With the emerging technology, the complexity of multi-class data has also increased thereby leading to class imbalance problem. With an imbalanced dataset, a machine learning algorithm can not make an accurate prediction. Therefore, in this paper Hellinger distance based oversampling method has been proposed. It is useful in balancing the datasets so that minority class can be identified with high accuracy without affecting accuracy of majority class. New synthetic data is generated using this method to achieve balance ratio. Testing has been done on five benchmark datasets using two standard classifiers KNN and C4.5. The evaluation matrix on precision, recall and fmeasure are drawn for two standard classification algorithms. It is observed that Hellinger distance reduces risk of overlapping and skewness of data. Obtained results show increase of 20% in classification accuracy compared to classification of imbalance multi-class dataset.
分类是一种流行的技术,用于预测数据集中数据样本的组成员关系。多类或多项分类是将实例分为两个以上类别的问题。随着技术的发展,多类数据的复杂性也随之增加,从而导致类不平衡问题。在数据不平衡的情况下,机器学习算法无法做出准确的预测。因此,本文提出了基于Hellinger距离的过采样方法。它有助于平衡数据集,以便在不影响多数类的准确性的情况下,以较高的准确性识别少数类。利用该方法生成新的合成数据,达到平衡比。使用两个标准分类器KNN和C4.5在五个基准数据集上进行了测试。绘制了两种标准分类算法的精度、召回率和度量评价矩阵。观察到海灵格距离降低了数据重叠和偏度的风险。所得结果表明,与不平衡多类数据集的分类相比,分类准确率提高了20%。
{"title":"Hellinger distance based oversampling method to solve multi-class imbalance problem","authors":"Amisha Kumari, Urjita Thakar","doi":"10.1109/CSNT.2017.8418525","DOIUrl":"https://doi.org/10.1109/CSNT.2017.8418525","url":null,"abstract":"Classification is a popular technique used to predict group membership for data samples in datasets. A multi-class or multinomial classification is the problem of classifying instances into more than two classes. With the emerging technology, the complexity of multi-class data has also increased thereby leading to class imbalance problem. With an imbalanced dataset, a machine learning algorithm can not make an accurate prediction. Therefore, in this paper Hellinger distance based oversampling method has been proposed. It is useful in balancing the datasets so that minority class can be identified with high accuracy without affecting accuracy of majority class. New synthetic data is generated using this method to achieve balance ratio. Testing has been done on five benchmark datasets using two standard classifiers KNN and C4.5. The evaluation matrix on precision, recall and fmeasure are drawn for two standard classification algorithms. It is observed that Hellinger distance reduces risk of overlapping and skewness of data. Obtained results show increase of 20% in classification accuracy compared to classification of imbalance multi-class dataset.","PeriodicalId":382417,"journal":{"name":"2017 7th International Conference on Communication Systems and Network Technologies (CSNT)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126450880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2017 7th International Conference on Communication Systems and Network Technologies (CSNT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1