首页 > 最新文献

Kafrelsheikh Journal of Information Sciences最新文献

英文 中文
Anemia Diagnosis And Prediction Based On Machine Learning 基于机器学习的贫血诊断和预测
Pub Date : 2023-11-01 DOI: 10.21608/kjis.2023.220945.1014
Sara shehab, Eman Shehab, AbdulRahman Khawaga
The extraordinary developments in the health sector have resulted in the substantial production of data in daily life. To get valuable information out of this data—infor-mation that can be used for analysis, forecasting, making suggestions, and making decisions—it must be processed. Accessible data is converted into useful information using data mining and machine learning approaches. The first challenge for medical practitioners in developing a preventative strategy and successful treatment plan is the timely diagnosis of diseases. Sometimes, this can result in death if accuracy is lacking. In this study, we examine supervised machine learning methods (Decision Tree, Multilayer Perceptron “MLP”, K-nearest neighbors “ KNN”, Logistic Regression, Random Forest, and Support Vector Machine “SVC”) for anemia prediction utilizing CBC (Complete Blood Count) data gathered from pathology labs. The outcomes demonstrate that the Random Forest, Multilayer Perceptron “MLP”, Decision Tree, and Logistic Regression techniques outperform KNN and SVC in terms of accuracy of 99.94%.
卫生领域的飞速发展导致日常生活中产生了大量数据。要从这些数据中获取有价值的信息--可用于分析、预测、建议和决策的信息--就必须对其进行处理。利用数据挖掘和机器学习方法,可以将可获取的数据转化为有用的信息。在制定预防战略和成功的治疗计划时,医疗从业人员面临的首要挑战是及时诊断疾病。如果缺乏准确性,有时会导致死亡。在本研究中,我们利用从病理实验室收集的 CBC(全血细胞计数)数据,研究了用于贫血预测的监督机器学习方法(决策树、多层感知器 "MLP"、K-近邻 "KNN"、逻辑回归、随机森林和支持向量机 "SVC")。结果表明,随机森林、多层感知器 "MLP"、决策树和逻辑回归技术的准确率高于 KNN 和 SVC,达到 99.94%。
{"title":"Anemia Diagnosis And Prediction Based On Machine Learning","authors":"Sara shehab, Eman Shehab, AbdulRahman Khawaga","doi":"10.21608/kjis.2023.220945.1014","DOIUrl":"https://doi.org/10.21608/kjis.2023.220945.1014","url":null,"abstract":"The extraordinary developments in the health sector have resulted in the substantial production of data in daily life. To get valuable information out of this data—infor-mation that can be used for analysis, forecasting, making suggestions, and making decisions—it must be processed. Accessible data is converted into useful information using data mining and machine learning approaches. The first challenge for medical practitioners in developing a preventative strategy and successful treatment plan is the timely diagnosis of diseases. Sometimes, this can result in death if accuracy is lacking. In this study, we examine supervised machine learning methods (Decision Tree, Multilayer Perceptron “MLP”, K-nearest neighbors “ KNN”, Logistic Regression, Random Forest, and Support Vector Machine “SVC”) for anemia prediction utilizing CBC (Complete Blood Count) data gathered from pathology labs. The outcomes demonstrate that the Random Forest, Multilayer Perceptron “MLP”, Decision Tree, and Logistic Regression techniques outperform KNN and SVC in terms of accuracy of 99.94%.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139292136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chronic Kidney Disease Classification Using ML Algorithms 利用多重算法进行慢性肾病分类
Pub Date : 2023-11-01 DOI: 10.21608/kjis.2023.220954.1015
Sara shehab, Eman Shehab, aya morsi
Chronic kidney failure is one of the most common diseases that threaten the lives of many people and cause death for many. By using artificial intelligence, we predict the disease and classify people into infected and non-infected people. One of the goals is to reduce non-communicable disease-related premature death by a third by 2030. 10-15% of the world's population may have chronic kidney disease (CKD), which is one of the major causes of non-communicable disease morbidity and mortality. In order to reduce the effects of patient health complications like hypertension, anaemia (low blood count), mineral bone disorder, poor nutritional health, acid base abnormalities, and neurological complications with timely intervention through appropriate medications, early and accurate detection of the stages of CKD is thought to be essential. Several studies on the early identification of CKD have been conducted utilising machine learning approaches. They weren't primarily concerned with predicting the exact stages. In this work classification methods are used like support vector classifier, random forest, logistic regression, and decision tree. The results detect that Linear SVC Support Vector Machine achieved high accuracy and Random Forest and Decision tree (100%) and logistic regression achieved (96.8%). A data set with 24 feature and 401 records are used for testing the algorithms. 20% of data set will be used in testing and 80% for training. The proposed work achieves high accuracy when compared with the previous works.
慢性肾衰竭是最常见的疾病之一,威胁着许多人的生命,并导致许多人死亡。通过使用人工智能,我们可以预测疾病,并将人分为感染者和非感染者。我们的目标之一是到 2030 年将与非传染性疾病相关的过早死亡人数减少三分之一。全球 10-15% 的人口可能患有慢性肾脏病(CKD),这是导致非传染性疾病发病率和死亡率的主要原因之一。为了减少高血压、贫血(低血细胞计数)、矿物质骨骼紊乱、营养不良、酸碱异常和神经系统并发症等并发症对患者健康的影响,通过适当的药物进行及时干预至关重要。利用机器学习方法对早期识别慢性肾功能衰竭进行了多项研究。这些研究主要关注的不是预测准确的阶段。本研究采用了支持向量分类器、随机森林、逻辑回归和决策树等分类方法。结果发现,线性 SVC 支持向量机的准确率很高,随机森林和决策树的准确率为 100%,逻辑回归的准确率为 96.8%。有 24 个特征和 401 条记录的数据集用于测试算法。数据集的 20% 用于测试,80% 用于训练。与之前的工作相比,建议的工作实现了较高的准确率。
{"title":"Chronic Kidney Disease Classification Using ML Algorithms","authors":"Sara shehab, Eman Shehab, aya morsi","doi":"10.21608/kjis.2023.220954.1015","DOIUrl":"https://doi.org/10.21608/kjis.2023.220954.1015","url":null,"abstract":"Chronic kidney failure is one of the most common diseases that threaten the lives of many people and cause death for many. By using artificial intelligence, we predict the disease and classify people into infected and non-infected people. One of the goals is to reduce non-communicable disease-related premature death by a third by 2030. 10-15% of the world's population may have chronic kidney disease (CKD), which is one of the major causes of non-communicable disease morbidity and mortality. In order to reduce the effects of patient health complications like hypertension, anaemia (low blood count), mineral bone disorder, poor nutritional health, acid base abnormalities, and neurological complications with timely intervention through appropriate medications, early and accurate detection of the stages of CKD is thought to be essential. Several studies on the early identification of CKD have been conducted utilising machine learning approaches. They weren't primarily concerned with predicting the exact stages. In this work classification methods are used like support vector classifier, random forest, logistic regression, and decision tree. The results detect that Linear SVC Support Vector Machine achieved high accuracy and Random Forest and Decision tree (100%) and logistic regression achieved (96.8%). A data set with 24 feature and 401 records are used for testing the algorithms. 20% of data set will be used in testing and 80% for training. The proposed work achieves high accuracy when compared with the previous works.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"167 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139294810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cost-Efficient Method for Detecting and Mitigating DDOS Attacks in SDN Based Networks 在基于 SDN 的网络中检测和缓解 DDOS 攻击的经济高效方法
Pub Date : 2023-11-01 DOI: 10.21608/kjis.2023.251235.1018
Alaa Allakany
Software-defined networks (SDN) provide a centralized administration programming interface for managing the network infrastructure. This new approach replaced traditional networks by establishing a flexible connection between the control and data planes, managing network operations through a centralized controller. As a result, prioritizing the security of the SDN controller becomes imperative in SDN networks. In the recent wave of distributed denial-of-service (DDoS) attacks, attackers have shifted their strategy from directly targeting the SDN controller to concentrating on specific links or area, causing disruptions in connectivity. This attack, known as Link-flooding attack (LFA), represent a novel form of DDoS attack. LFA targets the SDN control channel, which transmits control traffic from the SDN controller to switches, taking advantage of shared links in both control and data traffic paths. This sharing exposes a vulnerability that attackers can exploit to disrupt the control channel, using malicious data traffic to execute LFA. Considering the control channel's responsibility for granting centralized control to the controller over each network switch, it becomes relatively easy for an attacker to compromise all network functions. To handle this problem, in this paper, we develop a novel approach based on SDN designed for security solutions against DDoS and LFA. Our proposed scheme utilizes hop-by-hop network measurement to identify and capture abnormal link performance, enabling effective detection of such attacks. Subsequently, a Machine Learning (ML) model is employed to determine whether the congested links indicate the presence of such attacks. Unlike conventional approaches in the literature that solely rely on automatic ML models, our method begins by measuring congestion in each link. If abnormalities are detected, the ML model is then executed to identify whether it is an attack or not. By adopting this approach, we achieve optimized utilization of controller resources. Our proposed scheme will be implemented as an application at the application layer of the Ryu controller. Through our evaluation, we have demonstrated that this approach can efficiently optimize the process of measuring link performance, optimizing the utilization of SDN controller resources, and detecting DDoS and LFA.
软件定义网络(SDN)为管理网络基础设施提供了一个集中管理编程界面。这种新方法在控制平面和数据平面之间建立了灵活的连接,通过集中式控制器管理网络运行,从而取代了传统网络。因此,SDN 网络必须优先考虑 SDN 控制器的安全性。在最近的分布式拒绝服务(DDoS)攻击浪潮中,攻击者已将策略从直接针对 SDN 控制器转向集中攻击特定链路或区域,从而造成连接中断。这种攻击被称为链路泛洪攻击(LFA),是一种新型的 DDoS 攻击。LFA 针对的是 SDN 控制通道,该通道利用控制和数据流量路径中的共享链路,将控制流量从 SDN 控制器传输到交换机。这种共享暴露了一个漏洞,攻击者可以利用这个漏洞破坏控制通道,使用恶意数据流量执行 LFA。考虑到控制通道负责向控制器授予对每个网络交换机的集中控制,攻击者相对容易入侵所有网络功能。为解决这一问题,我们在本文中开发了一种基于 SDN 的新方法,旨在提供针对 DDoS 和 LFA 的安全解决方案。我们提出的方案利用逐跳网络测量来识别和捕获异常链路性能,从而有效检测此类攻击。随后,采用机器学习(ML)模型来确定拥塞链路是否表明存在此类攻击。与文献中仅依赖自动 ML 模型的传统方法不同,我们的方法首先测量每个链路的拥塞情况。如果检测到异常,则执行 ML 模型来确定是否是攻击。通过采用这种方法,我们实现了控制器资源的优化利用。我们提出的方案将作为 Ryu 控制器应用层的一个应用程序来实施。通过评估,我们证明这种方法可以有效优化测量链路性能、优化 SDN 控制器资源利用以及检测 DDoS 和 LFA 的过程。
{"title":"Cost-Efficient Method for Detecting and Mitigating DDOS Attacks in SDN Based Networks","authors":"Alaa Allakany","doi":"10.21608/kjis.2023.251235.1018","DOIUrl":"https://doi.org/10.21608/kjis.2023.251235.1018","url":null,"abstract":"Software-defined networks (SDN) provide a centralized administration programming interface for managing the network infrastructure. This new approach replaced traditional networks by establishing a flexible connection between the control and data planes, managing network operations through a centralized controller. As a result, prioritizing the security of the SDN controller becomes imperative in SDN networks. In the recent wave of distributed denial-of-service (DDoS) attacks, attackers have shifted their strategy from directly targeting the SDN controller to concentrating on specific links or area, causing disruptions in connectivity. This attack, known as Link-flooding attack (LFA), represent a novel form of DDoS attack. LFA targets the SDN control channel, which transmits control traffic from the SDN controller to switches, taking advantage of shared links in both control and data traffic paths. This sharing exposes a vulnerability that attackers can exploit to disrupt the control channel, using malicious data traffic to execute LFA. Considering the control channel's responsibility for granting centralized control to the controller over each network switch, it becomes relatively easy for an attacker to compromise all network functions. To handle this problem, in this paper, we develop a novel approach based on SDN designed for security solutions against DDoS and LFA. Our proposed scheme utilizes hop-by-hop network measurement to identify and capture abnormal link performance, enabling effective detection of such attacks. Subsequently, a Machine Learning (ML) model is employed to determine whether the congested links indicate the presence of such attacks. Unlike conventional approaches in the literature that solely rely on automatic ML models, our method begins by measuring congestion in each link. If abnormalities are detected, the ML model is then executed to identify whether it is an attack or not. By adopting this approach, we achieve optimized utilization of controller resources. Our proposed scheme will be implemented as an application at the application layer of the Ryu controller. Through our evaluation, we have demonstrated that this approach can efficiently optimize the process of measuring link performance, optimizing the utilization of SDN controller resources, and detecting DDoS and LFA.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139300986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decision Making in an Information System Via Pawlak’s Rough Approximation 通过帕夫拉克粗略近似法在信息系统中进行决策
Pub Date : 2023-11-01 DOI: 10.21608/kjis.2023.250400.1017
Mahmoud nasef
The original rough set model was based on a special kind of topological structure whose partition resulted from an equivalence relation. We have shown that real-world problems can be dealt with using the modern topological structure induced by Pawlak’s rough approximation. In this research, actual information was collected for some patients in hospitals, health centers, isolation centers and some symptoms were recorded through “ the World Health Organization” website enabled us analyze their data. By establishing an information system in which data can be analyzed using rough topology in order to draw conclusion about the most important symptoms in disease conPirmation.
最初的粗糙集模型基于一种特殊的拓扑结构,其分区由等价关系产生。我们已经证明,使用帕夫拉克粗糙近似所诱导的现代拓扑结构可以处理现实世界中的问题。在这项研究中,我们在医院、保健中心和隔离中心收集了一些病人的实际信息,并通过 "世界卫生组织 "网站记录了一些症状,从而分析了他们的数据。通过建立一个信息系统,利用粗糙拓扑对数据进行分析,从而得出疾病确认中最重要症状的结论。
{"title":"Decision Making in an Information System Via Pawlak’s Rough Approximation","authors":"Mahmoud nasef","doi":"10.21608/kjis.2023.250400.1017","DOIUrl":"https://doi.org/10.21608/kjis.2023.250400.1017","url":null,"abstract":"The original rough set model was based on a special kind of topological structure whose partition resulted from an equivalence relation. We have shown that real-world problems can be dealt with using the modern topological structure induced by Pawlak’s rough approximation. In this research, actual information was collected for some patients in hospitals, health centers, isolation centers and some symptoms were recorded through “ the World Health Organization” website enabled us analyze their data. By establishing an information system in which data can be analyzed using rough topology in order to draw conclusion about the most important symptoms in disease conPirmation.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139301671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The classification of mushroom using ML 使用 ML 对蘑菇进行分类
Pub Date : 2023-11-01 DOI: 10.21608/kjis.2023.221370.1016
Sara shehab, Eman Shehab, Rahma Nabil
The Mushroom is kind of fungi. Major health benefits of mushrooms include their ability to kill cancer cells. The goal of this research is to determine the most effective method for mushroom classification, with the categories of deadly and nonpoisonous mushrooms being used. Separate from plants and animals, they belong in their own realm. In terms of how they get nutrients, fungi are different from plants and mammals. Mushrooms are classified as edible and poisoned. To distinguish between two varieties of mushrooms, we can use machine learning, which is used in classification. There are numerous machine learning algorithms that perform classification, but in our model, I utilize random forest, MLP, Linear Regression and decision tree on the features of the mushroom to categorize it into edible and poisonous. Random Forest achieves high accuracy 98.70%. from these results, we can use Ml to differentiate between two varieties of mushrooms because it is used in classification efficiently.
蘑菇是一种真菌。蘑菇对健康的主要益处包括能够杀死癌细胞。这项研究的目标是确定最有效的蘑菇分类方法,并将蘑菇分为致命蘑菇和无毒蘑菇。蘑菇有别于动植物,属于自己的领域。就获取营养的方式而言,真菌与植物和哺乳动物不同。蘑菇分为食用蘑菇和毒蘑菇。为了区分两种蘑菇,我们可以使用机器学习来进行分类。进行分类的机器学习算法有很多,但在我们的模型中,我利用随机森林、MLP、线性回归和决策树对蘑菇的特征进行分类,将其分为可食用的和有毒的。随机森林的准确率高达 98.70%。从这些结果来看,我们可以使用 Ml 来区分两种蘑菇,因为它可以有效地用于分类。
{"title":"The classification of mushroom using ML","authors":"Sara shehab, Eman Shehab, Rahma Nabil","doi":"10.21608/kjis.2023.221370.1016","DOIUrl":"https://doi.org/10.21608/kjis.2023.221370.1016","url":null,"abstract":"The Mushroom is kind of fungi. Major health benefits of mushrooms include their ability to kill cancer cells. The goal of this research is to determine the most effective method for mushroom classification, with the categories of deadly and nonpoisonous mushrooms being used. Separate from plants and animals, they belong in their own realm. In terms of how they get nutrients, fungi are different from plants and mammals. Mushrooms are classified as edible and poisoned. To distinguish between two varieties of mushrooms, we can use machine learning, which is used in classification. There are numerous machine learning algorithms that perform classification, but in our model, I utilize random forest, MLP, Linear Regression and decision tree on the features of the mushroom to categorize it into edible and poisonous. Random Forest achieves high accuracy 98.70%. from these results, we can use Ml to differentiate between two varieties of mushrooms because it is used in classification efficiently.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139305488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Responce of DC Motor Via Fuzzy Logic and PID Controllers 通过模糊逻辑和 PID 控制器实现直流电机的动态响应
Pub Date : 2023-11-01 DOI: 10.21608/kjis.2023.329059
A. Salama, M. Darwish, M. Shokry, M.A.Nasef
Fuzzy set Theory of Lotfi A. Zadeh (1965) [1] has been one of the most important area for researches due to its advanced applications in many fields which has the ability to deal with non-linearity and independence of plant modeling, especially in Electrical machines and its control techniques to reach optimum Dynamic response with load variations. In this paper control of direct current (DC) motor with conventional controls proportional–integral–derivative (PID) and fuzzy logic control (FLC) has been investigated and compared with each others for different operating conditions. The mathematical model of Dc motor was modeled and simulated in Matlab Simulink (Mathworks) with illustrated graphs and plots. The performance of the model is expected to show a great results for the fuzzy logic control (FLC) over the PID control [2].
Lotfi A. Zadeh(1965 年)的模糊集理论[1] 由于其在许多领域的先进应用而成为最重要的研究领域之一。Zadeh(1965 年)[1] 提出的模糊集理论已成为最重要的研究领域之一,因为它在许多领域都有先进的应用,能够处理非线性和独立的工厂建模,特别是在电机及其控制技术方面,以达到随负载变化的最佳动态响应。本文研究了直流(DC)电机的传统控制比例-积分-派生(PID)和模糊逻辑控制(FLC),并在不同运行条件下进行了比较。直流电机的数学模型是在 Matlab Simulink (Mathworks) 中建模和仿真的,并配有说明图和曲线图。预计模糊逻辑控制 (FLC) 比 PID 控制效果更好 [2]。
{"title":"Dynamic Responce of DC Motor Via Fuzzy Logic and PID Controllers","authors":"A. Salama, M. Darwish, M. Shokry, M.A.Nasef","doi":"10.21608/kjis.2023.329059","DOIUrl":"https://doi.org/10.21608/kjis.2023.329059","url":null,"abstract":"Fuzzy set Theory of Lotfi A. Zadeh (1965) [1] has been one of the most important area for researches due to its advanced applications in many fields which has the ability to deal with non-linearity and independence of plant modeling, especially in Electrical machines and its control techniques to reach optimum Dynamic response with load variations. In this paper control of direct current (DC) motor with conventional controls proportional–integral–derivative (PID) and fuzzy logic control (FLC) has been investigated and compared with each others for different operating conditions. The mathematical model of Dc motor was modeled and simulated in Matlab Simulink (Mathworks) with illustrated graphs and plots. The performance of the model is expected to show a great results for the fuzzy logic control (FLC) over the PID control [2].","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139305740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid of Information gain and a Coati Optimization Algorithm for gene selection in microarray gene expression data classification. 基于信息增益和Coati优化算法的基因选择微阵列基因表达数据分类。
Pub Date : 2023-06-01 DOI: 10.21608/kjis.2023.216661.1013
Sarah Osama, A. Ali, Hassan Shaban
Gene expression data has become an essen2al tool for cancer classifica2on because it provides substan2al insights into the underlying mechanisms of cancer progression. However, the high-dimensional nature of microarray gene expression data presents a significant challenge. This paper introduces a new method called IG-COA, which combines Informa2on Gain (IG) approach and Coa2 Op2miza2on Algorithm (COA), to iden2fy the biomarkers genes. COA is a recent algorithm that has not been previously examined for feature or gene selec2on, to the best of our knowledge. Firstly, the IG method is used because using COA directly on microarray datasets is ineffec2ve and can make it challenging to train a classifier accurately. Secondly, the COA algorithm is u2lized to select the op2mal subset of genes from the previously selected ones. The effec2veness of the suggested IG-COA method with a Support Vector Machine is tested on several microarray gene expression datasets, and it exceeds other state-of-the-art methods.
基因表达数据已经成为癌症分类的重要工具,因为它为癌症进展的潜在机制提供了实质性的见解。然而,微阵列基因表达数据的高维性提出了一个重大挑战。本文介绍了一种结合Informa2on Gain (IG)法和Coa2 Op2miza2on算法(COA)的生物标记基因鉴定新方法IG-COA。据我们所知,COA是一种最近的算法,以前还没有对特征或基因选择进行过研究。首先,使用IG方法是因为直接在微阵列数据集上使用COA是无效的,并且会给准确训练分类器带来挑战。其次,利用COA算法从先前选择的基因中选择最优基因子集;基于支持向量机的igg - coa方法的有效性在多个微阵列基因表达数据集上进行了测试,并且超过了其他最先进的方法。
{"title":"A hybrid of Information gain and a Coati Optimization Algorithm for gene selection in microarray gene expression data classification.","authors":"Sarah Osama, A. Ali, Hassan Shaban","doi":"10.21608/kjis.2023.216661.1013","DOIUrl":"https://doi.org/10.21608/kjis.2023.216661.1013","url":null,"abstract":"Gene expression data has become an essen2al tool for cancer classifica2on because it provides substan2al insights into the underlying mechanisms of cancer progression. However, the high-dimensional nature of microarray gene expression data presents a significant challenge. This paper introduces a new method called IG-COA, which combines Informa2on Gain (IG) approach and Coa2 Op2miza2on Algorithm (COA), to iden2fy the biomarkers genes. COA is a recent algorithm that has not been previously examined for feature or gene selec2on, to the best of our knowledge. Firstly, the IG method is used because using COA directly on microarray datasets is ineffec2ve and can make it challenging to train a classifier accurately. Secondly, the COA algorithm is u2lized to select the op2mal subset of genes from the previously selected ones. The effec2veness of the suggested IG-COA method with a Support Vector Machine is tested on several microarray gene expression datasets, and it exceeds other state-of-the-art methods.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129443948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Classification Event Sequences via Compact Big Sequence 基于压缩大序列的事件序列分类
Pub Date : 2022-12-01 DOI: 10.21608/kjis.2022.181419.1012
M. Hassaan
{"title":"Classification Event Sequences via Compact Big Sequence","authors":"M. Hassaan","doi":"10.21608/kjis.2022.181419.1012","DOIUrl":"https://doi.org/10.21608/kjis.2022.181419.1012","url":null,"abstract":"","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121329050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Summarizing Graph Data Via the Compactness of Disjoint Paths 用不相交路径的紧性来总结图数据
Pub Date : 2022-12-01 DOI: 10.21608/kjis.2022.170163.1011
M. Hassaan
: Graphs are widely used to model many real-world data in many application domains such as chemical compounds, protein structures, gene structures, metabolic pathways, communication networks, and images entities. Graph summarization is very important task which searching for a summary of the given graph. There are many benefits of the graph summarization task which are as follows. By graph summarization, we reduce the data volume and storage as much as possible, speedup the query processing algorithms, and apply the interactive analysis. In this paper, we propose a new graph summarization method based on the compactness of disjoint paths. Our algorithm called DJ_Paths. DJ_Paths is edge-grouping technique. The experimental results show that DJ_Path outperforms the state-of-the-art method, Slugger, with respect to compression ratio (It achieves up to 2x better compression), total response time (It outperforms Slugger by more than one order of magnitude), and memory usage (It is 8x less memory consumption).
图被广泛用于在许多应用领域建模许多真实世界的数据,如化合物、蛋白质结构、基因结构、代谢途径、通信网络和图像实体。图的摘要是一项非常重要的任务,即寻找给定图的摘要。图形摘要任务有以下许多好处。通过图的汇总,尽可能减少数据量和存储空间,加快查询处理算法,并应用交互式分析。本文提出了一种新的基于不相交路径紧致性的图摘要方法。我们的算法叫做dj_path。dj_path是边分组技术。实验结果表明,DJ_Path在压缩比(压缩效率提高2倍)、总响应时间(比Slugger高一个数量级以上)和内存使用(内存消耗减少8倍)方面都优于最先进的方法Slugger。
{"title":"Summarizing Graph Data Via the Compactness of Disjoint Paths","authors":"M. Hassaan","doi":"10.21608/kjis.2022.170163.1011","DOIUrl":"https://doi.org/10.21608/kjis.2022.170163.1011","url":null,"abstract":": Graphs are widely used to model many real-world data in many application domains such as chemical compounds, protein structures, gene structures, metabolic pathways, communication networks, and images entities. Graph summarization is very important task which searching for a summary of the given graph. There are many benefits of the graph summarization task which are as follows. By graph summarization, we reduce the data volume and storage as much as possible, speedup the query processing algorithms, and apply the interactive analysis. In this paper, we propose a new graph summarization method based on the compactness of disjoint paths. Our algorithm called DJ_Paths. DJ_Paths is edge-grouping technique. The experimental results show that DJ_Path outperforms the state-of-the-art method, Slugger, with respect to compression ratio (It achieves up to 2x better compression), total response time (It outperforms Slugger by more than one order of magnitude), and memory usage (It is 8x less memory consumption).","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125594257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey Paper of Information Hiding by Using Steganography Techniques 基于隐写技术的信息隐藏研究综述
Pub Date : 2022-12-01 DOI: 10.21608/kjis.2022.280155
Abdelmgeid A. Ali, Waled T. A. Mohamed, Mentllah Sayed
{"title":"A Survey Paper of Information Hiding by Using Steganography Techniques","authors":"Abdelmgeid A. Ali, Waled T. A. Mohamed, Mentllah Sayed","doi":"10.21608/kjis.2022.280155","DOIUrl":"https://doi.org/10.21608/kjis.2022.280155","url":null,"abstract":"","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129065613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Kafrelsheikh Journal of Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1