首页 > 最新文献

Future Computing and Informatics Journal最新文献

英文 中文
DATA QUALITY DIMENSIONS, METRICS, AND IMPROVEMENT TECHNIQUES 数据质量维度、度量和改进技术
Pub Date : 2021-07-11 DOI: 10.54623/fue.fcij.6.1.3
Menna Ibrahim Gabr, Y. Helmy, Doaa S. Elzanfaly
Achieving high level of data quality is considered one of the most important assets for any small, medium and large size organizations. Data quality is the main hype for both practitioners and researchers who deal with traditional or big data. The level of data quality is measured through several quality dimensions. High percentage of the current studies focus on assessing and applying data quality on traditional data. As we are in the era of big data, the attention should be paid to the tremendous volume of generated and processed data in which 80% of all the generated data is unstructured. However, the initiatives for creating big data quality evaluation models are still under development. This paper investigates the data quality dimensions that are mostly used in both traditional and big data to figure out the metrics and techniques that are used to measure and handle each dimension. A complete definition for each traditional and big data quality dimension, metrics and handling techniques are presented in this paper. Many data quality dimensions can be applied to both traditional and big data, while few number of quality dimensions are either applied to traditional data or big data. Few number of data quality metrics and barely handling techniques are presented in the current works.
实现高水平的数据质量被认为是任何小型、中型和大型组织最重要的资产之一。数据质量是处理传统数据或大数据的从业者和研究人员的主要炒作。数据质量水平是通过几个质量维度来衡量的。目前有很大比例的研究侧重于传统数据的数据质量评估和应用。我们正处于大数据时代,需要注意的是产生和处理的数据量巨大,其中80%的生成数据是非结构化的。然而,创建大数据质量评估模型的举措仍在发展中。本文研究了传统数据和大数据中最常用的数据质量维度,以找出用于测量和处理每个维度的度量和技术。本文给出了每个传统和大数据质量维度、度量和处理技术的完整定义。许多数据质量维度可以同时适用于传统数据和大数据,而少数质量维度既适用于传统数据又适用于大数据。在目前的工作中,很少有数据质量度量和数据处理技术。
{"title":"DATA QUALITY DIMENSIONS, METRICS, AND IMPROVEMENT TECHNIQUES","authors":"Menna Ibrahim Gabr, Y. Helmy, Doaa S. Elzanfaly","doi":"10.54623/fue.fcij.6.1.3","DOIUrl":"https://doi.org/10.54623/fue.fcij.6.1.3","url":null,"abstract":"Achieving high level of data quality is considered one of the most important assets for any small, medium and large size organizations. Data quality is the main hype for both practitioners and researchers who deal with traditional or big data. The level of data quality is measured through several quality dimensions. High percentage of the current studies focus on assessing and applying data quality on traditional data. As we are in the era of big data, the attention should be paid to the tremendous volume of generated and processed data in which 80% of all the generated data is unstructured. However, the initiatives for creating big data quality evaluation models are still under development. This paper investigates the data quality dimensions that are mostly used in both traditional and big data to figure out the metrics and techniques that are used to measure and handle each dimension. A complete definition for each traditional and big data quality dimension, metrics and handling techniques are presented in this paper. Many data quality dimensions can be applied to both traditional and big data, while few number of quality dimensions are either applied to traditional data or big data. Few number of data quality metrics and barely handling techniques are presented in the current works.","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84121433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A DEEP LEARNING APPROACH FOR FORECASTING GLOBAL COMMODITIES PRICES 一种预测全球商品价格的深度学习方法
Pub Date : 2021-07-11 DOI: 10.54623/fue.fcij.6.1.4
A. S. Elberawi, M. Belal
Forecasting future values of time-series data is a critical task in many disciplines including financial planning and decision-making. Researchers and practitioners in statistics apply traditional statistical methods (such as ARMA, ARIMA, ES, and GARCH) for a long time with varying. accuracies. Deep learning provides more sophisticated and non-linear approximation that supersede traditional statistical methods in most cases. Deep learning methods require minimal features engineering compared to other methods; it adopts an end-to-end learning methodology. In addition, it can handle a huge amount of data and variables. Financial time series forecasting poses a challenge due to its high volatility and non-stationarity nature. This work presents a hybrid deep learning model based on recurrent neural network and Autoencoders techniques to forecast commodity materials' global prices. Results showbetter accuracy compared to traditional regression methods for short-term forecast horizons (1,2,3 and 7days).
预测时间序列数据的未来价值是包括财务规划和决策在内的许多学科的关键任务。统计领域的研究者和实践者长期以来应用传统的统计方法(如ARMA、ARIMA、ES和GARCH),结果各不相同。精度。深度学习提供了更复杂和非线性的近似,在大多数情况下取代了传统的统计方法。与其他方法相比,深度学习方法需要最少的特征工程;它采用端到端学习方法。此外,它还可以处理大量的数据和变量。金融时间序列的高波动性和非平稳性给其预测带来了挑战。这项工作提出了一种基于循环神经网络和自动编码器技术的混合深度学习模型,用于预测商品材料的全球价格。结果表明,与传统回归方法相比,短期预测(1、2、3和7天)的精度更高。
{"title":"A DEEP LEARNING APPROACH FOR FORECASTING GLOBAL COMMODITIES PRICES","authors":"A. S. Elberawi, M. Belal","doi":"10.54623/fue.fcij.6.1.4","DOIUrl":"https://doi.org/10.54623/fue.fcij.6.1.4","url":null,"abstract":"Forecasting future values of time-series data is a critical task in many disciplines including financial planning and decision-making. Researchers and practitioners in statistics apply traditional statistical methods (such as ARMA, ARIMA, ES, and GARCH) for a long time with varying. accuracies. Deep learning provides more sophisticated and non-linear approximation that supersede traditional statistical methods in most cases. Deep learning methods require minimal features engineering compared to other methods; it adopts an end-to-end learning methodology. In addition, it can handle a huge amount of data and variables. Financial time series forecasting poses a challenge due to its high volatility and non-stationarity nature. This work presents a hybrid deep learning model based on recurrent neural network and Autoencoders techniques to forecast commodity materials' global prices. Results showbetter accuracy compared to traditional regression methods for short-term forecast horizons (1,2,3 and 7days).","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86579592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Review of Data Mining Techniques for Detecting Churners in the Telecommunication Industry 电信行业流失检测的数据挖掘技术综述
Pub Date : 2021-07-11 DOI: 10.54623/fue.fcij.6.1.1
Mahmoud Ewieda, Essam M. Shaaban, M. Roushdy
The telecommunication sector has been developed rapidly and with large amounts of data obtained as a result of increasing in the number of subscribers, modern techniques, data-based applications, and services. As well as better awareness of customer requirements and excellent quality that meets their satisfaction. This satisfaction raises rivalry between firms to maintain the quality of their services and upgrade them. These data can be helpfully extracted for analysis and used for predicting churners. Researchers around the world have conducted important research to understand the uses of Data mining (DM) that can be used to predict customers' churn. This paper supplies a review of nearly 73 recent journalistic articles starting in 2003 to introduce the different DM techniques used in many customerbased churning models. It epitomizes the present literature in the field of communications by highlighting the impact of service quality on customer satisfaction, detecting churners in the telecoms industry, in addition to the sample size used, the churn variables used and the results of various DM technologies. Eventually, the most common techniques for predicting telecommunication churning such as classification, regression analysis, and clustering are included, thus presenting a roadmap for new researchers to build new churn management models.
由于用户数量、现代技术、基于数据的应用和服务的增加,电信部门发展迅速,获得了大量数据。以及更好的客户需求意识和卓越的质量,以满足他们的满意。这种满意度提高了公司之间的竞争,以保持服务质量并提高服务质量。这些数据可以被提取出来用于分析和预测流失。世界各地的研究人员进行了重要的研究,以了解数据挖掘(DM)的用途,该方法可用于预测客户的流失。本文回顾了从2003年开始的近73篇新闻文章,介绍了在许多基于客户的流失模型中使用的不同数据挖掘技术。它通过强调服务质量对客户满意度的影响,检测电信行业的流失者,以及使用的样本量,使用的流失者变量和各种DM技术的结果,概括了通信领域的现有文献。最后,最常见的预测电信流失的技术,如分类,回归分析和聚类,从而为新的研究人员提供了一个路线图,以建立新的流失管理模型。
{"title":"Review of Data Mining Techniques for Detecting Churners in the Telecommunication Industry","authors":"Mahmoud Ewieda, Essam M. Shaaban, M. Roushdy","doi":"10.54623/fue.fcij.6.1.1","DOIUrl":"https://doi.org/10.54623/fue.fcij.6.1.1","url":null,"abstract":"The telecommunication sector has been developed rapidly and with large amounts of data obtained as a result of increasing in the number of subscribers, modern techniques, data-based applications, and services. As well as better awareness of customer requirements and excellent quality that meets their satisfaction. This satisfaction raises rivalry between firms to maintain the quality of their services and upgrade them. These data can be helpfully extracted for analysis and used for predicting churners. Researchers around the world have conducted important research to understand the uses of Data mining (DM) that can be used to predict customers' churn. This paper supplies a review of nearly 73 recent journalistic articles starting in 2003 to introduce the different DM techniques used in many customerbased churning models. It epitomizes the present literature in the field of communications by highlighting the impact of service quality on customer satisfaction, detecting churners in the telecoms industry, in addition to the sample size used, the churn variables used and the results of various DM technologies. Eventually, the most common techniques for predicting telecommunication churning such as classification, regression analysis, and clustering are included, thus presenting a roadmap for new researchers to build new churn management models.","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85032189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A CONFIGURABLE MINING APPROACH FOR LEARNING SERVICES CUSTOMIZATION 用于学习服务定制的可配置挖掘方法
Pub Date : 2021-07-11 DOI: 10.54623/fue.fcij.6.1.2
Aya M. Mostafa, Y. Helmy, A. Idrees
There is no doubt that this age is the age of data and technology. Moreover, there is tremendous development in all fields. The personalized material is a good approach in the different fields. It provides a fit material that matches the styles of readers. It supports readers in various reading domains. This research paper aims to support students in the educational system. Additionally, the research paper designs to increase education values for students. Furthermore, the research paper builds the smart appropriate materials through Egyptian Knowledge Banking (EKB) based on the learner question. The Egyptian Knowledge Bank (EKB) is a rich platform for data. The research paper is implemented in the faculty of Commerce and Business Administration, Business Information System program (BIS) at Helwan University, Egypt.
毫无疑问,这个时代是数据和技术的时代。此外,各领域都有巨大的发展。个性化教材在不同的领域是一种很好的方法。它提供了适合读者风格的材料。它支持各种阅读域的阅读器。本研究论文旨在为教育系统中的学生提供支持。此外,研究论文旨在提高学生的教育价值。此外,研究论文通过基于学习者问题的埃及知识库(EKB)构建智能合适的材料。埃及知识银行(EKB)是一个丰富的数据平台。本研究论文是在埃及赫尔万大学商业信息系统(BIS)工商管理学院实施的。
{"title":"A CONFIGURABLE MINING APPROACH FOR LEARNING SERVICES CUSTOMIZATION","authors":"Aya M. Mostafa, Y. Helmy, A. Idrees","doi":"10.54623/fue.fcij.6.1.2","DOIUrl":"https://doi.org/10.54623/fue.fcij.6.1.2","url":null,"abstract":"There is no doubt that this age is the age of data and technology. Moreover, there is tremendous development in all fields. The personalized material is a good approach in the different fields. It provides a fit material that matches the styles of readers. It supports readers in various reading domains. This research paper aims to support students in the educational system. Additionally, the research paper designs to increase education values for students. Furthermore, the research paper builds the smart appropriate materials through Egyptian Knowledge Banking (EKB) based on the learner question. The Egyptian Knowledge Bank (EKB) is a rich platform for data. The research paper is implemented in the faculty of Commerce and Business Administration, Business Information System program (BIS) at Helwan University, Egypt.","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88946055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep feature learning for FoG episodes prediction In patients with PD 深度特征学习用于PD患者FoG发作预测
Pub Date : 2020-12-30 DOI: 10.54623/fue.fcij.5.2.2
Hadeer Elziaat, Nashwa El-Bendary, Ramdan Mowad
A common symptom of Parkinson's Disease is Freezing of Gait (FoG) that causes an interrupt of the forward progression of the patient’s feet while walking. Therefore, Freezing of Gait episodes is always engaged to the patient's falls. This paper proposes a model for Freezing of Gait episodes detection and prediction in patients with Parkinson's disease. Predicting Freezing of Gait in this paper considers as a multi-class classification problem with 3 classes namely, FoG, pre-FoG, and walking episodes. In this paper, the extracted feature scheme applied for the detection and the prediction of FoG is Convolutional Neural Network (CNN) spectrogram time-frequency features. The dataset is collected from three tri-axial accelerometer sensors for PD patients with FoG. The performance of the suggested approach has been distinguished by different machine learning classifiers and accelerometer axes.
帕金森病的一个常见症状是步态冻结(FoG),它会导致患者走路时脚的向前移动中断。因此,步态冻结发作总是与患者跌倒有关。提出了一种用于帕金森病患者步态冻结发作检测和预测的模型。本文认为步态冻结预测是一个多类分类问题,分为三类,即FoG、pre-FoG和walking episodes。本文提取的用于雾霾检测和预测的特征方案是卷积神经网络(CNN)谱图时频特征。数据集收集自三个三轴加速度计传感器,用于PD患者的FoG。所建议的方法的性能被不同的机器学习分类器和加速度计轴所区分。
{"title":"Deep feature learning for FoG episodes prediction In patients with PD","authors":"Hadeer Elziaat, Nashwa El-Bendary, Ramdan Mowad","doi":"10.54623/fue.fcij.5.2.2","DOIUrl":"https://doi.org/10.54623/fue.fcij.5.2.2","url":null,"abstract":"A common symptom of Parkinson's Disease is Freezing of Gait (FoG) that causes an interrupt of the forward progression of the patient’s feet while walking. Therefore, Freezing of Gait episodes is always engaged to the patient's falls. This paper proposes a model for Freezing of Gait episodes detection and prediction in patients with Parkinson's disease. Predicting Freezing of Gait in this paper considers as a multi-class classification problem with 3 classes namely, FoG, pre-FoG, and walking episodes. In this paper, the extracted feature scheme applied for the detection and the prediction of FoG is Convolutional Neural Network (CNN) spectrogram time-frequency features. The dataset is collected from three tri-axial accelerometer sensors for PD patients with FoG. The performance of the suggested approach has been distinguished by different machine learning classifiers and accelerometer axes.","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"179 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80924542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Performance Analysis of Whale optimization based Data Clustering 基于鲸鱼优化的数据聚类性能分析
Pub Date : 2020-12-30 DOI: 10.54623/fue.fcij.5.2.4
Ahamed B M Shafeeq
Data clustering is the method of gathering of data points so that the more similar points will be in the same group. It is a key role in exploratory data mining and a popular technique used in many fields to analyze statistical data. Quality clusters are the key requirement of the cluster analysis result. There will be tradeoffs between the speed of the clustering algorithm and the quality of clusters it produces. Both the quality and speed criteria must be considered for the state-of-the-art clustering algorithm for applications. The Bio-inspired technique has ensured that the process is not trapped in local minima, which is the main bottleneck of the traditional clustering algorithm. The results produced by the bio-inspired clustering algorithms are better than the traditional clustering algorithms. The newly introduced Whale optimization-based clustering is one of the promising algorithms from the bio-inspired family. The quality of clusters produced by Whale optimization-based clustering is compared with k-means, Kohonen self-organizing feature diagram, Grey wolf optimization. Popular quality measures such as the Silhouette index, Davies-Bouldin index, and Calianski-Harabasz index are used in the evaluation.
数据聚类是收集数据点的方法,以便将更多的相似点放在同一组中。它是探索性数据挖掘中的一个关键角色,也是许多领域中常用的统计数据分析技术。聚类质量是聚类分析结果的关键要求。在聚类算法的速度和它产生的聚类质量之间存在权衡。对于应用程序中最先进的聚类算法,必须同时考虑质量和速度标准。仿生聚类技术保证了聚类过程不会陷入局部极小值,这是传统聚类算法的主要瓶颈。仿生聚类算法的聚类结果优于传统的聚类算法。新引入的基于Whale优化的聚类是生物启发家族中有前途的算法之一。将基于Whale优化的聚类方法与k-means、Kohonen自组织特征图、灰狼优化的聚类质量进行了比较。常用的质量指标如Silhouette指数、Davies-Bouldin指数和Calianski-Harabasz指数用于评价。
{"title":"Performance Analysis of Whale optimization based Data Clustering","authors":"Ahamed B M Shafeeq","doi":"10.54623/fue.fcij.5.2.4","DOIUrl":"https://doi.org/10.54623/fue.fcij.5.2.4","url":null,"abstract":"Data clustering is the method of gathering of data points so that the more similar points will be in the same group. It is a key role in exploratory data mining and a popular technique used in many fields to analyze statistical data. Quality clusters are the key requirement of the cluster analysis result. There will be tradeoffs between the speed of the clustering algorithm and the quality of clusters it produces. Both the quality and speed criteria must be considered for the state-of-the-art clustering algorithm for applications. The Bio-inspired technique has ensured that the process is not trapped in local minima, which is the main bottleneck of the traditional clustering algorithm. The results produced by the bio-inspired clustering algorithms are better than the traditional clustering algorithms. The newly introduced Whale optimization-based clustering is one of the promising algorithms from the bio-inspired family. The quality of clusters produced by Whale optimization-based clustering is compared with k-means, Kohonen self-organizing feature diagram, Grey wolf optimization. Popular quality measures such as the Silhouette index, Davies-Bouldin index, and Calianski-Harabasz index are used in the evaluation.","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"223 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90438514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SIGNATURE IDENTIFICATION AND VERIFICATION SYSTEMS: A COMPARATIVE STUDY ON THE ONLINE AND OFFLINE TECHNIQUES 签名识别与验证系统:在线与离线技术的比较研究
Pub Date : 2020-12-26 DOI: 10.54623/fue.fcij.5.1.3
Nehal Hamdy Al-banhawy, H. Mohsen, N. Ghali
Handwritten signature identification and verification has become an active area of research in recent years. Handwritten signature identification systems are used for identifying the user among all users enrolled in the system while handwritten signature verification systems are used for authenticating a user by comparing a specific signature with his signature that is stored in the system. This paper presents a review for commonly used methods for pre-processing, feature extraction and classification techniques in signature identification and verification systems, in addition to a comparison between the systems implemented in the literature for identification techniques and verification techniques in online and offline systems with taking into consideration the datasets used and results for each system
近年来,手写体签名的识别与验证成为研究的热点。手写签名识别系统用于在系统中注册的所有用户中识别用户,而手写签名验证系统用于通过将特定签名与其存储在系统中的签名进行比较来对用户进行身份验证。本文综述了签名识别和验证系统中常用的预处理方法、特征提取和分类技术,并对在线和离线系统中识别技术和验证技术的文献中实现的系统进行了比较,同时考虑了每个系统使用的数据集和结果
{"title":"SIGNATURE IDENTIFICATION AND VERIFICATION SYSTEMS: A COMPARATIVE STUDY ON THE ONLINE AND OFFLINE TECHNIQUES","authors":"Nehal Hamdy Al-banhawy, H. Mohsen, N. Ghali","doi":"10.54623/fue.fcij.5.1.3","DOIUrl":"https://doi.org/10.54623/fue.fcij.5.1.3","url":null,"abstract":"Handwritten signature identification and verification has become an active area of research in recent years. Handwritten signature identification systems are used for identifying the user among all users enrolled in the system while handwritten signature verification systems are used for authenticating a user by comparing a specific signature with his signature that is stored in the system. This paper presents a review for commonly used methods for pre-processing, feature extraction and classification techniques in signature identification and verification systems, in addition to a comparison between the systems implemented in the literature for identification techniques and verification techniques in online and offline systems with taking into consideration the datasets used and results for each system","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"468 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89661203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Recent Advances and Machine Learning Techniques on Sickle Cell Disease 镰状细胞病的最新进展和机器学习技术
Pub Date : 2020-12-26 DOI: 10.54623/fue.fcij.5.1.4
Noorh H. Alharbi, Rana O. Bameer, Shahad S. Geddan, Hajar M Alharbi
Sickle cell disease is a severe hereditary disease caused by an abnormality of the red blood cells. The current therapeutic decision-making process applied to sickle cell disease includes monitoring a patient’s symptoms and complications and then adjusting the treatment accordingly. This process is time-consuming, which might result in serious consequences for patients’ lives and could lead to irreversible disease complications. Artificial intelligence, specifically machine learning, is a powerful technique that has been used to support medical decisions. This paper aims to review the recently developed machine learning models designed to interpret medical data regarding sickle cell disease. To propose an intelligence model, the suggested framework has to be performed in the following sequence. First, the data is preprocessed by imputing missing values and balancing them. Then, suitable feature selection methods are applied, and different classifiers are trained and tested. Finally, the performing model with the highest predefined performance metric over all experiments conducted is nominated. Thus, the aim of developing such a model is to predict the severity of a patient’s case, to determine the clinical complications of the disease, and to suggest the correct dosage of the treatment(s).
镰状细胞病是一种由红细胞异常引起的严重遗传性疾病。目前镰状细胞病的治疗决策过程包括监测患者的症状和并发症,然后相应地调整治疗。这一过程耗时,可能对患者的生命造成严重后果,并可能导致不可逆转的疾病并发症。人工智能,特别是机器学习,是一项强大的技术,已被用于支持医疗决策。本文旨在回顾最近开发的机器学习模型,旨在解释镰状细胞病的医疗数据。要提出智能模型,建议的框架必须按照以下顺序执行。首先,通过输入缺失值并对其进行平衡,对数据进行预处理。然后,采用合适的特征选择方法,对不同的分类器进行训练和测试。最后,在所有实验中,提名具有最高预定义性能指标的执行模型。因此,开发这种模型的目的是预测患者病情的严重程度,确定疾病的临床并发症,并建议正确的治疗剂量。
{"title":"Recent Advances and Machine Learning Techniques on Sickle Cell Disease","authors":"Noorh H. Alharbi, Rana O. Bameer, Shahad S. Geddan, Hajar M Alharbi","doi":"10.54623/fue.fcij.5.1.4","DOIUrl":"https://doi.org/10.54623/fue.fcij.5.1.4","url":null,"abstract":"Sickle cell disease is a severe hereditary disease caused by an abnormality of the red blood cells. The current therapeutic decision-making process applied to sickle cell disease includes monitoring a patient’s symptoms and complications and then adjusting the treatment accordingly. This process is time-consuming, which might result in serious consequences for patients’ lives and could lead to irreversible disease complications. Artificial intelligence, specifically machine learning, is a powerful technique that has been used to support medical decisions. This paper aims to review the recently developed machine learning models designed to interpret medical data regarding sickle cell disease. To propose an intelligence model, the suggested framework has to be performed in the following sequence. First, the data is preprocessed by imputing missing values and balancing them. Then, suitable feature selection methods are applied, and different classifiers are trained and tested. Finally, the performing model with the highest predefined performance metric over all experiments conducted is nominated. Thus, the aim of developing such a model is to predict the severity of a patient’s case, to determine the clinical complications of the disease, and to suggest the correct dosage of the treatment(s).","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"73 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76948338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Twitter Analysis based on Damage Detection and Geoparsing for Event Mapping Management 基于损伤检测和地质分析的推特分析用于事件映射管理
Pub Date : 2020-12-26 DOI: 10.54623/fue.fcij.5.1.1
Yasmeen Ali Ameen, Khaled Bahnasy, Adel Elmahdy
Background: Early event detection, monitor, and response can significantly decrease the impact of disasters. Lately, the usage of social media for detecting events has displayed hopeful results. Objectives: for event detection and mapping; the tweets will locate and monitor them on a map. This new approach uses grouped geoparsing then scoring for each tweet based on three spatial indicators. Method/Approach: Our approach uses a geoparsing technique to match a location in tweets to geographic locations of multiple-events tweets in Egypt country, administrative subdivision. Thus, additional geographic information acquired from the tweet itself to detect the actual locations that the user mentioned in the tweet. Results: The approach was developed from a large pool of tweets related to various crisis events over one year. Only all (very specific) tweets that were plotted on a crisis map to monitor these events. The tweets were analyzed through predefined geo-graphical displays, message content filters (damage, casualties). Conclusion: A method was implemented to predict the effective start of any crisis event and an inequity condition is applied to determine the end of the event. Results indicate that our automated filtering of information provides valuable information for operational response and crisis communication
背景:早期的事件检测、监测和响应可以显著降低灾害的影响。最近,使用社交媒体来检测事件已经显示出有希望的结果。目标:用于事件检测和映射;推特将在地图上定位和监控他们。这种新方法使用分组地质分析,然后根据三个空间指标对每条推文进行评分。方法/方法:我们的方法使用地质解析技术将推文中的位置与埃及国家行政区划的多事件推文的地理位置相匹配。因此,从推文本身获取额外的地理信息,以检测用户在推文中提到的实际位置。结果:该方法是从一年中与各种危机事件相关的大量推文中开发出来的。只有绘制在危机地图上的所有(非常具体的)推文才能监控这些事件。这些推文通过预定义的地理显示、消息内容过滤器(损害、伤亡)进行分析。结论:采用一种方法预测任何危机事件的有效开始,并采用不平等条件来确定事件的结束。结果表明,我们的信息自动过滤为业务响应和危机沟通提供了有价值的信息
{"title":"Twitter Analysis based on Damage Detection and Geoparsing for Event Mapping Management","authors":"Yasmeen Ali Ameen, Khaled Bahnasy, Adel Elmahdy","doi":"10.54623/fue.fcij.5.1.1","DOIUrl":"https://doi.org/10.54623/fue.fcij.5.1.1","url":null,"abstract":"Background: Early event detection, monitor, and response can significantly decrease the impact of disasters. Lately, the usage of social media for detecting events has displayed hopeful results. Objectives: for event detection and mapping; the tweets will locate and monitor them on a map. This new approach uses grouped geoparsing then scoring for each tweet based on three spatial indicators. Method/Approach: Our approach uses a geoparsing technique to match a location in tweets to geographic locations of multiple-events tweets in Egypt country, administrative subdivision. Thus, additional geographic information acquired from the tweet itself to detect the actual locations that the user mentioned in the tweet. Results: The approach was developed from a large pool of tweets related to various crisis events over one year. Only all (very specific) tweets that were plotted on a crisis map to monitor these events. The tweets were analyzed through predefined geo-graphical displays, message content filters (damage, casualties). Conclusion: A method was implemented to predict the effective start of any crisis event and an inequity condition is applied to determine the end of the event. Results indicate that our automated filtering of information provides valuable information for operational response and crisis communication","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80401149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Supporting Tool for Requirements Prioritization Process in Agile Software Development 敏捷软件开发中需求优先排序过程的支持工具
Pub Date : 2020-12-26 DOI: 10.54623/fue.fcij.5.1.2
Khaled AbdElazim Muhammad, R. Moawad, Essam Elfakharany
Requirements engineering is a crucial phase of software engineering, and requirements prioritization is an essential stage of requirements engineering particularly in agile software development. Requirements prioritization goals at eliciting which requirements of software need to be covered in a particular release. The key point is which requirement will be selected in the next iteration and which one will be delayed to other iterations for minimizing risk during development and meeting stakeholders’ needs. There are many existing techniques for requirement prioritization, but most of these techniques do not cover continuous growth, change of requirements, and requirements dependencies. The prioritization techniques need to be more continuous, scalable, implemented easily and integrated with software development life cycle. This paper introduces a supporting tool for a proposed framework to prioritize requirements in agile software development. This framework tries to find solutions for the challenges facing this prioritization process such as how to make this prioritization continuous and scalable and how to deal with rapidly requirement changes and its dependencies. The proposed framework is validated in a real case study using its supporting tool, and the results are promising
需求工程是软件工程的一个关键阶段,需求优先级是需求工程的一个重要阶段,尤其是在敏捷软件开发中。需求优先级的目标是引出软件的哪些需求需要在一个特定的版本中被覆盖。关键点是哪些需求将在下一个迭代中被选择,哪些需求将被延迟到其他迭代中,以便在开发期间最小化风险并满足涉众的需求。有许多现有的需求优先级技术,但是这些技术中的大多数都没有涵盖持续增长、需求变更和需求依赖。优先级技术需要更加连续、可伸缩、易于实现并与软件开发生命周期集成。本文介绍了一个支持敏捷软件开发中需求优先级划分框架的工具。该框架试图找到解决方案,以应对优先级排序过程所面临的挑战,例如如何使优先级排序持续且可伸缩,以及如何处理快速的需求变更及其依赖关系。利用该框架的支持工具在实际案例研究中进行了验证,结果令人满意
{"title":"A Supporting Tool for Requirements Prioritization Process in Agile Software Development","authors":"Khaled AbdElazim Muhammad, R. Moawad, Essam Elfakharany","doi":"10.54623/fue.fcij.5.1.2","DOIUrl":"https://doi.org/10.54623/fue.fcij.5.1.2","url":null,"abstract":"Requirements engineering is a crucial phase of software engineering, and requirements prioritization is an essential stage of requirements engineering particularly in agile software development. Requirements prioritization goals at eliciting which requirements of software need to be covered in a particular release. The key point is which requirement will be selected in the next iteration and which one will be delayed to other iterations for minimizing risk during development and meeting stakeholders’ needs. There are many existing techniques for requirement prioritization, but most of these techniques do not cover continuous growth, change of requirements, and requirements dependencies. The prioritization techniques need to be more continuous, scalable, implemented easily and integrated with software development life cycle. This paper introduces a supporting tool for a proposed framework to prioritize requirements in agile software development. This framework tries to find solutions for the challenges facing this prioritization process such as how to make this prioritization continuous and scalable and how to deal with rapidly requirement changes and its dependencies. The proposed framework is validated in a real case study using its supporting tool, and the results are promising","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"57 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80655882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Future Computing and Informatics Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1