首页 > 最新文献

Big Data最新文献

英文 中文
Prediction and Big Data Impact Analysis of Telecom Churn by Backpropagation Neural Network Algorithm from the Perspective of Business Model. 基于商业模型的反向传播神经网络算法对电信客户流失的预测与大数据影响分析。
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-01 Epub Date: 2023-01-19 DOI: 10.1089/big.2021.0365
Jiabing Xu, Jiarui Liu, Tianen Yao, Yang Li

This study aims to transform the existing telecom operators from traditional Internet operators to digital-driven services, and improve the overall competitiveness of telecom enterprises. Data mining is applied to telecom user classification to process the existing telecom user data through data integration, cleaning, standardization, and transformation. Although the existing algorithms ensure the accuracy of the algorithm on the telecom user analysis platform under big data, they do not solve the limitations of single machine computing and cannot effectively improve the training efficiency of the model. To solve this problem, this article establishes a telecom customer churn prediction model with the help of backpropagation neural network (BPNN) algorithm, and deploys the MapReduce programming framework on Hadoop platform. Using the data of a telecom company, this article analyzes the loss of telecom customers in the big data environment. The research shows that the accuracy of telecom customer churn prediction model in BPNN is 82.12%. After deploying large data sets, the learning and training time of the model is greatly shortened. When the number of nodes is 8, the acceleration ratio of the model remains at 60 seconds. Under big data, the telecom user analysis platform not only ensures the accuracy of the algorithm, but also solves the limitations of single machine computing and effectively improves the training efficiency of the model. Compared with that of the existing research, the accuracy of the model is improved by 25.36%, and the running time is shortened by about twice. This business model based on BPNN algorithm has obvious advantages in processing more data sets, and has great reference value for the digital-driven business model transformation of the telecommunications industry.

本研究旨在将现有的电信运营商从传统的互联网运营商转变为数字驱动的服务,提高电信企业的整体竞争力。数据挖掘应用于电信用户分类,通过数据集成、清理、标准化和转换来处理现有的电信用户数据。现有算法虽然保证了大数据下电信用户分析平台上算法的准确性,但并没有解决单机计算的局限性,也无法有效提高模型的训练效率。为了解决这个问题,本文借助反向传播神经网络(BPNN)算法建立了电信客户流失预测模型,并在Hadoop平台上部署了MapReduce编程框架。本文利用一家电信公司的数据,分析了大数据环境下电信客户的流失情况。研究表明,BPNN中电信客户流失预测模型的准确率为82.12%,部署了大数据集后,模型的学习和训练时间大大缩短。当节点数为8时,模型的加速比保持在60秒。在大数据下,电信用户分析平台不仅保证了算法的准确性,还解决了单机计算的局限性,有效提高了模型的训练效率。与现有研究相比,该模型的精度提高了25.36%,运行时间缩短了约两倍。这种基于BPNN算法的商业模式在处理更多数据集方面具有明显优势,对电信行业数字化驱动的商业模式转型具有很大参考价值。
{"title":"Prediction and Big Data Impact Analysis of Telecom Churn by Backpropagation Neural Network Algorithm from the Perspective of Business Model.","authors":"Jiabing Xu,&nbsp;Jiarui Liu,&nbsp;Tianen Yao,&nbsp;Yang Li","doi":"10.1089/big.2021.0365","DOIUrl":"10.1089/big.2021.0365","url":null,"abstract":"<p><p>This study aims to transform the existing telecom operators from traditional Internet operators to digital-driven services, and improve the overall competitiveness of telecom enterprises. Data mining is applied to telecom user classification to process the existing telecom user data through data integration, cleaning, standardization, and transformation. Although the existing algorithms ensure the accuracy of the algorithm on the telecom user analysis platform under big data, they do not solve the limitations of single machine computing and cannot effectively improve the training efficiency of the model. To solve this problem, this article establishes a telecom customer churn prediction model with the help of backpropagation neural network (BPNN) algorithm, and deploys the MapReduce programming framework on Hadoop platform. Using the data of a telecom company, this article analyzes the loss of telecom customers in the big data environment. The research shows that the accuracy of telecom customer churn prediction model in BPNN is 82.12%. After deploying large data sets, the learning and training time of the model is greatly shortened. When the number of nodes is 8, the acceleration ratio of the model remains at 60 seconds. Under big data, the telecom user analysis platform not only ensures the accuracy of the algorithm, but also solves the limitations of single machine computing and effectively improves the training efficiency of the model. Compared with that of the existing research, the accuracy of the model is improved by 25.36%, and the running time is shortened by about twice. This business model based on BPNN algorithm has obvious advantages in processing more data sets, and has great reference value for the digital-driven business model transformation of the telecommunications industry.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"355-368"},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10549823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
iELMNet: Integrating Novel Improved Extreme Learning Machine and Convolutional Neural Network Model for Traffic Sign Detection. iELMNet:集成新型改进的极限学习机和卷积神经网络模型用于交通标志检测。
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-01 Epub Date: 2022-01-06 DOI: 10.1089/big.2021.0279
Aisha Batool, Muhammad Wasif Nisar, Jamal Hussain Shah, Muhammad Attique Khan, Ahmed A Abd El-Latif

Traffic sign detection (TSD) in real-time environment holds great importance for applications such as automated-driven vehicles. Large variety of traffic signs, different appearances, and spatial representations causes a huge intraclass variation. In this article, an extreme learning machine (ELM), convolutional neural network (CNN), and scale transformation (ST)-based model, called improved extreme learning machine network, are proposed to detect traffic signs in real-time environment. The proposed model has a custom DenseNet-based novel CNN architecture, improved version of region proposal networks called accurate anchor prediction model (A2PM), ST, and ELM module. CNN architecture makes use of handcrafted features such as scale-invariant feature transform and Gabor to improvise the edges of traffic signs. The A2PM minimizes the redundancy among extracted features to make the model efficient and ST enables the model to detect traffic signs of different sizes. ELM module enhances the efficiency by reshaping the features. The proposed model is tested on three publicly available data sets, challenging unreal and real environments for traffic sign recognition, Tsinghua-Tencent 100K, and German traffic sign detection benchmark and achieves average precisions of 93.31%, 95.22%, and 99.45%, respectively. These results prove that the proposed model is more efficient than state-of-the-art sign detection techniques.

实时环境中的交通标志检测(TSD)对于自动驾驶车辆等应用具有重要意义。各种各样的交通标志、不同的外观和空间表现导致了巨大的类内变化。本文提出了一种基于极限学习机(ELM)、卷积神经网络(CNN)和尺度变换(ST)的模型,称为改进的极限学习机网络,用于实时环境中的交通标志检测。所提出的模型具有自定义的基于DenseNet的新型CNN架构、称为精确锚预测模型(A2PM)、ST和ELM模块的区域建议网络的改进版本。CNN架构利用手工制作的特征,如尺度不变特征变换和Gabor来即兴制作交通标志的边缘。A2PM使提取的特征之间的冗余最小化,以使模型高效,ST使模型能够检测不同大小的交通标志。ELM模块通过重塑功能来提高效率。该模型在三个公开的数据集上进行了测试,分别挑战了交通标志识别的真实和非真实环境、清华腾讯100K和德国交通标志检测基准,平均精度分别为93.31%、95.22%和99.45%。这些结果证明,所提出的模型比最先进的符号检测技术更有效。
{"title":"iELMNet: Integrating Novel Improved Extreme Learning Machine and Convolutional Neural Network Model for Traffic Sign Detection.","authors":"Aisha Batool,&nbsp;Muhammad Wasif Nisar,&nbsp;Jamal Hussain Shah,&nbsp;Muhammad Attique Khan,&nbsp;Ahmed A Abd El-Latif","doi":"10.1089/big.2021.0279","DOIUrl":"10.1089/big.2021.0279","url":null,"abstract":"<p><p>Traffic sign detection (TSD) in real-time environment holds great importance for applications such as automated-driven vehicles. Large variety of traffic signs, different appearances, and spatial representations causes a huge intraclass variation. In this article, an extreme learning machine (ELM), convolutional neural network (CNN), and scale transformation (ST)-based model, called improved extreme learning machine network, are proposed to detect traffic signs in real-time environment. The proposed model has a custom DenseNet-based novel CNN architecture, improved version of region proposal networks called accurate anchor prediction model (A2PM), ST, and ELM module. CNN architecture makes use of handcrafted features such as scale-invariant feature transform and Gabor to improvise the edges of traffic signs. The A2PM minimizes the redundancy among extracted features to make the model efficient and ST enables the model to detect traffic signs of different sizes. ELM module enhances the efficiency by reshaping the features. The proposed model is tested on three publicly available data sets, challenging unreal and real environments for traffic sign recognition, Tsinghua-Tencent 100K, and German traffic sign detection benchmark and achieves average precisions of 93.31%, 95.22%, and 99.45%, respectively. These results prove that the proposed model is more efficient than state-of-the-art sign detection techniques.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"323-338"},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39655008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Applications of Bayesian Neural Networks in Outlier Detection. 贝叶斯神经网络在异常值检测中的应用。
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-01 Epub Date: 2023-01-27 DOI: 10.1089/big.2021.0343
Chen Tao

Anomaly detection is crucial in a variety of domains, such as fraud detection, disease diagnosis, and equipment defect detection. With the development of deep learning, anomaly detection with Bayesian neural networks (BNNs) becomes a novel research topic in recent years. This article aims to propose a widely applicable method of outlier detection (a category of anomaly detection) using BNNs based on uncertainty measurement. There are three kinds of uncertainties generated in the prediction of BNNs: epistemic uncertainty, aleatoric uncertainty, and (model) misspecification uncertainty. Although the approaches in previous studies are adopted to measure epistemic and aleatoric uncertainty, a new method of utilizing loss functions to quantify misspecification uncertainty is proposed in this article. Then, these three uncertainty sources are merged together by specific combination models to construct total prediction uncertainty. In this study, the key idea is that the observations with high total prediction uncertainty should correspond to outliers in the data. The method of this research is applied to the experiments on Modified National Institute of Standards and Technology (MNIST) dataset and Taxi dataset, respectively. From the results, if the network is appropriately constructed and well-trained and model parameters are carefully tuned, most anomalous images in MNIST dataset and all the abnormal traffic periods in Taxi dataset can be nicely detected. In addition, the performance of this method is compared with the BNN anomaly detection methods proposed before and the classical Local Outlier Factor and Density-Based Spatial Clustering of Applications with Noise methods. This study links the classification of uncertainties in essence with anomaly detection and takes the lead to consider combining different uncertainty sources to reform detection outcomes instead of using only single uncertainty each time.

异常检测在欺诈检测、疾病诊断和设备缺陷检测等多个领域都至关重要。随着深度学习的发展,贝叶斯神经网络异常检测成为近年来的一个新的研究课题。本文旨在提出一种广泛适用的基于不确定性测量的使用BNN的异常值检测方法(异常检测的一类)。在BNN的预测中产生了三种不确定性:认知不确定性、任意不确定性和(模型)错误指定不确定性。尽管先前研究中的方法被用来测量认识和假设的不确定性,但本文提出了一种利用损失函数来量化错误指定不确定性的新方法。然后,通过特定的组合模型将这三个不确定性源合并在一起,构建总的预测不确定性。在这项研究中,关键思想是具有高总预测不确定性的观测值应与数据中的异常值相对应。本研究方法分别应用于修改后的国家标准与技术研究所(MNIST)数据集和出租车数据集的实验。从结果来看,如果网络构造得当,训练有素,模型参数经过仔细调整,MNIST数据集中的大多数异常图像和Taxi数据集中的所有异常交通时段都可以很好地检测到。此外,将该方法的性能与之前提出的BNN异常检测方法以及经典的局部异常因子和基于密度的噪声应用空间聚类方法进行了比较。本研究将不确定性的分类本质上与异常检测联系起来,并率先考虑将不同的不确定性来源结合起来,以改变检测结果,而不是每次只使用单个不确定性。
{"title":"Applications of Bayesian Neural Networks in Outlier Detection.","authors":"Chen Tao","doi":"10.1089/big.2021.0343","DOIUrl":"10.1089/big.2021.0343","url":null,"abstract":"<p><p>Anomaly detection is crucial in a variety of domains, such as fraud detection, disease diagnosis, and equipment defect detection. With the development of deep learning, anomaly detection with Bayesian neural networks (BNNs) becomes a novel research topic in recent years. This article aims to propose a widely applicable method of outlier detection (a category of anomaly detection) using BNNs based on uncertainty measurement. There are three kinds of uncertainties generated in the prediction of BNNs: epistemic uncertainty, aleatoric uncertainty, and (model) misspecification uncertainty. Although the approaches in previous studies are adopted to measure epistemic and aleatoric uncertainty, a new method of utilizing loss functions to quantify misspecification uncertainty is proposed in this article. Then, these three uncertainty sources are merged together by specific combination models to construct total prediction uncertainty. In this study, the key idea is that the observations with high total prediction uncertainty should correspond to outliers in the data. The method of this research is applied to the experiments on Modified National Institute of Standards and Technology (MNIST) dataset and Taxi dataset, respectively. From the results, if the network is appropriately constructed and well-trained and model parameters are carefully tuned, most anomalous images in MNIST dataset and all the abnormal traffic periods in Taxi dataset can be nicely detected. In addition, the performance of this method is compared with the BNN anomaly detection methods proposed before and the classical Local Outlier Factor and Density-Based Spatial Clustering of Applications with Noise methods. This study links the classification of uncertainties in essence with anomaly detection and takes the lead to consider combining different uncertainty sources to reform detection outcomes instead of using only single uncertainty each time.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"369-386"},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10681813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Big Data-Driven Futuristic Fabric System in Societal Digital Transformation. 社会数字化转型中大数据驱动的未来织物系统。
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-01 DOI: 10.1089/big.2023.29062.editorial
Chinmay Chakraborty, Muhammad Khurram Khan
{"title":"Big Data-Driven Futuristic Fabric System in Societal Digital Transformation.","authors":"Chinmay Chakraborty,&nbsp;Muhammad Khurram Khan","doi":"10.1089/big.2023.29062.editorial","DOIUrl":"10.1089/big.2023.29062.editorial","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"11 5","pages":"321-322"},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41219740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Expert Panel Discussion Embedding Ethics and Equity in Artificial Intelligence and Machine Learning Infrastructure. 专家小组讨论将伦理和公平嵌入人工智能和机器学习基础设施。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-09-01 DOI: 10.1089/big.2023.29061.rtd
Malaika Simmons, Rachele Hendricks-Sturrup, Gabriella Waters, Laurie Novak, Martin Were, Sajid Hussain
{"title":"An Expert Panel Discussion Embedding Ethics and Equity in Artificial Intelligence and Machine Learning Infrastructure.","authors":"Malaika Simmons, Rachele Hendricks-Sturrup, Gabriella Waters, Laurie Novak, Martin Were, Sajid Hussain","doi":"10.1089/big.2023.29061.rtd","DOIUrl":"10.1089/big.2023.29061.rtd","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"11 S1","pages":"i-S13"},"PeriodicalIF":2.6,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623060/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41166031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Analysis of Education Personalized Recommendation System under Vision of System Science Communication 系统科学传播学视野下的教育个性化推荐系统设计与分析
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-08-21 DOI: 10.3390/engproc2023038091
Manying Shi, Fang Luo, Hanping Ke, Shiliang Zhang
{"title":"Design and Analysis of Education Personalized Recommendation System under Vision of System Science Communication","authors":"Manying Shi, Fang Luo, Hanping Ke, Shiliang Zhang","doi":"10.3390/engproc2023038091","DOIUrl":"https://doi.org/10.3390/engproc2023038091","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"1 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90898197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Realizing the Potential of Stratosphere Utilization via Stratosphere Data Centers 通过平流层数据中心实现平流层利用的潜力
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-08-03 DOI: 10.1109/icABCD59051.2023.10220520
A. Periola, K. Ogudo, A. Alonge
The stratosphere is an aeronautical resource whose use is of benefit to the government in delivering aviation services. It also provides a freely cooling environment making it suitable for hosting non-terrestrial data centers. However, the development of a framework enabling the utilization of the stratosphere requires further research attention. The research presents a multientity architecture that describes the role of a stratosphere-bound airport that supports the deployment and use of future stratosphere-based data centers. The solution being presented is intended to increase the operational duration of future deployed stratosphere-based data centers. The focus here is on enhancing the operational duration of the stratosphere-based data center. This is important for its role in future networks. Analysis shows that the proposed solution improved the operational duration by at least 33% and by up to 76% on average.
平流层是一种航空资源,对政府提供航空服务大有裨益。它还提供了一个自由冷却的环境,使其适合托管非地面数据中心。然而,开发一个能够利用平流层的框架需要进一步的研究注意。该研究提出了一个多实体架构,描述了平流层机场的角色,支持未来基于平流层的数据中心的部署和使用。提出的解决方案旨在增加未来部署的基于平流层的数据中心的运行持续时间。这里的重点是增强基于平流层的数据中心的运行持续时间。这对于它在未来网络中的作用非常重要。分析表明,提出的解决方案将运行持续时间提高了至少33%,平均提高了76%。
{"title":"Realizing the Potential of Stratosphere Utilization via Stratosphere Data Centers","authors":"A. Periola, K. Ogudo, A. Alonge","doi":"10.1109/icABCD59051.2023.10220520","DOIUrl":"https://doi.org/10.1109/icABCD59051.2023.10220520","url":null,"abstract":"The stratosphere is an aeronautical resource whose use is of benefit to the government in delivering aviation services. It also provides a freely cooling environment making it suitable for hosting non-terrestrial data centers. However, the development of a framework enabling the utilization of the stratosphere requires further research attention. The research presents a multientity architecture that describes the role of a stratosphere-bound airport that supports the deployment and use of future stratosphere-based data centers. The solution being presented is intended to increase the operational duration of future deployed stratosphere-based data centers. The focus here is on enhancing the operational duration of the stratosphere-based data center. This is important for its role in future networks. Analysis shows that the proposed solution improved the operational duration by at least 33% and by up to 76% on average.","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"4 1","pages":"1-6"},"PeriodicalIF":4.6,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80701322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance Analysis of a Light Weight Ground Robotic Vehicle by Implementing Adaptive Neuro-Fuzzy Inference System (ANFIS) 基于自适应神经模糊推理系统(ANFIS)的轻型地面机器人车辆性能分析
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-08-03 DOI: 10.1109/icABCD59051.2023.10220494
M. Okwu, I. Emovon, O. J. Oyejide, Kingsley C. Ezekiel, Olaye Messiah, Perpetua C. Jones-Iwuagwu
Automated Guided Vehicles (AGVs) are widely used as delivery agents and for material transportation in factories, hospital environment, and other facilities. Conducting performance tests on AGVs has the potential to ratify and improve the efficiency, and reliability of the system. However, published studies on performance analysis focused on classical metrics for such evaluation. In this study, the emphasis is on the performance evaluation of a developed lightweight AGV using the Adaptive Neuro-fuzzy Inference System (ANFIS). The developed line following AGV is flexible, intelligent, and nifty, and can be accessed wirelessly, and controlled by an operator. It was programmed to avoid collision with the help of a proximity sensor attached. The performance test was conducted by drawing black lines on a plain surface for easy navigation of the AGV. A series of experiments was carried out by using realistic test variables like the navigation pattern of AGV, test accuracy, energy efficiency, obstacle avoidance, task accomplishment, and others. Sensitivity analysis was done using the ANFIS surface plot. The total system intelligence (TSI) obtained for the different trials are 76%; 79%; 80%; 81%; 79% and 81 %, for the first, second, third, fourth, fifth, and final trials respectively. The preeminent observable performance was the fourth and sixth trials, obtained at 81 %. The outcome of the investigation reveals that the ANFIS model is an efficient soft computing technique capable of performing TSI tests of AGVs with a high degree of accuracy. The model is also recommended in AGV platooning.
自动导引车(agv)被广泛应用于工厂、医院和其他设施的递送代理和物料运输。在agv上进行性能测试有可能验证和提高系统的效率和可靠性。然而,已发表的关于绩效分析的研究主要集中在此类评估的经典指标上。在本研究中,重点研究了一种基于自适应神经模糊推理系统(ANFIS)的轻型AGV的性能评估。开发的线路跟踪AGV灵活、智能、美观,可以无线接入,由操作员控制。它被编程为在附加的接近传感器的帮助下避免碰撞。为了便于AGV导航,在平面上绘制黑线进行性能测试。采用AGV导航模式、测试精度、能效、避障、任务完成等现实测试变量进行了一系列实验。采用ANFIS地形图进行敏感性分析。不同试验获得的总系统智能(TSI)为76%;79%;80%;81%;分别为第一、第二、第三、第四、第五和最后一次试验的79%和81%。最显著的观察表现是在第四和第六次试验中,达到81%。研究结果表明,ANFIS模型是一种高效的软计算技术,能够对agv进行高精度的TSI测试。该模型也适用于AGV队列调度。
{"title":"Performance Analysis of a Light Weight Ground Robotic Vehicle by Implementing Adaptive Neuro-Fuzzy Inference System (ANFIS)","authors":"M. Okwu, I. Emovon, O. J. Oyejide, Kingsley C. Ezekiel, Olaye Messiah, Perpetua C. Jones-Iwuagwu","doi":"10.1109/icABCD59051.2023.10220494","DOIUrl":"https://doi.org/10.1109/icABCD59051.2023.10220494","url":null,"abstract":"Automated Guided Vehicles (AGVs) are widely used as delivery agents and for material transportation in factories, hospital environment, and other facilities. Conducting performance tests on AGVs has the potential to ratify and improve the efficiency, and reliability of the system. However, published studies on performance analysis focused on classical metrics for such evaluation. In this study, the emphasis is on the performance evaluation of a developed lightweight AGV using the Adaptive Neuro-fuzzy Inference System (ANFIS). The developed line following AGV is flexible, intelligent, and nifty, and can be accessed wirelessly, and controlled by an operator. It was programmed to avoid collision with the help of a proximity sensor attached. The performance test was conducted by drawing black lines on a plain surface for easy navigation of the AGV. A series of experiments was carried out by using realistic test variables like the navigation pattern of AGV, test accuracy, energy efficiency, obstacle avoidance, task accomplishment, and others. Sensitivity analysis was done using the ANFIS surface plot. The total system intelligence (TSI) obtained for the different trials are 76%; 79%; 80%; 81%; 79% and 81 %, for the first, second, third, fourth, fifth, and final trials respectively. The preeminent observable performance was the fourth and sixth trials, obtained at 81 %. The outcome of the investigation reveals that the ANFIS model is an efficient soft computing technique capable of performing TSI tests of AGVs with a high degree of accuracy. The model is also recommended in AGV platooning.","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"52 1","pages":"1-7"},"PeriodicalIF":4.6,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85157357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Malware detection using Explainable ML models based on Feature Extraction using API calls 基于API调用的特征提取的可解释ML模型的恶意软件检测
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-08-03 DOI: 10.1109/icABCD59051.2023.10220515
Bhanu Prakash Reddy Banda, Bianca Govan, K. Roy, Kelvin S. Bryant
Malware attacks have become a crucial problem in modern life. From 2015 to 2021 about 56.1billion malware attacks have taken place in the world. A malware attack typically costs a business over 2.5 million dollars to remediate. According to Cybersecurity Ventures, during the next five years, the cost of cybercrime would increase by 15% yearly, reaching 10.5 trillion USD annually by 2025 from 3 trillion USD in 2015. There is a global epidemic of malware. Studies imply that malware's effects are deteriorating. The main defense against malware tools is malware detectors. Therefore, it is crucial that we research malware detection methods to better comprehend their advantages and disadvantages. This research focuses on an Application Pro-gramming Interface (API) call-based malware detection strategy with Machine Learning to further improve malware detection. The Limitations that motivated to work on this project was the lack of datasets with newly attacked malware samples and also lack of detecting the malware with good accuracy. The main goal of this research is to understand the malware behavior on the Windows platform, use a dynamic analysis to identify various aspects or features that have dangerous code patterns from malware samples and employ various malware and benign samples to construct and validate machine learning-based malware detection models. The data was gathered from publicly accessible sites and sampled using a sandbox approach. Machine Learning models were built using the new dataset. The Supervised Learning models and deep Learning models were applied to the dataset and then the results were compared and cross-checked to get the best fit model. This investigation demonstrated the possibility of estab- lishing a high-precision capability for the detection of malware while combining API calls and Machine Learning models., The strategy yielded a high malware detection accuracy of 88.26% (XGBoost) model and 90.70% (MLP classifier) for Windows-based platforms. We have used Explainable Machine Learning, namely the SHapley Additive exPlanations (SHAP) value methods to demonstrate the important component or feature responsible for the prediction of the model.
恶意软件攻击已经成为现代生活中的一个关键问题。从2015年到2021年,全球共发生了561亿次恶意软件攻击。恶意软件攻击通常要花费企业超过250万美元来修复。根据网络安全风险投资公司的数据,在未来五年内,网络犯罪的成本将以每年15%的速度增长,到2025年将从2015年的每年3万亿美元达到10.5万亿美元。恶意软件在全球流行。研究表明,恶意软件的影响正在恶化。针对恶意软件的主要防御工具是恶意软件检测器。因此,研究恶意软件检测方法以更好地了解它们的优缺点是至关重要的。本文研究了一种基于应用程序编程接口(API)调用的恶意软件检测策略,并结合机器学习进一步改进恶意软件检测。这个项目的局限性是缺乏新攻击的恶意软件样本的数据集,也缺乏准确检测恶意软件的能力。本研究的主要目标是了解Windows平台上的恶意软件行为,使用动态分析来识别恶意软件样本中具有危险代码模式的各个方面或特征,并使用各种恶意软件和良性样本来构建和验证基于机器学习的恶意软件检测模型。数据是从可公开访问的站点收集的,并使用沙盒方法进行抽样。使用新的数据集建立了机器学习模型。将有监督学习模型和深度学习模型应用于数据集,然后对结果进行比较和交叉检查,以获得最佳拟合模型。这项调查证明了在结合API调用和机器学习模型的同时,建立高精度恶意软件检测能力的可能性。该策略在windows平台上的恶意软件检测准确率为88.26% (XGBoost)模型和90.70% (MLP分类器)。我们使用了可解释机器学习,即SHapley加性解释(SHAP)值方法来展示负责模型预测的重要成分或特征。
{"title":"Malware detection using Explainable ML models based on Feature Extraction using API calls","authors":"Bhanu Prakash Reddy Banda, Bianca Govan, K. Roy, Kelvin S. Bryant","doi":"10.1109/icABCD59051.2023.10220515","DOIUrl":"https://doi.org/10.1109/icABCD59051.2023.10220515","url":null,"abstract":"Malware attacks have become a crucial problem in modern life. From 2015 to 2021 about 56.1billion malware attacks have taken place in the world. A malware attack typically costs a business over 2.5 million dollars to remediate. According to Cybersecurity Ventures, during the next five years, the cost of cybercrime would increase by 15% yearly, reaching 10.5 trillion USD annually by 2025 from 3 trillion USD in 2015. There is a global epidemic of malware. Studies imply that malware's effects are deteriorating. The main defense against malware tools is malware detectors. Therefore, it is crucial that we research malware detection methods to better comprehend their advantages and disadvantages. This research focuses on an Application Pro-gramming Interface (API) call-based malware detection strategy with Machine Learning to further improve malware detection. The Limitations that motivated to work on this project was the lack of datasets with newly attacked malware samples and also lack of detecting the malware with good accuracy. The main goal of this research is to understand the malware behavior on the Windows platform, use a dynamic analysis to identify various aspects or features that have dangerous code patterns from malware samples and employ various malware and benign samples to construct and validate machine learning-based malware detection models. The data was gathered from publicly accessible sites and sampled using a sandbox approach. Machine Learning models were built using the new dataset. The Supervised Learning models and deep Learning models were applied to the dataset and then the results were compared and cross-checked to get the best fit model. This investigation demonstrated the possibility of estab- lishing a high-precision capability for the detection of malware while combining API calls and Machine Learning models., The strategy yielded a high malware detection accuracy of 88.26% (XGBoost) model and 90.70% (MLP classifier) for Windows-based platforms. We have used Explainable Machine Learning, namely the SHapley Additive exPlanations (SHAP) value methods to demonstrate the important component or feature responsible for the prediction of the model.","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"3 1","pages":"1-7"},"PeriodicalIF":4.6,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85206168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Early Detection of Lung Cancer via Breath Analysis Utilising Electronic Nose 利用电子鼻进行呼吸分析的肺癌早期检测
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-08-03 DOI: 10.1109/icABCD59051.2023.10220490
Funmilayo S. Moninuola, E. Adetiba, Anthony A. Atayero, A. Awelewa, A. Adeyeye, Oluwadamilola Oshin, J. Ameh, A. Abayomi, Victor Ezekiel
Lung Cancer (LC), have the highest mortality rate and the second-highest incidence rate of all cancers combined because of a pathophysiological imbalance in the fundamental mechanism of cell proliferation. For patients with LC, prompt diagnosis and treatment are of utmost importance. The orthodox methods employed for detecting LC are characterised by invasiveness, protracted duration, high cost and exhibit reduced efficacy in detecting malignant cells during the initial phases of the ailment. The increasing attention of researchers toward the potential of utilising Volatile Organic Compound (VOC) biomarkers for the non-invasive detection of LC can be attributed to the advancements in techniques and procedures. This study offers a state-of-the-art portable E-nose that has the potential to enhance clinical outcomes associated with the early diagnosis of LC. Three ML models - SVM, AdaBoost, and MLP were employed to discriminate LC from other respiratory breathprint dataset. The MLP model achieved the highest performance accuracy result of 89.05%, specificity 95.12%, and sensitivity of 80%.
肺癌(LC)由于细胞增殖基本机制的病理生理失衡,在所有癌症中死亡率最高,发病率第二高。对于LC患者,及时诊断和治疗至关重要。传统的LC检测方法具有侵袭性、持续时间长、成本高、在疾病初期检测恶性细胞的效率较低等特点。研究人员越来越关注利用挥发性有机化合物(VOC)生物标志物进行LC无创检测的潜力,这可归因于技术和程序的进步。这项研究提供了一种最先进的便携式电子鼻,它有可能提高与LC早期诊断相关的临床结果。使用SVM、AdaBoost和MLP三种机器学习模型将LC与其他呼吸指纹数据进行区分。MLP模型的最高性能准确率为89.05%,特异性为95.12%,灵敏度为80%。
{"title":"Early Detection of Lung Cancer via Breath Analysis Utilising Electronic Nose","authors":"Funmilayo S. Moninuola, E. Adetiba, Anthony A. Atayero, A. Awelewa, A. Adeyeye, Oluwadamilola Oshin, J. Ameh, A. Abayomi, Victor Ezekiel","doi":"10.1109/icABCD59051.2023.10220490","DOIUrl":"https://doi.org/10.1109/icABCD59051.2023.10220490","url":null,"abstract":"Lung Cancer (LC), have the highest mortality rate and the second-highest incidence rate of all cancers combined because of a pathophysiological imbalance in the fundamental mechanism of cell proliferation. For patients with LC, prompt diagnosis and treatment are of utmost importance. The orthodox methods employed for detecting LC are characterised by invasiveness, protracted duration, high cost and exhibit reduced efficacy in detecting malignant cells during the initial phases of the ailment. The increasing attention of researchers toward the potential of utilising Volatile Organic Compound (VOC) biomarkers for the non-invasive detection of LC can be attributed to the advancements in techniques and procedures. This study offers a state-of-the-art portable E-nose that has the potential to enhance clinical outcomes associated with the early diagnosis of LC. Three ML models - SVM, AdaBoost, and MLP were employed to discriminate LC from other respiratory breathprint dataset. The MLP model achieved the highest performance accuracy result of 89.05%, specificity 95.12%, and sensitivity of 80%.","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"9 1","pages":"1-6"},"PeriodicalIF":4.6,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82427584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1