首页 > 最新文献

Big Data最新文献

英文 中文
HUTNet: An Efficient Convolutional Neural Network for Handwritten Uchen Tibetan Character Recognition. HUTNet:一种高效的卷积神经网络,用于乌琴藏文手写体识别。
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-01 Epub Date: 2023-01-19 DOI: 10.1089/big.2021.0333
Guowei Zhang, Weilan Wang, Ce Zhang, Penghai Zhao, Mingkai Zhang

Recognition of handwritten Uchen Tibetan characters input has been considered an efficient way of acquiring mass data in the digital era. However, it still faces considerable challenges due to seriously touching letters and various morphological features of identical characters. Thus, deeper neural networks are required to achieve decent recognition accuracy, making an efficient, lightweight model design important to balance the inevitable trade-off between accuracy and latency. To reduce the learnable parameters of the network as much as possible and maintain acceptable accuracy, we introduce an efficient model named HUTNet based on the internal relationship between floating-point operations per second (FLOPs) and Memory Access Cost. The proposed network achieves a ResNet-18-level accuracy of 96.86%, with only a tenth of the parameters. The subsequent pruning and knowledge distillation strategies were applied to further reduce the inference latency of the model. Experiments on the test set (Handwritten Uchen Tibetan Data set by Wang [HUTDW]) containing 562 classes of 42,068 samples show that the compressed model achieves a 96.83% accuracy while maintaining lower FLOPs and fewer parameters. To verify the effectiveness of HUTNet, we tested it on the Chinese Handwriting Data sets Handwriting Database 1.1 (HWDB1.1), in which HUTNet achieved an accuracy of 97.24%, higher than that of ResNet-18 and ResNet-34. In general, we conduct extensive experiments on resource and accuracy trade-offs and show a stronger performance compared with other famous models on HUTDW and HWDB1.1. It also unlocks the critical bottleneck for handwritten Uchen Tibetan recognition on low-power computing devices.

在数字时代,识别输入的乌陈藏文手写体被认为是获取海量数据的有效途径。然而,由于字母的感人性和相同字符的各种形态特征,它仍然面临着相当大的挑战。因此,需要更深入的神经网络来实现良好的识别精度,这使得高效、轻量级的模型设计对于平衡精度和延迟之间不可避免的权衡非常重要。为了尽可能减少网络的可学习参数并保持可接受的精度,我们引入了一个基于每秒浮点运算(FLOP)和内存访问成本之间的内部关系的高效模型HUTNet。所提出的网络实现了96.86%的ResNet-18级精度,仅具有十分之一的参数。随后的修剪和知识提取策略被应用于进一步减少模型的推理延迟。在包含562类42068个样本的测试集(王[HUTDW]的手写乌陈藏文数据集)上的实验表明,压缩模型在保持较低的FLOP和较少的参数的同时,实现了96.83%的准确率。为了验证HUTNet的有效性,我们在中文手写数据集手写数据库1.1(HWDB1.1)上对其进行了测试,其中HUTNet实现了97.24%的准确率,高于ResNet-18和ResNet-34。总的来说,我们在资源和精度权衡方面进行了广泛的实验,并在HUTDW和HWDB1.1上显示出与其他著名模型相比更强的性能。它还解开了在低功耗计算设备上手写乌琴藏文识别的关键瓶颈。
{"title":"HUTNet: An Efficient Convolutional Neural Network for Handwritten Uchen Tibetan Character Recognition.","authors":"Guowei Zhang,&nbsp;Weilan Wang,&nbsp;Ce Zhang,&nbsp;Penghai Zhao,&nbsp;Mingkai Zhang","doi":"10.1089/big.2021.0333","DOIUrl":"10.1089/big.2021.0333","url":null,"abstract":"<p><p>Recognition of handwritten Uchen Tibetan characters input has been considered an efficient way of acquiring mass data in the digital era. However, it still faces considerable challenges due to seriously touching letters and various morphological features of identical characters. Thus, deeper neural networks are required to achieve decent recognition accuracy, making an efficient, lightweight model design important to balance the inevitable trade-off between accuracy and latency. To reduce the learnable parameters of the network as much as possible and maintain acceptable accuracy, we introduce an efficient model named HUTNet based on the internal relationship between floating-point operations per second (FLOPs) and Memory Access Cost. The proposed network achieves a ResNet-18-level accuracy of 96.86%, with only a tenth of the parameters. The subsequent pruning and knowledge distillation strategies were applied to further reduce the inference latency of the model. Experiments on the test set (Handwritten Uchen Tibetan Data set by Wang [HUTDW]) containing 562 classes of 42,068 samples show that the compressed model achieves a 96.83% accuracy while maintaining lower FLOPs and fewer parameters. To verify the effectiveness of HUTNet, we tested it on the Chinese Handwriting Data sets Handwriting Database 1.1 (HWDB1.1), in which HUTNet achieved an accuracy of 97.24%, higher than that of ResNet-18 and ResNet-34. In general, we conduct extensive experiments on resource and accuracy trade-offs and show a stronger performance compared with other famous models on HUTDW and HWDB1.1. It also unlocks the critical bottleneck for handwritten Uchen Tibetan recognition on low-power computing devices.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10543391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction and Big Data Impact Analysis of Telecom Churn by Backpropagation Neural Network Algorithm from the Perspective of Business Model. 基于商业模型的反向传播神经网络算法对电信客户流失的预测与大数据影响分析。
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-01 Epub Date: 2023-01-19 DOI: 10.1089/big.2021.0365
Jiabing Xu, Jiarui Liu, Tianen Yao, Yang Li

This study aims to transform the existing telecom operators from traditional Internet operators to digital-driven services, and improve the overall competitiveness of telecom enterprises. Data mining is applied to telecom user classification to process the existing telecom user data through data integration, cleaning, standardization, and transformation. Although the existing algorithms ensure the accuracy of the algorithm on the telecom user analysis platform under big data, they do not solve the limitations of single machine computing and cannot effectively improve the training efficiency of the model. To solve this problem, this article establishes a telecom customer churn prediction model with the help of backpropagation neural network (BPNN) algorithm, and deploys the MapReduce programming framework on Hadoop platform. Using the data of a telecom company, this article analyzes the loss of telecom customers in the big data environment. The research shows that the accuracy of telecom customer churn prediction model in BPNN is 82.12%. After deploying large data sets, the learning and training time of the model is greatly shortened. When the number of nodes is 8, the acceleration ratio of the model remains at 60 seconds. Under big data, the telecom user analysis platform not only ensures the accuracy of the algorithm, but also solves the limitations of single machine computing and effectively improves the training efficiency of the model. Compared with that of the existing research, the accuracy of the model is improved by 25.36%, and the running time is shortened by about twice. This business model based on BPNN algorithm has obvious advantages in processing more data sets, and has great reference value for the digital-driven business model transformation of the telecommunications industry.

本研究旨在将现有的电信运营商从传统的互联网运营商转变为数字驱动的服务,提高电信企业的整体竞争力。数据挖掘应用于电信用户分类,通过数据集成、清理、标准化和转换来处理现有的电信用户数据。现有算法虽然保证了大数据下电信用户分析平台上算法的准确性,但并没有解决单机计算的局限性,也无法有效提高模型的训练效率。为了解决这个问题,本文借助反向传播神经网络(BPNN)算法建立了电信客户流失预测模型,并在Hadoop平台上部署了MapReduce编程框架。本文利用一家电信公司的数据,分析了大数据环境下电信客户的流失情况。研究表明,BPNN中电信客户流失预测模型的准确率为82.12%,部署了大数据集后,模型的学习和训练时间大大缩短。当节点数为8时,模型的加速比保持在60秒。在大数据下,电信用户分析平台不仅保证了算法的准确性,还解决了单机计算的局限性,有效提高了模型的训练效率。与现有研究相比,该模型的精度提高了25.36%,运行时间缩短了约两倍。这种基于BPNN算法的商业模式在处理更多数据集方面具有明显优势,对电信行业数字化驱动的商业模式转型具有很大参考价值。
{"title":"Prediction and Big Data Impact Analysis of Telecom Churn by Backpropagation Neural Network Algorithm from the Perspective of Business Model.","authors":"Jiabing Xu,&nbsp;Jiarui Liu,&nbsp;Tianen Yao,&nbsp;Yang Li","doi":"10.1089/big.2021.0365","DOIUrl":"10.1089/big.2021.0365","url":null,"abstract":"<p><p>This study aims to transform the existing telecom operators from traditional Internet operators to digital-driven services, and improve the overall competitiveness of telecom enterprises. Data mining is applied to telecom user classification to process the existing telecom user data through data integration, cleaning, standardization, and transformation. Although the existing algorithms ensure the accuracy of the algorithm on the telecom user analysis platform under big data, they do not solve the limitations of single machine computing and cannot effectively improve the training efficiency of the model. To solve this problem, this article establishes a telecom customer churn prediction model with the help of backpropagation neural network (BPNN) algorithm, and deploys the MapReduce programming framework on Hadoop platform. Using the data of a telecom company, this article analyzes the loss of telecom customers in the big data environment. The research shows that the accuracy of telecom customer churn prediction model in BPNN is 82.12%. After deploying large data sets, the learning and training time of the model is greatly shortened. When the number of nodes is 8, the acceleration ratio of the model remains at 60 seconds. Under big data, the telecom user analysis platform not only ensures the accuracy of the algorithm, but also solves the limitations of single machine computing and effectively improves the training efficiency of the model. Compared with that of the existing research, the accuracy of the model is improved by 25.36%, and the running time is shortened by about twice. This business model based on BPNN algorithm has obvious advantages in processing more data sets, and has great reference value for the digital-driven business model transformation of the telecommunications industry.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10549823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
iELMNet: Integrating Novel Improved Extreme Learning Machine and Convolutional Neural Network Model for Traffic Sign Detection. iELMNet:集成新型改进的极限学习机和卷积神经网络模型用于交通标志检测。
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-01 Epub Date: 2022-01-06 DOI: 10.1089/big.2021.0279
Aisha Batool, Muhammad Wasif Nisar, Jamal Hussain Shah, Muhammad Attique Khan, Ahmed A Abd El-Latif

Traffic sign detection (TSD) in real-time environment holds great importance for applications such as automated-driven vehicles. Large variety of traffic signs, different appearances, and spatial representations causes a huge intraclass variation. In this article, an extreme learning machine (ELM), convolutional neural network (CNN), and scale transformation (ST)-based model, called improved extreme learning machine network, are proposed to detect traffic signs in real-time environment. The proposed model has a custom DenseNet-based novel CNN architecture, improved version of region proposal networks called accurate anchor prediction model (A2PM), ST, and ELM module. CNN architecture makes use of handcrafted features such as scale-invariant feature transform and Gabor to improvise the edges of traffic signs. The A2PM minimizes the redundancy among extracted features to make the model efficient and ST enables the model to detect traffic signs of different sizes. ELM module enhances the efficiency by reshaping the features. The proposed model is tested on three publicly available data sets, challenging unreal and real environments for traffic sign recognition, Tsinghua-Tencent 100K, and German traffic sign detection benchmark and achieves average precisions of 93.31%, 95.22%, and 99.45%, respectively. These results prove that the proposed model is more efficient than state-of-the-art sign detection techniques.

实时环境中的交通标志检测(TSD)对于自动驾驶车辆等应用具有重要意义。各种各样的交通标志、不同的外观和空间表现导致了巨大的类内变化。本文提出了一种基于极限学习机(ELM)、卷积神经网络(CNN)和尺度变换(ST)的模型,称为改进的极限学习机网络,用于实时环境中的交通标志检测。所提出的模型具有自定义的基于DenseNet的新型CNN架构、称为精确锚预测模型(A2PM)、ST和ELM模块的区域建议网络的改进版本。CNN架构利用手工制作的特征,如尺度不变特征变换和Gabor来即兴制作交通标志的边缘。A2PM使提取的特征之间的冗余最小化,以使模型高效,ST使模型能够检测不同大小的交通标志。ELM模块通过重塑功能来提高效率。该模型在三个公开的数据集上进行了测试,分别挑战了交通标志识别的真实和非真实环境、清华腾讯100K和德国交通标志检测基准,平均精度分别为93.31%、95.22%和99.45%。这些结果证明,所提出的模型比最先进的符号检测技术更有效。
{"title":"iELMNet: Integrating Novel Improved Extreme Learning Machine and Convolutional Neural Network Model for Traffic Sign Detection.","authors":"Aisha Batool,&nbsp;Muhammad Wasif Nisar,&nbsp;Jamal Hussain Shah,&nbsp;Muhammad Attique Khan,&nbsp;Ahmed A Abd El-Latif","doi":"10.1089/big.2021.0279","DOIUrl":"10.1089/big.2021.0279","url":null,"abstract":"<p><p>Traffic sign detection (TSD) in real-time environment holds great importance for applications such as automated-driven vehicles. Large variety of traffic signs, different appearances, and spatial representations causes a huge intraclass variation. In this article, an extreme learning machine (ELM), convolutional neural network (CNN), and scale transformation (ST)-based model, called improved extreme learning machine network, are proposed to detect traffic signs in real-time environment. The proposed model has a custom DenseNet-based novel CNN architecture, improved version of region proposal networks called accurate anchor prediction model (A2PM), ST, and ELM module. CNN architecture makes use of handcrafted features such as scale-invariant feature transform and Gabor to improvise the edges of traffic signs. The A2PM minimizes the redundancy among extracted features to make the model efficient and ST enables the model to detect traffic signs of different sizes. ELM module enhances the efficiency by reshaping the features. The proposed model is tested on three publicly available data sets, challenging unreal and real environments for traffic sign recognition, Tsinghua-Tencent 100K, and German traffic sign detection benchmark and achieves average precisions of 93.31%, 95.22%, and 99.45%, respectively. These results prove that the proposed model is more efficient than state-of-the-art sign detection techniques.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39655008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Applications of Bayesian Neural Networks in Outlier Detection. 贝叶斯神经网络在异常值检测中的应用。
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-01 Epub Date: 2023-01-27 DOI: 10.1089/big.2021.0343
Chen Tao

Anomaly detection is crucial in a variety of domains, such as fraud detection, disease diagnosis, and equipment defect detection. With the development of deep learning, anomaly detection with Bayesian neural networks (BNNs) becomes a novel research topic in recent years. This article aims to propose a widely applicable method of outlier detection (a category of anomaly detection) using BNNs based on uncertainty measurement. There are three kinds of uncertainties generated in the prediction of BNNs: epistemic uncertainty, aleatoric uncertainty, and (model) misspecification uncertainty. Although the approaches in previous studies are adopted to measure epistemic and aleatoric uncertainty, a new method of utilizing loss functions to quantify misspecification uncertainty is proposed in this article. Then, these three uncertainty sources are merged together by specific combination models to construct total prediction uncertainty. In this study, the key idea is that the observations with high total prediction uncertainty should correspond to outliers in the data. The method of this research is applied to the experiments on Modified National Institute of Standards and Technology (MNIST) dataset and Taxi dataset, respectively. From the results, if the network is appropriately constructed and well-trained and model parameters are carefully tuned, most anomalous images in MNIST dataset and all the abnormal traffic periods in Taxi dataset can be nicely detected. In addition, the performance of this method is compared with the BNN anomaly detection methods proposed before and the classical Local Outlier Factor and Density-Based Spatial Clustering of Applications with Noise methods. This study links the classification of uncertainties in essence with anomaly detection and takes the lead to consider combining different uncertainty sources to reform detection outcomes instead of using only single uncertainty each time.

异常检测在欺诈检测、疾病诊断和设备缺陷检测等多个领域都至关重要。随着深度学习的发展,贝叶斯神经网络异常检测成为近年来的一个新的研究课题。本文旨在提出一种广泛适用的基于不确定性测量的使用BNN的异常值检测方法(异常检测的一类)。在BNN的预测中产生了三种不确定性:认知不确定性、任意不确定性和(模型)错误指定不确定性。尽管先前研究中的方法被用来测量认识和假设的不确定性,但本文提出了一种利用损失函数来量化错误指定不确定性的新方法。然后,通过特定的组合模型将这三个不确定性源合并在一起,构建总的预测不确定性。在这项研究中,关键思想是具有高总预测不确定性的观测值应与数据中的异常值相对应。本研究方法分别应用于修改后的国家标准与技术研究所(MNIST)数据集和出租车数据集的实验。从结果来看,如果网络构造得当,训练有素,模型参数经过仔细调整,MNIST数据集中的大多数异常图像和Taxi数据集中的所有异常交通时段都可以很好地检测到。此外,将该方法的性能与之前提出的BNN异常检测方法以及经典的局部异常因子和基于密度的噪声应用空间聚类方法进行了比较。本研究将不确定性的分类本质上与异常检测联系起来,并率先考虑将不同的不确定性来源结合起来,以改变检测结果,而不是每次只使用单个不确定性。
{"title":"Applications of Bayesian Neural Networks in Outlier Detection.","authors":"Chen Tao","doi":"10.1089/big.2021.0343","DOIUrl":"10.1089/big.2021.0343","url":null,"abstract":"<p><p>Anomaly detection is crucial in a variety of domains, such as fraud detection, disease diagnosis, and equipment defect detection. With the development of deep learning, anomaly detection with Bayesian neural networks (BNNs) becomes a novel research topic in recent years. This article aims to propose a widely applicable method of outlier detection (a category of anomaly detection) using BNNs based on uncertainty measurement. There are three kinds of uncertainties generated in the prediction of BNNs: epistemic uncertainty, aleatoric uncertainty, and (model) misspecification uncertainty. Although the approaches in previous studies are adopted to measure epistemic and aleatoric uncertainty, a new method of utilizing loss functions to quantify misspecification uncertainty is proposed in this article. Then, these three uncertainty sources are merged together by specific combination models to construct total prediction uncertainty. In this study, the key idea is that the observations with high total prediction uncertainty should correspond to outliers in the data. The method of this research is applied to the experiments on Modified National Institute of Standards and Technology (MNIST) dataset and Taxi dataset, respectively. From the results, if the network is appropriately constructed and well-trained and model parameters are carefully tuned, most anomalous images in MNIST dataset and all the abnormal traffic periods in Taxi dataset can be nicely detected. In addition, the performance of this method is compared with the BNN anomaly detection methods proposed before and the classical Local Outlier Factor and Density-Based Spatial Clustering of Applications with Noise methods. This study links the classification of uncertainties in essence with anomaly detection and takes the lead to consider combining different uncertainty sources to reform detection outcomes instead of using only single uncertainty each time.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10681813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Big Data-Driven Futuristic Fabric System in Societal Digital Transformation. 社会数字化转型中大数据驱动的未来织物系统。
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-01 DOI: 10.1089/big.2023.29062.editorial
Chinmay Chakraborty, Muhammad Khurram Khan
{"title":"Big Data-Driven Futuristic Fabric System in Societal Digital Transformation.","authors":"Chinmay Chakraborty,&nbsp;Muhammad Khurram Khan","doi":"10.1089/big.2023.29062.editorial","DOIUrl":"10.1089/big.2023.29062.editorial","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41219740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social Listening for Product Design Requirement Analysis and Segmentation: A Graph Analysis Approach with User Comments Mining. 面向产品设计需求分析和细分的社会倾听:基于用户评论挖掘的图分析方法。
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-09-04 DOI: 10.1089/big.2022.0021
Xinjun Lai, Guitao Huang, Ziyue Zhao, Shenhe Lin, Sheng Zhang, Huiyu Zhang, Qingxin Chen, Ning Mao

This study investigates customers' product design requirements through online comments from social media, and quickly translates these needs into product design specifications. First, the exponential discriminative snowball sampling method was proposed to generate a product-related subnetwork. Second, natural language processing (NLP) was utilized to mine user-generated comments, and a Graph SAmple and aggreGatE method was employed to embed the user's node neighborhood information in the network to jointly define a user's persona. Clustering was used for market and product model segmentation. Finally, a deep learning bidirectional long short-term memory with conditional random fields framework was introduced for opinion mining. A comment frequency-invert group frequency indicator was proposed to quantify all user groups' positive and negative opinions for various specifications of different product functions. A case study of smartphone design analysis is presented with data from a large Chinese online community called Baidu Tieba. Eleven layers of social relationships were snowball sampled, with 14,018 users and 30,803 comments. The proposed method produced a more reasonable user group clustering result than the conventional method. With our approach, user groups' dominating likes and dislikes for specifications could be immediately identified, and the similar and different preferences of product features by different user groups were instantly revealed. Managerial and engineering insights were also discussed.

本研究通过社交媒体的在线评论来调查客户的产品设计需求,并将这些需求快速转化为产品设计规范。首先,提出了指数判别滚雪球抽样方法生成乘积相关子网络;其次,利用自然语言处理(NLP)对用户生成的评论进行挖掘,并采用Graph SAmple和aggreGatE方法将用户的节点邻域信息嵌入到网络中,共同定义用户的角色;聚类用于市场和产品模型分割。最后,提出了一种基于条件随机场的深度学习双向长短期记忆框架。提出了一种评论频率逆变组频率指标,量化所有用户组对不同产品功能的各种规格的正面和负面意见。一个智能手机设计分析的案例研究采用了来自中国大型在线社区百度贴吧的数据。11层社会关系被滚雪球抽样,有14018个用户和30803条评论。与传统方法相比,该方法获得了更合理的用户组聚类结果。通过我们的方法,可以立即识别用户群体对规格的主导喜欢和不喜欢,并立即揭示不同用户群体对产品功能的相似和不同偏好。还讨论了管理和工程方面的见解。
{"title":"Social Listening for Product Design Requirement Analysis and Segmentation: A Graph Analysis Approach with User Comments Mining.","authors":"Xinjun Lai,&nbsp;Guitao Huang,&nbsp;Ziyue Zhao,&nbsp;Shenhe Lin,&nbsp;Sheng Zhang,&nbsp;Huiyu Zhang,&nbsp;Qingxin Chen,&nbsp;Ning Mao","doi":"10.1089/big.2022.0021","DOIUrl":"https://doi.org/10.1089/big.2022.0021","url":null,"abstract":"<p><p>This study investigates customers' product design requirements through online comments from social media, and quickly translates these needs into product design specifications. First, the exponential discriminative snowball sampling method was proposed to generate a product-related subnetwork. Second, natural language processing (NLP) was utilized to mine user-generated comments, and a Graph SAmple and aggreGatE method was employed to embed the user's node neighborhood information in the network to jointly define a user's persona. Clustering was used for market and product model segmentation. Finally, a deep learning bidirectional long short-term memory with conditional random fields framework was introduced for opinion mining. A comment frequency-invert group frequency indicator was proposed to quantify all user groups' positive and negative opinions for various specifications of different product functions. A case study of smartphone design analysis is presented with data from a large Chinese online community called Baidu Tieba. Eleven layers of social relationships were snowball sampled, with 14,018 users and 30,803 comments. The proposed method produced a more reasonable user group clustering result than the conventional method. With our approach, user groups' dominating likes and dislikes for specifications could be immediately identified, and the similar and different preferences of product features by different user groups were instantly revealed. Managerial and engineering insights were also discussed.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10508327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Expert Panel Discussion Embedding Ethics and Equity in Artificial Intelligence and Machine Learning Infrastructure. 专家小组讨论将伦理和公平嵌入人工智能和机器学习基础设施。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-09-01 DOI: 10.1089/big.2023.29061.rtd
Malaika Simmons, Rachele Hendricks-Sturrup, Gabriella Waters, Laurie Novak, Martin Were, Sajid Hussain
{"title":"An Expert Panel Discussion Embedding Ethics and Equity in Artificial Intelligence and Machine Learning Infrastructure.","authors":"Malaika Simmons, Rachele Hendricks-Sturrup, Gabriella Waters, Laurie Novak, Martin Were, Sajid Hussain","doi":"10.1089/big.2023.29061.rtd","DOIUrl":"10.1089/big.2023.29061.rtd","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623060/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41166031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Analysis of Education Personalized Recommendation System under Vision of System Science Communication 系统科学传播学视野下的教育个性化推荐系统设计与分析
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-08-21 DOI: 10.3390/engproc2023038091
Manying Shi, Fang Luo, Hanping Ke, Shiliang Zhang
{"title":"Design and Analysis of Education Personalized Recommendation System under Vision of System Science Communication","authors":"Manying Shi, Fang Luo, Hanping Ke, Shiliang Zhang","doi":"10.3390/engproc2023038091","DOIUrl":"https://doi.org/10.3390/engproc2023038091","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90898197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimising the Cuckoo Search Algorithm for Improved Quality of Service in Cognitive Radio ad hoc Networks 优化布谷鸟搜索算法以提高认知无线电自组织网络的服务质量
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-08-03 DOI: 10.1109/icABCD59051.2023.10220569
Ramahlapane Lerato Moila, M. Velempini
This study proposes an optimised routing scheme, called OCS-AODV, for Cognitive Radio Ad Hoc Networks (CRAHNs) to enhance Quality of Service (QoS). The scheme applies the Cuckoo Search (CS) algorithm optimised with a fitness function to improve the performance of the Ad Hoc On-Demand Distance Vector (AODV). The objective of the study is to evaluate the proposed scheme's performance with respect to delay, packet loss, packet delivery ratio and throughput. The literature review shows that the existing routing protocols have limitations which impact performance in dynamic environments. The proposed OCS-AODV scheme aims to address these limitations by selecting reliable paths based on a fitness function that considers the lifetime of nodes, reliability, and available buffer capacity. The simulation results have shown that the OCS-AODV scheme outperforms the CS-DSDV and ACO-AODV schemes in terms of PDR, packet loss, delay, and throughput. The study concludes that the proposed scheme improves the QoS of routing in CRAHNs. However, the use of a single fitness function may not be optimal for all network scenarios. Multiple fitness functions may be considered in future and the schemes be evaluated in real-world CRAHNs
本研究提出了一种优化的路由方案,称为OCS-AODV,用于认知无线电自组织网络(CRAHNs),以提高服务质量(QoS)。该方案采用适应度函数优化的布谷鸟搜索(CS)算法来提高Ad Hoc按需距离矢量(AODV)的性能。研究的目的是评估所提出的方案在延迟、丢包、包传送率和吞吐量方面的性能。文献综述表明,现有的路由协议存在局限性,影响动态环境下的性能。提出的OCS-AODV方案旨在通过基于考虑节点生存期、可靠性和可用缓冲容量的适应度函数选择可靠路径来解决这些限制。仿真结果表明,OCS-AODV方案在PDR、丢包、时延和吞吐量方面都优于CS-DSDV和ACO-AODV方案。研究表明,该方案提高了crahn中路由的QoS。然而,对于所有网络场景,使用单一适应度函数可能不是最优的。未来可以考虑多个适应度函数,并在实际的crahn中对方案进行评估
{"title":"Optimising the Cuckoo Search Algorithm for Improved Quality of Service in Cognitive Radio ad hoc Networks","authors":"Ramahlapane Lerato Moila, M. Velempini","doi":"10.1109/icABCD59051.2023.10220569","DOIUrl":"https://doi.org/10.1109/icABCD59051.2023.10220569","url":null,"abstract":"This study proposes an optimised routing scheme, called OCS-AODV, for Cognitive Radio Ad Hoc Networks (CRAHNs) to enhance Quality of Service (QoS). The scheme applies the Cuckoo Search (CS) algorithm optimised with a fitness function to improve the performance of the Ad Hoc On-Demand Distance Vector (AODV). The objective of the study is to evaluate the proposed scheme's performance with respect to delay, packet loss, packet delivery ratio and throughput. The literature review shows that the existing routing protocols have limitations which impact performance in dynamic environments. The proposed OCS-AODV scheme aims to address these limitations by selecting reliable paths based on a fitness function that considers the lifetime of nodes, reliability, and available buffer capacity. The simulation results have shown that the OCS-AODV scheme outperforms the CS-DSDV and ACO-AODV schemes in terms of PDR, packet loss, delay, and throughput. The study concludes that the proposed scheme improves the QoS of routing in CRAHNs. However, the use of a single fitness function may not be optimal for all network scenarios. Multiple fitness functions may be considered in future and the schemes be evaluated in real-world CRAHNs","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74075792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Underwater Network for Mini-Submarine Underwater Observatory 小型潜艇水下观测站的水下网络
IF 4.6 4区 计算机科学 Q1 Computer Science Pub Date : 2023-08-03 DOI: 10.1109/icABCD59051.2023.10220457
A. Periola, M. Sumbwanyambe
Ice melting in the Arctic enables the conduct of underwater neutrino astronomy in new regions with maritime resources. The presented research proposes a novel underwater network that is integrated with terrestrial computing entities to obtain underwater astronomy-associated data. In addition, the proposed network architecture enhances the conduct of underwater neutrino astronomy. This is done by increasing the potential neutrino presence points. Analysis shows that the use of the arctic region in addition to the existing region of Lake Baikal in comparison to the existing case (where only Lake Baikal is utilized) increases the potential neutrino presence points by an average of (28.3 – 65.7) %.
北极地区的冰融化使在有海洋资源的新地区进行水下中微子天文学成为可能。本研究提出了一种与地面计算实体相结合的新型水下网络,以获取水下天文相关数据。此外,所提出的网络结构增强了水下中微子天文学的进行。这是通过增加潜在的中微子存在点来实现的。分析表明,与现有情况(仅利用贝加尔湖)相比,除了利用贝加尔湖现有区域外,还利用北极地区,使潜在中微子存在点平均增加(28.3 - 65.7%)%。
{"title":"An Underwater Network for Mini-Submarine Underwater Observatory","authors":"A. Periola, M. Sumbwanyambe","doi":"10.1109/icABCD59051.2023.10220457","DOIUrl":"https://doi.org/10.1109/icABCD59051.2023.10220457","url":null,"abstract":"Ice melting in the Arctic enables the conduct of underwater neutrino astronomy in new regions with maritime resources. The presented research proposes a novel underwater network that is integrated with terrestrial computing entities to obtain underwater astronomy-associated data. In addition, the proposed network architecture enhances the conduct of underwater neutrino astronomy. This is done by increasing the potential neutrino presence points. Analysis shows that the use of the arctic region in addition to the existing region of Lake Baikal in comparison to the existing case (where only Lake Baikal is utilized) increases the potential neutrino presence points by an average of (28.3 – 65.7) %.","PeriodicalId":51314,"journal":{"name":"Big Data","volume":null,"pages":null},"PeriodicalIF":4.6,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75011344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1