首页 > 最新文献

Big Data最新文献

英文 中文
Prognostic Modeling for Liver Cirrhosis Mortality Prediction and Real-Time Health Monitoring from Electronic Health Data. 基于电子健康数据的肝硬化死亡率预测和实时健康监测的预后建模。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-09 DOI: 10.1089/big.2024.0071
Chengping Zhang, Muhammad Faisal Buland Iqbal, Imran Iqbal, Minghao Cheng, Nadia Sarhan, Emad Mahrous Awwad, Yazeed Yasin Ghadi

Liver cirrhosis stands as a prominent contributor to mortality, impacting millions across the United States. Enabling health care providers to predict early mortality among patients with cirrhosis holds the potential to enhance treatment efficacy significantly. Our hypothesis centers on the correlation between mortality and laboratory test results along with relevant diagnoses in this patient cohort. Additionally, we posit that a deep learning model could surpass the predictive capabilities of the existing Model for End-Stage Liver Disease score. This research seeks to advance prognostic accuracy and refine approaches to address the critical challenges posed by cirrhosis-related mortality. This study evaluates the performance of an artificial neural network model for liver disease classification using various training dataset sizes. Through meticulous experimentation, three distinct training proportions were analyzed: 70%, 80%, and 90%. The model's efficacy was assessed using precision, recall, F1-score, accuracy, and support metrics, alongside receiver operating characteristic (ROC) and precision-recall (PR) curves. The ROC curves were quantified using the area under the curve (AUC) metric. Results indicated that the model's performance improved with an increased size of the training dataset. Specifically, the 80% training data model achieved the highest AUC, suggesting superior classification ability over the models trained with 70% and 90% data. PR analysis revealed a steep trade-off between precision and recall across all datasets, with 80% training data again demonstrating a slightly better balance. This is indicative of the challenges faced in achieving high precision with a concurrently high recall, a common issue in imbalanced datasets such as those found in medical diagnostics.

肝硬化是导致死亡的一个重要因素,影响着美国数百万人。使卫生保健提供者能够预测肝硬化患者的早期死亡率,具有显著提高治疗效果的潜力。我们的假设集中在死亡率与实验室检测结果以及该患者队列的相关诊断之间的相关性。此外,我们假设深度学习模型可以超越现有终末期肝病评分模型的预测能力。本研究旨在提高预后准确性和改进方法,以解决肝硬化相关死亡率带来的关键挑战。本研究使用不同的训练数据集大小来评估肝脏疾病分类的人工神经网络模型的性能。通过细致的实验,分析了三种不同的训练比例:70%、80%和90%。采用精密度、召回率、f1评分、准确度和支持度指标,以及受试者工作特征(ROC)和精确召回率(PR)曲线来评估模型的有效性。ROC曲线采用曲线下面积(AUC)指标进行量化。结果表明,模型的性能随着训练数据集大小的增加而提高。具体来说,80%训练数据模型的AUC最高,表明其分类能力优于70%和90%训练数据模型。PR分析揭示了所有数据集的准确率和召回率之间的巨大权衡,80%的训练数据再次显示出稍好的平衡。这表明在实现高精度和高召回率方面所面临的挑战,这是医疗诊断等不平衡数据集中的常见问题。
{"title":"Prognostic Modeling for Liver Cirrhosis Mortality Prediction and Real-Time Health Monitoring from Electronic Health Data.","authors":"Chengping Zhang, Muhammad Faisal Buland Iqbal, Imran Iqbal, Minghao Cheng, Nadia Sarhan, Emad Mahrous Awwad, Yazeed Yasin Ghadi","doi":"10.1089/big.2024.0071","DOIUrl":"https://doi.org/10.1089/big.2024.0071","url":null,"abstract":"<p><p>Liver cirrhosis stands as a prominent contributor to mortality, impacting millions across the United States. Enabling health care providers to predict early mortality among patients with cirrhosis holds the potential to enhance treatment efficacy significantly. Our hypothesis centers on the correlation between mortality and laboratory test results along with relevant diagnoses in this patient cohort. Additionally, we posit that a deep learning model could surpass the predictive capabilities of the existing Model for End-Stage Liver Disease score. This research seeks to advance prognostic accuracy and refine approaches to address the critical challenges posed by cirrhosis-related mortality. This study evaluates the performance of an artificial neural network model for liver disease classification using various training dataset sizes. Through meticulous experimentation, three distinct training proportions were analyzed: 70%, 80%, and 90%. The model's efficacy was assessed using precision, recall, F1-score, accuracy, and support metrics, alongside receiver operating characteristic (ROC) and precision-recall (PR) curves. The ROC curves were quantified using the area under the curve (AUC) metric. Results indicated that the model's performance improved with an increased size of the training dataset. Specifically, the 80% training data model achieved the highest AUC, suggesting superior classification ability over the models trained with 70% and 90% data. PR analysis revealed a steep trade-off between precision and recall across all datasets, with 80% training data again demonstrating a slightly better balance. This is indicative of the challenges faced in achieving high precision with a concurrently high recall, a common issue in imbalanced datasets such as those found in medical diagnostics.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social Listening for Product Design Requirement Analysis and Segmentation: A Graph Analysis Approach with User Comments Mining. 面向产品设计需求分析和细分的社会倾听:基于用户评论挖掘的图分析方法。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2023-09-04 DOI: 10.1089/big.2022.0021
Xinjun Lai, Guitao Huang, Ziyue Zhao, Shenhe Lin, Sheng Zhang, Huiyu Zhang, Qingxin Chen, Ning Mao

This study investigates customers' product design requirements through online comments from social media, and quickly translates these needs into product design specifications. First, the exponential discriminative snowball sampling method was proposed to generate a product-related subnetwork. Second, natural language processing (NLP) was utilized to mine user-generated comments, and a Graph SAmple and aggreGatE method was employed to embed the user's node neighborhood information in the network to jointly define a user's persona. Clustering was used for market and product model segmentation. Finally, a deep learning bidirectional long short-term memory with conditional random fields framework was introduced for opinion mining. A comment frequency-invert group frequency indicator was proposed to quantify all user groups' positive and negative opinions for various specifications of different product functions. A case study of smartphone design analysis is presented with data from a large Chinese online community called Baidu Tieba. Eleven layers of social relationships were snowball sampled, with 14,018 users and 30,803 comments. The proposed method produced a more reasonable user group clustering result than the conventional method. With our approach, user groups' dominating likes and dislikes for specifications could be immediately identified, and the similar and different preferences of product features by different user groups were instantly revealed. Managerial and engineering insights were also discussed.

本研究通过社交媒体的在线评论来调查客户的产品设计需求,并将这些需求快速转化为产品设计规范。首先,提出了指数判别滚雪球抽样方法生成乘积相关子网络;其次,利用自然语言处理(NLP)对用户生成的评论进行挖掘,并采用Graph SAmple和aggreGatE方法将用户的节点邻域信息嵌入到网络中,共同定义用户的角色;聚类用于市场和产品模型分割。最后,提出了一种基于条件随机场的深度学习双向长短期记忆框架。提出了一种评论频率逆变组频率指标,量化所有用户组对不同产品功能的各种规格的正面和负面意见。本文以智能手机设计分析为例,分析了来自中国大型在线社区百度贴吧的数据。11层社会关系被滚雪球抽样,有14018个用户和30803条评论。与传统方法相比,该方法获得了更合理的用户组聚类结果。通过我们的方法,可以立即识别用户群体对规格的主导喜欢和不喜欢,并立即揭示不同用户群体对产品功能的相似和不同偏好。还讨论了管理和工程方面的见解。
{"title":"Social Listening for Product Design Requirement Analysis and Segmentation: A Graph Analysis Approach with User Comments Mining.","authors":"Xinjun Lai, Guitao Huang, Ziyue Zhao, Shenhe Lin, Sheng Zhang, Huiyu Zhang, Qingxin Chen, Ning Mao","doi":"10.1089/big.2022.0021","DOIUrl":"10.1089/big.2022.0021","url":null,"abstract":"<p><p>This study investigates customers' product design requirements through online comments from social media, and quickly translates these needs into product design specifications. First, the exponential discriminative snowball sampling method was proposed to generate a product-related subnetwork. Second, natural language processing (NLP) was utilized to mine user-generated comments, and a Graph SAmple and aggreGatE method was employed to embed the user's node neighborhood information in the network to jointly define a user's persona. Clustering was used for market and product model segmentation. Finally, a deep learning bidirectional long short-term memory with conditional random fields framework was introduced for opinion mining. A comment frequency-invert group frequency indicator was proposed to quantify all user groups' positive and negative opinions for various specifications of different product functions. A case study of smartphone design analysis is presented with data from a large Chinese online community called Baidu Tieba. Eleven layers of social relationships were snowball sampled, with 14,018 users and 30,803 comments. The proposed method produced a more reasonable user group clustering result than the conventional method. With our approach, user groups' dominating likes and dislikes for specifications could be immediately identified, and the similar and different preferences of product features by different user groups were instantly revealed. Managerial and engineering insights were also discussed.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"456-477"},"PeriodicalIF":2.6,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10508327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IDLIQ: An Incremental Deterministic Finite Automaton Learning Algorithm Through Inverse Queries for Regular Grammar Inference. 基于逆查询的正则语法推理的增量确定性有限自动机学习算法。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2023-05-18 DOI: 10.1089/big.2022.0158
Farah Haneef, Muddassar A Sindhu

We present an efficient incremental learning algorithm for Deterministic Finite Automaton (DFA) with the help of inverse query (IQ) and membership query (MQ). This algorithm is an extension of the Identification of Regular Languages (ID) algorithm from a complete to an incremental learning setup. The learning algorithm learns by making use of a set of labeled examples and by posing queries to a knowledgeable teacher, which is equipped to answer IQs along with MQs and equivalence query. Based on the examples (elements of the live complete set) and responses against IQs from the minimally adequate teacher (MAT), the learning algorithm constructs the hypothesis automaton, consistent with all observed examples. The Incremental DFA Learning algorithm through Inverse Queries (IDLIQ) takes O(|Σ|N+|Pc||F|) time complexity in the presence of a MAT and ensures convergence to a minimal representation of the target DFA with finite number of labeled examples. Existing incremental learning algorithms; the Incremental ID, the Incremental Distinguishing Strings have polynomial (cubic) time complexity in the presence of a MAT. Therefore, sometimes, these algorithms even fail to learn large complex software systems. In this research work, we have reduced the complexity (from cubic to square form) of the DFA learning in an incremental setup. Finally, we prove the correctness and termination of the IDLIQ algorithm.

提出了一种基于逆查询(IQ)和隶属查询(MQ)的确定性有限自动机(DFA)的高效增量学习算法。该算法是正则语言识别(ID)算法的扩展,从一个完整的学习设置到一个增量的学习设置。学习算法通过使用一组标记的示例并向知识渊博的教师提出问题来学习,该教师配备了回答iq以及MQs和等价查询的设备。基于示例(实时完整集的元素)和对最低适足教师(MAT)智商的响应,学习算法构建假设自动机,与所有观察到的示例一致。通过逆查询的增量DFA学习算法(IDLIQ)在MAT存在下的时间复杂度为0 (|Σ|N+|Pc||F|),并确保收敛到具有有限数量标记示例的目标DFA的最小表示。现有的增量学习算法;在存在MAT的情况下,增量ID、增量区分字符串具有多项式(三次)时间复杂度。因此,有时这些算法甚至无法学习大型复杂软件系统。在这项研究工作中,我们在增量设置中降低了DFA学习的复杂性(从立方形式到平方形式)。最后,我们证明了IDLIQ算法的正确性和终止性。
{"title":"IDLIQ: An Incremental <i>Deterministic Finite Automaton</i> Learning Algorithm Through Inverse Queries for Regular Grammar Inference.","authors":"Farah Haneef, Muddassar A Sindhu","doi":"10.1089/big.2022.0158","DOIUrl":"10.1089/big.2022.0158","url":null,"abstract":"<p><p>We present an efficient incremental learning algorithm for <i>Deterministic Finite Automaton</i> (DFA) with the help of inverse query (IQ) and membership query (MQ). This algorithm is an extension of the <i>Identification of Regular Languages</i> (ID) algorithm from a complete to an incremental learning setup. The learning algorithm learns by making use of a set of labeled examples and by posing queries to a knowledgeable teacher, which is equipped to answer IQs along with MQs and equivalence query. Based on the examples (elements of the live complete set) and responses against IQs from the <i>minimally adequate teacher</i> (MAT), the learning algorithm constructs the hypothesis automaton, consistent with all observed examples. The Incremental DFA Learning algorithm through Inverse Queries (IDLIQ) takes <math><mstyle><mi>O</mi></mstyle><mrow><mo>(</mo><mrow><mo>|</mo><mi>Σ</mi><mo>|</mo><mi>N</mi><mo>+</mo><mo>|</mo><msub><mrow><mi>P</mi></mrow><mrow><mi>c</mi></mrow></msub><mo>|</mo><mo>|</mo><mi>F</mi><mo>|</mo></mrow><mo>)</mo></mrow></math> time complexity in the presence of a MAT and ensures convergence to a minimal representation of the target DFA with finite number of labeled examples. Existing incremental learning algorithms; the Incremental ID, the Incremental Distinguishing Strings have polynomial (cubic) time complexity in the presence of a MAT. Therefore, sometimes, these algorithms even fail to learn large complex software systems. In this research work, we have reduced the complexity (from cubic to square form) of the DFA learning in an incremental setup. Finally, we prove the correctness and termination of the IDLIQ algorithm.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"446-455"},"PeriodicalIF":2.6,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9492270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DGSLSTM: Deep Gated Stacked Long Short-Term Memory Neural Network for Traffic Flow Forecasting of Transportation Networks on Big Data Environment. DGSLSTM:用于大数据环境下交通网络流量预测的深度门控堆叠长短期记忆神经网络。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2022-02-10 DOI: 10.1089/big.2021.0013
Rajalakshmi Gurusamy, Siva Ranjani Seenivasan

Deep learning and big data techniques have become increasingly popular in traffic flow forecasting. Deep neural networks have also been applied to traffic flow forecasting. Furthermore, it is difficult to determine whether neural networks can be used for accurate traffic flow prediction. Moreover, since the network model is poorly structured and the parameter optimization technique is inappropriate, the traffic flow prediction is inaccurate because of the lack of certainty. The proposed system overcomes these problems by combining multiple simple recurrent long short-term memory (LSTM) neural networks with time traits to predict traffic flow using a deep gated stacked neural network. To deepen the model, the hidden layers have been trained using an unsupervised layer-by-layer approach. This approach provides a systematic representation of the time series data. A systematic representation of hidden layers improves the accuracy of time series forecasting by capturing information at multiple levels. Furthermore, it emphasizes the importance of model structure, random weight initialization, and hyperparameters used in stacked LSTM to enhance predictive performance. The prediction efficacy of the deep gated stacked LSTM model is compared with that of the gated recurrent unit model and the stacked autoencoder model.

深度学习和大数据技术在交通流量预测中越来越受欢迎。深度神经网络也被应用于交通流量预测。但是,神经网络在交通流量预测中的应用并不成熟,而且神经网络能否用于准确的交通流量预测也很难确定。此外,由于网络模型结构不完善,参数优化技术不恰当,交通流量预测因缺乏确定性而不准确。所提出的系统克服了这些问题,将多个简单的递归长短期记忆(LSTM)神经网络与时间特征相结合,使用深度门控堆叠神经网络预测交通流量。为了深化模型,采用无监督逐层方法对隐藏层进行了训练。这种方法可以系统地表示时间序列数据。隐层的系统化表示通过捕捉多层次的信息,提高了时间序列预测的准确性。此外,它还强调了堆叠 LSTM 中使用的模型结构、随机权重初始化和超参数对提高预测性能的重要性。将深度门控堆叠 LSTM 模型的预测效果与门控递归单元模型和堆叠自动编码器模型进行了比较。
{"title":"DGSLSTM: Deep Gated Stacked Long Short-Term Memory Neural Network for Traffic Flow Forecasting of Transportation Networks on Big Data Environment.","authors":"Rajalakshmi Gurusamy, Siva Ranjani Seenivasan","doi":"10.1089/big.2021.0013","DOIUrl":"10.1089/big.2021.0013","url":null,"abstract":"<p><p>Deep learning and big data techniques have become increasingly popular in traffic flow forecasting. Deep neural networks have also been applied to traffic flow forecasting. Furthermore, it is difficult to determine whether neural networks can be used for accurate traffic flow prediction. Moreover, since the network model is poorly structured and the parameter optimization technique is inappropriate, the traffic flow prediction is inaccurate because of the lack of certainty. The proposed system overcomes these problems by combining multiple simple recurrent long short-term memory (LSTM) neural networks with time traits to predict traffic flow using a deep gated stacked neural network. To deepen the model, the hidden layers have been trained using an unsupervised layer-by-layer approach. This approach provides a systematic representation of the time series data. A systematic representation of hidden layers improves the accuracy of time series forecasting by capturing information at multiple levels. Furthermore, it emphasizes the importance of model structure, random weight initialization, and hyperparameters used in stacked LSTM to enhance predictive performance. The prediction efficacy of the deep gated stacked LSTM model is compared with that of the gated recurrent unit model and the stacked autoencoder model.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"504-517"},"PeriodicalIF":2.6,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39906258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Internet of Things Data Visualization for Business Intelligence. 用于商业智能的物联网数据可视化。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2022-02-08 DOI: 10.1089/big.2021.0200
Sima Attar-Khorasani, Ricardo Chalmeta

This study contributes to the research on Internet of Things data visualization for business intelligence processes, an area of growing interest to scholars, by conducting a systematic review of the literature. A total of 237 articles published over the past 11 years were obtained and compared. This made it possible to identify the top contributing and most influential authors, countries, publishers, institutions, papers, and research findings, together with the challenges facing current research. Based on these results, this work provides a thorough insight into the field by proposing four research categories (Technology infrastructure, Case examples, Final-user experience, and Big Data tools), together with the development of these research streams over time and their future research directions.

本研究通过对文献进行系统性回顾,为物联网数据可视化在商业智能流程中的应用这一学者们日益关注的领域的研究做出了贡献。本研究共获得并比较了过去 11 年间发表的 237 篇文章。这样就有可能找出贡献最大、最有影响力的作者、国家、出版商、机构、论文和研究成果,以及当前研究面临的挑战。基于这些结果,本作品提出了四个研究类别(技术基础架构、案例、最终用户体验和大数据工具),并介绍了这些研究流的长期发展及其未来的研究方向,从而提供了对该领域的透彻见解。
{"title":"Internet of Things Data Visualization for Business Intelligence.","authors":"Sima Attar-Khorasani, Ricardo Chalmeta","doi":"10.1089/big.2021.0200","DOIUrl":"10.1089/big.2021.0200","url":null,"abstract":"<p><p>This study contributes to the research on Internet of Things data visualization for business intelligence processes, an area of growing interest to scholars, by conducting a systematic review of the literature. A total of 237 articles published over the past 11 years were obtained and compared. This made it possible to identify the top contributing and most influential authors, countries, publishers, institutions, papers, and research findings, together with the challenges facing current research. Based on these results, this work provides a thorough insight into the field by proposing four research categories (Technology infrastructure, Case examples, Final-user experience, and Big Data tools), together with the development of these research streams over time and their future research directions.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"478-503"},"PeriodicalIF":2.6,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39899264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Customer Prioritization Integrated Supply Chain Optimization Model with Outsourcing Strategies. 具有外包策略的客户优先级集成供应链优化模型。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2022-04-29 DOI: 10.1089/big.2021.0292
Iram Mushtaq, Muhammad Umer, Muhammad Attique Khan, Seifedine Kadry

Pre-COVID-19, most of the supply chains functioned with more capacity than demand. However, COVID-19 changed traditional supply chains' dynamics, resulting in more demand than their production capacity. This article presents a multiobjective and multiperiod supply chain network design along with customer prioritization, keeping in view price discounts and outsourcing strategies to deal with the situation when demand exceeds the production capacity. Initially, a multiperiod, multiobjective supply chain network is designed that incorporates prices discounts, customer prioritization, and outsourcing strategies. The main objectives are profit and prioritization maximization and time minimization. The introduction of the prioritization objective function having customer ranking as a parameter and considering less capacity than demand and outsourcing differentiates this model from the literature. A four-valued neutrosophic multiobjective optimization method is introduced to solve the model developed. To validate the model, a case study of the supply chain of a surgical mask is presented as the real-life application of research. The research findings are useful for the managers to make price discounts and preferred customer prioritization decisions under uncertainty and imbalance between supply and demand. In future, the logic in the proposed model can be used to create web application for optimal decision-making in supply chains.

在2019冠状病毒病之前,大多数供应链的运转能力大于需求。然而,2019冠状病毒病改变了传统供应链的动态,导致需求大于产能。本文提出了一种多目标、多周期的供应链网络设计,考虑了客户优先级、价格折扣和外包策略,以应对需求超过生产能力的情况。首先,设计了一个包含价格折扣、客户优先级和外包策略的多周期、多目标供应链网络。主要目标是利润和优先级最大化和时间最小化。引入优先级目标函数,以客户排名为参数,考虑容量小于需求和外包,使该模型与文献不同。引入了一种四值嗜中性多目标优化方法来求解所建立的模型。为了验证该模型,本文提出了一个外科口罩供应链的案例研究,作为研究的实际应用。研究结果对在不确定性和供需不平衡的情况下进行价格折扣和顾客优先排序决策具有指导意义。在未来,该模型中的逻辑可以用于创建web应用程序,以实现供应链中的最优决策。
{"title":"Customer Prioritization Integrated Supply Chain Optimization Model with Outsourcing Strategies.","authors":"Iram Mushtaq, Muhammad Umer, Muhammad Attique Khan, Seifedine Kadry","doi":"10.1089/big.2021.0292","DOIUrl":"10.1089/big.2021.0292","url":null,"abstract":"<p><p>Pre-COVID-19, most of the supply chains functioned with more capacity than demand. However, COVID-19 changed traditional supply chains' dynamics, resulting in more demand than their production capacity. This article presents a multiobjective and multiperiod supply chain network design along with customer prioritization, keeping in view price discounts and outsourcing strategies to deal with the situation when demand exceeds the production capacity. Initially, a multiperiod, multiobjective supply chain network is designed that incorporates prices discounts, customer prioritization, and outsourcing strategies. The main objectives are profit and prioritization maximization and time minimization. The introduction of the prioritization objective function having customer ranking as a parameter and considering less capacity than demand and outsourcing differentiates this model from the literature. A four-valued neutrosophic multiobjective optimization method is introduced to solve the model developed. To validate the model, a case study of the supply chain of a surgical mask is presented as the real-life application of research. The research findings are useful for the managers to make price discounts and preferred customer prioritization decisions under uncertainty and imbalance between supply and demand. In future, the logic in the proposed model can be used to create web application for optimal decision-making in supply chains.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"1 1","pages":"413-428"},"PeriodicalIF":2.6,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41616147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attribute-Based Adaptive Homomorphic Encryption for Big Data Security. 基于属性的大数据安全自适应同态加密。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-10-01 Epub Date: 2021-12-13 DOI: 10.1089/big.2021.0176
R Thenmozhi, S Shridevi, Sachi Nandan Mohanty, Vicente García-Díaz, Deepak Gupta, Prayag Tiwari, Mohammad Shorfuzzaman

There is a drastic increase in Internet usage across the globe, thanks to mobile phone penetration. This extreme Internet usage generates huge volumes of data, in other terms, big data. Security and privacy are the main issues to be considered in big data management. Hence, in this article, Attribute-based Adaptive Homomorphic Encryption (AAHE) is developed to enhance the security of big data. In the proposed methodology, Oppositional Based Black Widow Optimization (OBWO) is introduced to select the optimal key parameters by following the AAHE method. By considering oppositional function, Black Widow Optimization (BWO) convergence analysis was enhanced. The proposed methodology has different processes, namely, process setup, encryption, and decryption processes. The researcher evaluated the proposed methodology with non-abelian rings and the homomorphism process in ciphertext format. Further, it is also utilized in improving one-way security related to the conjugacy examination issue. Afterward, homomorphic encryption is developed to secure the big data. The study considered two types of big data such as adult datasets and anonymous Microsoft web datasets to validate the proposed methodology. With the help of performance metrics such as encryption time, decryption time, key size, processing time, downloading, and uploading time, the proposed method was evaluated and compared against conventional cryptography techniques such as Rivest-Shamir-Adleman (RSA) and Elliptic Curve Cryptography (ECC). Further, the key generation process was also compared against conventional methods such as BWO, Particle Swarm Optimization (PSO), and Firefly Algorithm (FA). The results established that the proposed method is supreme than the compared methods and can be applied in real time in near future.

由于移动电话的普及,全球互联网使用量急剧增加。这种极高的互联网使用率产生了海量数据,换句话说就是大数据。安全和隐私是大数据管理中需要考虑的主要问题。因此,本文开发了基于属性的自适应同态加密(AAHE)来增强大数据的安全性。在所提出的方法中,引入了基于对立函数的黑寡妇优化(OBWO),以按照 AAHE 方法选择最佳密钥参数。通过考虑对立函数,加强了黑寡妇优化(BWO)的收敛性分析。所提出的方法有不同的流程,即流程设置、加密和解密流程。研究人员用非阿贝尔环和密码文本格式中的同构过程对所提出的方法进行了评估。此外,该方法还用于提高与共轭检验问题相关的单向安全性。之后,开发了同态加密技术来保护大数据的安全。研究考虑了两种类型的大数据,如成人数据集和匿名微软网络数据集,以验证所提出的方法。在加密时间、解密时间、密钥大小、处理时间、下载和上传时间等性能指标的帮助下,对所提出的方法进行了评估,并与 Rivest-Shamir-Adleman (RSA)和椭圆曲线加密法(ECC)等传统加密技术进行了比较。此外,还将密钥生成过程与 BWO、粒子群优化(PSO)和萤火虫算法(FA)等传统方法进行了比较。结果表明,所提出的方法比其他方法更优越,可在不久的将来实时应用。
{"title":"Attribute-Based Adaptive Homomorphic Encryption for Big Data Security.","authors":"R Thenmozhi, S Shridevi, Sachi Nandan Mohanty, Vicente García-Díaz, Deepak Gupta, Prayag Tiwari, Mohammad Shorfuzzaman","doi":"10.1089/big.2021.0176","DOIUrl":"10.1089/big.2021.0176","url":null,"abstract":"<p><p>There is a drastic increase in Internet usage across the globe, thanks to mobile phone penetration. This extreme Internet usage generates huge volumes of data, in other terms, big data. Security and privacy are the main issues to be considered in big data management. Hence, in this article, Attribute-based Adaptive Homomorphic Encryption (AAHE) is developed to enhance the security of big data. In the proposed methodology, Oppositional Based Black Widow Optimization (OBWO) is introduced to select the optimal key parameters by following the AAHE method. By considering oppositional function, Black Widow Optimization (BWO) convergence analysis was enhanced. The proposed methodology has different processes, namely, process setup, encryption, and decryption processes. The researcher evaluated the proposed methodology with non-abelian rings and the homomorphism process in ciphertext format. Further, it is also utilized in improving one-way security related to the conjugacy examination issue. Afterward, homomorphic encryption is developed to secure the big data. The study considered two types of big data such as adult datasets and anonymous Microsoft web datasets to validate the proposed methodology. With the help of performance metrics such as encryption time, decryption time, key size, processing time, downloading, and uploading time, the proposed method was evaluated and compared against conventional cryptography techniques such as Rivest-Shamir-Adleman (RSA) and Elliptic Curve Cryptography (ECC). Further, the key generation process was also compared against conventional methods such as BWO, Particle Swarm Optimization (PSO), and Firefly Algorithm (FA). The results established that the proposed method is supreme than the compared methods and can be applied in real time in near future.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"343-356"},"PeriodicalIF":2.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39718084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Deep Learning Approach for Traffic Speed Prediction. 用于交通速度预测的混合深度学习方法。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-10-01 Epub Date: 2022-02-02 DOI: 10.1089/big.2021.0251
Fei Dai, Pengfei Cao, Penggui Huang, Qi Mo, Bi Huang

Traffic speed prediction plays a fundamental role in traffic management and driving route planning. However, timely accurate traffic speed prediction is challenging as it is affected by complex spatial and temporal correlations. Most existing works cannot simultaneously model spatial and temporal correlations in traffic data, resulting in unsatisfactory prediction performance. In this article, we propose a novel hybrid deep learning approach, named HDL4TSP, to predict traffic speed in each region of a city, which consists of an input layer, a spatial layer, a temporal layer, a fusion layer, and an output layer. Specifically, first, the spatial layer employs graph convolutional networks to capture spatial near dependencies and spatial distant dependencies in the spatial dimension. Second, the temporal layer employs convolutional long short-term memory (ConvLSTM) networks to model closeness, daily periodicity, and weekly periodicity in the temporal dimension. Third, the fusion layer designs a fusion component to merge the outputs of ConvLSTM networks. Finally, we conduct extensive experiments and experimental results to show that HDL4TSP outperforms four baselines on two real-world data sets.

交通速度预测在交通管理和行车路线规划中发挥着重要作用。然而,由于受到复杂的空间和时间相关性的影响,及时准确地预测交通速度具有挑战性。大多数现有研究都无法同时对交通数据中的空间和时间相关性进行建模,导致预测效果不尽如人意。在本文中,我们提出了一种新颖的混合深度学习方法,名为 HDL4TSP,用于预测城市各区域的交通速度,该方法由输入层、空间层、时间层、融合层和输出层组成。具体来说,首先,空间层采用图卷积网络来捕捉空间维度上的空间近依赖关系和空间远依赖关系。其次,时间层采用卷积长短期记忆(ConvLSTM)网络来模拟时间维度上的亲疏关系、日周期性和周周期性。第三,融合层设计了一个融合组件来合并 ConvLSTM 网络的输出。最后,我们进行了大量实验,实验结果表明 HDL4TSP 在两个真实世界数据集上的表现优于四种基线。
{"title":"Hybrid Deep Learning Approach for Traffic Speed Prediction.","authors":"Fei Dai, Pengfei Cao, Penggui Huang, Qi Mo, Bi Huang","doi":"10.1089/big.2021.0251","DOIUrl":"10.1089/big.2021.0251","url":null,"abstract":"<p><p>Traffic speed prediction plays a fundamental role in traffic management and driving route planning. However, timely accurate traffic speed prediction is challenging as it is affected by complex spatial and temporal correlations. Most existing works cannot simultaneously model spatial and temporal correlations in traffic data, resulting in unsatisfactory prediction performance. In this article, we propose a novel hybrid deep learning approach, named HDL4TSP, to predict traffic speed in each region of a city, which consists of an input layer, a spatial layer, a temporal layer, a fusion layer, and an output layer. Specifically, first, the spatial layer employs graph convolutional networks to capture spatial near dependencies and spatial distant dependencies in the spatial dimension. Second, the temporal layer employs convolutional long short-term memory (ConvLSTM) networks to model closeness, daily periodicity, and weekly periodicity in the temporal dimension. Third, the fusion layer designs a fusion component to merge the outputs of ConvLSTM networks. Finally, we conduct extensive experiments and experimental results to show that HDL4TSP outperforms four baselines on two real-world data sets.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"377-389"},"PeriodicalIF":2.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39880866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Weighted GraphSAGE-Based Context-Aware Approach for Big Data Access Control. 基于加权 GraphSAGE 的大数据访问控制情境感知方法。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-10-01 Epub Date: 2023-08-01 DOI: 10.1089/big.2021.0473
Dibin Shan, Xuehui Du, Wenjuan Wang, Aodi Liu, Na Wang

Context information is the key element to realizing dynamic access control of big data. However, existing context-aware access control (CAAC) methods do not support automatic context awareness and cannot automatically model and reason about context relationships. To solve these problems, this article proposes a weighted GraphSAGE-based context-aware approach for big data access control. First, graph modeling is performed on the access record data set and transforms the access control context-awareness problem into a graph neural network (GNN) node learning problem. Then, a GNN model WGraphSAGE is proposed to achieve automatic context awareness and automatic generation of CAAC rules. Finally, weighted neighbor sampling and weighted aggregation algorithms are designed for the model to realize automatic modeling and reasoning of node relationships and relationship strengths simultaneously in the graph node learning process. The experiment results show that the proposed method has obvious advantages in context awareness and context relationship reasoning compared with similar GNN models. Meanwhile, it obtains better results in dynamic access control decisions than the existing CAAC models.

上下文信息是实现大数据动态访问控制的关键要素。然而,现有的上下文感知访问控制(CAAC)方法不支持自动上下文感知,无法自动建模和推理上下文关系。为了解决这些问题,本文提出了一种基于加权 GraphSAGE 的大数据访问控制上下文感知方法。首先,对访问记录数据集进行图建模,将访问控制上下文感知问题转化为图神经网络(GNN)节点学习问题。然后,提出了一个 GNN 模型 WGraphSAGE,以实现自动上下文感知和 CAAC 规则的自动生成。最后,为该模型设计了加权邻居采样和加权聚合算法,以实现图节点学习过程中节点关系和关系强度的自动建模和同时推理。实验结果表明,与同类 GNN 模型相比,本文提出的方法在上下文感知和上下文关系推理方面具有明显优势。同时,与现有的 CAAC 模型相比,它在动态访问控制决策方面取得了更好的效果。
{"title":"A Weighted GraphSAGE-Based Context-Aware Approach for Big Data Access Control.","authors":"Dibin Shan, Xuehui Du, Wenjuan Wang, Aodi Liu, Na Wang","doi":"10.1089/big.2021.0473","DOIUrl":"10.1089/big.2021.0473","url":null,"abstract":"<p><p>Context information is the key element to realizing dynamic access control of big data. However, existing context-aware access control (CAAC) methods do not support automatic context awareness and cannot automatically model and reason about context relationships. To solve these problems, this article proposes a weighted GraphSAGE-based context-aware approach for big data access control. First, graph modeling is performed on the access record data set and transforms the access control context-awareness problem into a graph neural network (GNN) node learning problem. Then, a GNN model WGraphSAGE is proposed to achieve automatic context awareness and automatic generation of CAAC rules. Finally, weighted neighbor sampling and weighted aggregation algorithms are designed for the model to realize automatic modeling and reasoning of node relationships and relationship strengths simultaneously in the graph node learning process. The experiment results show that the proposed method has obvious advantages in context awareness and context relationship reasoning compared with similar GNN models. Meanwhile, it obtains better results in dynamic access control decisions than the existing CAAC models.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"390-411"},"PeriodicalIF":2.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9922924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Special Issue: Big Scientific Data and Machine Learning in Science and Engineering. 特刊:科学与工程中的大科学数据和机器学习。
IF 2.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 Epub Date: 2024-07-31 DOI: 10.1089/big.2024.59218.kpa
Farhad Pourkamali-Anaraki
{"title":"Special Issue: Big Scientific Data and Machine Learning in Science and Engineering.","authors":"Farhad Pourkamali-Anaraki","doi":"10.1089/big.2024.59218.kpa","DOIUrl":"10.1089/big.2024.59218.kpa","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"269"},"PeriodicalIF":2.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1