PeerJ Computer Science最新文献_第7页

Probability-boosting technique for combinatorial optimization. 组合优化的概率提升技术。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-20 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2499

Sanpawat Kantabutra

In many combinatorial optimization problems we want a particular set of k out of n items with some certain properties (or constraints). These properties may involve the k items. In the worst case a deterministic algorithm must scan n-k items in the set to verify the k items. If we pick a set of k items randomly and verify the properties, it will take about (n/k)^k verifications, which can be a really large number for some values of k and n. In this article we introduce a significantly faster randomized strategy with very high probability to pick the set of such k items by amplifying the probability of obtaining a target set of k items and show how this probability boosting technique can be applied to solve three different combinatorial optimization problems efficiently. In all three applications algorithms that use the probability boosting technique show superiority over their deterministic counterparts.

在许多组合优化问题中，我们需要n个项目中的k个具有某些属性（或约束）的特定集合。这些属性可能涉及k项。在最坏的情况下，确定性算法必须扫描集合中的n-k个项目来验证k个项目。如果我们随机选取一组k项来验证属性，它将需要（n/k）k次验证，对于k和n的某些值来说，这可能是一个非常大的数字。在本文中，我们介绍了一种显著更快的随机化策略，通过放大获得k个项目目标集的概率，以非常高的概率选择k个项目的集合，并展示了如何将这种概率提升技术应用于有效地解决三个不同的组合优化问题。在所有这三种应用中，使用概率提升技术的算法比它们的确定性对应物表现出优越性。

引用次数: 0

Slovak morphological tokenizer using the Byte-Pair Encoding algorithm. 使用字节对编码算法的斯洛伐克语形态学标记器。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-19 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2465

Dávid Držík, Frantisek Forgac

This study introduces a new approach to text tokenization, SlovaK Morphological Tokenizer (SKMT), which integrates the morphology of the Slovak language into the training process using the Byte-Pair Encoding (BPE) algorithm. Unlike conventional tokenizers, SKMT focuses on preserving the integrity of word roots in individual tokens, crucial for maintaining lexical meaning. The methodology involves segmenting and extracting word roots from morphological dictionaries and databases, followed by corpus preprocessing and training SKMT alongside a traditional BPE tokenizer. Comparative evaluation against existing tokenizers demonstrates SKMT's outstanding ability to maintain root integrity, achieving 99.7% root integrity compared to SlovakBERT (90.5%) and a pureBPE tokenizer (93.1%). Further validation involved fine-tuning models on a sentiment classification NLP task, where models trained with SKMT achieved an F1-score improvement of 3.5% over those trained with conventional BPE tokenization, followed by a focus on the Semantic Textual Similarity (STS) task. These findings suggest that training language models on the SKMT tokenizer significantly enhances model performance and quality.

本研究引入了一种新的文本标记化方法，斯洛伐克语形态学标记器（SKMT），它使用字节对编码（BPE）算法将斯洛伐克语的形态学集成到训练过程中。与传统的标记器不同，SKMT侧重于保持单个标记中词根的完整性，这对于保持词汇意义至关重要。该方法包括从形态学词典和数据库中分词和提取词根，然后是语料库预处理和训练SKMT以及传统的BPE标记器。与现有标记器的比较评估表明，SKMT具有保持根完整性的卓越能力，与SlovakBERT（90.5%）和pureBPE标记器（93.1%）相比，SKMT实现了99.7%的根完整性。进一步的验证涉及在情感分类NLP任务上的微调模型，其中使用SKMT训练的模型比使用传统BPE标记化训练的模型获得了3.5%的f1分数提高，随后关注语义文本相似性（STS）任务。这些发现表明，在SKMT标记器上训练语言模型可以显著提高模型的性能和质量。

{"title":"Slovak morphological tokenizer using the Byte-Pair Encoding algorithm.","authors":"Dávid Držík, Frantisek Forgac","doi":"10.7717/peerj-cs.2465","DOIUrl":"https://doi.org/10.7717/peerj-cs.2465","url":null,"abstract":"This study introduces a new approach to text tokenization, SlovaK Morphological Tokenizer (SKMT), which integrates the morphology of the Slovak language into the training process using the Byte-Pair Encoding (BPE) algorithm. Unlike conventional tokenizers, SKMT focuses on preserving the integrity of word roots in individual tokens, crucial for maintaining lexical meaning. The methodology involves segmenting and extracting word roots from morphological dictionaries and databases, followed by corpus preprocessing and training SKMT alongside a traditional BPE tokenizer. Comparative evaluation against existing tokenizers demonstrates SKMT's outstanding ability to maintain root integrity, achieving 99.7% root integrity compared to SlovakBERT (90.5%) and a pureBPE tokenizer (93.1%). Further validation involved fine-tuning models on a sentiment classification NLP task, where models trained with SKMT achieved an F1-score improvement of 3.5% over those trained with conventional BPE tokenization, followed by a focus on the Semantic Textual Similarity (STS) task. These findings suggest that training language models on the SKMT tokenizer significantly enhances model performance and quality.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2465"},"PeriodicalIF":3.5,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622830/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A study of hybrid deep learning model for stock asset management. 股票资产管理混合深度学习模型研究。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-19 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2493

Yuanzhi Huo, Mengjie Jin, Sicong You

Crafting a lucrative stock trading strategy is pivotal in the realm of investments. However, the task of devising such a strategy becomes challenging task the intricate and ever-changing situation of the stock market. In recent years, with the development of artificial intelligence (AI), some AI technologies have been proven to be successfully applied in stock price and asset management. For example, long short-term memory networks (LSTM) can be used for predicting stock price variation, reinforcement learning (RL) can be used for control stock trading, however, they are generally used separately and cannot achieve simultaneous prediction and trading. In this study, we propose a hybrid deep learning model to predict stock prices and control stock trading to manage assets. LSTM is responsible for predicting stock prices, while RL is responsible for stock trading based on the predicted price trends. Meanwhile, to reduce uncertainty in the stock market and maximize stock assets, the proposed LSTM model can predict the average directional index (ADX) to comprehend the stock trends in advance and we also propose several constraints to assist assets management, thereby reducing the risk and maximizing the stock assets. In our results, the hybrid model yields an average R ² value of 0.94 when predicting price variations. Moreover, employing the proposed approach, which integrates ADX and constraints, the hybrid model augments stock assets to 1.05 times than initial assets.

在投资领域，制定一个有利可图的股票交易策略至关重要。然而，面对错综复杂、瞬息万变的股市形势，制定这样的策略变得极具挑战性。近年来，随着人工智能（AI）的发展，一些人工智能技术已被证明可成功应用于股票价格和资产管理。例如，长短期记忆网络（LSTM）可用于预测股价变化，强化学习（RL）可用于控制股票交易，但它们一般都是单独使用，无法同时实现预测和交易。在本研究中，我们提出了一种混合深度学习模型，用于预测股票价格和控制股票交易，以管理资产。LSTM 负责预测股票价格，RL 负责根据预测的价格趋势进行股票交易。同时，为了降低股票市场的不确定性，实现股票资产的最大化，我们提出的 LSTM 模型可以预测平均方向性指数（ADX），提前了解股票走势，我们还提出了几个约束条件来辅助资产管理，从而降低风险，实现股票资产的最大化。根据我们的研究结果，混合模型在预测价格变化时的平均 R 2 值为 0.94。此外，混合模型采用了所提出的 ADX 与约束相结合的方法，使股票资产增加到初始资产的 1.05 倍。

{"title":"A study of hybrid deep learning model for stock asset management.","authors":"Yuanzhi Huo, Mengjie Jin, Sicong You","doi":"10.7717/peerj-cs.2493","DOIUrl":"10.7717/peerj-cs.2493","url":null,"abstract":"Crafting a lucrative stock trading strategy is pivotal in the realm of investments. However, the task of devising such a strategy becomes challenging task the intricate and ever-changing situation of the stock market. In recent years, with the development of artificial intelligence (AI), some AI technologies have been proven to be successfully applied in stock price and asset management. For example, long short-term memory networks (LSTM) can be used for predicting stock price variation, reinforcement learning (RL) can be used for control stock trading, however, they are generally used separately and cannot achieve simultaneous prediction and trading. In this study, we propose a hybrid deep learning model to predict stock prices and control stock trading to manage assets. LSTM is responsible for predicting stock prices, while RL is responsible for stock trading based on the predicted price trends. Meanwhile, to reduce uncertainty in the stock market and maximize stock assets, the proposed LSTM model can predict the average directional index (ADX) to comprehend the stock trends in advance and we also propose several constraints to assist assets management, thereby reducing the risk and maximizing the stock assets. In our results, the hybrid model yields an average R 2 value of 0.94 when predicting price variations. Moreover, employing the proposed approach, which integrates ADX and constraints, the hybrid model augments stock assets to 1.05 times than initial assets.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2493"},"PeriodicalIF":3.5,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639306/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142830722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimal tuning of multi-PID controller using improved CMOCSO algorithm. 基于改进CMOCSO算法的多pid控制器最优整定。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-19 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2453

Ying Hu, Xiongyan Liu, Hao Chen

To mitigate synchronization errors within a multi-PID controller system and enhance its resistance to interference, an improved competitive and cooperative swarm optimizer for constrained multi-objective optimization (CMOCSO) algorithm is employed to optimize the parameters of the multi-PID controller. Initially, a mathematical model representing the constrained multi-objective problem associated with the multi-PID controller is formulated. In this model, the parameters are designated as decision variables, the performance index serves as the objective function, and the stability constraints of the system are incorporated. Subsequently, an improved CMOCSO algorithm is introduced, which bifurcates the evolutionary process into two distinct stages using a central point-moving strategy; each stage employs different evolutionary techniques to accelerate convergence rates, and a novel grouping strategy is implemented to increase the learning efficiency of the population. The efficacy of the algorithm is evaluated through testing on 16 standard functions, demonstrating its effectiveness in addressing constrained multi-objective problems. Ultimately, the algorithm is applied to optimize the parameters of the multi-PID controller. The simulation results indicate that the proposed method yields superior control performance, reduced synchronization errors, and notable interference resistance capacity.

为了减轻多pid控制器系统内的同步误差，增强其抗干扰能力，采用改进的竞争与合作约束多目标优化算法（CMOCSO）对多pid控制器进行参数优化。首先，建立了与多pid控制器相关的约束多目标问题的数学模型。该模型以参数为决策变量，以绩效指标为目标函数，并引入系统的稳定性约束。随后，介绍了一种改进的CMOCSO算法，该算法采用中心点移动策略将进化过程分为两个不同的阶段；每个阶段采用不同的进化技术来加快收敛速度，并采用新颖的分组策略来提高群体的学习效率。通过对16个标准函数的测试来评估算法的有效性，证明了该算法在解决约束多目标问题方面的有效性。最后，将该算法应用于多pid控制器的参数优化。仿真结果表明，该方法具有较好的控制性能、较低的同步误差和较好的抗干扰能力。

{"title":"Optimal tuning of multi-PID controller using improved CMOCSO algorithm.","authors":"Ying Hu, Xiongyan Liu, Hao Chen","doi":"10.7717/peerj-cs.2453","DOIUrl":"10.7717/peerj-cs.2453","url":null,"abstract":"To mitigate synchronization errors within a multi-PID controller system and enhance its resistance to interference, an improved competitive and cooperative swarm optimizer for constrained multi-objective optimization (CMOCSO) algorithm is employed to optimize the parameters of the multi-PID controller. Initially, a mathematical model representing the constrained multi-objective problem associated with the multi-PID controller is formulated. In this model, the parameters are designated as decision variables, the performance index serves as the objective function, and the stability constraints of the system are incorporated. Subsequently, an improved CMOCSO algorithm is introduced, which bifurcates the evolutionary process into two distinct stages using a central point-moving strategy; each stage employs different evolutionary techniques to accelerate convergence rates, and a novel grouping strategy is implemented to increase the learning efficiency of the population. The efficacy of the algorithm is evaluated through testing on 16 standard functions, demonstrating its effectiveness in addressing constrained multi-objective problems. Ultimately, the algorithm is applied to optimize the parameters of the multi-PID controller. The simulation results indicate that the proposed method yields superior control performance, reduced synchronization errors, and notable interference resistance capacity.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2453"},"PeriodicalIF":3.5,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622856/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Drivable path detection for a mobile robot with differential drive using a deep Learning based segmentation method for indoor navigation. 使用基于深度学习的室内导航分割方法，为带差分驱动的移动机器人检测可驾驶路径。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-19 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2514

Oğuz Mısır

The integration of artificial intelligence into the field of robotics enables robots to perform their tasks more meaningfully. In particular, deep-learning methods contribute significantly to robots becoming intelligent cybernetic systems. The effective use of deep-learning mobile cyber-physical systems has enabled mobile robots to become more intelligent. This effective use of deep learning can also help mobile robots determine a safe path. The drivable pathfinding problem involves a mobile robot finding the path to a target in a challenging environment with obstacles. In this paper, a semantic-segmentation-based drivable path detection method is presented for use in the indoor navigation of mobile robots. The proposed method uses a perspective transformation strategy based on transforming high-accuracy segmented images into real-world space. This transformation enables the motion space to be divided into grids, based on the image perceived in a real-world space. A grid-based RRT* navigation strategy was developed that uses images divided into grids to enable the mobile robot to avoid obstacles and meet the optimal path requirements. Smoothing was performed to improve the path planning of the grid-based RRT* and avoid unnecessary turning angles of the mobile robot. Thus, the mobile robot could reach the target in an optimum manner in the drivable area determined by segmentation. Deeplabv3+ and ResNet50 backbone architecture with superior segmentation ability are proposed for accurate determination of drivable path. Gaussian filter was used to reduce the noise caused by segmentation. In addition, multi-otsu thresholding was used to improve the masked images in multiple classes. The segmentation model and backbone architecture were compared in terms of their performance using different methods. DeepLabv3+ and ResNet50 backbone architectures outperformed the other compared methods by 0.21%-4.18% on many metrics. In addition, a mobile robot design is presented to test the proposed drivable path determination method. This design validates the proposed method by using different scenarios in an indoor environment.

将人工智能融入机器人学领域，能让机器人更有意义地执行任务。其中，深度学习方法对机器人成为智能控制论系统做出了重大贡献。深度学习移动网络物理系统的有效使用使移动机器人变得更加智能。深度学习的有效利用还能帮助移动机器人确定安全路径。可驾驶寻路问题涉及移动机器人在充满障碍的挑战环境中寻找通往目标的路径。本文提出了一种基于语义分割的可驾驶路径检测方法，用于移动机器人的室内导航。所提出的方法采用透视转换策略，将高精度的分割图像转换到真实世界空间。通过这种转换，可以根据在真实世界空间中感知到的图像，将运动空间划分为网格。我们开发了一种基于网格的 RRT* 导航策略，该策略使用划分为网格的图像，使移动机器人能够避开障碍物并满足最佳路径要求。为了改善基于网格的 RRT* 的路径规划，避免移动机器人不必要的转弯角度，对图像进行了平滑处理。这样，移动机器人就能在分割确定的可驾驶区域内以最佳方式到达目标。为精确确定可驾驶路径，提出了具有卓越分割能力的 Deeplabv3+ 和 ResNet50 骨干架构。使用高斯滤波器来降低分割产生的噪声。此外，还使用了多阈值技术来改进多类图像中的屏蔽图像。使用不同的方法对分割模型和骨干架构的性能进行了比较。DeepLabv3+和ResNet50骨干架构在许多指标上都优于其他比较方法0.21%-4.18%。此外，还介绍了一个移动机器人设计，以测试所提出的可驾驶路径确定方法。该设计通过使用室内环境中的不同场景验证了所提出的方法。

{"title":"Drivable path detection for a mobile robot with differential drive using a deep Learning based segmentation method for indoor navigation.","authors":"Oğuz Mısır","doi":"10.7717/peerj-cs.2514","DOIUrl":"10.7717/peerj-cs.2514","url":null,"abstract":"The integration of artificial intelligence into the field of robotics enables robots to perform their tasks more meaningfully. In particular, deep-learning methods contribute significantly to robots becoming intelligent cybernetic systems. The effective use of deep-learning mobile cyber-physical systems has enabled mobile robots to become more intelligent. This effective use of deep learning can also help mobile robots determine a safe path. The drivable pathfinding problem involves a mobile robot finding the path to a target in a challenging environment with obstacles. In this paper, a semantic-segmentation-based drivable path detection method is presented for use in the indoor navigation of mobile robots. The proposed method uses a perspective transformation strategy based on transforming high-accuracy segmented images into real-world space. This transformation enables the motion space to be divided into grids, based on the image perceived in a real-world space. A grid-based RRT* navigation strategy was developed that uses images divided into grids to enable the mobile robot to avoid obstacles and meet the optimal path requirements. Smoothing was performed to improve the path planning of the grid-based RRT* and avoid unnecessary turning angles of the mobile robot. Thus, the mobile robot could reach the target in an optimum manner in the drivable area determined by segmentation. Deeplabv3+ and ResNet50 backbone architecture with superior segmentation ability are proposed for accurate determination of drivable path. Gaussian filter was used to reduce the noise caused by segmentation. In addition, multi-otsu thresholding was used to improve the masked images in multiple classes. The segmentation model and backbone architecture were compared in terms of their performance using different methods. DeepLabv3+ and ResNet50 backbone architectures outperformed the other compared methods by 0.21%-4.18% on many metrics. In addition, a mobile robot design is presented to test the proposed drivable path determination method. This design validates the proposed method by using different scenarios in an indoor environment.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2514"},"PeriodicalIF":3.5,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639217/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142830806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Honey bee inspired resource allocation scheme for IoT-driven smart healthcare applications in fog-cloud paradigm. 雾云模式下物联网驱动智能医疗应用的蜜蜂启发资源分配方案。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-19 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2484

Aasma Akram, Fatima Anjum, Sajid Latif, Muhammad Imran Zulfiqar, Mohsin Nazir

The Internet of Things (IoT) paradigm is a foundational and integral factor for the development of smart applications in different sectors. These applications are comprised over set of interconnected modules that exchange data and realize the distributed data flow (DDF) model. The execution of these modules on distant cloud data-center is prone to quality of service (QoS) degradation. This is where fog computing philosophy comes in to bridge this gap and bring the computation closer to the IoT devices. However, resource management in fog and optimal allocation of fog devices to application modules is critical for better resource utilization and achieve QoS. Significant challenge in this regard is to manage the fog network dynamically to determine cost effective placement of application modules on resources. In this study, we propose the optimal placement strategy for smart health-care application modules on fog resources. The objective of this strategy is to ensure optimal execution in terms of latency, bandwidth and earliest completion time as compared to few baseline techniques. A honey bee inspired strategy has been proposed for allocation and utilization of the resource for application module processing. In order to model the application and measure the effectiveness of our strategy, iFogSim Java-based simulation classes have been extended and conduct the experiments that demonstrate the satisfactory results.

物联网（IoT）范式是不同领域智能应用发展的基础和不可或缺的因素。这些应用程序由一组相互连接的模块组成，这些模块交换数据并实现分布式数据流模型。在远程云数据中心上执行这些模块容易导致服务质量（QoS）下降。这就是雾计算哲学来弥补这一差距并使计算更接近物联网设备的地方。然而，雾中的资源管理和雾设备在应用模块中的优化分配对于更好地利用资源和实现QoS至关重要。这方面的重大挑战是动态管理雾网络，以确定应用模块在资源上的成本效益。在本研究中，我们提出了智能医疗应用模块在雾资源上的最佳放置策略。此策略的目标是确保在延迟、带宽和最早完成时间方面，与少数基线技术相比，实现最佳执行。提出了一种受蜜蜂启发的应用模块处理资源分配和利用策略。为了对应用程序进行建模并测量策略的有效性，我们扩展了基于java的iFogSim仿真类，并进行了实验，证明了令人满意的结果。

{"title":"Honey bee inspired resource allocation scheme for IoT-driven smart healthcare applications in fog-cloud paradigm.","authors":"Aasma Akram, Fatima Anjum, Sajid Latif, Muhammad Imran Zulfiqar, Mohsin Nazir","doi":"10.7717/peerj-cs.2484","DOIUrl":"10.7717/peerj-cs.2484","url":null,"abstract":"The Internet of Things (IoT) paradigm is a foundational and integral factor for the development of smart applications in different sectors. These applications are comprised over set of interconnected modules that exchange data and realize the distributed data flow (DDF) model. The execution of these modules on distant cloud data-center is prone to quality of service (QoS) degradation. This is where fog computing philosophy comes in to bridge this gap and bring the computation closer to the IoT devices. However, resource management in fog and optimal allocation of fog devices to application modules is critical for better resource utilization and achieve QoS. Significant challenge in this regard is to manage the fog network dynamically to determine cost effective placement of application modules on resources. In this study, we propose the optimal placement strategy for smart health-care application modules on fog resources. The objective of this strategy is to ensure optimal execution in terms of latency, bandwidth and earliest completion time as compared to few baseline techniques. A honey bee inspired strategy has been proposed for allocation and utilization of the resource for application module processing. In order to model the application and measure the effectiveness of our strategy, iFogSim Java-based simulation classes have been extended and conduct the experiments that demonstrate the satisfactory results.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2484"},"PeriodicalIF":3.5,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623239/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Out of (the) bag-encoding categorical predictors impacts out-of-bag samples. 袋外编码分类预测因子影响袋外样本。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-18 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2445

Helen L Smith, Patrick J Biggs, Nigel P French, Adam N H Smith, Jonathan C Marshall

Performance of random forest classification models is often assessed and interpreted using out-of-bag (OOB) samples. Observations which are OOB when a tree is trained may serve as a test set for that tree and predictions from the OOB observations used to calculate OOB error and variable importance measures (VIM). OOB errors are popular because they are fast to compute and, for large samples, are a good estimate of the true prediction error. In this study, we investigate how target-based vs. target-agnostic encoding of categorical predictor variables for random forest can bias performance measures based on OOB samples. We show that, when categorical variables are encoded using a target-based encoding method, and when the encoding takes place prior to bagging, the OOB sample can underestimate the true misclassification rate, and overestimate variable importance. We recommend using a separate test data set when evaluating variable importance and/or predictive performance of tree based methods that utilise a target-based encoding method.

随机森林分类模型的性能通常使用袋外样本（OOB）来评估和解释。当树被训练时的OOB观测值可以作为该树的测试集和用于计算OOB误差和可变重要性度量（VIM）的OOB观测值的预测。OOB误差很受欢迎，因为它们可以快速计算，并且对于大样本来说，是对真实预测误差的良好估计。在这项研究中，我们研究了随机森林分类预测变量基于目标的编码与目标不可知的编码如何对基于OOB样本的性能测量产生偏差。我们表明，当使用基于目标的编码方法对分类变量进行编码时，当编码发生在装袋之前时，OOB样本可能低估了真实的误分类率，并高估了变量的重要性。我们建议在评估使用基于目标的编码方法的基于树的方法的变量重要性和/或预测性能时使用单独的测试数据集。

引用次数: 0

Predicting hotel booking cancellations using tree-based neural network. 使用基于树的神经网络预测酒店预订取消。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-18 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2473

Dan Yang, Xiaoling Miao

In the hospitality business, cancellations negatively affect the precise estimation of revenue management. With today's powerful computational advances, it is feasible to develop a model to predict cancellations to reduce the risks for business owners. Although these models have not yet been tested in real-world conditions, several prototypes were developed and deployed in two hotels. The their main goal was to study how these models could be incorporated into a decision support system and to assess their influence on demand-management decisions. In our study, we introduce a tree-based neural network (TNN) that combines a tree-based learning algorithm with a feed-forward neural network as a computational method for predicting hotel booking cancellation. Experimental results indicated that the TNN model significantly improved the predictive power on two benchmark datasets compared to tree-based models and baseline artificial neural networks alone. Also, the preliminary success of our study confirmed that tree-based neural networks are promising in dealing with tabular data.

在酒店业务中，取消对收入管理的精确估计产生负面影响。随着当今强大的计算技术的进步，开发一个预测取消的模型来降低企业主的风险是可行的。虽然这些模型还没有在现实环境中进行测试，但已经在两家酒店开发并部署了几个原型。他们的主要目标是研究如何将这些模型纳入决策支持系统，并评估它们对需求管理决策的影响。在我们的研究中，我们引入了一种基于树的神经网络（TNN），它将基于树的学习算法与前馈神经网络相结合，作为预测酒店预订取消的计算方法。实验结果表明，与基于树的模型和单独的基线人工神经网络相比，TNN模型在两个基准数据集上的预测能力显著提高。此外，我们研究的初步成功证实了基于树的神经网络在处理表格数据方面是有希望的。

引用次数: 0

k-Clique counting on large scale-graphs: a survey. 大比例尺图上的k-团计数：综述。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-18 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2501

Büşra Çalmaz, Belgin Ergenç Bostanoğlu

Clique counting is a crucial task in graph mining, as the count of cliques provides different insights across various domains, social and biological network analysis, community detection, recommendation systems, and fraud detection. Counting cliques is algorithmically challenging due to combinatorial explosion, especially for large datasets and larger clique sizes. There are comprehensive surveys and reviews on algorithms for counting subgraphs and triangles (three-clique), but there is a notable lack of reviews addressing k-clique counting algorithms for k > 3. This paper addresses this gap by reviewing clique counting algorithms designed to overcome this challenge. Also, a systematic analysis and comparison of exact and approximation techniques are provided by highlighting their advantages, disadvantages, and suitability for different contexts. It also presents a taxonomy of clique counting methodologies, covering approximate and exact methods and parallelization strategies. The paper aims to enhance understanding of this specific domain and guide future research of k-clique counting in large-scale graphs.

派系计数在图挖掘中是一项至关重要的任务，因为派系计数提供了不同领域、社会和生物网络分析、社区检测、推荐系统和欺诈检测的不同见解。由于组合爆炸，计数团在算法上具有挑战性，特别是对于大型数据集和更大的团规模。对于计算子图和三角形（3 -clique）的算法有全面的调查和评论，但是对于k bbbb3的k-clique计数算法的评论明显缺乏。本文通过回顾旨在克服这一挑战的派系计数算法来解决这一差距。此外，通过强调精确和近似技术的优点、缺点和不同上下文的适用性，提供了系统的分析和比较。它还提出了派系计数方法的分类，包括近似和精确方法以及并行化策略。本文旨在加强对这一特定领域的理解，并指导大规模图中k-团计数的未来研究。

引用次数: 0

Design of judicial public opinion supervision and intelligent decision-making model based on Bi-LSTM. 基于Bi-LSTM的司法舆情监督与智能决策模型设计。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2024-11-13 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2385

Heng Guo

Fuzzy preference modeling in intelligent decision support systems aims to improve the efficiency and accuracy of decision-making processes by incorporating fuzzy logic and preference modeling techniques. While network public opinion (NPO) has the potential to drive judicial reform and progress, it also poses challenges to the independence of the judiciary due to the negative impact of malicious public opinion. To tackle this issue within the context of intelligent decision support systems, this study provides an insightful overview of current NPO monitoring technologies. Recognizing the complexities associated with handling large-scale NPO data and mitigating significant interference, a novel judicial domain NPO monitoring model is proposed, which centers around semantic feature analysis. This model takes into account time series characteristics, binary semantic fitting, and public sentiment intensity. Notably, it leverages a bidirectional long short-term memory (Bi-LSTM) network (S-Bi-LSTM) to construct a judicial domain semantic similarity calculation model. The semantic similarity values between sentences are obtained through the utilization of a fully connected layer. Empirical evaluations demonstrate the remarkable performance of the proposed model, achieving an accuracy rate of 85.9% and an F1 value of 87.1 on the test set, surpassing existing sentence semantic similarity models. Ultimately, the proposed model significantly enhances the monitoring capabilities of judicial authorities over NPO, thereby alleviating the burden on public relations faced by judicial institutions and fostering a more equitable execution of judicial power.

智能决策支持系统中的模糊偏好建模是将模糊逻辑和偏好建模技术相结合，以提高决策过程的效率和准确性。网络民意具有推动司法改革和进步的潜力，但也因恶意舆论的负面影响对司法独立构成挑战。为了在智能决策支持系统的背景下解决这个问题，本研究提供了当前NPO监测技术的深刻概述。考虑到处理大规模NPO数据和减少重大干扰的复杂性，提出了一种以语义特征分析为中心的司法领域NPO监测模型。该模型考虑了时间序列特征、二值语义拟合和公众情绪强度。值得注意的是，它利用双向长短期记忆（Bi-LSTM）网络（S-Bi-LSTM）构建了司法领域语义相似性计算模型。利用全连通层获得句子间的语义相似度。经验评价表明，该模型的准确率达到85.9%，在测试集上的F1值达到87.1，超过了现有的句子语义相似度模型。最终，所提出的模式显著提高了司法当局对非营利组织的监督能力，从而减轻了司法机构面临的公共关系负担，促进司法权力更加公平地行使。

{"title":"Design of judicial public opinion supervision and intelligent decision-making model based on Bi-LSTM.","authors":"Heng Guo","doi":"10.7717/peerj-cs.2385","DOIUrl":"10.7717/peerj-cs.2385","url":null,"abstract":"Fuzzy preference modeling in intelligent decision support systems aims to improve the efficiency and accuracy of decision-making processes by incorporating fuzzy logic and preference modeling techniques. While network public opinion (NPO) has the potential to drive judicial reform and progress, it also poses challenges to the independence of the judiciary due to the negative impact of malicious public opinion. To tackle this issue within the context of intelligent decision support systems, this study provides an insightful overview of current NPO monitoring technologies. Recognizing the complexities associated with handling large-scale NPO data and mitigating significant interference, a novel judicial domain NPO monitoring model is proposed, which centers around semantic feature analysis. This model takes into account time series characteristics, binary semantic fitting, and public sentiment intensity. Notably, it leverages a bidirectional long short-term memory (Bi-LSTM) network (S-Bi-LSTM) to construct a judicial domain semantic similarity calculation model. The semantic similarity values between sentences are obtained through the utilization of a fully connected layer. Empirical evaluations demonstrate the remarkable performance of the proposed model, achieving an accuracy rate of 85.9% and an F1 value of 87.1 on the test set, surpassing existing sentence semantic similarity models. Ultimately, the proposed model significantly enhances the monitoring capabilities of judicial authorities over NPO, thereby alleviating the burden on public relations faced by judicial institutions and fostering a more equitable execution of judicial power.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2385"},"PeriodicalIF":3.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623130/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0