PeerJ Computer Science最新文献

英文中文

Leveraging sentiment analysis of food delivery services reviews using deep learning and word embedding. 利用深度学习和词嵌入对送餐服务评论进行情感分析。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-19 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2669

Dheya Mustafa, Safaa M Khabour, Mousa Al-Kfairy, Ahmed Shatnawi

Companies that deliver food (food delivery services, or FDS) try to use customer feedback to identify aspects where the customer experience could be improved. Consumer feedback on purchasing and receiving goods via online platforms is a crucial tool for learning about a company's performance. Many English-language studies have been conducted on sentiment analysis (SA). Arabic is becoming one of the most extensively written languages on the World Wide Web, but because of its morphological and grammatical difficulty as well as the lack of openly accessible resources for Arabic SA, like as dictionaries and datasets, there has not been much research done on the language. Using a manually annotated FDS dataset, the current study conducts extensive sentiment analysis using reviews related to FDS that include Modern Standard Arabic and dialectal Arabic. It does this by utilizing word embedding models, deep learning techniques, and natural language processing to extract subjective opinions, determine polarity, and recognize customers' feelings in the FDS domain. Convolutional neural network (CNN), bidirectional long short-term memory recurrent neural network (BiLSTM), and an LSTM-CNN hybrid model were among the deep learning approaches to classification that we evaluated. In addition, the article investigated different effective approaches for word embedding and stemming techniques. Using a dataset of Modern Standard Arabic and dialectal Arabic corpus gathered from Talabat.com, we trained and evaluated our suggested models. Our best accuracy was approximately 84% for multiclass classification and 92.5% for binary classification on the FDS. To verify that the proposed approach is suitable for analyzing human perceptions in diversified domains, we designed and carried out excessive experiments on other existing Arabic datasets. The highest obtained multi-classification accuracy is 88.9% on the Hotels Arabic-Reviews Dataset (HARD) dataset, and the highest obtained binary classification accuracy is 97.2% on the same dataset.

{"title":"Leveraging sentiment analysis of food delivery services reviews using deep learning and word embedding.","authors":"Dheya Mustafa, Safaa M Khabour, Mousa Al-Kfairy, Ahmed Shatnawi","doi":"10.7717/peerj-cs.2669","DOIUrl":"10.7717/peerj-cs.2669","url":null,"abstract":"Companies that deliver food (food delivery services, or FDS) try to use customer feedback to identify aspects where the customer experience could be improved. Consumer feedback on purchasing and receiving goods via online platforms is a crucial tool for learning about a company's performance. Many English-language studies have been conducted on sentiment analysis (SA). Arabic is becoming one of the most extensively written languages on the World Wide Web, but because of its morphological and grammatical difficulty as well as the lack of openly accessible resources for Arabic SA, like as dictionaries and datasets, there has not been much research done on the language. Using a manually annotated FDS dataset, the current study conducts extensive sentiment analysis using reviews related to FDS that include Modern Standard Arabic and dialectal Arabic. It does this by utilizing word embedding models, deep learning techniques, and natural language processing to extract subjective opinions, determine polarity, and recognize customers' feelings in the FDS domain. Convolutional neural network (CNN), bidirectional long short-term memory recurrent neural network (BiLSTM), and an LSTM-CNN hybrid model were among the deep learning approaches to classification that we evaluated. In addition, the article investigated different effective approaches for word embedding and stemming techniques. Using a dataset of Modern Standard Arabic and dialectal Arabic corpus gathered from Talabat.com, we trained and evaluated our suggested models. Our best accuracy was approximately 84% for multiclass classification and 92.5% for binary classification on the FDS. To verify that the proposed approach is suitable for analyzing human perceptions in diversified domains, we designed and carried out excessive experiments on other existing Arabic datasets. The highest obtained multi-classification accuracy is 88.9% on the Hotels Arabic-Reviews Dataset (HARD) dataset, and the highest obtained binary classification accuracy is 97.2% on the same dataset.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2669"},"PeriodicalIF":3.5,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888876/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Diagnostic test accuracy of AI-assisted mammography for breast imaging: a narrative review.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-19 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2476

Daksh Dave, Adnan Akhunzada, Nikola Ivković, Sujan Gyawali, Korhan Cengiz, Adeel Ahmed, Ahmad Sami Al-Shamayleh

The integration of artificial intelligence into healthcare, particularly in mammography, holds immense potential for improving breast cancer diagnosis. Artificial intelligence (AI), with its ability to process vast amounts of data and detect intricate patterns, offers a solution to the limitations of traditional mammography, including missed diagnoses and false positives. This review focuses on the diagnostic accuracy of AI-assisted mammography, synthesizing findings from studies across different clinical settings and algorithms. The motivation for this research lies in addressing the need for enhanced diagnostic tools in breast cancer screening, where early detection can significantly impact patient outcomes. Although AI models have shown promising improvements in sensitivity and specificity, challenges such as algorithmic bias, interpretability, and the generalizability of models across diverse populations remain. The review concludes that while AI holds transformative potential in breast cancer screening, collaborative efforts between radiologists, AI developers, and policymakers are crucial for ensuring ethical, reliable, and inclusive integration into clinical practice.

{"title":"Diagnostic test accuracy of AI-assisted mammography for breast imaging: a narrative review.","authors":"Daksh Dave, Adnan Akhunzada, Nikola Ivković, Sujan Gyawali, Korhan Cengiz, Adeel Ahmed, Ahmad Sami Al-Shamayleh","doi":"10.7717/peerj-cs.2476","DOIUrl":"10.7717/peerj-cs.2476","url":null,"abstract":"The integration of artificial intelligence into healthcare, particularly in mammography, holds immense potential for improving breast cancer diagnosis. Artificial intelligence (AI), with its ability to process vast amounts of data and detect intricate patterns, offers a solution to the limitations of traditional mammography, including missed diagnoses and false positives. This review focuses on the diagnostic accuracy of AI-assisted mammography, synthesizing findings from studies across different clinical settings and algorithms. The motivation for this research lies in addressing the need for enhanced diagnostic tools in breast cancer screening, where early detection can significantly impact patient outcomes. Although AI models have shown promising improvements in sensitivity and specificity, challenges such as algorithmic bias, interpretability, and the generalizability of models across diverse populations remain. The review concludes that while AI holds transformative potential in breast cancer screening, collaborative efforts between radiologists, AI developers, and policymakers are crucial for ensuring ethical, reliable, and inclusive integration into clinical practice.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2476"},"PeriodicalIF":3.5,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888881/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The technique of fuzzy analytic hierarchy process (FAHP) based on the triangular q-rung fuzzy numbers (TR-q-ROFNS) with applications in best African coffee brand selection.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-19 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2555

Yupei Huang, Muhammad Gulistan, Amir Rafique, Wathek Chammam, Khursheed Aurangzeb, Ateeq Ur Rehman

The African coffee market offers a rich and diverse range of coffee profiles. The coffee producers of Africa face numerous challenges like climate change, market fluctuations, diseases, soil degradation and limited access to finance. These challenges badly affect their productivity, quality and livelihood. There are different factors like social and cultural, which can affect the coffee production. This study aims to develop multi criteria decision making (MCDM) methods and their applications in coffee market specifically in identifying factors influencing consumers' coffee brand preferences in South Africa, which is known for its vibrant coffee culture. For this purpose, first we developed the technique of analytic hierarchy process (AHP) in the environment of triangular q-rung orthopair fuzzy numbers. The triangular q-rung fuzzy numbers can effectively handle the uncertainity. The AHP technique has widely been used in decision making due to its flexibility in assigning weights and dealing with vagueness. The weights of critera plays a very important role in an MCDM problem. The development of AHP technique in triangular q-rung orthopair fuzzy environment can improve the decision making (DM) by handling vagueness in data and by using the most appropriate weights. Furthermore this new proposed method improves accuracy and minimize the information loss. The newly peoposed method is applied to different MCDM problems and comparative analysis is conducted to check the validity of results.

{"title":"The technique of fuzzy analytic hierarchy process (FAHP) based on the triangular q-rung fuzzy numbers (TR-q-ROFNS) with applications in best African coffee brand selection.","authors":"Yupei Huang, Muhammad Gulistan, Amir Rafique, Wathek Chammam, Khursheed Aurangzeb, Ateeq Ur Rehman","doi":"10.7717/peerj-cs.2555","DOIUrl":"10.7717/peerj-cs.2555","url":null,"abstract":"The African coffee market offers a rich and diverse range of coffee profiles. The coffee producers of Africa face numerous challenges like climate change, market fluctuations, diseases, soil degradation and limited access to finance. These challenges badly affect their productivity, quality and livelihood. There are different factors like social and cultural, which can affect the coffee production. This study aims to develop multi criteria decision making (MCDM) methods and their applications in coffee market specifically in identifying factors influencing consumers' coffee brand preferences in South Africa, which is known for its vibrant coffee culture. For this purpose, first we developed the technique of analytic hierarchy process (AHP) in the environment of triangular q-rung orthopair fuzzy numbers. The triangular q-rung fuzzy numbers can effectively handle the uncertainity. The AHP technique has widely been used in decision making due to its flexibility in assigning weights and dealing with vagueness. The weights of critera plays a very important role in an MCDM problem. The development of AHP technique in triangular q-rung orthopair fuzzy environment can improve the decision making (DM) by handling vagueness in data and by using the most appropriate weights. Furthermore this new proposed method improves accuracy and minimize the information loss. The newly peoposed method is applied to different MCDM problems and comparative analysis is conducted to check the validity of results.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2555"},"PeriodicalIF":3.5,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888883/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing power allocation for URLLC-D2D in 5G networks with Rician fading channel. 在具有里氏衰落信道的 5G 网络中优化 URLLC-D2D 的功率分配。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-18 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2712

Owais Muhammad, Hong Jiang, Muhammad Bilal, Mushtaq Muhammad Umer

The rapid evolution of wireless technologies within the 5G network brings significant challenges in managing the increased connectivity and traffic of mobile devices. This enhanced connectivity brings challenges for base stations, which must handle increased traffic and efficiently serve a growing number of mobile devices. One of the key solutions to address these challenges is integrating device-to-device (D2D) communication with ultra-reliable and low-latency communication (URLLC). This study examines the impact of the Rician fading channel on the performance of D2D communication under URLLC. It addresses the critical problem of optimizing power allocation to maximize the minimum data rate in D2D communication. A significant challenge arises due to interference issues, as the problem of maximizing the minimum data rate is non-convex, which leads to high computational complexity. This complexity makes it difficult to derive optimal solutions efficiently. To address this challenge, we introduce an algorithm that is based on derivatives to find the optimal power allocation. Comparisons are made with the branch and bound (B&B) algorithm, heuristic algorithm, and particle swarm optimization (PSO) algorithm. Our proposed algorithm improves power allocation performance and also achieves faster execution with lower computational complexity compared to the B&B, PSO, and heuristic algorithms.

{"title":"Optimizing power allocation for URLLC-D2D in 5G networks with Rician fading channel.","authors":"Owais Muhammad, Hong Jiang, Muhammad Bilal, Mushtaq Muhammad Umer","doi":"10.7717/peerj-cs.2712","DOIUrl":"10.7717/peerj-cs.2712","url":null,"abstract":"The rapid evolution of wireless technologies within the 5G network brings significant challenges in managing the increased connectivity and traffic of mobile devices. This enhanced connectivity brings challenges for base stations, which must handle increased traffic and efficiently serve a growing number of mobile devices. One of the key solutions to address these challenges is integrating device-to-device (D2D) communication with ultra-reliable and low-latency communication (URLLC). This study examines the impact of the Rician fading channel on the performance of D2D communication under URLLC. It addresses the critical problem of optimizing power allocation to maximize the minimum data rate in D2D communication. A significant challenge arises due to interference issues, as the problem of maximizing the minimum data rate is non-convex, which leads to high computational complexity. This complexity makes it difficult to derive optimal solutions efficiently. To address this challenge, we introduce an algorithm that is based on derivatives to find the optimal power allocation. Comparisons are made with the branch and bound (B&B) algorithm, heuristic algorithm, and particle swarm optimization (PSO) algorithm. Our proposed algorithm improves power allocation performance and also achieves faster execution with lower computational complexity compared to the B&B, PSO, and heuristic algorithms.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2712"},"PeriodicalIF":3.5,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888938/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing fraud detection in the Ethereum blockchain using ensemble learning.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-18 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2716

Zhexian Gu, Omar Dib

The Ethereum blockchain operates as a decentralized platform, utilizing blockchain technology to distribute smart contracts across a global network. It enables currency and digital value exchange without centralized control. However, the exponential growth of online commerce has created a fertile ground for a surge in fraudulent activities such as money laundering and phishing, thereby exacerbating significant security vulnerabilities. To combat this, our article introduces an ensemble learning approach to accurately detect fraudulent Ethereum blockchain transactions. Our goal is to integrate a decision-making tool into the decentralized validation process of Ethereum, allowing blockchain miners to identify and flag fraudulent transactions. Additionally, our system can assist governmental organizations in overseeing the blockchain network and identifying fraudulent activities. Our framework incorporates various data pre-processing techniques and evaluates multiple machine learning algorithms, including logistic regression, Isolation Forest, support vector machine, Random Forest, XGBoost, and recurrent neural network. These models are fine-tuned using grid search to enhance their performance. The proposed approach utilizes an ensemble of three distinct models (Random Forest, extreme gradient boosting (XGBoost), and support vector machine) to further improve classification performance. It achieves high scores of over 98% across key classification metrics like accuracy, precision, recall, and F1-score. Moreover, the approach is suitable for real-world usage, with an inference time of 0.13 s.

{"title":"Enhancing fraud detection in the Ethereum blockchain using ensemble learning.","authors":"Zhexian Gu, Omar Dib","doi":"10.7717/peerj-cs.2716","DOIUrl":"10.7717/peerj-cs.2716","url":null,"abstract":"The Ethereum blockchain operates as a decentralized platform, utilizing blockchain technology to distribute smart contracts across a global network. It enables currency and digital value exchange without centralized control. However, the exponential growth of online commerce has created a fertile ground for a surge in fraudulent activities such as money laundering and phishing, thereby exacerbating significant security vulnerabilities. To combat this, our article introduces an ensemble learning approach to accurately detect fraudulent Ethereum blockchain transactions. Our goal is to integrate a decision-making tool into the decentralized validation process of Ethereum, allowing blockchain miners to identify and flag fraudulent transactions. Additionally, our system can assist governmental organizations in overseeing the blockchain network and identifying fraudulent activities. Our framework incorporates various data pre-processing techniques and evaluates multiple machine learning algorithms, including logistic regression, Isolation Forest, support vector machine, Random Forest, XGBoost, and recurrent neural network. These models are fine-tuned using grid search to enhance their performance. The proposed approach utilizes an ensemble of three distinct models (Random Forest, extreme gradient boosting (XGBoost), and support vector machine) to further improve classification performance. It achieves high scores of over 98% across key classification metrics like accuracy, precision, recall, and F1-score. Moreover, the approach is suitable for real-world usage, with an inference time of 0.13 s.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2716"},"PeriodicalIF":3.5,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888915/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stock market trading via actor-critic reinforcement learning and adaptable data structure.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-18 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2690

Cesar Guevara

Currently, the stock market is attractive, and it is challenging to develop an efficient investment model with high accuracy due to changes in the values of the shares for political, economic, and social reasons. This article presents an innovative proposal for a short-term, automatic investment model to reduce capital loss during trading, applying a reinforcement learning (RL) model. On the other hand, we propose an adaptable data window structure to enhance the learning and accuracy of investment agents in three foreign exchange markets: crude oil, gold, and the Euro. In addition, the RL model employs an actor-critic neural network with rectified linear unit (ReLU) neurons to generate specialized investment agents, enabling more efficient trading, minimizing investment losses across different time periods, and reducing the model's learning time. The proposed RL model obtained a reduction average loss of 0.03% in Euro, 0.25% in gold, and 0.13% in crude oil in the test phase with varying initial conditions.

引用次数: 0

Optimized hybrid SVM-RF multi-biometric framework for enhanced authentication using fingerprint, iris, and face recognition.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-17 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2699

Sonal, Ajit Singh, Chander Kant

This article introduces a hybrid multi-biometric system incorporating fingerprint, face, and iris recognition to enhance individual authentication. The system addresses limitations of uni-modal approaches by combining multiple biometric modalities, exhibiting superior performance and heightened security in practical scenarios, making it more dependable and resilient for real-world applications. The integration of support vector machine (SVM) and random forest (RF) classifiers, along with optimization techniques like bacterial foraging optimization (BFO) and genetic algorithms (GA), improves efficiency and robustness. Additionally, integrating feature-level fusion and utilizing methods such as Gabor filters for feature extraction enhances overall performance of the model. The system demonstrates superior accuracy and reliability, making it suitable for real-world applications requiring secure and dependable identification solutions.

引用次数: 0

A lightweight coal-gangue detection model based on parallel deep residual networks. 基于并行深度残差网络的轻量级煤矸石检测模型。

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-17 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2677

Shexiang Jiang, Xinrui Zhou

To realize the accurate identification of coal-gangue in the process of underground coal transportation and the low-cost deployment of the model, a lightweight coal-gangue detection model based on the parallel depth residual network, called P-RNet, is proposed. For the problem of images of coal-gangue taken under complex conditions, the feature extraction module (FEM) is designed using decoupling training and inference methods. Furthermore, for the problem of the nearest neighbor interpolation upsampling method being prone to produce mosaic blocks and edge jagged edges, a lightweight upsampling operator is used to optimize the feature fusion module (FFM). Finally, to solve the problem, the stochastic gradient descent algorithm is prone to local suboptimal solutions and saddle point problems in the error function optimization process, numerous experiments are carried out on selecting the initial learning rate, and the Lookahead optimizer is used to optimize parameters during backpropagation. Experimental results show that the proposed model can effectively improve the recognition effect, with a corresponding low deployment cost.

{"title":"A lightweight coal-gangue detection model based on parallel deep residual networks.","authors":"Shexiang Jiang, Xinrui Zhou","doi":"10.7717/peerj-cs.2677","DOIUrl":"10.7717/peerj-cs.2677","url":null,"abstract":"To realize the accurate identification of coal-gangue in the process of underground coal transportation and the low-cost deployment of the model, a lightweight coal-gangue detection model based on the parallel depth residual network, called P-RNet, is proposed. For the problem of images of coal-gangue taken under complex conditions, the feature extraction module (FEM) is designed using decoupling training and inference methods. Furthermore, for the problem of the nearest neighbor interpolation upsampling method being prone to produce mosaic blocks and edge jagged edges, a lightweight upsampling operator is used to optimize the feature fusion module (FFM). Finally, to solve the problem, the stochastic gradient descent algorithm is prone to local suboptimal solutions and saddle point problems in the error function optimization process, numerous experiments are carried out on selecting the initial learning rate, and the Lookahead optimizer is used to optimize parameters during backpropagation. Experimental results show that the proposed model can effectively improve the recognition effect, with a corresponding low deployment cost.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2677"},"PeriodicalIF":3.5,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888869/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An improved termite life cycle optimizer algorithm for global function optimization.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-17 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2671

Yanjiao Wang, Mengjiao Wei

The termite life cycle optimizer algorithm (TLCO) is a new bionic meta-heuristic algorithm that emulates the natural behavior of termites in their natural habitat. This work presents an improved TLCO (ITLCO) to increase the speed and accuracy of convergence. A novel strategy for worker generation is established to enhance communication between individuals in the worker population and termite population. This strategy would prevent the original worker generation strategy from effectively balancing algorithm convergence and population diversity to reduce the risk of the algorithm in reaching a local optimum. A novel soldier generation strategy is proposed, which incorporates a step factor that adheres to the principles of evolution to further enhance the algorithm's convergence speed. Furthermore, a novel replacement update mechanism is executed when the new individual is of lower quality than the original individual. This mechanism ensures a balance between the convergence of the algorithm and the diversity of the population. The findings from CEC2013, CEC2019, and CEC2020 test sets indicate that ITLCO exhibits notable benefits regarding convergence speed, accuracy, and stability in comparison with the basic TLCO algorithm and the four most exceptional meta-heuristic algorithms thus far.

{"title":"An improved termite life cycle optimizer algorithm for global function optimization.","authors":"Yanjiao Wang, Mengjiao Wei","doi":"10.7717/peerj-cs.2671","DOIUrl":"10.7717/peerj-cs.2671","url":null,"abstract":"The termite life cycle optimizer algorithm (TLCO) is a new bionic meta-heuristic algorithm that emulates the natural behavior of termites in their natural habitat. This work presents an improved TLCO (ITLCO) to increase the speed and accuracy of convergence. A novel strategy for worker generation is established to enhance communication between individuals in the worker population and termite population. This strategy would prevent the original worker generation strategy from effectively balancing algorithm convergence and population diversity to reduce the risk of the algorithm in reaching a local optimum. A novel soldier generation strategy is proposed, which incorporates a step factor that adheres to the principles of evolution to further enhance the algorithm's convergence speed. Furthermore, a novel replacement update mechanism is executed when the new individual is of lower quality than the original individual. This mechanism ensures a balance between the convergence of the algorithm and the diversity of the population. The findings from CEC2013, CEC2019, and CEC2020 test sets indicate that ITLCO exhibits notable benefits regarding convergence speed, accuracy, and stability in comparison with the basic TLCO algorithm and the four most exceptional meta-heuristic algorithms thus far.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2671"},"PeriodicalIF":3.5,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888856/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Combination of machine learning and data envelopment analysis to measure the efficiency of the Tax Service Office.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science

Pub Date : 2025-02-17 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.2672

Shofinurdin Soffan, Arif Bramantoro, Ahmad A Alzahrani

The Tax Service Office, a division of the Directorate General of Taxes, is responsible for providing taxation services to the public and collecting taxes. Achieving tax targets efficiently while utilizing available resources is crucial. To assess the performance efficiency of decision-making units (DMUs), data envelopment analysis (DEA) is commonly employed. However, ensuring homogeneity among the DMUs is often necessary and requires the application of machine learning clustering techniques. In this study, we propose a three-stage approach: Clustering, DEA, and Regression, to measure the efficiency of all tax service office units. Real datasets from Indonesian tax service offices were used while maintaining strict confidentiality. Unlike previous studies that considered both input and output variables, we focus solely on clustering input variables, as it leads to more objective efficiency values when combining the results from each cluster. The results revealed three clusters with a silhouette score of 0.304 and Davies Bouldin Index of 1.119, demonstrating the effectiveness of fuzzy c-means clustering. Out of 352 DMUs, 225 or approximately 64% were identified as efficient using DEA calculations. We propose a regression algorithm to measure the efficiency of DMUs in new office planning, by determining the values of input and output variables. The optimization of multilayer perceptrons using genetic algorithms reduced the mean squared error by about 75.75%, from 0.0144 to 0.0035. Based on our findings, the overall performance of tax service offices in Indonesia has reached an efficiency level of 64%. These results show a significant improvement over the previous study, in which only about 18% of offices were considered efficient. The main contribution of this research is the development of a comprehensive framework for evaluating and predicting tax office efficiency, providing valuable insights for improving performance.

{"title":"Combination of machine learning and data envelopment analysis to measure the efficiency of the Tax Service Office.","authors":"Shofinurdin Soffan, Arif Bramantoro, Ahmad A Alzahrani","doi":"10.7717/peerj-cs.2672","DOIUrl":"10.7717/peerj-cs.2672","url":null,"abstract":"The Tax Service Office, a division of the Directorate General of Taxes, is responsible for providing taxation services to the public and collecting taxes. Achieving tax targets efficiently while utilizing available resources is crucial. To assess the performance efficiency of decision-making units (DMUs), data envelopment analysis (DEA) is commonly employed. However, ensuring homogeneity among the DMUs is often necessary and requires the application of machine learning clustering techniques. In this study, we propose a three-stage approach: Clustering, DEA, and Regression, to measure the efficiency of all tax service office units. Real datasets from Indonesian tax service offices were used while maintaining strict confidentiality. Unlike previous studies that considered both input and output variables, we focus solely on clustering input variables, as it leads to more objective efficiency values when combining the results from each cluster. The results revealed three clusters with a silhouette score of 0.304 and Davies Bouldin Index of 1.119, demonstrating the effectiveness of fuzzy c-means clustering. Out of 352 DMUs, 225 or approximately 64% were identified as efficient using DEA calculations. We propose a regression algorithm to measure the efficiency of DMUs in new office planning, by determining the values of input and output variables. The optimization of multilayer perceptrons using genetic algorithms reduced the mean squared error by about 75.75%, from 0.0144 to 0.0035. Based on our findings, the overall performance of tax service offices in Indonesia has reached an efficiency level of 64%. These results show a significant improvement over the previous study, in which only about 18% of offices were considered efficient. The main contribution of this research is the development of a comprehensive framework for evaluating and predicting tax office efficiency, providing valuable insights for improving performance.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2672"},"PeriodicalIF":3.5,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888853/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

PeerJ Computer Science

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀