首页 > 最新文献

Egyptian Informatics Journal最新文献

英文 中文
IoT enabled data protection with substitution box for lightweight ciphers
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-31 DOI: 10.1016/j.eij.2025.100620
K.B. Sarmila , S.V. Manisekaran
Rapid growth in communication and networking demands the protection of highly sensitive data in the system. The cryptographic techniques used in various traditional devices and cloud environments are not applicable to resource-constrained devices like sensors, industrial controllers, and RFID tags. A lightweight cryptographic design is required for securing the data revolving around constrained devices. Symmetric block cipher techniques shaped using substitution-permutation network (SPN) structure use the powerful component, the substitution box, which is the only component that contributes to non-linearity. In this paper, a modified 5-bit Dynamic Airy Chaotic (DAC) substitution box is proposed, which uses tent-logistic mapping for obtaining confusion property. This chaotic behavior is incorporated with an improved and crafted logical function. The substitution box exhibits high dynamic chaotic behavior and maintains the structure, balancing the composition of good security strength and resource utilization. The chaotic behavior and security resistance are evaluated based on the standard parameters. The DAC substitution box demonstrates improved security with 66% less memory footprint on an average gate count compared with standard 4- and 5-bit competitors. The solution was able to obtain equally good resistance against differential attacks and increased resistance against linear attacks with 40% less linear probability value in comparison with its competitors. With the increased bit length of 5, it is observed that DAC exhibits excellent flexibility with traditional block cipher techniques, thus simplifying the use of such a solution as a building block of cryptographic primitives.
{"title":"IoT enabled data protection with substitution box for lightweight ciphers","authors":"K.B. Sarmila ,&nbsp;S.V. Manisekaran","doi":"10.1016/j.eij.2025.100620","DOIUrl":"10.1016/j.eij.2025.100620","url":null,"abstract":"<div><div>Rapid growth in communication and networking demands the protection of highly sensitive data in the system. The cryptographic techniques used in various traditional devices and cloud environments are not applicable to resource-constrained devices like sensors, industrial controllers, and RFID tags. A lightweight cryptographic design is required for securing the data revolving around constrained devices. Symmetric block cipher techniques shaped using substitution-permutation network (SPN) structure use the powerful component, the substitution box, which is the only component that contributes to non-linearity. In this paper, a modified 5-bit Dynamic Airy Chaotic (DAC) substitution box is proposed, which uses tent-logistic mapping for obtaining confusion property. This chaotic behavior is incorporated with an improved and crafted logical function. The substitution box exhibits high dynamic chaotic behavior and maintains the structure, balancing the composition of good security strength and resource utilization. The chaotic behavior and security resistance are evaluated based on the standard parameters. The DAC substitution box demonstrates improved security with 66% less memory footprint on an average gate count compared with standard 4- and 5-bit competitors. The solution was able to obtain equally good resistance against differential attacks and increased resistance against linear attacks with 40% less linear probability value in comparison with its competitors. With the increased bit length of 5, it is observed that DAC exhibits excellent flexibility with traditional block cipher techniques, thus simplifying the use of such a solution as a building block of cryptographic primitives.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100620"},"PeriodicalIF":5.0,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-end neural automatic speech recognition system for low resource languages
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-28 DOI: 10.1016/j.eij.2025.100615
Sami Dhahbi , Nasir Saleem , Sami Bourouis , Mouhebeddine Berrima , Elena Verdú
The rising popularity of end-to-end (E2E) automatic speech recognition (ASR) systems can be attributed to their ability to learn complex speech patterns directly from raw data, eliminating the need for intricate feature extraction pipelines and handcrafted language models. E2E-ASR systems have consistently outperformed traditional ASRs. However, training E2E-ASR systems for low-resource languages remains challenging due to the dependence on data from well-resourced languages. ASR is vital for promoting under-resourced languages, especially in developing human-to-human and human-to-machine communication systems. Using synthetic speech and data augmentation techniques can enhance E2E-ASR performance for low-resource languages, reducing word error rates (WERs) and character error rates (CERs). This study leverages a non-autoregressive neural text-to-speech (TTS) engine to generate high-quality speech, converting a series of phonemes into speech waveforms (mel-spectrograms). An on-the-fly data augmentation method is applied to these mel-spectrograms, treating them as images from which features are extracted to train a convolutional neural network (CNN) and a bidirectional long short-term memory (BLSTM)-based ASR. The E2E architecture of this system achieves optimal WER and CER performance. The proposed deep learning-based E2E-ASR, trained with synthetic speech and data augmentation, shows significant performance improvements, with a 20.75% reduction in WERs and a 10.34% reduction in CERs.
{"title":"End-to-end neural automatic speech recognition system for low resource languages","authors":"Sami Dhahbi ,&nbsp;Nasir Saleem ,&nbsp;Sami Bourouis ,&nbsp;Mouhebeddine Berrima ,&nbsp;Elena Verdú","doi":"10.1016/j.eij.2025.100615","DOIUrl":"10.1016/j.eij.2025.100615","url":null,"abstract":"<div><div>The rising popularity of end-to-end (E2E) automatic speech recognition (ASR) systems can be attributed to their ability to learn complex speech patterns directly from raw data, eliminating the need for intricate feature extraction pipelines and handcrafted language models. E2E-ASR systems have consistently outperformed traditional ASRs. However, training E2E-ASR systems for low-resource languages remains challenging due to the dependence on data from well-resourced languages. ASR is vital for promoting under-resourced languages, especially in developing human-to-human and human-to-machine communication systems. Using synthetic speech and data augmentation techniques can enhance E2E-ASR performance for low-resource languages, reducing word error rates (WERs) and character error rates (CERs). This study leverages a non-autoregressive neural text-to-speech (TTS) engine to generate high-quality speech, converting a series of phonemes into speech waveforms (mel-spectrograms). An on-the-fly data augmentation method is applied to these mel-spectrograms, treating them as images from which features are extracted to train a convolutional neural network (CNN) and a bidirectional long short-term memory (BLSTM)-based ASR. The E2E architecture of this system achieves optimal WER and CER performance. The proposed deep learning-based E2E-ASR, trained with synthetic speech and data augmentation, shows significant performance improvements, with a 20.75% reduction in WERs and a 10.34% reduction in CERs.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100615"},"PeriodicalIF":5.0,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Equalizer Design: HBOA-DE-trained radial basis function neural networks
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-27 DOI: 10.1016/j.eij.2025.100617
Santosh Kumar Das , Satya Ranjan Pattanaik , Pradyumna Kumar Mohapatra , Saroja Kumar Rout , Abdulaziz S. Almazyad , Muhammed Basheer Jasser , Guojiang Xiong , Ali Wagdy Mohamed
Communication systems that rely on wireless technology require signal processing techniques to improve their channel performance. Wireless communications are susceptible to various signal distortions during transmission, including inter-symbol interference, adjacent channel interference, and co-channel interference. As a result, achieving error-free signal transmission in wireless communication is often challenging. To make sure the signal is recovered with a minimum bit error rate, equalizers are needed at the front end of the receiver. As an optimization algorithm, a nature-inspired hybrid algorithm is applied, namely BOA/DE, which is a combination of the Butterfly optimization algorithm (BOA) and differential evolution (DE). Based on a suitable network topology and transfer function, the presented work proposes an algorithm for training radial basis function neural networks (RBFNNs) that is applied to the problem of channel equalization. Both BOA and DE are advantageous in the proposed algorithm, which permits it to produce efficient results by balancing exploration and exploitation. Several methods have also been discussed in the literature that use optimization techniques to deal with the problem of equalization. The same problem is treated in this article as a classification issue. As a further step in the evaluation of the HBOA-DE-based RBFNN equalizer, three non-linear channels and adding different nonlinearities have been simulated. The proposed algorithm is compared with well-known algorithms in terms of Mean Square Error (MSE) and Bit Error Rate (BER). Additionally, the algorithm has been tested against a situation in burst error and evaluated via bit error probability (BEP) to establish its robustness and performance. Results showed that the method performed better in handling burst errors compared to others. It has been shown that the projected method outclasses other methods even in poor signal-to-noise ratio conditions, which is borne out by extensive simulation studies.
{"title":"Equalizer Design: HBOA-DE-trained radial basis function neural networks","authors":"Santosh Kumar Das ,&nbsp;Satya Ranjan Pattanaik ,&nbsp;Pradyumna Kumar Mohapatra ,&nbsp;Saroja Kumar Rout ,&nbsp;Abdulaziz S. Almazyad ,&nbsp;Muhammed Basheer Jasser ,&nbsp;Guojiang Xiong ,&nbsp;Ali Wagdy Mohamed","doi":"10.1016/j.eij.2025.100617","DOIUrl":"10.1016/j.eij.2025.100617","url":null,"abstract":"<div><div>Communication systems that rely on wireless technology require signal processing techniques to improve their channel performance. Wireless communications are susceptible to various signal distortions during transmission, including inter-symbol interference, adjacent channel interference, and co-channel interference. As a result, achieving error-free signal transmission in wireless communication is often challenging. To make sure the signal is recovered with a minimum bit error rate, equalizers are needed at the front end of the receiver. As an optimization algorithm, a nature-inspired hybrid algorithm is applied, namely BOA/DE, which is a combination of the Butterfly optimization algorithm (BOA) and differential evolution (DE). Based on a suitable network topology and transfer function, the presented work proposes an algorithm for training radial basis function neural networks (RBFNNs) that is applied to the problem of channel equalization. Both BOA and DE are advantageous in the proposed algorithm, which permits it to produce efficient results by balancing exploration and exploitation. Several methods have also been discussed in the literature that use optimization techniques to deal with the problem of equalization. The same problem is treated in this article as a classification issue. As a further step in the evaluation of the HBOA-DE-based RBFNN equalizer, three non-linear channels and adding different nonlinearities have been simulated. The proposed algorithm is compared with well-known algorithms in terms of Mean Square Error (MSE) and Bit Error Rate (BER). Additionally, the algorithm has been tested against a situation in burst error and evaluated via bit error probability (BEP) to establish its robustness and performance. Results showed that the method performed better in handling burst errors compared to others. It has been shown that the projected method outclasses other methods even in poor signal-to-noise ratio conditions, which is borne out by extensive simulation studies.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100617"},"PeriodicalIF":5.0,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel feature selection technique: Detection and classification of Android malware
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-27 DOI: 10.1016/j.eij.2025.100618
Sandeep Sharma , Prachi , Rita Chhikara , Kavita Khanna
Android operating system is not just the most commonly employed mobile operating system, but also the most lucrative target for cybercriminals due to its extensive user base. In light of this, the objective of this research is to uncover a few features that can significantly enhance the detection of Android malware through utilization of feature engineering. This work introduces a novel approach to feature selection that can discover a promising subset of features for effective malware detection. The proposed technique, Multi-Wrapper Hybrid Feature Selection Technique (MWHFST), integrates wrapper-based feature selection techniques to address the limitations of individual wrapper-based feature selection methods. The research employs extensive experiments on the Kronodroid dataset, a comprehensive and large-scale dataset, to gauge how well the proposed technique identifies and classifies malicious Android applications. Experimental results using machine learning algorithms demonstrate that the technique proposed in this research effectively integrates the advantages of individual feature selection techniques and exhibits the potential to identify a brief set of pivotal features for detecting Android malware. The proposed approach successfully identifies and categorizes malicious Android applications, achieving an accuracy of 98.8 % and 88 %, respectively, using only 31 features. This approach surpasses existing methods by delivering comparable performance with a significantly reduced number of features compared to individual approaches.
{"title":"A novel feature selection technique: Detection and classification of Android malware","authors":"Sandeep Sharma ,&nbsp;Prachi ,&nbsp;Rita Chhikara ,&nbsp;Kavita Khanna","doi":"10.1016/j.eij.2025.100618","DOIUrl":"10.1016/j.eij.2025.100618","url":null,"abstract":"<div><div>Android operating system is not just the most commonly employed mobile operating system, but also the most lucrative target for cybercriminals due to its extensive user base. In light of this, the objective of this research is to uncover a few features that can significantly enhance the detection of Android malware through utilization of feature engineering. This work introduces a novel approach to feature selection that can discover a promising subset of features for effective malware detection. The proposed technique, Multi-Wrapper Hybrid Feature Selection Technique (MWHFST), integrates wrapper-based feature selection techniques to address the limitations of individual wrapper-based feature selection methods. The research employs extensive experiments on the Kronodroid dataset, a comprehensive and large-scale dataset, to gauge how well the proposed technique identifies and classifies malicious Android applications. Experimental results using machine learning algorithms demonstrate that the technique proposed in this research effectively integrates the advantages of individual feature selection techniques and exhibits the potential to identify a brief set of pivotal features for detecting Android malware. The proposed approach successfully identifies and categorizes malicious Android applications, achieving an accuracy of 98.8 % and 88 %, respectively, using only 31 features. This approach surpasses existing methods by delivering comparable performance with a significantly reduced number of features compared to individual approaches.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100618"},"PeriodicalIF":5.0,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy decision support system for english language teaching with corpus data
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-26 DOI: 10.1016/j.eij.2025.100612
Meilin Huang
Corpus data from the past and novice observations are useful in improving the adeptness of English language teaching in high schools. Optimization methods support this adeptness through fine-tuning processes acquired from the corpus data. Hence, this article introduces a Progression-focused Teaching System (PTS) optimized by Fuzzy Decision (FD). This system focuses on identifying and providing solutions for lexical placement errors. Focusing on the progression of English teaching with high-quality outputs, lexical arrangement, and error reduction is pursued. The fuzzy decision system identifies a maximum precision output from the possible lexical placement in teaching vocabulary. In this decision process, the teaching efficiency towards the specific output is tuned through personalized training recommendations. The fuzzy output is used to benchmark the maximum precision output for further teaching references. Therefore, a consistent progression in teaching English vocabulary is attained by rectifying the errors in the previous corpus inputs.
{"title":"Fuzzy decision support system for english language teaching with corpus data","authors":"Meilin Huang","doi":"10.1016/j.eij.2025.100612","DOIUrl":"10.1016/j.eij.2025.100612","url":null,"abstract":"<div><div>Corpus data from the past and novice observations are useful in improving the adeptness of English language teaching in high schools. Optimization methods support this adeptness through fine-tuning processes acquired from the corpus data. Hence, this article introduces a Progression-focused Teaching System (PTS) optimized by Fuzzy Decision (FD). This system focuses on identifying and providing solutions for lexical placement errors. Focusing on the progression of English teaching with high-quality outputs, lexical arrangement, and error reduction is pursued. The fuzzy decision system identifies a maximum precision output from the possible lexical placement in teaching vocabulary. In this decision process, the teaching efficiency towards the specific output is tuned through personalized training recommendations. The fuzzy output is used to benchmark the maximum precision output for further teaching references. Therefore, a consistent progression in teaching English vocabulary is attained by rectifying the errors in the previous corpus inputs.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100612"},"PeriodicalIF":5.0,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel validity indices for dynamic clustering and an Improved Dynamic Fuzzy C-Means
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-17 DOI: 10.1016/j.eij.2025.100613
Ramiro Saltos , Ignacio Carvajal , Fernando Crespo , Richard Weber
Dynamic clustering algorithms play a crucial role in numerous real-world applications by continuously adapting to evolving data patterns and identifying changes within the underlying cluster structure. However, unlike static clustering, where a plethora of validation indices exist to assess the solution’s quality, evaluating the effectiveness of dynamic clustering algorithms remains a challenge. This paper addresses this gap by proposing a novel set of six validation indices specifically designed for dynamic clustering. These indices assess the quality of solutions generated at three distinct granularities: individual clusters, individual observation periods, and the entire observation horizon. Our focus centers on cluster creation and elimination, recognized as the most critical structural changes within the dynamic clustering literature. To illustrate the application of these novel indices, we introduce an improved version of the dynamic fuzzy c-means algorithm (I-DFCM) which offers enhanced computational stability for handling dynamic data. We demonstrate the effectiveness of both the I-DFCM algorithm and the new validation indices through computational experiments using both synthetic and real-world datasets. The experiments showcase how these indices can effectively validate dynamic clustering solutions and guide parameter tuning for optimal performance, and support practical applications such as dynamic community detection in social networks and informed decision-making in dynamic environments. The results highlight the significant potential of these new validation indices and the I-DFCM algorithm in advancing the field of dynamic clustering.
{"title":"Novel validity indices for dynamic clustering and an Improved Dynamic Fuzzy C-Means","authors":"Ramiro Saltos ,&nbsp;Ignacio Carvajal ,&nbsp;Fernando Crespo ,&nbsp;Richard Weber","doi":"10.1016/j.eij.2025.100613","DOIUrl":"10.1016/j.eij.2025.100613","url":null,"abstract":"<div><div>Dynamic clustering algorithms play a crucial role in numerous real-world applications by continuously adapting to evolving data patterns and identifying changes within the underlying cluster structure. However, unlike static clustering, where a plethora of validation indices exist to assess the solution’s quality, evaluating the effectiveness of dynamic clustering algorithms remains a challenge. This paper addresses this gap by proposing a novel set of six validation indices specifically designed for dynamic clustering. These indices assess the quality of solutions generated at three distinct granularities: individual clusters, individual observation periods, and the entire observation horizon. Our focus centers on cluster creation and elimination, recognized as the most critical structural changes within the dynamic clustering literature. To illustrate the application of these novel indices, we introduce an improved version of the dynamic fuzzy c-means algorithm (I-DFCM) which offers enhanced computational stability for handling dynamic data. We demonstrate the effectiveness of both the I-DFCM algorithm and the new validation indices through computational experiments using both synthetic and real-world datasets. The experiments showcase how these indices can effectively validate dynamic clustering solutions and guide parameter tuning for optimal performance, and support practical applications such as dynamic community detection in social networks and informed decision-making in dynamic environments. The results highlight the significant potential of these new validation indices and the I-DFCM algorithm in advancing the field of dynamic clustering.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100613"},"PeriodicalIF":5.0,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmark Arabic news posts and analyzes Arabic sentiment through RMuBERT and SSL with AMCFFL technique
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-15 DOI: 10.1016/j.eij.2024.100601
Mustafa Mhamed , Richard Sutcliffe , Jun Feng
Sentiment analysis aims to extract emotions from textual data; sentiment analysis and text recognition are two of the most common tasks associated with natural language processing. Emergent technologies have been developed and employed in various fields, including marketing, health care, and policy making. However, with the growth of social media platforms and the flow of data, especially in the Arabic language, substantial difficulties have emerged that call for the creation of new frameworks to address problems, such as the lack of datasets related to news platforms, the complicated formation of the Arabic language, and complications with classifying, and system challenges, whether in machine learning, deep learning, or online analysis tools. This paper provides a new framework that helps address ASA challenges and work on various tasks based on the state-of-the-art ASA. First, it presents a new collection named (ANP5) from Arabic news posts from several Arabic platforms, then uses SSL with AMCFFL technique to analyze the Arabic sentiment and generate a second dataset (ANPS2). Next, applied ML classifiers, RF and SVM, do the best among the other classifiers, with an accuracy of 82.00%; however, the measurement distributions for each class are different (Experiment 1). Following that, DL models, BIGRU, CNN-LSTM, LSTM, and CNN, had accuracies of 88.10%, 89.30%, 89.85%, and 90.10% (Experiment 2). Experiments 1 and 2 represent the initial benchmark classification as the first baseline. Afterward, a new RMuBERT Model was developed and compared with four transformers on the two datasets: ANPS2 accuracy (90.87%) and ANP5 (90.33%). RMuBERT performed better than the baselines (Experiment 3). Further testing of RMuBERT on various Arabic corpora with different classes, lengths, and sizes: ArSarcasm (3C), STD (2C), AJGT (2C), and AAQ (2C), revealed accuracies of 77.76%, 91.79%, 94.07%, and 93.48%, respectively. Still, RMuBERT performed better than the baselines (Experiment 4). Finally, on the largest Arabic sentiment corpora with six million Arabic tweets, the performance is up to (91.12%); RMuBERT works efficiently with less training time (Experiment 5).
{"title":"Benchmark Arabic news posts and analyzes Arabic sentiment through RMuBERT and SSL with AMCFFL technique","authors":"Mustafa Mhamed ,&nbsp;Richard Sutcliffe ,&nbsp;Jun Feng","doi":"10.1016/j.eij.2024.100601","DOIUrl":"10.1016/j.eij.2024.100601","url":null,"abstract":"<div><div>Sentiment analysis aims to extract emotions from textual data; sentiment analysis and text recognition are two of the most common tasks associated with natural language processing. Emergent technologies have been developed and employed in various fields, including marketing, health care, and policy making. However, with the growth of social media platforms and the flow of data, especially in the Arabic language, substantial difficulties have emerged that call for the creation of new frameworks to address problems, such as the lack of datasets related to news platforms, the complicated formation of the Arabic language, and complications with classifying, and system challenges, whether in machine learning, deep learning, or online analysis tools. This paper provides a new framework that helps address ASA challenges and work on various tasks based on the state-of-the-art ASA. First, it presents a new collection named (ANP5) from Arabic news posts from several Arabic platforms, then uses SSL with AMCFFL technique to analyze the Arabic sentiment and generate a second dataset (ANPS2). Next, applied ML classifiers, RF and SVM, do the best among the other classifiers, with an accuracy of 82.00%; however, the measurement distributions for each class are different (Experiment 1). Following that, DL models, BIGRU, CNN-LSTM, LSTM, and CNN, had accuracies of 88.10%, 89.30%, 89.85%, and 90.10% (Experiment 2). Experiments 1 and 2 represent the initial benchmark classification as the first baseline. Afterward, a new RMuBERT Model was developed and compared with four transformers on the two datasets: ANPS2 accuracy (90.87%) and ANP5 (90.33%). RMuBERT performed better than the baselines (Experiment 3). Further testing of RMuBERT on various Arabic corpora with different classes, lengths, and sizes: ArSarcasm (3C), STD (2C), AJGT (2C), and AAQ (2C), revealed accuracies of 77.76%, 91.79%, 94.07%, and 93.48%, respectively. Still, RMuBERT performed better than the baselines (Experiment 4). Finally, on the largest Arabic sentiment corpora with six million Arabic tweets, the performance is up to (91.12%); RMuBERT works efficiently with less training time (Experiment 5).</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100601"},"PeriodicalIF":5.0,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Next-Generation energy Management: Particle Density algorithm for residential microgrid optimization
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-11 DOI: 10.1016/j.eij.2025.100611
Liang Wang , Nan Sun
This study presents an innovative approach to optimize energy management in residential microgrids, in light of the rising demand for energy and mounting environmental concerns. The research underscores the vital role of efficient energy management and responsive load control to improve energy efficiency and reduce consumer costs. To this end, a framework is proposed in which a power aggregator operates within a microgrid to manage residential electricity consumption. The primary goal of this framework is to minimize energy costs while considering subscriber preferences and the capacity limitations of the distribution network. The improved particle swarm optimization (IPSO) algorithm is employed to optimize energy management, resolve convergence challenges, and ensure user requirements are effectively prioritized. Integrating emergency, economic, and planned strategies provides cost savings, ensures grid stability, and enhances user satisfaction. The incorporation of Internet of Things (IoT) technology enables seamless communication, precise device control, and data-driven decision-making, empowering households to manage their energy loads more effectively and contribute to grid efficiency. Through scenario analysis, this research demonstrates the IPSO algorithm’s potential for significant cost reductions and improved grid stability. In Scenario 1, focused exclusively on affordability, numerical analyses present the total cost of electricity under different load conditions over three months. Scenario 2, also prioritizing affordability, highlights the impact of economic considerations on electricity expenses. Furthermore, Scenario 3 (80 % emergency + 20 % affordable) and Scenario 4 (50 % emergency + 20 % affordable + 30 % planned) showcase the potential for cost reduction through various priority combinations. These insights reflect the effectiveness of load management strategies facilitated by IoT technology. This comprehensive energy management approach lays a strong foundation for a resilient and adaptable energy infrastructure that meets society’s evolving demands.
{"title":"Next-Generation energy Management: Particle Density algorithm for residential microgrid optimization","authors":"Liang Wang ,&nbsp;Nan Sun","doi":"10.1016/j.eij.2025.100611","DOIUrl":"10.1016/j.eij.2025.100611","url":null,"abstract":"<div><div>This study presents an innovative approach to optimize energy management in residential microgrids, in light of the rising demand for energy and mounting environmental concerns. The research underscores the vital role of efficient energy management and responsive load control to improve energy efficiency and reduce consumer costs. To this end, a framework is proposed in which a power aggregator operates within a microgrid to manage residential electricity consumption. The primary goal of this framework is to minimize energy costs while considering subscriber preferences and the capacity limitations of the distribution network. The improved particle swarm optimization (IPSO) algorithm is employed to optimize energy management, resolve convergence challenges, and ensure user requirements are effectively prioritized. Integrating emergency, economic, and planned strategies provides cost savings, ensures grid stability, and enhances user satisfaction. The incorporation of Internet of Things (IoT) technology enables seamless communication, precise device control, and data-driven decision-making, empowering households to manage their energy loads more effectively and contribute to grid efficiency. Through scenario analysis, this research demonstrates the IPSO algorithm’s potential for significant cost reductions and improved grid stability. In Scenario 1, focused exclusively on affordability, numerical analyses present the total cost of electricity under different load conditions over three months. Scenario 2, also prioritizing affordability, highlights the impact of economic considerations on electricity expenses. Furthermore, Scenario 3 (80 % emergency + 20 % affordable) and Scenario 4 (50 % emergency + 20 % affordable + 30 % planned) showcase the potential for cost reduction through various priority combinations. These insights reflect the effectiveness of load management strategies facilitated by IoT technology. This comprehensive energy management approach lays a strong foundation for a resilient and adaptable energy infrastructure that meets society’s evolving demands.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100611"},"PeriodicalIF":5.0,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The financial impact of human resources configuration: A quantitative analysis based on modified single candidate optimizer
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-11 DOI: 10.1016/j.eij.2024.100584
Zhuozhuo Zhang , Jun Lu , Qi Wang
Recently, by complicated and fast changing business environments, the effective allocation of Human Resources (HR) is considered as an important task to achieve success within organizations. However, the optimal allocation of HR is considered as a complicated challenge due to the uncertainties that are inherent in the process. Traditional approaches often rely on manual decision-making, which can result in less effective allocations and reduced productivity. With the rise of big data and advanced analytics, there is an increasing demand for data-driven methodologies to enhance HR allocation. This paper presents an innovative HR optimization framework that uses a modified metaheuristic model, called the Modified Single Candidate Optimizer (MSCO) algorithm to resolve this task. The framework integrates big data analytics and system analysis to establish a quantitative management strategy for optimizing HR configurations. By using the advantages of the proposed MSCO, the framework can effectively address the HR allocation problems to provide an optimal solution. The results indicate that the proposed framework significantly improves HR utilization rates, labor productivity.
{"title":"The financial impact of human resources configuration: A quantitative analysis based on modified single candidate optimizer","authors":"Zhuozhuo Zhang ,&nbsp;Jun Lu ,&nbsp;Qi Wang","doi":"10.1016/j.eij.2024.100584","DOIUrl":"10.1016/j.eij.2024.100584","url":null,"abstract":"<div><div>Recently, by complicated and fast changing business environments, the effective allocation of Human Resources (HR) is considered as an important task to achieve success within organizations. However, the optimal allocation of HR is considered as a complicated challenge due to the uncertainties that are inherent in the process. Traditional approaches often rely on manual decision-making, which can result in less effective allocations and reduced productivity. With the rise of big data and advanced analytics, there is an increasing demand for data-driven methodologies to enhance HR allocation. This paper presents an innovative HR optimization framework that uses a modified metaheuristic model, called the Modified Single Candidate Optimizer (MSCO) algorithm to resolve this task. The framework integrates big data analytics and system analysis to establish a quantitative management strategy for optimizing HR configurations. By using the advantages of the proposed MSCO, the framework can effectively address the HR allocation problems to provide an optimal solution. The results indicate that the proposed framework significantly improves HR utilization rates, labor productivity.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100584"},"PeriodicalIF":5.0,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The role of hybrid models in financial decision-making: Forecasting stock prices with advanced algorithms
IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-11 DOI: 10.1016/j.eij.2025.100610
Xiaoyi Zhu
Stock price volatility is influenced by many factors, which are significant obstacles to achieving accurate stock price forecasting in the financial market. This study introduces a novel hybrid model to tackle the abovementioned issues by integrating various algorithms, including bidirectional long short-term memory and random forest. Additionally, it incorporates ensemble empirical mode decomposition, sample entropy clustering, and sea-horse optimizer as part of its methodology. Exponential moving average 30, relative strength index 14, simple moving average 30, moving average convergence divergence, on-balance-volume, and daily open price, high price, low price, close price, and trading volume of the S&P 500 index between 01/04/2013 and 12/29/2022 were utilized as the dataset. To reduce the complexity of the time series decomposition and clustering methods were employed. Then, the high sequences underwent processing using the optimized random forest algorithm, and the remaining sequences were subjected to processing utilizing optimized bidirectional long short-term memory. This approach allowed the model to generalize effectively across a variety of global indices, as demonstrated by its high prediction accuracy (coefficient of determination (R2) values exceeding 0.98 for the Dow Jones, CSI, Nikkei, and DAX indices). Additionally, robustness testing was conducted by introducing incremental noise levels to simulate real-market conditions, which demonstrated that the model remains highly accurate even at the highest noise level. In comparison to other methods, the proposed model demonstrated superior performance on the S&P 500, with an R2 of 0.99 and low error metrics. This model’s adaptability and reliability in diverse and volatile market conditions are emphasized by this robust framework, which renders it a potent financial forecasting tool.
{"title":"The role of hybrid models in financial decision-making: Forecasting stock prices with advanced algorithms","authors":"Xiaoyi Zhu","doi":"10.1016/j.eij.2025.100610","DOIUrl":"10.1016/j.eij.2025.100610","url":null,"abstract":"<div><div>Stock price volatility is influenced by many factors, which are significant obstacles to achieving accurate stock price forecasting in the financial market. This study introduces a novel hybrid model to tackle the abovementioned issues by integrating various algorithms, including bidirectional long short-term memory and random forest. Additionally, it incorporates ensemble empirical mode decomposition, sample entropy clustering, and sea-horse optimizer as part of its methodology. Exponential moving average 30, relative strength index 14, simple moving average 30, moving average convergence divergence, on-balance-volume, and daily open price, high price, low price, close price, and trading volume of the S&amp;P 500 index between 01/04/2013 and 12/29/2022 were utilized as the dataset. To reduce the complexity of the time series decomposition and clustering methods were employed. Then, the high sequences underwent processing using the optimized random forest algorithm, and the remaining sequences were subjected to processing utilizing optimized bidirectional long short-term memory. This approach allowed the model to generalize effectively across a variety of global indices, as demonstrated by its high prediction accuracy (coefficient of determination (<span><math><msup><mrow><mi>R</mi></mrow><mn>2</mn></msup></math></span>) values exceeding 0.98 for the Dow Jones, CSI, Nikkei, and DAX indices). Additionally, robustness testing was conducted by introducing incremental noise levels to simulate real-market conditions, which demonstrated that the model remains highly accurate even at the highest noise level. In comparison to other methods, the proposed model demonstrated superior performance on the S&amp;P 500, with an R<sup>2</sup> of 0.99 and low error metrics. This model’s adaptability and reliability in diverse and volatile market conditions are emphasized by this robust framework, which renders it a potent financial forecasting tool.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100610"},"PeriodicalIF":5.0,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143175762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Egyptian Informatics Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1