首页 > 最新文献

IAES International Journal of Artificial Intelligence (IJ-AI)最新文献

英文 中文
Bangla song genre recognition using artificial neural network 利用人工神经网络识别孟加拉语歌曲流派
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2413-2422
Mariam Akter, Nishat Sultana, S. R. H. Noori, Md Zahid Hasan
Music has a control over human moods and it can make someone calm or excited. It allows us to feel all emotions we experience. Nowadays, people are often attached with their phones and computers listening to music on Spotify, Soundcloud or any other internet platform. Music Information retrieval plays an important role for music recommendation according to lyrics, pitch, pattern of choices, and genre. In this study, we have tried to recognize the music genre for a better music recommendation system. We have collected an amount of 1820 Bangla songs from six different genres including Adhunik, Rock, Hip hop, Nazrul, Rabindra and Folk music. We have started with some traditional machine learning algorithms having K-Nearest Neighbor, Logistic Regression, Random Forest, Support Vector Machine and Decision Tree but ended up with a deep learning algorithm named Artificial Neural Network with an accuracy of 78% for recognizing music genres from six different genres. All mentioned algorithms are experimented with transformed mel-spectrograms and Mean Chroma Frequency Values of that raw amplitude data. But we found that music Tempo having Beats per Minute value with two previous features present better accuracy.
音乐能控制人的情绪,它能让人平静或兴奋。它能让我们感受到自己的所有情绪。如今,人们经常通过手机和电脑收听 Spotify、Soundcloud 或其他网络平台上的音乐。音乐信息检索在根据歌词、音调、选择模式和流派推荐音乐方面发挥着重要作用。在这项研究中,我们尝试识别音乐流派,以建立更好的音乐推荐系统。我们收集了 1820 首孟加拉歌曲,这些歌曲来自六种不同的音乐流派,包括阿杜尼克(Adhunik)、摇滚(Rock)、嘻哈(Hip Hop)、纳兹鲁尔(Nazrul)、拉宾德拉(Rabindra)和民间音乐。我们从 K-近邻、逻辑回归、随机森林、支持向量机和决策树等一些传统的机器学习算法入手,但最终采用了一种名为人工神经网络的深度学习算法,其识别六种不同类型音乐的准确率高达 78%。所有上述算法都是通过原始振幅数据的转换后的旋律谱图和平均色度频率值进行实验的。但我们发现,具有每分钟节拍值的音乐节奏和前两个特征的准确率更高。
{"title":"Bangla song genre recognition using artificial neural network","authors":"Mariam Akter, Nishat Sultana, S. R. H. Noori, Md Zahid Hasan","doi":"10.11591/ijai.v13.i2.pp2413-2422","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2413-2422","url":null,"abstract":"Music has a control over human moods and it can make someone calm or excited. It allows us to feel all emotions we experience. Nowadays, people are often attached with their phones and computers listening to music on Spotify, Soundcloud or any other internet platform. Music Information retrieval plays an important role for music recommendation according to lyrics, pitch, pattern of choices, and genre. In this study, we have tried to recognize the music genre for a better music recommendation system. We have collected an amount of 1820 Bangla songs from six different genres including Adhunik, Rock, Hip hop, Nazrul, Rabindra and Folk music. We have started with some traditional machine learning algorithms having K-Nearest Neighbor, Logistic Regression, Random Forest, Support Vector Machine and Decision Tree but ended up with a deep learning algorithm named Artificial Neural Network with an accuracy of 78% for recognizing music genres from six different genres. All mentioned algorithms are experimented with transformed mel-spectrograms and Mean Chroma Frequency Values of that raw amplitude data. But we found that music Tempo having Beats per Minute value with two previous features present better accuracy.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141229135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategies for improving the quality of community detection based on modularity optimization 基于模块化优化的群落检测质量改进策略
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1794-1804
Tedy Setiadi, Mohd Ridzwan Yaakub, Azuraliza Abu Bakar
Community detection is a field of interest in social networks. Many new methods have emerged for community detection solution, however the modularity optimization method is the most prominent. Community detection based on modularity optimization (CDMO) has fundamental problems in the form of solution degeneration and resolution limits. From the two problems, the resolution limit is more concerned because it affects the resulting community's quality. During the last decade, many studies have attempted to address the problems, but so far they have been carried out partially, no one has thoroughly discussed efforts to improve the quality of CDMO. In this paper, we aim to investigate works in handling resolution limit and improving the quality of CDMO, along with their strengths and limitations. We derive six categories of strategies to improve the quality of CDMO, namely developing multi-resolution modularity, creating local modularity, creating modularity density, creating new metrics as an alternative to modularity, creating new quality metrics as a substitute for modularity, involving node attributes in determining community detection, and extending the single objective function into a multi-objective function. These strategies can be used as a guide in developing community detection methods. By considering network size, network type, and community distribution, we can choose the appropriate strategy in improving the quality of community detection.
社群检测是社交网络中一个备受关注的领域。社区检测解决方案出现了许多新方法,但模块化优化方法最为突出。基于模块化优化的社群检测(CDMO)存在解退化和分辨率限制两个基本问题。在这两个问题中,分辨率限制更受关注,因为它会影响所得到的群落质量。近十年来,许多研究都试图解决这些问题,但迄今为止,这些研究都是局部性的,没有人深入探讨过如何提高 CDMO 的质量。本文旨在研究处理分辨率限制和提高 CDMO 质量的工作及其优势和局限性。我们得出了六类提高 CDMO 质量的策略,即发展多分辨率模块化、创建局部模块化、创建模块化密度、创建新指标作为模块化的替代、创建新质量指标作为模块化的替代、让节点属性参与确定群落检测,以及将单一目标函数扩展为多目标函数。这些策略可作为开发社群检测方法的指南。通过考虑网络规模、网络类型和社群分布,我们可以选择合适的策略来提高社群检测的质量。
{"title":"Strategies for improving the quality of community detection based on modularity optimization","authors":"Tedy Setiadi, Mohd Ridzwan Yaakub, Azuraliza Abu Bakar","doi":"10.11591/ijai.v13.i2.pp1794-1804","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1794-1804","url":null,"abstract":"Community detection is a field of interest in social networks. Many new methods have emerged for community detection solution, however the modularity optimization method is the most prominent. Community detection based on modularity optimization (CDMO) has fundamental problems in the form of solution degeneration and resolution limits. From the two problems, the resolution limit is more concerned because it affects the resulting community's quality. During the last decade, many studies have attempted to address the problems, but so far they have been carried out partially, no one has thoroughly discussed efforts to improve the quality of CDMO. In this paper, we aim to investigate works in handling resolution limit and improving the quality of CDMO, along with their strengths and limitations. We derive six categories of strategies to improve the quality of CDMO, namely developing multi-resolution modularity, creating local modularity, creating modularity density, creating new metrics as an alternative to modularity, creating new quality metrics as a substitute for modularity, involving node attributes in determining community detection, and extending the single objective function into a multi-objective function. These strategies can be used as a guide in developing community detection methods. By considering network size, network type, and community distribution, we can choose the appropriate strategy in improving the quality of community detection.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141229699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time indoor tracking for augmented reality using computer vision technique 利用计算机视觉技术实现增强现实的实时室内跟踪
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1845-1857
Ashraf Saad Shewail, Hala H. Zayed, Neven A. M. Elsayed
In recent times, there has been an increase in the stability and integration of augmented reality (AR) technology in everyday applications. AR relies on tracking techniques to capture the characteristics of the surrounding environment. Tracking falls into two categories: outdoor and indoor. While outdoor tracking predominantly relies on the global positioning system (GPS), it is performance indoors is hindered by imprecise GPS signals. Indoor tracking offers a solution for navigating complex indoor environments. This paper introduces an indoor tracking system that combines smartphone sensor data and computer vision using the oriented features from accelerated and segments test and rotated binary robust independent elementary features (ORB) algorithm for feature extraction, along with brute force match (BFM) and k-nearest neighbor (KNN) for matching. This approach outperforms previous systems, offering efficient navigation without relying on pre-existing maps. The system uses the A* algorithm to find the shortest path and cloud computing for data storage. Experimental results demonstrate an impressive 99% average accuracy within a 7-10 cm error range, even in scenarios with varying distances. Moreover, all users successfully reached their destinations during the experiments. This innovative model presents a promising advancement in indoor tracking, enhancing the accuracy and effectiveness of navigation in complex indoor spaces
近来,增强现实(AR)技术在日常应用中的稳定性和集成度不断提高。AR 依靠跟踪技术来捕捉周围环境的特征。跟踪分为两类:室外和室内。室外追踪主要依靠全球定位系统(GPS),但由于 GPS 信号不精确,其在室内的性能受到影响。室内追踪为复杂的室内环境导航提供了一种解决方案。本文介绍了一种结合智能手机传感器数据和计算机视觉的室内跟踪系统,该系统使用加速和分段测试的定向特征、旋转二进制鲁棒独立基本特征(ORB)算法进行特征提取,并使用蛮力匹配(BFM)和k-近邻(KNN)进行匹配。这种方法优于以往的系统,无需依赖预先存在的地图即可提供高效的导航。该系统使用 A* 算法查找最短路径,并使用云计算进行数据存储。实验结果表明,即使在距离不同的情况下,平均准确率也达到了令人印象深刻的 99%,误差范围在 7-10 厘米之间。此外,在实验过程中,所有用户都成功到达了目的地。这一创新模型为室内追踪带来了希望,提高了在复杂室内空间导航的准确性和有效性。
{"title":"Real-time indoor tracking for augmented reality using computer vision technique","authors":"Ashraf Saad Shewail, Hala H. Zayed, Neven A. M. Elsayed","doi":"10.11591/ijai.v13.i2.pp1845-1857","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1845-1857","url":null,"abstract":"In recent times, there has been an increase in the stability and integration of augmented reality (AR) technology in everyday applications. AR relies on tracking techniques to capture the characteristics of the surrounding environment. Tracking falls into two categories: outdoor and indoor. While outdoor tracking predominantly relies on the global positioning system (GPS), it is performance indoors is hindered by imprecise GPS signals. Indoor tracking offers a solution for navigating complex indoor environments. This paper introduces an indoor tracking system that combines smartphone sensor data and computer vision using the oriented features from accelerated and segments test and rotated binary robust independent elementary features (ORB) algorithm for feature extraction, along with brute force match (BFM) and k-nearest neighbor (KNN) for matching. This approach outperforms previous systems, offering efficient navigation without relying on pre-existing maps. The system uses the A* algorithm to find the shortest path and cloud computing for data storage. Experimental results demonstrate an impressive 99% average accuracy within a 7-10 cm error range, even in scenarios with varying distances. Moreover, all users successfully reached their destinations during the experiments. This innovative model presents a promising advancement in indoor tracking, enhancing the accuracy and effectiveness of navigation in complex indoor spaces","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141229764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combination of gray level co-occurrence matrix and artificial neural networks for classification of COVID-19 based on chest X-ray images 结合灰度共现矩阵和人工神经网络,基于胸部 X 光图像对 COVID-19 进行分类
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1625-1631
Bahtiar Imran, Lalu Delsi Samsumar, Ahmad Subki, Zaeniah Zaeniah, Salman Salman, Muhammad Rijal Alfian
This research uses the gray level co-occurrence matrix (GLCM) and artificial neural networks to classify COVID-19 images based on chest X-ray images. According to previous studies, there has never been a researcher who has integrated GLCM with artificial neural networks. Epochs 10, 30, 50, 70, 100, and 120 were used in this research. The total number of data points used in this investigation was 600, divided into 300 normal chests and 300 COVID-19 data points. Epoch 10 had 91% accuracy, epoch 30 had 91% accuracy, epoch 50 had 92% accuracy, epoch 70 had 91% accuracy, epoch 100 had 92% accuracy, and epoch 120 had 90% accuracy in categorization. As indicated by the results of the classification tests, combining GLCM and artificial neural networks can produce good results; a combination of these methods can yield a classification for COVID-19.
本研究利用灰度共现矩阵(GLCM)和人工神经网络对基于胸部 X 光图像的 COVID-19 图像进行分类。根据以往的研究,从未有研究人员将 GLCM 与人工神经网络相结合。本研究使用了 10、30、50、70、100 和 120 个时间点。本次调查使用的数据点总数为 600 个,分为 300 个正常胸部数据点和 300 个 COVID-19 数据点。在分类方面,第 10 个纪元的准确率为 91%,第 30 个纪元的准确率为 91%,第 50 个纪元的准确率为 92%,第 70 个纪元的准确率为 91%,第 100 个纪元的准确率为 92%,第 120 个纪元的准确率为 90%。分类测试结果表明,将 GLCM 和人工神经网络结合起来可以产生良好的效果;将这些方法结合起来可以对 COVID-19 进行分类。
{"title":"Combination of gray level co-occurrence matrix and artificial neural networks for classification of COVID-19 based on chest X-ray images","authors":"Bahtiar Imran, Lalu Delsi Samsumar, Ahmad Subki, Zaeniah Zaeniah, Salman Salman, Muhammad Rijal Alfian","doi":"10.11591/ijai.v13.i2.pp1625-1631","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1625-1631","url":null,"abstract":"This research uses the gray level co-occurrence matrix (GLCM) and artificial neural networks to classify COVID-19 images based on chest X-ray images. According to previous studies, there has never been a researcher who has integrated GLCM with artificial neural networks. Epochs 10, 30, 50, 70, 100, and 120 were used in this research. The total number of data points used in this investigation was 600, divided into 300 normal chests and 300 COVID-19 data points. Epoch 10 had 91% accuracy, epoch 30 had 91% accuracy, epoch 50 had 92% accuracy, epoch 70 had 91% accuracy, epoch 100 had 92% accuracy, and epoch 120 had 90% accuracy in categorization. As indicated by the results of the classification tests, combining GLCM and artificial neural networks can produce good results; a combination of these methods can yield a classification for COVID-19.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141233077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tuning the k value in k-nearest neighbors for malware detection 调整 k 近邻中的 k 值以检测恶意软件
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2275-2282
Mosleh M. Abualhaj, A. Abu-Shareha, Qusai Y. Shambour, S. Al-Khatib, Mohammad O. Hiari
Malicious software, also referred to as malware, poses a serious threat to computer networks, user privacy, and user systems. Effective cybersecurity depends on the correct detection and classification of malware. In order to improve its effectiveness, the K-Nearest Neighbors (KNN) method is applied systematically in this study to the task of malware detection. The study investigates the effect of the number of neighbors (K) parameter on the KNN's performance. MalMem-2022 malware datasets and relevant evaluation criteria like accuracy, precision, recall, and F1-score will be used to assess the efficacy of the suggested technique. The experiments evaluate how parameter tuning affects the accuracy of malware detection by comparing the performance of various parameter setups. The study findings show that careful parameter adjustment considerably boosts the KNN method's malware detection capability. The research also highlights the potential of KNN with parameter adjustment as a useful tool for malware detection in real-world circumstances, allowing for prompt and precise identification of malware.
恶意软件(也称为恶意软件)对计算机网络、用户隐私和用户系统构成严重威胁。有效的网络安全取决于对恶意软件的正确检测和分类。为了提高其有效性,本研究将 K-Nearest Neighbors (KNN) 方法系统地应用于恶意软件检测任务中。本研究探讨了邻居数(K)参数对 KNN 性能的影响。将使用 MalMem-2022 恶意软件数据集和相关评估标准(如准确率、精确度、召回率和 F1 分数)来评估所建议技术的功效。实验通过比较各种参数设置的性能,评估参数调整如何影响恶意软件检测的准确性。研究结果表明,精心的参数调整大大提高了 KNN 方法的恶意软件检测能力。这项研究还凸显了参数调整 KNN 作为实际环境中恶意软件检测的有用工具的潜力,可以迅速准确地识别恶意软件。
{"title":"Tuning the k value in k-nearest neighbors for malware detection","authors":"Mosleh M. Abualhaj, A. Abu-Shareha, Qusai Y. Shambour, S. Al-Khatib, Mohammad O. Hiari","doi":"10.11591/ijai.v13.i2.pp2275-2282","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2275-2282","url":null,"abstract":"Malicious software, also referred to as malware, poses a serious threat to computer networks, user privacy, and user systems. Effective cybersecurity depends on the correct detection and classification of malware. In order to improve its effectiveness, the K-Nearest Neighbors (KNN) method is applied systematically in this study to the task of malware detection. The study investigates the effect of the number of neighbors (K) parameter on the KNN's performance. MalMem-2022 malware datasets and relevant evaluation criteria like accuracy, precision, recall, and F1-score will be used to assess the efficacy of the suggested technique. The experiments evaluate how parameter tuning affects the accuracy of malware detection by comparing the performance of various parameter setups. The study findings show that careful parameter adjustment considerably boosts the KNN method's malware detection capability. The research also highlights the potential of KNN with parameter adjustment as a useful tool for malware detection in real-world circumstances, allowing for prompt and precise identification of malware.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141235173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the machine learning models based on natural language processing tasks 根据自然语言处理任务评估机器学习模型
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp1954-1968
Meeradevi Meeradevi, S. B. J., Swetha B. N.
In the realm of natural language processing (NLP), a diverse array of language models has emerged, catering to a wide spectrum of tasks, ranging from speaker recognition and auto-correction to sentiment analysis and stock prediction. The significance of language models in enabling the execution of these NLP tasks cannot be overstated. This study proposes an approach to enhance accuracy by leveraging a hybrid language model, combining the strengths of long short-term memory (LSTM) and gated recurrent unit (GRU). LSTM excels in preserving long-term dependencies in data, while GRU's simpler gating mechanism expedites the training process. The research endeavors to evaluate four variations of this hybrid model: LSTM, GRU, bidirectional long short-term memory (Bi-LSTM), and a combination of LSTM with GRU. These models are subjected to rigorous testing on two distinct datasets: one focused on IBM stock price prediction, and the other on Jigsaw toxic comment classification (sentiment analysis). This work represents a significant stride towards democratizing NLP capabilities, ensuring that even in resource-constrained settings, NLP models can exhibit improved performance. The anticipated implications of these findings span a wide spectrum of real-world applications and hold the potential to stimulate further research in the field of NLP. 
在自然语言处理(NLP)领域,出现了各种各样的语言模型,可满足从说话人识别和自动纠错到情感分析和股票预测等各种任务。语言模型在执行这些 NLP 任务中的重要性无论怎样强调都不为过。本研究提出了一种利用混合语言模型提高准确性的方法,它结合了长短期记忆(LSTM)和门控递归单元(GRU)的优势。LSTM 擅长保存数据中的长期依赖关系,而 GRU 更简单的门控机制则加快了训练过程。本研究致力于评估这种混合模型的四种变体:LSTM、GRU、双向长短期记忆(Bi-LSTM)以及 LSTM 与 GRU 的组合。这些模型在两个不同的数据集上进行了严格测试:一个侧重于 IBM 股票价格预测,另一个侧重于 Jigsaw 有毒评论分类(情感分析)。这项工作在实现 NLP 能力民主化方面迈出了重要一步,确保了即使在资源有限的情况下,NLP 模型也能表现出更高的性能。这些发现的预期影响涵盖了现实世界的广泛应用,并有可能促进 NLP 领域的进一步研究。
{"title":"Evaluating the machine learning models based on natural language processing tasks","authors":"Meeradevi Meeradevi, S. B. J., Swetha B. N.","doi":"10.11591/ijai.v13.i2.pp1954-1968","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp1954-1968","url":null,"abstract":"In the realm of natural language processing (NLP), a diverse array of language models has emerged, catering to a wide spectrum of tasks, ranging from speaker recognition and auto-correction to sentiment analysis and stock prediction. The significance of language models in enabling the execution of these NLP tasks cannot be overstated. This study proposes an approach to enhance accuracy by leveraging a hybrid language model, combining the strengths of long short-term memory (LSTM) and gated recurrent unit (GRU). LSTM excels in preserving long-term dependencies in data, while GRU's simpler gating mechanism expedites the training process. The research endeavors to evaluate four variations of this hybrid model: LSTM, GRU, bidirectional long short-term memory (Bi-LSTM), and a combination of LSTM with GRU. These models are subjected to rigorous testing on two distinct datasets: one focused on IBM stock price prediction, and the other on Jigsaw toxic comment classification (sentiment analysis). This work represents a significant stride towards democratizing NLP capabilities, ensuring that even in resource-constrained settings, NLP models can exhibit improved performance. The anticipated implications of these findings span a wide spectrum of real-world applications and hold the potential to stimulate further research in the field of NLP. ","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141232860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature selection techniques for microarray dataset: a review 微阵列数据集的特征选择技术:综述
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2395-2402
Avinash Nagaraja, S. Sinha, Shivamurthaiah Mallaiah
For many researchers working on feature selection techniques, finding an appropriate feature from the microarray dataset has turned into a bottleneck. Researchers often create feature selection approaches and algorithms with the goal of improving accuracy in microarray datasets. The main goal of this study is to present a variety of contemporary feature selection techniques, such as Filter, Wrapper, and Embedded methods proposed for microarray datasets to work on multi-class classification problems and different approaches to enhance the performance of learning algorithms, to address the imbalance issue in the data set, and to support research efforts on microarray dataset. This study is based on Critical Review Questions (CRQ) constructed using feature election methods described in the review methodology and applied to a microarray dataset. We discussed the analysed findings and future prospects of feature selection strategies for multi-class classification issues using microarray datasets, as well as prospective ways to speed up computing environment
对于许多研究特征选择技术的人员来说,从微阵列数据集中找到合适的特征已成为一个瓶颈。研究人员经常创建特征选择方法和算法,目的是提高微阵列数据集的准确性。本研究的主要目的是介绍各种当代特征选择技术,如针对微阵列数据集提出的过滤器、封装器和嵌入式方法,以解决多类分类问题和不同方法,从而提高学习算法的性能,解决数据集中的不平衡问题,并为微阵列数据集的研究工作提供支持。本研究基于 "关键评论问题"(Critical Review Questions,CRQ),采用了评论方法学中描述的特征选择方法,并应用于微阵列数据集。我们讨论了使用微阵列数据集解决多类分类问题的特征选择策略的分析结果和未来前景,以及加速计算环境的前瞻性方法。
{"title":"Feature selection techniques for microarray dataset: a review","authors":"Avinash Nagaraja, S. Sinha, Shivamurthaiah Mallaiah","doi":"10.11591/ijai.v13.i2.pp2395-2402","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2395-2402","url":null,"abstract":"For many researchers working on feature selection techniques, finding an appropriate feature from the microarray dataset has turned into a bottleneck. Researchers often create feature selection approaches and algorithms with the goal of improving accuracy in microarray datasets. The main goal of this study is to present a variety of contemporary feature selection techniques, such as Filter, Wrapper, and Embedded methods proposed for microarray datasets to work on multi-class classification problems and different approaches to enhance the performance of learning algorithms, to address the imbalance issue in the data set, and to support research efforts on microarray dataset. This study is based on Critical Review Questions (CRQ) constructed using feature election methods described in the review methodology and applied to a microarray dataset. We discussed the analysed findings and future prospects of feature selection strategies for multi-class classification issues using microarray datasets, as well as prospective ways to speed up computing environment","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141229041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid deep learning optimization for predicting the spread of a new emerging infectious disease 预测新发传染病传播的混合深度学习优化方法
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2036-2048
F. E. Nastiti, Shahrulniza Musa, Eiad Yafi
In this study, a novel approach geared toward predicting the estimated number of coronavirus disease (COVID-19) cases was developed. Combining long short-term memory (LSTM) neural networks with particle swarm optimization (PSO) along with grey wolf optimization (GWO) employ hybrid optimization algorithm techniques. This investigation utilizes COVID-19 original data from the Ministry of Health of Indonesia, period 2020-2021. The developed LSTM-PSO-GWO hybrid optimization algorithm can improve the performance and accuracy of predicting the spread of the COVID-19 virus in Indonesia. In initiating LSTM initial weights with weaknesses, using the hybrid optimization algorithm helps overcome these problems and improve model performance. The results of this study suggest that the LSTM-PSO-GWO model can be utilized as an effective and reliable predictive tool to gauge the COVID-19 virus’s spread in Indonesia. 
本研究开发了一种新方法,用于预测冠状病毒病(COVID-19)的估计病例数。将长短期记忆(LSTM)神经网络与粒子群优化(PSO)和灰狼优化(GWO)相结合,采用了混合优化算法技术。这项调查利用了印度尼西亚卫生部 2020-2021 年期间的 COVID-19 原始数据。所开发的 LSTM-PSO-GWO 混合优化算法可以提高预测 COVID-19 病毒在印尼传播的性能和准确性。在初始 LSTM 初始权重存在缺陷时,使用混合优化算法有助于克服这些问题,提高模型性能。本研究的结果表明,LSTM-PSO-GWO 模型可作为一种有效、可靠的预测工具,用于评估 COVID-19 病毒在印度尼西亚的传播情况。
{"title":"A hybrid deep learning optimization for predicting the spread of a new emerging infectious disease","authors":"F. E. Nastiti, Shahrulniza Musa, Eiad Yafi","doi":"10.11591/ijai.v13.i2.pp2036-2048","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2036-2048","url":null,"abstract":"In this study, a novel approach geared toward predicting the estimated number of coronavirus disease (COVID-19) cases was developed. Combining long short-term memory (LSTM) neural networks with particle swarm optimization (PSO) along with grey wolf optimization (GWO) employ hybrid optimization algorithm techniques. This investigation utilizes COVID-19 original data from the Ministry of Health of Indonesia, period 2020-2021. The developed LSTM-PSO-GWO hybrid optimization algorithm can improve the performance and accuracy of predicting the spread of the COVID-19 virus in Indonesia. In initiating LSTM initial weights with weaknesses, using the hybrid optimization algorithm helps overcome these problems and improve model performance. The results of this study suggest that the LSTM-PSO-GWO model can be utilized as an effective and reliable predictive tool to gauge the COVID-19 virus’s spread in Indonesia. ","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141230497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The performance analysis of hyper-heuristics algorithms over examination timetabling problems 超启发式算法在考试时间安排问题上的性能分析
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2155-2164
A. Muklason, Yusnardo Tendio, Helena Angelita Depari, Muhammad Arif Nuriman, Gusti Agung Premananda
In general, uncapacitated exam timetabling is conducted manually, which can be time-consuming. Many studies aim to automate and optimize uncapacitated exam timetabling. However, pinpointing the most efficient algorithm is challenging since most studies assert that their algorithms surpass previous ones. To identify the optimal algorithm, this research evaluates the performance of four algorithms: Hill climbing (HC), simulated annealing (SA), great deluge (GD), and tabu search (TS) in addressing the exam timetabling problem. The Kempe chain operator’s influence on optimization solutions is also examined. A simple random method is employed to select the low-level heuristic (LLH). The Carter (Toronto) dataset served as the test material, with each algorithm undergoing 200,000 iterations for comparison. The results indicate that the TS algorithm is superior, providing the best solution in 13 instances. The use of a tabu list enhanced the search process’s efficiency by preventing redundant modifications. The Kempe chain LLH exhibited a tendency towards achieving better solutions.
一般来说,无空位考试时间安排都是人工进行的,这可能会很耗时。许多研究旨在自动化和优化无空位考试时间安排。然而,确定最有效的算法具有挑战性,因为大多数研究都声称他们的算法超越了以前的算法。为了找出最佳算法,本研究评估了四种算法在解决考试时间安排问题中的性能:爬山算法(HC)、模拟退火算法(SA)、大洪水算法(GD)和塔布搜索算法(TS)。此外,还研究了 Kempe 链算子对优化解决方案的影响。在选择低级启发式(LLH)时,采用了一种简单的随机方法。卡特(多伦多)数据集作为测试材料,每种算法都进行了 200,000 次迭代比较。结果表明,TS 算法更胜一筹,在 13 个实例中提供了最佳解决方案。塔布列表的使用避免了多余的修改,从而提高了搜索过程的效率。Kempe 链 LLH 显示出获得更好解决方案的趋势。
{"title":"The performance analysis of hyper-heuristics algorithms over examination timetabling problems","authors":"A. Muklason, Yusnardo Tendio, Helena Angelita Depari, Muhammad Arif Nuriman, Gusti Agung Premananda","doi":"10.11591/ijai.v13.i2.pp2155-2164","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2155-2164","url":null,"abstract":"In general, uncapacitated exam timetabling is conducted manually, which can be time-consuming. Many studies aim to automate and optimize uncapacitated exam timetabling. However, pinpointing the most efficient algorithm is challenging since most studies assert that their algorithms surpass previous ones. To identify the optimal algorithm, this research evaluates the performance of four algorithms: Hill climbing (HC), simulated annealing (SA), great deluge (GD), and tabu search (TS) in addressing the exam timetabling problem. The Kempe chain operator’s influence on optimization solutions is also examined. A simple random method is employed to select the low-level heuristic (LLH). The Carter (Toronto) dataset served as the test material, with each algorithm undergoing 200,000 iterations for comparison. The results indicate that the TS algorithm is superior, providing the best solution in 13 instances. The use of a tabu list enhanced the search process’s efficiency by preventing redundant modifications. The Kempe chain LLH exhibited a tendency towards achieving better solutions.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141231003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative adversarial network-based phishing URL detection with variational autoencoder and transformer 利用变异自动编码器和变换器进行基于生成对抗网络的钓鱼 URL 检测
Pub Date : 2024-06-01 DOI: 10.11591/ijai.v13.i2.pp2165-2172
Jishnu Kaitholikkal Sasi, Arthi Balakrishnan
Phishing attacks pose a constant threat to online security, necessitating the development of efficient tools for identifying malicious URLs. In this article, we propose a novel approach to detect phishing URLs employing a generative adversarial network (GAN) with a variational autoencoder (VAE) as the generator and a transformer model with self-attention as the discriminator. The VAE generator is trained to produce synthetic URLs. In contrast, the transformer discriminator uses its self-attention mechanism to focus on the different parts of the input URLs to extract crucial features. Our model uses adversarial training to distinguish between legitimate and phishing URLs. We evaluate the effectiveness of the proposed method using a large set of one million URLs that incorporate both authentic and phishing URLs. Experimental results show that our model is effective, with an impressive accuracy of 97.75%, outperforming the baseline models. This study significantly improves online security by offering a novel and highly accurate phishing URL detection method.
网络钓鱼攻击对网络安全构成持续威胁,因此有必要开发高效的工具来识别恶意 URL。在本文中,我们提出了一种检测网络钓鱼 URL 的新方法,该方法采用了一种生成式对抗网络 (GAN),以变异自动编码器 (VAE) 作为生成器,以具有自我关注功能的变压器模型作为判别器。VAE 生成器经过训练,可以生成合成 URL。与此相反,变换器判别器利用其自我注意机制,关注输入 URL 的不同部分,以提取关键特征。我们的模型利用对抗训练来区分合法 URL 和网络钓鱼 URL。我们使用包含真实网址和网络钓鱼网址的 100 万个大型网址集来评估所提出方法的有效性。实验结果表明,我们的模型非常有效,准确率高达 97.75%,优于基线模型。这项研究提供了一种新颖、高度准确的网络钓鱼 URL 检测方法,从而大大提高了网络安全性。
{"title":"Generative adversarial network-based phishing URL detection with variational autoencoder and transformer","authors":"Jishnu Kaitholikkal Sasi, Arthi Balakrishnan","doi":"10.11591/ijai.v13.i2.pp2165-2172","DOIUrl":"https://doi.org/10.11591/ijai.v13.i2.pp2165-2172","url":null,"abstract":"Phishing attacks pose a constant threat to online security, necessitating the development of efficient tools for identifying malicious URLs. In this article, we propose a novel approach to detect phishing URLs employing a generative adversarial network (GAN) with a variational autoencoder (VAE) as the generator and a transformer model with self-attention as the discriminator. The VAE generator is trained to produce synthetic URLs. In contrast, the transformer discriminator uses its self-attention mechanism to focus on the different parts of the input URLs to extract crucial features. Our model uses adversarial training to distinguish between legitimate and phishing URLs. We evaluate the effectiveness of the proposed method using a large set of one million URLs that incorporate both authentic and phishing URLs. Experimental results show that our model is effective, with an impressive accuracy of 97.75%, outperforming the baseline models. This study significantly improves online security by offering a novel and highly accurate phishing URL detection method.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141231612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IAES International Journal of Artificial Intelligence (IJ-AI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1