首页 > 最新文献

PeerJ Computer Science最新文献

英文 中文
A new era in identification of tick genera; artificial intelligence for precision and speed. 蜱属鉴定的新时代人工智能的精度和速度。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 eCollection Date: 2026-01-01 DOI: 10.7717/peerj-cs.3291
Ibrahim A Ame, Abdullahi Ibrahim Umar, Cenk S Ozverel, Erdal Şanlıdağ, Ayse Seyer, Fadi Al-Turjman, Tamer Sanlidag

Background: The occurrence of pandemics in the last 20 years highlighted the unpreparedness of healthcare systems. There is a worldwide increased trend in the vector borne diseases. Ticks are one of the most common organisms that play a vital role in global ecosystem as well as being vectors of diseases affecting human and livestock. They are able to carry infectious agents that might cause illnesses including paralysis and to some certain extend death. Therefore, it is crucial to identify different genera of ticks to track infectious agents. Conventionally, tick classification is done by acarologists who are experts in the field. For this reason, the identification process is carried out in a difficult and time-consuming manner.

Method: The aim of the study was to develop a web-based application by using artificial intelligence-based algorithms to easily identify Hyalomma and Rhipicephalus ticks, which are the most abundant genera in Northern Cyprus, with high sensitivity and accuracy. The experimental procedure is structured based on five phases. Phase 1 revolves around data collection in which pictures of 35 identified ticks are taken by experienced acarologists and the curation of non-tick images (spiders, beetles, mites, mosquitos and scorpions). Phase 2 revolves around pre-processing steps and data split. Phase 3 involves training and testing custom Convolutional Neural Network (CNN), Visual Geometry Group 16 (VGG16), Residual Network 50 (ResNet-50) using 6,972 images (3,486 images for each class) for discrimination between ticks and non-ticks and 9,556 images (4,778 images for each class) for the discrimination between Hyalomma and Rhipicephalus. Phase 4 revolves around performance evaluation. Phase 5 is characterized by development of a web-based application (I-TickNet), created to enable a widespread use of the tick classifier.

Results: The performance evaluation and comparison of the model performance has shown that ResNet50 achieved the best result for binary classification of tick and non-tick (experiment A) with accuracy of 100% and Area Under the Curve (AUC) score of 100%. Moreover, VGG16 achieved the best result for binary classification of ticks (experiment B) with an accuracy of 96.97% and AUC score of 99.55% respectively. All the three models were employed for the development of artificial intelligence/Internet of Things (AI/IoT) framework known as I-TickNet for real-time and on-spot classification of tick images. In conclusion, this study provided a web-based application that can identify two distinct tick genera with high accuracy and sensitivity. The application developed enabled a user-friendly interface to identify genera without requiring any expertise.

背景:过去20年流行病的发生突出了卫生保健系统的准备不足。病媒传播的疾病在世界范围内有增加的趋势。蜱是最常见的生物之一,在全球生态系统中起着至关重要的作用,也是影响人类和牲畜疾病的媒介。它们能够携带可能导致包括瘫痪在内的疾病的传染性病原体,并在一定程度上导致死亡。因此,识别不同种类的蜱虫对追踪感染源至关重要。通常,蜱虫分类是由该领域的专家动物学家完成的。由于这个原因,身份查验过程是以困难和耗时的方式进行的。方法:采用基于人工智能的算法开发基于网络的应用程序,以方便地识别北塞浦路斯最丰富的属透明蜱和头蜱,具有较高的灵敏度和准确性。实验过程分为五个阶段。第一阶段围绕数据收集展开,由经验丰富的动物学家拍摄35种已识别蜱虫的照片,并对非蜱虫图像(蜘蛛、甲虫、螨虫、蚊子和蝎子)进行管理。阶段2围绕预处理步骤和数据分割展开。第三阶段包括训练和测试自定义卷积神经网络(CNN)、视觉几何组16 (VGG16)、残余网络50 (ResNet-50),使用6,972张图像(每类3,486张图像)来区分蜱虫和非蜱虫,使用9,556张图像(每类4,778张图像)来区分Hyalomma和Rhipicephalus。阶段4围绕性能评估展开。第5阶段的特点是开发基于web的应用程序(I-TickNet),创建该应用程序是为了使蜱类分类器得到广泛使用。结果:对模型性能的性能评价和比较表明,ResNet50在蜱虫和非蜱虫(实验A)的二元分类中取得了最好的结果,准确率为100%,曲线下面积(Area Under The Curve, AUC)得分为100%。其中,VGG16对蜱虫的二元分类(实验B)的准确率为96.97%,AUC得分为99.55%,达到最佳效果。这三种模型都被用于开发人工智能/物联网(AI/IoT)框架,即I-TickNet,用于实时和现场分类蜱虫图像。总之,本研究提供了一种基于网络的应用程序,可以以较高的准确性和灵敏度识别两种不同的蜱属。开发的应用程序支持用户友好的界面,无需任何专业知识即可识别属。
{"title":"A new era in identification of tick genera; artificial intelligence for precision and speed.","authors":"Ibrahim A Ame, Abdullahi Ibrahim Umar, Cenk S Ozverel, Erdal Şanlıdağ, Ayse Seyer, Fadi Al-Turjman, Tamer Sanlidag","doi":"10.7717/peerj-cs.3291","DOIUrl":"10.7717/peerj-cs.3291","url":null,"abstract":"<p><strong>Background: </strong>The occurrence of pandemics in the last 20 years highlighted the unpreparedness of healthcare systems. There is a worldwide increased trend in the vector borne diseases. Ticks are one of the most common organisms that play a vital role in global ecosystem as well as being vectors of diseases affecting human and livestock. They are able to carry infectious agents that might cause illnesses including paralysis and to some certain extend death. Therefore, it is crucial to identify different genera of ticks to track infectious agents. Conventionally, tick classification is done by acarologists who are experts in the field. For this reason, the identification process is carried out in a difficult and time-consuming manner.</p><p><strong>Method: </strong>The aim of the study was to develop a web-based application by using artificial intelligence-based algorithms to easily identify Hyalomma and Rhipicephalus ticks, which are the most abundant genera in Northern Cyprus, with high sensitivity and accuracy. The experimental procedure is structured based on five phases. Phase 1 revolves around data collection in which pictures of 35 identified ticks are taken by experienced acarologists and the curation of non-tick images (spiders, beetles, mites, mosquitos and scorpions). Phase 2 revolves around pre-processing steps and data split. Phase 3 involves training and testing custom Convolutional Neural Network (CNN), Visual Geometry Group 16 (VGG16), Residual Network 50 (ResNet-50) using 6,972 images (3,486 images for each class) for discrimination between ticks and non-ticks and 9,556 images (4,778 images for each class) for the discrimination between Hyalomma and Rhipicephalus. Phase 4 revolves around performance evaluation. Phase 5 is characterized by development of a web-based application (I-TickNet), created to enable a widespread use of the tick classifier.</p><p><strong>Results: </strong>The performance evaluation and comparison of the model performance has shown that ResNet50 achieved the best result for binary classification of tick and non-tick (experiment A) with accuracy of 100% and Area Under the Curve (AUC) score of 100%. Moreover, VGG16 achieved the best result for binary classification of ticks (experiment B) with an accuracy of 96.97% and AUC score of 99.55% respectively. All the three models were employed for the development of artificial intelligence/Internet of Things (AI/IoT) framework known as I-TickNet for real-time and on-spot classification of tick images. In conclusion, this study provided a web-based application that can identify two distinct tick genera with high accuracy and sensitivity. The application developed enabled a user-friendly interface to identify genera without requiring any expertise.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"12 ","pages":"e3291"},"PeriodicalIF":2.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12851202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MS-YieldStackNet: multi-source data fusion for wheat yield estimation using a stacked ensemble neural network. MS-YieldStackNet:基于堆叠集成神经网络的小麦产量估计多源数据融合。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-13 eCollection Date: 2026-01-01 DOI: 10.7717/peerj-cs.3434
Waqas Ali, Zeeshan Ramzan, Muhammad Shahbaz, Qamar Ul Zaman Bhutta, Muhammad Talha, Mohammed J AlGhamdi

Accurate crop yield prediction is vital for ensuring food security and informing agricultural policy, particularly in wheat-dependent regions like Pakistan where manual estimation methods are labor-intensive and imprecise. This study introduces a novel algorithmic framework, MS-YieldStackNet, to predict wheat yield with high spatial resolution by integrating multispectral satellite imagery, in-situ soil analytics, and meteorological variables. A unified feature space is constructed using Normalized Difference Vegetation Index (NDVI) and Difference Vegetation Index (DVI), soil physicochemical attributes, and temporal climate patterns, processed through a stacked ensemble neural architecture (MS-YieldStackNet) combining three parallel feed-forward neural networks (FFNNs) and a Random Forest meta-learner. The model achieved robust performance with an R-squared of 0.81, Mean Squared Error (MSE) of 6,114.30 kg/ha, root mean squared error (RMSE) of 78.19 kg/ha, mean absolute error (MAE) of 59.07 kg/ha, and mean absolute percentage error (MAPE) of 3.55%, demonstrating its potential for precise and scalable crop yield forecasting.

准确的作物产量预测对于确保粮食安全和为农业政策提供信息至关重要,特别是在像巴基斯坦这样依赖小麦的地区,人工估算方法是劳动密集型的,而且不精确。本研究引入了一种新的算法框架MS-YieldStackNet,通过整合多光谱卫星图像、原位土壤分析和气象变量,以高空间分辨率预测小麦产量。利用归一化植被指数(NDVI)和差异植被指数(DVI)、土壤理化属性和时间气候模式构建统一的特征空间,通过结合三个并行前馈神经网络(ffnn)和随机森林元学习器的堆叠集成神经结构(MS-YieldStackNet)进行处理。该模型的r平方为0.81,均方误差(MSE)为6114.30 kg/ha,均方根误差(RMSE)为78.19 kg/ha,平均绝对误差(MAE)为59.07 kg/ha,平均绝对百分比误差(MAPE)为3.55%,显示了其精确和可扩展的作物产量预测潜力。
{"title":"MS-YieldStackNet: multi-source data fusion for wheat yield estimation using a stacked ensemble neural network.","authors":"Waqas Ali, Zeeshan Ramzan, Muhammad Shahbaz, Qamar Ul Zaman Bhutta, Muhammad Talha, Mohammed J AlGhamdi","doi":"10.7717/peerj-cs.3434","DOIUrl":"10.7717/peerj-cs.3434","url":null,"abstract":"<p><p>Accurate crop yield prediction is vital for ensuring food security and informing agricultural policy, particularly in wheat-dependent regions like Pakistan where manual estimation methods are labor-intensive and imprecise. This study introduces a novel algorithmic framework, MS-YieldStackNet, to predict wheat yield with high spatial resolution by integrating multispectral satellite imagery, <i>in-situ</i> soil analytics, and meteorological variables. A unified feature space is constructed using Normalized Difference Vegetation Index (NDVI) and Difference Vegetation Index (DVI), soil physicochemical attributes, and temporal climate patterns, processed through a stacked ensemble neural architecture (MS-YieldStackNet) combining three parallel feed-forward neural networks (FFNNs) and a Random Forest meta-learner. The model achieved robust performance with an R-squared of 0.81, Mean Squared Error (MSE) of 6,114.30 kg/ha, root mean squared error (RMSE) of 78.19 kg/ha, mean absolute error (MAE) of 59.07 kg/ha, and mean absolute percentage error (MAPE) of 3.55%, demonstrating its potential for precise and scalable crop yield forecasting.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"12 ","pages":"e3434"},"PeriodicalIF":2.5,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12818367/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust coffee plant disease classification using deep learning and advanced feature engineering techniques. 基于深度学习和先进特征工程技术的强健咖啡植物病害分类。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-09 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3386
Hanin Ardah, Maher Alrahhal, Walaa M Abd-Elhafiez, Doaa Trabay

Coffee, the world's most traded tropical crop, is vital to the economies of many producing countries. However, coffee leaf diseases pose a serious threat to coffee quality and sustainable production. Deep learning has shown strong performance in plant disease identification through automatic image classification. Nevertheless, reliance on a single convolutional neural networks (CNNs) architecture restricts feature variability and real-world generalization. Moreover, limited work has systematically combined feature selection/reduction with CNNs, which constrains the advancement of hybrid models capable of capturing complementary features while ensuring computational efficiency without accuracy loss. This article presents an enhanced deep learning-based framework for coffee disease classification incorporating a hybrid strategy that integrates CNNs and advanced feature selection algorithms. GoogLeNet and ResNet18 are paired for complementary feature extraction, Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) are employed for dimensionality reduction, and ANOVA and Chi-square are applied to select the most informative features. An Adam optimizer (learning rate = 0.001, batch size = 20, epochs = 50) with early stopping is used for training. Experiments on the BRACOL dataset achieved 99.78% accuracy, with precision, recall, and F1-score all exceeding 99% across classes. To the best of our knowledge, this study systematically integrates GoogLeNet and ResNet18 with PCA/SVD dimensionality reduction and analysis of variance (ANOVA)/Chi-square feature selection, for coffee disease classification, thereby addressing a key gap in prior research.

咖啡是世界上交易量最大的热带作物,对许多生产国的经济至关重要。然而,咖啡叶病害对咖啡品质和可持续生产构成严重威胁。深度学习在植物病害自动分类识别方面表现优异。然而,对单一卷积神经网络(cnn)架构的依赖限制了特征的可变性和现实世界的泛化。此外,有限的工作将特征选择/约简与cnn系统地结合在一起,这限制了能够捕获互补特征同时保证计算效率而不损失精度的混合模型的进步。本文提出了一种增强的基于深度学习的咖啡疾病分类框架,该框架结合了集成cnn和高级特征选择算法的混合策略。将GoogLeNet和ResNet18配对进行互补特征提取,采用主成分分析(PCA)和奇异值分解(SVD)进行降维,采用方差分析(ANOVA)和卡方分析(χ 2)选择信息量最大的特征。使用提前停止的Adam优化器(学习率= 0.001,批大小= 20,epoch = 50)进行训练。在BRACOL数据集上的实验达到了99.78%的准确率,准确率、召回率和F1-score都超过了99%。据我们所知,本研究系统地将GoogLeNet和ResNet18与PCA/SVD降维和方差分析(ANOVA)/卡方特征选择相结合,用于咖啡疾病分类,从而解决了先前研究中的一个关键空白。
{"title":"Robust coffee plant disease classification using deep learning and advanced feature engineering techniques.","authors":"Hanin Ardah, Maher Alrahhal, Walaa M Abd-Elhafiez, Doaa Trabay","doi":"10.7717/peerj-cs.3386","DOIUrl":"10.7717/peerj-cs.3386","url":null,"abstract":"<p><p>Coffee, the world's most traded tropical crop, is vital to the economies of many producing countries. However, coffee leaf diseases pose a serious threat to coffee quality and sustainable production. Deep learning has shown strong performance in plant disease identification through automatic image classification. Nevertheless, reliance on a single convolutional neural networks (CNNs) architecture restricts feature variability and real-world generalization. Moreover, limited work has systematically combined feature selection/reduction with CNNs, which constrains the advancement of hybrid models capable of capturing complementary features while ensuring computational efficiency without accuracy loss. This article presents an enhanced deep learning-based framework for coffee disease classification incorporating a hybrid strategy that integrates CNNs and advanced feature selection algorithms. GoogLeNet and ResNet18 are paired for complementary feature extraction, Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) are employed for dimensionality reduction, and ANOVA and Chi-square are applied to select the most informative features. An Adam optimizer (learning rate = 0.001, batch size = 20, epochs = 50) with early stopping is used for training. Experiments on the BRACOL dataset achieved 99.78% accuracy, with precision, recall, and F1-score all exceeding 99% across classes. To the best of our knowledge, this study systematically integrates GoogLeNet and ResNet18 with PCA/SVD dimensionality reduction and analysis of variance (ANOVA)/Chi-square feature selection, for coffee disease classification, thereby addressing a key gap in prior research.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3386"},"PeriodicalIF":2.5,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12695173/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KomoTrip: a multi-day travel itinerary recommendation method based on the discrete komodo mlipir algorithm. KomoTrip:一种基于离散komodo mlipir算法的多日游行程推荐方法。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-12 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3350
Z K Abdurahman Baizal, Soni Fajar Surya Gumilang, Rio Nurtantyana, Rahmat Hendrawan

Technological developments in recent years led to the emergence of increasingly sophisticated recommender systems to support multi-day travel itineraries that fall under the Tourist Trip Design Problem (TTDP). Various problem analogies are widely used to solve TTDP, such as Traveling Salesman Problem (TSP), Vehicle Routing Problem (VRP), Orienteering Problem (OP), and Team Orienteering Problem with Time Windows (TOPTW). For multi-day route recommendation, TOPTW is suitable as a problem analogy since there is a per-day travel duration constraint. So far, TTDP with TOPTW does not consider the weighting (priority level of users) for each requirement attribute in a multi-attribute-based TOPTW to ensure personalized recommendations. In addition, running time remains a challenge in many studies in the TOPTW area. Many metaheuristic algorithms have been adopted to TOPTW for generating a time-efficient approach. Komodo Mlipir Algorithm (KMA) emerges as a new algorithm that promises good scalability. Therefore, we propose KomoTrip, a method that adopts the discrete version of KMA and Multi-Attribute Utility Theory (MAUT) to recommend optimal travel routes per day by accommodating the multi-attribute preferences of users. We perform three evaluation scenarios, i.e., general performance, Degree of Interest (DOI) combinations, and varying numbers of Points of Interest (POI), consistently demonstrating that KomoTrip outperforms several benchmark algorithms in terms of computational time efficiency and also exhibits robust fitness values across different problem dimension scales. Thus, KomoTrip can be regarded as an efficient algorithm to recommend optimal multi-day tour routes, effectively incorporating weighted multi-attribute preferences into its optimization process. We further benchmarked KomoTrip against state-of-the-art TOPTW heuristics on the public Solomon dataset, where it demonstrated competitive profit values, particularly for a larger number of days (tours), and consistently achieved superior runtime performance.

近年来的技术发展导致越来越复杂的推荐系统的出现,以支持属于旅游行程设计问题(TTDP)的多日旅行行程。各种各样的问题类比被广泛用于解决TTDP问题,如旅行商问题(TSP)、车辆路线问题(VRP)、定向问题(OP)和带时间窗口的团队定向问题(TOPTW)。对于多日路线推荐,由于存在每日行程时间约束,所以TOPTW适合作为问题类比。到目前为止,使用TOPTW的TTDP没有考虑基于多属性的TOPTW中每个需求属性的权重(用户的优先级),以确保个性化的推荐。此外,在TOPTW领域的许多研究中,运行时间仍然是一个挑战。许多元启发式算法已被采用到TOPTW中,以生成一种时间效率高的方法。Komodo Mlipir算法(KMA)是一种具有良好可扩展性的新算法。因此,我们提出了KomoTrip方法,该方法采用离散版本的KMA和多属性效用理论(MAUT),通过容纳用户的多属性偏好来推荐每天最优的旅行路线。我们执行了三种评估场景,即一般性能、兴趣度(DOI)组合和不同数量的兴趣点(POI),一致地证明KomoTrip在计算时间效率方面优于几种基准算法,并且在不同的问题维度尺度上也表现出稳健的适应度值。因此,KomoTrip可以看作是一种高效的多日游路线推荐算法,它有效地将加权多属性偏好融入到优化过程中。我们进一步将KomoTrip与公共Solomon数据集上最先进的TOPTW启发式方法进行了基准测试,在那里它展示了具有竞争力的利润值,特别是对于更长的天数(旅行),并且始终取得了卓越的运行时性能。
{"title":"KomoTrip: a multi-day travel itinerary recommendation method based on the discrete komodo mlipir algorithm.","authors":"Z K Abdurahman Baizal, Soni Fajar Surya Gumilang, Rio Nurtantyana, Rahmat Hendrawan","doi":"10.7717/peerj-cs.3350","DOIUrl":"10.7717/peerj-cs.3350","url":null,"abstract":"<p><p>Technological developments in recent years led to the emergence of increasingly sophisticated recommender systems to support multi-day travel itineraries that fall under the Tourist Trip Design Problem (TTDP). Various problem analogies are widely used to solve TTDP, such as Traveling Salesman Problem (TSP), Vehicle Routing Problem (VRP), Orienteering Problem (OP), and Team Orienteering Problem with Time Windows (TOPTW). For multi-day route recommendation, TOPTW is suitable as a problem analogy since there is a per-day travel duration constraint. So far, TTDP with TOPTW does not consider the weighting (priority level of users) for each requirement attribute in a multi-attribute-based TOPTW to ensure personalized recommendations. In addition, running time remains a challenge in many studies in the TOPTW area. Many metaheuristic algorithms have been adopted to TOPTW for generating a time-efficient approach. Komodo Mlipir Algorithm (KMA) emerges as a new algorithm that promises good scalability. Therefore, we propose KomoTrip, a method that adopts the discrete version of KMA and Multi-Attribute Utility Theory (MAUT) to recommend optimal travel routes per day by accommodating the multi-attribute preferences of users. We perform three evaluation scenarios, <i>i.e</i>., general performance, Degree of Interest (DOI) combinations, and varying numbers of Points of Interest (POI), consistently demonstrating that KomoTrip outperforms several benchmark algorithms in terms of computational time efficiency and also exhibits robust fitness values across different problem dimension scales. Thus, KomoTrip can be regarded as an efficient algorithm to recommend optimal multi-day tour routes, effectively incorporating weighted multi-attribute preferences into its optimization process. We further benchmarked KomoTrip against state-of-the-art TOPTW heuristics on the public Solomon dataset, where it demonstrated competitive profit values, particularly for a larger number of days (tours), and consistently achieved superior runtime performance.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3350"},"PeriodicalIF":2.5,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12704617/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145770019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal image fusion for enhanced vehicle identification in intelligent transport. 智能交通中增强车辆识别的多模式图像融合。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-30 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3270
Naif Al Mudawi, Muhammad Waqas Ahmed, Haifa F Alhasson, Naif S Alshassari, Abdulwahab Alazeb, Mohammed Alshehri, Bayan Alabdullah

Target detection in remote sensing is essential for applications such as law enforcement, military surveillance, and search-and-rescue. With advancements in computational power, deep learning methods have excelled in processing unimodal aerial imagery. The availability of diverse imaging modalities including, infrared, hyperspectral, multispectral, synthetic aperture radar, and Light Detection and Ranging (LiDAR) allows researchers to leverage complementary data sources. Integrating these multi-modal datasets has significantly enhanced detection performance, making these technologies more effective in real-world scenarios. In this work, we propose a novel approach that employs a deep learning-based attention mechanism to generate depth maps from aerial images. These depth maps are fused with RGB images to achieve enhanced feature representation. For image segmentation, we use Markov Random Fields (MRF), and for object detection, we adopt the You Only Look Once (YOLOv4) framework. Furthermore, we introduce a hybrid feature extraction technique that combines Histogram of Oriented Gradients (HOG) and Binary Robust Invariant Scalable Keypoints (BRISK) descriptors within the Vision Transformer (ViT) framework. Finally, a Residual Network with 18 layers (ResNet-18) is used for classification. Our model is evaluated on three benchmark datasets Roundabout Aerial, AU-Air, and Vehicle Aerial Imagery Dataset (VAID) achieving precision scores of 98.4%, 96.2%, and 97.4%, respectively, for object detection. Experimental results demonstrate that our approach outperforms existing state-of-the-art methods in vehicle detection and classification for aerial imagery.

遥感中的目标探测对于执法、军事监视和搜索救援等应用至关重要。随着计算能力的进步,深度学习方法在处理单峰航空图像方面表现出色。多种成像模式的可用性,包括红外、高光谱、多光谱、合成孔径雷达和光探测和测距(LiDAR),使研究人员能够利用互补的数据源。整合这些多模态数据集大大提高了检测性能,使这些技术在现实场景中更加有效。在这项工作中,我们提出了一种采用基于深度学习的注意机制从航空图像生成深度图的新方法。这些深度图与RGB图像融合以实现增强的特征表示。对于图像分割,我们使用马尔可夫随机场(MRF),对于目标检测,我们采用You Only Look Once (YOLOv4)框架。此外,我们引入了一种混合特征提取技术,该技术结合了视觉转换(ViT)框架中的定向梯度直方图(HOG)和二值鲁棒不变可扩展关键点(BRISK)描述符。最后,使用一个18层的残差网络(ResNet-18)进行分类。我们的模型在三个基准数据集Roundabout Aerial、AU-Air和Vehicle Aerial Imagery Dataset (VAID)上进行了评估,目标检测的精度分别为98.4%、96.2%和97.4%。实验结果表明,我们的方法在航空图像的车辆检测和分类方面优于现有的最先进的方法。
{"title":"Multimodal image fusion for enhanced vehicle identification in intelligent transport.","authors":"Naif Al Mudawi, Muhammad Waqas Ahmed, Haifa F Alhasson, Naif S Alshassari, Abdulwahab Alazeb, Mohammed Alshehri, Bayan Alabdullah","doi":"10.7717/peerj-cs.3270","DOIUrl":"10.7717/peerj-cs.3270","url":null,"abstract":"<p><p>Target detection in remote sensing is essential for applications such as law enforcement, military surveillance, and search-and-rescue. With advancements in computational power, deep learning methods have excelled in processing unimodal aerial imagery. The availability of diverse imaging modalities including, infrared, hyperspectral, multispectral, synthetic aperture radar, and Light Detection and Ranging (LiDAR) allows researchers to leverage complementary data sources. Integrating these multi-modal datasets has significantly enhanced detection performance, making these technologies more effective in real-world scenarios. In this work, we propose a novel approach that employs a deep learning-based attention mechanism to generate depth maps from aerial images. These depth maps are fused with RGB images to achieve enhanced feature representation. For image segmentation, we use Markov Random Fields (MRF), and for object detection, we adopt the You Only Look Once (YOLOv4) framework. Furthermore, we introduce a hybrid feature extraction technique that combines Histogram of Oriented Gradients (HOG) and Binary Robust Invariant Scalable Keypoints (BRISK) descriptors within the Vision Transformer (ViT) framework. Finally, a Residual Network with 18 layers (ResNet-18) is used for classification. Our model is evaluated on three benchmark datasets Roundabout Aerial, AU-Air, and Vehicle Aerial Imagery Dataset (VAID) achieving precision scores of 98.4%, 96.2%, and 97.4%, respectively, for object detection. Experimental results demonstrate that our approach outperforms existing state-of-the-art methods in vehicle detection and classification for aerial imagery.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3270"},"PeriodicalIF":2.5,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12594218/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CrossAlignNet: a self-supervised feature learning framework for 3D point cloud understanding. crosssalignnet:用于三维点云理解的自监督特征学习框架。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-19 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3194
Fei Wang, Xingzhen Dong, Jia Wu, Weishi Zhang, Tuo Zhou

We propose a self-supervised point cloud representation learning framework CrossAlignNet based on cross-modal mask alignment strategy, to solve the problems of imbalance between global semantic and local geometric feature learning, as well as cross-modal information asymmetry in existing methods. A geometrically consistent mask region is established between the point cloud patches and the corresponding image patches through a synchronized mask alignment strategy to ensure cross-modal information symmetry. A dual-task learning framework is designed: the global semantic alignment task enhances the cross-modal semantic consistency through contrastive learning, and the local mask reconstruction task fuses the image cues using the cross-attention mechanism to recover the local geometric structure of the masked point cloud. In addition, the ShapeNet3D-CMA dataset is constructed to provide accurate point cloud-image spatial mapping relations to support cross-modal learning. Our framework shows superior or comparative results against existing methods on three point cloud understanding tasks including object classification, few-shot classification, and part segmentation.

针对现有方法中存在的全局语义学习与局部几何特征学习不平衡以及跨模态信息不对称等问题,提出了一种基于跨模态掩模对齐策略的自监督点云表示学习框架CrossAlignNet。通过同步掩模对齐策略,在点云补丁和相应图像补丁之间建立几何一致的掩模区域,保证跨模态信息对称。设计了双任务学习框架:全局语义对齐任务通过对比学习增强跨模态语义一致性;局部掩模重建任务利用交叉注意机制融合图像线索,恢复被掩点云的局部几何结构。此外,构建ShapeNet3D-CMA数据集,提供准确的点云图空间映射关系,支持跨模态学习。我们的框架在三个点云理解任务(包括目标分类、少镜头分类和部分分割)上显示出优于或与现有方法比较的结果。
{"title":"CrossAlignNet: a self-supervised feature learning framework for 3D point cloud understanding.","authors":"Fei Wang, Xingzhen Dong, Jia Wu, Weishi Zhang, Tuo Zhou","doi":"10.7717/peerj-cs.3194","DOIUrl":"10.7717/peerj-cs.3194","url":null,"abstract":"<p><p>We propose a self-supervised point cloud representation learning framework CrossAlignNet based on cross-modal mask alignment strategy, to solve the problems of imbalance between global semantic and local geometric feature learning, as well as cross-modal information asymmetry in existing methods. A geometrically consistent mask region is established between the point cloud patches and the corresponding image patches through a synchronized mask alignment strategy to ensure cross-modal information symmetry. A dual-task learning framework is designed: the global semantic alignment task enhances the cross-modal semantic consistency through contrastive learning, and the local mask reconstruction task fuses the image cues using the cross-attention mechanism to recover the local geometric structure of the masked point cloud. In addition, the ShapeNet3D-CMA dataset is constructed to provide accurate point cloud-image spatial mapping relations to support cross-modal learning. Our framework shows superior or comparative results against existing methods on three point cloud understanding tasks including object classification, few-shot classification, and part segmentation.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3194"},"PeriodicalIF":2.5,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453720/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A method for semantic textual similarity on long texts. 长文本语义相似度的一种方法。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-19 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3202
Omar Zatarain, Juan Carlos González-Castolo, Silvia Ramos-Cabral

This work introduces a method for the semantic similarity of long documents using sentence transformers and large language models. The method detects relevant information from a pair of long texts by exploiting sentence transformers and large language models. The degree of similarity is obtained with an analytical fuzzy strategy that enables selective iterative retrieval under noisy conditions. The method discards the least similar pairs of sentences and selects the most similar. The preprocessing consists of splitting texts into sentences. The analytical strategy classifies pairs of texts by a degree of similarity without prior training on a dataset of long documents. Instead, it uses pre-trained models with any token capacity, a set of fuzzy parameters is tuned based on a few assessment iterations, and the parameters are updated based on criteria to detect four classes of similarity: identical, same topic, concept related, and non-related. This method can be employed in both small sentence transformers and large language models to detect similarity between pairs of documents of random sizes and avoid truncation of texts by testing pairs of sentences. A dataset of long texts in English from Wikipedia and other public sources, jointly with its gold standard, is provided and reviewed to test the method's performance. The method's performance is tested with small-token-size sentence transformers, large language models (LLMs), and text pairs split into sentences. Results prove that smaller sentence transformers are reliable for obtaining the similarity on long texts and indicate this method is an economical alternative to the increasing need for larger language models to find the degree of similarity between two long texts and extract the relevant information. Code and datasets are available at: https://github.com/omarzatarain/long-texts-similarity. Results of the adjustment of parameters can be found at https://doi.org/10.6084/m9.figshare.29082791.

本文介绍了一种使用句子转换器和大型语言模型的长文档语义相似度的方法。该方法通过利用句子转换器和大型语言模型从一对长文本中检测相关信息。采用模糊分析策略,在噪声条件下实现选择性迭代检索,从而获得相似度。该方法丢弃最不相似的句子对,选择最相似的句子对。预处理包括将文本分成句子。该分析策略通过相似度对文本进行分类,而无需事先在长文档数据集上进行训练。相反,它使用具有任何标记容量的预训练模型,根据几次评估迭代调整一组模糊参数,并根据标准更新参数,以检测四类相似性:相同、相同主题、概念相关和不相关。该方法既可以应用于小型句子转换器,也可以应用于大型语言模型,通过对句子的测试来检测随机大小的文档对之间的相似度,避免文本的截断。本文提供了一个来自维基百科和其他公共资源的英文长文本数据集,并对其黄金标准进行了审查,以测试该方法的性能。该方法的性能通过小标记大小的句子转换器、大型语言模型(llm)和分割成句子的文本对来测试。结果证明,较小的句子转换器对于获得长文本的相似度是可靠的,并且表明该方法是一种经济的替代方法,以满足越来越需要更大的语言模型来查找两个长文本之间的相似程度并提取相关信息。代码和数据集可在:https://github.com/omarzatarain/long-texts-similarity。参数调整结果可在https://doi.org/10.6084/m9.figshare.29082791上找到。
{"title":"A method for semantic textual similarity on long texts.","authors":"Omar Zatarain, Juan Carlos González-Castolo, Silvia Ramos-Cabral","doi":"10.7717/peerj-cs.3202","DOIUrl":"10.7717/peerj-cs.3202","url":null,"abstract":"<p><p>This work introduces a method for the semantic similarity of long documents using sentence transformers and large language models. The method detects relevant information from a pair of long texts by exploiting sentence transformers and large language models. The degree of similarity is obtained with an analytical fuzzy strategy that enables selective iterative retrieval under noisy conditions. The method discards the least similar pairs of sentences and selects the most similar. The preprocessing consists of splitting texts into sentences. The analytical strategy classifies pairs of texts by a degree of similarity without prior training on a dataset of long documents. Instead, it uses pre-trained models with any token capacity, a set of fuzzy parameters is tuned based on a few assessment iterations, and the parameters are updated based on criteria to detect four classes of similarity: identical, same topic, concept related, and non-related. This method can be employed in both small sentence transformers and large language models to detect similarity between pairs of documents of random sizes and avoid truncation of texts by testing pairs of sentences. A dataset of long texts in English from Wikipedia and other public sources, jointly with its gold standard, is provided and reviewed to test the method's performance. The method's performance is tested with small-token-size sentence transformers, large language models (LLMs), and text pairs split into sentences. Results prove that smaller sentence transformers are reliable for obtaining the similarity on long texts and indicate this method is an economical alternative to the increasing need for larger language models to find the degree of similarity between two long texts and extract the relevant information. Code and datasets are available at: https://github.com/omarzatarain/long-texts-similarity. Results of the adjustment of parameters can be found at https://doi.org/10.6084/m9.figshare.29082791.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3202"},"PeriodicalIF":2.5,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453783/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid ARIMA-LSTM for COVID-19 forecasting: a comparative AI modeling study. 混合ARIMA-LSTM预测新冠肺炎:人工智能模型的比较研究。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-19 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3195
Al Mahmud, Syed Husni Noor Syed Hatim Noor, Kamarul Imran Musa, Firdaus Mohamad Hamzah, Zainab Mat Yudin, Noorshaida Kamaruddin, Ashwini M Madawana, Mohamad Arif Awang Nawi

Pandemics present critical challenges to global health systems, economies, and societal structures, necessitating the development of accurate forecasting models for effective intervention and resource allocation. Classical statistical models such as the autoregressive integrated moving average (ARIMA) have been widely employed in epidemiological forecasting; however, they struggle to capture the nonlinear trends and dynamic fluctuations inherent in pandemic data. Conversely, deep learning models such as long short-term memory (LSTM) networks demonstrate strong capabilities in modeling complex dependencies but often require substantial data and computational resources. To boost forecasting precision, hybrid models such as ARIMA-LSTM integrate the advantages of traditional and deep learning methods. This study evaluates and compares the performance of ARIMA, LSTM, and hybrid ARIMA-LSTM models in predicting pandemic trends, using COVID-19 data from the Malaysian Ministry of Health as a case study. The dataset covers the period from 4 January 2021 to 18 September 2021, and model performance is evaluated using key metrics, including mean squared error (MSE), mean absolute error (MAE), mean absolute percentage error (MAPE), root mean squared error (RMSE), relative root mean squared error (RRMSE), normalized root mean squared error (NRMSE), and the coefficient of determination (R2). The results demonstrate that ARIMA performs poorly in capturing pandemic trends, while LSTM improves forecasting accuracy. However, the hybrid ARIMA-LSTM model consistently achieves the lowest error rates, confirming the advantage of integrating statistical and deep learning methodologies. All findings support the adoption of hybrid modeling approaches for pandemic forecasting, contributing to more accurate and reliable predictive analytics in epidemiology. Future research should investigate the generalizability of hybrid models across various infectious diseases and integrate additional real-time external variables to improve forecasting reliability.

大流行对全球卫生系统、经济和社会结构构成重大挑战,需要开发准确的预测模型,以进行有效干预和资源分配。自回归综合移动平均(ARIMA)等经典统计模型已广泛应用于流行病学预测;然而,它们难以捕捉大流行数据中固有的非线性趋势和动态波动。相反,长短期记忆(LSTM)网络等深度学习模型在建模复杂依赖关系方面表现出强大的能力,但通常需要大量的数据和计算资源。为了提高预测精度,ARIMA-LSTM等混合模型整合了传统和深度学习方法的优点。本研究以马来西亚卫生部的COVID-19数据为例,评估和比较了ARIMA、LSTM和ARIMA-LSTM混合模型在预测大流行趋势方面的表现。该数据集涵盖了2021年1月4日至2021年9月18日期间,模型性能使用关键指标进行评估,包括均方误差(MSE)、平均绝对误差(MAE)、平均绝对百分比误差(MAPE)、均方根误差(RMSE)、相对均方根误差(RRMSE)、标准化均方根误差(NRMSE)和决定系数(R2)。结果表明,ARIMA在捕捉大流行趋势方面表现不佳,而LSTM提高了预测准确性。然而,混合ARIMA-LSTM模型始终实现最低的错误率,证实了将统计和深度学习方法相结合的优势。所有研究结果都支持采用混合建模方法进行大流行预测,有助于在流行病学中进行更准确和可靠的预测分析。未来的研究应探讨混合模型在各种传染病中的通用性,并整合额外的实时外部变量以提高预测的可靠性。
{"title":"Hybrid ARIMA-LSTM for COVID-19 forecasting: a comparative AI modeling study.","authors":"Al Mahmud, Syed Husni Noor Syed Hatim Noor, Kamarul Imran Musa, Firdaus Mohamad Hamzah, Zainab Mat Yudin, Noorshaida Kamaruddin, Ashwini M Madawana, Mohamad Arif Awang Nawi","doi":"10.7717/peerj-cs.3195","DOIUrl":"10.7717/peerj-cs.3195","url":null,"abstract":"<p><p>Pandemics present critical challenges to global health systems, economies, and societal structures, necessitating the development of accurate forecasting models for effective intervention and resource allocation. Classical statistical models such as the autoregressive integrated moving average (ARIMA) have been widely employed in epidemiological forecasting; however, they struggle to capture the nonlinear trends and dynamic fluctuations inherent in pandemic data. Conversely, deep learning models such as long short-term memory (LSTM) networks demonstrate strong capabilities in modeling complex dependencies but often require substantial data and computational resources. To boost forecasting precision, hybrid models such as ARIMA-LSTM integrate the advantages of traditional and deep learning methods. This study evaluates and compares the performance of ARIMA, LSTM, and hybrid ARIMA-LSTM models in predicting pandemic trends, using COVID-19 data from the Malaysian Ministry of Health as a case study. The dataset covers the period from 4 January 2021 to 18 September 2021, and model performance is evaluated using key metrics, including mean squared error (MSE), mean absolute error (MAE), mean absolute percentage error (MAPE), root mean squared error (RMSE), relative root mean squared error (RRMSE), normalized root mean squared error (NRMSE), and the coefficient of determination (R<sup>2</sup>). The results demonstrate that ARIMA performs poorly in capturing pandemic trends, while LSTM improves forecasting accuracy. However, the hybrid ARIMA-LSTM model consistently achieves the lowest error rates, confirming the advantage of integrating statistical and deep learning methodologies. All findings support the adoption of hybrid modeling approaches for pandemic forecasting, contributing to more accurate and reliable predictive analytics in epidemiology. Future research should investigate the generalizability of hybrid models across various infectious diseases and integrate additional real-time external variables to improve forecasting reliability.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3195"},"PeriodicalIF":2.5,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453849/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Arabic hate speech detection using deep learning: a state-of-the-art survey of advances, challenges, and future directions (2020-2024). 使用深度学习的阿拉伯仇恨言论检测:最新进展、挑战和未来方向的调查(2020-2024)。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-17 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3133
Mariam Itriq, Mohd Halim Mohd Noor

The proliferation of social media has intensified concerns about the societal and psychological impacts of hate speech, particularly in Arabic-speaking communities, where dialectal diversity, morphological complexity, and sociopolitical factors complicate detection. Despite platform efforts, the automated detection of Arabic hate speech remains challenging owing to limited annotated datasets and linguistic nuances. This survey reviews the advances (2020-2024) in deep learning approaches, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), transformer-based models (e.g., bidirectional encoder representations from transformers (BERT) and AraBERT), and hybrid architectures for Arabic hate speech detection. It further examines the dataset constraints involving dialectal variation, annotation inconsistencies, and scarcity. The analysis identified critical research gaps and proposed future directions: expanding multilingual datasets, enhancing contextual modeling, and developing ethically grounded frameworks. This review consolidates state-of-the-art methodologies to guide effective countermeasures against Arabic online hate speech.

社交媒体的激增加剧了人们对仇恨言论的社会和心理影响的担忧,特别是在讲阿拉伯语的社区,那里的方言多样性、形态复杂性和社会政治因素使检测复杂化。尽管有平台的努力,由于有限的注释数据集和语言的细微差别,阿拉伯仇恨言论的自动检测仍然具有挑战性。本调查回顾了深度学习方法的进展(2020-2024),包括卷积神经网络(cnn)、循环神经网络(rnn)、基于变压器的模型(例如,来自变压器的双向编码器表示(BERT)和AraBERT),以及用于阿拉伯仇恨言论检测的混合架构。它进一步检查了涉及方言变化、注释不一致和稀缺性的数据集约束。该分析确定了关键的研究差距,并提出了未来的方向:扩展多语言数据集,增强上下文建模,开发基于伦理的框架。本综述整合了最先进的方法,以指导针对阿拉伯语在线仇恨言论的有效对策。
{"title":"Arabic hate speech detection using deep learning: a state-of-the-art survey of advances, challenges, and future directions (2020-2024).","authors":"Mariam Itriq, Mohd Halim Mohd Noor","doi":"10.7717/peerj-cs.3133","DOIUrl":"10.7717/peerj-cs.3133","url":null,"abstract":"<p><p>The proliferation of social media has intensified concerns about the societal and psychological impacts of hate speech, particularly in Arabic-speaking communities, where dialectal diversity, morphological complexity, and sociopolitical factors complicate detection. Despite platform efforts, the automated detection of Arabic hate speech remains challenging owing to limited annotated datasets and linguistic nuances. This survey reviews the advances (2020-2024) in deep learning approaches, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), transformer-based models (<i>e.g</i>., bidirectional encoder representations from transformers (BERT) and AraBERT), and hybrid architectures for Arabic hate speech detection. It further examines the dataset constraints involving dialectal variation, annotation inconsistencies, and scarcity. The analysis identified critical research gaps and proposed future directions: expanding multilingual datasets, enhancing contextual modeling, and developing ethically grounded frameworks. This review consolidates state-of-the-art methodologies to guide effective countermeasures against Arabic online hate speech.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3133"},"PeriodicalIF":2.5,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453721/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring mHealth interventions for medication management: a scoping review of digital tools, implementation barriers, and patient outcomes. 探索移动医疗干预药物管理:数字工具、实施障碍和患者结果的范围审查。
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-17 eCollection Date: 2025-01-01 DOI: 10.7717/peerj-cs.3190
Xuye Wang, Beibei Wang, Wan Yin Tew, Xiaoning Yang, Xiangyang Xu, Yifang Gao, Yongjia Chen, Mun Fei Yam

Background: Medication non-adherence remains a significant global healthcare challenge, resulting in inadequate disease management, increased hospitalisations, and higher healthcare costs. Mobile health (mHealth) applications have emerged as promising digital health tools for enhancing medication adherence through real-time monitoring, personalised reminders, artificial intelligence (AI)-driven interventions, and improved patient engagement.

Objectives: This scoping review examines the effectiveness, key features, and challenges of mHealth applications in promoting medication adherence across diverse patient populations and healthcare settings. It also seeks to identify research gaps and inform future development and implementation strategies for digital therapeutics.

Eligibility criteria: Studies published between 2020 and 2024 were included if they investigated the use of mHealth applications to improve medication adherence and reported outcomes related to adherence rates, patient health indicators, or user engagement. Only studies with empirical data, including randomised controlled trials, observational studies, or mixed-methods research, were considered.

Sources of evidence: A comprehensive search was conducted across Scopus, Web of Science, PubMed/MEDLINE, Google Scholar, and CINAHL databases. In total, 319 studies met the inclusion criteria following a systematic screening process based on Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) guidelines.

Charting methods: Data were extracted on study design, app functionalities, patient demographics, adherence outcomes, and barriers to adoption. The charted data were thematically synthesised to identify trends, success factors, and limitations.

Results: Among the included studies, 85% reported improved medication adherence associated with features such as personalised medication reminders, real-time health tracking, and AI-powered adherence prediction. Clinical outcomes were also frequently observed, including improved blood pressure, glucose control, and patient-reported quality of life. Key barriers to adoption included limited digital literacy, concerns about data privacy, socioeconomic disparities, and a lack of integration with electronic health records (EHRs).

Conclusions: mHealth applications show significant potential to improve medication adherence and health outcomes, particularly in the management of chronic diseases. However, inclusive design, robust data privacy frameworks, and evidence-based implementation strategies are essential for scalability and sustained impact. Future research should focus on long-term effectiveness, cost-efficiency, and integration of mHealth tools within broader healthcare systems.

背景:药物依从性不遵医嘱仍然是一个重大的全球卫生保健挑战,导致疾病管理不足、住院人数增加和卫生保健费用增加。移动医疗(mHealth)应用程序已经成为一种很有前途的数字医疗工具,通过实时监测、个性化提醒、人工智能(AI)驱动的干预措施和提高患者参与度来增强药物依从性。目的:本综述考察了移动健康应用在不同患者群体和医疗环境中促进药物依从性的有效性、关键特征和挑战。它还寻求确定研究差距,并为数字治疗的未来发展和实施战略提供信息。入选标准:在2020年至2024年间发表的研究,如果调查了移动医疗应用程序的使用,以提高药物依从性,并报告了与依从率、患者健康指标或用户参与度相关的结果,则纳入研究。仅考虑具有经验数据的研究,包括随机对照试验、观察性研究或混合方法研究。证据来源:在Scopus、Web of Science、PubMed/MEDLINE、谷歌Scholar和CINAHL数据库中进行了全面的搜索。总共有319项研究符合纳入标准,系统筛选过程基于系统评价的首选报告项目和扩展范围评价的荟萃分析(PRISMA-ScR)指南。图表方法:提取研究设计、应用程序功能、患者人口统计学、依从性结果和采用障碍方面的数据。图表数据按主题进行综合,以确定趋势、成功因素和局限性。结果:在纳入的研究中,85%的研究报告了与个性化药物提醒、实时健康跟踪和人工智能依从性预测等功能相关的药物依从性改善。临床结果也经常被观察到,包括改善的血压、血糖控制和患者报告的生活质量。采用的主要障碍包括数字素养有限、对数据隐私的担忧、社会经济差异以及与电子健康记录(EHRs)缺乏整合。结论:移动健康应用显示出改善药物依从性和健康结果的巨大潜力,特别是在慢性病管理方面。然而,包容性设计、稳健的数据隐私框架和基于证据的实施策略对于可扩展性和持续影响至关重要。未来的研究应该集中在长期有效性、成本效益和移动医疗工具在更广泛的医疗系统中的集成上。
{"title":"Exploring mHealth interventions for medication management: a scoping review of digital tools, implementation barriers, and patient outcomes.","authors":"Xuye Wang, Beibei Wang, Wan Yin Tew, Xiaoning Yang, Xiangyang Xu, Yifang Gao, Yongjia Chen, Mun Fei Yam","doi":"10.7717/peerj-cs.3190","DOIUrl":"10.7717/peerj-cs.3190","url":null,"abstract":"<p><strong>Background: </strong>Medication non-adherence remains a significant global healthcare challenge, resulting in inadequate disease management, increased hospitalisations, and higher healthcare costs. Mobile health (mHealth) applications have emerged as promising digital health tools for enhancing medication adherence through real-time monitoring, personalised reminders, artificial intelligence (AI)-driven interventions, and improved patient engagement.</p><p><strong>Objectives: </strong>This scoping review examines the effectiveness, key features, and challenges of mHealth applications in promoting medication adherence across diverse patient populations and healthcare settings. It also seeks to identify research gaps and inform future development and implementation strategies for digital therapeutics.</p><p><strong>Eligibility criteria: </strong>Studies published between 2020 and 2024 were included if they investigated the use of mHealth applications to improve medication adherence and reported outcomes related to adherence rates, patient health indicators, or user engagement. Only studies with empirical data, including randomised controlled trials, observational studies, or mixed-methods research, were considered.</p><p><strong>Sources of evidence: </strong>A comprehensive search was conducted across Scopus, Web of Science, PubMed/MEDLINE, Google Scholar, and CINAHL databases. In total, 319 studies met the inclusion criteria following a systematic screening process based on Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) guidelines.</p><p><strong>Charting methods: </strong>Data were extracted on study design, app functionalities, patient demographics, adherence outcomes, and barriers to adoption. The charted data were thematically synthesised to identify trends, success factors, and limitations.</p><p><strong>Results: </strong>Among the included studies, 85% reported improved medication adherence associated with features such as personalised medication reminders, real-time health tracking, and AI-powered adherence prediction. Clinical outcomes were also frequently observed, including improved blood pressure, glucose control, and patient-reported quality of life. Key barriers to adoption included limited digital literacy, concerns about data privacy, socioeconomic disparities, and a lack of integration with electronic health records (EHRs).</p><p><strong>Conclusions: </strong>mHealth applications show significant potential to improve medication adherence and health outcomes, particularly in the management of chronic diseases. However, inclusive design, robust data privacy frameworks, and evidence-based implementation strategies are essential for scalability and sustained impact. Future research should focus on long-term effectiveness, cost-efficiency, and integration of mHealth tools within broader healthcare systems.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3190"},"PeriodicalIF":2.5,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453750/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PeerJ Computer Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1