首页 > 最新文献

Frontiers in Artificial Intelligence最新文献

英文 中文
Optimized ensemble machine learning model for cyberattack classification in industrial IoT. 工业物联网网络攻击分类的优化集成机器学习模型。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1685376
Batool Alabdullah, Suresh Sankaranarayanan

Introduction: The increasing cyber threats targeting industrial control systems (ICS) and the Internet of Things (IoT) pose significant risks, especially in critical infrastructures like the oil and gas sector. Existing machine learning (ML) approaches for cyberattack detection often rely on binary classification and lack computational efficiency.

Methods: This study proposes two optimized stacked ensemble models to enhance attack detection accuracy while reducing computational overhead. The main contribution lies in the strategic selection and integration of diverse base models, such as Logistic Regression, Extra Tree Classifier, XGBoost, and LGBM, with RFC as the final estimator. These models are chosen to address unique characteristics of security datasets, such as class imbalance, noise, and complex attack patterns. This combination aims to leverage different decision boundaries and learning mechanisms.

Results: Evaluations show that the Stacked Ensemble_2 model achieves 97% accuracy with a training and testing computation time of 54 minutes. Stacked Ensemble_2, which excelled over the traditional Stacked Ensemble_1, was also evaluated on the CICIDS 2017 dataset, achieving an impressive 100% accuracy with an AUROC of 99%.

Discussion: The results indicate that the proposed Stacked Ensemble_2 model provides a scalable, real-time detection mechanism for securing ICS and IoT environments. By proving its effectiveness on unseen data, this model demonstrates a significant advancement over traditional methods, offering enhanced accuracy and efficiency in detecting sophisticated cyber threats in critical infrastructure sectors.

导语:越来越多的针对工业控制系统(ICS)和物联网(IoT)的网络威胁构成了重大风险,特别是在石油和天然气行业等关键基础设施中。现有的机器学习(ML)网络攻击检测方法往往依赖于二进制分类,缺乏计算效率。方法:提出两种优化的堆叠集成模型,在降低计算开销的同时提高攻击检测精度。主要贡献在于策略性地选择和整合各种基本模型,如Logistic回归、Extra Tree Classifier、XGBoost和LGBM,并以RFC作为最终的估计器。选择这些模型是为了解决安全数据集的独特特征,例如类不平衡、噪声和复杂的攻击模式。这种组合旨在利用不同的决策边界和学习机制。结果:评价表明,该模型的训练和测试计算时间为54分钟,准确率达到97%。在CICIDS 2017数据集上,对优于传统堆叠Ensemble_1的堆叠Ensemble_2进行了评估,达到了令人印象深刻的100%准确率和99%的AUROC。讨论:结果表明,所提出的堆叠集成模型为保护ICS和物联网环境提供了一种可扩展的实时检测机制。通过证明其在看不见的数据上的有效性,该模型显示了比传统方法的重大进步,在检测关键基础设施部门的复杂网络威胁方面提供了更高的准确性和效率。
{"title":"Optimized ensemble machine learning model for cyberattack classification in industrial IoT.","authors":"Batool Alabdullah, Suresh Sankaranarayanan","doi":"10.3389/frai.2025.1685376","DOIUrl":"10.3389/frai.2025.1685376","url":null,"abstract":"<p><strong>Introduction: </strong>The increasing cyber threats targeting industrial control systems (ICS) and the Internet of Things (IoT) pose significant risks, especially in critical infrastructures like the oil and gas sector. Existing machine learning (ML) approaches for cyberattack detection often rely on binary classification and lack computational efficiency.</p><p><strong>Methods: </strong>This study proposes two optimized stacked ensemble models to enhance attack detection accuracy while reducing computational overhead. The main contribution lies in the strategic selection and integration of diverse base models, such as Logistic Regression, Extra Tree Classifier, XGBoost, and LGBM, with RFC as the final estimator. These models are chosen to address unique characteristics of security datasets, such as class imbalance, noise, and complex attack patterns. This combination aims to leverage different decision boundaries and learning mechanisms.</p><p><strong>Results: </strong>Evaluations show that the Stacked Ensemble_2 model achieves 97% accuracy with a training and testing computation time of 54 minutes. Stacked Ensemble_2, which excelled over the traditional Stacked Ensemble_1, was also evaluated on the CICIDS 2017 dataset, achieving an impressive 100% accuracy with an AUROC of 99%.</p><p><strong>Discussion: </strong>The results indicate that the proposed Stacked Ensemble_2 model provides a scalable, real-time detection mechanism for securing ICS and IoT environments. By proving its effectiveness on unseen data, this model demonstrates a significant advancement over traditional methods, offering enhanced accuracy and efficiency in detecting sophisticated cyber threats in critical infrastructure sectors.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1685376"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12832753/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RiCoRecA: rich cooking recipe annotation schema. RiCoRecA:丰富的烹饪食谱注释模式。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1550604
Filippos Ventirozos, Mauricio Jacobo-Romero, Haifa Alrdahi, Sarah Clinch, Riza Batista-Navarro

Despite recent advancements, modern kitchens, at best, have one or more isolated (non-communicating) "smart" devices. The vision of having a fully-fledged ambient kitchen where devices know what to do and when has yet to be realized. To address this, we present RiCoRecA, a novel schema for parsing cooking recipes into a workflow representation suitable for automation, a step toward that direction. Methodologically, the schema requires a number of information extraction tasks, i.e., annotating named entities, identifying relations between them, coreference resolution, and entity tracking. RiCoRecA differs from previously reported approaches in that it learns these different information extraction tasks using one joint model. We also provide a dataset containing annotations that follow this schema. Furthermore, we compared two transformer-based models for parsing recipes into workflows, namely, PEGASUS-X and LongT5. Our results demonstrate that PEGASUS-X surpassed LongT5 on all of the annotation tasks. Specifically, PEGASUS-X surpassed LongT5 by 39% in terms of F-Score when averaging the performance on all the tasks; it demonstrated almost human-like performance.

尽管最近取得了进步,但现代厨房最多只有一个或多个孤立的(非通信的)“智能”设备。拥有一个完全成熟的环境厨房,设备知道什么时候做什么,这一愿景尚未实现。为了解决这个问题,我们提出了RiCoRecA,这是一种新颖的模式,用于将烹饪食谱解析为适合自动化的工作流表示形式,朝着这个方向迈出了一步。在方法上,该模式需要许多信息提取任务,即注释命名实体、识别它们之间的关系、共同引用解析和实体跟踪。RiCoRecA与之前报道的方法不同,它使用一个联合模型来学习这些不同的信息提取任务。我们还提供了一个包含遵循此模式的注释的数据集。此外,我们比较了用于将食谱解析为工作流的两个基于转换器的模型,即PEGASUS-X和LongT5。我们的结果表明PEGASUS-X在所有注释任务上都超过了LongT5。具体来说,PEGASUS-X在所有任务的平均性能方面的F-Score超过了LongT5 39%;它展示了几乎与人类相似的表现。
{"title":"RiCoRecA: <i>ri</i>ch <i>co</i>oking <i>rec</i>ipe <i>a</i>nnotation schema.","authors":"Filippos Ventirozos, Mauricio Jacobo-Romero, Haifa Alrdahi, Sarah Clinch, Riza Batista-Navarro","doi":"10.3389/frai.2025.1550604","DOIUrl":"10.3389/frai.2025.1550604","url":null,"abstract":"<p><p>Despite recent advancements, modern kitchens, at best, have one or more isolated (non-communicating) \"smart\" devices. The vision of having a fully-fledged ambient kitchen where devices know what to do and when has yet to be realized. To address this, we present RiCoRecA, a novel schema for parsing cooking recipes into a workflow representation suitable for automation, a step toward that direction. Methodologically, the schema requires a number of information extraction tasks, i.e., annotating named entities, identifying relations between them, coreference resolution, and entity tracking. RiCoRecA differs from previously reported approaches in that it learns these different information extraction tasks using one joint model. We also provide a dataset containing annotations that follow this schema. Furthermore, we compared two transformer-based models for parsing recipes into workflows, namely, PEGASUS-X and LongT5. Our results demonstrate that PEGASUS-X surpassed LongT5 on all of the annotation tasks. Specifically, PEGASUS-X surpassed LongT5 by 39% in terms of F-Score when averaging the performance on all the tasks; it demonstrated almost human-like performance.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1550604"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833278/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced multi-class object detector for bone fracture diagnosis with prescription recommendation. 基于处方推荐的增强多类目标检测器用于骨折诊断。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1692894
Daudi Mashauri Migayo, Shubi Kaijage, Stephen Swetala, Devotha G Nyambo

Bone fractures are among the most prominent injuries in the modern world that affect all ages and races. Traditional treatment involves radiographic imaging that relies heavily on radiologists manually analyzing images. There have been efforts to develop computer-aided diagnosis tools that employ artificial intelligence and deep learning approaches. Existing literature focuses on developing tools that only detect and classify bone fractures, rather than addressing the broader issue of bone fracture management. However, evidence of scholarly works that include treatment recommendations is still lacking. Furthermore, deep learning-based object detectors that achieve state-of-the-art results are computationally expensive and considered as black-box solutions. Developing countries, such as Sub-Saharan Africa, face a shortage of radiologists and orthopedists. For this reason, this paper proposes a methodological approach that uses a more efficient object detection model to diagnose long bone fractures and provide prescription recommendations. An enhanced anchoring process, known as adaptive anchoring, is proposed to improve the performance of the Regional Proposal Network and the object detection model. A Faster R-CNN model with ResNet-50/101 and ResNext-50/101 backbones was used to develop an object detection model that uses X-ray images as input. To understand and interpret the model's decision, a Gradient-based Class Activation Mapping method was used to assess the model's learnability. The results indicate that the proposed adaptive anchoring approach can improve computational efficiency, reducing training time by up to 29% compared to the traditional approach. Model accuracy during training and validation ranged between 94% and 98%. Overall, adaptive anchoring performed better when applied with the ResNet-101 backbone, yielding an Average Precision of 92.73%, an F1 score of 96.01%, a precision of 96.80%, and a recall of 95.23%. The study provides valuable insights into the use of computationally efficient deep learning models for medical recommendation systems. Future studies should develop models to diagnose fractures using input images from various modalities and to provide prescription recommendations.

骨折是现代世界中影响所有年龄和种族的最突出的伤害之一。传统的治疗包括放射成像,严重依赖放射科医生手动分析图像。人们一直在努力开发利用人工智能和深度学习方法的计算机辅助诊断工具。现有文献侧重于开发仅检测和分类骨折的工具,而不是解决骨折管理的更广泛问题。然而,包括治疗建议的学术著作的证据仍然缺乏。此外,基于深度学习的目标检测器实现了最先进的结果,计算成本很高,被认为是黑盒解决方案。发展中国家,如撒哈拉以南非洲,面临着放射科医生和骨科医生的短缺。因此,本文提出了一种方法学方法,使用更有效的目标检测模型来诊断长骨骨折并提供处方建议。提出了一种增强的锚定过程,称为自适应锚定,以提高区域建议网络和目标检测模型的性能。采用基于ResNet-50/101和ResNext-50/101主干的Faster R-CNN模型,开发以x射线图像为输入的目标检测模型。为了理解和解释模型的决策,使用基于梯度的类激活映射方法来评估模型的可学习性。结果表明,与传统方法相比,所提出的自适应锚定方法可以提高计算效率,减少高达29%的训练时间。模型在训练和验证期间的准确率在94%到98%之间。总体而言,自适应锚定在ResNet-101骨干网中表现更好,平均精度为92.73%,F1分数为96.01%,精度为96.80%,召回率为95.23%。该研究为医疗推荐系统使用计算效率高的深度学习模型提供了有价值的见解。未来的研究应该建立模型,利用不同模式的输入图像来诊断骨折,并提供处方建议。
{"title":"Enhanced multi-class object detector for bone fracture diagnosis with prescription recommendation.","authors":"Daudi Mashauri Migayo, Shubi Kaijage, Stephen Swetala, Devotha G Nyambo","doi":"10.3389/frai.2025.1692894","DOIUrl":"10.3389/frai.2025.1692894","url":null,"abstract":"<p><p>Bone fractures are among the most prominent injuries in the modern world that affect all ages and races. Traditional treatment involves radiographic imaging that relies heavily on radiologists manually analyzing images. There have been efforts to develop computer-aided diagnosis tools that employ artificial intelligence and deep learning approaches. Existing literature focuses on developing tools that only detect and classify bone fractures, rather than addressing the broader issue of bone fracture management. However, evidence of scholarly works that include treatment recommendations is still lacking. Furthermore, deep learning-based object detectors that achieve state-of-the-art results are computationally expensive and considered as black-box solutions. Developing countries, such as Sub-Saharan Africa, face a shortage of radiologists and orthopedists. For this reason, this paper proposes a methodological approach that uses a more efficient object detection model to diagnose long bone fractures and provide prescription recommendations. An enhanced anchoring process, known as adaptive anchoring, is proposed to improve the performance of the Regional Proposal Network and the object detection model. A Faster R-CNN model with ResNet-50/101 and ResNext-50/101 backbones was used to develop an object detection model that uses X-ray images as input. To understand and interpret the model's decision, a Gradient-based Class Activation Mapping method was used to assess the model's learnability. The results indicate that the proposed adaptive anchoring approach can improve computational efficiency, reducing training time by up to 29% compared to the traditional approach. Model accuracy during training and validation ranged between 94% and 98%. Overall, adaptive anchoring performed better when applied with the ResNet-101 backbone, yielding an Average Precision of 92.73%, an F1 score of 96.01%, a precision of 96.80%, and a recall of 95.23%. The study provides valuable insights into the use of computationally efficient deep learning models for medical recommendation systems. Future studies should develop models to diagnose fractures using input images from various modalities and to provide prescription recommendations.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1692894"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833394/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved attention-based PCNN with GhostNet for epilepsy seizure detection using EEG and fMRI modalities: extractive pattern and histogram feature set. 基于GhostNet的改进的基于注意力的PCNN用于脑电图和功能磁共振成像的癫痫发作检测:提取模式和直方图特征集。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1679218
Sunkara Mounika, Reeja S R

Introduction: Detecting epileptic seizures remains a major challenge in clinical neurology due to the complex, heterogeneous, and non-stationary characteristics of electroencephalogram (EEG) signals. Although recent machine learning (ML) and deep learning (DL) approaches have improved detection performance, most methods still struggle with limited interpretability, inadequate spatial-temporal modeling, and suboptimal generalization. To address these limitations, this study proposes an enhanced hybrid parallel convolutional-GhostNet framework (HPG-ESD) for robust seizure detection using multimodal EEG and functional Magnetic Resonance Imaging (fMRI) data.

Methods: The experimental data consist of pediatric scalp EEG recordings from 24 subjects in the CHB-MIT dataset (22-channel 10-20 system, 256 Hz sampling, continuous multi-hour recordings) and resting-state 3T fMRI scans from 52 participants in the UNAM TLE dataset (26 epilepsy patients and 26 healthy controls). EEG data underwent Gauss-based median filtering, while fMRI images were denoised using an adaptive weight-based Wiener filter. Spatial, temporal, and spectral EEG features were extracted alongside an enhanced common spatial pattern (E-CSP) representation, whereas fMRI features were obtained using deep 3D CNN embeddings combined with a smoothened pyramid histogram of oriented gradients (S-PHOG) descriptor. These multimodal features were fused within a soft voting hybrid parallel convolutional-GhostNet (S-HPCGN) model, integrating an improved attention based parallel convolutional network (IAPCNet) and GhostNet to capture complementary spatial-temporal patterns.

Results: The proposed HPG-ESD framework achieved an accuracy of 0.941, precision of 0.939, and sensitivity of 0.944, outperforming conventional unimodal and state-of-the-art methods.

Discussion: These results demonstrate the potential of multi-modal learning and lightweight attention-enhanced architectures for reliable and clinically relevant seizure detection.

由于脑电图(EEG)信号的复杂性、异质性和非平稳性,检测癫痫发作仍然是临床神经病学的主要挑战。尽管最近的机器学习(ML)和深度学习(DL)方法提高了检测性能,但大多数方法仍然存在有限的可解释性、不充分的时空建模和次优泛化的问题。为了解决这些限制,本研究提出了一种增强的混合并行卷积- ghostnet框架(HPG-ESD),用于使用多模态脑电图和功能磁共振成像(fMRI)数据进行稳健的癫痫检测。方法:实验数据包括来自CHB-MIT数据集(22通道10-20系统,256 Hz采样,连续多小时记录)的24名儿童头皮EEG记录和来自UNAM TLE数据集的52名参与者(26名癫痫患者和26名健康对照)的静息状态3T fMRI扫描。EEG数据采用高斯中值滤波,fMRI图像采用自适应加权维纳滤波去噪。通过增强的共同空间模式(E-CSP)表示提取EEG的空间、时间和频谱特征,而使用深度3D CNN嵌入结合平滑的定向梯度金字塔直方图(S-PHOG)描述符获得fMRI特征。这些多模态特征融合在软投票混合并行卷积-GhostNet (S-HPCGN)模型中,整合改进的基于注意力的并行卷积网络(IAPCNet)和GhostNet来捕捉互补的时空模式。结果:所提出的HPG-ESD框架的准确度为0.941,精密度为0.939,灵敏度为0.944,优于传统的单峰方法和最先进的方法。讨论:这些结果证明了多模式学习和轻量级注意力增强架构在可靠和临床相关的癫痫检测方面的潜力。
{"title":"Improved attention-based PCNN with GhostNet for epilepsy seizure detection using EEG and fMRI modalities: extractive pattern and histogram feature set.","authors":"Sunkara Mounika, Reeja S R","doi":"10.3389/frai.2025.1679218","DOIUrl":"10.3389/frai.2025.1679218","url":null,"abstract":"<p><strong>Introduction: </strong>Detecting epileptic seizures remains a major challenge in clinical neurology due to the complex, heterogeneous, and non-stationary characteristics of electroencephalogram (EEG) signals. Although recent machine learning (ML) and deep learning (DL) approaches have improved detection performance, most methods still struggle with limited interpretability, inadequate spatial-temporal modeling, and suboptimal generalization. To address these limitations, this study proposes an enhanced hybrid parallel convolutional-GhostNet framework (HPG-ESD) for robust seizure detection using multimodal EEG and functional Magnetic Resonance Imaging (fMRI) data.</p><p><strong>Methods: </strong>The experimental data consist of pediatric scalp EEG recordings from 24 subjects in the CHB-MIT dataset (22-channel 10-20 system, 256 Hz sampling, continuous multi-hour recordings) and resting-state 3T fMRI scans from 52 participants in the UNAM TLE dataset (26 epilepsy patients and 26 healthy controls). EEG data underwent Gauss-based median filtering, while fMRI images were denoised using an adaptive weight-based Wiener filter. Spatial, temporal, and spectral EEG features were extracted alongside an enhanced common spatial pattern (E-CSP) representation, whereas fMRI features were obtained using deep 3D CNN embeddings combined with a smoothened pyramid histogram of oriented gradients (S-PHOG) descriptor. These multimodal features were fused within a soft voting hybrid parallel convolutional-GhostNet (S-HPCGN) model, integrating an improved attention based parallel convolutional network (IAPCNet) and GhostNet to capture complementary spatial-temporal patterns.</p><p><strong>Results: </strong>The proposed HPG-ESD framework achieved an accuracy of 0.941, precision of 0.939, and sensitivity of 0.944, outperforming conventional unimodal and state-of-the-art methods.</p><p><strong>Discussion: </strong>These results demonstrate the potential of multi-modal learning and lightweight attention-enhanced architectures for reliable and clinically relevant seizure detection.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1679218"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12850516/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perception and awareness of healthcare professionals toward the applications of artificial intelligence in Egyptian healthcare settings. 感知和医疗保健专业人员对人工智能在埃及医疗保健设置的应用意识。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1700493
Shimaa Azzam, El-Morsy Ahmed El-Morsy, Amira S A Said, Nermin Eissa, Doaa Mahmoud Khalil

Background: Healthcare professionals' awareness and handling of artificial intelligence applications in healthcare enhance patient outcomes and improve processes. This study aimed to evaluate the perception, attitude, knowledge, and practice of healthcare professionals regarding the application of artificial intelligence in Egyptian healthcare settings.

Method: A cross-sectional study in which 367 healthcare professionals responded to an electronic questionnaire.

Results: Out of 367 participants (234 female), radiology and lab test specialty (36.2%) was the predominant. The mean age was 27.03 years; 51.8% of respondents showed positive perception, 68.7% experienced sub-optimal knowledge, 52.9% expressed negative attitudes, and 53.4% demonstrated a low practice level of AI tools. Younger age was significantly associated with positive perception (adjusted odds ratio (AOR) = 0.905, p = 0.020) and higher AI practice (AOR = 0.907, p = 0.026). University hospital professionals had 61.4% lower odds of optimal knowledge than private hospital professionals (AOR = 0.386, p = 0.046). Men had higher odds of both positive attitudes (AOR = 1.844, p = 0.010) and high practice level (AOR = 2.92, p < 0.001). Pre-bachelor's holders had lower odds of positive attitudes (AOR = 0.361, p = 0.036), as well as physicians compared to nurses and others (AOR = 0.424, p = 0.005). Bachelor's holders showed lower odds of high AI practice (AOR = 0.388, p = 0.017).

Conclusion: Despite moderate perception, most professionals have knowledge, attitude, and practice defects. Mainly, younger age and men showed higher engagement, indicating a need for targeted AI training, especially for older and female professionals.

背景:医疗保健专业人员对医疗保健中人工智能应用的认识和处理可以提高患者的治疗效果并改善流程。本研究旨在评估埃及医疗保健专业人员对人工智能应用的看法、态度、知识和实践。方法:一项横断面研究,其中367名医疗保健专业人员回答了一份电子问卷。结果:在367名参与者中(234名女性),放射学和实验室检测专业占36.2%。平均年龄27.03 岁;51.8%的受访者对人工智能工具持积极态度,68.7%的受访者认为知识不够理想,52.9%的受访者持消极态度,53.4%的受访者表示人工智能工具的实践水平较低。年龄越小,积极感知能力越强(调整优势比(AOR) = 0.905,p = 0.020),人工智能水平越高(AOR = 0.907,p = 0.026)。大学医院专业人员获得最佳知识的几率比私立医院专业人员低61.4% (AOR = 0.386,p = 0.046)。男性有更高的几率都积极的态度(AOR = 1.844,p = 0.010)和高实践水平(AOR = 2.92,p  = 0.036),以及医生比护士和其他(AOR = 0.424,p = 0.005)。学士学位持有者的高人工智能实践的几率较低(AOR = 0.388,p = 0.017)。结论:大多数专业人员在认知上存在一定的缺陷,但在知识、态度和实践上存在一定的缺陷。主要是年轻人和男性表现出更高的参与度,这表明需要有针对性的人工智能培训,尤其是对老年人和女性专业人士。
{"title":"Perception and awareness of healthcare professionals toward the applications of artificial intelligence in Egyptian healthcare settings.","authors":"Shimaa Azzam, El-Morsy Ahmed El-Morsy, Amira S A Said, Nermin Eissa, Doaa Mahmoud Khalil","doi":"10.3389/frai.2025.1700493","DOIUrl":"10.3389/frai.2025.1700493","url":null,"abstract":"<p><strong>Background: </strong>Healthcare professionals' awareness and handling of artificial intelligence applications in healthcare enhance patient outcomes and improve processes. This study aimed to evaluate the perception, attitude, knowledge, and practice of healthcare professionals regarding the application of artificial intelligence in Egyptian healthcare settings.</p><p><strong>Method: </strong>A cross-sectional study in which 367 healthcare professionals responded to an electronic questionnaire.</p><p><strong>Results: </strong>Out of 367 participants (234 female), radiology and lab test specialty (36.2%) was the predominant. The mean age was 27.03 years; 51.8% of respondents showed positive perception, 68.7% experienced sub-optimal knowledge, 52.9% expressed negative attitudes, and 53.4% demonstrated a low practice level of AI tools. Younger age was significantly associated with positive perception (adjusted odds ratio (AOR) = 0.905, <i>p</i> = 0.020) and higher AI practice (AOR = 0.907, <i>p</i> = 0.026). University hospital professionals had 61.4% lower odds of optimal knowledge than private hospital professionals (AOR = 0.386, <i>p</i> = 0.046). Men had higher odds of both positive attitudes (AOR = 1.844, <i>p</i> = 0.010) and high practice level (AOR = 2.92, <i>p</i> < 0.001). Pre-bachelor's holders had lower odds of positive attitudes (AOR = 0.361, <i>p</i> = 0.036), as well as physicians compared to nurses and others (AOR = 0.424, <i>p</i> = 0.005). Bachelor's holders showed lower odds of high AI practice (AOR = 0.388, <i>p</i> = 0.017).</p><p><strong>Conclusion: </strong>Despite moderate perception, most professionals have knowledge, attitude, and practice defects. Mainly, younger age and men showed higher engagement, indicating a need for targeted AI training, especially for older and female professionals.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1700493"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physics-constrained GAN boosts OAM correction in ocean turbulence. 物理约束的GAN增强了海洋湍流中的OAM校正。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1702056
Xiaoji Li, Zhiyuan Wang

Introduction: This study addresses the challenge of improving wavefront correction for Orbital Angular Momentum (OAM) in oceanic turbulence using a physics-constrained Generative Adversarial Network (GAN).

Methods: We integrated physical constraints into a deep learning framework to reconstruct degraded input images (SSIM = 0.62). The model was trained with varied loss settings, including a baseline model, spectral constraints (+Spec), and spatial constraints (+Ortho).

Results: The dual-constraint approach (+Ortho+Spec) reached a near-optimal SSIM of 0.98. Ablation studies revealed that while +Ortho boosted modal purity to 95.7%, the dual-constraints achieved 98.4% purity. Power spectral density analysis via KL divergence confirmed the dual-constraints' superiority (KL = 0.56) over the baseline (KL = 2.47).

Discussion: These results demonstrate that integrating both spatial and spectral constraints effectively optimizes reconstruction, purity, and spectral fidelity, offering a robust solution for OAM correction in underwater optical communication systems.

本研究利用物理约束生成对抗网络(GAN)解决了改善海洋湍流中轨道角动量(OAM)波前校正的挑战。方法:我们将物理约束整合到深度学习框架中来重建退化的输入图像(SSIM = 0.62)。该模型使用不同的损失设置进行训练,包括基线模型、光谱约束(+Spec)和空间约束(+Ortho)。结果:双约束方法(+Ortho+Spec)达到了接近最优的SSIM为0.98。消融研究显示,虽然+Ortho将模态纯度提高到95.7%,但双重约束的纯度达到98.4%。通过KL散度进行的功率谱密度分析证实了双约束(KL = 0.56)优于基线(KL = 2.47)。讨论:这些结果表明,空间和光谱约束的整合有效地优化了重建、纯度和光谱保真度,为水下光通信系统的OAM校正提供了一个强大的解决方案。
{"title":"Physics-constrained GAN boosts OAM correction in ocean turbulence.","authors":"Xiaoji Li, Zhiyuan Wang","doi":"10.3389/frai.2025.1702056","DOIUrl":"10.3389/frai.2025.1702056","url":null,"abstract":"<p><strong>Introduction: </strong>This study addresses the challenge of improving wavefront correction for Orbital Angular Momentum (OAM) in oceanic turbulence using a physics-constrained Generative Adversarial Network (GAN).</p><p><strong>Methods: </strong>We integrated physical constraints into a deep learning framework to reconstruct degraded input images (SSIM = 0.62). The model was trained with varied loss settings, including a baseline model, spectral constraints (+Spec), and spatial constraints (+Ortho).</p><p><strong>Results: </strong>The dual-constraint approach (+Ortho+Spec) reached a near-optimal SSIM of 0.98. Ablation studies revealed that while +Ortho boosted modal purity to 95.7%, the dual-constraints achieved 98.4% purity. Power spectral density analysis via KL divergence confirmed the dual-constraints' superiority (KL = 0.56) over the baseline (KL = 2.47).</p><p><strong>Discussion: </strong>These results demonstrate that integrating both spatial and spectral constraints effectively optimizes reconstruction, purity, and spectral fidelity, offering a robust solution for OAM correction in underwater optical communication systems.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1702056"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833431/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Persona pedagogica in crisis: are educators becoming data custodians in the age of AI? 危机中的人物角色教学法:教育工作者正在成为人工智能时代的数据保管人吗?
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1743016
Anusree Ambady, Thomas K V
{"title":"Persona pedagogica in crisis: are educators becoming data custodians in the age of AI?","authors":"Anusree Ambady, Thomas K V","doi":"10.3389/frai.2025.1743016","DOIUrl":"https://doi.org/10.3389/frai.2025.1743016","url":null,"abstract":"","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1743016"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827719/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal graph neural networks in healthcare: a review of fusion strategies across biomedical domains. 医疗保健中的多模态图神经网络:跨生物医学领域的融合策略综述。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1716706
Maria Vaida, Ziyuan Huang

Graph Neural Networks (GNNs) have transformed multimodal healthcare data integration by capturing complex, non-Euclidean relationships across diverse sources such as electronic health records, medical imaging, genomic profiles, and clinical notes. This review synthesizes GNN applications in healthcare, highlighting their impact on clinical decision-making through multimodal integration, advanced fusion strategies, and attention mechanisms. Key applications include drug interaction and discovery, cancer detection and prognosis, clinical status prediction, infectious disease modeling, genomics, and the diagnosis of mental health and neurological disorders. Various GNN architectures demonstrate consistent applications in modeling both intra- and intermodal relationships. GNN architectures, such as Graph Convolutional Networks and Graph Attention Networks, are integrated with Convolutional Neural Networks (CNNs), transformer-based models, temporal encoders, and optimization algorithms to facilitate robust multimodal integration. Early, intermediate, late, and hybrid fusion strategies, enhanced by attention mechanisms like multi-head attention, enable dynamic prioritization of critical relationships, improving accuracy and interpretability. However, challenges remain, including data heterogeneity, computational demands, and the need for greater interpretability. Addressing these challenges presents opportunities to advance GNN adoption in medicine through scalable, transparent GNN models.

图神经网络(gnn)通过捕获不同来源(如电子健康记录、医学成像、基因组档案和临床记录)之间复杂的非欧几里得关系,改变了多模式医疗保健数据集成。本文综述了GNN在医疗保健中的应用,强调了它们通过多模式集成、先进融合策略和注意机制对临床决策的影响。主要应用包括药物相互作用和发现、癌症检测和预后、临床状态预测、传染病建模、基因组学以及精神健康和神经疾病的诊断。各种GNN体系结构在建模内部和多式联运关系方面表现出一致的应用。GNN架构,如图卷积网络和图注意网络,与卷积神经网络(cnn)、基于变压器的模型、时间编码器和优化算法集成在一起,以促进鲁棒多模态集成。早期、中期、晚期和混合融合策略,通过多头注意等注意机制的增强,实现了关键关系的动态优先排序,提高了准确性和可解释性。然而,挑战仍然存在,包括数据异构性、计算需求和对更高可解释性的需求。应对这些挑战为通过可扩展、透明的GNN模型推进GNN在医学中的应用提供了机会。
{"title":"Multimodal graph neural networks in healthcare: a review of fusion strategies across biomedical domains.","authors":"Maria Vaida, Ziyuan Huang","doi":"10.3389/frai.2025.1716706","DOIUrl":"10.3389/frai.2025.1716706","url":null,"abstract":"<p><p>Graph Neural Networks (GNNs) have transformed multimodal healthcare data integration by capturing complex, non-Euclidean relationships across diverse sources such as electronic health records, medical imaging, genomic profiles, and clinical notes. This review synthesizes GNN applications in healthcare, highlighting their impact on clinical decision-making through multimodal integration, advanced fusion strategies, and attention mechanisms. Key applications include drug interaction and discovery, cancer detection and prognosis, clinical status prediction, infectious disease modeling, genomics, and the diagnosis of mental health and neurological disorders. Various GNN architectures demonstrate consistent applications in modeling both intra- and intermodal relationships. GNN architectures, such as Graph Convolutional Networks and Graph Attention Networks, are integrated with Convolutional Neural Networks (CNNs), transformer-based models, temporal encoders, and optimization algorithms to facilitate robust multimodal integration. Early, intermediate, late, and hybrid fusion strategies, enhanced by attention mechanisms like multi-head attention, enable dynamic prioritization of critical relationships, improving accuracy and interpretability. However, challenges remain, including data heterogeneity, computational demands, and the need for greater interpretability. Addressing these challenges presents opportunities to advance GNN adoption in medicine through scalable, transparent GNN models.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1716706"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827511/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved YOLOv8n with multi-scale feature fusion for real time and high precision railway track defect detection. 基于多尺度特征融合的改进YOLOv8n实时高精度轨道缺陷检测。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1711309
Zhihong Zhang, Liling Zhang, Xin Lu, Tingting Ma, Feng Huang, Sheng Zhong

Introduction: Railway transportation is increasingly critical for modern urban and intercity mobility. However, the expanding scale and intensifying operational intensity of rail networks have elevated track defect detection to a key concern. Traditional inspection methods (manual, ultrasonic, eddy current, magnetic flux leakage testing) are limited by insufficient accuracy, low efficiency, or poor adaptability to complex environmental conditions.

Methods: An enhanced defect detection framework based on an improved YOLOv8 algorithm was proposed, tailored for small targets and complex backgrounds. Three core improvements were integrated: 1) AVCStem module with variable convolution kernels to dynamically adapt to defects of different shapes and scales; 2) ADSPPF module using multi-scale pooling and multi-branch attention mechanisms to preserve fine-grained features across scales; 3) MSF module for enhanced multi-scale feature fusion via partial convolution and hierarchical feature alignment.

Results and discussion: Experiments on a real-world track defect dataset showed the proposed model achieved 90.2% detection precision, 90.2% mAP@0.5, and 73.2% mAP@0.5:0.95. Meanwhile, the model size was reduced to 5.2MB with 2.45M parameters. Comparative and ablation studies confirmed the complementary advantages of each module and the model's superior performance over existing lightweight detectors. The proposed model provides a robust, accurate, and efficient solution for real-time railway defect detection. It exhibits strong potential for deployment in edge AI devices and mobile inspection robots, addressing the limitations of traditional inspection methods.

铁路运输在现代城市和城际交通中越来越重要。然而,随着铁路网络规模的扩大和运营强度的提高,轨道缺陷检测成为人们关注的焦点。传统的检测方法(人工、超声波、涡流、漏磁检测)精度不足,效率低,对复杂环境条件适应性差。方法:提出了一种基于改进YOLOv8算法的改进缺陷检测框架,并针对小目标和复杂背景进行了改进。集成了三个核心改进:1)AVCStem模块采用可变卷积核,动态适应不同形状和规模的缺陷;2) ADSPPF模块采用多尺度池和多分支关注机制,跨尺度保留细粒度特征;3)基于部分卷积和层次化特征对齐的增强多尺度特征融合MSF模块。结果与讨论:在真实轨迹缺陷数据集上的实验表明,该模型的检测精度为90.2%,mAP@0.5为90.2%,mAP@0.5为73.2%:0.95。同时,模型尺寸减小到5.2MB,参数为2.45M。对比和烧蚀研究证实了每个模块的互补优势,以及该模型优于现有轻型探测器的性能。该模型为铁路实时缺陷检测提供了鲁棒、准确、高效的解决方案。它在边缘人工智能设备和移动检测机器人中显示出强大的部署潜力,解决了传统检测方法的局限性。
{"title":"An improved YOLOv8n with multi-scale feature fusion for real time and high precision railway track defect detection.","authors":"Zhihong Zhang, Liling Zhang, Xin Lu, Tingting Ma, Feng Huang, Sheng Zhong","doi":"10.3389/frai.2025.1711309","DOIUrl":"10.3389/frai.2025.1711309","url":null,"abstract":"<p><strong>Introduction: </strong>Railway transportation is increasingly critical for modern urban and intercity mobility. However, the expanding scale and intensifying operational intensity of rail networks have elevated track defect detection to a key concern. Traditional inspection methods (manual, ultrasonic, eddy current, magnetic flux leakage testing) are limited by insufficient accuracy, low efficiency, or poor adaptability to complex environmental conditions.</p><p><strong>Methods: </strong>An enhanced defect detection framework based on an improved YOLOv8 algorithm was proposed, tailored for small targets and complex backgrounds. Three core improvements were integrated: 1) AVCStem module with variable convolution kernels to dynamically adapt to defects of different shapes and scales; 2) ADSPPF module using multi-scale pooling and multi-branch attention mechanisms to preserve fine-grained features across scales; 3) MSF module for enhanced multi-scale feature fusion via partial convolution and hierarchical feature alignment.</p><p><strong>Results and discussion: </strong>Experiments on a real-world track defect dataset showed the proposed model achieved 90.2% detection precision, 90.2% mAP@0.5, and 73.2% mAP@0.5:0.95. Meanwhile, the model size was reduced to 5.2MB with 2.45M parameters. Comparative and ablation studies confirmed the complementary advantages of each module and the model's superior performance over existing lightweight detectors. The proposed model provides a robust, accurate, and efficient solution for real-time railway defect detection. It exhibits strong potential for deployment in edge AI devices and mobile inspection robots, addressing the limitations of traditional inspection methods.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1711309"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827737/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting freebooted content in social media ads: multimodal provenance and e-commerce implications. 检测社交媒体广告中的免费内容:多模式来源和电子商务影响。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1717129
Petr Weinlich, Tereza Semeradova

This study examines the phenomenon of content freebooting on social media and its exploitation for marketing counterfeit and "dupe" products. Using a four-week dataset of TikTok ads linked to 32 distinct e-commerce domains, we develop and evaluate a multimodal provenance pipeline-combining perceptual hashing, audio fingerprinting, vision embeddings, and natural-language clustering-applied to 54 ads, 180 landing pages, and over 3,000 extracted video frames. The primary contribution is methodological: multimodal late-fusion substantially outperforms single-modality detectors in identifying copyright-infringing reuse of creator content under adversarial transformations. Empirically, we document systematic asset theft from legitimate fashion creators, with several videos and review images reappearing across more than 10 separate domains. Purchases from three advertised shops, alongside control items, reveal systematic misrepresentation of product quality and unreliable fulfillment, situating freebooted ads at the intersection of copyright infringement, trademark-like "dupe" positioning, deceptive advertising, and consumer fraud. Network analysis of ad handles and domains indicates a coordinated cluster of shell actors, with a median time-to-reupload of 18 h. As a secondary contribution, the study uses this provenance pipeline to illuminate how freebooted cultural assets are rapidly converted into counterfeit-linked sales, and to surface gaps in platform integrity and consumer protection. By integrating computer vision, audio analysis, and NLP techniques with network and fulfillment audits, the paper offers both a methodological framework for analyzing freebooting pipelines and socio-technical insights for platform governance in digital commerce.

本研究探讨了社交媒体上的免费内容现象及其用于营销假冒和“欺骗”产品的利用。使用与32个不同电子商务领域相关的为期四周的TikTok广告数据集,我们开发并评估了一个多模式来源管道——结合感知哈希、音频指纹、视觉嵌入和自然语言聚类——应用于54个广告、180个登陆页面和3000多个提取的视频帧。主要贡献在于方法:在识别对抗性转换下侵犯版权的创作者内容重用方面,多模态后期融合实质上优于单模态检测器。根据经验,我们记录了合法时尚创作者的系统性资产盗窃,其中有几个视频和评论图像在10多个不同的域中重复出现。从三家有广告的商店购买,以及控制项目,揭示了产品质量的系统性虚假陈述和不可靠的履行,将免费广告置于侵犯版权,类似商标的“欺骗”定位,欺骗性广告和消费者欺诈的交叉点。对广告句柄和域名的网络分析表明,一个协调的shell参与者集群,重新加载的中位数时间为18 h。作为第二项贡献,该研究利用这一来源渠道阐明了免费文化资产如何迅速转化为与假货相关的销售,并揭示了平台完整性和消费者保护方面的差距。通过将计算机视觉、音频分析和NLP技术与网络和履行审计相结合,本文提供了分析免费启动管道的方法框架和数字商务平台治理的社会技术见解。
{"title":"Detecting freebooted content in social media ads: multimodal provenance and e-commerce implications.","authors":"Petr Weinlich, Tereza Semeradova","doi":"10.3389/frai.2025.1717129","DOIUrl":"10.3389/frai.2025.1717129","url":null,"abstract":"<p><p>This study examines the phenomenon of content freebooting on social media and its exploitation for marketing counterfeit and \"dupe\" products. Using a four-week dataset of TikTok ads linked to 32 distinct e-commerce domains, we develop and evaluate a multimodal provenance pipeline-combining perceptual hashing, audio fingerprinting, vision embeddings, and natural-language clustering-applied to 54 ads, 180 landing pages, and over 3,000 extracted video frames. The primary contribution is methodological: multimodal late-fusion substantially outperforms single-modality detectors in identifying copyright-infringing reuse of creator content under adversarial transformations. Empirically, we document systematic asset theft from legitimate fashion creators, with several videos and review images reappearing across more than 10 separate domains. Purchases from three advertised shops, alongside control items, reveal systematic misrepresentation of product quality and unreliable fulfillment, situating freebooted ads at the intersection of copyright infringement, trademark-like \"dupe\" positioning, deceptive advertising, and consumer fraud. Network analysis of ad handles and domains indicates a coordinated cluster of shell actors, with a median time-to-reupload of 18 h. As a secondary contribution, the study uses this provenance pipeline to illuminate how freebooted cultural assets are rapidly converted into counterfeit-linked sales, and to surface gaps in platform integrity and consumer protection. By integrating computer vision, audio analysis, and NLP techniques with network and fulfillment audits, the paper offers both a methodological framework for analyzing freebooting pipelines and socio-technical insights for platform governance in digital commerce.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1717129"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827671/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1