首页 > 最新文献

Frontiers in Artificial Intelligence最新文献

英文 中文
User perceptions of RBI-approved P2P digital lending apps: an NLP, machine learning, and deep learning approach. 用户对印度储备银行批准的P2P数字借贷应用程序的看法:NLP、机器学习和深度学习方法。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1708080
Kunchakara Raja Sekhar, Shaiku Shahida Saheb

Introduction: Digital lending, also known as alternative lending, refers to fintech platforms that offer quick and easy loans through digital channels, bypassing many of the limitations of traditional banking. Since the mid-2000s, digital lending has become a major fintech innovation, with rapid growth in India driven by financial inclusion measures. However, the sector continues to face challenges, including fraud, transparency issues, and consumer dissatisfaction. The primary objective of this study was to understand how consumers perceive and assess India's RBI-approved P2P digital lending apps by analyzing a large dataset of customer feedback to identify strengths, weaknesses, and overall satisfaction levels.

Methods: The study analyzed a final dataset of 15,408 user reviews collected from seven RBI-approved digital lending platforms: 5Paisa, Faircent, i2iFunding, LenDenClub, CashKumar, Lendbox, and IndiaMoneyMart derived from an initial 15,537 reviews. The cleaned data was then examined using natural language processing, topic modeling, and supervised machine learning and deep learning models to identify key themes and evaluate predictive performance.

Results: Topic modeling identified 11 recurring topics. Sentiment analysis revealed that 55% of evaluations were positive, 41% were negative, and 4% were neutral. Strengths included loan disbursement, withdrawals, and EMI payments, while weaknesses involved interface design, transparency around rejections, and login functionality. Comparative data revealed that IndiaMoneyMart and i2iFunding received the highest user satisfaction, while 5Paisa and Lendbox trailed due to recurring complaints about transparency, accessibility, and overall user experience. In terms of modeling, the deep learning model VGG16 and ensemble machine learning techniques (XGBoost, CatBoost, and LightGBM) consistently achieved the highest predictive accuracy (up to 0.88), outperforming simpler models such as decision trees and ResNets.

Discussion: The findings indicate that digital lending platforms support financial inclusion but require improvements in user interface and user experience, better transparency in loan decisions, and stronger customer support. Addressing these areas can help strengthen trust and promote long term adoption of digital lending services.

简介:数字借贷,又称另类借贷,是指金融科技平台通过数字渠道提供快速便捷的贷款,绕过了传统银行的诸多限制。自2000年代中期以来,数字贷款已成为一项主要的金融科技创新,在金融普惠措施的推动下,印度的数字贷款增长迅速。然而,该行业继续面临挑战,包括欺诈、透明度问题和消费者不满。本研究的主要目的是通过分析客户反馈的大型数据集来确定优势、劣势和总体满意度,了解消费者如何看待和评估印度央行批准的P2P数字借贷应用程序。方法:该研究分析了从7个印度央行批准的数字借贷平台收集的15408条用户评论的最终数据集:5Paisa, Faircent, i2iffunding, LenDenClub, CashKumar, Lendbox和indiammoneymart,这些平台收集了最初的15537条评论。然后使用自然语言处理、主题建模、监督机器学习和深度学习模型检查清理后的数据,以确定关键主题并评估预测性能。结果:主题建模确定了11个重复主题。情绪分析显示,55%的评价是正面的,41%是负面的,4%是中性的。优点包括贷款支付、取款和EMI支付,而缺点涉及界面设计、拒绝的透明度和登录功能。比较数据显示,indiammoneymart和i2iffunding获得了最高的用户满意度,而5Paisa和Lendbox则因为反复出现的透明度、可访问性和整体用户体验方面的投诉而落后。在建模方面,深度学习模型VGG16和集成机器学习技术(XGBoost、CatBoost和LightGBM)始终实现了最高的预测精度(高达0.88),优于决策树和ResNets等简单模型。讨论:研究结果表明,数字借贷平台支持普惠金融,但需要改进用户界面和用户体验,提高贷款决策的透明度,并加强客户支持。解决这些问题有助于加强信任,促进数字借贷服务的长期采用。
{"title":"User perceptions of RBI-approved P2P digital lending apps: an NLP, machine learning, and deep learning approach.","authors":"Kunchakara Raja Sekhar, Shaiku Shahida Saheb","doi":"10.3389/frai.2025.1708080","DOIUrl":"10.3389/frai.2025.1708080","url":null,"abstract":"<p><strong>Introduction: </strong>Digital lending, also known as alternative lending, refers to fintech platforms that offer quick and easy loans through digital channels, bypassing many of the limitations of traditional banking. Since the mid-2000s, digital lending has become a major fintech innovation, with rapid growth in India driven by financial inclusion measures. However, the sector continues to face challenges, including fraud, transparency issues, and consumer dissatisfaction. The primary objective of this study was to understand how consumers perceive and assess India's RBI-approved P2P digital lending apps by analyzing a large dataset of customer feedback to identify strengths, weaknesses, and overall satisfaction levels.</p><p><strong>Methods: </strong>The study analyzed a final dataset of 15,408 user reviews collected from seven RBI-approved digital lending platforms: 5Paisa, Faircent, i2iFunding, LenDenClub, CashKumar, Lendbox, and IndiaMoneyMart derived from an initial 15,537 reviews. The cleaned data was then examined using natural language processing, topic modeling, and supervised machine learning and deep learning models to identify key themes and evaluate predictive performance.</p><p><strong>Results: </strong>Topic modeling identified 11 recurring topics. Sentiment analysis revealed that 55% of evaluations were positive, 41% were negative, and 4% were neutral. Strengths included loan disbursement, withdrawals, and EMI payments, while weaknesses involved interface design, transparency around rejections, and login functionality. Comparative data revealed that IndiaMoneyMart and i2iFunding received the highest user satisfaction, while 5Paisa and Lendbox trailed due to recurring complaints about transparency, accessibility, and overall user experience. In terms of modeling, the deep learning model VGG16 and ensemble machine learning techniques (XGBoost, CatBoost, and LightGBM) consistently achieved the highest predictive accuracy (up to 0.88), outperforming simpler models such as decision trees and ResNets.</p><p><strong>Discussion: </strong>The findings indicate that digital lending platforms support financial inclusion but require improvements in user interface and user experience, better transparency in loan decisions, and stronger customer support. Addressing these areas can help strengthen trust and promote long term adoption of digital lending services.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1708080"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12832989/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From the logic of coordination to goal-directed reasoning: the agentic turn in artificial intelligence. 从协调逻辑到目标导向推理:人工智能中的代理转向。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1728738
Tsehaye Haidemariam

The rise of agentic artificial intelligence (Agentic AI) marks a transition from systems that optimize externally specified objectives to systems capable of representing, evaluating, and revising their own goals. Whereas earlier AI architectures executed fixed task specifications, agentic systems maintain recursive loops of perception, evaluation, goal-updating, and action, allowing them to sustain and adapt purposive activity across temporal and organizational scales. This paper argues that Agentic AI is not an incremental extension of large language models (LLMs) or autonomous agents in the sense we know it from classical AI and multi-agent systems, but a reconstitution of agency itself within computational substrates. Building on the logic of coordination, delegation, and self-regulation developed in early agent-based process management systems, we propose a general theory of synthetic purposiveness, where agency emerges as a distributed and self-maintaining property of artificial systems operating in open-ended environments. We develop the concept of synthetic teleology-the engineered capacity of artificial systems to generate and regulate goals through ongoing self-evaluation-and we formalize its dynamics through a recursive goal-maintenance equation. We further outline design patterns, computational semantics, and measurable indicators of purposiveness (e.g., teleological coherence, adaptive recovery, and reflective efficiency), providing a foundation for the systematic design and empirical investigation of agentic behaviour. By reclaiming agency as a first-class construct in artificial intelligence, we argue for a paradigm shift from algorithmic optimization toward goal-directed reasoning and purposive orchestration-one with far-reaching epistemic, societal, and institutional consequences.

人工智能(agent AI)的兴起标志着从优化外部指定目标的系统向能够表示、评估和修改自己目标的系统的转变。早期的人工智能架构执行固定的任务规范,而代理系统维持感知、评估、目标更新和行动的递归循环,允许它们在时间和组织尺度上维持和适应有目的的活动。本文认为,人工智能不是大型语言模型(llm)的增量扩展,也不是我们从经典人工智能和多智能体系统中了解到的自主智能体,而是在计算基础上对代理本身的重构。在早期基于代理的过程管理系统中发展的协调、授权和自我调节逻辑的基础上,我们提出了一种综合合意性的一般理论,其中代理作为在开放式环境中运行的人工系统的分布式和自我维护属性而出现。我们发展了合成目的论的概念——人工系统通过持续的自我评估产生和调节目标的工程能力——我们通过递归的目标维持方程形式化了它的动力学。我们进一步概述了设计模式、计算语义和可测量的合向性指标(例如,目的性一致性、适应性恢复和反射效率),为代理行为的系统设计和实证研究提供了基础。通过将代理重新定义为人工智能中的一流结构,我们主张从算法优化到目标导向推理和有目的的编排的范式转变-具有深远的认知,社会和制度后果。
{"title":"From the logic of coordination to goal-directed reasoning: the agentic turn in artificial intelligence.","authors":"Tsehaye Haidemariam","doi":"10.3389/frai.2025.1728738","DOIUrl":"10.3389/frai.2025.1728738","url":null,"abstract":"<p><p>The rise of agentic artificial intelligence (Agentic AI) marks a transition from systems that optimize externally specified objectives to systems capable of representing, evaluating, and revising their own goals. Whereas earlier AI architectures executed fixed task specifications, agentic systems maintain recursive loops of perception, evaluation, goal-updating, and action, allowing them to sustain and adapt purposive activity across temporal and organizational scales. This paper argues that Agentic AI is not an incremental extension of large language models (LLMs) or autonomous agents in the sense we know it from classical AI and multi-agent systems, but a reconstitution of agency itself within computational substrates. Building on the logic of coordination, delegation, and self-regulation developed in early agent-based process management systems, we propose a general theory of synthetic purposiveness, where agency emerges as a distributed and self-maintaining property of artificial systems operating in open-ended environments. We develop the concept of synthetic teleology-the engineered capacity of artificial systems to generate and regulate goals through ongoing self-evaluation-and we formalize its dynamics through a recursive goal-maintenance equation. We further outline design patterns, computational semantics, and measurable indicators of purposiveness (e.g., teleological coherence, adaptive recovery, and reflective efficiency), providing a foundation for the systematic design and empirical investigation of agentic behaviour. By reclaiming agency as a first-class construct in artificial intelligence, we argue for a paradigm shift from algorithmic optimization toward goal-directed reasoning and purposive orchestration-one with far-reaching epistemic, societal, and institutional consequences.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1728738"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833085/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimized ensemble machine learning model for cyberattack classification in industrial IoT. 工业物联网网络攻击分类的优化集成机器学习模型。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1685376
Batool Alabdullah, Suresh Sankaranarayanan

Introduction: The increasing cyber threats targeting industrial control systems (ICS) and the Internet of Things (IoT) pose significant risks, especially in critical infrastructures like the oil and gas sector. Existing machine learning (ML) approaches for cyberattack detection often rely on binary classification and lack computational efficiency.

Methods: This study proposes two optimized stacked ensemble models to enhance attack detection accuracy while reducing computational overhead. The main contribution lies in the strategic selection and integration of diverse base models, such as Logistic Regression, Extra Tree Classifier, XGBoost, and LGBM, with RFC as the final estimator. These models are chosen to address unique characteristics of security datasets, such as class imbalance, noise, and complex attack patterns. This combination aims to leverage different decision boundaries and learning mechanisms.

Results: Evaluations show that the Stacked Ensemble_2 model achieves 97% accuracy with a training and testing computation time of 54 minutes. Stacked Ensemble_2, which excelled over the traditional Stacked Ensemble_1, was also evaluated on the CICIDS 2017 dataset, achieving an impressive 100% accuracy with an AUROC of 99%.

Discussion: The results indicate that the proposed Stacked Ensemble_2 model provides a scalable, real-time detection mechanism for securing ICS and IoT environments. By proving its effectiveness on unseen data, this model demonstrates a significant advancement over traditional methods, offering enhanced accuracy and efficiency in detecting sophisticated cyber threats in critical infrastructure sectors.

导语:越来越多的针对工业控制系统(ICS)和物联网(IoT)的网络威胁构成了重大风险,特别是在石油和天然气行业等关键基础设施中。现有的机器学习(ML)网络攻击检测方法往往依赖于二进制分类,缺乏计算效率。方法:提出两种优化的堆叠集成模型,在降低计算开销的同时提高攻击检测精度。主要贡献在于策略性地选择和整合各种基本模型,如Logistic回归、Extra Tree Classifier、XGBoost和LGBM,并以RFC作为最终的估计器。选择这些模型是为了解决安全数据集的独特特征,例如类不平衡、噪声和复杂的攻击模式。这种组合旨在利用不同的决策边界和学习机制。结果:评价表明,该模型的训练和测试计算时间为54分钟,准确率达到97%。在CICIDS 2017数据集上,对优于传统堆叠Ensemble_1的堆叠Ensemble_2进行了评估,达到了令人印象深刻的100%准确率和99%的AUROC。讨论:结果表明,所提出的堆叠集成模型为保护ICS和物联网环境提供了一种可扩展的实时检测机制。通过证明其在看不见的数据上的有效性,该模型显示了比传统方法的重大进步,在检测关键基础设施部门的复杂网络威胁方面提供了更高的准确性和效率。
{"title":"Optimized ensemble machine learning model for cyberattack classification in industrial IoT.","authors":"Batool Alabdullah, Suresh Sankaranarayanan","doi":"10.3389/frai.2025.1685376","DOIUrl":"10.3389/frai.2025.1685376","url":null,"abstract":"<p><strong>Introduction: </strong>The increasing cyber threats targeting industrial control systems (ICS) and the Internet of Things (IoT) pose significant risks, especially in critical infrastructures like the oil and gas sector. Existing machine learning (ML) approaches for cyberattack detection often rely on binary classification and lack computational efficiency.</p><p><strong>Methods: </strong>This study proposes two optimized stacked ensemble models to enhance attack detection accuracy while reducing computational overhead. The main contribution lies in the strategic selection and integration of diverse base models, such as Logistic Regression, Extra Tree Classifier, XGBoost, and LGBM, with RFC as the final estimator. These models are chosen to address unique characteristics of security datasets, such as class imbalance, noise, and complex attack patterns. This combination aims to leverage different decision boundaries and learning mechanisms.</p><p><strong>Results: </strong>Evaluations show that the Stacked Ensemble_2 model achieves 97% accuracy with a training and testing computation time of 54 minutes. Stacked Ensemble_2, which excelled over the traditional Stacked Ensemble_1, was also evaluated on the CICIDS 2017 dataset, achieving an impressive 100% accuracy with an AUROC of 99%.</p><p><strong>Discussion: </strong>The results indicate that the proposed Stacked Ensemble_2 model provides a scalable, real-time detection mechanism for securing ICS and IoT environments. By proving its effectiveness on unseen data, this model demonstrates a significant advancement over traditional methods, offering enhanced accuracy and efficiency in detecting sophisticated cyber threats in critical infrastructure sectors.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1685376"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12832753/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RiCoRecA: rich cooking recipe annotation schema. RiCoRecA:丰富的烹饪食谱注释模式。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1550604
Filippos Ventirozos, Mauricio Jacobo-Romero, Haifa Alrdahi, Sarah Clinch, Riza Batista-Navarro

Despite recent advancements, modern kitchens, at best, have one or more isolated (non-communicating) "smart" devices. The vision of having a fully-fledged ambient kitchen where devices know what to do and when has yet to be realized. To address this, we present RiCoRecA, a novel schema for parsing cooking recipes into a workflow representation suitable for automation, a step toward that direction. Methodologically, the schema requires a number of information extraction tasks, i.e., annotating named entities, identifying relations between them, coreference resolution, and entity tracking. RiCoRecA differs from previously reported approaches in that it learns these different information extraction tasks using one joint model. We also provide a dataset containing annotations that follow this schema. Furthermore, we compared two transformer-based models for parsing recipes into workflows, namely, PEGASUS-X and LongT5. Our results demonstrate that PEGASUS-X surpassed LongT5 on all of the annotation tasks. Specifically, PEGASUS-X surpassed LongT5 by 39% in terms of F-Score when averaging the performance on all the tasks; it demonstrated almost human-like performance.

尽管最近取得了进步,但现代厨房最多只有一个或多个孤立的(非通信的)“智能”设备。拥有一个完全成熟的环境厨房,设备知道什么时候做什么,这一愿景尚未实现。为了解决这个问题,我们提出了RiCoRecA,这是一种新颖的模式,用于将烹饪食谱解析为适合自动化的工作流表示形式,朝着这个方向迈出了一步。在方法上,该模式需要许多信息提取任务,即注释命名实体、识别它们之间的关系、共同引用解析和实体跟踪。RiCoRecA与之前报道的方法不同,它使用一个联合模型来学习这些不同的信息提取任务。我们还提供了一个包含遵循此模式的注释的数据集。此外,我们比较了用于将食谱解析为工作流的两个基于转换器的模型,即PEGASUS-X和LongT5。我们的结果表明PEGASUS-X在所有注释任务上都超过了LongT5。具体来说,PEGASUS-X在所有任务的平均性能方面的F-Score超过了LongT5 39%;它展示了几乎与人类相似的表现。
{"title":"RiCoRecA: <i>ri</i>ch <i>co</i>oking <i>rec</i>ipe <i>a</i>nnotation schema.","authors":"Filippos Ventirozos, Mauricio Jacobo-Romero, Haifa Alrdahi, Sarah Clinch, Riza Batista-Navarro","doi":"10.3389/frai.2025.1550604","DOIUrl":"10.3389/frai.2025.1550604","url":null,"abstract":"<p><p>Despite recent advancements, modern kitchens, at best, have one or more isolated (non-communicating) \"smart\" devices. The vision of having a fully-fledged ambient kitchen where devices know what to do and when has yet to be realized. To address this, we present RiCoRecA, a novel schema for parsing cooking recipes into a workflow representation suitable for automation, a step toward that direction. Methodologically, the schema requires a number of information extraction tasks, i.e., annotating named entities, identifying relations between them, coreference resolution, and entity tracking. RiCoRecA differs from previously reported approaches in that it learns these different information extraction tasks using one joint model. We also provide a dataset containing annotations that follow this schema. Furthermore, we compared two transformer-based models for parsing recipes into workflows, namely, PEGASUS-X and LongT5. Our results demonstrate that PEGASUS-X surpassed LongT5 on all of the annotation tasks. Specifically, PEGASUS-X surpassed LongT5 by 39% in terms of F-Score when averaging the performance on all the tasks; it demonstrated almost human-like performance.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1550604"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833278/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced multi-class object detector for bone fracture diagnosis with prescription recommendation. 基于处方推荐的增强多类目标检测器用于骨折诊断。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1692894
Daudi Mashauri Migayo, Shubi Kaijage, Stephen Swetala, Devotha G Nyambo

Bone fractures are among the most prominent injuries in the modern world that affect all ages and races. Traditional treatment involves radiographic imaging that relies heavily on radiologists manually analyzing images. There have been efforts to develop computer-aided diagnosis tools that employ artificial intelligence and deep learning approaches. Existing literature focuses on developing tools that only detect and classify bone fractures, rather than addressing the broader issue of bone fracture management. However, evidence of scholarly works that include treatment recommendations is still lacking. Furthermore, deep learning-based object detectors that achieve state-of-the-art results are computationally expensive and considered as black-box solutions. Developing countries, such as Sub-Saharan Africa, face a shortage of radiologists and orthopedists. For this reason, this paper proposes a methodological approach that uses a more efficient object detection model to diagnose long bone fractures and provide prescription recommendations. An enhanced anchoring process, known as adaptive anchoring, is proposed to improve the performance of the Regional Proposal Network and the object detection model. A Faster R-CNN model with ResNet-50/101 and ResNext-50/101 backbones was used to develop an object detection model that uses X-ray images as input. To understand and interpret the model's decision, a Gradient-based Class Activation Mapping method was used to assess the model's learnability. The results indicate that the proposed adaptive anchoring approach can improve computational efficiency, reducing training time by up to 29% compared to the traditional approach. Model accuracy during training and validation ranged between 94% and 98%. Overall, adaptive anchoring performed better when applied with the ResNet-101 backbone, yielding an Average Precision of 92.73%, an F1 score of 96.01%, a precision of 96.80%, and a recall of 95.23%. The study provides valuable insights into the use of computationally efficient deep learning models for medical recommendation systems. Future studies should develop models to diagnose fractures using input images from various modalities and to provide prescription recommendations.

骨折是现代世界中影响所有年龄和种族的最突出的伤害之一。传统的治疗包括放射成像,严重依赖放射科医生手动分析图像。人们一直在努力开发利用人工智能和深度学习方法的计算机辅助诊断工具。现有文献侧重于开发仅检测和分类骨折的工具,而不是解决骨折管理的更广泛问题。然而,包括治疗建议的学术著作的证据仍然缺乏。此外,基于深度学习的目标检测器实现了最先进的结果,计算成本很高,被认为是黑盒解决方案。发展中国家,如撒哈拉以南非洲,面临着放射科医生和骨科医生的短缺。因此,本文提出了一种方法学方法,使用更有效的目标检测模型来诊断长骨骨折并提供处方建议。提出了一种增强的锚定过程,称为自适应锚定,以提高区域建议网络和目标检测模型的性能。采用基于ResNet-50/101和ResNext-50/101主干的Faster R-CNN模型,开发以x射线图像为输入的目标检测模型。为了理解和解释模型的决策,使用基于梯度的类激活映射方法来评估模型的可学习性。结果表明,与传统方法相比,所提出的自适应锚定方法可以提高计算效率,减少高达29%的训练时间。模型在训练和验证期间的准确率在94%到98%之间。总体而言,自适应锚定在ResNet-101骨干网中表现更好,平均精度为92.73%,F1分数为96.01%,精度为96.80%,召回率为95.23%。该研究为医疗推荐系统使用计算效率高的深度学习模型提供了有价值的见解。未来的研究应该建立模型,利用不同模式的输入图像来诊断骨折,并提供处方建议。
{"title":"Enhanced multi-class object detector for bone fracture diagnosis with prescription recommendation.","authors":"Daudi Mashauri Migayo, Shubi Kaijage, Stephen Swetala, Devotha G Nyambo","doi":"10.3389/frai.2025.1692894","DOIUrl":"10.3389/frai.2025.1692894","url":null,"abstract":"<p><p>Bone fractures are among the most prominent injuries in the modern world that affect all ages and races. Traditional treatment involves radiographic imaging that relies heavily on radiologists manually analyzing images. There have been efforts to develop computer-aided diagnosis tools that employ artificial intelligence and deep learning approaches. Existing literature focuses on developing tools that only detect and classify bone fractures, rather than addressing the broader issue of bone fracture management. However, evidence of scholarly works that include treatment recommendations is still lacking. Furthermore, deep learning-based object detectors that achieve state-of-the-art results are computationally expensive and considered as black-box solutions. Developing countries, such as Sub-Saharan Africa, face a shortage of radiologists and orthopedists. For this reason, this paper proposes a methodological approach that uses a more efficient object detection model to diagnose long bone fractures and provide prescription recommendations. An enhanced anchoring process, known as adaptive anchoring, is proposed to improve the performance of the Regional Proposal Network and the object detection model. A Faster R-CNN model with ResNet-50/101 and ResNext-50/101 backbones was used to develop an object detection model that uses X-ray images as input. To understand and interpret the model's decision, a Gradient-based Class Activation Mapping method was used to assess the model's learnability. The results indicate that the proposed adaptive anchoring approach can improve computational efficiency, reducing training time by up to 29% compared to the traditional approach. Model accuracy during training and validation ranged between 94% and 98%. Overall, adaptive anchoring performed better when applied with the ResNet-101 backbone, yielding an Average Precision of 92.73%, an F1 score of 96.01%, a precision of 96.80%, and a recall of 95.23%. The study provides valuable insights into the use of computationally efficient deep learning models for medical recommendation systems. Future studies should develop models to diagnose fractures using input images from various modalities and to provide prescription recommendations.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1692894"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833394/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved attention-based PCNN with GhostNet for epilepsy seizure detection using EEG and fMRI modalities: extractive pattern and histogram feature set. 基于GhostNet的改进的基于注意力的PCNN用于脑电图和功能磁共振成像的癫痫发作检测:提取模式和直方图特征集。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1679218
Sunkara Mounika, Reeja S R

Introduction: Detecting epileptic seizures remains a major challenge in clinical neurology due to the complex, heterogeneous, and non-stationary characteristics of electroencephalogram (EEG) signals. Although recent machine learning (ML) and deep learning (DL) approaches have improved detection performance, most methods still struggle with limited interpretability, inadequate spatial-temporal modeling, and suboptimal generalization. To address these limitations, this study proposes an enhanced hybrid parallel convolutional-GhostNet framework (HPG-ESD) for robust seizure detection using multimodal EEG and functional Magnetic Resonance Imaging (fMRI) data.

Methods: The experimental data consist of pediatric scalp EEG recordings from 24 subjects in the CHB-MIT dataset (22-channel 10-20 system, 256 Hz sampling, continuous multi-hour recordings) and resting-state 3T fMRI scans from 52 participants in the UNAM TLE dataset (26 epilepsy patients and 26 healthy controls). EEG data underwent Gauss-based median filtering, while fMRI images were denoised using an adaptive weight-based Wiener filter. Spatial, temporal, and spectral EEG features were extracted alongside an enhanced common spatial pattern (E-CSP) representation, whereas fMRI features were obtained using deep 3D CNN embeddings combined with a smoothened pyramid histogram of oriented gradients (S-PHOG) descriptor. These multimodal features were fused within a soft voting hybrid parallel convolutional-GhostNet (S-HPCGN) model, integrating an improved attention based parallel convolutional network (IAPCNet) and GhostNet to capture complementary spatial-temporal patterns.

Results: The proposed HPG-ESD framework achieved an accuracy of 0.941, precision of 0.939, and sensitivity of 0.944, outperforming conventional unimodal and state-of-the-art methods.

Discussion: These results demonstrate the potential of multi-modal learning and lightweight attention-enhanced architectures for reliable and clinically relevant seizure detection.

由于脑电图(EEG)信号的复杂性、异质性和非平稳性,检测癫痫发作仍然是临床神经病学的主要挑战。尽管最近的机器学习(ML)和深度学习(DL)方法提高了检测性能,但大多数方法仍然存在有限的可解释性、不充分的时空建模和次优泛化的问题。为了解决这些限制,本研究提出了一种增强的混合并行卷积- ghostnet框架(HPG-ESD),用于使用多模态脑电图和功能磁共振成像(fMRI)数据进行稳健的癫痫检测。方法:实验数据包括来自CHB-MIT数据集(22通道10-20系统,256 Hz采样,连续多小时记录)的24名儿童头皮EEG记录和来自UNAM TLE数据集的52名参与者(26名癫痫患者和26名健康对照)的静息状态3T fMRI扫描。EEG数据采用高斯中值滤波,fMRI图像采用自适应加权维纳滤波去噪。通过增强的共同空间模式(E-CSP)表示提取EEG的空间、时间和频谱特征,而使用深度3D CNN嵌入结合平滑的定向梯度金字塔直方图(S-PHOG)描述符获得fMRI特征。这些多模态特征融合在软投票混合并行卷积-GhostNet (S-HPCGN)模型中,整合改进的基于注意力的并行卷积网络(IAPCNet)和GhostNet来捕捉互补的时空模式。结果:所提出的HPG-ESD框架的准确度为0.941,精密度为0.939,灵敏度为0.944,优于传统的单峰方法和最先进的方法。讨论:这些结果证明了多模式学习和轻量级注意力增强架构在可靠和临床相关的癫痫检测方面的潜力。
{"title":"Improved attention-based PCNN with GhostNet for epilepsy seizure detection using EEG and fMRI modalities: extractive pattern and histogram feature set.","authors":"Sunkara Mounika, Reeja S R","doi":"10.3389/frai.2025.1679218","DOIUrl":"10.3389/frai.2025.1679218","url":null,"abstract":"<p><strong>Introduction: </strong>Detecting epileptic seizures remains a major challenge in clinical neurology due to the complex, heterogeneous, and non-stationary characteristics of electroencephalogram (EEG) signals. Although recent machine learning (ML) and deep learning (DL) approaches have improved detection performance, most methods still struggle with limited interpretability, inadequate spatial-temporal modeling, and suboptimal generalization. To address these limitations, this study proposes an enhanced hybrid parallel convolutional-GhostNet framework (HPG-ESD) for robust seizure detection using multimodal EEG and functional Magnetic Resonance Imaging (fMRI) data.</p><p><strong>Methods: </strong>The experimental data consist of pediatric scalp EEG recordings from 24 subjects in the CHB-MIT dataset (22-channel 10-20 system, 256 Hz sampling, continuous multi-hour recordings) and resting-state 3T fMRI scans from 52 participants in the UNAM TLE dataset (26 epilepsy patients and 26 healthy controls). EEG data underwent Gauss-based median filtering, while fMRI images were denoised using an adaptive weight-based Wiener filter. Spatial, temporal, and spectral EEG features were extracted alongside an enhanced common spatial pattern (E-CSP) representation, whereas fMRI features were obtained using deep 3D CNN embeddings combined with a smoothened pyramid histogram of oriented gradients (S-PHOG) descriptor. These multimodal features were fused within a soft voting hybrid parallel convolutional-GhostNet (S-HPCGN) model, integrating an improved attention based parallel convolutional network (IAPCNet) and GhostNet to capture complementary spatial-temporal patterns.</p><p><strong>Results: </strong>The proposed HPG-ESD framework achieved an accuracy of 0.941, precision of 0.939, and sensitivity of 0.944, outperforming conventional unimodal and state-of-the-art methods.</p><p><strong>Discussion: </strong>These results demonstrate the potential of multi-modal learning and lightweight attention-enhanced architectures for reliable and clinically relevant seizure detection.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1679218"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12850516/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perception and awareness of healthcare professionals toward the applications of artificial intelligence in Egyptian healthcare settings. 感知和医疗保健专业人员对人工智能在埃及医疗保健设置的应用意识。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1700493
Shimaa Azzam, El-Morsy Ahmed El-Morsy, Amira S A Said, Nermin Eissa, Doaa Mahmoud Khalil

Background: Healthcare professionals' awareness and handling of artificial intelligence applications in healthcare enhance patient outcomes and improve processes. This study aimed to evaluate the perception, attitude, knowledge, and practice of healthcare professionals regarding the application of artificial intelligence in Egyptian healthcare settings.

Method: A cross-sectional study in which 367 healthcare professionals responded to an electronic questionnaire.

Results: Out of 367 participants (234 female), radiology and lab test specialty (36.2%) was the predominant. The mean age was 27.03 years; 51.8% of respondents showed positive perception, 68.7% experienced sub-optimal knowledge, 52.9% expressed negative attitudes, and 53.4% demonstrated a low practice level of AI tools. Younger age was significantly associated with positive perception (adjusted odds ratio (AOR) = 0.905, p = 0.020) and higher AI practice (AOR = 0.907, p = 0.026). University hospital professionals had 61.4% lower odds of optimal knowledge than private hospital professionals (AOR = 0.386, p = 0.046). Men had higher odds of both positive attitudes (AOR = 1.844, p = 0.010) and high practice level (AOR = 2.92, p < 0.001). Pre-bachelor's holders had lower odds of positive attitudes (AOR = 0.361, p = 0.036), as well as physicians compared to nurses and others (AOR = 0.424, p = 0.005). Bachelor's holders showed lower odds of high AI practice (AOR = 0.388, p = 0.017).

Conclusion: Despite moderate perception, most professionals have knowledge, attitude, and practice defects. Mainly, younger age and men showed higher engagement, indicating a need for targeted AI training, especially for older and female professionals.

背景:医疗保健专业人员对医疗保健中人工智能应用的认识和处理可以提高患者的治疗效果并改善流程。本研究旨在评估埃及医疗保健专业人员对人工智能应用的看法、态度、知识和实践。方法:一项横断面研究,其中367名医疗保健专业人员回答了一份电子问卷。结果:在367名参与者中(234名女性),放射学和实验室检测专业占36.2%。平均年龄27.03 岁;51.8%的受访者对人工智能工具持积极态度,68.7%的受访者认为知识不够理想,52.9%的受访者持消极态度,53.4%的受访者表示人工智能工具的实践水平较低。年龄越小,积极感知能力越强(调整优势比(AOR) = 0.905,p = 0.020),人工智能水平越高(AOR = 0.907,p = 0.026)。大学医院专业人员获得最佳知识的几率比私立医院专业人员低61.4% (AOR = 0.386,p = 0.046)。男性有更高的几率都积极的态度(AOR = 1.844,p = 0.010)和高实践水平(AOR = 2.92,p  = 0.036),以及医生比护士和其他(AOR = 0.424,p = 0.005)。学士学位持有者的高人工智能实践的几率较低(AOR = 0.388,p = 0.017)。结论:大多数专业人员在认知上存在一定的缺陷,但在知识、态度和实践上存在一定的缺陷。主要是年轻人和男性表现出更高的参与度,这表明需要有针对性的人工智能培训,尤其是对老年人和女性专业人士。
{"title":"Perception and awareness of healthcare professionals toward the applications of artificial intelligence in Egyptian healthcare settings.","authors":"Shimaa Azzam, El-Morsy Ahmed El-Morsy, Amira S A Said, Nermin Eissa, Doaa Mahmoud Khalil","doi":"10.3389/frai.2025.1700493","DOIUrl":"10.3389/frai.2025.1700493","url":null,"abstract":"<p><strong>Background: </strong>Healthcare professionals' awareness and handling of artificial intelligence applications in healthcare enhance patient outcomes and improve processes. This study aimed to evaluate the perception, attitude, knowledge, and practice of healthcare professionals regarding the application of artificial intelligence in Egyptian healthcare settings.</p><p><strong>Method: </strong>A cross-sectional study in which 367 healthcare professionals responded to an electronic questionnaire.</p><p><strong>Results: </strong>Out of 367 participants (234 female), radiology and lab test specialty (36.2%) was the predominant. The mean age was 27.03 years; 51.8% of respondents showed positive perception, 68.7% experienced sub-optimal knowledge, 52.9% expressed negative attitudes, and 53.4% demonstrated a low practice level of AI tools. Younger age was significantly associated with positive perception (adjusted odds ratio (AOR) = 0.905, <i>p</i> = 0.020) and higher AI practice (AOR = 0.907, <i>p</i> = 0.026). University hospital professionals had 61.4% lower odds of optimal knowledge than private hospital professionals (AOR = 0.386, <i>p</i> = 0.046). Men had higher odds of both positive attitudes (AOR = 1.844, <i>p</i> = 0.010) and high practice level (AOR = 2.92, <i>p</i> < 0.001). Pre-bachelor's holders had lower odds of positive attitudes (AOR = 0.361, <i>p</i> = 0.036), as well as physicians compared to nurses and others (AOR = 0.424, <i>p</i> = 0.005). Bachelor's holders showed lower odds of high AI practice (AOR = 0.388, <i>p</i> = 0.017).</p><p><strong>Conclusion: </strong>Despite moderate perception, most professionals have knowledge, attitude, and practice defects. Mainly, younger age and men showed higher engagement, indicating a need for targeted AI training, especially for older and female professionals.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1700493"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physics-constrained GAN boosts OAM correction in ocean turbulence. 物理约束的GAN增强了海洋湍流中的OAM校正。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-12 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1702056
Xiaoji Li, Zhiyuan Wang

Introduction: This study addresses the challenge of improving wavefront correction for Orbital Angular Momentum (OAM) in oceanic turbulence using a physics-constrained Generative Adversarial Network (GAN).

Methods: We integrated physical constraints into a deep learning framework to reconstruct degraded input images (SSIM = 0.62). The model was trained with varied loss settings, including a baseline model, spectral constraints (+Spec), and spatial constraints (+Ortho).

Results: The dual-constraint approach (+Ortho+Spec) reached a near-optimal SSIM of 0.98. Ablation studies revealed that while +Ortho boosted modal purity to 95.7%, the dual-constraints achieved 98.4% purity. Power spectral density analysis via KL divergence confirmed the dual-constraints' superiority (KL = 0.56) over the baseline (KL = 2.47).

Discussion: These results demonstrate that integrating both spatial and spectral constraints effectively optimizes reconstruction, purity, and spectral fidelity, offering a robust solution for OAM correction in underwater optical communication systems.

本研究利用物理约束生成对抗网络(GAN)解决了改善海洋湍流中轨道角动量(OAM)波前校正的挑战。方法:我们将物理约束整合到深度学习框架中来重建退化的输入图像(SSIM = 0.62)。该模型使用不同的损失设置进行训练,包括基线模型、光谱约束(+Spec)和空间约束(+Ortho)。结果:双约束方法(+Ortho+Spec)达到了接近最优的SSIM为0.98。消融研究显示,虽然+Ortho将模态纯度提高到95.7%,但双重约束的纯度达到98.4%。通过KL散度进行的功率谱密度分析证实了双约束(KL = 0.56)优于基线(KL = 2.47)。讨论:这些结果表明,空间和光谱约束的整合有效地优化了重建、纯度和光谱保真度,为水下光通信系统的OAM校正提供了一个强大的解决方案。
{"title":"Physics-constrained GAN boosts OAM correction in ocean turbulence.","authors":"Xiaoji Li, Zhiyuan Wang","doi":"10.3389/frai.2025.1702056","DOIUrl":"10.3389/frai.2025.1702056","url":null,"abstract":"<p><strong>Introduction: </strong>This study addresses the challenge of improving wavefront correction for Orbital Angular Momentum (OAM) in oceanic turbulence using a physics-constrained Generative Adversarial Network (GAN).</p><p><strong>Methods: </strong>We integrated physical constraints into a deep learning framework to reconstruct degraded input images (SSIM = 0.62). The model was trained with varied loss settings, including a baseline model, spectral constraints (+Spec), and spatial constraints (+Ortho).</p><p><strong>Results: </strong>The dual-constraint approach (+Ortho+Spec) reached a near-optimal SSIM of 0.98. Ablation studies revealed that while +Ortho boosted modal purity to 95.7%, the dual-constraints achieved 98.4% purity. Power spectral density analysis via KL divergence confirmed the dual-constraints' superiority (KL = 0.56) over the baseline (KL = 2.47).</p><p><strong>Discussion: </strong>These results demonstrate that integrating both spatial and spectral constraints effectively optimizes reconstruction, purity, and spectral fidelity, offering a robust solution for OAM correction in underwater optical communication systems.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1702056"},"PeriodicalIF":4.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12833431/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Persona pedagogica in crisis: are educators becoming data custodians in the age of AI? 危机中的人物角色教学法:教育工作者正在成为人工智能时代的数据保管人吗?
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1743016
Anusree Ambady, Thomas K V
{"title":"Persona pedagogica in crisis: are educators becoming data custodians in the age of AI?","authors":"Anusree Ambady, Thomas K V","doi":"10.3389/frai.2025.1743016","DOIUrl":"https://doi.org/10.3389/frai.2025.1743016","url":null,"abstract":"","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1743016"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827719/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal graph neural networks in healthcare: a review of fusion strategies across biomedical domains. 医疗保健中的多模态图神经网络:跨生物医学领域的融合策略综述。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1716706
Maria Vaida, Ziyuan Huang

Graph Neural Networks (GNNs) have transformed multimodal healthcare data integration by capturing complex, non-Euclidean relationships across diverse sources such as electronic health records, medical imaging, genomic profiles, and clinical notes. This review synthesizes GNN applications in healthcare, highlighting their impact on clinical decision-making through multimodal integration, advanced fusion strategies, and attention mechanisms. Key applications include drug interaction and discovery, cancer detection and prognosis, clinical status prediction, infectious disease modeling, genomics, and the diagnosis of mental health and neurological disorders. Various GNN architectures demonstrate consistent applications in modeling both intra- and intermodal relationships. GNN architectures, such as Graph Convolutional Networks and Graph Attention Networks, are integrated with Convolutional Neural Networks (CNNs), transformer-based models, temporal encoders, and optimization algorithms to facilitate robust multimodal integration. Early, intermediate, late, and hybrid fusion strategies, enhanced by attention mechanisms like multi-head attention, enable dynamic prioritization of critical relationships, improving accuracy and interpretability. However, challenges remain, including data heterogeneity, computational demands, and the need for greater interpretability. Addressing these challenges presents opportunities to advance GNN adoption in medicine through scalable, transparent GNN models.

图神经网络(gnn)通过捕获不同来源(如电子健康记录、医学成像、基因组档案和临床记录)之间复杂的非欧几里得关系,改变了多模式医疗保健数据集成。本文综述了GNN在医疗保健中的应用,强调了它们通过多模式集成、先进融合策略和注意机制对临床决策的影响。主要应用包括药物相互作用和发现、癌症检测和预后、临床状态预测、传染病建模、基因组学以及精神健康和神经疾病的诊断。各种GNN体系结构在建模内部和多式联运关系方面表现出一致的应用。GNN架构,如图卷积网络和图注意网络,与卷积神经网络(cnn)、基于变压器的模型、时间编码器和优化算法集成在一起,以促进鲁棒多模态集成。早期、中期、晚期和混合融合策略,通过多头注意等注意机制的增强,实现了关键关系的动态优先排序,提高了准确性和可解释性。然而,挑战仍然存在,包括数据异构性、计算需求和对更高可解释性的需求。应对这些挑战为通过可扩展、透明的GNN模型推进GNN在医学中的应用提供了机会。
{"title":"Multimodal graph neural networks in healthcare: a review of fusion strategies across biomedical domains.","authors":"Maria Vaida, Ziyuan Huang","doi":"10.3389/frai.2025.1716706","DOIUrl":"10.3389/frai.2025.1716706","url":null,"abstract":"<p><p>Graph Neural Networks (GNNs) have transformed multimodal healthcare data integration by capturing complex, non-Euclidean relationships across diverse sources such as electronic health records, medical imaging, genomic profiles, and clinical notes. This review synthesizes GNN applications in healthcare, highlighting their impact on clinical decision-making through multimodal integration, advanced fusion strategies, and attention mechanisms. Key applications include drug interaction and discovery, cancer detection and prognosis, clinical status prediction, infectious disease modeling, genomics, and the diagnosis of mental health and neurological disorders. Various GNN architectures demonstrate consistent applications in modeling both intra- and intermodal relationships. GNN architectures, such as Graph Convolutional Networks and Graph Attention Networks, are integrated with Convolutional Neural Networks (CNNs), transformer-based models, temporal encoders, and optimization algorithms to facilitate robust multimodal integration. Early, intermediate, late, and hybrid fusion strategies, enhanced by attention mechanisms like multi-head attention, enable dynamic prioritization of critical relationships, improving accuracy and interpretability. However, challenges remain, including data heterogeneity, computational demands, and the need for greater interpretability. Addressing these challenges presents opportunities to advance GNN adoption in medicine through scalable, transparent GNN models.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1716706"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827511/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1