首页 > 最新文献

Machine learning with applications最新文献

英文 中文
KerisRDNet: Mask-aware augmentation and residual dilated networks for cultural heritage blade classification 基于掩码感知的文化遗产刀片分类增强和残差扩展网络
IF 4.9 Pub Date : 2026-06-01 Epub Date: 2026-02-05 DOI: 10.1016/j.mlwa.2026.100852
Khafiizh Hastuti, Erwin Yudi Hidayat, Abu Salam, Usman Sudibyo
Fine-grained recognition of cultural artifacts remains challenging because of the scarcity of annotated data, subtle intra-class differences, and heterogeneous imaging conditions. This study addresses these issues through a domain-specific deep learning pipeline, demonstrated on Indonesian keris classification across three tasks: pamor (27 classes), dhapur (42), and tangguh (5). The pipeline integrates background homogenization, orientation normalization, and YOLOv8-based blade cropping with mask-aware augmentation restricted to the blade regions. For classification, we propose KerisRDNet, which extends InceptionResNetV2 with Inception-Residual-Dilated (IRD) blocks and squeeze-and-excitation to model the elongated geometries and subtle forging motifs. Experiments show that baseline networks collapse under fine-grained settings, with macro-F1 near zero, whereas the proposed approach achieves 0.268 (pamor), 0.276 (dhapur), and 0.635 (tangguh) with Top-3 accuracy above 0.5 and AUC up to 0.853. Across three stratified resamplings, paired non-parametric tests (Wilcoxon signed-rank) indicated directionally consistent improvements; given the small number of repetitions (n=3), these results are interpreted conservatively. These results demonstrate the feasibility of practically viable keris recognition as a decision-support tool for cultural heritage curation, while also offering a transferable workflow for low-data fine-grained recognition tasks.
由于注释数据的稀缺性、微妙的类内差异和不同的成像条件,对文化文物的细粒度识别仍然具有挑战性。本研究通过特定领域的深度学习管道解决了这些问题,并在印度尼西亚keris分类中展示了三个任务:pamor(27类)、dhapur(42类)和tangguh(5类)。该管道集成了背景均匀化、方向归一化和基于yolov8的叶片裁剪,以及仅限于叶片区域的掩模感知增强。对于分类,我们提出KerisRDNet,它扩展了Inception-Residual-Dilated (IRD)块和挤压-激励的Inception-Residual-Dilated (IRD)块来建模细长的几何形状和微妙的锻造图案。实验表明,在细粒度设置下,基线网络崩溃,宏f1接近于零,而该方法达到0.268 (pamor), 0.276 (dhapur)和0.635 (tangguh), Top-3精度高于0.5,AUC高达0.853。在三次分层重采样中,配对非参数检验(Wilcoxon符号秩)显示方向一致的改善;考虑到重复次数很少(n=3),这些结果被保守地解释。这些结果证明了keris识别作为文化遗产管理决策支持工具的可行性,同时也为低数据细粒度识别任务提供了可转移的工作流程。
{"title":"KerisRDNet: Mask-aware augmentation and residual dilated networks for cultural heritage blade classification","authors":"Khafiizh Hastuti,&nbsp;Erwin Yudi Hidayat,&nbsp;Abu Salam,&nbsp;Usman Sudibyo","doi":"10.1016/j.mlwa.2026.100852","DOIUrl":"10.1016/j.mlwa.2026.100852","url":null,"abstract":"<div><div>Fine-grained recognition of cultural artifacts remains challenging because of the scarcity of annotated data, subtle intra-class differences, and heterogeneous imaging conditions. This study addresses these issues through a domain-specific deep learning pipeline, demonstrated on Indonesian keris classification across three tasks: <em>pamor</em> (27 classes), <em>dhapur</em> (42), and <em>tangguh</em> (5). The pipeline integrates background homogenization, orientation normalization, and YOLOv8-based blade cropping with mask-aware augmentation restricted to the blade regions. For classification, we propose KerisRDNet, which extends InceptionResNetV2 with Inception-Residual-Dilated (IRD) blocks and squeeze-and-excitation to model the elongated geometries and subtle forging motifs. Experiments show that baseline networks collapse under fine-grained settings, with macro-F1 near zero, whereas the proposed approach achieves 0.268 (<em>pamor</em>), 0.276 (<em>dhapur</em>), and 0.635 (<em>tangguh</em>) with Top-3 accuracy above 0.5 and AUC up to 0.853. Across three stratified resamplings, paired non-parametric tests (Wilcoxon signed-rank) indicated directionally consistent improvements; given the small number of repetitions (<span><math><mrow><mi>n</mi><mo>=</mo><mn>3</mn></mrow></math></span>), these results are interpreted conservatively. These results demonstrate the feasibility of practically viable keris recognition as a decision-support tool for cultural heritage curation, while also offering a transferable workflow for low-data fine-grained recognition tasks.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"24 ","pages":"Article 100852"},"PeriodicalIF":4.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146161644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards an intelligent review helpfulness estimation: A novel dataset and machine learning framework 迈向智能评论有用估计:一个新的数据集和机器学习框架
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2026-01-21 DOI: 10.1016/j.mlwa.2026.100849
Rakibul Hassan, Shubhashish Kar, Jorge Fonseca Cacho, Shaikh Arifuzzaman
The rise of online rental platforms has led to an overwhelming amount of user-generated content, making it difficult for prospective consumers to discern which reviews are helpful. Existing approaches often rely on raw helpfulness votes, which are sparse, subjective, and temporally inconsistent. Also, there is lack of labeled dataset in the field of rental review usefulness prediction. This paper introduces a novel dataset of apartment reviews collected from online website and proposes an intelligent machine learning framework to predict the helpfulness of rental reviews. To address the challenge of obtaining reliable labels from sparse and subjective user votes, a scoring-based labeling strategy is developed that uses helpful vote count and timeliness. A diverse set of features including TF–IDF vectors, sentiment polarity, rating deviation, and review length are used to capture both textual and behavioral aspects of the reviews. Multiple classifiers, including Logistic Regression, Naive Bayes, and XGBoost, are systematically evaluated under 5-fold cross-validation, along with a rule-based and deep learning models.
Experimental results show that XGBoost consistently achieves the best overall performance with an accuracy of 0.71 and ROC-AUC of 0.75 when leveraging all features. This research makes three key contributions: (i) the first large-scale dataset for rental review, (ii) auto annotation technique that uses clustering approach with score from user votes and time since posted, and (iii) comprehensive evaluation pipeline spanning rule-based, traditional, and deep learning classifiers. Together, these advances establish a foundation for intelligent rental review helpfulness estimation, with broader implications for e-commerce, hospitality, and user-generated content analysis.
在线租赁平台的兴起导致了大量的用户生成内容,这使得潜在的消费者很难辨别哪些评论是有用的。现有的方法通常依赖于原始的有用性投票,这些投票是稀疏的、主观的,并且在时间上不一致。此外,在租房评论有用性预测领域也缺乏标记数据集。本文介绍了一个从在线网站收集的公寓评论数据集,并提出了一个智能机器学习框架来预测租房评论的有用性。为了解决从稀疏和主观的用户投票中获得可靠标签的挑战,开发了一种基于分数的标签策略,该策略使用有用的投票计数和及时性。包括TF-IDF向量、情感极性、评级偏差和评论长度在内的一系列不同的特征被用来捕获评论的文本和行为方面。多个分类器,包括逻辑回归,朴素贝叶斯和XGBoost,在5倍交叉验证下进行系统评估,以及基于规则和深度学习模型。实验结果表明,在利用所有特征时,XGBoost始终保持最佳的整体性能,精度为0.71,ROC-AUC为0.75。本研究做出了三个关键贡献:(i)第一个用于租赁评论的大规模数据集,(ii)使用用户投票和发布时间评分的聚类方法的自动注释技术,以及(iii)跨越基于规则的、传统的和深度学习分类器的综合评估管道。总之,这些进步为智能租赁评论有用性评估奠定了基础,并对电子商务、酒店业和用户生成内容分析产生了更广泛的影响。
{"title":"Towards an intelligent review helpfulness estimation: A novel dataset and machine learning framework","authors":"Rakibul Hassan,&nbsp;Shubhashish Kar,&nbsp;Jorge Fonseca Cacho,&nbsp;Shaikh Arifuzzaman","doi":"10.1016/j.mlwa.2026.100849","DOIUrl":"10.1016/j.mlwa.2026.100849","url":null,"abstract":"<div><div>The rise of online rental platforms has led to an overwhelming amount of user-generated content, making it difficult for prospective consumers to discern which reviews are helpful. Existing approaches often rely on raw helpfulness votes, which are sparse, subjective, and temporally inconsistent. Also, there is lack of labeled dataset in the field of rental review usefulness prediction. This paper introduces a novel dataset of apartment reviews collected from online website and proposes an intelligent machine learning framework to predict the helpfulness of rental reviews. To address the challenge of obtaining reliable labels from sparse and subjective user votes, a scoring-based labeling strategy is developed that uses helpful vote count and timeliness. A diverse set of features including TF–IDF vectors, sentiment polarity, rating deviation, and review length are used to capture both textual and behavioral aspects of the reviews. Multiple classifiers, including Logistic Regression, Naive Bayes, and XGBoost, are systematically evaluated under 5-fold cross-validation, along with a rule-based and deep learning models.</div><div>Experimental results show that XGBoost consistently achieves the best overall performance with an accuracy of 0.71 and ROC-AUC of 0.75 when leveraging all features. This research makes three key contributions: (i) the first large-scale dataset for rental review, (ii) auto annotation technique that uses clustering approach with score from user votes and time since posted, and (iii) comprehensive evaluation pipeline spanning rule-based, traditional, and deep learning classifiers. Together, these advances establish a foundation for intelligent rental review helpfulness estimation, with broader implications for e-commerce, hospitality, and user-generated content analysis.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100849"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of major segmentation models for intracranial artery time-of-flight magnetic resonance angiography images 颅内动脉飞行时间磁共振血管造影图像的主要分割模型分析
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2026-01-19 DOI: 10.1016/j.mlwa.2026.100843
Mekhla Sarkar , Yen-Chu Huang , Tsong-Hai Lee , Jiann-Der Lee , Prasan Kumar Sahoo
Intracranial arterial stenosis (ICAS) is a leading cause of cerebrovascular accidents, and accurate morphological assessment of intracranial arteries is critical for diagnosis and treatment planning. Complex vascular structures, imaging noise, and variability in time-of-flight magnetic resonance angiography (TOF-MRA) images are challenging issues for the manual delineation that motivates the use of deep learning (DL) for automatic segmentation of the intracranial arteries. DL based automatic segmentation offers a promising solution by providing consistent and noise-reduced vessel delineation. However, selecting an optimal segmentation architecture remains challenging due to the diversity of network designs and encoder backbones. Therefore, this study presents a systematic benchmarking of five widely used DL segmentation architectures, UNet, LinkNet, Feature Pyramid Networks (FPN), Pyramid Scene Parsing Network (PSPNet), and DeepLabV3+, each combined with nine backbone networks, yielding 45 model variants, including previously unexplored configurations for intracranial artery segmentation in TOF-MRA. Models were trained and cross-validated on four datasets: in-house, CereVessMRA, IXI and ADAM, and evaluated on held-out independent test set. Performance metrics included Intersection over Union (IoU), Dice Similarity Coefficient (DSC), and a Stability Score, combining the coefficient of variation of IoU and DSC to quantify segmentation consistency and reproducibility. Experimental results demonstrated highest DSC score was achieved with UNet–SE-ResNeXt50, LinkNet-SE-ResNeXt50, FPN-DenseNet169, FPN-SENet154. The most stable configurations were LinkNet–EfficientNetB6, LinkNet–SENet154, UNet–DenseNet169, and UNet–EfficientNetB6. Conversely, DeepLabV3+ and PSPNet variants consistently underperformed. These findings provide actionable guidance for selecting backbone–segmentation pairs and highlight trade-offs between accuracy, robustness, and reproducibility for complex intracranial artery TOF-MRA segmentation tasks.
颅内动脉狭窄(ICAS)是脑血管意外的主要原因,准确的颅内动脉形态学评估对诊断和治疗计划至关重要。复杂的血管结构、成像噪声和飞行时间磁共振血管造影(TOF-MRA)图像的可变性是人工描绘的挑战问题,这促使使用深度学习(DL)来自动分割颅内动脉。基于深度学习的自动分割提供了一个很有前途的解决方案,它提供了一致的、降噪的血管描绘。然而,由于网络设计和编码器主干网的多样性,选择一个最佳的分割架构仍然是一个挑战。因此,本研究对五种广泛使用的深度学习分割架构UNet、LinkNet、特征金字塔网络(FPN)、金字塔场景解析网络(PSPNet)和DeepLabV3+进行了系统的基准测试,每一种都与9个骨干网络相结合,产生45种模型变体,包括以前未开发的TOF-MRA颅内动脉分割配置。模型在内部、CereVessMRA、IXI和ADAM四个数据集上进行训练和交叉验证,并在独立测试集上进行评估。性能指标包括交联(Intersection over Union, IoU)、骰子相似系数(Dice Similarity Coefficient, DSC)和稳定性评分,结合IoU和DSC的变异系数来量化分割的一致性和可重复性。实验结果表明,UNet-SE-ResNeXt50、LinkNet-SE-ResNeXt50、FPN-DenseNet169、FPN-SENet154的DSC得分最高。最稳定的配置是LinkNet-EfficientNetB6、LinkNet-SENet154、UNet-DenseNet169和UNet-EfficientNetB6。相反,DeepLabV3+和PSPNet变体一直表现不佳。这些发现为选择骨干分割对提供了可操作的指导,并突出了复杂颅内动脉TOF-MRA分割任务的准确性,稳健性和可重复性之间的权衡。
{"title":"Analysis of major segmentation models for intracranial artery time-of-flight magnetic resonance angiography images","authors":"Mekhla Sarkar ,&nbsp;Yen-Chu Huang ,&nbsp;Tsong-Hai Lee ,&nbsp;Jiann-Der Lee ,&nbsp;Prasan Kumar Sahoo","doi":"10.1016/j.mlwa.2026.100843","DOIUrl":"10.1016/j.mlwa.2026.100843","url":null,"abstract":"<div><div>Intracranial arterial stenosis (ICAS) is a leading cause of cerebrovascular accidents, and accurate morphological assessment of intracranial arteries is critical for diagnosis and treatment planning. Complex vascular structures, imaging noise, and variability in time-of-flight magnetic resonance angiography (TOF-MRA) images are challenging issues for the manual delineation that motivates the use of deep learning (DL) for automatic segmentation of the intracranial arteries. DL based automatic segmentation offers a promising solution by providing consistent and noise-reduced vessel delineation. However, selecting an optimal segmentation architecture remains challenging due to the diversity of network designs and encoder backbones. Therefore, this study presents a systematic benchmarking of five widely used DL segmentation architectures, UNet, LinkNet, Feature Pyramid Networks (FPN), Pyramid Scene Parsing Network (PSPNet), and DeepLabV3+, each combined with nine backbone networks, yielding 45 model variants, including previously unexplored configurations for intracranial artery segmentation in TOF-MRA. Models were trained and cross-validated on four datasets: in-house, CereVessMRA, IXI and ADAM, and evaluated on held-out independent test set. Performance metrics included Intersection over Union (IoU), Dice Similarity Coefficient (DSC), and a Stability Score, combining the coefficient of variation of IoU and DSC to quantify segmentation consistency and reproducibility. Experimental results demonstrated highest DSC score was achieved with UNet–SE-ResNeXt50, LinkNet-SE-ResNeXt50, FPN-DenseNet169, FPN-SENet154. The most stable configurations were LinkNet–EfficientNetB6, LinkNet–SENet154, UNet–DenseNet169, and UNet–EfficientNetB6. Conversely, DeepLabV3+ and PSPNet variants consistently underperformed. These findings provide actionable guidance for selecting backbone–segmentation pairs and highlight trade-offs between accuracy, robustness, and reproducibility for complex intracranial artery TOF-MRA segmentation tasks.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100843"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive multi-domain uncertainty quantification for digital twin water forecasting 数字孪生水预报的自适应多域不确定性量化
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2025-12-13 DOI: 10.1016/j.mlwa.2025.100812
Mohammadhossein Homaei , Mehran Tarif , Pablo García Rodríguez , Mar Ávila , Andrés Caro
Machine learning (ML) models are often used to predict demand in digital twins (DTs) of water distribution systems (WDS). However, most models do not provide uncertainty estimation, and this makes risk evaluation limited. In this work, we introduce the first systematic framework for hierarchical uncertainty transfer in regional water networks, because until now no method existed for DT of regional water systems. We propose Adaptive Multi-Village Conformal Prediction (AMV-CP), a method that keeps theoretical guarantees and also allows transfer of uncertainty information between villages that are similar in structure but different in operation. The main ideas are: (i) village-adaptive conformity scores that capture local patterns, (ii) a meta-learning algorithm that reduces calibration cost by 88.6%, and (iii) regime-aware calibration that keeps 94.2% coverage when seasons change. We use eight years of data from six villages with 6174 users in one regional network. The results show a theoretical basis for cross-village transfer and 95.1% empirical coverage (target was 95%), with real-time speed of 120 predictions per second. Early multi-step tests also show 93.7% coverage for 24-hour horizons, with controlled trade-offs. This framework is the first systematic method for controlled uncertainty transfer in infrastructure DTs, with theoretical guarantees under ϕ-mixing and practical deployment. Our multi-village tests demonstrate the value of meta-learning for uncertainty estimation and make a base method that can be used in other hierarchical infrastructure systems. The system is validated in a Mediterranean rural network, but generalization to other climates, urban settings, and cascading systems needs further empirical study.
机器学习(ML)模型通常用于预测供水系统(WDS)的数字双胞胎(dt)的需求。然而,大多数模型不提供不确定性估计,这使得风险评估受到限制。在这项工作中,我们引入了第一个系统框架,用于区域水网的层次不确定性传递,因为到目前为止还没有方法用于区域水系的DT。本文提出了一种既保持理论保证,又允许结构相似但运行方式不同的村落间不确定性信息传递的自适应多村落共形预测(AMV-CP)方法。主要思想是:(i)捕获当地模式的村庄适应性一致性分数,(ii)将校准成本降低88.6%的元学习算法,以及(iii)在季节变化时保持94.2%覆盖率的制度感知校准。我们使用了来自6个村庄的8年数据,在一个区域网络中有6174名用户。结果表明,跨村转移具有理论基础,经验覆盖率为95.1%(目标为95%),实时预测速度为120次/秒。早期的多步骤测试也显示,24小时的覆盖率为93.7%,并有控制的权衡。该框架是基础设施dt中受控不确定性传递的第一个系统方法,具有在ϕ-mix和实际部署下的理论保证。我们的多村测试证明了元学习对不确定性估计的价值,并为其他分层基础设施系统提供了一个基础方法。该系统在地中海农村网络中得到验证,但推广到其他气候、城市环境和级联系统需要进一步的实证研究。
{"title":"Adaptive multi-domain uncertainty quantification for digital twin water forecasting","authors":"Mohammadhossein Homaei ,&nbsp;Mehran Tarif ,&nbsp;Pablo García Rodríguez ,&nbsp;Mar Ávila ,&nbsp;Andrés Caro","doi":"10.1016/j.mlwa.2025.100812","DOIUrl":"10.1016/j.mlwa.2025.100812","url":null,"abstract":"<div><div>Machine learning (ML) models are often used to predict demand in digital twins (DTs) of water distribution systems (WDS). However, most models do not provide uncertainty estimation, and this makes risk evaluation limited. In this work, we introduce the first systematic framework for hierarchical uncertainty transfer in regional water networks, because until now no method existed for DT of regional water systems. We propose Adaptive Multi-Village Conformal Prediction (AMV-CP), a method that keeps theoretical guarantees and also allows transfer of uncertainty information between villages that are similar in structure but different in operation. The main ideas are: (i) village-adaptive conformity scores that capture local patterns, (ii) a meta-learning algorithm that reduces calibration cost by 88.6%, and (iii) regime-aware calibration that keeps 94.2% coverage when seasons change. We use eight years of data from six villages with 6174 users in one regional network. The results show a theoretical basis for cross-village transfer and 95.1% empirical coverage (target was 95%), with real-time speed of 120 predictions per second. Early multi-step tests also show 93.7% coverage for 24-hour horizons, with controlled trade-offs. This framework is the first systematic method for controlled uncertainty transfer in infrastructure DTs, with theoretical guarantees under <span><math><mi>ϕ</mi></math></span>-mixing and practical deployment. Our multi-village tests demonstrate the value of meta-learning for uncertainty estimation and make a base method that can be used in other hierarchical infrastructure systems. The system is validated in a Mediterranean rural network, but generalization to other climates, urban settings, and cascading systems needs further empirical study.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100812"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond text: Multimodal stance detection in Arabic tweets 超越文本:阿拉伯语推文的多模态姿态检测
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2025-12-16 DOI: 10.1016/j.mlwa.2025.100823
Nouf AlShenaifi, Nourah Alangari
Despite the growing importance of multimodal signals on social media, Arabic stance detection has remained largely text-only, overlooking the visual context that often accompanies user posts. To bridge this gap, we present MAWQIF-MM, the first publicly available Arabic multimodal stance detection corpus of tweet–image pairs annotated with three stance labels: Favor, Against, and Neutral. Building on this resource, we propose a novel attention-based cross-modal fusion model that jointly encodes text and images. Textual content is processed using AraBERT v2, a transformer-based language model optimized for Arabic, while visual features are extracted using BLIP with a ViT-B backbone, a state-of-the-art vision-language model. These two modalities are integrated via multi-head cross-attention to capture cross-modal interactions. Experimental results demonstrate the effectiveness of our approach: on a held-out test set, the model achieves 88% accuracy, outperforming a text-only AraBERT baseline by 12 percentage points and an image-only BLIP baseline by 4 points. To further probe large vision–language models (VLMs) in low-resource settings, we benchmark Gemini 2.5 Flash and GPT-4o under zero-shot and few-shot prompting. While these models show promising generalization, they struggle with nuanced stances without fine-tuning, underscoring the value of domain-specific supervised training.
尽管多模式信号在社交媒体上的重要性日益增加,但阿拉伯语的立场检测仍然主要是文本检测,忽略了用户帖子中经常出现的视觉背景。为了弥补这一差距,我们提出了MAWQIF-MM,这是第一个公开可用的阿拉伯语多模态姿态检测语料库,该语料库使用三个姿态标签进行注释:赞成、反对和中立。在此基础上,我们提出了一种新的基于注意力的跨模态融合模型,该模型联合编码文本和图像。文本内容使用AraBERT v2处理,AraBERT v2是一种针对阿拉伯语进行优化的基于转换器的语言模型,而视觉特征则使用BLIP与最先进的视觉语言模型ViT-B主干进行提取。这两种模式通过多头交叉注意来整合,以捕捉跨模式的相互作用。实验结果证明了我们方法的有效性:在一个固定测试集上,该模型达到88%的准确率,比纯文本的AraBERT基线高出12个百分点,比纯图像的BLIP基线高出4个百分点。为了进一步探索低资源环境下的大型视觉语言模型(VLMs),我们在零射击和少射击提示下对Gemini 2.5 Flash和gpt - 40进行了基准测试。虽然这些模型显示出有希望的泛化,但它们在没有微调的情况下与细微的立场作斗争,强调了特定领域监督训练的价值。
{"title":"Beyond text: Multimodal stance detection in Arabic tweets","authors":"Nouf AlShenaifi,&nbsp;Nourah Alangari","doi":"10.1016/j.mlwa.2025.100823","DOIUrl":"10.1016/j.mlwa.2025.100823","url":null,"abstract":"<div><div>Despite the growing importance of multimodal signals on social media, Arabic stance detection has remained largely text-only, overlooking the visual context that often accompanies user posts. To bridge this gap, we present MAWQIF-MM, the first publicly available Arabic multimodal stance detection corpus of tweet–image pairs annotated with three stance labels: Favor, Against, and Neutral. Building on this resource, we propose a novel attention-based cross-modal fusion model that jointly encodes text and images. Textual content is processed using AraBERT v2, a transformer-based language model optimized for Arabic, while visual features are extracted using BLIP with a ViT-B backbone, a state-of-the-art vision-language model. These two modalities are integrated via multi-head cross-attention to capture cross-modal interactions. Experimental results demonstrate the effectiveness of our approach: on a held-out test set, the model achieves 88% accuracy, outperforming a text-only AraBERT baseline by 12 percentage points and an image-only BLIP baseline by 4 points. To further probe large vision–language models (VLMs) in low-resource settings, we benchmark Gemini 2.5 Flash and GPT-4o under zero-shot and few-shot prompting. While these models show promising generalization, they struggle with nuanced stances without fine-tuning, underscoring the value of domain-specific supervised training.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100823"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid machine learning and IoT system for driver fatigue monitoring in connected electric vehicles 用于联网电动汽车驾驶员疲劳监测的混合机器学习和物联网系统
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2026-01-18 DOI: 10.1016/j.mlwa.2026.100845
Obaida AlHousrya, Aseel Bennagi, Petru A. Cotfas, Daniel T. Cotfas
Driver fatigue remains a critical factor in road accidents, particularly in long duration or cognitively demanding driving scenarios. This study presents a comprehensive, low cost, and real time system for monitoring driver health and electric vehicle status through physiological signal analysis. By integrating heart rate, eye movement, and breathing rate sensors, both simulated and real, this hybrid framework detects signs of fatigue using machine learning classifiers trained on publicly available datasets including OpenDriver, DriveDB, MAUS, YawDD, TinyML, and the Driver Respiration Dataset. The system architecture combines Arduino based hardware, cloud integration via Microsoft Azure, and advanced classification and anomaly detection algorithms such as Random Forest and Isolation Forest. Evaluation across diverse datasets revealed robust fatigue detection capabilities, with OpenDriver achieving 97.6% cross validation accuracy and 95.8% F1-score, while image and respiration-based models complemented the electrocardiogram-based analysis. These results demonstrate the feasibility of affordable, multimodal health monitoring in EVs, offering a scalable and deployable solution for enhancing road safety.
驾驶员疲劳仍然是道路交通事故的一个关键因素,特别是在长时间或认知要求高的驾驶情况下。本研究提出了一种全面、低成本、实时的、通过生理信号分析来监测驾驶员健康和电动汽车状态的系统。通过整合模拟和真实的心率、眼动和呼吸频率传感器,这个混合框架使用公开可用数据集(包括OpenDriver、DriveDB、MAUS、YawDD、TinyML和驾驶员呼吸数据集)训练的机器学习分类器检测疲劳迹象。系统架构结合了基于Arduino的硬件,通过Microsoft Azure进行云集成,以及Random Forest和Isolation Forest等高级分类和异常检测算法。对不同数据集的评估显示,OpenDriver具有强大的疲劳检测能力,交叉验证准确率为97.6%,f1评分为95.8%,而基于图像和呼吸的模型补充了基于心电图的分析。这些结果证明了在电动汽车中进行经济实惠的多模式健康监测的可行性,为提高道路安全提供了可扩展和可部署的解决方案。
{"title":"A hybrid machine learning and IoT system for driver fatigue monitoring in connected electric vehicles","authors":"Obaida AlHousrya,&nbsp;Aseel Bennagi,&nbsp;Petru A. Cotfas,&nbsp;Daniel T. Cotfas","doi":"10.1016/j.mlwa.2026.100845","DOIUrl":"10.1016/j.mlwa.2026.100845","url":null,"abstract":"<div><div>Driver fatigue remains a critical factor in road accidents, particularly in long duration or cognitively demanding driving scenarios. This study presents a comprehensive, low cost, and real time system for monitoring driver health and electric vehicle status through physiological signal analysis. By integrating heart rate, eye movement, and breathing rate sensors, both simulated and real, this hybrid framework detects signs of fatigue using machine learning classifiers trained on publicly available datasets including OpenDriver, DriveDB, MAUS, YawDD, TinyML, and the Driver Respiration Dataset. The system architecture combines Arduino based hardware, cloud integration via Microsoft Azure, and advanced classification and anomaly detection algorithms such as Random Forest and Isolation Forest. Evaluation across diverse datasets revealed robust fatigue detection capabilities, with OpenDriver achieving 97.6% cross validation accuracy and 95.8% F1-score, while image and respiration-based models complemented the electrocardiogram-based analysis. These results demonstrate the feasibility of affordable, multimodal health monitoring in EVs, offering a scalable and deployable solution for enhancing road safety.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100845"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-MLP-Mixer based surrogate model for seismic ground-motion with spatial source and geological parameters 基于Multi-MLP-Mixer的地震地震动空间震源和地质参数代理模型
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2026-01-30 DOI: 10.1016/j.mlwa.2026.100855
Hirotaka Hachiya , Yuto Kuroki , Asako Iwaki , Takahiro Maeda , Naonori Ueda , Hiroyuki Fujiwara
Seismic ground-motion simulations provide high-fidelity predictions but are computationally prohibitive for large-scale scenario analyses. Surrogate models based on Multi-Layer Perceptrons (MLPs) or Fourier Neural Operators (FNOs) have been studied, yet each has limitations: MLPs fail to capture spatial correlations, while FNOs incur high costs from repeated Fourier transforms on full-resolution grids. To overcome these issues, we propose a surrogate model based on the MLP-Mixer architecture that operates on a patch grid, enabling efficient extraction of global spatial correlations. In addition, we introduce a multi-stream design with source and geology inputs fused through a learnable element-wise multi-modal mixer, allowing period-dependent, data-driven fusion of modalities. Experiments on Nankai Trough simulations demonstrate that the proposed method, referred to as Multi-MLP-Mixer, achieves accuracy comparable to state-of-the-art surrogate models while reducing training and inference time, thereby balancing predictive performance with computational efficiency.
地震地面运动模拟提供了高保真的预测,但在计算上不利于大规模的情景分析。基于多层感知器(mlp)或傅立叶神经算子(FNOs)的替代模型已经得到了研究,但每种模型都有局限性:mlp无法捕获空间相关性,而FNOs在全分辨率网格上进行重复傅立叶变换会产生高昂的成本。为了克服这些问题,我们提出了一个基于MLP-Mixer架构的代理模型,该模型在补丁网格上运行,能够有效地提取全局空间相关性。此外,我们还引入了一种多流设计,通过可学习的多模态混合器将源和地质输入融合在一起,从而实现与周期相关的、数据驱动的模态融合。在南开槽的仿真实验表明,所提出的方法(Multi-MLP-Mixer)在减少训练和推理时间的同时,达到了与最先进的代理模型相当的精度,从而平衡了预测性能和计算效率。
{"title":"Multi-MLP-Mixer based surrogate model for seismic ground-motion with spatial source and geological parameters","authors":"Hirotaka Hachiya ,&nbsp;Yuto Kuroki ,&nbsp;Asako Iwaki ,&nbsp;Takahiro Maeda ,&nbsp;Naonori Ueda ,&nbsp;Hiroyuki Fujiwara","doi":"10.1016/j.mlwa.2026.100855","DOIUrl":"10.1016/j.mlwa.2026.100855","url":null,"abstract":"<div><div>Seismic ground-motion simulations provide high-fidelity predictions but are computationally prohibitive for large-scale scenario analyses. Surrogate models based on Multi-Layer Perceptrons (MLPs) or Fourier Neural Operators (FNOs) have been studied, yet each has limitations: MLPs fail to capture spatial correlations, while FNOs incur high costs from repeated Fourier transforms on full-resolution grids. To overcome these issues, we propose a surrogate model based on the MLP-Mixer architecture that operates on a patch grid, enabling efficient extraction of global spatial correlations. In addition, we introduce a multi-stream design with source and geology inputs fused through a learnable element-wise multi-modal mixer, allowing period-dependent, data-driven fusion of modalities. Experiments on Nankai Trough simulations demonstrate that the proposed method, referred to as Multi-MLP-Mixer, achieves accuracy comparable to state-of-the-art surrogate models while reducing training and inference time, thereby balancing predictive performance with computational efficiency.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100855"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146187667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable DEA–ensemble approach with golden jackal optimization: efficiency evaluation and prediction for United States information technology firms 金豺狼优化下的可解释dea -集合方法:美国信息技术企业效率评价与预测
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2025-11-29 DOI: 10.1016/j.mlwa.2025.100798
Temitope Olubanjo Kehinde , Azeez A. Oyedele , Morenikeji Kabirat Kareem , Joseph Akpan , Oludolapo A. Olanrewaju
This study presents an integrated Data Envelopment Analysis (DEA) and ensemble learning framework optimized with the Golden Jackal Optimization (GJO) algorithm to evaluate and predict the efficiency of United States information technology firms. Both Constant Returns to Scale and Variable Returns to Scale models were applied to measure firm efficiency and compute scale efficiency, providing a clearer distinction between managerial and scale-related effects. Using data from 3940 firms over the period 2013 to 2023, a robustness test introducing ±20% random noise to a 10% random sample confirmed that the CCR model achieved stronger stability, with a correlation coefficient of 0.795 compared to 0.773 for the BCC model. Consequently, the CCR results were adopted as the basis for predictive modeling. DEA efficiency scores were predicted using six ensemble learners, including XGBoost, Gradient Boosting Regressor, AdaBoost, Extra Trees Regressor, Random Forest, and LightGBM, with GJO employed for hyperparameter tuning. The Gradient Boosting Regressor optimized with GJO achieved the best predictive performance, accurately reproducing the observed efficiency scores. SHAP and feature importance analyses revealed that Total Equity, Operating Income, and Total Assets were the most influential determinants of efficiency. This research contributes a scalable and interpretable approach to efficiency prediction, offering actionable insights for managers, investors, and policymakers in volatile financial markets.
本研究提出一个整合数据包络分析(DEA)与金豺优化(GJO)算法的集成学习框架来评估和预测美国信息技术公司的效率。恒定规模回报和可变规模回报模型都被应用于衡量企业效率和计算规模效率,在管理效应和规模相关效应之间提供了更清晰的区分。利用2013年至2023年期间3940家企业的数据,对10%随机样本引入±20%随机噪声的稳健性检验证实,CCR模型具有更强的稳定性,其相关系数为0.795,而BCC模型的相关系数为0.773。因此,采用CCR结果作为预测建模的基础。使用XGBoost、Gradient Boosting Regressor、AdaBoost、Extra Trees Regressor、Random Forest和LightGBM等6个集成学习器预测DEA效率得分,并使用GJO进行超参数调优。使用GJO优化的梯度增强回归器获得了最佳的预测性能,准确地再现了观察到的效率得分。SHAP和特征重要性分析显示,总股本、营业收入和总资产是效率的最具影响力的决定因素。本研究为效率预测提供了一种可扩展和可解释的方法,为动荡的金融市场中的管理者、投资者和政策制定者提供了可操作的见解。
{"title":"Explainable DEA–ensemble approach with golden jackal optimization: efficiency evaluation and prediction for United States information technology firms","authors":"Temitope Olubanjo Kehinde ,&nbsp;Azeez A. Oyedele ,&nbsp;Morenikeji Kabirat Kareem ,&nbsp;Joseph Akpan ,&nbsp;Oludolapo A. Olanrewaju","doi":"10.1016/j.mlwa.2025.100798","DOIUrl":"10.1016/j.mlwa.2025.100798","url":null,"abstract":"<div><div>This study presents an integrated Data Envelopment Analysis (DEA) and ensemble learning framework optimized with the Golden Jackal Optimization (GJO) algorithm to evaluate and predict the efficiency of United States information technology firms. Both Constant Returns to Scale and Variable Returns to Scale models were applied to measure firm efficiency and compute scale efficiency, providing a clearer distinction between managerial and scale-related effects. Using data from 3940 firms over the period 2013 to 2023, a robustness test introducing ±20% random noise to a 10% random sample confirmed that the CCR model achieved stronger stability, with a correlation coefficient of 0.795 compared to 0.773 for the BCC model. Consequently, the CCR results were adopted as the basis for predictive modeling. DEA efficiency scores were predicted using six ensemble learners, including XGBoost, Gradient Boosting Regressor, AdaBoost, Extra Trees Regressor, Random Forest, and LightGBM, with GJO employed for hyperparameter tuning. The Gradient Boosting Regressor optimized with GJO achieved the best predictive performance, accurately reproducing the observed efficiency scores. SHAP and feature importance analyses revealed that Total Equity, Operating Income, and Total Assets were the most influential determinants of efficiency. This research contributes a scalable and interpretable approach to efficiency prediction, offering actionable insights for managers, investors, and policymakers in volatile financial markets.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100798"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145694191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring multimodal, non-invasive stress assessment through audio-visual and textual cues integrated with psychometric survey data 探索多模式,非侵入性的压力评估,通过视听和文字线索与心理测量调查数据相结合
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2025-11-26 DOI: 10.1016/j.mlwa.2025.100803
Xin Yu Huang , Venkat Margapuri
Stress is a widespread psychological concern that often manifests alongside conditions such as anxiety and depression. Traditional self-report tools like the Perceived Stress Scale (PSS-10) may not fully capture an individual’s stress experience. This study explores whether integrating multimodal biometric data through video, audio, and transcriptions can enhance stress detection by providing a more comprehensive and interpretive point of view. Participants completed the PSS-10 while being recorded, and emotional features were extracted using machine learning models across the three biometric modalities. Results revealed weak correlations among the modalities, indicating that each captures distinct aspects of stress. Notably, the combined biometric score demonstrated greater sensitivity than the PSS-10 alone, suggesting that multimodal models may detect stress-related states that self-reports overlook. These findings support the development of more comprehensive stress assessment tools, although they are not intended to replace professional clinical evaluation.
压力是一种普遍存在的心理问题,通常与焦虑和抑郁等症状一起表现出来。传统的自我报告工具,如感知压力量表(PSS-10)可能无法完全捕捉到个人的压力体验。本研究探讨了通过视频、音频和转录整合多模态生物识别数据是否可以通过提供更全面和更具解释性的观点来增强应力检测。参与者在被记录的同时完成了PSS-10,并使用跨三种生物识别模式的机器学习模型提取了情绪特征。结果显示,模式之间的弱相关性,表明每个捕获不同的方面的压力。值得注意的是,联合生物特征评分比单独使用PSS-10表现出更高的灵敏度,这表明多模态模型可以检测到自我报告忽略的压力相关状态。这些发现支持开发更全面的压力评估工具,尽管它们并不打算取代专业的临床评估。
{"title":"Exploring multimodal, non-invasive stress assessment through audio-visual and textual cues integrated with psychometric survey data","authors":"Xin Yu Huang ,&nbsp;Venkat Margapuri","doi":"10.1016/j.mlwa.2025.100803","DOIUrl":"10.1016/j.mlwa.2025.100803","url":null,"abstract":"<div><div>Stress is a widespread psychological concern that often manifests alongside conditions such as anxiety and depression. Traditional self-report tools like the Perceived Stress Scale (PSS-10) may not fully capture an individual’s stress experience. This study explores whether integrating multimodal biometric data through video, audio, and transcriptions can enhance stress detection by providing a more comprehensive and interpretive point of view. Participants completed the PSS-10 while being recorded, and emotional features were extracted using machine learning models across the three biometric modalities. Results revealed weak correlations among the modalities, indicating that each captures distinct aspects of stress. Notably, the combined biometric score demonstrated greater sensitivity than the PSS-10 alone, suggesting that multimodal models may detect stress-related states that self-reports overlook. These findings support the development of more comprehensive stress assessment tools, although they are not intended to replace professional clinical evaluation.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100803"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145624849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic discovery of robust risk groups from limited survival data across biomedical modalities 从生物医学模式的有限生存数据中自动发现稳健的风险群体
IF 4.9 Pub Date : 2026-03-01 Epub Date: 2025-12-08 DOI: 10.1016/j.mlwa.2025.100814
Ethar Alzaid , George Wright , Mark Eastwood , Piotr Keller , Fayyaz Minhas
Survival prediction from medical data is often constrained by scarce labels, limiting the effectiveness of fully supervised models. In addition, most existing approaches produce deterministic risk scores without conveying reliability, which hinders interpretability and clinical trustworthiness. To address these challenges, we introduce T-SURE, a transductive survival ranking and risk-stratification framework that learns jointly from labeled and unlabeled patients to reduce dependence on large annotated cohorts. It also estimates a rejection score that identifies high-uncertainty cases, enabling selective abstention when confidence is low. T-SURE generates a single risk score that enables (1) patient ranking based on survival risk, (2) automatic assignment to risk groups, and (3) optional rejection of uncertain predictions. We extensively evaluated the model on pan-cancer datasets from The Cancer Genome Atlas (TCGA), using gene expression profiles, whole slide images, pathology reports, and clinical information. The model outperformed existing approaches in both ranking and risk stratification, especially in the limited labeled data regimen. It also showed consistent improvements in performance as uncertain samples were rejected, while maintaining statistically significant stratification across datasets. T-SURE integrates as a reliable component within computational pathology pipelines by guiding risk-specific therapeutic and monitoring decisions and flagging ambiguous or rare cases via a high rejection score for further investigation. To support reproducibility, the full implementation of T-SURE is publicly available at: (Anonymized).
基于医疗数据的生存预测通常受到稀缺标签的限制,从而限制了完全监督模型的有效性。此外,大多数现有方法产生的确定性风险评分没有传达可靠性,这阻碍了可解释性和临床可信度。为了解决这些挑战,我们引入了T-SURE,这是一个转导生存排名和风险分层框架,可以从标记和未标记的患者中共同学习,以减少对大型注释队列的依赖。它还估计一个拒绝分数,识别高不确定性的情况下,使选择性弃权时,信心是低的。T-SURE生成一个单一的风险评分,实现(1)基于生存风险的患者排名,(2)对风险组的自动分配,以及(3)对不确定预测的选择性拒绝。我们在来自癌症基因组图谱(TCGA)的泛癌症数据集上广泛评估了该模型,使用了基因表达谱、全幻灯片图像、病理报告和临床信息。该模型在排名和风险分层方面优于现有方法,特别是在有限的标记数据方案中。当不确定样本被拒绝时,它也显示出性能的持续改进,同时在数据集上保持统计学上显著的分层。T-SURE作为一个可靠的组件集成在计算病理学管道中,通过指导风险特异性治疗和监测决策,并通过高排斥评分标记不明确或罕见的病例,以供进一步研究。为了支持可重复性,T-SURE的完整实现公开可在:(匿名)。
{"title":"Automatic discovery of robust risk groups from limited survival data across biomedical modalities","authors":"Ethar Alzaid ,&nbsp;George Wright ,&nbsp;Mark Eastwood ,&nbsp;Piotr Keller ,&nbsp;Fayyaz Minhas","doi":"10.1016/j.mlwa.2025.100814","DOIUrl":"10.1016/j.mlwa.2025.100814","url":null,"abstract":"<div><div>Survival prediction from medical data is often constrained by scarce labels, limiting the effectiveness of fully supervised models. In addition, most existing approaches produce deterministic risk scores without conveying reliability, which hinders interpretability and clinical trustworthiness. To address these challenges, we introduce T-SURE, a transductive survival ranking and risk-stratification framework that learns jointly from labeled and unlabeled patients to reduce dependence on large annotated cohorts. It also estimates a rejection score that identifies high-uncertainty cases, enabling selective abstention when confidence is low. T-SURE generates a single risk score that enables (1) patient ranking based on survival risk, (2) automatic assignment to risk groups, and (3) optional rejection of uncertain predictions. We extensively evaluated the model on pan-cancer datasets from The Cancer Genome Atlas (TCGA), using gene expression profiles, whole slide images, pathology reports, and clinical information. The model outperformed existing approaches in both ranking and risk stratification, especially in the limited labeled data regimen. It also showed consistent improvements in performance as uncertain samples were rejected, while maintaining statistically significant stratification across datasets. T-SURE integrates as a reliable component within computational pathology pipelines by guiding risk-specific therapeutic and monitoring decisions and flagging ambiguous or rare cases via a high rejection score for further investigation. To support reproducibility, the full implementation of T-SURE is publicly available at: (Anonymized).</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"23 ","pages":"Article 100814"},"PeriodicalIF":4.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145748776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine learning with applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1