Scalable data fusion via a scale-based hierarchical framework: Adapting to multi-source and multi-scale scenarios

IF 15.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Information Fusion Pub Date : 2025-02-01 Epub Date: 2024-09-12 DOI:10.1016/j.inffus.2024.102694

Xiaoyan Zhang, Jiajia Lin

{"title":"Scalable data fusion via a scale-based hierarchical framework: Adapting to multi-source and multi-scale scenarios","authors":"Xiaoyan Zhang, Jiajia Lin","doi":"10.1016/j.inffus.2024.102694","DOIUrl":null,"url":null,"abstract":"<div>Multi-source information fusion addresses challenges in integrating and transforming complementary data from diverse sources to facilitate unified information representation for centralized knowledge discovery. However, traditional methods face difficulties when applied to multi-scale data, where optimal scale selection can effectively resolve these issues but typically lack the advantage of identifying the optimal and simplest data from different data source relationships. Moreover, in multi-source, multi-scale environments, heterogeneous data (where identical samples have different features and scales in different sources) is prone to occur. To address these challenges, this study proposes a novel approach in two key stages: first, aggregating heterogeneous data sources and refining datasets using information gain; second, employing a customized Scale-based Tree (SbT) structure for each attribute to help extract specific scale information value, thereby achieving effective data fusion goals. Extensive experimental evaluations cover ten different datasets, reporting detailed performance across multiple metrics including Approximation Precision (AP), Approximation Quality (AQ) values, classification accuracy, and computational efficiency. The results highlight the robustness and effectiveness of our proposed algorithm in handling complex multi-source, multi-scale data environments, demonstrating its potential and practicality in addressing real-world data fusion challenges.</div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"114 ","pages":"Article 102694"},"PeriodicalIF":15.5000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156625352400472X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/12 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-source information fusion addresses challenges in integrating and transforming complementary data from diverse sources to facilitate unified information representation for centralized knowledge discovery. However, traditional methods face difficulties when applied to multi-scale data, where optimal scale selection can effectively resolve these issues but typically lack the advantage of identifying the optimal and simplest data from different data source relationships. Moreover, in multi-source, multi-scale environments, heterogeneous data (where identical samples have different features and scales in different sources) is prone to occur. To address these challenges, this study proposes a novel approach in two key stages: first, aggregating heterogeneous data sources and refining datasets using information gain; second, employing a customized Scale-based Tree (SbT) structure for each attribute to help extract specific scale information value, thereby achieving effective data fusion goals. Extensive experimental evaluations cover ten different datasets, reporting detailed performance across multiple metrics including Approximation Precision (AP), Approximation Quality (AQ) values, classification accuracy, and computational efficiency. The results highlight the robustness and effectiveness of our proposed algorithm in handling complex multi-source, multi-scale data environments, demonstrating its potential and practicality in addressing real-world data fusion challenges.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过基于规模的分层框架实现可扩展的数据融合：适应多源和多尺度场景

多源信息融合解决了整合和转换来自不同来源的互补数据的难题，从而为集中式知识发现提供统一的信息表示。然而，传统方法在应用于多尺度数据时面临着困难，最佳尺度选择可以有效解决这些问题，但通常缺乏从不同数据源关系中识别最佳和最简单数据的优势。此外，在多源多尺度环境中，容易出现异构数据（相同样本在不同源中具有不同特征和尺度）。为了应对这些挑战，本研究提出了一种分两个关键阶段的新方法：第一，聚合异构数据源，利用信息增益提炼数据集；第二，为每个属性采用定制的基于尺度的树（SbT）结构，帮助提取特定的尺度信息值，从而实现有效的数据融合目标。广泛的实验评估涵盖了十个不同的数据集，报告了多个指标的详细性能，包括近似精度（AP）、近似质量（AQ）值、分类准确性和计算效率。这些结果凸显了我们提出的算法在处理复杂的多源、多尺度数据环境时的稳健性和有效性，证明了它在应对现实世界数据融合挑战方面的潜力和实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Information Fusion 工程技术-计算机：理论方法

CiteScore

33.20

自引率

4.30%

发文量

161

审稿时长

7.9 months

期刊介绍： Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.