Elisa Frutos-Bernal, Eva Ceulemans, Purificación Galindo-Villardón, Tom F Wilderjans
{"title":"Data fusion by T3-PCA: A global model for the simultaneous analysis of coupled three-way and two-way real-valued data.","authors":"Elisa Frutos-Bernal, Eva Ceulemans, Purificación Galindo-Villardón, Tom F Wilderjans","doi":"10.1111/bmsp.12372","DOIUrl":null,"url":null,"abstract":"<p><p>In various areas of science, researchers try to gain insight into important processes by jointly analysing different datasets containing information regarding common aspects of these processes. For example, to explain individual differences in personality, researchers collect, for the same set of persons, data regarding behavioural signatures (i.e., the reaction profile of a person across different situations), on the one hand, and traits or dispositions, on the other hand. To uncover the processes underlying such coupled data, to all N-way <math> <semantics><mrow><mi>N</mi></mrow> <annotation>$$ N $$</annotation></semantics> </math> -mode data blocks simultaneously a global model is fitted, in which each data block is represented by an <math> <semantics><mrow><mi>N</mi></mrow> <annotation>$$ N $$</annotation></semantics> </math> -way <math> <semantics><mrow><mi>N</mi></mrow> <annotation>$$ N $$</annotation></semantics> </math> -mode decomposition model (e.g., principal component analysis [PCA], Parafac, Tucker3) and the parameters underlying the common mode are required to be the same for all data blocks this mode belongs to. To estimate the parameters underlying the common mode, a simultaneous strategy is used that pools the information present in all data blocks (i.e., data fusion). In this paper, we propose the T3-PCA model, which represents three- and two-way data with Tucker3 and PCA respectively. This model is less restrictive than the already proposed LMPCA model in which the three-way data block is decomposed according to a Parafac model. To estimate the T3-PCA model parameters, an alternating least-squares algorithm is proposed. The superior performance of the simultaneous T3-PCA strategy over a sequential strategy (i.e., estimating common parameters using information from the three-way data block only) is demonstrated in an extensive simulation study and an application to empirical coupled anxiety data.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Mathematical & Statistical Psychology","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1111/bmsp.12372","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
In various areas of science, researchers try to gain insight into important processes by jointly analysing different datasets containing information regarding common aspects of these processes. For example, to explain individual differences in personality, researchers collect, for the same set of persons, data regarding behavioural signatures (i.e., the reaction profile of a person across different situations), on the one hand, and traits or dispositions, on the other hand. To uncover the processes underlying such coupled data, to all N-way -mode data blocks simultaneously a global model is fitted, in which each data block is represented by an -way -mode decomposition model (e.g., principal component analysis [PCA], Parafac, Tucker3) and the parameters underlying the common mode are required to be the same for all data blocks this mode belongs to. To estimate the parameters underlying the common mode, a simultaneous strategy is used that pools the information present in all data blocks (i.e., data fusion). In this paper, we propose the T3-PCA model, which represents three- and two-way data with Tucker3 and PCA respectively. This model is less restrictive than the already proposed LMPCA model in which the three-way data block is decomposed according to a Parafac model. To estimate the T3-PCA model parameters, an alternating least-squares algorithm is proposed. The superior performance of the simultaneous T3-PCA strategy over a sequential strategy (i.e., estimating common parameters using information from the three-way data block only) is demonstrated in an extensive simulation study and an application to empirical coupled anxiety data.
在不同的科学领域,研究人员试图通过联合分析包含这些过程的共同方面的信息的不同数据集来深入了解重要的过程。例如,为了解释个性的个体差异,研究人员收集了同一组人的行为特征数据(即,一个人在不同情况下的反应概况),以及另一方面的特征或性格。为了揭示这些耦合数据背后的过程,以所有N-way N $$ N $$ -mode数据块同时拟合一个全局模型,其中每个数据块用N表示 $$ N $$ -way N $$ N $$ -模态分解模型(如主成分分析[PCA]、Parafac、Tucker3)和公共模态的底层参数对于该模态所属的所有数据块都要求相同。为了估计公共模式下的参数,使用了一种同步策略,将所有数据块中的信息集中在一起(即数据融合)。本文提出了T3-PCA模型,分别用Tucker3和PCA表示三向和双向数据。该模型比已经提出的LMPCA模型约束更少,在LMPCA模型中,根据Parafac模型对三向数据块进行分解。为了估计T3-PCA模型参数,提出了一种交替最小二乘算法。在广泛的模拟研究和经验耦合焦虑数据的应用中,证明了同步T3-PCA策略优于顺序策略(即仅使用来自三方数据块的信息估计共同参数)。
期刊介绍:
The British Journal of Mathematical and Statistical Psychology publishes articles relating to areas of psychology which have a greater mathematical or statistical aspect of their argument than is usually acceptable to other journals including:
• mathematical psychology
• statistics
• psychometrics
• decision making
• psychophysics
• classification
• relevant areas of mathematics, computing and computer software
These include articles that address substantitive psychological issues or that develop and extend techniques useful to psychologists. New models for psychological processes, new approaches to existing data, critiques of existing models and improved algorithms for estimating the parameters of a model are examples of articles which may be favoured.