Dietary Patterns and Cancer Risk: An Overview with Focus on Methods

The New England Journal of Statistics in Data Science Pub Date : 2022-01-01 DOI:10.51387/23-nejsds35

V. Edefonti, R. De Vito, M. Parpinel, M. Ferraroni

{"title":"Dietary Patterns and Cancer Risk: An Overview with Focus on Methods","authors":"V. Edefonti, R. De Vito, M. Parpinel, M. Ferraroni","doi":"10.51387/23-nejsds35","DOIUrl":null,"url":null,"abstract":"Traditionally, research in nutritional epidemiology has focused on specific foods/food groups or single nutrients in their relation with disease outcomes, including cancer. Dietary pattern analysis have been introduced to examine potential cumulative and interactive effects of individual dietary components of the overall diet, in which foods are consumed in combination. Dietary patterns can be identified by using evidence-based investigator-defined approaches or by using data-driven approaches, which rely on either response independent (also named “a posteriori” dietary patterns) or response dependent (also named “mixed-type” dietary patterns) multivariate statistical methods. Within the open methodological challenges related to study design, dietary assessment, identification of dietary patterns, confounding phenomena, and cancer risk assessment, the current paper provides an updated landscape review of novel methodological developments in the statistical analysis of a posteriori/mixed-type dietary patterns and cancer risk. The review starts from standard a posteriori dietary patterns from principal component, factor, and cluster analyses, including mixture models, and examines mixed-type dietary patterns from reduced rank regression, partial least squares, classification and regression tree analysis, and least absolute shrinkage and selection operator. Novel statistical approaches reviewed include Bayesian factor analysis with modeling of sparsity through shrinkage and sparse priors and frequentist focused principal component analysis. Most novelties relate to the reproducibility of dietary patterns across studies where potentialities of the Bayesian approach to factor and cluster analysis work at best.","PeriodicalId":94360,"journal":{"name":"The New England Journal of Statistics in Data Science","volume":"25 3 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The New England Journal of Statistics in Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51387/23-nejsds35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Traditionally, research in nutritional epidemiology has focused on specific foods/food groups or single nutrients in their relation with disease outcomes, including cancer. Dietary pattern analysis have been introduced to examine potential cumulative and interactive effects of individual dietary components of the overall diet, in which foods are consumed in combination. Dietary patterns can be identified by using evidence-based investigator-defined approaches or by using data-driven approaches, which rely on either response independent (also named “a posteriori” dietary patterns) or response dependent (also named “mixed-type” dietary patterns) multivariate statistical methods. Within the open methodological challenges related to study design, dietary assessment, identification of dietary patterns, confounding phenomena, and cancer risk assessment, the current paper provides an updated landscape review of novel methodological developments in the statistical analysis of a posteriori/mixed-type dietary patterns and cancer risk. The review starts from standard a posteriori dietary patterns from principal component, factor, and cluster analyses, including mixture models, and examines mixed-type dietary patterns from reduced rank regression, partial least squares, classification and regression tree analysis, and least absolute shrinkage and selection operator. Novel statistical approaches reviewed include Bayesian factor analysis with modeling of sparsity through shrinkage and sparse priors and frequentist focused principal component analysis. Most novelties relate to the reproducibility of dietary patterns across studies where potentialities of the Bayesian approach to factor and cluster analysis work at best.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

饮食模式与癌症风险:以方法为重点的综述

传统上，营养流行病学的研究侧重于特定食物/食物组或单一营养素与疾病结局(包括癌症)的关系。饮食模式分析已被引入，以检查整体饮食中单个饮食成分的潜在累积和相互作用效应，其中食物被组合食用。饮食模式可以通过使用基于证据的研究者定义的方法或使用数据驱动的方法来确定，这些方法依赖于反应独立(也称为“后验”饮食模式)或反应依赖(也称为“混合型”饮食模式)的多变量统计方法。在与研究设计、饮食评估、饮食模式识别、混杂现象和癌症风险评估相关的开放式方法学挑战中，本文对后验/混合型饮食模式和癌症风险统计分析方面的新方法学发展进行了最新的综述。本综述从包括混合模型在内的主成分、因子和聚类分析的标准后验饮食模式开始，并从降秩回归、偏最小二乘法、分类和回归树分析、最小绝对收缩和选择算子等方法检验混合饮食模式。回顾了新的统计方法，包括贝叶斯因子分析，通过收缩和稀疏先验来建模稀疏性，以及频率集中的主成分分析。大多数新奇之处都与饮食模式的可重复性有关，在这些研究中，贝叶斯方法的潜力在因子和聚类分析中发挥了最大的作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

The New England Journal of Statistics in Data Science

自引率

0.00%

发文量

期刊最新文献

Practical Considerations for Variable Screening in the Super Learner. Meta-analysis of Censored Adverse Events. Biomarker Panel Development Using Logic Regression in the Presence of Missing Data. Modeling Multivariate Spatial Dependencies Using Graphical Models. Effect of model space priors on statistical inference with model uncertainty.