Genetic Epidemiology最新文献_第6页

Ethical, Legal, and Social Implications of Gene-Environment Interaction Research 基因与环境相互作用研究的伦理、法律和社会影响。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-09-24 DOI: 10.1002/gepi.22591

Stephanie Calluori, Kaitlin Kirkpatrick Heimke, Charlisse Caga-anan, David Kaufman, Leah E. Mechanic, Kimberly A. McAllister

Many complex disorders are impacted by the interplay of genetic and environmental factors. In gene-environment interactions (GxE), an individual's genetic and epigenetic makeup impacts the response to environmental exposures. Understanding GxE can impact health at the individual, community, and population levels. The rapid expansion of GxE research in biomedical studies for complex diseases raises many unique ethical, legal, and social implications (ELSIs) that have not been extensively explored and addressed. This review article builds on discussions originating from a workshop held by the National Institute of Environmental Health Sciences (NIEHS) and the National Human Genome Research Institute (NHGRI) in January 2022, entitled: “Ethical, Legal, and Social Implications of Gene-Environment Interaction Research.” We expand upon multiple key themes to inform broad recommendations and general guidance for addressing some of the most unique and challenging ELSI in GxE research. Key takeaways include strategies and approaches for establishing sustainable community partnerships, incorporating social determinants of health and environmental justice considerations into GxE research, effectively communicating and translating GxE findings, and addressing privacy and discrimination concerns in all GxE research going forward. Additional guidelines, resources, approaches, training, and capacity building are required to further support innovative GxE research and multidisciplinary GxE research teams.

许多复杂的疾病都受到遗传和环境因素相互作用的影响。在基因-环境相互作用（GxE）中，个体的基因和表观遗传构成会影响对环境暴露的反应。了解 GxE 可以影响个人、社区和人群的健康。在针对复杂疾病的生物医学研究中，GxE 研究的迅速扩展引发了许多独特的伦理、法律和社会影响 (ELSI)，而这些影响尚未得到广泛的探讨和解决。这篇综述文章以美国国家环境健康科学研究所（NIEHS）和美国国家人类基因组研究所（NHGRI）于 2022 年 1 月举办的题为 "基因-环境相互作用研究的伦理、法律和社会影响 "的研讨会上的讨论为基础。我们扩展了多个关键主题，为解决基因与环境相互作用研究中一些最独特、最具挑战性的 ELSI 问题提供了广泛的建议和一般指导。主要收获包括建立可持续社区伙伴关系的策略和方法、将健康的社会决定因素和环境正义因素纳入 GxE 研究、有效交流和转化 GxE 研究结果，以及在今后的所有 GxE 研究中解决隐私和歧视问题。需要更多的指导方针、资源、方法、培训和能力建设，以进一步支持创新性 GxE 研究和多学科 GxE 研究团队。

{"title":"Ethical, Legal, and Social Implications of Gene-Environment Interaction Research","authors":"Stephanie Calluori, Kaitlin Kirkpatrick Heimke, Charlisse Caga-anan, David Kaufman, Leah E. Mechanic, Kimberly A. McAllister","doi":"10.1002/gepi.22591","DOIUrl":"10.1002/gepi.22591","url":null,"abstract":"<p>Many complex disorders are impacted by the interplay of genetic and environmental factors. In gene-environment interactions (GxE), an individual's genetic and epigenetic makeup impacts the response to environmental exposures. Understanding GxE can impact health at the individual, community, and population levels. The rapid expansion of GxE research in biomedical studies for complex diseases raises many unique ethical, legal, and social implications (ELSIs) that have not been extensively explored and addressed. This review article builds on discussions originating from a workshop held by the National Institute of Environmental Health Sciences (NIEHS) and the National Human Genome Research Institute (NHGRI) in January 2022, entitled: “Ethical, Legal, and Social Implications of Gene-Environment Interaction Research.” We expand upon multiple key themes to inform broad recommendations and general guidance for addressing some of the most unique and challenging ELSI in GxE research. Key takeaways include strategies and approaches for establishing sustainable community partnerships, incorporating social determinants of health and environmental justice considerations into GxE research, effectively communicating and translating GxE findings, and addressing privacy and discrimination concerns in all GxE research going forward. Additional guidelines, resources, approaches, training, and capacity building are required to further support innovative GxE research and multidisciplinary GxE research teams.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22591","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142307513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PSAP-Genomic-Regions: A Method Leveraging Population Data to Prioritize Coding and Non-Coding Variants in Whole Genome Sequencing for Rare Disease Diagnosis PSAP-Genomic-Regions：利用群体数据优先处理全基因组测序中编码和非编码变异以诊断罕见病的方法。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-09-24 DOI: 10.1002/gepi.22593

Marie-Sophie C. Ogloblinsky, Ozvan Bocher, Chaker Aloui, Anne-Louise Leutenegger, Ozan Ozisik, Anaïs Baudot, Elisabeth Tournier-Lasserve, Helen Castillo-Madeen, Daniel Lewinsohn, Donald F. Conrad, Emmanuelle Génin, Gaëlle Marenne

The introduction of Next-Generation Sequencing technologies in the clinics has improved rare disease diagnosis. Nonetheless, for very heterogeneous or very rare diseases, more than half of cases still lack molecular diagnosis. Novel strategies are needed to prioritize variants within a single individual. The Population Sampling Probability (PSAP) method was developed to meet this aim but only for coding variants in exome data. Here, we propose an extension of the PSAP method to the non-coding genome called PSAP-genomic-regions. In this extension, instead of considering genes as testing units (PSAP-genes strategy), we use genomic regions defined over the whole genome that pinpoint potential functional constraints. We conceived an evaluation protocol for our method using artificially generated disease exomes and genomes, by inserting coding and non-coding pathogenic ClinVar variants in large data sets of exomes and genomes from the general population. PSAP-genomic-regions significantly improves the ranking of these variants compared to using a pathogenicity score alone. Using PSAP-genomic-regions, more than 50% of non-coding ClinVar variants were among the top 10 variants of the genome. On real sequencing data from six patients with Cerebral Small Vessel Disease and nine patients with male infertility, all causal variants were ranked in the top 100 variants with PSAP-genomic-regions. By revisiting the testing units used in the PSAP method to include non-coding variants, we have developed PSAP-genomic-regions, an efficient whole-genome prioritization tool which offers promising results for the diagnosis of unresolved rare diseases.

下一代测序技术在临床上的应用改善了罕见病的诊断。然而，对于非常异质性或非常罕见的疾病，仍有一半以上的病例缺乏分子诊断。我们需要新的策略来确定单个个体内变异的优先次序。为实现这一目标，我们开发了群体采样概率（PSAP）方法，但该方法仅适用于外显子组数据中的编码变异。在这里，我们提出将 PSAP 方法扩展到非编码基因组，称为 PSAP-基因组-区域。在这一扩展中，我们不再将基因作为测试单元（PSAP-基因策略），而是使用定义在整个基因组上的基因组区域，以精确定位潜在的功能限制。我们设想了一个评估方案，利用人工生成的疾病外显子组和基因组，将编码和非编码致病性 ClinVar 变异插入来自普通人群的外显子组和基因组的大型数据集中，对我们的方法进行评估。与单独使用致病性评分相比，PSAP-基因组-区域能显著提高这些变异体的排序。使用 PSAP 基因组区域，50% 以上的 ClinVar 非编码变异跻身基因组前 10 个变异之列。在六名脑小血管疾病患者和九名男性不育症患者的真实测序数据中，所有因果变异都进入了使用 PSAP 基因组区域的前 100 个变异之列。通过重新审视 PSAP 方法中使用的检测单元，将非编码变异纳入其中，我们开发出了 PSAP-基因组-区域这一高效的全基因组优先排序工具，为诊断未解决的罕见病提供了可喜的成果。

{"title":"PSAP-Genomic-Regions: A Method Leveraging Population Data to Prioritize Coding and Non-Coding Variants in Whole Genome Sequencing for Rare Disease Diagnosis","authors":"Marie-Sophie C. Ogloblinsky, Ozvan Bocher, Chaker Aloui, Anne-Louise Leutenegger, Ozan Ozisik, Anaïs Baudot, Elisabeth Tournier-Lasserve, Helen Castillo-Madeen, Daniel Lewinsohn, Donald F. Conrad, Emmanuelle Génin, Gaëlle Marenne","doi":"10.1002/gepi.22593","DOIUrl":"10.1002/gepi.22593","url":null,"abstract":"<div>\u0000 \u0000 <p>The introduction of Next-Generation Sequencing technologies in the clinics has improved rare disease diagnosis. Nonetheless, for very heterogeneous or very rare diseases, more than half of cases still lack molecular diagnosis. Novel strategies are needed to prioritize variants within a single individual. The Population Sampling Probability (PSAP) method was developed to meet this aim but only for coding variants in exome data. Here, we propose an extension of the PSAP method to the non-coding genome called PSAP-genomic-regions. In this extension, instead of considering genes as testing units (PSAP-genes strategy), we use genomic regions defined over the whole genome that pinpoint potential functional constraints. We conceived an evaluation protocol for our method using artificially generated disease exomes and genomes, by inserting coding and non-coding pathogenic ClinVar variants in large data sets of exomes and genomes from the general population. PSAP-genomic-regions significantly improves the ranking of these variants compared to using a pathogenicity score alone. Using PSAP-genomic-regions, more than 50% of non-coding ClinVar variants were among the top 10 variants of the genome. On real sequencing data from six patients with Cerebral Small Vessel Disease and nine patients with male infertility, all causal variants were ranked in the top 100 variants with PSAP-genomic-regions. By revisiting the testing units used in the PSAP method to include non-coding variants, we have developed PSAP-genomic-regions, an efficient whole-genome prioritization tool which offers promising results for the diagnosis of unresolved rare diseases.</p></div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142345037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparing Ancestry Standardization Approaches for a Transancestry Colorectal Cancer Polygenic Risk Score 比较跨宗族结直肠癌多基因风险评分的宗族标准化方法。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-09-24 DOI: 10.1002/gepi.22590

Elisabeth A. Rosenthal, Li Hsu, Minta Thomas, Ulrike Peters, Christopher Kachulis, Karynne Patterson, Gail P. Jarvik

Colorectal cancer (CRC) is a complex disease with monogenic, polygenic and environmental risk factors. Polygenic risk scores (PRSs) aim to identify high polygenic risk individuals. Due to differences in genetic background, PRS distributions vary by ancestry, necessitating standardization. We compared four post-hoc methods using the All of Us Research Program Whole Genome Sequence data for a transancestry CRC PRS. We contrasted results from linear models trained on A. the entire data or an ancestrally diverse subset AND B. covariates including principal components of ancestry or admixture. Standardization with the training subset also adjusted the variance. All methods performed similarly within ancestry, OR (95% C.I.) per s.d. change in PRS: African 1.5 (1.02, 2.08), Admixed American 2.2 (1.27, 3.85), European 1.6 (1.43, 1.89), and Middle Eastern 1.1 (0.71, 1.63). Using admixture and an ancestrally diverse training set provided distributions closest to standard Normal. Training a model on ancestrally diverse participants, adjusting both the mean and variance using admixture as covariates, created standard Normal z-scores, which can be used to identify patients at high polygenic risk. These scores can be incorporated into comprehensive risk calculation including other known risk factors, allowing for more precise risk estimates.

结直肠癌（CRC）是一种复杂的疾病，具有单基因、多基因和环境风险因素。多基因风险评分（PRS）旨在识别多基因高风险个体。由于遗传背景的差异，PRS 的分布因血统而异，因此有必要进行标准化。我们使用 "全人类研究计划 "的全基因组序列数据比较了四种用于跨血统 CRC PRS 的事后分析方法。我们对比了 A. 整个数据或祖先多样性子集和 B. 辅变量（包括祖先或混血的主成分）所训练的线性模型的结果。用训练子集进行标准化还可以调整方差。所有方法在祖先、PRS 每 s.d. 变化的 OR（95% C.I.）方面的表现相似：非洲人 1.5 (1.02, 2.08)，混血美国人 2.2 (1.27, 3.85)，欧洲人 1.6 (1.43, 1.89)，中东人 1.1 (0.71, 1.63)。使用掺杂和祖先多样化的训练集提供了最接近标准正态分布的分布。对祖先多样化的参与者进行模型训练，使用掺杂作为协变量来调整均值和方差，可得到标准正态 Z 值，用于识别多基因高风险患者。这些分数可以纳入包括其他已知风险因素在内的综合风险计算中，从而得出更精确的风险估计值。

{"title":"Comparing Ancestry Standardization Approaches for a Transancestry Colorectal Cancer Polygenic Risk Score","authors":"Elisabeth A. Rosenthal, Li Hsu, Minta Thomas, Ulrike Peters, Christopher Kachulis, Karynne Patterson, Gail P. Jarvik","doi":"10.1002/gepi.22590","DOIUrl":"10.1002/gepi.22590","url":null,"abstract":"<div>\u0000 \u0000 <p>Colorectal cancer (CRC) is a complex disease with monogenic, polygenic and environmental risk factors. Polygenic risk scores (PRSs) aim to identify high polygenic risk individuals. Due to differences in genetic background, PRS distributions vary by ancestry, necessitating standardization. We compared four <i>post-hoc</i> methods using the All of Us Research Program Whole Genome Sequence data for a transancestry CRC PRS. We contrasted results from linear models trained on A. the entire data or an ancestrally diverse subset AND B. covariates including principal components of ancestry or admixture. Standardization with the training subset also adjusted the variance. All methods performed similarly within ancestry, OR (95% C.I.) per s.d. change in PRS: African 1.5 (1.02, 2.08), Admixed American 2.2 (1.27, 3.85), European 1.6 (1.43, 1.89), and Middle Eastern 1.1 (0.71, 1.63). Using admixture and an ancestrally diverse training set provided distributions closest to standard Normal. Training a model on ancestrally diverse participants, adjusting both the mean and variance using admixture as covariates, created standard Normal <i>z</i>-scores, which can be used to identify patients at high polygenic risk. These scores can be incorporated into comprehensive risk calculation including other known risk factors, allowing for more precise risk estimates.</p>\u0000 </div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142307512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The 2024 Annual Meeting of the International Genetic Epidemiology Society 国际遗传流行病学学会 2024 年年会。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-09-23 DOI: 10.1002/gepi.22598

引用次数: 0

Predicting Lung Cancer in Korean Never-Smokers With Polygenic Risk Scores 用多基因风险评分预测韩国从不吸烟者的肺癌发病率

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-09-23 DOI: 10.1002/gepi.22586

Juyeon Kim, Young Sik Park, Jin Hee Kim, Yun-Chul Hong, Young-Chul Kim, In-Jae Oh, Sun Ha Jee, Myung-Ju Ahn, Jong-Won Kim, Jae-Joon Yim, Sungho Won

In the last few decades, genome-wide association studies (GWAS) with more than 10,000 subjects have identified several loci associated with lung cancer and these loci have been used to develop novel risk prediction tools for cancer. The present study aimed to establish a lung cancer prediction model for Korean never-smokers using polygenic risk scores (PRSs); PRSs were calculated using a pruning-thresholding-based approach based on 11 genome-wide significant single nucleotide polymorphisms (SNPs). Overall, the odds ratios tended to increase as PRSs were larger, with the odds ratio of the top 5% PRSs being 1.71 (95% confidence interval: 1.31–2.23) using the 40%–60% percentile group as the reference, and the area under the curve (AUC) of the prediction model being of 0.76 (95% confidence interval: 0.747–0.774). The receiver operating characteristic (ROC) curves of the prediction model with and without PRSs as covariates were compared using DeLong's test, and a significant difference was observed. Our results suggest that PRSs can be valuable tools for predicting the risk of lung cancer.

在过去的几十年里，对超过 10,000 名受试者进行的全基因组关联研究（GWAS）发现了多个与肺癌相关的基因位点，这些基因位点已被用于开发新型癌症风险预测工具。本研究旨在利用多基因风险评分（PRSs）为韩国从不吸烟者建立一个肺癌预测模型；PRSs 是基于 11 个全基因组重要的单核苷酸多态性（SNPs），采用剪枝-阈值法计算得出的。总体而言，PRS越大，几率比越大，以40%-60%百分位数组为参照，前5% PRS的几率比为1.71（95%置信区间：1.31-2.23），预测模型的曲线下面积（AUC）为0.76（95%置信区间：0.747-0.774）。使用 DeLong 检验比较了以 PRS 为辅变量和不以 PRS 为辅变量的预测模型的接收器操作特征曲线（ROC），结果发现两者有显著差异。我们的研究结果表明，PRS 是预测肺癌风险的重要工具。

{"title":"Predicting Lung Cancer in Korean Never-Smokers With Polygenic Risk Scores","authors":"Juyeon Kim, Young Sik Park, Jin Hee Kim, Yun-Chul Hong, Young-Chul Kim, In-Jae Oh, Sun Ha Jee, Myung-Ju Ahn, Jong-Won Kim, Jae-Joon Yim, Sungho Won","doi":"10.1002/gepi.22586","DOIUrl":"10.1002/gepi.22586","url":null,"abstract":"<div>\u0000 \u0000 <p>In the last few decades, genome-wide association studies (GWAS) with more than 10,000 subjects have identified several loci associated with lung cancer and these loci have been used to develop novel risk prediction tools for cancer. The present study aimed to establish a lung cancer prediction model for Korean never-smokers using polygenic risk scores (PRSs); PRSs were calculated using a pruning-thresholding-based approach based on 11 genome-wide significant single nucleotide polymorphisms (SNPs). Overall, the odds ratios tended to increase as PRSs were larger, with the odds ratio of the top 5% PRSs being 1.71 (95% confidence interval: 1.31–2.23) using the 40%–60% percentile group as the reference, and the area under the curve (AUC) of the prediction model being of 0.76 (95% confidence interval: 0.747–0.774). The receiver operating characteristic (ROC) curves of the prediction model with and without PRSs as covariates were compared using DeLong's test, and a significant difference was observed. Our results suggest that PRSs can be valuable tools for predicting the risk of lung cancer.</p>\u0000 </div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142284342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring and Accounting for Genetically Driven Effect Heterogeneity in Mendelian Randomization 探索和解释孟德尔随机化中基因驱动的效应异质性。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-09-22 DOI: 10.1002/gepi.22587

Annika Jaitner, Krasimira Tsaneva-Atanasova, Rachel M. Freathy, Jack Bowden

Mendelian randomization (MR) is a framework to estimate the causal effect of a modifiable health exposure, drug target or pharmaceutical intervention on a downstream outcome by using genetic variants as instrumental variables. A crucial assumption allowing estimation of the average causal effect in MR, termed homogeneity, is that the causal effect does not vary across levels of any instrument used in the analysis. In contrast, the science of pharmacogenetics seeks to actively uncover and exploit genetically driven effect heterogeneity for the purposes of precision medicine. In this study, we consider a recently proposed method for performing pharmacogenetic analysis on observational data—the Triangulation WIthin a STudy (TWIST) framework—and explore how it can be combined with traditional MR approaches to properly characterise average causal effects and genetically driven effect heterogeneity. We propose two new methods which not only estimate the genetically driven effect heterogeneity but also enable the estimation of a causal effect in the genetic group with and without the risk allele separately. Both methods utilise homogeneity-respecting and homogeneity-violating genetic variants and rely on a different set of assumptions. Using data from the ALSPAC study, we apply our new methods to estimate the causal effect of smoking before and during pregnancy on offspring birth weight in mothers whose genetics mean they find it (relatively) easier or harder to quit smoking.

孟德尔随机化（MR）是一种利用遗传变异作为工具变量来估算可改变的健康暴露、药物目标或药物干预对下游结果的因果效应的框架。在 MR 中，估算平均因果效应的一个重要假设（称为同质性）是，因果效应不会因分析中使用的任何工具的不同水平而变化。与此相反，药物遗传学试图积极发现和利用基因驱动的效应异质性，以实现精准医疗的目的。在本研究中，我们考虑了最近提出的一种对观察数据进行药物遗传学分析的方法--TWIST（Triangulation WIthin a STudy）框架--并探讨了如何将其与传统的 MR 方法相结合，以正确描述平均因果效应和基因驱动的效应异质性。我们提出了两种新方法，它们不仅能估算基因驱动效应异质性，还能分别估算有风险等位基因和无风险等位基因基因组的因果效应。这两种方法都利用了尊重同质性和违反同质性的遗传变异，并依赖于不同的假设。利用 ALSPAC 研究的数据，我们运用新方法估算了母亲在怀孕前和怀孕期间吸烟对后代出生体重的因果效应，这些母亲的遗传意味着戒烟（相对）更容易或更困难。

{"title":"Exploring and Accounting for Genetically Driven Effect Heterogeneity in Mendelian Randomization","authors":"Annika Jaitner, Krasimira Tsaneva-Atanasova, Rachel M. Freathy, Jack Bowden","doi":"10.1002/gepi.22587","DOIUrl":"10.1002/gepi.22587","url":null,"abstract":"<p>Mendelian randomization (MR) is a framework to estimate the causal effect of a modifiable health exposure, drug target or pharmaceutical intervention on a downstream outcome by using genetic variants as instrumental variables. A crucial assumption allowing estimation of the average causal effect in MR, termed <i>homogeneity</i>, is that the causal effect does not vary across levels of any instrument used in the analysis. In contrast, the science of pharmacogenetics seeks to actively uncover and exploit genetically driven effect heterogeneity for the purposes of precision medicine. In this study, we consider a recently proposed method for performing pharmacogenetic analysis on observational data—the Triangulation WIthin a STudy (TWIST) framework—and explore how it can be combined with traditional MR approaches to properly characterise average causal effects and genetically driven effect heterogeneity. We propose two new methods which not only estimate the genetically driven effect heterogeneity but also enable the estimation of a causal effect in the genetic group with and without the risk allele separately. Both methods utilise homogeneity-respecting and homogeneity-violating genetic variants and rely on a different set of assumptions. Using data from the ALSPAC study, we apply our new methods to estimate the causal effect of smoking before and during pregnancy on offspring birth weight in mothers whose genetics mean they find it (relatively) easier or harder to quit smoking.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22587","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142284341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using clustering of genetic variants in Mendelian randomization to interrogate the causal pathways underlying multimorbidity from a common risk factor 利用孟德尔随机化中的遗传变异聚类，从一个共同的风险因素出发，探究多病致病的因果途径。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-08-13 DOI: 10.1002/gepi.22582

Xiaoran Liang, Ninon Mounier, Nicolas Apfel, Sara Khalid, Timothy M. Frayling, Jack Bowden

Mendelian randomization (MR) is an epidemiological approach that utilizes genetic variants as instrumental variables to estimate the causal effect of an exposure on a health outcome. This paper investigates an MR scenario in which genetic variants aggregate into clusters that identify heterogeneous causal effects. Such variant clusters are likely to emerge if they affect the exposure and outcome via distinct biological pathways. In the multi-outcome MR framework, where a shared exposure causally impacts several disease outcomes simultaneously, these variant clusters can provide insights into the common disease-causing mechanisms underpinning the co-occurrence of multiple long-term conditions, a phenomenon known as multimorbidity. To identify such variant clusters, we adapt the general method of agglomerative hierarchical clustering to multi-sample summary-data MR setup, enabling cluster detection based on variant-specific ratio estimates. Particularly, we tailor the method for multi-outcome MR to aid in elucidating the causal pathways through which a common risk factor contributes to multiple morbidities. We show in simulations that our “MR-AHC” method detects clusters with high accuracy, outperforming the existing methods. We apply the method to investigate the causal effects of high body fat percentage on type 2 diabetes and osteoarthritis, uncovering interconnected cellular processes underlying this multimorbid disease pair.

孟德尔随机化（MR）是一种流行病学方法，它利用遗传变异作为工具变量来估计暴露对健康结果的因果效应。本文研究了一种 MR 情景，在这种情景中，遗传变异聚集成群，从而确定了异质性因果效应。如果基因变异通过不同的生物途径影响暴露和结果，就有可能出现这种变异集群。在多结果 MR 框架中，共同的暴露会同时对几种疾病结果产生因果影响，这些变异集群可以让人们深入了解多种长期病症并发的共同致病机制，这种现象被称为多病共存。为了识别这种变异集群，我们将聚类分层聚类的一般方法调整为多样本汇总数据磁共振设置，从而能够根据变异特异性比率估计值进行集群检测。特别是，我们为多结果 MR 定制了方法，以帮助阐明一个共同风险因素导致多种疾病的因果途径。我们的模拟结果表明，我们的 "MR-AHC "方法能高精度地检测到集群，优于现有方法。我们应用该方法研究了高体脂率对 2 型糖尿病和骨关节炎的因果效应，揭示了这对多病组合背后相互关联的细胞过程。

{"title":"Using clustering of genetic variants in Mendelian randomization to interrogate the causal pathways underlying multimorbidity from a common risk factor","authors":"Xiaoran Liang, Ninon Mounier, Nicolas Apfel, Sara Khalid, Timothy M. Frayling, Jack Bowden","doi":"10.1002/gepi.22582","DOIUrl":"10.1002/gepi.22582","url":null,"abstract":"<p>Mendelian randomization (MR) is an epidemiological approach that utilizes genetic variants as instrumental variables to estimate the causal effect of an exposure on a health outcome. This paper investigates an MR scenario in which genetic variants aggregate into clusters that identify heterogeneous causal effects. Such variant clusters are likely to emerge if they affect the exposure and outcome via distinct biological pathways. In the multi-outcome MR framework, where a shared exposure causally impacts several disease outcomes simultaneously, these variant clusters can provide insights into the common disease-causing mechanisms underpinning the co-occurrence of multiple long-term conditions, a phenomenon known as multimorbidity. To identify such variant clusters, we adapt the general method of agglomerative hierarchical clustering to multi-sample summary-data MR setup, enabling cluster detection based on variant-specific ratio estimates. Particularly, we tailor the method for multi-outcome MR to aid in elucidating the causal pathways through which a common risk factor contributes to multiple morbidities. We show in simulations that our “MR-AHC” method detects clusters with high accuracy, outperforming the existing methods. We apply the method to investigate the causal effects of high body fat percentage on type 2 diabetes and osteoarthritis, uncovering interconnected cellular processes underlying this multimorbid disease pair.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22582","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141975524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring pleiotropy in Mendelian randomisation analyses: What are genetic variants associated with ‘cigarette smoking initiation’ really capturing? 探索孟德尔随机分析中的多义性：与 "开始吸烟 "相关的基因变异到底在捕捉什么？

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-08-04 DOI: 10.1002/gepi.22583

Zoe E. Reed, Robyn E. Wootton, Jasmine N. Khouja, Tom G. Richardson, Eleanor Sanderson, George Davey Smith, Marcus R. Munafò

Genetic variants used as instruments for exposures in Mendelian randomisation (MR) analyses may have horizontal pleiotropic effects (i.e., influence outcomes via pathways other than through the exposure), which can undermine the validity of results. We examined the extent of this using smoking behaviours as an example. We first ran a phenome-wide association study in UK Biobank, using a smoking initiation genetic instrument. From the most strongly associated phenotypes, we selected those we considered could either plausibly or not plausibly be caused by smoking. We examined associations between genetic instruments for smoking initiation, smoking heaviness and lifetime smoking and these phenotypes in UK Biobank and the Avon Longitudinal Study of Parents and Children (ALSPAC). We conducted negative control analyses among never smokers, including children. We found evidence that smoking-related genetic instruments were associated with phenotypes not plausibly caused by smoking in UK Biobank and (to a lesser extent) ALSPAC. We observed associations with phenotypes among never smokers. Our results demonstrate that smoking-related genetic risk scores are associated with unexpected phenotypes that are less plausibly downstream of smoking. This may reflect horizontal pleiotropy in these genetic risk scores, and we would encourage researchers to exercise caution this when using these and genetic risk scores for other complex behavioural exposures. We outline approaches that could be taken to consider this and overcome issues caused by potential horizontal pleiotropy, for example, in genetically informed causal inference analyses (e.g., MR) it is important to consider negative control outcomes and triangulation approaches, to avoid arriving at incorrect conclusions.

在孟德尔随机化（MR）分析中，作为暴露工具的基因变异可能会产生水平多向效应（即通过暴露以外的途径影响结果），这可能会损害结果的有效性。我们以吸烟行为为例，研究了这种影响的程度。我们首先在英国生物库中使用吸烟起始基因工具进行了全表型关联研究。从关联性最强的表型中，我们选择了那些我们认为可能由吸烟引起或不可能由吸烟引起的表型。我们研究了英国生物数据库和雅芳父母与子女纵向研究（ALSPAC）中吸烟起始、吸烟量和终生吸烟的基因工具与这些表型之间的关联。我们对包括儿童在内的从不吸烟者进行了阴性对照分析。我们发现有证据表明，在英国生物数据库和（在较小程度上）ALSPAC 中，与吸烟相关的遗传工具与非由吸烟引起的表型相关。我们观察到从未吸烟者的表型与吸烟相关。我们的研究结果表明，与吸烟相关的遗传风险评分与意外的表型相关，而这些表型不太可能是吸烟的下游因素。这可能反映了这些遗传风险评分的横向多效性，我们鼓励研究人员在将这些评分和遗传风险评分用于其他复杂的行为暴露时谨慎行事。我们概述了可以采取哪些方法来考虑这一点并克服潜在的横向褶积性所造成的问题，例如，在遗传信息因果推断分析（如 MR）中，必须考虑负对照结果和三角测量方法，以避免得出不正确的结论。

{"title":"Exploring pleiotropy in Mendelian randomisation analyses: What are genetic variants associated with ‘cigarette smoking initiation’ really capturing?","authors":"Zoe E. Reed, Robyn E. Wootton, Jasmine N. Khouja, Tom G. Richardson, Eleanor Sanderson, George Davey Smith, Marcus R. Munafò","doi":"10.1002/gepi.22583","DOIUrl":"10.1002/gepi.22583","url":null,"abstract":"<p>Genetic variants used as instruments for exposures in Mendelian randomisation (MR) analyses may have horizontal pleiotropic effects (i.e., influence outcomes via pathways other than through the exposure), which can undermine the validity of results. We examined the extent of this using smoking behaviours as an example. We first ran a phenome-wide association study in UK Biobank, using a smoking initiation genetic instrument. From the most strongly associated phenotypes, we selected those we considered could either plausibly or not plausibly be caused by smoking. We examined associations between genetic instruments for smoking initiation, smoking heaviness and lifetime smoking and these phenotypes in UK Biobank and the Avon Longitudinal Study of Parents and Children (ALSPAC). We conducted negative control analyses among never smokers, including children. We found evidence that smoking-related genetic instruments were associated with phenotypes not plausibly caused by smoking in UK Biobank and (to a lesser extent) ALSPAC. We observed associations with phenotypes among never smokers. Our results demonstrate that smoking-related genetic risk scores are associated with unexpected phenotypes that are less plausibly downstream of smoking. This may reflect horizontal pleiotropy in these genetic risk scores, and we would encourage researchers to exercise caution this when using these and genetic risk scores for other complex behavioural exposures. We outline approaches that could be taken to consider this and overcome issues caused by potential horizontal pleiotropy, for example, in genetically informed causal inference analyses (e.g., MR) it is important to consider negative control outcomes and triangulation approaches, to avoid arriving at incorrect conclusions.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7616876/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141888953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Use of genetic correlations to examine selection bias 利用基因相关性研究选择偏差。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-07-30 DOI: 10.1002/gepi.22584

Chin Yang Shapland, Apostolos Gkatzionis, Gibran Hemani, Kate Tilling

Observational studies are rarely representative of their target population because there are known and unknown factors that affect an individual's choice to participate (the selection mechanism). Selection can cause bias in a given analysis if the outcome is related to selection (conditional on the other variables in the model). Detecting and adjusting for selection bias in practice typically requires access to data on nonselected individuals. Here, we propose methods to detect selection bias in genetic studies by comparing correlations among genetic variants in the selected sample to those expected under no selection. We examine the use of four hypothesis tests to identify induced associations between genetic variants in the selected sample. We evaluate these approaches in Monte Carlo simulations. Finally, we use these approaches in an applied example using data from the UK Biobank (UKBB). The proposed tests suggested an association between alcohol consumption and selection into UKBB. Hence, UKBB analyses with alcohol consumption as the exposure or outcome may be biased by this selection.

观察性研究很少能代表其目标人群，因为有已知和未知的因素会影响个人对参与的选择（选择机制）。如果结果与选择有关（以模型中的其他变量为条件），选择就会导致特定分析出现偏差。在实践中，检测和调整选择偏差通常需要获取非选择个体的数据。在此，我们提出了在遗传研究中检测选择偏倚的方法，即比较所选样本中遗传变异的相关性和无选择情况下的相关性。我们研究了使用四种假设检验来识别所选样本中遗传变异之间的诱导关联。我们在蒙特卡罗模拟中对这些方法进行了评估。最后，我们利用英国生物库 (UKBB) 的数据将这些方法用于一个应用实例中。所提出的测试表明，酒精消费与英国生物库的选择之间存在关联。因此，以酒精消费为暴露或结果的英国生物库分析可能会因这种选择而产生偏差。

引用次数: 0

Polygenic hazard score models for the prediction of Alzheimer's free survival using the lasso for Cox's proportional hazards model 利用考克斯比例危险模型的套索，建立预测阿尔茨海默氏症患者自由生存期的多基因危险评分模型。

IF 1.7 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology

Pub Date : 2024-07-09 DOI: 10.1002/gepi.22581

Georg Hahn, Dmitry Prokopenko, Julian Hecker, Sharon M. Lutz, Kristina Mullin, Rudolph E. Tanzi, Stacia DeSantis, Christoph Lange

The prediction of the susceptibility of an individual to a certain disease is an important and timely research area. An established technique is to estimate the risk of an individual with the help of an integrated risk model, that is, a polygenic risk score with added epidemiological covariates. However, integrated risk models do not capture any time dependence, and may provide a point estimate of the relative risk with respect to a reference population. The aim of this work is twofold. First, we explore and advocate the idea of predicting the time-dependent hazard and survival (defined as disease-free time) of an individual for the onset of a disease. This provides a practitioner with a much more differentiated view of absolute survival as a function of time. Second, to compute the time-dependent risk of an individual, we use published methodology to fit a Cox's proportional hazard model to data from a genetic SNP study of time to Alzheimer's disease (AD) onset, using the lasso to incorporate further epidemiological variables such as sex, APOE (apolipoprotein E, a genetic risk factor for AD) status, 10 leading principal components, and selected genomic loci. We apply the lasso for Cox's proportional hazards to a data set of 6792 AD patients (composed of 4102 cases and 2690 controls) and 87 covariates. We demonstrate that fitting a lasso model for Cox's proportional hazards allows one to obtain more accurate survival curves than with state-of-the-art (likelihood-based) methods. Moreover, the methodology allows one to obtain personalized survival curves for a patient, thus giving a much more differentiated view of the expected progression of a disease than the view offered by integrated risk models. The runtime to compute personalized survival curves is under a minute for the entire data set of AD patients, thus enabling it to handle datasets with 60,000–100,000 subjects in less than 1 h.

预测个人对某种疾病的易感性是一个重要而及时的研究领域。一种成熟的技术是借助综合风险模型来估计个体的风险，即多基因风险评分加上流行病学协变量。然而，综合风险模型无法捕捉任何时间依赖性，只能提供相对于参照人群的相对风险点估算值。这项工作有两个目的。首先，我们探索并倡导预测个体发病时与时间相关的危险性和生存期（定义为无病时间）。这为从业者提供了一个更有区别的绝对生存时间函数。其次，为了计算个体的时间相关风险，我们使用已公布的方法，对阿尔茨海默病（AD）发病时间的遗传 SNP 研究数据拟合 Cox 比例危险模型，并使用套索法纳入更多流行病学变量，如性别、APOE（载脂蛋白 E，AD 的遗传风险因素）状态、10 个主要主成分和选定的基因组位点。我们在一个包含 6792 例 AD 患者（由 4102 例病例和 2690 例对照组成）和 87 个协变量的数据集上应用了 lasso 的 Cox 比例危险度模型。我们证明，与最先进的（基于似然法的）方法相比，拟合 Cox 比例危险度的套索模型可以获得更准确的生存曲线。此外，该方法还能获得患者的个性化生存曲线，因此，与综合风险模型相比，该方法能提供更有区别的疾病预期进展情况。对整个 AD 患者数据集而言，计算个性化生存曲线的运行时间不到一分钟，因此可以在 1 小时内处理 60,000 至 100,000 个受试者的数据集。

{"title":"Polygenic hazard score models for the prediction of Alzheimer's free survival using the lasso for Cox's proportional hazards model","authors":"Georg Hahn, Dmitry Prokopenko, Julian Hecker, Sharon M. Lutz, Kristina Mullin, Rudolph E. Tanzi, Stacia DeSantis, Christoph Lange","doi":"10.1002/gepi.22581","DOIUrl":"10.1002/gepi.22581","url":null,"abstract":"<p>The prediction of the susceptibility of an individual to a certain disease is an important and timely research area. An established technique is to estimate the risk of an individual with the help of an integrated risk model, that is, a polygenic risk score with added epidemiological covariates. However, integrated risk models do not capture any time dependence, and may provide a point estimate of the relative risk with respect to a reference population. The aim of this work is twofold. First, we explore and advocate the idea of predicting the time-dependent hazard and survival (defined as disease-free time) of an individual for the onset of a disease. This provides a practitioner with a much more differentiated view of absolute survival as a function of time. Second, to compute the time-dependent risk of an individual, we use published methodology to fit a Cox's proportional hazard model to data from a genetic SNP study of time to Alzheimer's disease (AD) onset, using the lasso to incorporate further epidemiological variables such as sex, APOE (apolipoprotein E, a genetic risk factor for AD) status, 10 leading principal components, and selected genomic loci. We apply the lasso for Cox's proportional hazards to a data set of 6792 AD patients (composed of 4102 cases and 2690 controls) and 87 covariates. We demonstrate that fitting a lasso model for Cox's proportional hazards allows one to obtain more accurate survival curves than with state-of-the-art (likelihood-based) methods. Moreover, the methodology allows one to obtain personalized survival curves for a patient, thus giving a much more differentiated view of the expected progression of a disease than the view offered by integrated risk models. The runtime to compute personalized survival curves is under a minute for the entire data set of AD patients, thus enabling it to handle datasets with 60,000–100,000 subjects in less than 1 h.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141563235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0