Efficient inference of parent-of-origin effect using case-control mother–child genotype data

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY Journal of Statistical Planning and Inference Pub Date : 2024-05-09 DOI:10.1016/j.jspi.2024.106190

Yuang Tian , Hong Zhang , Alexandre Bureau , Hagit Hochner , Jinbo Chen

{"title":"Efficient inference of parent-of-origin effect using case-control mother–child genotype data","authors":"Yuang Tian , Hong Zhang , Alexandre Bureau , Hagit Hochner , Jinbo Chen","doi":"10.1016/j.jspi.2024.106190","DOIUrl":null,"url":null,"abstract":"<div><p>Parent-of-origin effect plays an important role in mammal development and disorder. Case-control mother–child pair genotype data can be used to detect parent-of-origin effect and is often convenient to collect in practice. Most existing methods for assessing parent-of-origin effect do not incorporate any covariates, which may be required to control for confounding factors. We propose to model the parent-of-origin effect through a logistic regression model, with predictors including maternal and child genotypes, parental origins, and covariates. The parental origins may not be fully inferred from genotypes of a target genetic marker, so we propose to use genotypes of markers tightly linked to the target marker to increase inference efficiency. A robust statistical inference procedure is developed based on a modified profile log-likelihood in a retrospective way. A computationally feasible expectation–maximization algorithm is devised to estimate all unknown parameters involved in the modified profile log-likelihood. This algorithm differs from the conventional expectation–maximization algorithm in the sense that it is based on a modified instead of the original profile log-likelihood function. The convergence of the algorithm is established under some mild regularity conditions. This expectation–maximization algorithm also allows convenient handling of missing child genotypes. Large sample properties, including weak consistency, asymptotic normality, and asymptotic efficiency, are established for the proposed estimator under some mild regularity conditions. Finite sample properties are evaluated through extensive simulation studies and the application to a real dataset.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"233 ","pages":"Article 106190"},"PeriodicalIF":0.8000,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375824000478","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

Abstract

Parent-of-origin effect plays an important role in mammal development and disorder. Case-control mother–child pair genotype data can be used to detect parent-of-origin effect and is often convenient to collect in practice. Most existing methods for assessing parent-of-origin effect do not incorporate any covariates, which may be required to control for confounding factors. We propose to model the parent-of-origin effect through a logistic regression model, with predictors including maternal and child genotypes, parental origins, and covariates. The parental origins may not be fully inferred from genotypes of a target genetic marker, so we propose to use genotypes of markers tightly linked to the target marker to increase inference efficiency. A robust statistical inference procedure is developed based on a modified profile log-likelihood in a retrospective way. A computationally feasible expectation–maximization algorithm is devised to estimate all unknown parameters involved in the modified profile log-likelihood. This algorithm differs from the conventional expectation–maximization algorithm in the sense that it is based on a modified instead of the original profile log-likelihood function. The convergence of the algorithm is established under some mild regularity conditions. This expectation–maximization algorithm also allows convenient handling of missing child genotypes. Large sample properties, including weak consistency, asymptotic normality, and asymptotic efficiency, are established for the proposed estimator under some mild regularity conditions. Finite sample properties are evaluated through extensive simulation studies and the application to a real dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用病例对照母子基因型数据有效推断原生父母效应

亲本效应在哺乳动物的发育和疾病中发挥着重要作用。病例对照母子配对基因型数据可用于检测亲本效应，而且在实践中通常很容易收集。大多数现有的评估亲本效应的方法都没有纳入任何协变量，而控制混杂因素可能需要协变量。我们建议通过逻辑回归模型来模拟父母-原籍效应，预测因子包括母子基因型、父母原籍和协变量。根据目标遗传标记的基因型可能无法完全推断出父母的来源，因此我们建议使用与目标标记紧密相连的标记的基因型来提高推断效率。我们开发了一种稳健的统计推断程序，该程序以追溯的方式基于修正的轮廓对数概率。设计了一种计算上可行的期望最大化算法来估计修正的剖面对数似然所涉及的所有未知参数。该算法与传统的期望最大化算法不同，它是基于修正后的轮廓对数似然函数，而不是原始的轮廓对数似然函数。该算法的收敛性是在一些温和的正则条件下确定的。这种期望最大化算法还能方便地处理缺失的子基因型。在一些温和的正则性条件下，为所提出的估计器建立了大样本特性，包括弱一致性、渐近正则性和渐近效率。通过广泛的模拟研究和对真实数据集的应用，对有限样本特性进行了评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Statistical Planning and Inference 数学-统计学与概率论

CiteScore

2.10

自引率

11.10%

发文量

审稿时长

3-6 weeks

期刊介绍： The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists. We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.