Efficient inference of parent-of-origin effect using case-control mother–child genotype data

Pub Date : 2024-05-09 DOI:10.1016/j.jspi.2024.106190
Yuang Tian , Hong Zhang , Alexandre Bureau , Hagit Hochner , Jinbo Chen
{"title":"Efficient inference of parent-of-origin effect using case-control mother–child genotype data","authors":"Yuang Tian ,&nbsp;Hong Zhang ,&nbsp;Alexandre Bureau ,&nbsp;Hagit Hochner ,&nbsp;Jinbo Chen","doi":"10.1016/j.jspi.2024.106190","DOIUrl":null,"url":null,"abstract":"<div><p>Parent-of-origin effect plays an important role in mammal development and disorder. Case-control mother–child pair genotype data can be used to detect parent-of-origin effect and is often convenient to collect in practice. Most existing methods for assessing parent-of-origin effect do not incorporate any covariates, which may be required to control for confounding factors. We propose to model the parent-of-origin effect through a logistic regression model, with predictors including maternal and child genotypes, parental origins, and covariates. The parental origins may not be fully inferred from genotypes of a target genetic marker, so we propose to use genotypes of markers tightly linked to the target marker to increase inference efficiency. A robust statistical inference procedure is developed based on a modified profile log-likelihood in a retrospective way. A computationally feasible expectation–maximization algorithm is devised to estimate all unknown parameters involved in the modified profile log-likelihood. This algorithm differs from the conventional expectation–maximization algorithm in the sense that it is based on a modified instead of the original profile log-likelihood function. The convergence of the algorithm is established under some mild regularity conditions. This expectation–maximization algorithm also allows convenient handling of missing child genotypes. Large sample properties, including weak consistency, asymptotic normality, and asymptotic efficiency, are established for the proposed estimator under some mild regularity conditions. Finite sample properties are evaluated through extensive simulation studies and the application to a real dataset.</p></div>","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375824000478","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Parent-of-origin effect plays an important role in mammal development and disorder. Case-control mother–child pair genotype data can be used to detect parent-of-origin effect and is often convenient to collect in practice. Most existing methods for assessing parent-of-origin effect do not incorporate any covariates, which may be required to control for confounding factors. We propose to model the parent-of-origin effect through a logistic regression model, with predictors including maternal and child genotypes, parental origins, and covariates. The parental origins may not be fully inferred from genotypes of a target genetic marker, so we propose to use genotypes of markers tightly linked to the target marker to increase inference efficiency. A robust statistical inference procedure is developed based on a modified profile log-likelihood in a retrospective way. A computationally feasible expectation–maximization algorithm is devised to estimate all unknown parameters involved in the modified profile log-likelihood. This algorithm differs from the conventional expectation–maximization algorithm in the sense that it is based on a modified instead of the original profile log-likelihood function. The convergence of the algorithm is established under some mild regularity conditions. This expectation–maximization algorithm also allows convenient handling of missing child genotypes. Large sample properties, including weak consistency, asymptotic normality, and asymptotic efficiency, are established for the proposed estimator under some mild regularity conditions. Finite sample properties are evaluated through extensive simulation studies and the application to a real dataset.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
利用病例对照母子基因型数据有效推断原生父母效应
亲本效应在哺乳动物的发育和疾病中发挥着重要作用。病例对照母子配对基因型数据可用于检测亲本效应,而且在实践中通常很容易收集。大多数现有的评估亲本效应的方法都没有纳入任何协变量,而控制混杂因素可能需要协变量。我们建议通过逻辑回归模型来模拟父母-原籍效应,预测因子包括母子基因型、父母原籍和协变量。根据目标遗传标记的基因型可能无法完全推断出父母的来源,因此我们建议使用与目标标记紧密相连的标记的基因型来提高推断效率。我们开发了一种稳健的统计推断程序,该程序以追溯的方式基于修正的轮廓对数概率。设计了一种计算上可行的期望最大化算法来估计修正的剖面对数似然所涉及的所有未知参数。该算法与传统的期望最大化算法不同,它是基于修正后的轮廓对数似然函数,而不是原始的轮廓对数似然函数。该算法的收敛性是在一些温和的正则条件下确定的。这种期望最大化算法还能方便地处理缺失的子基因型。在一些温和的正则性条件下,为所提出的估计器建立了大样本特性,包括弱一致性、渐近正则性和渐近效率。通过广泛的模拟研究和对真实数据集的应用,对有限样本特性进行了评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1