Theoretical Biology Forum最新文献

英文中文

A third way to the selected effect/causal role distinction in the great encode debate. 第三种方式选择的影响/因果作用的区别在大编码辩论。

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2023-07-01 DOI: 10.19272/202311402004

Ehud Lamm, Sophie Juliane Veigl

Since the ENCODE project published its final results in a series of articles in 2012, there is no consensus on what its implications are. ENCODE's central and most controversial claim was that there is essentially no junk DNA: most sections of the human genome believed to be «junk» are functional. This claim was met with many reservations. If researchers disagree about whether there is junk DNA, they have first to agree on a concept of function and how function, given a particular definition, can be discovered. The ENCODE debate centered on a notion of function that assumes a strong dichotomy between evolutionary and non-evolutionary function and causes, prevalent in the Modern Evolutionary Synthesis. In contrast to how the debate is typically portrayed, both sides share a commitment to this distinction. This distinction is, however, much debated in alternative approaches to evolutionary theory, such as the EES. We show that because the ENCODE debate is grounded in a particular notion of function, it is unclear how it connects to broader debates about what is the correct evolutionary framework. Furthermore, we show how arguments brought forward in the controversy, particularly arguments from mathematical population genetics, are deeply embedded in their particular disciplinary contexts, and reflect substantive assumptions about the evolution of genomes. With this article, we aim to provide an anatomy of the ENCODE debate that offers a new perspective on the notions of function both sides employed, as well as to situate the ENCODE debate within wider debates regarding the forces operating in evolution.

自从ENCODE项目于2012年在一系列文章中发表了最终结果以来，人们对其含义没有达成共识。ENCODE的核心和最具争议的主张是，本质上没有垃圾DNA:大多数被认为是“垃圾”的人类基因组片段都是功能性的。这种说法遭到许多人的保留。如果研究人员在是否存在垃圾DNA的问题上存在分歧，他们必须首先就功能的概念以及如何在特定的定义下发现功能达成一致。ENCODE的争论集中在一个功能概念上，该概念假定在进化和非进化的功能和原因之间存在强烈的二分法，这在现代进化综合理论中很普遍。与辩论的典型描述相反，双方都致力于这一区别。然而，这种区别在进化理论的替代方法(如EES)中备受争议。我们表明，由于ENCODE争论是基于一个特定的功能概念，因此尚不清楚它如何与更广泛的关于什么是正确的进化框架的争论联系起来。此外，我们还展示了争论中提出的论点，特别是来自数学群体遗传学的论点，如何深深植根于其特定的学科背景中，并反映了关于基因组进化的实质性假设。在这篇文章中，我们的目标是对ENCODE辩论进行剖析，为双方所采用的功能概念提供一个新的视角，并将ENCODE辩论置于关于进化中起作用的力量的更广泛的辩论中。

{"title":"A third way to the selected effect/causal role distinction in the great encode debate.","authors":"Ehud Lamm, Sophie Juliane Veigl","doi":"10.19272/202311402004","DOIUrl":"https://doi.org/10.19272/202311402004","url":null,"abstract":"<p><p>Since the ENCODE project published its final results in a series of articles in 2012, there is no consensus on what its implications are. ENCODE's central and most controversial claim was that there is essentially no junk DNA: most sections of the human genome believed to be «junk» are functional. This claim was met with many reservations. If researchers disagree about whether there is junk DNA, they have first to agree on a concept of function and how function, given a particular definition, can be discovered. The ENCODE debate centered on a notion of function that assumes a strong dichotomy between evolutionary and non-evolutionary function and causes, prevalent in the Modern Evolutionary Synthesis. In contrast to how the debate is typically portrayed, both sides share a commitment to this distinction. This distinction is, however, much debated in alternative approaches to evolutionary theory, such as the EES. We show that because the ENCODE debate is grounded in a particular notion of function, it is unclear how it connects to broader debates about what is the correct evolutionary framework. Furthermore, we show how arguments brought forward in the controversy, particularly arguments from mathematical population genetics, are deeply embedded in their particular disciplinary contexts, and reflect substantive assumptions about the evolution of genomes. With this article, we aim to provide an anatomy of the ENCODE debate that offers a new perspective on the notions of function both sides employed, as well as to situate the ENCODE debate within wider debates regarding the forces operating in evolution.</p>","PeriodicalId":55980,"journal":{"name":"Theoretical Biology Forum","volume":"116 1-2","pages":"53-74"},"PeriodicalIF":1.5,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10165575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Trust and Credit in Organizations and Institutions 组织和机构中的信任和信用

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2022-01-01 DOI: 10.1007/978-981-19-4979-1

M. Nakamaru

引用次数: 0

A Primer on Population Dynamics Modeling 人口动力学建模入门

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2022-01-01 DOI: 10.1007/978-981-19-6016-1

H. Seno

引用次数: 3

Getting Started in Mathematical Life Sciences 数学生命科学入门

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2022-01-01 DOI: 10.1007/978-981-19-8257-6

Makoto Sato

引用次数: 0

Nonparametric clustering for longitudinal functional data with the application to H-NMR spectra of kidney transplant patients. Longitudinal functional data clustering. 纵向功能数据的非参数聚类及其在肾移植患者H-NMR谱中的应用。纵向功能数据聚类。

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2021-01-01 DOI: 10.19272/202111401003

Minzhen Xie, Haiyan Liu, Jeanine Houwing-Duistermaat

Longitudinal functional data are increasingly common in the health domain. The motivated dataset for this paper comprises H-NMR spectra of kidney transplant patients [8]. Our aim is to cluster patients into different clinical outcome subgoups to reveal the success of the transplantation. The NMR spectra of each patient at each time point are functional data and the data are longitudinally collected at up to nine different time points. Existing methods are available for functional data collected at one time point, but not for longitudinal functional data collected at a grid of time points subject to missingness. We therefore first apply a method to extract the same number of functional feactures for each subject. Next we propose a novel nonparametric clustering method for mulitivariate functional data. We applied our proposed clustering method to the kidney transplant dataset both to a subset of the raw data with only two time points and the extacted functional features. It appeared that the proposed method achieves better clustering performance on the extracted functional features than on the subset of raw data. A data simulation study was performed to further evaluate the method. The design mimiced the kidney transplant dataset but with a larger sample size. Scenarios which have different levels of noise were considered. The simulation study showed the accuarcy of our proposed method.

纵向功能数据在健康领域越来越常见。本文的动机数据集包括肾移植患者的H-NMR光谱[8]。我们的目的是将患者分为不同的临床结果亚组，以揭示移植的成功。每个患者在每个时间点的NMR光谱都是功能数据，并且在多达九个不同的时间点纵向收集数据。现有方法可用于在一个时间点收集的功能数据，但不适用于在可能丢失的时间点网格收集的纵向功能数据。因此，我们首先应用一种方法为每个受试者提取相同数量的功能特征。接下来，我们提出了一种新的多变量函数数据的非参数聚类方法。我们将我们提出的聚类方法应用于肾移植数据集，既应用于只有两个时间点的原始数据子集，也应用于提取的功能特征。与原始数据子集相比，所提出的方法在提取的功能特征上实现了更好的聚类性能。进行了数据模拟研究以进一步评估该方法。该设计模仿了肾移植数据集，但样本量更大。考虑了具有不同噪声水平的场景。仿真研究表明了该方法的准确性。

{"title":"Nonparametric clustering for longitudinal functional data with the application to H-NMR spectra of kidney transplant patients. Longitudinal functional data clustering.","authors":"Minzhen Xie, Haiyan Liu, Jeanine Houwing-Duistermaat","doi":"10.19272/202111401003","DOIUrl":"https://doi.org/10.19272/202111401003","url":null,"abstract":"Longitudinal functional data are increasingly common in the health domain. The motivated dataset for this paper comprises H-NMR spectra of kidney transplant patients [8]. Our aim is to cluster patients into different clinical outcome subgoups to reveal the success of the transplantation. The NMR spectra of each patient at each time point are functional data and the data are longitudinally collected at up to nine different time points. Existing methods are available for functional data collected at one time point, but not for longitudinal functional data collected at a grid of time points subject to missingness. We therefore first apply a method to extract the same number of functional feactures for each subject. Next we propose a novel nonparametric clustering method for mulitivariate functional data. We applied our proposed clustering method to the kidney transplant dataset both to a subset of the raw data with only two time points and the extacted functional features. It appeared that the proposed method achieves better clustering performance on the extracted functional features than on the subset of raw data. A data simulation study was performed to further evaluate the method. The design mimiced the kidney transplant dataset but with a larger sample size. Scenarios which have different levels of noise were considered. The simulation study showed the accuarcy of our proposed method.","PeriodicalId":55980,"journal":{"name":"Theoretical Biology Forum","volume":"114 1-2 1","pages":"15-28"},"PeriodicalIF":1.5,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49105459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Use of shared gamma frailty model in analysis of survival data in twins. 共享伽玛脆弱性模型在双胞胎生存数据分析中的应用。

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2021-01-01 DOI: 10.19272/202111402005

Annah Mwikali Muli, A. Gusnanto, Jeanine Houwing-Duistermaat

In survival analysis, the effect of a covariate on the outcome is reported in a hazard rate. However, hazards rates are hard to interpret. Here we consider differences in survival probabilities instead. Using data on twins is interesting due to the fact that many observed and unobserved factors are controlled or matched. To model the correlation between twins, some authors have proposed survival models with frailties or random effects. However, there is a potential danger of bias in the estimation if the frailty distribution is misspecified. Frailties are often assumed to follow a gamma distribution. To safeguard us from the impact of the misspecification of this distribution, we consider a flexible non-parametric baseline hazard in addition to a parametric one. We will apply this methodology to the TwinsUK cohort to predict the probability of experiencing a fracture in the next five or ten years, given their bone mineral densities (BMD) and their frailty index. The models with parametric and non-parametric baseline hazards yield very close results in estimating survival probabilities and thus a choice of parametric baseline hazard is generally preferred. We find that bone mineral density is a significant predictor in the model whereas frailty index is not. Low BMD leads to a larger probability of fracture; e.g, in 10 years, the probability of fracture is 21% for low BMD group, 16% for medium BMD group and 8% for high BMD group.

在生存分析中，协变量对结果的影响以危险率报告。然而，危险率很难解释。这里我们考虑的是生存概率的差异。使用双胞胎的数据是有趣的，因为许多观察到的和未观察到的因素是控制或匹配的。为了模拟双胞胎之间的相关性，一些作者提出了带有脆弱性或随机效应的生存模型。然而，如果脆弱分布被错误地指定，则在估计中存在潜在的偏倚危险。弱点通常被认为遵循伽玛分布。为了保护我们免受该分布的错误规范的影响，除了参数基线之外，我们还考虑了一个灵活的非参数基线危害。我们将把这种方法应用到TwinsUK队列中，根据他们的骨密度(BMD)和脆弱指数来预测未来5年或10年发生骨折的概率。具有参数和非参数基线风险的模型在估计生存概率方面产生非常接近的结果，因此通常首选参数基线风险的选择。我们发现骨密度在模型中是一个重要的预测因子，而虚弱指数不是。低骨密度导致骨折的可能性较大;例如，在10年内，低骨密度组发生骨折的概率为21%，中等骨密度组为16%，高骨密度组为8%。

{"title":"Use of shared gamma frailty model in analysis of survival data in twins.","authors":"Annah Mwikali Muli, A. Gusnanto, Jeanine Houwing-Duistermaat","doi":"10.19272/202111402005","DOIUrl":"https://doi.org/10.19272/202111402005","url":null,"abstract":"In survival analysis, the effect of a covariate on the outcome is reported in a hazard rate. However, hazards rates are hard to interpret. Here we consider differences in survival probabilities instead. Using data on twins is interesting due to the fact that many observed and unobserved factors are controlled or matched. To model the correlation between twins, some authors have proposed survival models with frailties or random effects. However, there is a potential danger of bias in the estimation if the frailty distribution is misspecified. Frailties are often assumed to follow a gamma distribution. To safeguard us from the impact of the misspecification of this distribution, we consider a flexible non-parametric baseline hazard in addition to a parametric one. We will apply this methodology to the TwinsUK cohort to predict the probability of experiencing a fracture in the next five or ten years, given their bone mineral densities (BMD) and their frailty index. The models with parametric and non-parametric baseline hazards yield very close results in estimating survival probabilities and thus a choice of parametric baseline hazard is generally preferred. We find that bone mineral density is a significant predictor in the model whereas frailty index is not. Low BMD leads to a larger probability of fracture; e.g, in 10 years, the probability of fracture is 21% for low BMD group, 16% for medium BMD group and 8% for high BMD group.","PeriodicalId":55980,"journal":{"name":"Theoretical Biology Forum","volume":"114 1-2 1","pages":"45-58"},"PeriodicalIF":1.5,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43590946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Editorial. 社论。

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2021-01-01 DOI: 10.5771/0490-1606-2013-9-10-354

D. Lambert

引用次数: 0

Estimation of the effect of surrogate multi-omic biomarkers. 替代多组学生物标志物作用的评估。

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2021-01-01 DOI: 10.19272/202111402006

A. Fuady, S. el Bouhaddani, H. Uh, Jeanine Houwing-Duistermaat

Multiple technologies which measure the same omics data set but are based on different aspects of the molecules exist. In practice, studies use different technologies and have therefore different biomarkers. An example is the glycan age index, which is constructed by three different ultra-performance liquid chromatography (UPLC) IgG glycans, and is a biomarker for biological age. A second technology is liquid chromatography- mass spectrometry (LCMS). To estimate the effect of a biomarker on an outcome variable, two issues need to be addressed. Firstly, a measurement error is needed to map one technology to the other one using a calibration study. Here, we consider two approaches, namely one based on the chemical properties of the two technologies and one based on the estimation of this relationship using O2PLS. Secondly, the use of an approximation of the biomarker in the main study needs to be taken into account by use of a regression calibration method. The performance of the two approaches is studied via simulations. The methods are used to estimate the relationship between glycan age and menopause. We have data from two cohorts, namely Korcula and Vis. In conclusion, (1) both measurement error models give similar results and suggest that there is an association between the glycan age index and the menopause status, (2) the chemical mapping approach outperforms O2PLS in the low measurement error variance, while on the larger measurement error variance, O2PLS works better, (3) statistical efficiency is lost due to increased noise level by adding irrelevant information.

存在多种测量相同组学数据集但基于分子不同方面的技术。在实践中，研究使用不同的技术，因此具有不同的生物标志物。一个例子是聚糖年龄指数，它由三种不同的超高效液相色谱（UPLC）IgG聚糖构建，是生物年龄的生物标志物。第二种技术是液相色谱-质谱法（LCMS）。为了估计生物标志物对结果变量的影响，需要解决两个问题。首先，使用校准研究将一种技术映射到另一种技术需要测量误差。在这里，我们考虑两种方法，即一种基于这两种技术的化学性质，另一种基于使用O2PLS对这种关系的估计。其次，需要通过使用回归校准方法来考虑在主要研究中使用生物标志物的近似值。通过仿真研究了这两种方法的性能。这些方法用于估计聚糖年龄与更年期之间的关系。我们有来自两个队列的数据，即Korcula和Vis。总之，（1）两个测量误差模型给出了相似的结果，并表明聚糖年龄指数与更年期状态之间存在关联。（2）化学作图方法在低测量误差方差方面优于O2PLS，而在较大的测量误差方差上，O2PLS效果更好，（3）由于添加了不相关的信息而增加了噪声水平，从而损失了统计效率。

{"title":"Estimation of the effect of surrogate multi-omic biomarkers.","authors":"A. Fuady, S. el Bouhaddani, H. Uh, Jeanine Houwing-Duistermaat","doi":"10.19272/202111402006","DOIUrl":"https://doi.org/10.19272/202111402006","url":null,"abstract":"Multiple technologies which measure the same omics data set but are based on different aspects of the molecules exist. In practice, studies use different technologies and have therefore different biomarkers. An example is the glycan age index, which is constructed by three different ultra-performance liquid chromatography (UPLC) IgG glycans, and is a biomarker for biological age. A second technology is liquid chromatography- mass spectrometry (LCMS). To estimate the effect of a biomarker on an outcome variable, two issues need to be addressed. Firstly, a measurement error is needed to map one technology to the other one using a calibration study. Here, we consider two approaches, namely one based on the chemical properties of the two technologies and one based on the estimation of this relationship using O2PLS. Secondly, the use of an approximation of the biomarker in the main study needs to be taken into account by use of a regression calibration method. The performance of the two approaches is studied via simulations. The methods are used to estimate the relationship between glycan age and menopause. We have data from two cohorts, namely Korcula and Vis. In conclusion, (1) both measurement error models give similar results and suggest that there is an association between the glycan age index and the menopause status, (2) the chemical mapping approach outperforms O2PLS in the low measurement error variance, while on the larger measurement error variance, O2PLS works better, (3) statistical efficiency is lost due to increased noise level by adding irrelevant information.","PeriodicalId":55980,"journal":{"name":"Theoretical Biology Forum","volume":"114 1-2 1","pages":"59-73"},"PeriodicalIF":1.5,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42019619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Multivariate functional partial least squares for classification using longitudinal data. 多元泛函偏最小二乘分类使用纵向数据。

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2021-01-01 DOI: 10.19272/202111402007

Sonia Dembowska, Alejandro F Frangi, Jeanine Houwing-Duistermaat, Haiyan Liu

The use of statistical methods to predict outcomes using high dimensional datasets in medicine is becoming increasingly popular for forecasting and monitoring patient health. Our work is motivated by a longitudinal dataset containing 1H NMR spectra of metabolites of 18 patients undergoing a kidney transplant alongside their graft outcomes that fall into one of three categories: acute rejection, delayed graft function and primary function. We proposed a functional partial least squares (FPLS) model that extends existing PLS methods for the analysis of longitudinally measured scalar omics datasets to the case of longitudinally measured functional datasets. We designed an iterative algorithm to link multiple time points, and then applied our proposed method to analyse the data from kidney transplant patients. Finally, we compared the AUC of our method to the AUC of the univariate methods which only use the information of one time-point information. It appeared that our method outperforms the existing methods. A simulation study was performed to mimic the kidney transplant dataset but with a larger sample size and different scenarios performed to evaluate the performance of the new method in larger datasets. We consider scenarios which vary in the difficulty to distinguish the two groups. It appeared that the three time-points model performs better than any of the individual models with average AUCs of 0.909 and 0.811 respectively.

在医学中，使用统计方法预测使用高维数据集的结果在预测和监测患者健康方面越来越受欢迎。我们的工作是由一个纵向数据集激发的，该数据集包含18名接受肾移植的患者的代谢物的1H NMR光谱，以及他们的移植结果，这些移植结果属于三类之一:急性排斥反应，移植功能延迟和主要功能。我们提出了一个功能偏最小二乘(FPLS)模型，将现有的用于纵向测量标量组学数据集分析的PLS方法扩展到纵向测量功能数据集的情况。我们设计了一个迭代算法来链接多个时间点，然后将我们提出的方法应用于肾移植患者的数据分析。最后，将该方法的AUC与仅使用一个时间点信息的单变量方法的AUC进行了比较。看来我们的方法优于现有的方法。为了模拟肾脏移植数据集，进行了一项模拟研究，但样本量更大，并进行了不同的场景，以评估新方法在更大数据集中的性能。我们考虑不同难度的场景来区分这两组。结果表明，三个时间点模型的平均auc分别为0.909和0.811，优于任何单个模型。

{"title":"Multivariate functional partial least squares for classification using longitudinal data.","authors":"Sonia Dembowska, Alejandro F Frangi, Jeanine Houwing-Duistermaat, Haiyan Liu","doi":"10.19272/202111402007","DOIUrl":"https://doi.org/10.19272/202111402007","url":null,"abstract":"The use of statistical methods to predict outcomes using high dimensional datasets in medicine is becoming increasingly popular for forecasting and monitoring patient health. Our work is motivated by a longitudinal dataset containing 1H NMR spectra of metabolites of 18 patients undergoing a kidney transplant alongside their graft outcomes that fall into one of three categories: acute rejection, delayed graft function and primary function. We proposed a functional partial least squares (FPLS) model that extends existing PLS methods for the analysis of longitudinally measured scalar omics datasets to the case of longitudinally measured functional datasets. We designed an iterative algorithm to link multiple time points, and then applied our proposed method to analyse the data from kidney transplant patients. Finally, we compared the AUC of our method to the AUC of the univariate methods which only use the information of one time-point information. It appeared that our method outperforms the existing methods. A simulation study was performed to mimic the kidney transplant dataset but with a larger sample size and different scenarios performed to evaluate the performance of the new method in larger datasets. We consider scenarios which vary in the difficulty to distinguish the two groups. It appeared that the three time-points model performs better than any of the individual models with average AUCs of 0.909 and 0.811 respectively.","PeriodicalId":55980,"journal":{"name":"Theoretical Biology Forum","volume":"114 1-2 1","pages":"75-88"},"PeriodicalIF":1.5,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45744157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

DNA methylation correlation structure of chromosome 21 in Down syndrome. 唐氏综合征21号染色体DNA甲基化相关结构。

IF 1.5 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum

Pub Date : 2021-01-01 DOI: 10.19272/202111402008

I. Budimir, C. Sala, M. G. Bacalini, P. Garagnani, G. Castellani

DNA methylation studies usually focus on the groups of CpG sites. Neighbouring CpG sites are analyzed together due to their group behaviour. However, this approach ignores the possible interaction between more distant CpG sites. In this work, we investigate the complete methylation correlation structure of chromosome 21. Two data sets were used for the correlation analysis, smaller data set with methylation measurements from Down syndrome patients and their family members and larger data set with healthy subjects. This allowed us to examine the general properties of the methylation correlation structure as well as its modifications in presence of an extra copy of the chromosome. We observed that the CpG sites work in small highly correlated groups. While some groups coincided with CpG islands, other groups contained CpG sites scattered across the whole chromosome. Groups of highly correlated CpG sites remained preserved in the case of Down syndrome. Moreover, the methylome of a Down syndrome patient had newly formed correlations between CpG sites suggesting that the methylation correlation structure in Down syndrome is stronger than in case of an unaffected individual.

DNA甲基化研究通常集中在CpG位点组上。由于它们的群体行为，相邻的CpG位点被一起分析。然而，这种方法忽略了更远的CpG位点之间可能的相互作用。在这项工作中，我们研究了21号染色体的完全甲基化相关结构。相关性分析使用了两个数据集，一个是唐氏综合症患者及其家庭成员甲基化测量的较小数据集，另一个是健康受试者的较大数据集。这使我们能够检查甲基化相关结构的一般性质，以及在存在额外染色体拷贝的情况下对其进行的修饰。我们观察到CpG位点在高度相关的小群体中起作用。虽然一些组与CpG岛重合，但其他组包含分散在整个染色体上的CpG位点。在唐氏综合征的病例中，高度相关的CpG位点组仍然保留。此外，唐氏综合症患者的甲基化组在CpG位点之间有新形成的相关性，这表明唐氏综合症的甲基化相关性结构比未受影响的个体更强。

{"title":"DNA methylation correlation structure of chromosome 21 in Down syndrome.","authors":"I. Budimir, C. Sala, M. G. Bacalini, P. Garagnani, G. Castellani","doi":"10.19272/202111402008","DOIUrl":"https://doi.org/10.19272/202111402008","url":null,"abstract":"DNA methylation studies usually focus on the groups of CpG sites. Neighbouring CpG sites are analyzed together due to their group behaviour. However, this approach ignores the possible interaction between more distant CpG sites. In this work, we investigate the complete methylation correlation structure of chromosome 21. Two data sets were used for the correlation analysis, smaller data set with methylation measurements from Down syndrome patients and their family members and larger data set with healthy subjects. This allowed us to examine the general properties of the methylation correlation structure as well as its modifications in presence of an extra copy of the chromosome. We observed that the CpG sites work in small highly correlated groups. While some groups coincided with CpG islands, other groups contained CpG sites scattered across the whole chromosome. Groups of highly correlated CpG sites remained preserved in the case of Down syndrome. Moreover, the methylome of a Down syndrome patient had newly formed correlations between CpG sites suggesting that the methylation correlation structure in Down syndrome is stronger than in case of an unaffected individual.","PeriodicalId":55980,"journal":{"name":"Theoretical Biology Forum","volume":"114 1-2 1","pages":"89-113"},"PeriodicalIF":1.5,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44461506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Theoretical Biology Forum

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀