Pub Date : 2007-12-01DOI: 10.1017/S0016672308009580
B. Charlesworth
{"title":"A hitch-hiking guide to the genome: a commentary on 'The hitch-hiking effect of a favourable gene' by John Maynard Smith and John Haigh.","authors":"B. Charlesworth","doi":"10.1017/S0016672308009580","DOIUrl":"https://doi.org/10.1017/S0016672308009580","url":null,"abstract":"","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"75 1","pages":"389-90"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86090561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S001667230800966X
S. Otto
{"title":"Unravelling the evolutionary advantage of sex: a commentary on 'Mutation-selection balance and the evolutionary advantage of sex and recombination' by Brian Charlesworth.","authors":"S. Otto","doi":"10.1017/S001667230800966X","DOIUrl":"https://doi.org/10.1017/S001667230800966X","url":null,"abstract":"","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"24 1","pages":"447-9"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89235708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S0016672308009725
R. Doerge
{"title":"Proper experimental design and sound statistical inference win every time: a commentary on 'Statistical design and the analysis of gene expression microarray data' by M. Kathleen Kerr and Gary A. Churchill.","authors":"R. Doerge","doi":"10.1017/S0016672308009725","DOIUrl":"https://doi.org/10.1017/S0016672308009725","url":null,"abstract":"","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"16 1","pages":"505-7"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90471256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S0016672308009701
T. Mackay
where H0 is the heterozygosity of the population before the bottleneck, Ht is the heterozygosity after t generations of maintenance with 2N individuals and F is the inbreeding coefficient (Falconer & Mackay, 1996). Clearly heterozygosity decreases, and inbreeding increases, as N decreases ; and these effects accumulate over time. However, no real population fits the ideal model on which this theory is based, which includes self-fertilization in random amounts. The concept of effective population size enables us to utilize this expression by replacing the N in the equation with Ne, where Ne, the effective population size, is the number of individuals that would give rise to the same variance in gene frequency or rate of inbreeding as an ideal population of that size (Falconer & Mackay, 1996). Major departures from the ideal population model that affect Ne are unequal numbers of males and females, unequal numbers of individuals in different generations, non-random distribution of family size, and overlapping generations. Analytical expressions relating the census size of the population (N) to the effective population size have been derived for each of these cases (Frankham, 1995; Falconer & Mackay, 1996) ; under most scenarios Ne
{"title":"Wild populations are smaller than we think: a commentary on 'Effective population size/adult population size ratios in wildlife: a review' by Richard Frankham.","authors":"T. Mackay","doi":"10.1017/S0016672308009701","DOIUrl":"https://doi.org/10.1017/S0016672308009701","url":null,"abstract":"where H0 is the heterozygosity of the population before the bottleneck, Ht is the heterozygosity after t generations of maintenance with 2N individuals and F is the inbreeding coefficient (Falconer & Mackay, 1996). Clearly heterozygosity decreases, and inbreeding increases, as N decreases ; and these effects accumulate over time. However, no real population fits the ideal model on which this theory is based, which includes self-fertilization in random amounts. The concept of effective population size enables us to utilize this expression by replacing the N in the equation with Ne, where Ne, the effective population size, is the number of individuals that would give rise to the same variance in gene frequency or rate of inbreeding as an ideal population of that size (Falconer & Mackay, 1996). Major departures from the ideal population model that affect Ne are unequal numbers of males and females, unequal numbers of individuals in different generations, non-random distribution of family size, and overlapping generations. Analytical expressions relating the census size of the population (N) to the effective population size have been derived for each of these cases (Frankham, 1995; Falconer & Mackay, 1996) ; under most scenarios Ne<N. Knowledge of the ratio ofNe toN is critical in wildlife populations and particularly endangered species, if we are to predict the rate of inbreeding and loss of heterozygosity. In this meta-analysis, Frankham (1995) synthesizes data from 192 estimates of Ne/N from 102 species. The estimates of Ne/N from insects, molluscs, amphibians, reptiles, birds, mammals and plants ranged from 10x6 in Pacific oysters to 0.99 in humans, and averaged 0.34 overall. However, these studies differed in whether they included fluctuating population size, variable family size and/or different numbers of males and females – less than one-third of the studies included all three of these factors. In addition, different measures of census size were used as the denominator. Some studies utilized the total census size (NT, the total number of adults and juveniles), some the number of adults (NA, the number of breeding plus senescent adults), while others counted only the number of breeding individuals (NB). Finally, both genetic and demographic methods were used. Frankham (1995) capitalized on this variability to perform stepwise regression analyses in order to determine the major variables affecting Ne/N. The significant variables, in decreasing order of importance, were fluctuating population size, variable family size, method of determining census number, taxonomic group, and the sex ratio. The most striking conclusion was that the comprehensive estimates of Ne/N in wild species, including all variables, were of the order of 10%. This is much smaller than had been thought previously and a cause for concern in terms of long-term population viability. This influential review stimulated many studies estimating Ne/N in a wide variety of wild specie","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"38 1","pages":"489"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75552551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S0016672308009506
Z. Zeng
{"title":"The Hill-Robertson effect is a consequence of interplay between linkage, selection and drift: a commentary on 'The effect of linkage on limits to artificial selection' by W. G. Hill and A. Robertson.","authors":"Z. Zeng","doi":"10.1017/S0016672308009506","DOIUrl":"https://doi.org/10.1017/S0016672308009506","url":null,"abstract":"","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"69 1","pages":"309-10"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89974311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S0016672308009609
D. Finnegan
One of the many striking insights to have come from sequencing complete genomes is the realization that transposable elements make up a large proportion of the DNA of most eukaryotes. These elements are far from minor players on the genomic stage. They have had a major role in genome evolution and are a significant source of genome instability. Transposable elements are driven to high copy number by transposition, but their activity is difficult to study experimentally as most copies are defective, and the few that are transpositionally competent move infrequently. This is good for the host organism, but not for anyone wanting to study them. A small number of transposable elements in Drosophila transpose at high levels in the progeny of particular crosses. This results in genetic instability manifest by reduced fertility of one or both sexes and a high frequency of germ line mutations, a phenomenon known as hybrid dysgenesis (Kidwell, 1977). P–M hybrid dysgenesis, for example, is seen when males of a P-type strain are crossed with females of an M-type strain, but not in the progeny of the reciprocal cross. At the heart of hybrid dysgenesis is a breakdown in the control of transposition. This was not immediately apparent and for several years hybrid dysgenesis was regarded as a strange phenomenon that appeared to run counter to accepted ‘ rules ’ of genetics. Bill Engels was one of the first to address this, and his analysis of the genetic basis for the difference between P and M strains established a paradigm for subsequent research (Engels, 1979). In an elegant series of experiments, Engels was able to show that hybrid dysgenesis could be explained by the interaction of polygenic chromosomal factors that appeared to be inherited in a Mendelian fashion, and a maternally inherited cytoplasmic state that he called ‘cytotype’. He proposed that the cytotype of M strain females is permissive for the activity of the chromosomal factors, P factors, which, in dysgenic flies, would be inherited from P strain males. The cytotype of a P strain would be non-permissive, thus accounting for the non-reciprocal nature of hybrid dysgenesis. He further suggested that P cytotype is determined by P factors themselves, explaining why P strains are genetically stable. Finally, he proposed that P factors might be transposable elements, as was thought to be the case for chromosomal factors responsible for a second form of hybrid dysgenesis, I–R hybrid dysgenesis (Picard, 1976). The suggestion that the chromosomal determinants responsible for P–M hybrid dysgenesis are transposable elements was confirmed by the identification of insertions of repeated sequences in the white gene in white-eye mutations isolated from the progeny of P–M dysgenic flies (Rubin et al., 1982). These insertions appeared to be deletion derivatives of a longer element, probably the P factor itself. This was cloned and shown to have P factor activity by injection into M strain embryos, an experiment
从全基因组测序中获得的许多惊人见解之一是认识到转座因子构成了大多数真核生物DNA的很大一部分。在基因组的舞台上,这些元素远不是次要的角色。它们在基因组进化中起着重要作用,也是基因组不稳定的重要来源。转座因子是由转座驱动的高拷贝数,但由于大多数拷贝是有缺陷的,而少数具有转座能力的拷贝很少移动,因此它们的活性很难通过实验研究。这对宿主生物是有好处的,但对任何想要研究它们的人来说都不是。在果蝇中,少数转座因子在特定杂交的后代中有高水平的转座。这导致遗传不稳定,表现为一种或两性的生育能力降低,种系突变频率高,这种现象被称为杂交发育不良(Kidwell, 1977)。例如,当p型株的雄性与m型株的雌性杂交时,可以看到P-M杂交发育不良,但在反向杂交的后代中则不会出现。杂交发育不良的核心是对转位控制的崩溃。这并没有立即显现出来,几年来,杂交发育不良被认为是一种奇怪的现象,似乎与公认的遗传学“规则”背道而驰。比尔·恩格斯是最早解决这个问题的人之一,他对P和M菌株之间差异的遗传基础的分析为随后的研究建立了一个范例(恩格斯,1979)。在一系列优雅的实验中,恩格斯证明了杂交发育不良可以用多基因染色体因素的相互作用来解释,多基因染色体因素似乎是以孟德尔的方式遗传的,而母系遗传的细胞质状态被他称为“细胞型”。他提出,M株雌性的细胞类型允许染色体因子P因子的活性,在基因异常果蝇中,这些因子将遗传自P株雄性。P菌株的细胞型是非允许的,因此说明了杂交发育不良的非互易性质。他进一步提出P细胞型是由P因子本身决定的,这解释了为什么P菌株在遗传上是稳定的。最后,他提出P因子可能是转座因子,就像人们认为的那样,染色体因子负责第二种形式的杂交发育不良,即I-R杂交发育不良(Picard, 1976)。从P-M基因异常果蝇的后代中分离出的白眼突变中,在白色基因中插入了重复序列,证实了导致P-M杂交基因异常的染色体决定因素是转座因子(Rubin et al., 1982)。这些插入似乎是一个更长的元素的缺失衍生物,可能是P因子本身。这是克隆的,并通过注射到M株胚胎中显示具有P因子活性,这是一个基于Engel对细胞类型在控制P因子活性中的作用的见解的实验。由这些胚胎产生的果蝇表现出杂交后代的一些特征
{"title":"Hybrid dysgenesis: from darkness into light: a commentary on 'Hybrid dysgenesis in Drosophila melanogaster: rules of inheritance of female sterility' by William R. Engels.","authors":"D. Finnegan","doi":"10.1017/S0016672308009609","DOIUrl":"https://doi.org/10.1017/S0016672308009609","url":null,"abstract":"One of the many striking insights to have come from sequencing complete genomes is the realization that transposable elements make up a large proportion of the DNA of most eukaryotes. These elements are far from minor players on the genomic stage. They have had a major role in genome evolution and are a significant source of genome instability. Transposable elements are driven to high copy number by transposition, but their activity is difficult to study experimentally as most copies are defective, and the few that are transpositionally competent move infrequently. This is good for the host organism, but not for anyone wanting to study them. A small number of transposable elements in Drosophila transpose at high levels in the progeny of particular crosses. This results in genetic instability manifest by reduced fertility of one or both sexes and a high frequency of germ line mutations, a phenomenon known as hybrid dysgenesis (Kidwell, 1977). P–M hybrid dysgenesis, for example, is seen when males of a P-type strain are crossed with females of an M-type strain, but not in the progeny of the reciprocal cross. At the heart of hybrid dysgenesis is a breakdown in the control of transposition. This was not immediately apparent and for several years hybrid dysgenesis was regarded as a strange phenomenon that appeared to run counter to accepted ‘ rules ’ of genetics. Bill Engels was one of the first to address this, and his analysis of the genetic basis for the difference between P and M strains established a paradigm for subsequent research (Engels, 1979). In an elegant series of experiments, Engels was able to show that hybrid dysgenesis could be explained by the interaction of polygenic chromosomal factors that appeared to be inherited in a Mendelian fashion, and a maternally inherited cytoplasmic state that he called ‘cytotype’. He proposed that the cytotype of M strain females is permissive for the activity of the chromosomal factors, P factors, which, in dysgenic flies, would be inherited from P strain males. The cytotype of a P strain would be non-permissive, thus accounting for the non-reciprocal nature of hybrid dysgenesis. He further suggested that P cytotype is determined by P factors themselves, explaining why P strains are genetically stable. Finally, he proposed that P factors might be transposable elements, as was thought to be the case for chromosomal factors responsible for a second form of hybrid dysgenesis, I–R hybrid dysgenesis (Picard, 1976). The suggestion that the chromosomal determinants responsible for P–M hybrid dysgenesis are transposable elements was confirmed by the identification of insertions of repeated sequences in the white gene in white-eye mutations isolated from the progeny of P–M dysgenic flies (Rubin et al., 1982). These insertions appeared to be deletion derivatives of a longer element, probably the P factor itself. This was cloned and shown to have P factor activity by injection into M strain embryos, an experiment","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"38 1","pages":"405-6"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77719417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S001667230800952X
D. Gessler
{"title":"Genetic variability and neutral mutations: a commentary on 'Genetic variability maintained in a finite population due to mutational production of neutral and nearly neutral isoalleles' by Motoo Kimura.","authors":"D. Gessler","doi":"10.1017/S001667230800952X","DOIUrl":"https://doi.org/10.1017/S001667230800952X","url":null,"abstract":"","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"1 1","pages":"337-9"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82953842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S0016672308009683
N. Barton
{"title":"Identity and coalescence in structured populations: a commentary on 'Inbreeding coefficients and coalescence times' by Montgomery Slatkin.","authors":"N. Barton","doi":"10.1017/S0016672308009683","DOIUrl":"https://doi.org/10.1017/S0016672308009683","url":null,"abstract":"","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"43 1","pages":"475-7"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85105232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S0016672308009543
J. Hey
In the early 1970s, protein electrophoresis was the primary tool geneticists used to discover and measure allelic variation in natural populations. It was a relatively simple and inexpensive technique and, most importantly, it permitted the detection of multiple alleles regardless of polymorphism levels. This was a critical point because before the age of protein electrophoresis, segregating alleles were usually discovered only in cases where clearly discrete patterns of phenotypic variation were first observed. With protein electrophoresis a geneticist’s ability to identify multiple alleles did not depend on a prior indication of the presence of genetic variation (Hubby & Lewontin, 1966; Lewontin, 1974). The purpose of the short paper that Tomoko Ohta and her mentor Motoo Kimura published in Genetical Research in 1973 was to devise a mutation model that was explicitly appropriate for protein electrophoretic data and that would permit such data to be analysed with regard to questions on the relative roles of natural selection and genetic drift. As data on electrophoretic alleles began to accumulate, it was discovered that individuals were heterozygous, and many species were polymorphic, at a substantial fraction of the proteins that could be surveyed. These numbers on heterozygosity and polymorphism immediately began to feed a long-standing hunger, that had built up from decades of sophisticated modelling, for data on such topics as mutation rates, genetic load, the rate of neutral mutations, and the relative roles of natural selection and genetic drift in shaping levels and patterns of variation. Ohta and Kimura were the primary theoreticians of the neutral theory of molecular evolution and they had a very strong interest (as did most population geneticists of that age) in understanding how well the neutral theory explained the levels of polymorphism discovered by electrophoresis. The models they developed focused on amounts and patterns of genetic variation, and they tended to include explicitly a neutral mutation rate as well as assumptions about the nature of the mutation process. One prediction of the neutral theory was that the number of alleles in a population was expected to co-vary strongly with the effective population size. Earlier in 1964, Kimura and James Crow had developed the infinite alleles model, in which every mutation gives rise to a new allele (Kimura & Crow, 1964), and under this model the number of neutral alleles varies linearly with both effective population size and neutral mutation rate. Ohta and Kimura’s key idea in 1973 was a mutation model that explicitly gave rise to new allelic states in single steps that differed in net protein charge. Because four of the amino acids are normally charged at physiological pH, the surface of a soluble protein will carry a charge that affects its behaviour in gel electrophoresis, and mutations that raise or lower this charge will increase or decrease the rate of electrophoresis. In O
{"title":"A model in two acts: a commentary on 'A model detectable alleles in a finite population' by Timoko Ohta and Motoo Kimura.","authors":"J. Hey","doi":"10.1017/S0016672308009543","DOIUrl":"https://doi.org/10.1017/S0016672308009543","url":null,"abstract":"In the early 1970s, protein electrophoresis was the primary tool geneticists used to discover and measure allelic variation in natural populations. It was a relatively simple and inexpensive technique and, most importantly, it permitted the detection of multiple alleles regardless of polymorphism levels. This was a critical point because before the age of protein electrophoresis, segregating alleles were usually discovered only in cases where clearly discrete patterns of phenotypic variation were first observed. With protein electrophoresis a geneticist’s ability to identify multiple alleles did not depend on a prior indication of the presence of genetic variation (Hubby & Lewontin, 1966; Lewontin, 1974). The purpose of the short paper that Tomoko Ohta and her mentor Motoo Kimura published in Genetical Research in 1973 was to devise a mutation model that was explicitly appropriate for protein electrophoretic data and that would permit such data to be analysed with regard to questions on the relative roles of natural selection and genetic drift. As data on electrophoretic alleles began to accumulate, it was discovered that individuals were heterozygous, and many species were polymorphic, at a substantial fraction of the proteins that could be surveyed. These numbers on heterozygosity and polymorphism immediately began to feed a long-standing hunger, that had built up from decades of sophisticated modelling, for data on such topics as mutation rates, genetic load, the rate of neutral mutations, and the relative roles of natural selection and genetic drift in shaping levels and patterns of variation. Ohta and Kimura were the primary theoreticians of the neutral theory of molecular evolution and they had a very strong interest (as did most population geneticists of that age) in understanding how well the neutral theory explained the levels of polymorphism discovered by electrophoresis. The models they developed focused on amounts and patterns of genetic variation, and they tended to include explicitly a neutral mutation rate as well as assumptions about the nature of the mutation process. One prediction of the neutral theory was that the number of alleles in a population was expected to co-vary strongly with the effective population size. Earlier in 1964, Kimura and James Crow had developed the infinite alleles model, in which every mutation gives rise to a new allele (Kimura & Crow, 1964), and under this model the number of neutral alleles varies linearly with both effective population size and neutral mutation rate. Ohta and Kimura’s key idea in 1973 was a mutation model that explicitly gave rise to new allelic states in single steps that differed in net protein charge. Because four of the amino acids are normally charged at physiological pH, the surface of a soluble protein will carry a charge that affects its behaviour in gel electrophoresis, and mutations that raise or lower this charge will increase or decrease the rate of electrophoresis. In O","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"116 1","pages":"365-6"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79773891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-12-01DOI: 10.1017/S0016672308009622
B. Weir
In 1987, Hudson proposed an estimator for the scaled recombination parameter C=4Nc, where N is the population size and c is the recombination rate between the two most distant of a set of segregating sites. This work came shortly after Kreitman (1983) published the first set of population genetic data at the DNA sequence level. Kreitman had been able to sequence 2.7 kilobases of the Drosophila melanogaster genome in 11 samples. It was felt at that time that population genetics was entering a new era, although Hudson cautioned that sufficiently large data sets for his new estimator ‘may require prohibitively large research efforts ’. Hudson’s estimator is based on the variance of the number of site differences between pairs of haplotypes and an estimate of the scaled mutation rate h=4Nm. The variance of the number of differences had already been shown by Brown et al. (1980) to be a convenient single-statistic summary of all the pairwise linkage disequilibria among a set of loci. The need for such a statistic continues as there is still doubt as to how well two-locus associations capture the full multilocus structure. Hudson provided an elegant derivation of the expected value of his statistic as a function of the unknown value C. His method of moments approach to estimation has the great virtue of simplicity although it would not be expected to behave as well as the maximum-likelihood methods that he (Hudson, 1993) and others (e.g. Kuhner et al., 2000; Wall, 2000; Fearnhead and Donnelly, 2001) developed later. Likelihood methods exploit all the information in a data set rather than just the information in a summary statistic and will do well provided the underlying evolutionary model is appropriate for the data being addressed. Writing 10 years after Hudson, Wakeley kept the same moment approach but provided modifications to Hudson’s method that improved its performance. Since 1983 the human genome has been sequenced, as have the genomes of several other species. There is now a ‘1000 genomes’ project (http://www.1000 genomes.org) under way for humans, and new sequencing techniques will make it possible very soon for population geneticists to obtain large samples of DNA sequence data. In 1987, Hudsonwished formore extensive DNA sequence data but he could not have foreseen the remarkable explosion of intermediate data – single-nucleotide polymorphisms (SNPs). Human geneticists are now generating 1 million SNP profiles for samples of thousands of individuals. By 2002, Hudson had produced a simulation procedure for SNP data (Hudson, 2002), and this has been used in studies such as Li and Stephens (2003) to detect recombination rate ‘hotspots ’. Hudson’s 1987 paper has the hallmarks of a classic paper. It introduced a new and simple method for estimating recombination rates from population samples rather than from pedigree data. More sophisticated methods have since been introduced, including composite-likelihood (Hudson, 2001) and others reviewed by H
1987年,Hudson提出了缩放重组参数C=4Nc的估计量,其中N为种群大小,C为一组分离位点中距离最远的两个位点之间的重组率。这项工作是在Kreitman(1983)发表第一组DNA序列水平的种群遗传数据后不久进行的。克雷特曼已经能够在11个样本中对黑腹果蝇基因组的2.7万个碱基进行测序。当时人们认为,群体遗传学正在进入一个新时代,尽管哈德森警告说,足够大的数据集对于他的新估计器来说“可能需要大量的研究努力”。Hudson的估计值是基于单倍型对之间的位点差异数的方差和缩放突变率h=4Nm的估计值。Brown等人(1980)已经证明,差异数的方差是一组基因座中所有成对连锁不平衡的方便的单统计汇总。对这种统计的需求仍在继续,因为对于双位点关联如何很好地捕获完整的多位点结构仍然存在疑问。Hudson提供了他的统计值期望值作为未知值c的函数的优雅推导。他的矩量方法估计具有简单的优点,尽管它不会像他(Hudson, 1993)和其他人(例如Kuhner等人,2000;墙,2000;Fearnhead and Donnelly, 2001)发展较晚。似然方法利用数据集中的所有信息,而不仅仅是汇总统计数据中的信息,如果底层进化模型适合于所处理的数据,它将会做得很好。在哈德逊10年后,韦克利继续沿用了哈德逊的方法,但对哈德逊的方法进行了修改,以提高其性能。自1983年以来,人类基因组已被测序,其他几个物种的基因组也已测序。现在有一个针对人类的“1000个基因组”项目(http://www.1000 genomes.org)正在进行中,新的测序技术将使群体遗传学家很快有可能获得大量DNA序列数据样本。1987年,哈德逊希望获得更广泛的DNA序列数据,但他无法预见到中间数据——单核苷酸多态性(snp)的惊人爆炸式增长。人类遗传学家现在正在为数千个人的样本生成100万个SNP图谱。到2002年,Hudson已经制作了一个SNP数据的模拟程序(Hudson, 2002), Li和Stephens(2003)等研究已使用该程序来检测重组率的“热点”。哈德森1987年的论文具有经典论文的特点。它提出了一种新的和简单的方法估计重组率从总体样本,而不是从系谱数据。后来引入了更复杂的方法,包括复合似然法(Hudson, 2001)以及Hellenthal和Stephens(2006)评述的其他方法,但最初的方法在进化研究中仍然有用(例如Meikeljohn等人,2004)。
{"title":"Estimating the recombination parameter: a commentary on 'Estimating the recombination parameter of a finite population model without selection' by Richard R. Hudson.","authors":"B. Weir","doi":"10.1017/S0016672308009622","DOIUrl":"https://doi.org/10.1017/S0016672308009622","url":null,"abstract":"In 1987, Hudson proposed an estimator for the scaled recombination parameter C=4Nc, where N is the population size and c is the recombination rate between the two most distant of a set of segregating sites. This work came shortly after Kreitman (1983) published the first set of population genetic data at the DNA sequence level. Kreitman had been able to sequence 2.7 kilobases of the Drosophila melanogaster genome in 11 samples. It was felt at that time that population genetics was entering a new era, although Hudson cautioned that sufficiently large data sets for his new estimator ‘may require prohibitively large research efforts ’. Hudson’s estimator is based on the variance of the number of site differences between pairs of haplotypes and an estimate of the scaled mutation rate h=4Nm. The variance of the number of differences had already been shown by Brown et al. (1980) to be a convenient single-statistic summary of all the pairwise linkage disequilibria among a set of loci. The need for such a statistic continues as there is still doubt as to how well two-locus associations capture the full multilocus structure. Hudson provided an elegant derivation of the expected value of his statistic as a function of the unknown value C. His method of moments approach to estimation has the great virtue of simplicity although it would not be expected to behave as well as the maximum-likelihood methods that he (Hudson, 1993) and others (e.g. Kuhner et al., 2000; Wall, 2000; Fearnhead and Donnelly, 2001) developed later. Likelihood methods exploit all the information in a data set rather than just the information in a summary statistic and will do well provided the underlying evolutionary model is appropriate for the data being addressed. Writing 10 years after Hudson, Wakeley kept the same moment approach but provided modifications to Hudson’s method that improved its performance. Since 1983 the human genome has been sequenced, as have the genomes of several other species. There is now a ‘1000 genomes’ project (http://www.1000 genomes.org) under way for humans, and new sequencing techniques will make it possible very soon for population geneticists to obtain large samples of DNA sequence data. In 1987, Hudsonwished formore extensive DNA sequence data but he could not have foreseen the remarkable explosion of intermediate data – single-nucleotide polymorphisms (SNPs). Human geneticists are now generating 1 million SNP profiles for samples of thousands of individuals. By 2002, Hudson had produced a simulation procedure for SNP data (Hudson, 2002), and this has been used in studies such as Li and Stephens (2003) to detect recombination rate ‘hotspots ’. Hudson’s 1987 paper has the hallmarks of a classic paper. It introduced a new and simple method for estimating recombination rates from population samples rather than from pedigree data. More sophisticated methods have since been introduced, including composite-likelihood (Hudson, 2001) and others reviewed by H","PeriodicalId":12777,"journal":{"name":"Genetical research","volume":"20 1","pages":"425-6"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82373357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}