{"title":"A New Theoretical Approach to Ancestry Estimation as Applied to Human Crania","authors":"Michael W. Kenyhercz","doi":"10.13110/humanbiology.93.2.03","DOIUrl":null,"url":null,"abstract":"Since Frank Livingstone proposed the idea that there are no races, only clines, in 1962, little has changed in how anthropologists study and, ultimately, estimate ancestry. How we talk about the study of human variation may have changed—shifting away from “racial” labels and toward those of supposed ancestral origins—but the methods we use to label and analyze groups, however termed, have remained the same. The author suggests a new theoretical approach to ancestry estimation that does not rely on group labels, using the Howells Craniometric Data Set as an example. In the suggested workflow, the data structures itself into natural clusters, referred to as “morphogroups,” without relying on a group label. Each morphogroup is explored for subgroups, and the process is repeated until no further distinctions can be made. At each level an individual is compared to the morphogroup in a descriptive manner, focusing on similarities and differences. Lastly, a multi-iteration classification procedure, using random forest modeling, classifies by morphogroup. In this test, hierarchical clustering identifies the optimal number of natural clusters within the data, and principal components analysis is used to explore morphogroups. (The author provides a markdown file of all code used, at https://rpubs.com/kenyhercz2/717620.) Using this suggested workflow, the author identifies three main morphogroups in the Howells data set, each with different numbers of subclusters ranging from 0 to 8. Morphogroup correct classifications are typically in the mid-90% range, and the accompanying sex estimations, between 93% and 100% correct. The author emphasizes that this is but one of myriad ways ancestry could be estimated. Human variation and identity are not static, and we should help one another rethink and redefine what is possible for our field.","PeriodicalId":13053,"journal":{"name":"Human Biology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.13110/humanbiology.93.2.03","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Since Frank Livingstone proposed the idea that there are no races, only clines, in 1962, little has changed in how anthropologists study and, ultimately, estimate ancestry. How we talk about the study of human variation may have changed—shifting away from “racial” labels and toward those of supposed ancestral origins—but the methods we use to label and analyze groups, however termed, have remained the same. The author suggests a new theoretical approach to ancestry estimation that does not rely on group labels, using the Howells Craniometric Data Set as an example. In the suggested workflow, the data structures itself into natural clusters, referred to as “morphogroups,” without relying on a group label. Each morphogroup is explored for subgroups, and the process is repeated until no further distinctions can be made. At each level an individual is compared to the morphogroup in a descriptive manner, focusing on similarities and differences. Lastly, a multi-iteration classification procedure, using random forest modeling, classifies by morphogroup. In this test, hierarchical clustering identifies the optimal number of natural clusters within the data, and principal components analysis is used to explore morphogroups. (The author provides a markdown file of all code used, at https://rpubs.com/kenyhercz2/717620.) Using this suggested workflow, the author identifies three main morphogroups in the Howells data set, each with different numbers of subclusters ranging from 0 to 8. Morphogroup correct classifications are typically in the mid-90% range, and the accompanying sex estimations, between 93% and 100% correct. The author emphasizes that this is but one of myriad ways ancestry could be estimated. Human variation and identity are not static, and we should help one another rethink and redefine what is possible for our field.
期刊介绍:
Human Biology publishes original scientific articles, brief communications, letters to the editor, and review articles on the general topic of biological anthropology. Our main focus is understanding human biological variation and human evolution through a broad range of approaches.
We encourage investigators to submit any study on human biological diversity presented from an evolutionary or adaptive perspective. Priority will be given to interdisciplinary studies that seek to better explain the interaction between cultural processes and biological processes in our evolution. Methodological papers are also encouraged. Any computational approach intended to summarize cultural variation is encouraged. Studies that are essentially descriptive or concern only a limited geographic area are acceptable only when they have a wider relevance to understanding human biological variation.
Manuscripts may cover any of the following disciplines, once the anthropological focus is apparent: human population genetics, evolutionary and genetic demography, quantitative genetics, evolutionary biology, ancient DNA studies, biological diversity interpreted in terms of adaptation (biometry, physical anthropology), and interdisciplinary research linking biological and cultural diversity (inferred from linguistic variability, ethnological diversity, archaeological evidence, etc.).