{"title":"A comprehensive workflow for compositional data analysis in archaeometry, with code in R","authors":"Michael Greenacre, Jonathan R. Wood","doi":"10.1007/s12520-024-02070-w","DOIUrl":null,"url":null,"abstract":"<div><p>Compositional data, which have relative rather than absolute meaning, are common in quantitative archaeological research. Such multivariate data are usually expressed as proportions, summing to 1, or equivalently as percentages. We present a comprehensive and defensible workflow for processing compositional data in archaeometry, using both the original compositional values and their transformation to logratios. The most useful logratio transformations are illustrated and how they affect the interpretation of the final results in the context of both unsupervised and supervised learning. The workflow is demonstrated on compositional data from bronze ritual vessels to provide compositional fingerprints for the Shang and Zhou periods of the Chinese Bronze Age. Predictions, with caveats, of the fabrication age of the vessels are made from the compositional data – in effect, compositional rather than typological seriation of the bronzes. In the Supplementary Material, we further explore the effect of zeros in the dataset and compare logratio analyses with the chiPower approach, where we replace any value in the original data determined as being below the detection limit of the instruments for the element, with zeros. The data and R code for reproducing all the analyses are provided both in the Supplementary Information and online.</p></div>","PeriodicalId":8214,"journal":{"name":"Archaeological and Anthropological Sciences","volume":"16 10","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archaeological and Anthropological Sciences","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s12520-024-02070-w","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Compositional data, which have relative rather than absolute meaning, are common in quantitative archaeological research. Such multivariate data are usually expressed as proportions, summing to 1, or equivalently as percentages. We present a comprehensive and defensible workflow for processing compositional data in archaeometry, using both the original compositional values and their transformation to logratios. The most useful logratio transformations are illustrated and how they affect the interpretation of the final results in the context of both unsupervised and supervised learning. The workflow is demonstrated on compositional data from bronze ritual vessels to provide compositional fingerprints for the Shang and Zhou periods of the Chinese Bronze Age. Predictions, with caveats, of the fabrication age of the vessels are made from the compositional data – in effect, compositional rather than typological seriation of the bronzes. In the Supplementary Material, we further explore the effect of zeros in the dataset and compare logratio analyses with the chiPower approach, where we replace any value in the original data determined as being below the detection limit of the instruments for the element, with zeros. The data and R code for reproducing all the analyses are provided both in the Supplementary Information and online.
具有相对意义而非绝对意义的组合数据在定量考古研究中很常见。此类多元数据通常以比例表示,总和为 1,或等同于百分比。我们为考古计量学中的成分数据处理提出了一套全面、合理的工作流程,既可使用原始成分值,也可将其转换为对数。我们说明了最有用的对数比率转换,以及它们在无监督和有监督学习中如何影响最终结果的解释。工作流程在青铜礼器的成分数据上进行了演示,以提供中国青铜时代商周时期的成分指纹。根据成分数据对青铜器的制作年代进行了预测,但有一些注意事项--实际上是对青铜器进行成分而非类型学上的排序。在补充材料中,我们进一步探讨了数据集中零的影响,并将对数比率分析与chiPower方法进行了比较,在chiPower方法中,我们将原始数据中被确定为低于仪器对该元素检测极限的任何值替换为零。用于重现所有分析的数据和 R 代码在补充信息和在线版中提供。
期刊介绍:
Archaeological and Anthropological Sciences covers the full spectrum of natural scientific methods with an emphasis on the archaeological contexts and the questions being studied. It bridges the gap between archaeologists and natural scientists providing a forum to encourage the continued integration of scientific methodologies in archaeological research.
Coverage in the journal includes: archaeology, geology/geophysical prospection, geoarchaeology, geochronology, palaeoanthropology, archaeozoology and archaeobotany, genetics and other biomolecules, material analysis and conservation science.
The journal is endorsed by the German Society of Natural Scientific Archaeology and Archaeometry (GNAA), the Hellenic Society for Archaeometry (HSC), the Association of Italian Archaeometrists (AIAr) and the Society of Archaeological Sciences (SAS).