{"title":"利用铜器时代最大墓葬遗址中的长骨进行性别估计:线性判别分析和随机森林","authors":"","doi":"10.1016/j.jasrep.2024.104730","DOIUrl":null,"url":null,"abstract":"<div><p>Sex estimation of the individuals in a sample is fundamental for any bioarchaeological study to define a particular demographic assemblage or to classify isolated remains. Long bones are an excellent alternative for sex estimation when the most dimorphic anatomical parts are not preserved or are highly altered. Here we propose a set of discriminant functions and classification models to estimate the sex of prehistoric individuals using linear discriminant analysis and machine learning approaches. Different osteometric variables were taken from the humeri, ulnae, radii, femurs and tibias of a sample of 109 articulated skeletons buried in the collective tomb of Camino del Molino (Region of Murcia, SE-Spain), dated to the 3rd millennium BC. Sex was estimated based on standard anthropological methods and ancient DNA analysis of a control sample. Fifty-two discriminant functions with prediction thresholds higher than 0.8 on the ROC curve were obtained using independent (22) and combined variables (30). The best LDA models for sex prediction were those based on proximal epiphyseal widths or their combination with other variables, reaching values close to 0.98 on the ROC curve. The random forest-based model obtained an accuracy of 0.94 and confirmed the importance of epiphyseal widths in sex classification. This analysis is more comprehensive than univariate LDA, as it allows for ranking the importance of bones in sex discrimination and considers correlations between long bones rather than treating them as independent observations. In contrast, applying LDA to each bone makes it easier to predict the sex of other coeval collections that do not have such a complete sample. This work aims to overcome the scarcity of methods that can be applied to sex estimation of the large volume of isolated remains from Camino del Molino and for other Mediterranean skeletal series from the Late Prehistory with high biological affinity and that share similar environmental conditions.</p></div>","PeriodicalId":48150,"journal":{"name":"Journal of Archaeological Science-Reports","volume":null,"pages":null},"PeriodicalIF":1.5000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352409X24003584/pdfft?md5=5ca7fb69d2e542dbe4efa2198e0f7ea2&pid=1-s2.0-S2352409X24003584-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Sex estimation using long bones in the largest burial site of the Copper Age: Linear discriminant analysis and random forest\",\"authors\":\"\",\"doi\":\"10.1016/j.jasrep.2024.104730\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Sex estimation of the individuals in a sample is fundamental for any bioarchaeological study to define a particular demographic assemblage or to classify isolated remains. Long bones are an excellent alternative for sex estimation when the most dimorphic anatomical parts are not preserved or are highly altered. Here we propose a set of discriminant functions and classification models to estimate the sex of prehistoric individuals using linear discriminant analysis and machine learning approaches. Different osteometric variables were taken from the humeri, ulnae, radii, femurs and tibias of a sample of 109 articulated skeletons buried in the collective tomb of Camino del Molino (Region of Murcia, SE-Spain), dated to the 3rd millennium BC. Sex was estimated based on standard anthropological methods and ancient DNA analysis of a control sample. Fifty-two discriminant functions with prediction thresholds higher than 0.8 on the ROC curve were obtained using independent (22) and combined variables (30). The best LDA models for sex prediction were those based on proximal epiphyseal widths or their combination with other variables, reaching values close to 0.98 on the ROC curve. The random forest-based model obtained an accuracy of 0.94 and confirmed the importance of epiphyseal widths in sex classification. This analysis is more comprehensive than univariate LDA, as it allows for ranking the importance of bones in sex discrimination and considers correlations between long bones rather than treating them as independent observations. In contrast, applying LDA to each bone makes it easier to predict the sex of other coeval collections that do not have such a complete sample. This work aims to overcome the scarcity of methods that can be applied to sex estimation of the large volume of isolated remains from Camino del Molino and for other Mediterranean skeletal series from the Late Prehistory with high biological affinity and that share similar environmental conditions.</p></div>\",\"PeriodicalId\":48150,\"journal\":{\"name\":\"Journal of Archaeological Science-Reports\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2352409X24003584/pdfft?md5=5ca7fb69d2e542dbe4efa2198e0f7ea2&pid=1-s2.0-S2352409X24003584-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Archaeological Science-Reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352409X24003584\",\"RegionNum\":2,\"RegionCategory\":\"历史学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ARCHAEOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Archaeological Science-Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352409X24003584","RegionNum":2,"RegionCategory":"历史学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ARCHAEOLOGY","Score":null,"Total":0}
Sex estimation using long bones in the largest burial site of the Copper Age: Linear discriminant analysis and random forest
Sex estimation of the individuals in a sample is fundamental for any bioarchaeological study to define a particular demographic assemblage or to classify isolated remains. Long bones are an excellent alternative for sex estimation when the most dimorphic anatomical parts are not preserved or are highly altered. Here we propose a set of discriminant functions and classification models to estimate the sex of prehistoric individuals using linear discriminant analysis and machine learning approaches. Different osteometric variables were taken from the humeri, ulnae, radii, femurs and tibias of a sample of 109 articulated skeletons buried in the collective tomb of Camino del Molino (Region of Murcia, SE-Spain), dated to the 3rd millennium BC. Sex was estimated based on standard anthropological methods and ancient DNA analysis of a control sample. Fifty-two discriminant functions with prediction thresholds higher than 0.8 on the ROC curve were obtained using independent (22) and combined variables (30). The best LDA models for sex prediction were those based on proximal epiphyseal widths or their combination with other variables, reaching values close to 0.98 on the ROC curve. The random forest-based model obtained an accuracy of 0.94 and confirmed the importance of epiphyseal widths in sex classification. This analysis is more comprehensive than univariate LDA, as it allows for ranking the importance of bones in sex discrimination and considers correlations between long bones rather than treating them as independent observations. In contrast, applying LDA to each bone makes it easier to predict the sex of other coeval collections that do not have such a complete sample. This work aims to overcome the scarcity of methods that can be applied to sex estimation of the large volume of isolated remains from Camino del Molino and for other Mediterranean skeletal series from the Late Prehistory with high biological affinity and that share similar environmental conditions.
期刊介绍:
Journal of Archaeological Science: Reports is aimed at archaeologists and scientists engaged with the application of scientific techniques and methodologies to all areas of archaeology. The journal focuses on the results of the application of scientific methods to archaeological problems and debates. It will provide a forum for reviews and scientific debate of issues in scientific archaeology and their impact in the wider subject. Journal of Archaeological Science: Reports will publish papers of excellent archaeological science, with regional or wider interest. This will include case studies, reviews and short papers where an established scientific technique sheds light on archaeological questions and debates.